Proximal Policy Optimization - AI Glossary