Reinforcement-Learning

AutoResearch-RL: Autonomous Neural Architecture Discovery Through Reinforcement Learning

AutoResearch-RL presents a framework where reinforcement learning agents autonomously conduct neural architecture and hyperparameter research without human supervision, using PPO to optimize code modifications based on …

AI · Development Signal Editorial Team

Feb 27 arxiv.org 4 min read

Ferret-UI Lite: Building Efficient 3B On-Device GUI Agents with Reinforcement Learning

Apple researchers present Ferret-UI Lite, a compact 3B multimodal language model designed for on-device GUI automation across mobile, web, and desktop platforms. The model achieves competitive performance through curated …

AI · Development Signal Editorial Team

Feb 2 arxiv.org 3 min read

GEPA: Reflective Prompt Evolution Can Outperform Reinforcement Learning

GEPA introduces a novel prompt optimization approach that uses natural language reflection and Pareto-based evolutionary search to optimize compound AI systems, achieving superior performance compared to reinforcement …

AI · Development Signal Editorial Team