Empowering GUI Agents via Autonomous Experience Exploration and Hindsight Experience Utilization for Task Planning
Training smaller AI models to plan complex computer tasks better than much larger ones
Researchers developed a method that lets smaller AI models learn to navigate websites and complete tasks by autonomously exploring environments and reusing past experiences as training data. A 7-billion-parameter model trained this way outperformed a much larger 32-billion-parameter commercial model, reaching 30.6% accuracy on real-world benchmarks. The breakthrough came from focusing training on high-level task planning rather than low-level individual skills.
Smaller AI models are cheaper to run and keep user data private, but they've struggled with planning complex multi-step tasks on websites and generalizing to new situations. This work shows they can match or beat much larger commercial models when trained the right way—meaning organizations could deploy capable web automation agents without expensive hardware or privacy concerns, while still handling unfamiliar websites and task variations they've never seen before.