We proposed a reinforcement learning-based training framework for grounding GUI agents. Utilizing high-quality seed data filtering, dense policy reward, and an attention-based self-evolution mechanism significantly improves the ability to locate UI elements. With only 3k training samples, SE-GUI-7B model achieves state-of-the-art performance among models of the same scale on multiple benchmark tasks.
forked from YXB-NKU/SE-GUI
-
Notifications
You must be signed in to change notification settings - Fork 0
Offical implementation of "Enhancing Visual Grounding for GUI Agents via Self-Evolutionary Reinforcement Learning"
rickyHong/SE-GUI
Folders and files
| Name | Name | Last commit message | Last commit date | |
|---|---|---|---|---|
Repository files navigation
About
Offical implementation of "Enhancing Visual Grounding for GUI Agents via Self-Evolutionary Reinforcement Learning"
Resources
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published