-
JanusCoder Public
Forked from InternLM/JanusCoderJanusCoder: Towards a Foundational Visual-Programmatic Interface for Code Intelligence
Jupyter Notebook MIT License UpdatedOct 28, 2025 -
njucckevin.github.io Public
Forked from RayeRen/acad-homepage.github.ioAcadHomepage: A Modern and Responsive Academic Personal Homepage
SCSS MIT License UpdatedSep 20, 2025 -
SeeClick Public
The model, data and code for the visual GUI Agent SeeClick
-
GUI-Actor Public
Forked from microsoft/GUI-ActorGUI-Actor: Coordinate-Free Visual Grounding for GUI Agents
Python MIT License UpdatedJun 6, 2025 -
OS-Genesis Public
Forked from OS-Copilot/OS-GenesisCode and data for OS-Genesis: Automating GUI Agent Trajectory Construction via Reverse Task Synthesis
Jupyter Notebook UpdatedJun 6, 2025 -
CapArena Public
An Arena-style Automated Evaluation Benchmark for Detailed Captioning
-
MM-Self-Improve Public
A Self-Training Framework for Vision-Language Reasoning
-
KnowCap Public
Code for Beyond Generic: Enhancing Image Captioning with Real-World Knowledge using Vision-Language Pre-Training Model
-
ADS-Cap Public
A Framework for Accurate and Diverse Stylized Captioning with Unpaired Stylistic Corpora
-