Lists (1)
Sort Name ascending (A-Z)
Stars
Next paradigm for LLM Agent. Unify plan and action through recursive code generation for adaptive, human-like decision-making.
Continuously updated paper list on advancements in Data Agents. Companion repo to our paper "A Survey of Data Agents: Emerging Paradigm or Overstated Hype?"
VisJudgeBench: A comprehensive benchmark for aesthetics and quality assessment of visualizations, featuring 3,090 expert-annotated samples with six-dimensional quality scores.
UI-Venus is a native UI agent designed to perform precise GUI element grounding and effective navigation using only screenshots as input.
DataMosaic: Explainable and Verifiable Document-Based Data Analytics
🌟100+ 原创 LLM / RL 原理图📚,《大模型算法》作者巨献!💥(100+ LLM/RL Algorithm Maps )
🤗 smolagents: a barebones library for agents that think in code.
✨✨R1-Reward: Training Multimodal Reward Model Through Stable Reinforcement Learning
🔥[NeurIPS'25] DeepFund: Pilot for Your Next Fund Investment
verl: Volcano Engine Reinforcement Learning for LLMs
Official repository of 'Visual-RFT: Visual Reinforcement Fine-Tuning' & 'Visual-ARFT: Visual Agentic Reinforcement Fine-Tuning'’
Witness the aha moment of VLM with less than $3.
An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & vLLM & Ray & Dynamic Sampling & Async Agentic RL)
🐫 CAMEL: The first and the best multi-agent framework. Finding the Scaling Law of Agents. https://www.camel-ai.org
ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型
Code for the ICML 2021 (long talk) paper: "ViLT: Vision-and-Language Transformer Without Convolution or Region Supervision"
A fork to add multimodal model training to open-r1
Solve Visual Understanding with Reinforced VLMs
OpenSeek aims to unite the global open source community to drive collaborative innovation in algorithms, data and systems to develop next-generation models that surpass DeepSeek.
Famous Vision Language Models and Their Architectures