-
Tsinghua University
- https://na-wen.github.io/
- https://scholar.google.cz/citations?user=zBvXAyIAAAAJ&hl=cs
Stars
Single File, Single GPU, From Scratch, Efficient, Full Parameter Tuning library for "RL for LLMs"
A RL Framework for multi LLM agent system
Meta Agents Research Environments is a comprehensive platform designed to evaluate AI agents in dynamic, realistic scenarios. Unlike static benchmarks, this platform introduces evolving environment…
Qwen3-omni is a natively end-to-end, omni-modal LLM developed by the Qwen team at Alibaba Cloud, capable of understanding text, audio, images, and video, as well as generating speech in real time.
MarkDiffusion: An Open-Source Toolkit for Generative Watermarking of Latent Diffusion Models
Qwen3-Coder is the code version of Qwen3, the large language model series developed by Qwen team, Alibaba Cloud.
Build, evaluate and train General Multi-Agent Assistance with ease
Code for paper "Omni-SafetyBench: A Benchmark for Safety Evaluation of Audio-Visual Large Language Models".
MarkLLM: An Open-Source Toolkit for LLM Watermarking.(EMNLP 2024 System Demonstration)
gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI
An open-source implementaion for fine-tuning Qwen-VL series by Alibaba Cloud.
A simple screen parsing tool towards pure vision based GUI agent
Benchmark and research code for the paper SWEET-RL Training Multi-Turn LLM Agents onCollaborative Reasoning Tasks
LLM-Based Human-Agent Collaboration and Interaction Systems: A Survey | Awesome Human-Agent Collaboration | Human-AI Collaboration
A Framework for LLM-based Multi-Agent Reinforced Training and Inference
[NeurIPS 2025 D&B] Open-source Multi-agent Poster Generation from Papers
RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.
📚A curated list of Awesome LLM/VLM Inference Papers with Codes: Flash-Attention, Paged-Attention, WINT8/4, Parallelism, etc.🎉
Lamorel is a Python library designed for RL practitioners eager to use Large Language Models (LLMs).
This is a curated list of "Embodied AI or robot with Large Language Models" research. Watch this repository for the latest updates! 🔥
MLE-bench is a benchmark for measuring how well AI agents perform at machine learning engineering
Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.
A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.