Skip to content
View NA-Wen's full-sized avatar
🌵
🌵

Block or report NA-Wen

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Jupyter Notebook 176 2 Updated Dec 19, 2025

Single File, Single GPU, From Scratch, Efficient, Full Parameter Tuning library for "RL for LLMs"

Jupyter Notebook 572 54 Updated Oct 7, 2025

Cursor for Minecraft

Java 1,080 88 Updated Nov 6, 2025

A RL Framework for multi LLM agent system

Python 86 11 Updated Dec 27, 2025
Python 207 15 Updated Oct 27, 2025

Meta Agents Research Environments is a comprehensive platform designed to evaluate AI agents in dynamic, realistic scenarios. Unlike static benchmarks, this platform introduces evolving environment…

Python 407 55 Updated Nov 17, 2025

Qwen3-omni is a natively end-to-end, omni-modal LLM developed by the Qwen team at Alibaba Cloud, capable of understanding text, audio, images, and video, as well as generating speech in real time.

Jupyter Notebook 3,172 195 Updated Oct 9, 2025

MarkDiffusion: An Open-Source Toolkit for Generative Watermarking of Latent Diffusion Models

Jupyter Notebook 179 5 Updated Dec 23, 2025

Qwen3-Coder is the code version of Qwen3, the large language model series developed by Qwen team, Alibaba Cloud.

Python 14,739 1,025 Updated Dec 4, 2025

Build, evaluate and train General Multi-Agent Assistance with ease

Python 1,082 108 Updated Dec 26, 2025

Code for paper "Omni-SafetyBench: A Benchmark for Safety Evaluation of Audio-Visual Large Language Models".

Python 68 Updated Sep 28, 2025

MarkLLM: An Open-Source Toolkit for LLM Watermarking.(EMNLP 2024 System Demonstration)

Python 683 76 Updated Oct 14, 2025

gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI

Python 19,476 1,999 Updated Nov 1, 2025

An open-source implementaion for fine-tuning Qwen-VL series by Alibaba Cloud.

Python 1,511 190 Updated Dec 19, 2025

A simple screen parsing tool towards pure vision based GUI agent

Jupyter Notebook 24,099 2,064 Updated Sep 12, 2025

Benchmark and research code for the paper SWEET-RL Training Multi-Turn LLM Agents onCollaborative Reasoning Tasks

Python 254 11 Updated May 5, 2025

LLM-Based Human-Agent Collaboration and Interaction Systems: A Survey | Awesome Human-Agent Collaboration | Human-AI Collaboration

172 7 Updated Nov 30, 2025

Awesome List for Agentic RL

HTML 664 27 Updated Dec 9, 2025

A Framework for LLM-based Multi-Agent Reinforced Training and Inference

Python 381 42 Updated Nov 20, 2025

[NeurIPS 2025 D&B] Open-source Multi-agent Poster Generation from Papers

Python 3,007 205 Updated Dec 21, 2025

RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.

Jupyter Notebook 2,460 197 Updated Dec 3, 2025

📚A curated list of Awesome LLM/VLM Inference Papers with Codes: Flash-Attention, Paged-Attention, WINT8/4, Parallelism, etc.🎉

Python 4,873 330 Updated Nov 28, 2025

Lamorel is a Python library designed for RL practitioners eager to use Large Language Models (LLMs).

Python 243 25 Updated Dec 11, 2025

This is a curated list of "Embodied AI or robot with Large Language Models" research. Watch this repository for the latest updates! 🔥

1,657 91 Updated Oct 30, 2025

清华主题PPT模板

Python 1,526 95 Updated Nov 16, 2025

MLE-bench is a benchmark for measuring how well AI agents perform at machine learning engineering

Python 1,248 194 Updated Dec 19, 2025

Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.

Python 20,737 2,218 Updated Mar 11, 2025

A Tiny structure of pytorch for learning;

C++ 60 13 Updated Jul 7, 2024

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.

6,875 371 Updated Dec 17, 2025
Next