Skip to content
View lehaoqu's full-sized avatar
  • Beihang University
  • No. 37, Xueyuan Road, Haidian District, Beijing
  • 08:09 (UTC -12:00)

Highlights

  • Pro

Block or report lehaoqu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Public repository for Agent Skills

Python 119,542 13,827 Updated Apr 16, 2026

Framework for orchestrating role-playing, autonomous AI agents. By fostering collaborative intelligence, CrewAI empowers agents to work together seamlessly, tackling complex tasks.

Python 49,117 6,709 Updated Apr 17, 2026

🔥[MobiCom'25 Poster] AFL-Lib: An Asynchronous Federated Learning Library and Benchmark

Python 40 3 Updated Jul 23, 2025
Python 1 Updated Mar 14, 2026

Trinity-RFT is a general-purpose, flexible and scalable framework designed for reinforcement fine-tuning (RFT) of large language models (LLM).

Python 1 Updated Oct 28, 2025

Official Implementation of "KungfuBot: Physics-Based Humanoid Whole-Body Control for Learning Highly-Dynamic Skills"

Python 791 97 Updated Nov 13, 2025

GPU-optimized version of the MuJoCo physics simulator, designed for NVIDIA hardware.

Python 1,184 144 Updated Apr 17, 2026

Isaac Lab API, powered by MuJoCo-Warp, for RL and robotics research

Python 2,145 326 Updated Apr 17, 2026

世界模型(World Model)调研项目:收集李飞飞、LeCun和Meta的最新世界模型开源代码和研究资料

12 1 Updated Nov 22, 2025

Code and implementations for the paper "AgentGym-RL: Training LLM Agents for Long-Horizon Decision Making through Multi-Turn Reinforcement Learning" by Zhiheng Xi et al.

Python 700 70 Updated Feb 15, 2026

Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL

Python 4,495 391 Updated Nov 13, 2025

verl-agent is an extension of veRL, designed for training LLM/VLM agents via RL. verl-agent is also the official code for paper "Group-in-Group Policy Optimization for LLM Agent Training"

Python 1,807 173 Updated Feb 27, 2026

The Open-Source Multimodal AI Agent Stack: Connecting Cutting-Edge AI Models and Agent Infra

TypeScript 29,441 2,884 Updated Mar 27, 2026

Tongyi Deep Research, the Leading Open-source Deep Research Agent

Python 18,677 1,440 Updated Feb 27, 2026

[EMNLP 2025] WebAgent-R1: Training Web Agents via End-to-End Multi-Turn Reinforcement Learning

Python 80 4 Updated Nov 4, 2025

Building Open LLM Web Agents with Self-Evolving Online Curriculum RL

Python 516 37 Updated Jun 6, 2025

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python 159,524 32,903 Updated Apr 17, 2026

Fully open reproduction of DeepSeek-R1

Python 25,991 2,415 Updated Apr 2, 2026

复现大模型相关算法及一些学习记录

Python 3,272 439 Updated Mar 21, 2026

OpenHuFu is an open-sourced data federation system to support collaborative queries over multi databases with security guarantee.

Java 733 288 Updated Oct 25, 2024

An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & VLM & TIS & vLLM & Ray & Async RL)

Python 9,368 920 Updated Apr 17, 2026

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 70,231 8,598 Updated Apr 12, 2026

Official pytorch Implementation of Relational Knowledge Distillation, CVPR 2019

Python 417 50 Updated May 17, 2021

PyTorch implementation of JEI-DNN (ICLR 2024)

Python 3 Updated Apr 3, 2024

Boosted Dynamic Neural Networks, AAAI 2023

Python 8 3 Updated Dec 1, 2022

Code and pretrained models for paper: Data-Free Adversarial Distillation

Python 99 16 Updated Nov 28, 2022

Code and data accompanying the FedGen paper

Python 261 73 Updated Oct 31, 2024
Python 4 Updated Dec 12, 2023

EE-LLM is a framework for large-scale training and inference of early-exit (EE) large language models (LLMs).

Python 79 7 Updated Jun 14, 2024
Next