Skip to content
View xiuxiusen's full-sized avatar

Block or report xiuxiusen

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

StarVLA: A Lego-like Codebase for Vision-Language-Action Model Developing

Python 2,875 360 Updated Jun 18, 2026

Public repository for Agent Skills

Python 152,499 17,967 Updated Jun 9, 2026

verl-agent is an extension of veRL, designed for training LLM/VLM agents via RL. verl-agent is also the official code for paper "Group-in-Group Policy Optimization for LLM Agent Training"

Python 2,022 198 Updated Jun 9, 2026

RoboTwin 2.0 Offical Repo

Python 2,462 401 Updated May 23, 2026

Minimal reproduction of DeepSeek R1-Zero

Python 13,173 1,585 Updated Feb 27, 2026

Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL

Python 4,965 445 Updated Nov 13, 2025

VLA-Adapter: An Effective Paradigm for Tiny-Scale Vision-Language-Action Model

Python 2,209 200 Updated Mar 19, 2026

Official Code for RVT-2 and RVT

Jupyter Notebook 406 55 Updated Feb 14, 2025
Python 274 22 Updated Aug 25, 2025

RLinf: Reinforcement Learning Infrastructure for Embodied and Agentic AI

Python 3,835 541 Updated Jun 18, 2026

PWM: Policy Learning with Large World Models

Jupyter Notebook 70 6 Updated Aug 4, 2025

An open-source implementaion for fine-tuning Qwen-VL series by Alibaba Cloud.

Python 1,918 218 Updated May 26, 2026

official repo for AGNOSTOS, a cross-task manipulation benchmark, and X-ICM method, a cross-task in-context manipulation (VLA) method

Python 69 5 Updated May 28, 2026

MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training Pipeline. 训练医疗大模型,实现了包括增量预训练(PT)、有监督微调(SFT)、RLHF、DPO、ORPO、GRPO。

Python 5,527 765 Updated Jun 3, 2026

High-Fidelity 3D Shape Generation via Scalable Geometric Refinement

Python 759 60 Updated Jan 6, 2026

学术常用的prompts

873 49 Updated Jan 18, 2026

AgentEvolver: Towards Efficient Self-Evolving Agent System

Python 1,463 171 Updated Apr 1, 2026

"AnyTool: Universal Tool-Use Layer for AI Agents"

Python 663 91 Updated Feb 28, 2026

MCP to allow AI agents to control Unreal

C++ 589 80 Updated Jun 22, 2025

Model Context Protocol(MCP) 编程极速入门

3,522 220 Updated Apr 23, 2025

A Minecraft MCP Server powered by Mineflayer API. It allows to control a Minecraft character in real-time, allowing AI assistants to build structures, explore the world, and interact with the game …

TypeScript 622 68 Updated Apr 4, 2026

Enable AI assistant clients like Cursor, Windsurf and Claude Desktop to control Unreal Engine through natural language using the Model Context Protocol (MCP).

C++ 1,995 329 Updated Apr 22, 2025

[ICLR 2024] Official code of the paper "Multi-Task Reinforcement Learning with Mixture of Orthogonal Experts.

Python 46 7 Updated Nov 18, 2024

主要记录大语言大模型(LLMs) 算法(应用)工程师相关的知识及面试题

HTML 14,513 1,432 Updated Jun 14, 2026

Benchmarking Knowledge Transfer in Lifelong Robot Learning

Jupyter Notebook 1,958 425 Updated Mar 15, 2025

[EMNLP2025] "LightRAG: Simple and Fast Retrieval-Augmented Generation"

Python 36,755 5,185 Updated Jun 18, 2026

"RAG-Anything: All-in-One RAG Framework"

Python 21,433 2,501 Updated Jun 15, 2026

Official implementation of paper on Nature Machine Intelligence: "Preserving and Combining Knowledge in Robotic Lifelong Reinforcement Learning"

Python 129 12 Updated Apr 3, 2025

Codes of CTPG accompanying the paper "Efficient Multi-Task Reinforcement Learning with Cross-Task Policy Guidance"(NeurIPS 2024).

Python 9 1 Updated Nov 12, 2024

Code for "TD-MPC2: Scalable, Robust World Models for Continuous Control"

Python 865 197 Updated May 21, 2025
Next