-
homepage Public
Forked from shangdongyang/shangdongyang.github.ioAcademic Personal Homepage
SCSS MIT License UpdatedJun 12, 2026 -
delta-Mem Public
Forked from declare-lab/delta-MemThe official repo of the paper: delta-Mem: Efficient Online Memory for Large Language Models
Python UpdatedMay 27, 2026 -
stable-worldmodel Public
Forked from galilai-group/stable-worldmodelA platform for reproducible world model research and evaluation
Python UpdatedMay 26, 2026 -
-
-
-
-
In-Place-TTT Public
Forked from ByteDance-Seed/In-Place-TTTPython Apache License 2.0 UpdatedApr 21, 2026 -
-
E2HiL-project-a1x Public
Forked from E2HiL/E2HiL-project-a1xPython Apache License 2.0 UpdatedMar 22, 2026 -
verl-agent Public
Forked from langfengQ/verl-agentverl-agent is an extension of veRL, designed for training LLM/VLM agents via RL. verl-agent is also the official code for paper "Group-in-Group Policy Optimization for LLM Agent Training"
Python Apache License 2.0 UpdatedFeb 27, 2026 -
conrft Public
Forked from cccedric/conrftThis is the official implementation of the paper "ConRFT: A Reinforced Fine-tuning Method for VLA Models via Consistency Policy".
Python Apache License 2.0 UpdatedNov 11, 2025 -
vlarl Public
Forked from GuanxingLu/vlarlSingle-file implementation to advance vision-language-action (VLA) models with reinforcement learning.
Python Apache License 2.0 UpdatedNov 8, 2025 -
-
dyn-O Public
Forked from wangzizhao/dyn-OOfficial Implementation of Dyn-O: Building Structured World Models with Object-Centric Representations (NeurIPS 2025)
Python UpdatedOct 20, 2025 -
era Public
Forked from nothingbutbut/eraAn official code repo of paper Entropy Regularizing Activation: Boosting Continuous Control, Large Language Models, and Image Classification with Activation as Entropy Constraints.
Python UpdatedOct 10, 2025 -
EO-1 Public
Forked from EO-Robotics/EO1EO: Open-source Unified Embodied Foundation Model Series
Jupyter Notebook UpdatedSep 15, 2025 -
RLinf Public
Forked from RLinf/RLinfRLinf is a flexible and scalable open-source infrastructure designed for post-training foundation models (LLMs, VLMs, VLAs) via reinforcement learning.
Python Apache License 2.0 UpdatedSep 1, 2025 -
CoSo Public
Forked from langfengQ/CoSoOfficial code for paper "Towards Efficient Online Tuning of VLM Agents via Counterfactual Soft Reinforcement Learning"
Python Apache License 2.0 UpdatedJun 12, 2025 -
ECL Public
Source code for Towards Empowerment Gain through Causal Structure Learning in Model-Based RL
UpdatedApr 16, 2025 -
-
open-r1 Public
Forked from huggingface/open-r1Fully open reproduction of DeepSeek-R1
Python Apache License 2.0 UpdatedMar 14, 2025 -
-
OpenManus Public
Forked from FoundationAgents/OpenManusNo fortress, purely open ground. OpenManus is Coming.
Python MIT License UpdatedMar 10, 2025 -
X-Boundary Public
Forked from AI45Lab/X-BoundaryThe code repo of paper "X-Boundary: Establishing Exact Safety Boundary to Shield LLMs from Multi-Turn Jailbreaks without Compromising Usability"
Python UpdatedMar 7, 2025 -
DPT-Agent Public
Forked from sjtu-marl/DPT-AgentThis is the official implementation of paper "Leveraging Dual Process Theory in Language Agent Framework for Simultaneous Human-AI Collaboration."
Python MIT License UpdatedMar 2, 2025 -
Awesome-LLM-Safety Public
Forked from drivetosouth/Awesome-LLM-SafetyA collection of awesome public projects about LLM Safety.
UpdatedFeb 27, 2025 -
SPAG Public
Forked from Linear95/SPAGSelf-playing Adversarial Language Game Enhances LLM Reasoning, NeurIPS 2024
Python Apache License 2.0 UpdatedFeb 24, 2025 -
LEGION Public
Forked from Ghiara/LEGIONOfficial implementation of paper on Nature Machine Intelligence: "Preserving and Combining Knowledge in Robotic Lifelong Reinforcement Learning"
Python MIT License UpdatedFeb 9, 2025 -