HYeCao

Hongye Cao HYeCao

6 followers · 25 following

Nanjing University

Achievements

homepage Public
Forked from shangdongyang/shangdongyang.github.io

Academic Personal Homepage

SCSS MIT License Updated Jun 12, 2026
delta-Mem Public
Forked from declare-lab/delta-Mem

The official repo of the paper: delta-Mem: Efficient Online Memory for Large Language Models

Python Updated May 27, 2026
stable-worldmodel Public
Forked from galilai-group/stable-worldmodel

A platform for reproducible world model research and evaluation

Python Updated May 26, 2026
ScriptMem Public
Forked from memorax-ai/ScriptMem

Python Other Updated May 8, 2026
CIP Public

source code for causal information prioritization

Python 9 Updated Apr 28, 2026
CausalHIL-SERL Public

Updated Apr 23, 2026
HILRL-A1X Public

Python Apache License 2.0 Updated Apr 23, 2026
In-Place-TTT Public
Forked from ByteDance-Seed/In-Place-TTT

Python Apache License 2.0 Updated Apr 21, 2026
MemFactory Public
Forked from Valsure/MemFactory

Python Apache License 2.0 Updated Apr 7, 2026
E2HiL-project-a1x Public
Forked from E2HiL/E2HiL-project-a1x

Python Apache License 2.0 Updated Mar 22, 2026
verl-agent Public
Forked from langfengQ/verl-agent

verl-agent is an extension of veRL, designed for training LLM/VLM agents via RL. verl-agent is also the official code for paper "Group-in-Group Policy Optimization for LLM Agent Training"

Python Apache License 2.0 Updated Feb 27, 2026
conrft Public
Forked from cccedric/conrft

This is the official implementation of the paper "ConRFT: A Reinforced Fine-tuning Method for VLA Models via Consistency Policy".

Python Apache License 2.0 Updated Nov 11, 2025
vlarl Public
Forked from GuanxingLu/vlarl

Single-file implementation to advance vision-language-action (VLA) models with reinforcement learning.

Python Apache License 2.0 Updated Nov 8, 2025
hil-serl Public
Forked from rail-berkeley/hil-serl

Python Apache License 2.0 Updated Oct 27, 2025
dyn-O Public
Forked from wangzizhao/dyn-O

Official Implementation of Dyn-O: Building Structured World Models with Object-Centric Representations (NeurIPS 2025)

Python Updated Oct 20, 2025
era Public
Forked from nothingbutbut/era

An official code repo of paper Entropy Regularizing Activation: Boosting Continuous Control, Large Language Models, and Image Classification with Activation as Entropy Constraints.

Python Updated Oct 10, 2025
EO-1 Public
Forked from EO-Robotics/EO1

EO: Open-source Unified Embodied Foundation Model Series

Jupyter Notebook Updated Sep 15, 2025
RLinf Public
Forked from RLinf/RLinf

RLinf is a flexible and scalable open-source infrastructure designed for post-training foundation models (LLMs, VLMs, VLAs) via reinforcement learning.

Python Apache License 2.0 Updated Sep 1, 2025
CoSo Public
Forked from langfengQ/CoSo

Official code for paper "Towards Efficient Online Tuning of VLM Agents via Counterfactual Soft Reinforcement Learning"

Python Apache License 2.0 Updated Jun 12, 2025
ECL Public

Source code for Towards Empowerment Gain through Causal Structure Learning in Model-Based RL

Updated Apr 16, 2025
dino_wm Public
Forked from gaoyuezhou/dino_wm

Python MIT License Updated Mar 24, 2025
open-r1 Public
Forked from huggingface/open-r1

Fully open reproduction of DeepSeek-R1

Python Apache License 2.0 Updated Mar 14, 2025
SPECTra Public
Forked from funny-rl/SPECTra

Python Updated Mar 14, 2025
OpenManus Public
Forked from FoundationAgents/OpenManus

No fortress, purely open ground. OpenManus is Coming.

Python MIT License Updated Mar 10, 2025
X-Boundary Public
Forked from AI45Lab/X-Boundary

The code repo of paper "X-Boundary: Establishing Exact Safety Boundary to Shield LLMs from Multi-Turn Jailbreaks without Compromising Usability"

Python Updated Mar 7, 2025
DPT-Agent Public
Forked from sjtu-marl/DPT-Agent

This is the official implementation of paper "Leveraging Dual Process Theory in Language Agent Framework for Simultaneous Human-AI Collaboration."

Python MIT License Updated Mar 2, 2025
Awesome-LLM-Safety Public
Forked from drivetosouth/Awesome-LLM-Safety

A collection of awesome public projects about LLM Safety.

Updated Feb 27, 2025
SPAG Public
Forked from Linear95/SPAG

Self-playing Adversarial Language Game Enhances LLM Reasoning, NeurIPS 2024

Python Apache License 2.0 Updated Feb 24, 2025
LEGION Public
Forked from Ghiara/LEGION

Official implementation of paper on Nature Machine Intelligence: "Preserving and Combining Knowledge in Robotic Lifelong Reinforcement Learning"

Python MIT License Updated Feb 9, 2025
Causal-Copilot Public
Forked from Lancelot39/Causal-Copilot

Python Updated Dec 31, 2024

Hongye Cao HYeCao

Achievements

Achievements

homepage Public

Uh oh!

delta-Mem Public

Uh oh!

stable-worldmodel Public

Uh oh!

ScriptMem Public

Uh oh!

CIP Public

Uh oh!

CausalHIL-SERL Public

Uh oh!

HILRL-A1X Public

Uh oh!

In-Place-TTT Public

Uh oh!

MemFactory Public

Uh oh!

E2HiL-project-a1x Public

Uh oh!

verl-agent Public

Uh oh!

conrft Public

Uh oh!

vlarl Public

Uh oh!

hil-serl Public

Uh oh!

dyn-O Public

Uh oh!

era Public

Uh oh!

EO-1 Public

Uh oh!

RLinf Public

Uh oh!

CoSo Public

Uh oh!

ECL Public

Uh oh!

dino_wm Public

Uh oh!

open-r1 Public

Uh oh!

SPECTra Public

Uh oh!

OpenManus Public

Uh oh!

X-Boundary Public

Uh oh!

DPT-Agent Public

Uh oh!

Awesome-LLM-Safety Public

Uh oh!

SPAG Public

Uh oh!

LEGION Public

Uh oh!

Causal-Copilot Public

Uh oh!