Skip to content
View shipengai's full-sized avatar

Block or report shipengai

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Scalable toolkit for efficient model reinforcement

Python 1,743 428 Updated Jun 19, 2026

Multimodal RL training framework for diffusion & omni models

Python 374 55 Updated Jun 19, 2026

https://avocado-captioner.github.io/

Python 37 1 Updated Oct 16, 2025

[ICML 2026] Scripting Multi-Scene Videos with Time-Aware and Structural Audio-Visual Captions

Python 46 Updated Jun 2, 2026

[ICLR 2026] An official implementation of "CapRL: Stimulating Dense Image Caption Capabilities via Reinforcement Learning"

Python 217 8 Updated Jun 10, 2026

Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe

Python 684 43 Updated May 30, 2026
Python 13 2 Updated Mar 23, 2026

Official Repo for SvS: A Self-play with Variational Problem Synthesis strategy for RLVR training

Python 53 5 Updated Dec 13, 2025

FuseAI Project

Python 598 37 Updated Jan 25, 2025

Codebase for Merging Language Models (ICML 2024)

Python 868 52 Updated May 5, 2024

Official Implementation of OmniWeaving: Towards Unified Video Generation with Free-form Composition and Reasoning

Python 884 27 Updated Apr 11, 2026
Python 18 Updated Mar 16, 2026

πŸ”₯ OneThinker: All-in-one Reasoning Model for Image and Video [CVPR 2026]

Python 458 31 Updated Feb 28, 2026
Python 77 4 Updated Apr 9, 2026

[CVPR 2026] Official repo for "VideoSSR: Video Self-Supervised Reinforcement Learning"

Python 39 2 Updated Nov 11, 2025

[CVPR 2026] Boosting Reasoning in Large Multimodal Models via Activation Replay

Python 23 Updated May 7, 2026

Official code for "Rethinking Chain-of-Thought Reasoning for Videos"

21 Updated Dec 14, 2025

[CVPR2026] VideoAuto-R1: Video Auto Reasoning via Thinking Once, Answering Twice

Python 87 5 Updated Feb 27, 2026

[CVPRF 2026] Official PyTorch code of "Weaver: End-to-End Agentic System Training for Video Interleaved Reasoning".

10 Updated Apr 23, 2026

Long-RL: Scaling RL to Long Sequences (NeurIPS 2025)

Python 725 28 Updated Sep 24, 2025

A Flexible Framework for Experiencing Heterogeneous LLM Inference/Fine-tune Optimizations

Python 17,312 1,318 Updated Jun 18, 2026

[ICLR2026] VideoChat-Flash: Hierarchical Compression for Long-Context Video Modeling

Python 526 19 Updated Nov 18, 2025

GLM-4.6V/4.5V/4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning

Python 2,340 173 Updated May 16, 2026

RoboBrain 2.5: Advanced version of RoboBrain. Depth in Sight, Time in Mind. πŸŽ‰πŸŽ‰πŸŽ‰

Python 1,103 109 Updated Feb 28, 2026

Spirit-v1.5: A Robotic Foundation Model by Spirit AI

Python 611 34 Updated May 29, 2026

EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL

Python 5,020 372 Updated Apr 6, 2026
Python 71 2 Updated Jun 18, 2026

StarVLA: A Lego-like Codebase for Vision-Language-Action Model Developing

Python 2,877 361 Updated Jun 18, 2026

Building General-Purpose Robots Based on Embodied Foundation Model

Python 1,098 84 Updated Jun 17, 2026

πŸ€— LeRobot: Making AI for Robotics more accessible with end-to-end learning

Python 25,111 4,848 Updated Jun 19, 2026
Next