-
Tsinghua University
- Shenzhen, China
- https://georginhsu.github.io/
Stars
🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
A generative world for general-purpose robotics & embodied AI learning.
Generative Models by Stability AI
Train transformer language models with reinforcement learning.
openvla / openvla
Forked from TRI-ML/prismatic-vlmsOpenVLA: An open-source vision-language-action model for robotic manipulation.
[CVPR 2025] Magma: A Foundation Model for Multimodal AI Agents
[arXiv 2025] TWIST2: Scalable, Portable, and Holistic Humanoid Data Collection System
[CoRL 2025] ManiFlow: A General Robot Manipulation Policy via Consistency Flow Training
An Easy-to-use, Scalable and High-performance RLHF Framework designed for Multimodal Models.