-
BSc, THU -> PhD, HKUST
- https://scholar.google.com/citations?user=5U4P54wAAAAJ&hl=zh-CN
Highlights
- Pro
Lists (3)
Sort Name ascending (A-Z)
Stars
Awesome-Biomolecule-Language-Cross-Modeling: a curated list of resources for paper "Leveraging Biomolecule and Natural Language through Multi-Modal Learning: A Survey"
Public repo for rbio, a biologically-informed reasoning model trained on virtual cell models as verifiers
🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning
AlphaFold 3 inference pipeline.
Pretrain, finetune ANY AI model of ANY size on 1 or 10,000+ GPUs with zero code changes.
BioReason: Incentivizing Multimodal Biological Reasoning within a DNA-LLM Model | NeurIPS '25
[ICML'25 Oral] Multi-agent Architecture Search via Agentic Supernet
[Lumina Embodied AI] 具身智能技术指南 Embodied-AI-Guide
Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.
verl: Volcano Engine Reinforcement Learning for LLMs
An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & TIS & vLLM & Ray & Dynamic Sampling & Async Agentic RL)
My learning notes for ML SYS.
Fully open reproduction of DeepSeek-R1
[ICML 2025 Oral] CodeI/O: Condensing Reasoning Patterns via Code Input-Output Prediction
Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek-R1, Qwen3, Gemma 3, TTS 2x faster with 70% less VRAM.
LLaSA: Scaling Train-time and Inference-time Compute for LLaMA-based Speech Synthesis
A comprehensive list of papers for the definition of World Models and using World Models for General Video Generation, Embodied AI, and Autonomous Driving, including papers, codes, and related webs…
Agent Laboratory is an end-to-end autonomous research workflow meant to assist you as the human researcher toward implementing your research ideas
Scalable RL solution for advanced reasoning of language models
A generative world for general-purpose robotics & embodied AI learning.
Optimizing inference proxy for LLMs
An Open Large Reasoning Model for Real-World Solutions