Skip to content
View chanchimin's full-sized avatar

Highlights

  • Pro

Organizations

@thunlp

Block or report chanchimin

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Awesome-Biomolecule-Language-Cross-Modeling: a curated list of resources for paper "Leveraging Biomolecule and Natural Language through Multi-Modal Learning: A Survey"

238 15 Updated Dec 5, 2025

Public repo for rbio, a biologically-informed reasoning model trained on virtual cell models as verifiers

Python 121 15 Updated Nov 24, 2025

🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning

Python 20,327 3,317 Updated Dec 20, 2025
Python 35 3 Updated Sep 23, 2025

AlphaFold 3 inference pipeline.

Python 7,346 1,043 Updated Dec 16, 2025

Pretrain, finetune ANY AI model of ANY size on 1 or 10,000+ GPUs with zero code changes.

Python 30,601 3,625 Updated Dec 20, 2025

BioReason: Incentivizing Multimodal Biological Reasoning within a DNA-LLM Model | NeurIPS '25

Jupyter Notebook 341 48 Updated Nov 26, 2025

[ICML'25 Oral] Multi-agent Architecture Search via Agentic Supernet

Python 227 26 Updated Nov 13, 2025

[Lumina Embodied AI] 具身智能技术指南 Embodied-AI-Guide

10,073 681 Updated Dec 3, 2025

Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.

Python 25,825 1,813 Updated Oct 13, 2025
Python 953 108 Updated Jun 17, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 17,660 2,860 Updated Dec 21, 2025

An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & TIS & vLLM & Ray & Dynamic Sampling & Async Agentic RL)

Python 8,633 838 Updated Dec 18, 2025

My learning notes for ML SYS.

Python 4,724 299 Updated Dec 19, 2025

Fully open reproduction of DeepSeek-R1

Python 25,745 2,405 Updated Nov 24, 2025

[ICML 2025 Oral] CodeI/O: Condensing Reasoning Patterns via Code Input-Output Prediction

Python 565 32 Updated May 6, 2025

Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek-R1, Qwen3, Gemma 3, TTS 2x faster with 70% less VRAM.

Python 49,688 4,097 Updated Dec 20, 2025
Python 8,613 608 Updated Nov 12, 2025

LLaSA: Scaling Train-time and Inference-time Compute for LLaMA-based Speech Synthesis

Python 639 51 Updated Apr 8, 2025

A comprehensive list of papers for the definition of World Models and using World Models for General Video Generation, Embodied AI, and Autonomous Driving, including papers, codes, and related webs…

922 22 Updated Dec 17, 2025

Agent Laboratory is an end-to-end autonomous research workflow meant to assist you as the human researcher toward implementing your research ideas

Python 5,149 744 Updated Aug 20, 2025

Scalable RL solution for advanced reasoning of language models

Python 1,785 99 Updated Mar 18, 2025

A generative world for general-purpose robotics & embodied AI learning.

Python 27,822 2,571 Updated Dec 19, 2025

Skywork Reward Model Series

11 1 Updated Sep 6, 2024

Optimizing inference proxy for LLMs

Python 3,234 258 Updated Dec 3, 2025

An Open Large Reasoning Model for Real-World Solutions

Python 1,537 80 Updated May 30, 2025
Next