crazycth

Follow

😻

Tianhao Cheng crazycth

😻

Follow

PHD Student in @fudan

97 followers · 163 following

FuDan University
Shanghai
15:00 (UTC +08:00)
crazycth.github.io

Achievements

Achievements

Stars

tile-ai / tilelang

Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels

C++ 4,300 360 Updated Dec 25, 2025

Shenzhi-Wang / Beyond-the-80-20-Rule-RLVR

The open-source code for the NeurIPS 2025 paper, "Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning."

Python 24 1 Updated Dec 22, 2025

sgl-project / sglang

SGLang is a fast serving framework for large language models and vision language models.

Python 21,956 3,862 Updated Dec 25, 2025

JarvisUSTC / DoctorAgent-RL

DoctorAgent-RL: A Multi-Agent Collaborative Reinforcement Learning System for Multi-Turn Clinical Dialogue

Python 53 7 Updated Oct 15, 2025

nex-agi / Nex-N1

94 3 Updated Dec 5, 2025

MathFoundationRL / Book-Mathematical-Foundation-of-Reinforcement-Learning

This is the homepage of a new book entitled "Mathematical Foundations of Reinforcement Learning."

MATLAB 13,952 1,309 Updated Oct 28, 2025

browser-use / browser-use

🌐 Make websites accessible for AI agents. Automate tasks online with ease.

Python 74,118 8,875 Updated Dec 24, 2025

QwenLM / Qwen3Guard

Qwen3Guard is a multilingual guardrail model series developed by the Qwen team at Alibaba Cloud.

Python 388 26 Updated Oct 21, 2025

microsoft / nnscaler

nnScaler: Compiling DNN models for Parallel Training

Python 121 22 Updated Sep 23, 2025

galtay / hilbertcurve

maps between 1-D space filling hilbert curve and N-D coordinates

Python 269 38 Updated Apr 28, 2024

MoonshotAI / Kimi-Linear

1,244 57 Updated Nov 17, 2025

MiroMindAI / MiroFlow

MiroMind Research Agent: Fully Open-Source Deep Research Agent with Reproducible State-of-the-Art Performance on FutureX, GAIA, HLE, BrowserComp and xBench.

Python 1,619 175 Updated Nov 30, 2025

seamoke / DPH-RL

This is the official implementation of paper "The Choice of Divergence: A Neglected Key to Mitigating Diversity Collapse in Reinforcement Learning with Verifiable Reward"

Python 10 Updated Oct 17, 2025

fla-org / flash-linear-attention

🚀 Efficient implementations of state-of-the-art linear attention models

Python 4,118 338 Updated Dec 24, 2025

TheRoadQaQ / ReLIFT

Official Repository of "Learning what reinforcement learning can't"

Python 71 1 Updated Nov 16, 2025

SkyworkAI / Skywork-OR1

Unleashing the Power of Reinforcement Learning for Math and Code Reasoners

Python 736 44 Updated Jun 6, 2025

openai / codex

Lightweight coding agent that runs in your terminal

Rust 54,648 6,950 Updated Dec 25, 2025

ZeroYuHuang / prefix_rft

Python 3 Updated Sep 14, 2025

JinjieNi / Quokka

The official github repo for "Training Optimal Large Diffusion Language Models", the first-ever large-scale diffusion language models scaling law..

Python 45 1 Updated Nov 6, 2025

PPPP-kaqiu / Awesome-Parallel-Reasoning

Awesome-Parallel-Reasoning: Unlocking the reasoning potential of LLMs. Papers, Code, Resources & Survey.

HTML 41 3 Updated Dec 20, 2025

HKUDS / AI-Researcher

[NeurIPS2025] "AI-Researcher: Autonomous Scientific Innovation" -- A production-ready version: https://novix.science/chat

Python 3,825 453 Updated Oct 16, 2025

adewynter / is-icl-learning

Repository for the paper 'Is In-Context Learning Learning?'

Jupyter Notebook 3 Updated Sep 16, 2025

XiaomiMiMo / MiMo

MiMo: Unlocking the Reasoning Potential of Language Model – From Pretraining to Posttraining

Python 1,858 75 Updated Jun 5, 2025

CharlesQ9 / Physics-Supernova

Python 25 4 Updated Dec 7, 2025

ai4protein / VenusREM

🧬 Augmenting zero-shot mutant prediction by retrieval-based logits fusion. (ISMB/ECCB 2025)

Python 114 12 Updated Aug 20, 2025

dunnolab / awesome-in-context-rl

Awesome In-Context RL: A curated list of In-Context Reinforcement Learning - - —

261 14 Updated Sep 8, 2025

MuiseDestiny / zotero-style

Ethereal Style for Zotero

JavaScript 4,684 147 Updated Nov 24, 2025

LiveCodeBench / LiveCodeBench

Official repository for the paper "LiveCodeBench: Holistic and Contamination Free Evaluation of Large Language Models for Code"

Python 748 155 Updated Jul 16, 2025

k8sgpt-ai / k8sgpt

Giving Kubernetes Superpowers to everyone

Go 7,255 912 Updated Dec 23, 2025

UbiquantAI / one-shot-em

One-shot Entropy Minimization

Python 187 11 Updated Jun 13, 2025