xu3kev

Wen-Ding Li xu3kev

65 followers · 330 following

https://www.cs.cornell.edu/~wdli/

Achievements

x2 x2

Achievements

x2 x2

Highlights

Organizations

Stars

openai / simple-evals

Python 4,524 491 Updated Apr 22, 2026

heilcheng / DeepMind

Record for work at Google DeepMind

HTML 482 33 Updated Dec 29, 2025

stas00 / ml-engineering

Machine Learning Engineering Open Book

Python 18,129 1,151 Updated May 18, 2026

kfdong / STP

The official implementation of "Self-play LLM Theorem Provers with Iterative Conjecturing and Proving"

Python 121 10 Updated Mar 28, 2025

jxiw / MambaInLlama

[NeurIPS 2024] Official Repository of The Mamba in the Llama: Distilling and Accelerating Hybrid Models

Python 242 22 Updated Oct 14, 2025

McGill-NLP / nano-aha-moment

Single File, Single GPU, From Scratch, Efficient, Full Parameter Tuning library for "RL for LLMs"

Jupyter Notebook 623 56 Updated Oct 7, 2025

formal-land / rocq-of-rust

Formal verification tool for Rust: check 100% of execution cases of your programs to make safer applications.

Rocq Prover 1,129 45 Updated Jun 16, 2026

PrimeIntellect-ai / verifiers

Our library for RL environments + evals

Python 4,199 561 Updated Jun 16, 2026

groundlight / r1_vlm

Build your own visual reasoning model

Jupyter Notebook 423 28 Updated Jan 13, 2026

nebius / kvax

A FlashAttention implementation for JAX with support for efficient document mask computation and context parallelism.

Python 167 9 Updated Nov 11, 2025

zhaochenyang20 / Awesome-ML-SYS-Tutorial

My learning notes for ML SYS.

Python 6,532 443 Updated Jun 8, 2026

anthropics / claude-code

Claude Code is an agentic coding tool that lives in your terminal, understands your codebase, and helps you code faster by executing routine tasks, explaining complex code, and handling git workflo…

Python 132,820 21,488 Updated Jun 16, 2026

deepseek-ai / FlashMLA

FlashMLA: Efficient Multi-head Latent Attention Kernels

C++ 12,705 1,062 Updated Apr 30, 2026

rllm-org / rllm

Democratizing Reinforcement Learning for LLMs

Python 5,623 577 Updated Jun 16, 2026

therealoliver / Deepdive-llama3-from-scratch

Achieve the llama3 inference step-by-step, grasp the core concepts, master the process derivation, implement the code.

Jupyter Notebook 629 52 Updated Feb 24, 2025

DataArcTech / ChartMoE

[ICLR2025 Oral] ChartMoE: Mixture of Diversely Aligned Expert Connector for Chart Understanding

Jupyter Notebook 100 9 Updated Apr 1, 2025

Coobiw / MPP-LLaVA

Personal Project: MPP-Qwen14B & MPP-Qwen-Next(Multimodal Pipeline Parallel based on Qwen-LM). Support [video/image/multi-image] {sft/conversations}. Don't let the poverty limit your imagination! Tr…

Jupyter Notebook 681 34 Updated Mar 10, 2025