Zefan-Cai

Zefan Cai Zefan-Cai

I help community use large language models.

384 followers · 70 following

Achievements

x2 x3

Achievements

x2 x3

Highlights

Stars

tianyi-lab / PoLar

Code for "Skip a Layer or Loop It? Learning Program-of-Layers in LLMs (ICML 2026 Oral)"

Python 6 Updated Jun 11, 2026

thinking-machines-lab / tinker-cookbook

Post-training with Tinker

Python 3,466 446 Updated Jun 13, 2026

multica-ai / andrej-karpathy-skills

A single CLAUDE.md file to improve Claude Code behavior, derived from Andrej Karpathy's observations on LLM coding pitfalls.

175,028 17,854 Updated Apr 20, 2026

jackwener / OpenCLI

Make Any Website into CLI & Use your logged-in browser by AI agent.

JavaScript 24,313 2,434 Updated Jun 14, 2026

kyegomez / OpenMythos

A theoretical reconstruction of the Claude Mythos architecture, built from first principles using the available research literature.

Python 13,831 3,118 Updated May 23, 2026

weilicao / SPScanner

[COLM '25] Single-Pass Document Scanning for Question Answering

Python 14 Updated Aug 20, 2025

AutoX-AI-Labs / AutoR

AI handles execution, humans own the direction, and every run becomes an inspectable research artifact on disk.

Python 854 23 Updated Jun 2, 2026

luwill / research-skills

Some commonly used research experiences and processes are encapsulated into Agent skills.

TypeScript 663 82 Updated May 11, 2026

karpathy / autoresearch

AI agents running research on single-GPU nanochat training automatically

Python 86,636 12,549 Updated Mar 26, 2026

TheToughCrane / nano-kvllm

This project aims to provide a high effective KV cache manage framework for llm inference and improve memory utilization and inference speed.

Python 61 2 Updated Apr 24, 2026

UniPat-AI / UniScientist

UniScientist is designed to advance universal scientific research intelligence through a unified paradigm

Python 163 12 Updated Mar 14, 2026

SakanaAI / doc-to-lora

Hypernetworks that update LLMs to remember factual information

Python 745 96 Updated May 25, 2026

openclaw / openclaw

Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞

TypeScript 378,637 79,183 Updated Jun 14, 2026

chenfengxu714 / StreamDiffusionV2

StreamDiffusion, Live Stream APP

Python 490 57 Updated May 19, 2026

karpathy / nanoGPT

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 59,599 10,280 Updated Nov 12, 2025

ModelTC / LightX2V

Light Image Video Generation Inference Framework

Python 2,392 216 Updated Jun 13, 2026

tokenbender / mHC-manifold-constrained-hyper-connections

implementations and experimentation on mHC by deepseek - https://arxiv.org/abs/2512.24880

Shell 362 34 Updated Feb 17, 2026

test-time-training / e2e

Official JAX implementation of End-to-End Test-Time Training for Long Context

Python 620 47 Updated Feb 15, 2026

thu-ml / TurboDiffusion

TurboDiffusion: 100–200× Acceleration for Video Diffusion Models

Python 3,531 265 Updated Apr 15, 2026

lillian039 / VARC

Python 238 17 Updated Nov 26, 2025

Alibaba-Quark / LiveAvatar

Implementation of "Live Avatar: Streaming Real-time Audio-Driven Avatar Generation with Infinite Length"

Python 2,145 241 Updated May 31, 2026

meituan-longcat / LongCat-Video

Python 4,317 679 Updated May 27, 2026

vipshop / cache-dit

A PyTorch-native inference engine with cache, parallelism, quantization and cpu offload for DiTs.

Python 1,199 75 Updated Jun 12, 2026

fla-org / hybrid-distillation

Python 33 3 Updated Dec 31, 2025

guandeh17 / Self-Forcing

Official codebase for "Self Forcing: Bridging Training and Inference in Autoregressive Video Diffusion" (NeurIPS 2025 Spotlight)

Python 3,397 274 Updated Sep 12, 2025

Dao-AILab / sonic-moe

Accelerating MoE with IO and Tile-aware Optimizations

Python 713 89 Updated Jun 13, 2026

sgl-project / mini-sglang

A compact implementation of SGLang, designed to demystify the complexities of modern LLM serving systems.

Python 4,394 698 Updated May 17, 2026

kmccleary3301 / nested_learning

A Reproduction of GDM's Nested Learning Paper

Python 697 101 Updated Feb 25, 2026

stepfun-ai / PaCoRe

PaCoRe: Learning to Scale Test-Time Compute with Parallel Coordinated Reasoning

Python 334 15 Updated Feb 5, 2026

LJungang / Awesome-Video-Reasoning-Landscape

🔥An open-source survey of the latest video reasoning tasks, paradigms, and benchmarks.

182 9 Updated May 5, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Zefan Cai Zefan-Cai

Achievements

Achievements

Highlights

Block or report Zefan-Cai

Stars

tianyi-lab / PoLar

thinking-machines-lab / tinker-cookbook

multica-ai / andrej-karpathy-skills

jackwener / OpenCLI

kyegomez / OpenMythos

weilicao / SPScanner

AutoX-AI-Labs / AutoR

luwill / research-skills

karpathy / autoresearch

TheToughCrane / nano-kvllm

UniPat-AI / UniScientist

SakanaAI / doc-to-lora

openclaw / openclaw

chenfengxu714 / StreamDiffusionV2

karpathy / nanoGPT

ModelTC / LightX2V

tokenbender / mHC-manifold-constrained-hyper-connections

test-time-training / e2e

thu-ml / TurboDiffusion

lillian039 / VARC

Alibaba-Quark / LiveAvatar

meituan-longcat / LongCat-Video

vipshop / cache-dit

fla-org / hybrid-distillation

guandeh17 / Self-Forcing

Dao-AILab / sonic-moe

sgl-project / mini-sglang

kmccleary3301 / nested_learning

stepfun-ai / PaCoRe

LJungang / Awesome-Video-Reasoning-Landscape