Skip to content
View Zefan-Cai's full-sized avatar

Highlights

  • Pro

Block or report Zefan-Cai

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Code for "Skip a Layer or Loop It? Learning Program-of-Layers in LLMs (ICML 2026 Oral)"

Python 6 Updated Jun 11, 2026

Post-training with Tinker

Python 3,466 446 Updated Jun 13, 2026

A single CLAUDE.md file to improve Claude Code behavior, derived from Andrej Karpathy's observations on LLM coding pitfalls.

175,116 17,861 Updated Apr 20, 2026

Make Any Website into CLI & Use your logged-in browser by AI agent.

JavaScript 24,325 2,435 Updated Jun 14, 2026

A theoretical reconstruction of the Claude Mythos architecture, built from first principles using the available research literature.

Python 13,838 3,122 Updated May 23, 2026

[COLM '25] Single-Pass Document Scanning for Question Answering

Python 14 Updated Aug 20, 2025

AI handles execution, humans own the direction, and every run becomes an inspectable research artifact on disk.

Python 854 23 Updated Jun 2, 2026

Some commonly used research experiences and processes are encapsulated into Agent skills.

TypeScript 664 82 Updated May 11, 2026

AI agents running research on single-GPU nanochat training automatically

Python 86,651 12,552 Updated Mar 26, 2026

This project aims to provide a high effective KV cache manage framework for llm inference and improve memory utilization and inference speed.

Python 61 2 Updated Apr 24, 2026

UniScientist is designed to advance universal scientific research intelligence through a unified paradigm

Python 163 12 Updated Mar 14, 2026

Hypernetworks that update LLMs to remember factual information

Python 745 96 Updated May 25, 2026

Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞

TypeScript 378,656 79,184 Updated Jun 14, 2026

StreamDiffusion, Live Stream APP

Python 490 57 Updated May 19, 2026

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 59,604 10,282 Updated Nov 12, 2025

Light Image Video Generation Inference Framework

Python 2,394 216 Updated Jun 13, 2026

implementations and experimentation on mHC by deepseek - https://arxiv.org/abs/2512.24880

Shell 362 34 Updated Feb 17, 2026

Official JAX implementation of End-to-End Test-Time Training for Long Context

Python 620 47 Updated Feb 15, 2026

TurboDiffusion: 100–200× Acceleration for Video Diffusion Models

Python 3,531 265 Updated Apr 15, 2026
Python 238 17 Updated Nov 26, 2025

Implementation of "Live Avatar: Streaming Real-time Audio-Driven Avatar Generation with Infinite Length"

Python 2,146 241 Updated May 31, 2026

A PyTorch-native inference engine with cache, parallelism, quantization and cpu offload for DiTs.

Python 1,199 75 Updated Jun 12, 2026
Python 33 3 Updated Dec 31, 2025

Official codebase for "Self Forcing: Bridging Training and Inference in Autoregressive Video Diffusion" (NeurIPS 2025 Spotlight)

Python 3,397 274 Updated Sep 12, 2025

Accelerating MoE with IO and Tile-aware Optimizations

Python 713 89 Updated Jun 13, 2026

A compact implementation of SGLang, designed to demystify the complexities of modern LLM serving systems.

Python 4,395 698 Updated May 17, 2026

A Reproduction of GDM's Nested Learning Paper

Python 697 101 Updated Feb 25, 2026

PaCoRe: Learning to Scale Test-Time Compute with Parallel Coordinated Reasoning

Python 334 15 Updated Feb 5, 2026

🔥An open-source survey of the latest video reasoning tasks, paradigms, and benchmarks.

182 9 Updated May 5, 2026
Next