tddg

🤯

Yue Cheng tddg

🤯

Associate professor @ UVA, working on distributed systems, cloud computing, data systems (ML/AI, LLMs, storage), OS, and HPC.

43 followers · 11 following

University of Virginia
Charlottesville, VA
http://tddg.github.io
https://ds2-lab.github.io/
@yuecheng87
in/yue-cheng

Achievements

Organizations

Lists (1)

Sort

🔮 Future ideas

1 repository

Starred repositories

EverMind-AI / MSA

Memory Sparse Attention - A scalable, end-to-end trainable latent-memory framework for 100M-token contexts.

Python 3,031 176 Updated Apr 5, 2026

harvard-edge / cs249r_book

Machine Learning Systems

JavaScript 23,564 2,828 Updated Apr 12, 2026

karpathy / autoresearch

AI agents running research on single-GPU nanochat training automatically

Python 70,809 10,322 Updated Mar 26, 2026

openclaw / openclaw

Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞

TypeScript 355,442 71,934 Updated Apr 12, 2026

unslothai / unsloth

Unsloth Studio is a web UI for training and running open models like Gemma 4, Qwen3.5, DeepSeek, gpt-oss locally.

Python 61,165 5,281 Updated Apr 12, 2026

treeverse / dvc

🦉 Data Versioning and ML Experiments

Python 15,528 1,292 Updated Apr 7, 2026

ds2-lab / ZipLLM

ZipLLM: An efficient, lossless data reduction pipeline for large-scale LLM storage (NSDI'26)

Rust 4 Updated Feb 25, 2026

thedotmack / claude-mem

A Claude Code plugin that automatically captures everything Claude does during your coding sessions, compresses it with AI (using Claude's agent-sdk), and injects relevant context back into future …

TypeScript 48,704 3,811 Updated Apr 12, 2026

sgl-project / mini-sglang

A compact implementation of SGLang, designed to demystify the complexities of modern LLM serving systems.

Python 3,967 566 Updated Mar 13, 2026

HarleyCoops / Math-To-Manim

Create Epic Math and Physics Animations & Study Notes From Text and Images.

Python 1,774 201 Updated Mar 26, 2026

github / spec-kit

💫 Toolkit to help you get started with Spec-Driven Development

Python 87,257 7,505 Updated Apr 10, 2026

anomalyco / opencode

The open source coding agent.

TypeScript 141,915 15,920 Updated Apr 12, 2026

algorithmicsuperintelligence / openevolve

Open-source implementation of AlphaEvolve

Python 5,947 945 Updated Mar 18, 2026

kvcache-ai / Mooncake

Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.

C++ 5,079 667 Updated Apr 11, 2026

DCjanus / prompts

Python 18 4 Updated Apr 10, 2026

pcodec / pcodec

Lossless codec for numerical data

Rust 473 28 Updated Mar 22, 2026

THUDM / AgentBench

A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)

Python 3,316 244 Updated Feb 8, 2026

OpenBMB / ToolBench

[ICLR'24 spotlight] An open platform for training, serving, and evaluating large language model for tool learning.

Python 5,598 481 Updated May 21, 2025

Project-HAMi / HAMi

Heterogeneous GPU Sharing on Kubernetes

Go 3,271 508 Updated Apr 10, 2026

disler / single-file-agents

What if we could pack single purpose, powerful AI Agents into a single python file?

Python 433 154 Updated Apr 8, 2025

deepspeedai / DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 42,053 4,788 Updated Apr 12, 2026

binpash / try

Inspect a command's effects before modifying your live system

Shell 5,430 79 Updated Apr 10, 2026

microsoft / generative-ai-for-beginners

21 Lessons, Get Started Building with Generative AI

Jupyter Notebook 109,232 58,583 Updated Apr 12, 2026

interestingLSY / swiftLLM

A tiny yet powerful LLM inference system tailored for researching purpose. vLLM-equivalent performance with only 2k lines of code (2% of vLLM).

Python 321 37 Updated Jun 10, 2025

plasma-umass / scalene

Scalene: a high-performance, high-precision CPU, GPU, and memory profiler for Python with AI-powered optimization proposals

JavaScript 13,364 434 Updated Apr 6, 2026

TheR1D / shell_gpt

A command-line productivity tool powered by AI large language models like GPT-5, will help you accomplish your tasks faster and more efficiently.

Python 11,959 950 Updated Apr 11, 2026

yhzhang0128 / egos-2000

Envision a future where everyone can read all the code of an educational operating system.

C 2,527 214 Updated Feb 18, 2026

neuralmagic / deepsparse

Sparsity-aware deep learning inference runtime for CPUs

Python 3,163 190 Updated Jun 2, 2025

project-etalon / etalon

LLM Serving Performance Evaluation Harness

Python 84 12 Updated Feb 25, 2025

course-go / lectures

Course lectures

Go 17 3 Updated Apr 11, 2026

Yue Cheng tddg

Organizations

Lists (1)

🔮 Future ideas

Starred repositories

lock-free