1a1a11a

Juncheng Yang 1a1a11a

Assistant Professor at Harvard University, building modern data systems

262 followers · 109 following

Harvard University
Cambridge
http://jasony.me
@1a1a11a

Achievements

x4 x3 x2

Achievements

x4 x3 x2

Highlights

Organizations

Lists (2)

Sort

🔮 Future ideas

🚀 My stack

Starred repositories

z-lab / dflash

DFlash: Block Diffusion for Flash Speculative Decoding

Python 5,093 368 Updated May 10, 2026

gi-dellav / zerostack

Minimal coding agent written in Rust, optimized for memory footprint and performance

Rust 1,271 90 Updated Jun 13, 2026

pelikan-io / cache-rs

A collection of Rust implementation of state-of-the-art cache algorithms

Rust 2 1 Updated Jun 13, 2026

ai-christianson / RA.Aid

Develop software autonomously.

Python 2,224 209 Updated Jan 30, 2026

eugr / spark-vllm-docker

Docker configuration for running VLLM on dual DGX Sparks

Shell 1,601 290 Updated Jun 12, 2026

Liquid4All / cookbook

Examples, end-2-end tutorials and apps built using Liquid AI Foundational Models (LFM) and the LEAP SDK

Jupyter Notebook 2,075 340 Updated Jun 12, 2026

ekzhang / bore

🕳 bore is a simple CLI tool for making tunnels to localhost

Rust 11,229 498 Updated Feb 4, 2026

RichardAtCT / claude-code-openai-wrapper

OpenAI API-compatible wrapper for Claude Code

Python 560 113 Updated May 4, 2026

UWASL / dedup-bench

DedupBench is a benchmarking tool for content-defined chunking techniques used in data deduplication. It currently supports eleven unique CDC techniques and five different vector instruction sets.

C++ 24 1 Updated Feb 20, 2026

THUDM / slime

slime is an LLM post-training framework for RL Scaling.

Python 6,109 893 Updated Jun 13, 2026

daos-stack / daos

DAOS Storage Stack (client libraries, storage engine, control plane)

C 945 349 Updated Jun 12, 2026

Michael-A-Kuykendall / shimmy

⚡ Pure-Rust WebGPU inference engine — OpenAI-API compatible, GGUF native, runs on any GPU. No Python. No llama.cpp. Single binary.

Rust 5,417 515 Updated Jun 11, 2026

cxl-micron-reskit / famfs

Forked from jagalactic/famfs

This is the user space repo for famfs, the fabric-attached memory file system

C 96 6 Updated May 25, 2026

NVIDIA / gdrcopy

A fast GPU memory copy library based on NVIDIA GPUDirect RDMA technology

C 1,383 189 Updated Jun 13, 2026

cacheMon / libCacheSim-python

Python bindings for libCacheSim, designed for rapid experimentation with cache simulation models.

Python 7 3 Updated May 18, 2026

alibaba / ServeGen

A framework for generating realistic LLM serving workloads

Python 152 14 Updated May 11, 2026

mozilla-ai / any-agent

A single interface to use and evaluate different agent frameworks

Python 1,174 94 Updated Jun 8, 2026

BerriAI / litellm

Python SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing and logging. [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthr…

Python 50,263 8,856 Updated Jun 13, 2026

eunomia-bpf / agentsight

Zero instrucment system-level AI agent tracing in eBPF

C 446 63 Updated Jun 13, 2026

cacheMon / cache_dataset

A comprehensive open-source cache trace dataset

Jupyter Notebook 25 6 Updated Aug 23, 2025

pcodec / pcodec

Lossless codec for numerical data

Rust 486 29 Updated Jun 12, 2026

1a1a11a / libCacheSim

a high performance library for building cache simulators

C++ 332 109 Updated May 4, 2026

GeeeekExplorer / nano-vllm

Nano vLLM

Python 14,012 2,209 Updated Apr 26, 2026

AI-Hypercomputer / maxdiffusion

Python 358 75 Updated Jun 13, 2026

AI-Hypercomputer / JetStream

JetStream is a throughput and memory optimized engine for LLM inference on XLA devices, starting with TPUs (and GPUs in future -- PRs welcome).

Python 445 66 Updated Jan 5, 2026

sir-lab / data-release

Huawei Cloud datasets

Jupyter Notebook 91 13 Updated Jan 8, 2026

NVIDIA / nvbandwidth

A tool for bandwidth measurements on NVIDIA GPUs.

C++ 712 83 Updated Apr 8, 2026

google-ai-edge / gallery

A gallery that showcases on-device ML/GenAI use cases and allows people to try and use models locally.

Kotlin 23,732 2,479 Updated Jun 12, 2026

facebookresearch / fastgen

Simple high-throughput inference library

Python 158 10 Updated Jun 10, 2026

facebookresearch / param

PArametrized Recommendation and Ai Model benchmark is a repository for development of numerous uBenchmarks as well as end to end nets for evaluation of training and inference platforms.

Juncheng Yang 1a1a11a

Highlights

Organizations

Lists (2)

🔮 Future ideas

🚀 My stack

Starred repositories

sieve-cache

s3fifo

s3-fifo

cache-simulation