Yong Wu
yongwww
MLSys Engineer @ Nvidia | FlashInfer and Machine Learning Compiler LLM co-design
@Nvidia Redmond, WA
Gaurav Kumar
liquidslr
AWS | CS Grad USC | Interested in ML, Algorithms, Distributed Systems and System Design
AWS New York
Tom Turney
TheTom
Working on LLM inference systems, KV cache compression, and kernel-level optimizations (TurboQuant).
Texas
SemiAnalysisAI
SemiAnalysisAI
Open Source Projects By SemiAnalysis in collaboration with the community. InferenceX™ & soon Assembly ISA level microbenchmarking
United States of America
Venkat Raman
Venkat2811
staff engineer, oss, distributed systems, low latency, inference
Berlin, Germany
Yuri Chervonyi
ychervonyi
Research engineer, PhD in String theory and Supergravity
Google San Francisco Bay Area
Zane Hambly
Zaneham
Kia ora (Hello).
Former firefighter, former soldier, currently studying and making software independently
Auckland, New Zealand
Peter Steinberger
steipete
Full-Time Open-Sourcerer Vienna & London
Ethan Weber
ethanweber
AI Research Scientist at Meta, Previously PhD at Berkeley, EECS at MIT BS '20 & MEng '21.
PreviousNext