Stars
Full Stack template for FastAPI, GO (chi), React 19 TypeScript, SCSS, Tanstack Query, Zustand, Nginx, Docker
A minimal PyTorch re-implementation of Qwen 3.5
A template for rapidly building full-stack web applications in Python, featuring a FastAPI backend, a NiceGUI frontend, PostgreSQL, and Docker.
Model Context Protocol Servers
Visual testing tool for MCP servers
LLM Council works together to answer your hardest questions
Visualize your API endpoints and explore them interactively, also support Django ninja & Litestar
A collection of tutorials on state-of-the-art computer vision models and techniques. Explore everything from foundational architectures like ResNet to cutting-edge models like RF-DETR, YOLO11, SAM …
🚀 Efficient implementations of state-of-the-art linear attention models
Language modeling with linear-cost context
Solution for Waymo Motion Prediction Challenge 2022. Our implementation of MultiPath++
A collaboration friendly studio for NeRFs
[ICLR2025, ICML2025, NeurIPS2025 Spotlight] Quantized Attention achieves speedup of 2-5x compared to FlashAttention, without losing end-to-end metrics across language, image, and video models.
Notes from the Latent Space paper club. Follow along or start your own!
Training framework with a goal to explore the frontier of sample efficiency of small language models
A lightweight web framework in C for building modern web applications
From baby GPT to diffusion GPT: An annotated implementation of a character-level discrete diffusion model (adapted from Karpathy’s baby GPT).
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
A comprehensive 0-to-1 guide for building self-improving LLM applications with DSPy framework
A Python library for extracting structured information from unstructured text using LLMs with precise source grounding and interactive visualization.
This repository contains a curated collection of 300+ case studies from over 80 companies, detailing practical applications and insights into machine learning (ML) system design. The contents are o…
Port of OpenAI's Whisper model in C/C++