Benchmark and deploy optimized LLM models on GPU servers with vLLM or SGLang. Chose from a list of optimized recipes for popular models or create your own with custom configurations. Run benchmarks…

Python 60 8 Updated Jun 22, 2026

thu-ml / SageAttention

[ICLR2025, ICML2025, NeurIPS2025 Spotlight] Quantized Attention achieves speedup of 2-5x compared to FlashAttention, without losing end-to-end metrics across language, image, and video models.

Cuda 3,428 434 Updated Jan 17, 2026

alshedivat / al-folio

A beautiful, simple, clean, and responsive Jekyll theme for academics

HTML 15,759 13,072 Updated Jun 22, 2026

Dao-AILab / dao-ailab.github.io

HTML 1 2 Updated Jun 19, 2026

just-every / code

Forked from openai/codex

Every Code - push frontier AI to it limits. A fork of the Codex CLI with validation, automation, browser integration, multi-agents, theming, and much more. Orchestrate agents from OpenAI, Claude, G…

Rust 3,809 232 Updated Jun 22, 2026

Jupyter Notebook 12 2 Updated Dec 19, 2025

NVIDIA-NeMo / DataDesigner

🎨 NeMo Data Designer: Generate high-quality synthetic data from scratch or from seed data.

Python 2,025 187 Updated Jun 22, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Alecco alecco

Achievements

Achievements

Block or report alecco

Stars

IST-DASLab / Quartet-II

fal-ai / flashpack

anonymous452026 / ngpt-nvfp4

zlab-princeton / strong-distill

qlabs-eng / slowrun

open-lm-engine / coda-kernels

ighoshsubho / lighthouse-attention

NousResearch / hermes-agent

SakanaAI / sparser-faster-llms

KellerJordan / Muon

KellerJordan / modded-nanogpt

cornell-zhang / SmoothE

lightseekorg / tokenspeed

NVIDIA / cudnn-frontend

PluralisResearch / node0

gau-nernst / learn-cuda

cloudrift-ai / deplodock

thu-ml / SageAttention

alshedivat / al-folio

Dao-AILab / dao-ailab.github.io

just-every / code

MoonshotAI / FlashKDA

L-z-Chen / data-run

Dao-AILab / sonic-moe

Avarok-Cybersecurity / dgx-vllm

flashinfer-ai / flashinfer

4rtemi5 / rbf_attention

simple-stories / simple_stories_train

simple-stories / simple_stories_generate

NVIDIA-NeMo / DataDesigner