Highlights
- Pro
-
flutter_rust_bridge Public
Flutter/Dart <-> Rust binding generator, feature-rich, but seamless and simple.
-
flutter_smooth Public
Achieve ~60 FPS, no matter how heavy the tree is to build/layout
-
dart_interactive Public
REPL (interactive shell) for Dart, supporting 3rd party packages, hot reload, and full grammar
-
sentry-dart Public
Forked from getsentry/sentry-dartSentry SDK for Dart and Flutter
-
sglang Public
Forked from sgl-project/sglangSGLang is a fast serving framework for large language models and vision language models.
-
flutter_convenient_test Public
Write and debug tests easily, with full action history, time travel, screenshots, rapid re-execution, video records, interactivity, isolation and more
-
torchft Public
Forked from meta-pytorch/torchftFault tolerance for PyTorch (HSDP, LocalSGD, DiLoCo, Streaming DiLoCo)
Python Other UpdatedMar 27, 2026 -
cargokit Public
Forked from irondash/cargokitIntegrate cargo build with flutter plugins and applications.
-
torch_memory_saver Public
Allow torch tensor memory to be released and resumed later
-
slime Public
Forked from THUDM/slimeslime is a LLM post-training framework aiming at scaling RL.
Python Apache License 2.0 UpdatedMar 3, 2026 -
ome Public
Forked from sgl-project/omeOME is a Kubernetes operator for enterprise-grade management and serving of Large Language Models (LLMs)
Go MIT License UpdatedMar 2, 2026 -
flashinfer Public
Forked from flashinfer-ai/flashinferFlashInfer: Kernel Library for LLM Serving
-
lm-evaluation-harness Public
Forked from EleutherAI/lm-evaluation-harnessA framework for few-shot evaluation of language models.
Python MIT License UpdatedJan 25, 2026 -
Megatron-Bridge Public
Forked from NVIDIA-NeMo/Megatron-BridgeTraining library for Megatron-based models
-
FlashMLA Public
Forked from deepseek-ai/FlashMLAFlashMLA: Efficient Multi-head Latent Attention Kernels
C++ MIT License UpdatedJan 16, 2026 -
TensorRT-LLM Public
Forked from NVIDIA/TensorRT-LLMTensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and support state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorR…
C++ Apache License 2.0 UpdatedDec 31, 2025 -
NeMo-Skills Public
Forked from NVIDIA-NeMo/SkillsA project to improve skills of large language models
Python Apache License 2.0 UpdatedDec 27, 2025 -
Mooncake Public
Forked from kvcache-ai/MooncakeMooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.
C++ Apache License 2.0 UpdatedDec 19, 2025 -
mlperf-common Public
Forked from NVIDIA/mlperf-commonNVIDIA's launch, startup, and logging scripts used by our MLPerf Training and HPC submissions
Python Apache License 2.0 UpdatedNov 30, 2025 -
Megatron-LM Public
Forked from NVIDIA/Megatron-LMOngoing research training transformer models at scale
-
DeepEP Public
Forked from deepseek-ai/DeepEPDeepEP: an efficient expert-parallel communication library
-
transformers Public
Forked from huggingface/transformers🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Python Apache License 2.0 UpdatedOct 23, 2025 -
TransformerEngine Public
Forked from NVIDIA/TransformerEngineA library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper, Ada and Blackwell GPUs, to provide better performance with lower memory…
Python Apache License 2.0 UpdatedOct 20, 2025 -
triton Public
Forked from triton-lang/tritonDevelopment repository for the Triton language and compiler
MLIR MIT License UpdatedOct 20, 2025 -
SpecForge Public
Forked from sgl-project/SpecForgeTrain speculative decoding models effortlessly and port them smoothly to SGLang serving.
-
LongBench Public
Forked from Fridge003/LongBenchLongBench v2 and LongBench (ACL 25'&24')
Python MIT License UpdatedSep 27, 2025 -
-
torch_utils Public
Utility scripts for PyTorch (e.g. Make Perfetto show some disappearing kernels, Memory profiler that understands more low-level allocations such as NCCL, ...)
-
flutter_portal Public
Evolved Overlay/OverlayEntry - declarative not imperative, intuitive-context, and easy-alignment
-
cutlass Public
Forked from NVIDIA/cutlassCUDA Templates for Linear Algebra Subroutines
C++ Other UpdatedAug 26, 2025