- Redmond, WA
-
13:32
(UTC -07:00)
Highlights
- Pro
-
flashinfer Public
Forked from flashinfer-ai/flashinferFlashInfer: Kernel Library for LLM Serving
Python Apache License 2.0 UpdatedMar 31, 2026 -
tvm Public
Forked from apache/tvmOpen deep learning compiler stack for cpu, gpu and specialized accelerators
Python Apache License 2.0 UpdatedMar 25, 2026 -
openclaw Public
Forked from openclaw/openclawYour own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞
TypeScript MIT License UpdatedMar 20, 2026 -
-
ci Public
Forked from tlc-pack/ciRepository which handles configuration of TVM CI infrastructure.
Python Apache License 2.0 UpdatedFeb 10, 2026 -
-
terraform-aws-github-runner Public
Forked from github-aws-runners/terraform-aws-github-runnerTerraform module for scalable GitHub action runners on AWS
TypeScript MIT License UpdatedJan 24, 2026 -
vibetensor Public
Forked from NVlabs/vibetensorOur first fully AI generated deep learning system
Python Apache License 2.0 UpdatedJan 22, 2026 -
flashinfer-bench Public
Forked from flashinfer-ai/flashinfer-benchBuilding the Virtuous Cycle for AI-driven LLM Systems
Python Apache License 2.0 UpdatedDec 17, 2025 -
dynamo Public
Forked from ai-dynamo/dynamoA Datacenter Scale Distributed Inference Serving Framework
Rust Apache License 2.0 UpdatedDec 10, 2025 -
NeMo Public
Forked from NVIDIA-NeMo/NeMoA scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
Python Apache License 2.0 UpdatedDec 10, 2025 -
sglang Public
Forked from sgl-project/sglangSGLang is a fast serving framework for large language models and vision language models.
Python Apache License 2.0 UpdatedDec 9, 2025 -
tilelang Public
Forked from tile-ai/tilelangDomain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels
C++ Other UpdatedOct 17, 2025 -
cutlass Public
Forked from NVIDIA/cutlassCUDA Templates for Linear Algebra Subroutines
C++ Other UpdatedOct 1, 2025 -
cutlass_fpA_intB_gemm Public
Forked from tlc-pack/cutlass_fpA_intB_gemmA standalone GEMM kernel for fp16 activation and quantized weight, extracted from FasterTransformer
C++ Apache License 2.0 UpdatedAug 7, 2025 -
mlc-llm Public
Forked from mlc-ai/mlc-llmEnable everyone to develop, optimize and deploy AI models natively on everyone's devices.
Python Apache License 2.0 UpdatedJul 25, 2025 -
Genesis Public
Forked from Genesis-Embodied-AI/GenesisA generative world for general-purpose robotics & embodied AI learning.
Python Apache License 2.0 UpdatedApr 15, 2025 -
-
triton Public
Forked from triton-lang/tritonDevelopment repository for the Triton language and compiler
C++ MIT License UpdatedSep 20, 2024 -
diffusers Public
Forked from huggingface/diffusers🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.
Python Apache License 2.0 UpdatedJun 13, 2024 -
rust Public
Forked from rust-lang/rustEmpowering everyone to build reliable and efficient software.
Rust Other UpdatedJun 5, 2024 -
gpt-fast Public
Forked from meta-pytorch/gpt-fastSimple and efficient pytorch-native transformer text generation in <1000 LOC of python.
Python BSD 3-Clause "New" or "Revised" License UpdatedMay 8, 2024 -
vllm Public
Forked from vllm-project/vllmA high-throughput and memory-efficient inference and serving engine for LLMs
Python Apache License 2.0 UpdatedMay 3, 2024 -
llm.c Public
Forked from karpathy/llm.cLLM training in simple, raw C/CUDA
Cuda MIT License UpdatedMay 3, 2024 -
transformers Public
Forked from huggingface/transformers🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Python Apache License 2.0 UpdatedFeb 27, 2024 -
-
web-llm Public
Forked from mlc-ai/web-llmBringing large-language models and chat to web browsers. Everything runs inside the browser with no server support.
Python Apache License 2.0 UpdatedMay 18, 2023 -
stablehlo Public
Forked from openxla/stablehloBackward compatible ML compute opset inspired by HLO/MHLO
MLIR Apache License 2.0 UpdatedMar 22, 2023 -
jax Public
Forked from jax-ml/jaxComposable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more
Python Apache License 2.0 UpdatedMar 9, 2023 -
relax Public
Forked from tlc-pack/relaxTemp repo for prototyping relax(relay next), the effort will be upstreamed. We use the wiki pages on this repo to host design docs.
Python Apache License 2.0 UpdatedFeb 23, 2023