Highlights
-
simple_inference_server Public
A straightforward OpenAI-compatible inference API server for hosting multiple small models at the edge.
-
annotaterb Public
Forked from drwl/annotaterbA Ruby Gem that adds annotations to your Rails models and route files.
Ruby Other UpdatedDec 19, 2025 -
logica_compiler.rb Public
Compile Logica programs to digested SQL + manifest.
Ruby MIT License UpdatedDec 17, 2025 -
-
-
-
console-adapter-rails Public
Forked from socketry/console-adapter-railsRuby MIT License UpdatedDec 9, 2025 -
-
-
-
-
vllm Public
Forked from vllm-project/vllmA high-throughput and memory-efficient inference and serving engine for LLMs
Python Apache License 2.0 UpdatedNov 2, 2025 -
xformers Public
Forked from facebookresearch/xformersHackable and optimized Transformers building blocks, supporting a composable construction.
Python Other UpdatedOct 27, 2025 -
flashinfer Public
Forked from flashinfer-ai/flashinferFlashInfer: Kernel Library for LLM Serving
Cuda Apache License 2.0 UpdatedOct 20, 2025 -
tvm Public
Forked from apache/tvmOpen deep learning compiler stack for cpu, gpu and specialized accelerators
Python Apache License 2.0 UpdatedOct 18, 2025 -
FBGEMM Public
Forked from pytorch/FBGEMMFB (Facebook) + GEMM (General Matrix-Matrix Multiplication) - https://code.fb.com/ml-applications/fbgemm/
C++ Other UpdatedOct 15, 2025 -
jetson-containers Public
Forked from dusty-nv/jetson-containersMachine Learning Containers for NVIDIA Jetson and JetPack-L4T
Jupyter Notebook Other UpdatedOct 13, 2025 -
Wan2.2 Public
Forked from Wan-Video/Wan2.2Wan: Open and Advanced Large-Scale Video Generative Models
Python Apache License 2.0 UpdatedOct 12, 2025 -
DeepGEMM Public
Forked from deepseek-ai/DeepGEMMDeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling
Cuda MIT License UpdatedOct 9, 2025 -
-
index-tts Public
Forked from index-tts/index-ttsAn Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System
Python Other UpdatedOct 1, 2025 -
vllm-flash-attention Public
Forked from vllm-project/flash-attentionFast and memory-efficient exact attention
Python BSD 3-Clause "New" or "Revised" License UpdatedSep 29, 2025 -
jetson-stacks Public
Yet another re-implement of jetson-containers, targeting for Jetson Thor, Spark, and x86.
-
bitsandbytes Public
Forked from bitsandbytes-foundation/bitsandbytes8-bit CUDA functions for PyTorch
Python MIT License UpdatedSep 26, 2025 -
nccl-tests Public
Forked from NVIDIA/nccl-testsNCCL Tests
Cuda BSD 3-Clause "New" or "Revised" License UpdatedSep 17, 2025 -
parity-common Public
Forked from paritytech/parity-commonCollection of crates used in Parity projects
Rust Apache License 2.0 UpdatedSep 8, 2025 -
polkadot-sdk Public
Forked from paritytech/polkadot-sdkThe Parity Polkadot Blockchain SDK
Rust UpdatedApr 7, 2025 -
substrate Public archive
Forked from paritytech/substrateSubstrate: The platform for blockchain innovators
Rust Apache License 2.0 UpdatedApr 1, 2025 -
derive_more Public
Forked from JelteF/derive_moreSome more derive(Trait) options
Rust MIT License UpdatedMar 26, 2025 -
http_accept_language Public
Forked from trammel/http_accept_languageRuby on Rails plugin. Fishes out the Accept-Language header into an array.
Ruby UpdatedMar 13, 2025