- All languages
- Arduino
- Assembly
- Batchfile
- Bikeshed
- Blade
- C
- C#
- C++
- CSS
- Clojure
- Common Lisp
- Dart
- Dockerfile
- Fluent
- FreeBASIC
- Go
- HTML
- Haskell
- Java
- JavaScript
- Jupyter Notebook
- Kotlin
- LiveScript
- LookML
- Lua
- MDX
- Makefile
- Markdown
- Mustache
- Objective-C
- PHP
- Perl
- PowerShell
- Pug
- Python
- R
- Ruby
- Rust
- SCSS
- Scala
- Scheme
- Shell
- Slim
- Solidity
- Svelte
- Swift
- TeX
- Text
- TypeScript
- VBA
- Vim Script
- Visual Basic 6.0
- Vue
- Zig
Starred repositories
π RuView: WiFi DensePose turns commodity WiFi signals into real-time human pose estimation, vital sign monitoring, and presence detection — all without a single pixel of video.
Inference of Mamba, Mamba2 and Mamba3 models in pure C
A fast, scalable, high performance Gradient Boosting on Decision Trees library, used for ranking, classification, regression and other machine learning tasks for Python, R, Java, C++. Supports comp…
Supercharge Your LLM with the Fastest KV Cache Layer
Lightweight toolkit package to train and fine-tune 1.58bit Language models
Minimalistic 4D-parallelism distributed training framework for education purpose
Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.
BertViz: Visualize Attention in Transformer Models
🌸 Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translatio…
Open-source LLM load balancer and serving platform for self-hosting LLMs at scale 🏓🦙 Alternative to projects like llm-d, Docker Model Runner, etc but with less moving parts and simple deployments b…
RWKV (pronounced RwaKuv) is an RNN with great LLM performance, which can also be directly trained like a GPT transformer (parallelizable). We are at RWKV-7 "Goose". So it's combining the best of RN…
Examples and guides for using the Gemini API
Sample code and notebooks for Generative AI on Google Cloud, with Gemini on Vertex AI
Python binding for curl-impersonate fork via cffi. A http client that can impersonate browser tls/ja3/http2 fingerprints.
Official inference library for Mistral models
Plug in and Play Implementation of Tree of Thoughts: Deliberate Problem Solving with Large Language Models that Elevates Model Reasoning by atleast 70%
Awesome resources for in-context learning and prompt engineering: Mastery of the LLMs such as ChatGPT, GPT-3, and FlanT5, with up-to-date and cutting-edge updates.
FlashInfer: Kernel Library for LLM Serving
AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (NVIDIA GPU) and MatrixCore (AMD GPU) inference.
🔍 An LLM-based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT)
🖱️ Generate human-like mouse movements with puppeteer or on any 2D plane
Official JS client for ClickHouse DB
A high-performance Python-based I/O system for large (and small) deep learning problems, with strong support for PyTorch.
HugeCTR is a high efficiency GPU framework designed for Click-Through-Rate (CTR) estimating training
Loads environment variables from .env for nodejs projects.