Stars
- All languages
- ActionScript
- Assembly
- C
- C#
- C++
- CSS
- Clojure
- CoffeeScript
- DTrace
- Dockerfile
- Elixir
- Emacs Lisp
- F#
- Go
- Groovy
- HCL
- HTML
- Haskell
- Java
- JavaScript
- Jsonnet
- Jupyter Notebook
- Kotlin
- Less
- Lua
- MATLAB
- MDX
- Makefile
- OCaml
- Objective-C
- Objective-C++
- PHP
- Perl
- PowerShell
- Python
- QML
- Reason
- Roff
- Ruby
- Rust
- Scala
- Shell
- Smarty
- Standard ML
- Starlark
- Svelte
- Swift
- TeX
- Thrift
- TypeScript
- Vim Script
- Vue
Accelerate SHA256 computations in pure Go using AVX512, SHA Extensions for x86 and ARM64 for ARM. On AVX512 it provides an up to 8x improvement (over 3 GB/s per core). SHA Extensions give a perform…
Weighs the soul of incoming HTTP requests to stop AI crawlers
10x faster dynamic Protobuf parsing in Go that’s even 3x faster than generated code.
⎈ Multi pod and container log tailing for Kubernetes -- Friendly fork of https://github.com/wercker/stern
A container init that is so simple it's effectively brain-dead.
Next generation distributed, event-driven, parallel config management!
llm-d enables high-performance distributed LLM inference on Kubernetes
Mount remote repositories, models and datasets managed by Git LFS instantly.
A Rust based DNS client, server, and resolver
Gateway API Inference Extension
Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sagemaker, HuggingFace, Replicate, Groq]
RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge RAG with Agent capabilities to create a superior context layer for LLMs
Langflow is a powerful tool for building and deploying AI-powered agents and workflows.
A flexible distributed key-value database that is optimized for caching and other realtime workloads.
Open-source Pricing and Billing Infrastructure 🚀 Subscription management, Invoicing, Pricing, Usage-based billing, Cost limiting, Grandfathering, Experiments, Revenue analytics & Actionable insights
A QoS-based scheduling system brings optimal layout and status to workloads such as microservices, web services, big data jobs, AI jobs, etc.
A Datacenter Scale Distributed Inference Serving Framework
The YAML org maintained fork of https://github.com/go-yaml/yaml
Enable tool-use ability for any LLM model (DeepSeek V3/R1, etc.)
A TTS model capable of generating ultra-realistic dialogue in one pass.