Lists (2)
Sort Name ascending (A-Z)
Starred repositories
SGLang is a fast serving framework for large language models and vision language models.
🤯 LobeHub - an open-source, modern design AI Agent Workspace. Supports multiple AI providers, Knowledge Base (file upload / RAG ), one click install MCP Marketplace and Artifacts / Thinking. One-cl…
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.
Production-Grade Container Scheduling and Management
Ready-to-use SRT / WebRTC / RTSP / RTMP / LL-HLS / MPEG-TS / RTP media server and media proxy that allows to read, publish, proxy, record and playback video and audio streams.
🍒 Cherry Studio is a desktop client that supports for multiple LLM providers.
Tensors and Dynamic neural networks in Python with strong GPU acceleration
The FreeBSD src tree publish-only repository. Experimenting with 'simple' pull requests....
The repository for high quality TypeScript type definitions.
🤖 The free, Open Source alternative to OpenAI, Claude and others. Self-hosted and local-first. Drop-in replacement for OpenAI, running on consumer-grade hardware. No GPU required. Runs gguf, transf…
A retargetable MLIR-based machine learning compiler and runtime toolkit.
A high-throughput and memory-efficient inference and serving engine for LLMs
FlashInfer: Kernel Library for LLM Serving
Fair-code workflow automation platform with native AI capabilities. Combine visual building with custom code, self-host or cloud, 400+ integrations.
An open-source C++ library developed and used at Facebook.
Development repository for the Triton language and compiler
ROCm / llvm-project
Forked from llvm/llvm-projectThis is the AMD-maintained fork of the LLVM git repository. This repository accepts pull requests and issues related to AMD fork-specific topics (amd/*). For all other issues/PRs, please submit ups…
An open-source, cross-platform terminal for seamless workflows
NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source components of TensorRT.
A task runner / simpler Make alternative written in Go
oneAPI Threading Building Blocks (oneTBB)
A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit and 4-bit floating point (FP8 and FP4) precision on Hopper, Ada and Blackwell GPUs, to provide better performance…
🚀 The fast, Pythonic way to build MCP servers and clients
🔥 MaxKB is an open-source platform for building enterprise-grade agents. 强大易用的开源企业级智能体平台。
Simple, scalable AI model deployment on GPU clusters