Starred repositories
The agent that grows with you
vLLM fork for Tesla V100 (SM70) with AWQ 4-bit support, CUDA 12.8 build flow, and validated Qwen3.5 27B/35B deployment on multi-GPU V100.
Open source free capture HTTP(S) traffic software ProxyPin, supporting full platform systems
A Flexible Framework for Experiencing Heterogeneous LLM Inference/Fine-tune Optimizations
Visualizer for neural network, deep learning and machine learning models
SGLang is a high-performance serving framework for large language models and multimodal models.
🧩 Monibuca is a Modularized, Extensible framework for building Streaming Server
A cross-platform desktop All-in-One assistant tool for Claude Code, Codex, OpenCode, openclaw & Gemini CLI.
FastAPI framework, high performance, easy to learn, fast to code, ready for production
A high-throughput and memory-efficient inference and serving engine for LLMs
The repo is finally unlocked. enjoy the party! The fastest repo in history to surpass 100K stars ⭐. Join Discord: https://discord.gg/5TUQKqFWd Built in Rust using oh-my-codex.
Claude Code is an agentic coding tool that lives in your terminal, understands your codebase, and helps you code faster by executing routine tasks, explaining complex code, and handling git workflo…
MCP Server for Computer Use in Windows
A modern GUI client based on Tauri, designed to run in Windows, macOS and Linux for tailored proxy experience
Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞
A simple zero-config tool to make locally trusted development certificates with any names you'd like.
FastGPT is a knowledge-based platform built on the LLMs, offers a comprehensive suite of out-of-the-box capabilities such as data processing, RAG retrieval, and visual AI workflow orchestration, le…
Web UI for training and running open models like Gemma 4, Qwen3.6, DeepSeek, gpt-oss locally.
Simple framework for creating REST APIs
The Python micro framework for building web applications.