Stars
[EMNLP 2025 Demo] PDF scientific paper translation with preserved formats - 基于 AI 完整保留排版的 PDF 文档全文双语翻译,支持 Google/DeepL/Ollama/OpenAI 等服务,提供 CLI/GUI/MCP/Docker/Zotero
File Parser optimised for LLM Ingestion with no loss 🧠 Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.
Infinity is a high-throughput, low-latency serving engine for text-embeddings, reranking models, clip, clap and colpali
Python APIs for web automation, testing, and bypassing bot-detection with ease.
Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.
OpenMMLab Semantic Segmentation Toolbox and Benchmark.
🔗 Some useful websites for programmers.
A MLX port of FLUX based on the Huggingface Diffusers implementation.
chsrc 全平台通用换源工具与框架. Change Source everywhere for every software
Run the latest LLMs and VLMs across GPU, NPU, and CPU with PC (Python/C++) & mobile (Android & iOS) support, running quickly with OpenAI gpt-oss, Granite4, Qwen3VL, Gemma 3n and more.
Minecraft AI with LLMs+Mineflayer
Netflix-level subtitle cutting, translation, alignment, and even dubbing - one-click fully automated AI video subtitle team | Netflix级字幕切割、翻译、对齐、甚至加上配音,一键全自动视频搬运AI字幕组
Get your documents ready for gen AI
Annotate better with CVAT, the industry-leading data engine for machine learning. Used and trusted by teams at any scale, for data of any scale.
StreamDiffusion: A Pipeline-Level Solution for Real-Time Interactive Generation
Multi-agent framework, runtime and control plane. Built for speed, privacy, and scale.
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
Python sample codes and textbook for robotics algorithms.
Train transformer language models with reinforcement learning.
TinyChatEngine: On-Device LLM Inference Library
Agentic components of the Llama Stack APIs