Stars
Making large AI models cheaper, faster and more accessible
🎯 告别信息过载,AI 助你看懂新闻资讯热点,简单的舆情监控分析 - 多平台热点聚合+基于 MCP 的AI分析工具。监控35个平台(抖音、知乎、B站、华尔街见闻、财联社等),智能筛选+自动推送+AI对话分析(用自然语言深度挖掘新闻:趋势追踪、情感分析、相似检索等13种工具)。支持企业微信/个人微信/飞书/钉钉/Telegram/邮件/ntfy/bark/slack 推送,1分钟手机通知,无需…
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
aider is AI pair programming in your terminal
Opiniated RAG for integrating GenAI in your apps 🧠 Focus on your product rather than the RAG. Easy integration in existing products with customisation! Any LLM: GPT4, Groq, Llama. Any Vectorstore: …
Federated query engine for AI - The only MCP Server you'll ever need
Instant voice cloning by MIT and MyShell. Audio foundation model.
Python SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing and logging. [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthr…
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.
Your AI second brain. Self-hostable. Get answers from the web or your docs. Build custom agents, schedule automations, do deep research. Turn any online or local LLM into your personal, autonomous …
DSPy: The framework for programming—not prompting—language models
Convert PDF to markdown + JSON quickly with high accuracy
Code and documentation to train Stanford's Alpaca models, and generate the data.
A modular graph-based Retrieval-Augmented Generation (RAG) system
The official Python library for the OpenAI API
Open-Sora: Democratizing Efficient Video Production for All
An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.
[EMNLP2025] "LightRAG: Simple and Fast Retrieval-Augmented Generation"
Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.
Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)
Fully open reproduction of DeepSeek-R1
An LLM agent that conducts deep research (local and web) on any given topic and generates a long report with citations.
JARVIS, a system to connect LLMs with ML community. Paper: https://arxiv.org/pdf/2303.17580.pdf
Official inference framework for 1-bit LLMs
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.