Starred repositories
Conifer: Improving Complex Constrained Instruction-Following Ability of Large Language Models
[NAACL 2024 Outstanding Paper] Source code for the NAACL 2024 paper entitled "R-Tuning: Instructing Large Language Models to Say 'I Don't Know'"
🔍 An LLM-based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT)
A high-throughput and memory-efficient inference and serving engine for LLMs
This is the repo for the survey of Bias and Fairness in IR with LLMs.
A general fine-tuning kit geared toward image/video/audio diffusion models.
m&ms: A Benchmark to Evaluate Tool-Use for multi-step multi-modal tasks
This repo contains the source code for RULER: What’s the Real Context Size of Your Long-Context Language Models?
[ACL 2024] FollowBench: A Multi-level Fine-grained Constraints Following Benchmark for Large Language Models
Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)
[ICML 2023] Data and code release for the paper "DS-1000: A Natural and Reliable Benchmark for Data Science Code Generation".
Memory optimization and training recipes to extrapolate language models' context length to 1 million tokens, with minimal hardware.
Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs
An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.
GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection
The #1 open-source voice interface for desktop, mobile, and ESP32 chips.
An LLM agent that conducts deep research (local and web) on any given topic and generates a long report with citations.
Collection of China illegal cases about web crawler 本项目用来整理所有中国大陆爬虫开发者涉诉与违规相关的新闻、资料与法律法规。致力于帮助在中国大陆工作的爬虫行业从业者了解我国相关法律,避免触碰数据合规红线。 [AD]企业租显卡算力部署AI请选Novagrid
A guidance language for controlling large language models.
[ACL 2024] AutoAct: Automatic Agent Learning from Scratch for QA via Self-Planning
Implementation of "RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Horizon Generation".
Enhanced ChatGPT Clone: Features Agents, MCP, DeepSeek, Anthropic, AWS, OpenAI, Responses API, Azure, Groq, o1, GPT-5, Mistral, OpenRouter, Vertex AI, Gemini, Artifacts, AI model switching, message…
Get up and running with OpenAI gpt-oss, DeepSeek-R1, Gemma 3 and other models.
[NAACL'24] Self-data filtering of LLM instruction-tuning data using a novel perplexity-based difficulty score, without using any other models