Highlights
- Pro
Lists (1)
Sort Name ascending (A-Z)
Stars
Fully open reproduction of DeepSeek-R1
Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.
🤖 Chat with your SQL database 📊. Accurate Text-to-SQL Generation via LLMs using Agentic Retrieval 🔄.
⚡ Automatically decrypt encryptions without knowing the key or cipher, decode encodings, and crack hashes ⚡
⚡️HivisionIDPhotos: a lightweight and efficient AI ID photos tools. 一个轻量级的AI证件照制作算法。
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
OCR, layout analysis, reading order, table recognition in 90+ languages
Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.
Debug, evaluate, and monitor your LLM applications, RAG systems, and agentic workflows with comprehensive tracing, automated evaluations, and production-ready dashboards.
FauxPilot - an open-source alternative to GitHub Copilot server
AKShare is an elegant and simple financial data interface library for Python, built for human beings! 开源财经数据接口库
Free and Open Source Machine Translation API. Self-hosted, offline capable and easy to setup.
The official GitHub page for the survey paper "A Survey of Large Language Models".
Run any open-source LLMs, such as DeepSeek and Llama, as OpenAI compatible API endpoint in the cloud.
Retrieval and Retrieval-augmented LLMs
Implementation of Nougat Neural Optical Understanding for Academic Documents
ChatRWKV is like ChatGPT but powered by RWKV (100% RNN) language model, and open source.
Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with …
CodeGeeX: An Open Multilingual Code Generation Model (KDD 2023)
Pythonic AI generation of images and videos
Semantic cache for LLMs. Fully integrated with LangChain and llama_index.
GLM-130B: An Open Bilingual Pre-Trained Model (ICLR 2023)
The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.
A large-scale 7B pretraining language model developed by BaiChuan-Inc.
[EMNLP'23, ACL'24] To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which achieves up to 20x compression with minimal performance loss.
Create and modify Word documents with Python
CodeGen is a family of open-source model for program synthesis. Trained on TPU-v4. Competitive with OpenAI Codex.