Stars
The AI-Native Search Database. Unifies vector, text, structured and semi-structured data in a single engine, enabling hybrid search and in-database AI workflows.
A high-throughput and memory-efficient inference and serving engine for LLMs
The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
An Efficient LLM Fine-Tuning Factory Optimized for MoE PEFT
A generative speech model for daily dialogue.
A service that can convert ChatGPT on the web to OpenAI API format.
Parsers for scientific papers (PDF2JSON, TEX2JSON, JATS2JSON)
A handy PDF-to-JSON conversion tool for academic papers implemented in Python.
Get [Google, Yandex, Baidu, Bing, DuckDuckGo] search results via API for free 🎉
[CVPR 2024] 4K4D: Real-Time 4D View Synthesis at 4K Resolution
TUDB-Labs / GPTuner
Forked from SolidLao/GPTunerGPTuner is a manual-reading database tuning system leveraging domain knowlege automatically and extensively to enhance knob tuning process.
Code and models for ICML 2024 paper, NExT-GPT: Any-to-Any Multimodal Large Language Model
An Efficient "Factory" to Build Multiple LoRA Adapters
[ICLR'24 spotlight] Chinese and English Multimodal Large Model Series (Chat and Paint) | 基于CPM基础模型的中英双语多模态大模型系列
fay是一个帮助数字人(2.5d、3d、移动、pc、网页)或大语言模型(openai兼容、deepseek)连通业务系统的agent框架。
中英文敏感词、语言检测、中外手机/电话归属地/运营商查询、名字推断性别、手机号抽取、身份证抽取、邮箱抽取、中日文人名库、中文缩写库、拆字词典、词汇情感值、停用词、反动词表、暴恐词表、繁简体转换、英文模拟中文发音、汪峰歌词生成器、职业名称词库、同义词库、反义词库、否定词库、汽车品牌词库、汽车零件词库、连续英文切割、各种中文词向量、公司名字大全、古诗词库、IT词库、财经词库、成语词库、地名词库、…
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
A large-scale 7B pretraining language model developed by BaiChuan-Inc.
MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化,也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志、论文、台词、帖子、wiki、古诗、歌词、商品介绍、笑话、糗事、聊天记录等一切形式的纯文本中文数据。
大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP
TigerBot: A multi-language multi-task LLM
QLoRA: Efficient Finetuning of Quantized LLMs
SuperCLUE: 中文通用大模型综合性基准 | A Benchmark for Foundation Models in Chinese
BELLE: Be Everyone's Large Language model Engine(开源中文对话大模型)