Stars
The official implement of paper 《DaMo: Data Mixing Optimizer in Fine-tuning Multimodal LLMs for Mobile Phone Agents》
✨✨Latest Advances on Multimodal Large Language Models
An open-source implementaion for fine-tuning Qwen-VL series by Alibaba Cloud.
Witness the aha moment of VLM with less than $3.
强化学习中文教程(蘑菇书🍄),在线阅读地址:https://datawhalechina.github.io/easy-rl/
A live reading list for LLM data synthesis (Updated to July, 2025).
Summarize existing representative LLMs text datasets.
textcnn for advertising detection,广告检测
A family of open-sourced Mixture-of-Experts (MoE) Large Language Models
Comparing the Performance of LLMs: A Deep Dive into Roberta, Llama, and Mistral for Disaster Tweets Analysis with Lora
[NeurlPS D&B 2024] Generative AI for Math: MathPile
Collection of training data management explorations for large language models
Convert PDF to markdown + JSON quickly with high accuracy
DeepSeek LLM: Let there be answers
Skywork series models are pre-trained on 3.2TB of high-quality multilingual (mainly Chinese and English) and code data. We have open-sourced the model, training data, evaluation data, evaluation me…
A series of large language models developed by Baichuan Intelligent Technology
OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.
SuperCLUE: 中文通用大模型综合性基准 | A Benchmark for Foundation Models in Chinese
The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
Firefly: 大模型训练工具,支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型
ReLE评测:中文AI大模型能力评测(持续更新):目前已囊括335个大模型,覆盖chatgpt、gpt-5.2、o4-mini、谷歌gemini-3-pro、Claude-4.5、文心ERNIE-X1.1、ERNIE-5.0-Thinking、qwen3-max、百川、讯飞星火、商汤senseChat等商用模型, 以及kimi-k2、ernie4.5、minimax-M2、deepseek-…
CMMLU: Measuring massive multitask language understanding in Chinese