Skywork series models are pre-trained on 3.2TB of high-quality multilingual (mainly Chinese and English) and code data. We have open-sourced the model, training data, evaluation data, evaluation me…

Python 1,459 142 Updated Mar 7, 2025

google-research / deduplicate-text-datasets

Rust 1,256 127 Updated Jul 30, 2024

baichuan-inc / Baichuan2

A series of large language models developed by Baichuan Intelligent Technology

Python 4,124 292 Updated Nov 8, 2024

open-compass / opencompass

OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.

Python 6,442 705 Updated Dec 17, 2025

CLUEbenchmark / SuperCLUE

SuperCLUE: 中文通用大模型综合性基准 | A Benchmark for Foundation Models in Chinese

3,270 112 Updated Sep 8, 2025

opendatalab / WanJuan1.0

万卷1.0多模态语料

569 28 Updated Oct 20, 2023

QwenLM / Qwen

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

Python 19,982 1,668 Updated Nov 26, 2025

yangjianxin1 / Firefly

Firefly: 大模型训练工具，支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型

Python 6,616 588 Updated Oct 24, 2024

jeinlee1991 / chinese-llm-benchmark

ReLE评测：中文AI大模型能力评测（持续更新）：目前已囊括335个大模型，覆盖chatgpt、gpt-5.2、o4-mini、谷歌gemini-3-pro、Claude-4.5、文心ERNIE-X1.1、ERNIE-5.0-Thinking、qwen3-max、百川、讯飞星火、商汤senseChat等商用模型，以及kimi-k2、ernie4.5、minimax-M2、deepseek-…

5,302 211 Updated Dec 19, 2025

haonan-li / CMMLU

CMMLU: Measuring massive multitask language understanding in Chinese

Python 797 65 Updated Dec 6, 2024

zhangbc / eBooks

eBook分享大集合：主要以IT领域经典书籍收藏，以备不时之需。

2,011 738 Updated May 14, 2021

csarron / ITBlogs

🔖 Collecting tech blogs and WeChat official accounts

361 81 Updated May 20, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

panbq

Block or report panbq

Stars

OPPO-Mente-Lab / DaMo

BradyFU / Awesome-Multimodal-Large-Language-Models

2U1 / Qwen-VL-Series-Finetune

StarsfieldAI / R1-V

datawhalechina / easy-rl

pengr / LLM-Synthetic-Data

fanqiwan / FuseAI

lmmlzn / Awesome-LLMs-Datasets

jiangnanboy / ad_detect_textcnn

XueFuzhao / OpenMoE

THUDM / LongBench

mehdiir / Roberta-Llama-Mistral

GAIR-NLP / MathPile

ZigeW / data_management_LLM

datalab-to / marker

deepseek-ai / DeepSeek-LLM

wgwang / awesome-LLMs-In-China

lamini-ai / llm-classifier

SkyworkAI / Skywork