Skip to content
View brightmart's full-sized avatar

Organizations

@CLUEbenchmark

Block or report brightmart

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

中文精确指令遵循测评基准(开源版)

Python 7 1 Updated Aug 12, 2025

Math24o: 高中奥林匹克数学竞赛测评集 High School Olympiad Mathematics Chinese Benchmark

Python 11 Updated Mar 27, 2025

Llama3开源模型中文版-全方位测评,基于SuperCLUE基准 | Llama3 Chinese Evaluation with SuperCLUE

16 Updated Apr 21, 2024

中文原生检索增强生成测评基准

130 4 Updated Apr 18, 2024

中文原生工业测评基准

15 Updated Mar 21, 2024

中文原生多层次文生视频测评基准

18 1 Updated Jul 8, 2024

中文原生等级化代码能力测试基准

15 1 Updated Apr 11, 2024

SC-Safety: 中文大模型多轮对抗安全基准

150 12 Updated Mar 15, 2024

Instruction Tuning with GPT-4

HTML 4,337 309 Updated Jun 11, 2023

Llama2开源模型中文版-全方位测评,基于SuperCLUE的OPEN基准 | Llama2 Chinese evaluation with SuperCLUE

127 8 Updated Aug 2, 2023

Llama中文社区,实时汇总最新Llama学习资料,构建最好的中文Llama大模型开源生态,完全开源可商用

Python 14,726 1,304 Updated Apr 6, 2025

ReLE评测:中文AI大模型能力评测(持续更新):目前已囊括374个大模型,覆盖chatgpt、gpt-5.4、谷歌gemini-3.1-pro、Claude-4.6、文心ERNIE-X1.1、ERNIE-5.0、qwen3.6-max、qwen3.6-plus、百川、讯飞星火、商汤senseChat等商用模型, 以及step3.5-flash、kimi-k2.6、ernie4.5、Mini…

5,925 241 Updated Apr 26, 2026

AirLLM 70B inference with single 4GB GPU

Jupyter Notebook 16,763 1,780 Updated Mar 10, 2026

An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.

Jupyter Notebook 1,976 307 Updated Aug 9, 2025

SuperCLUE高考作文机器自动阅卷系统

19 Updated Jun 8, 2023

SuperCLUE琅琊榜:中文通用大模型匿名对战评价基准

144 6 Updated Jun 19, 2024

Langchain-Chatchat(原Langchain-ChatGLM)基于 Langchain 与 ChatGLM, Qwen 与 Llama 等语言模型的 RAG 与 Agent 应用 | Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge based LLM (like ChatGLM, Qwen and…

Python 37,932 6,193 Updated Nov 10, 2025

AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.

Python 183,825 46,230 Updated Apr 27, 2026

LlamaIndex is the leading document agent and OCR platform

Python 48,982 7,321 Updated Apr 27, 2026

BELLE: Be Everyone's Large Language model Engine(开源中文对话大模型)

HTML 8,284 767 Updated Oct 16, 2024

ChatYuan: Large Language Model for Dialogue in Chinese and English

Python 1,874 178 Updated Jun 16, 2023

Long-form text-to-images generation, using a pipeline of deep generative models (GPT-3 and Stable Diffusion)

Python 689 54 Updated Oct 30, 2022

pCLUE: 1000000+多任务提示学习数据集

Jupyter Notebook 506 60 Updated Oct 4, 2022

PromptCLUE, 全中文任务支持零样本学习模型

Jupyter Notebook 665 65 Updated Jun 16, 2023

中文语言理解测评基准 Chinese Language Understanding Evaluation Benchmark: datasets, baselines, pre-trained models, corpus and leaderboard

Python 4,253 545 Updated Feb 6, 2026

Official repository of OFA (ICML 2022). Paper: OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework

Python 2,557 249 Updated Apr 24, 2024

史上最大规模1.4亿中文知识图谱开源下载

Python 5,169 738 Updated Dec 6, 2023

A latent text-to-image diffusion model

Jupyter Notebook 72,946 10,612 Updated Jun 18, 2024

一站式自动化开源标注平台

Java 80 16 Updated Aug 25, 2022
Next