demi6od

Chen Zhang (@demi6od) demi6od

SenseTime Senior AI Research Scientist SenseNova & SenseChat Group Core Member

285 followers · 1 following

https://twitter.com/demi6od

Achievements

Stars

demi6od / o1_agent

Python 1 Updated Feb 7, 2025

demi6od / sim_gpt4

Python 1 Updated Dec 8, 2023

hpcaitech / ColossalAI

Making large AI models cheaper, faster and more accessible

Python 41,370 4,518 Updated Apr 9, 2026

google-research / tuning_playbook

A playbook for systematically maximizing the performance of deep learning models.

30,002 2,421 Updated Jun 18, 2024

MLNLP-World / NLP-Course-Chinese

MLNLP社区翻译的NLP入门课程。

HTML 179 17 Updated Jan 17, 2023

BDBC-KG-NLP / QA-Survey-CN

北京航空航天大学大数据高精尖中心自然语言处理研究团队开展了智能问答的研究与应用总结。包括基于知识图谱的问答（KBQA），基于文本的问答系统（TextQA），基于表格的问答系统（TableQA）、基于视觉的问答系统（VisualQA）和机器阅读理解（MRC）等，每类任务分别对学术界和工业界进行了相关总结。

1,814 261 Updated Apr 6, 2023

mattzheng / ChineseWiki

维基百科中文语料整理

Python 302 34 Updated Mar 7, 2018

ymcui / PERT

PERT: Pre-training BERT with Permuted Language Model

369 25 Updated Jul 15, 2025

styfeng / DataAug4NLP

Collection of papers and resources for data augmentation for NLP.

833 77 Updated Aug 12, 2022

CLUEbenchmark / CLUE

中文语言理解测评基准 Chinese Language Understanding Evaluation Benchmark: datasets, baselines, pre-trained models, corpus and leaderboard

Python 4,246 545 Updated Feb 6, 2026

lizhe2004 / chatbot-list

行业内关于智能客服、聊天机器人的应用和架构、算法分享和介绍

1,350 248 Updated Aug 26, 2025

baidu / DuReader

Baseline Systems of DuReader Dataset

Python 1,169 308 Updated May 26, 2022

luhua-rain / MRC_Competition_Dureader

机器阅读理解冠军/亚军代码及中文预训练MRC模型

Python 743 151 Updated Nov 19, 2022

facebookresearch / faiss

A library for efficient similarity search and clustering of dense vectors.

C++ 39,666 4,323 Updated Apr 9, 2026

thu-coai / KdConv

KdConv: A Chinese Multi-domain Dialogue Dataset Towards Multi-turn Knowledge-driven Conversation

Python 497 62 Updated May 8, 2023

PaddlePaddle / RocketQA

🚀 RocketQA, dense retrieval for information retrieval and question answering, including both Chinese and English state-of-the-art models.

Python 785 124 Updated Dec 19, 2023

deepset-ai / haystack

Open-source AI orchestration framework for building context-engineered, production-ready LLM applications. Design modular pipelines and agent workflows with explicit control over retrieval, routing…

MDX 24,778 2,703 Updated Apr 9, 2026