Skip to content
View demi6od's full-sized avatar
  • https://twitter.com/demi6od

Block or report demi6od

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 1 Updated Feb 7, 2025
Python 1 Updated Dec 8, 2023

Making large AI models cheaper, faster and more accessible

Python 41,371 4,521 Updated Mar 30, 2026

A playbook for systematically maximizing the performance of deep learning models.

29,978 2,423 Updated Jun 18, 2024

MLNLP社区翻译的NLP入门课程。

HTML 179 17 Updated Jan 17, 2023

北京航空航天大学大数据高精尖中心自然语言处理研究团队开展了智能问答的研究与应用总结。包括基于知识图谱的问答(KBQA),基于文本的问答系统(TextQA),基于表格的问答系统(TableQA)、基于视觉的问答系统(VisualQA)和机器阅读理解(MRC)等,每类任务分别对学术界和工业界进行了相关总结。

1,813 261 Updated Apr 6, 2023

维基百科中文语料整理

Python 302 34 Updated Mar 7, 2018

PERT: Pre-training BERT with Permuted Language Model

369 25 Updated Jul 15, 2025

Collection of papers and resources for data augmentation for NLP.

833 77 Updated Aug 12, 2022

中文语言理解测评基准 Chinese Language Understanding Evaluation Benchmark: datasets, baselines, pre-trained models, corpus and leaderboard

Python 4,242 545 Updated Feb 6, 2026

行业内关于智能客服、聊天机器人的应用和架构、算法分享和介绍

1,349 248 Updated Aug 26, 2025

Baseline Systems of DuReader Dataset

Python 1,170 307 Updated May 26, 2022

机器阅读理解 冠军/亚军代码及中文预训练MRC模型

Python 743 151 Updated Nov 19, 2022

A library for efficient similarity search and clustering of dense vectors.

C++ 39,598 4,312 Updated Apr 3, 2026

KdConv: A Chinese Multi-domain Dialogue Dataset Towards Multi-turn Knowledge-driven Conversation

Python 497 62 Updated May 8, 2023

🚀 RocketQA, dense retrieval for information retrieval and question answering, including both Chinese and English state-of-the-art models.

Python 785 124 Updated Dec 19, 2023

Open-source AI orchestration framework for building context-engineered, production-ready LLM applications. Design modular pipelines and agent workflows with explicit control over retrieval, routing…

MDX 24,691 2,695 Updated Apr 3, 2026

⛔ [NOT MAINTAINED] An End-To-End Closed Domain Question Answering System.

Python 617 191 Updated Apr 30, 2020

Collections of Chinese reading comprehension datasets

221 27 Updated Dec 19, 2019

搜索所有中文NLP数据集,附常用英文NLP数据集

Python 4,426 625 Updated Nov 21, 2022

Dense Passage Retriever - is a set of tools and models for open domain Q&A task.

Python 1,863 316 Updated Apr 6, 2023

Multiple paper open-source codes of the Microsoft Research Asia DKI group

Python 386 61 Updated Nov 10, 2023

ACL 2019论文复现:Improving Multi-turn Dialogue Modelling with Utterance ReWriter

Python 138 23 Updated Jan 23, 2020

《Machine Learning Systems: Design and Implementation》 (V2 is launching soon)

TeX 4,796 476 Updated Mar 15, 2026

该仓库主要记录 NLP 算法工程师相关的顶会论文研读笔记

C++ 4,027 648 Updated Aug 18, 2023

pandas中文教程

Jupyter Notebook 5,114 1,934 Updated Apr 24, 2024

A framework for training and evaluating AI models on a variety of openly available dialogue datasets.

Python 10,632 2,074 Updated Nov 3, 2023

Zero-shot dialogue state tracking (DST)

Python 83 15 Updated Nov 18, 2021

自然语言处理领域下的相关论文(附阅读笔记),复现模型以及数据处理等(代码含TensorFlow和PyTorch两版本)

Python 1,331 187 Updated Jan 5, 2024

Text2Cor: Sequence to Sequence Coreference Resolution

Perl 8 Updated Oct 14, 2021
Next