Skip to content
View wenjunyang's full-sized avatar

Organizations

@NLPchina

Block or report wenjunyang

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
20 results for source starred repositories written in Python
Clear filter

Making large AI models cheaper, faster and more accessible

Python 41,232 4,537 Updated Nov 10, 2025

ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型

Python 41,158 5,216 Updated Jun 27, 2024

OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.

Python 37,491 3,299 Updated Aug 17, 2024

🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time

Python 36,745 5,274 Updated Nov 15, 2024

Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the most common NLP tasks.

Python 22,965 3,620 Updated Jul 28, 2024

Train transformer language models with reinforcement learning.

Python 16,244 2,287 Updated Nov 10, 2025

ChatRWKV is like ChatGPT but powered by RWKV (100% RNN) language model, and open source.

Python 9,511 705 Updated Sep 27, 2025

all kinds of text classification models and more with deep learning

Python 7,938 2,561 Updated Sep 28, 2023

A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)

Python 4,726 483 Updated Jan 8, 2024

A LITE BERT FOR SELF-SUPERVISED LEARNING OF LANGUAGE REPRESENTATIONS, 海量中文预训练ALBERT模型

Python 3,988 750 Updated Nov 21, 2022

FlagAI (Fast LArge-scale General AI models) is a fast, easy-to-use and extensible toolkit for large-scale model.

Python 3,879 419 Updated Sep 10, 2025

Learning Chinese Character style with conditional GAN

Python 2,678 480 Updated Aug 9, 2019

Kashgari is a production-level NLP Transfer learning framework built on top of tf.keras for text-labeling and text-classification, includes Word2Vec, BERT, and GPT2 Language Embedding.

Python 2,386 433 Updated Sep 3, 2024

An Open-source Neural Hierarchical Multi-label Text Classification Toolkit

Python 1,911 410 Updated Jul 7, 2025

A PyTorch-based knowledge distillation toolkit for natural language processing

Python 1,685 247 Updated May 8, 2023

ChatGLM-6B 指令学习|指令数据|Instruct

Python 654 51 Updated Apr 10, 2023

中文医学知识图谱命名实体识别,包括bi-LSTM+CRF,transformer+CRF等模型

Python 248 54 Updated Jun 4, 2019

Self complemented Pinyin2Chinese demo use algorithms including Trie and HMM model , 基于隐马尔科夫模型与Trie树的拼音切分与拼音转中文的简单demo实现。

Python 86 27 Updated Apr 24, 2018

A HMM-like linear-chain CRF, used Tensorflow API. 🐣

Python 36 13 Updated Mar 21, 2018