Skip to content
View wenjunyang's full-sized avatar

Organizations

@NLPchina

Block or report wenjunyang

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Train transformer language models with reinforcement learning.

Python 16,216 2,279 Updated Nov 7, 2025

Virtual whiteboard for sketching hand-drawn like diagrams

TypeScript 109,859 11,438 Updated Nov 7, 2025

Oh my tmux! My self-contained, pretty & versatile tmux configuration made with 💛🩷💙🖤❤️🤍

Shell 23,741 3,516 Updated Nov 7, 2025

Making large AI models cheaper, faster and more accessible

Python 41,227 4,540 Updated Nov 7, 2025

YSDA course in Natural Language Processing

Jupyter Notebook 10,358 2,716 Updated Nov 7, 2025

A fast reverse proxy to help you expose a local server behind a NAT or firewall to the internet.

Go 100,405 14,595 Updated Nov 6, 2025

Unsupervised text tokenizer for Neural Network-based text generation.

C++ 11,424 1,305 Updated Nov 6, 2025

💥 Fast State-of-the-Art Tokenizers optimized for Research and Production

Rust 10,205 991 Updated Oct 16, 2025

ChatRWKV is like ChatGPT but powered by RWKV (100% RNN) language model, and open source.

Python 9,509 705 Updated Sep 27, 2025

FlagAI (Fast LArge-scale General AI models) is a fast, easy-to-use and extensible toolkit for large-scale model.

Python 3,879 419 Updated Sep 10, 2025

大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP

9,805 1,560 Updated Sep 8, 2025

An Open-source Neural Hierarchical Multi-label Text Classification Toolkit

Python 1,910 410 Updated Jul 7, 2025

一键命令下载飞书文档为 Markdown

Go 1,801 170 Updated Apr 8, 2025

🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time

Python 36,741 5,273 Updated Nov 15, 2024

BELLE: Be Everyone's Large Language model Engine(开源中文对话大模型)

HTML 8,260 767 Updated Oct 16, 2024

Kashgari is a production-level NLP Transfer learning framework built on top of tf.keras for text-labeling and text-classification, includes Word2Vec, BERT, and GPT2 Language Embedding.

Python 2,386 433 Updated Sep 3, 2024

OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.

Python 37,493 3,298 Updated Aug 17, 2024

Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the most common NLP tasks.

Python 22,963 3,620 Updated Jul 28, 2024

深度学习入门课、资深课、特色课、学术案例、产业实践案例、深度学习知识百科及面试题库The course, case and knowledge of Deep Learning and AI

Jupyter Notebook 3,502 857 Updated Jul 25, 2024

ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型

Python 41,149 5,216 Updated Jun 27, 2024

Fast parallel CTC.

Cuda 4,073 1,036 Updated Mar 4, 2024

A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)

Python 4,725 483 Updated Jan 8, 2024

自动构建中文词库:http://www.matrix67.com/blog/archives/5044

Java 656 221 Updated Dec 5, 2023

xkcd styled chart lib

JavaScript 7,744 199 Updated Dec 2, 2023

all kinds of text classification models and more with deep learning

Python 7,938 2,561 Updated Sep 28, 2023

个人博客,看效果进入

CSS 1,417 843 Updated Sep 6, 2023

天涯 kkndme 神贴聊房价

19,281 3,883 Updated Aug 27, 2023

Simple tutorials using Google's TensorFlow Framework

Jupyter Notebook 6,023 1,498 Updated Aug 20, 2023

A PyTorch-based knowledge distillation toolkit for natural language processing

Python 1,684 247 Updated May 8, 2023
Next