Skip to content
View zhuango's full-sized avatar
🎯
Focusing
🎯
Focusing
  • Peking

Block or report zhuango

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
191 stars written in Python
Clear filter

A very simple GRPO implement for reproducing r1-like LLM thinking.

Python 1,432 109 Updated Aug 5, 2025

Ongoing research training transformer language models at scale, including: BERT & GPT-2

Python 1,425 228 Updated Mar 20, 2024

Transformer seq2seq model, program that can build a language translator from parallel corpus

Python 1,417 351 Updated May 19, 2023
Python 1,348 53 Updated Nov 21, 2024

Scalable data pre processing and curation toolkit for LLMs

Python 1,202 187 Updated Nov 6, 2025

An implementation of Performer, a linear attention-based transformer, in Pytorch

Python 1,156 148 Updated Feb 2, 2022

利用Python实现中文文本关键词抽取,分别采用TF-IDF、TextRank、Word2Vec词聚类三种方法。

Python 1,154 377 Updated Jan 16, 2018

A simple and well styled PPO implementation. Based on my Medium series: https://medium.com/@eyyu/coding-ppo-from-scratch-with-pytorch-part-1-4-613dfc1b14c8.

Python 1,139 151 Updated Oct 1, 2024

Code for reproducing key results in the paper "InfoGAN: Interpretable Representation Learning by Information Maximizing Generative Adversarial Nets"

Python 1,072 305 Updated Mar 25, 2021

Customizable implementation of the self-instruct paper.

Python 1,051 67 Updated Mar 7, 2024

Xwin-LM: Powerful, Stable, and Reproducible LLM Alignment

Python 1,042 44 Updated May 31, 2024

LLMPerf is a library for validating and benchmarking LLMs

Python 1,039 191 Updated Dec 9, 2024

A lightweight multilingual LLM

Python 1,004 48 Updated Aug 8, 2025

⛷️ LLaMA-MoE: Building Mixture-of-Experts from LLaMA with Continual Pre-training (EMNLP 2024)

Python 995 62 Updated Dec 6, 2024

PyTorch code for EMNLP 2019 paper "LXMERT: Learning Cross-Modality Encoder Representations from Transformers".

Python 963 160 Updated Oct 22, 2022

Fast inference from large lauguage models via speculative decoding

Python 848 89 Updated Aug 22, 2024

Crawl BookCorpus

Python 846 109 Updated Jul 14, 2023

中文实体关系抽取,pytorch,bilstm+attention

Python 766 179 Updated Nov 13, 2021

fastHan是基于fastNLP与pytorch实现的中文自然语言处理工具,像spacy一样调用方便。

Python 759 88 Updated Dec 9, 2023

A series of technical report on Slow Thinking with LLM

Python 743 41 Updated Aug 13, 2025

Plug-and-Play Image Restoration with Deep Denoiser Prior (IEEE TPAMI 2021) (PyTorch)

Python 724 110 Updated Nov 21, 2022

Expert Specialized Fine-Tuning

Python 708 260 Updated May 22, 2025

A bidirectional recurrent neural network model with attention mechanism for restoring missing punctuation in unsegmented text

Python 681 196 Updated Sep 19, 2021

ReST-MCTS*: LLM Self-Training via Process Reward Guided Tree Search (NeurIPS 2024)

Python 676 50 Updated Jan 20, 2025

A BERT-based Chinese Text Encoder Enhanced by N-gram Representations

Python 649 106 Updated Jul 24, 2022

Large-scale model inference.

Python 629 86 Updated Sep 12, 2023

Reference code for the paper: Deep White-Balance Editing (CVPR 2020). Our method is a deep learning multi-task framework for white-balance editing.

Python 589 70 Updated Jul 5, 2023

The codes of paper "Long Text Generation via Adversarial Training with Leaked Information" on AAAI 2018. Text generation using GAN and Hierarchical Reinforcement Learning.

Python 576 183 Updated Jul 2, 2022

Analyze the inference of Large Language Models (LLMs). Analyze aspects like computation, storage, transmission, and hardware roofline model in a user-friendly interface.

Python 573 70 Updated Sep 11, 2024