Skip to content
View zhuango's full-sized avatar
🎯
Focusing
🎯
Focusing
  • Peking

Block or report zhuango

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
191 stars written in Python
Clear filter

LLM as a Chatbot Service

Python 3,340 380 Updated Nov 20, 2023

Hidden Markov Models in Python, with scikit-learn like API

Python 3,277 747 Updated Oct 31, 2024

One-for-All Multimodal Evaluation Toolkit Across Text, Image, Video, and Audio Tasks

Python 3,262 420 Updated Nov 7, 2025

Home of CodeT5: Open Code LLMs for Code Understanding and Generation

Python 3,080 485 Updated Jan 20, 2024

Resources and Implementations of Generative Adversarial Nets: GAN, DCGAN, WGAN, CGAN, InfoGAN

Python 3,073 799 Updated Sep 8, 2017

GPT2 for Chinese chitchat/用于中文闲聊的GPT2模型(实现了DialoGPT的MMI思想)

Python 3,016 677 Updated Oct 30, 2023

A model library for exploring state-of-the-art deep learning topologies and techniques for optimizing Natural Language Processing neural networks

Python 2,938 447 Updated Nov 7, 2022

A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit and 4-bit floating point (FP8 and FP4) precision on Hopper, Ada and Blackwell GPUs, to provide better performance…

Python 2,890 540 Updated Nov 7, 2025

Make huge neural nets fit in memory

Python 2,821 277 Updated Apr 26, 2020

Code and documents of LongLoRA and LongAlpaca (ICLR 2024 Oral)

Python 2,689 290 Updated Aug 14, 2024

Reproduce R1 Zero on Logic Puzzle

Python 2,410 162 Updated Mar 20, 2025

Code for the paper "Improved Techniques for Training GANs"

Python 2,331 623 Updated Nov 21, 2018

Minimalistic large language model 3D-parallelism training

Python 2,299 253 Updated Sep 3, 2025

Multi-Task Deep Neural Networks for Natural Language Understanding

Python 2,257 415 Updated Mar 7, 2024

WaveRNN Vocoder + TTS

Python 2,173 697 Updated Jul 2, 2022

Fully open data curation for reasoning models

Python 2,135 177 Updated Sep 3, 2025

Dataset of GPT-2 outputs for research in detection, biases, and more

Python 2,002 550 Updated Dec 13, 2023

DeepIE: Deep Learning for Information Extraction

Python 1,945 351 Updated Dec 9, 2022

Hopfield Networks is All You Need

Python 1,868 213 Updated Apr 23, 2023

OpenR: An Open Source Framework for Advanced Reasoning with Large Language Models

Python 1,823 134 Updated Jan 17, 2025

Language Understanding Evaluation benchmark for Chinese: datasets, baselines, pre-trained models,corpus and leaderboard

Python 1,781 247 Updated Feb 18, 2023

Chinese Data Competitions' Solutions

Python 1,773 397 Updated Apr 5, 2019

Official Repository of Absolute Zero Reasoner

Python 1,737 290 Updated Aug 24, 2025

Examples of using sparse attention, as in "Generating Long Sequences with Sparse Transformers"

Python 1,593 193 Updated Aug 12, 2020

A pure Python interface to the Raspberry Pi camera module

Python 1,580 352 Updated Dec 24, 2022

A very simple generative adversarial network (GAN) in PyTorch

Python 1,540 447 Updated Jun 30, 2021

🐥A PyTorch implementation of OpenAI's finetuned transformer language model with a script to import the weights pre-trained by OpenAI

Python 1,520 285 Updated Aug 9, 2021
Python 1,495 113 Updated May 12, 2023

Neural machine translation and sequence learning using TensorFlow

Python 1,484 383 Updated Oct 14, 2023

Reference implementations of MLPerf® inference benchmarks

Python 1,480 588 Updated Nov 6, 2025