A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit and 4-bit floating point (FP8 and FP4) precision on Hopper, Ada and Blackwell GPUs, to provide better performance…

Python 2,890 540 Updated Nov 7, 2025

cybertronai / gradient-checkpointing

Make huge neural nets fit in memory

Python 2,821 277 Updated Apr 26, 2020

dvlab-research / LongLoRA

Code and documents of LongLoRA and LongAlpaca (ICLR 2024 Oral)

Python 2,689 290 Updated Aug 14, 2024

Unakar / Logic-RL

Reproduce R1 Zero on Logic Puzzle

Python 2,410 162 Updated Mar 20, 2025

openai / improved-gan

Code for the paper "Improved Techniques for Training GANs"

Python 2,331 623 Updated Nov 21, 2018

huggingface / nanotron

Minimalistic large language model 3D-parallelism training

Python 2,299 253 Updated Sep 3, 2025

namisan / mt-dnn

Multi-Task Deep Neural Networks for Natural Language Understanding

Python 2,257 415 Updated Mar 7, 2024

fatchord / WaveRNN

WaveRNN Vocoder + TTS

Python 2,173 697 Updated Jul 2, 2022

open-thoughts / open-thoughts

Fully open data curation for reasoning models

Python 2,135 177 Updated Sep 3, 2025

openai / gpt-2-output-dataset

Dataset of GPT-2 outputs for research in detection, biases, and more

Python 2,002 550 Updated Dec 13, 2023

loujie0822 / DeepIE

DeepIE: Deep Learning for Information Extraction

Python 1,945 351 Updated Dec 9, 2022

ml-jku / hopfield-layers

Hopfield Networks is All You Need

Python 1,868 213 Updated Apr 23, 2023

openreasoner / openr

OpenR: An Open Source Framework for Advanced Reasoning with Large Language Models

Python 1,823 134 Updated Jan 17, 2025

ChineseGLUE / ChineseGLUE

Language Understanding Evaluation benchmark for Chinese: datasets, baselines, pre-trained models,corpus and leaderboard

Python 1,781 247 Updated Feb 18, 2023

geekinglcq / CDCS

Chinese Data Competitions' Solutions

Python 1,773 397 Updated Apr 5, 2019

LeapLabTHU / Absolute-Zero-Reasoner

Official Repository of Absolute Zero Reasoner

Python 1,737 290 Updated Aug 24, 2025

openai / sparse_attention

Examples of using sparse attention, as in "Generating Long Sequences with Sparse Transformers"

Python 1,593 193 Updated Aug 12, 2020

waveform80 / picamera

A pure Python interface to the Raspberry Pi camera module

Python 1,580 352 Updated Dec 24, 2022

devnag / pytorch-generative-adversarial-networks

A very simple generative adversarial network (GAN) in PyTorch

Python 1,540 447 Updated Jun 30, 2021

huggingface / pytorch-openai-transformer-lm

🐥A PyTorch implementation of OpenAI's finetuned transformer language model with a script to import the weights pre-trained by OpenAI

Python 1,520 285 Updated Aug 9, 2021

sahil280114 / codealpaca

Python 1,495 113 Updated May 12, 2023

OpenNMT / OpenNMT-tf

Neural machine translation and sequence learning using TensorFlow

Python 1,484 383 Updated Oct 14, 2023

mlcommons / inference

Reference implementations of MLPerf® inference benchmarks

Python 1,480 588 Updated Nov 6, 2025

Previous Next

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Zhuang Liu zhuango

Achievements

Achievements

Block or report zhuango

Stars

deep-diver / LLM-As-Chatbot

hmmlearn / hmmlearn

EvolvingLMMs-Lab / lmms-eval

salesforce / CodeT5

yfeng95 / GAN

yangjianxin1 / GPT2-chitchat

IntelLabs / nlp-architect

NVIDIA / TransformerEngine