Skip to content
View ChunyuanLI's full-sized avatar

Block or report ChunyuanLI

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
80 stars written in Python
Clear filter

🧑‍🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga…

Python 64,813 6,559 Updated Nov 11, 2025

TensorFlow code and pre-trained models for BERT

Python 39,745 9,708 Updated Jul 23, 2024

The official Python library for the OpenAI API

Python 29,506 4,471 Updated Dec 17, 2025

Code for the paper "Language Models are Unsupervised Multitask Learners"

Python 24,472 5,836 Updated Aug 14, 2024

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 24,171 2,681 Updated Aug 12, 2024

Graph Neural Network Library for PyTorch

Python 23,280 3,937 Updated Dec 18, 2025

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Python 21,885 2,678 Updated Dec 15, 2025

The ChatGPT Retrieval Plugin lets you easily find personal or work documents by asking questions in natural language.

Python 21,235 3,673 Updated Jul 4, 2024

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

Python 19,216 2,044 Updated Oct 21, 2025

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Python 17,977 1,996 Updated Dec 17, 2025

This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".

Python 15,544 2,203 Updated Jul 24, 2024

Python Implementation of Reinforcement Learning: An Introduction

Python 14,468 4,964 Updated Aug 9, 2024

PyTorch package for the discrete VAE used for DALL·E.

Python 10,867 1,903 Updated Jan 31, 2024

Code for the paper "Jukebox: A Generative Model for Music"

Python 8,036 1,457 Updated Jun 19, 2024

Uniform Manifold Approximation and Projection

Python 8,034 855 Updated Dec 12, 2025

Repo for external large-scale work

Python 6,547 723 Updated Apr 27, 2024

XLNet: Generalized Autoregressive Pretraining for Language Understanding

Python 6,179 1,164 Updated May 28, 2023

[NeurIPS 2023] Official implementation of the paper "Segment Everything Everywhere All at Once"

Python 4,755 454 Updated Aug 19, 2024
Python 4,460 432 Updated Sep 14, 2025

Sequence modeling benchmarks and temporal convolutional networks

Python 4,446 902 Updated Mar 28, 2022

Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.

Python 4,242 365 Updated Oct 19, 2025

🦦 Otter, a multi-modal model based on OpenFlamingo (open-sourced version of DeepMind's Flamingo), trained on MIMIC-IT and showcasing improved instruction-following and in-context learning ability.

Python 3,283 209 Updated Mar 5, 2024

[ICLR 2020] Contrastive Representation Distillation (CRD), and benchmark of recent knowledge distillation methods

Python 2,414 404 Updated Oct 16, 2023

ELECTRA: Pre-training Text Encoders as Discriminators Rather Than Generators

Python 2,367 349 Updated Mar 23, 2024

Unsupervised Word Segmentation for Neural Machine Translation and Text Generation

Python 2,261 475 Updated Aug 7, 2024

Multi-Task Deep Neural Networks for Natural Language Understanding

Python 2,258 413 Updated Mar 7, 2024

Open-Set Grounded Text-to-Image Generation

Python 2,184 164 Updated Mar 6, 2024

PyTorch implementation of SwAV https//arxiv.org/abs/2006.09882

Python 2,086 286 Updated Apr 13, 2023

LSTM and QRNN Language Model Toolkit for PyTorch

Python 1,984 488 Updated Feb 12, 2022

Train AI models efficiently on medical images using any framework

Python 1,875 302 Updated Jun 13, 2024
Next