Skip to content
View astonzhang's full-sized avatar

Organizations

@apache @dmlc @d2l-ai

Block or report astonzhang

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
100 results for source starred repositories
Clear filter

Utilities intended for use with Llama models.

Python 7,385 1,297 Updated Dec 16, 2025

The official Meta Llama 3 GitHub site

Python 29,133 3,500 Updated Jan 26, 2025

Doing simple retrieval from LLM models at various context lengths to measure accuracy

Jupyter Notebook 2,109 227 Updated Aug 17, 2024

Official inference library for Mistral models

Jupyter Notebook 10,599 1,000 Updated Nov 21, 2025

Official implementation for "You Only Look at Screens: Multimodal Chain-of-Action Agents" (Findings of ACL 2024)

Python 256 19 Updated Jul 16, 2024

tiktoken is a fast BPE tokeniser for use with OpenAI's models.

Python 16,799 1,342 Updated Oct 6, 2025

800,000 step-level correctness labels on LLM solutions to MATH problems

Python 2,081 122 Updated Jun 1, 2023

Code Example for Learning Multimodal Data Augmentation in Feature Space

Python 43 2 Updated Mar 11, 2023

Official implementation for the paper "Prompt Pre-Training with Over Twenty-Thousand Classes for Open-Vocabulary Visual Recognition"

Python 259 9 Updated May 3, 2024

Source code for the X Recommendation Algorithm

Scala 67,968 12,645 Updated Sep 8, 2025

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 51,175 8,564 Updated Nov 12, 2025

A modular RL library to fine-tune language models to human preferences

Python 2,376 203 Updated Mar 1, 2024

Official implementation for "Multimodal Chain-of-Thought Reasoning in Language Models" (stay tuned and more will be updated)

Python 3,984 334 Updated Jun 12, 2024

Official implementation for "Parameter-Efficient Fine-Tuning Design Spaces"

Python 27 Updated Jan 4, 2023

Paper List for In-context Learning 🌷

871 63 Updated Oct 8, 2024

Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM

Python 7,870 681 Updated Oct 11, 2025

This project is deprecated. Check my new project ChatHub:

TypeScript 13,179 1,484 Updated Aug 14, 2024

Official implementation for "Automatic Chain of Thought Prompting in Large Language Models" (stay tuned & more will be updated)

Jupyter Notebook 1,984 181 Updated Mar 13, 2024

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 41,021 4,667 Updated Dec 17, 2025

《动手学深度学习》:面向中文读者、能运行、可讨论。中英文版被70多个国家的500多所大学用于教学。

Python 74,444 12,028 Updated Jul 30, 2024

Course repository for the Spring 2022 COMP790 course "Deep Learning" at UNC

19 Updated Apr 13, 2022

Course repository for the Spring COMP790 course "Deep Learning" at UNC

23 1 Updated Feb 2, 2022

[ICLR 2022] Differentiable Prompt Makes Pre-trained Language Models Better Few-shot Learners

Python 130 17 Updated Dec 7, 2022

Robustness Gym is an evaluation toolkit for machine learning.

Python 443 36 Updated Jun 28, 2022

Hypercomplex Neural Networks with PyTorch

Python 55 9 Updated Dec 14, 2022

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

Jupyter Notebook 32,003 3,862 Updated Jul 23, 2024

This is an official implementation for "SimMIM: A Simple Framework for Masked Image Modeling".

Python 1,015 105 Updated Sep 29, 2022

Grounded Language-Image Pre-training

Python 2,558 213 Updated Jan 24, 2024
Jupyter Notebook 19 16 Updated Apr 24, 2022
Next