Skip to content
View TagoreZhao's full-sized avatar
🥲
Focusing
🥲
Focusing

Highlights

  • Pro

Block or report TagoreZhao

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
21 stars written in Python
Clear filter

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Python 94,717 25,792 Updated Nov 6, 2025

Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek-R1, Qwen3, Gemma 3, TTS 2x faster with 70% less VRAM.

Python 47,946 3,921 Updated Nov 6, 2025

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 40,613 4,613 Updated Nov 6, 2025

A modular graph-based Retrieval-Augmented Generation (RAG) system

Python 29,013 3,034 Updated Nov 5, 2025

Fully open reproduction of DeepSeek-R1

Python 25,614 2,401 Updated Sep 8, 2025

Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.

Python 25,265 1,758 Updated Oct 13, 2025

RAG on Everything with LEANN. Enjoy 97% storage savings while running a fast, accurate, and 100% private RAG application on your personal device.

Python 3,506 351 Updated Nov 5, 2025

Code for the ICML 2023 paper "SparseGPT: Massive Language Models Can Be Accurately Pruned in One-Shot".

Python 846 114 Updated Aug 20, 2024

A simple and effective LLM pruning approach.

Python 815 118 Updated Aug 9, 2024

Implementation related to the Deep Complex Networks

Python 768 284 Updated Jan 16, 2019

Quantitative Finance book

Python 757 199 Updated Apr 14, 2025

For releasing code related to compression methods for transformers, accompanying our publications

Python 447 52 Updated Jan 16, 2025

Source code to our paper: "Learning a Variational Network for Reconstruction of Accelerated MRI Data"

Python 151 48 Updated Jan 27, 2021
Python 60 7 Updated Dec 15, 2024

Implementation of Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer in PyTorch.

Python 52 7 Updated Nov 8, 2023

code for the paper "A Statistical Framework for Low-bitwidth Training of Deep Neural Networks"

Python 29 2 Updated Oct 31, 2020

A numerical library for High-Dimensional option Pricing problems, including Fourier transform methods, Monte Carlo methods and the Deep Galerkin method

Python 29 6 Updated May 22, 2020

My implementation of the gMLP model from the paper "Pay Attention to MLPs".

Python 24 4 Updated May 25, 2021

ReCoDe project to showcase an implementation of the Euler-Maruyama numerical method to solve Stochastic Differential Equations

Python 13 1 Updated Jul 27, 2023