Skip to content
View ZiangWu-77's full-sized avatar
🎯
Focusing
🎯
Focusing
  • Peking University
  • Shenzhen
  • 06:09 (UTC +08:00)

Block or report ZiangWu-77

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
36 stars written in Jupyter Notebook
Clear filter

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 84,553 12,782 Updated Jan 29, 2026

A latent text-to-image diffusion model

Jupyter Notebook 72,335 10,580 Updated Jun 18, 2024

Neural Networks: Zero to Hero

Jupyter Notebook 20,161 2,872 Updated Aug 18, 2024

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 18,455 2,338 Updated Dec 25, 2024

《动手学大模型Dive into LLMs》系列编程实践教程

Jupyter Notebook 18,343 2,122 Updated Oct 10, 2025

Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 18,141 1,576 Updated Jan 30, 2026

AISystem 主要是指AI系统,包括AI芯片、AI编译器、AI推理和训练框架等AI全栈底层技术

Jupyter Notebook 16,235 2,327 Updated Sep 3, 2025

Kolmogorov Arnold Networks

Jupyter Notebook 16,159 1,548 Updated Jan 19, 2025

[🔥updating ...] AI 自动量化交易机器人(完全本地部署) AI-powered Quantitative Investment Research Platform. 📃 online docs: https://ufund-me.github.io/Qbot ✨ :news: qbot-mini: https://github.com/Charmve/iQuant

Jupyter Notebook 16,088 2,290 Updated Jul 6, 2025

llama3 implementation one matrix multiplication at a time

Jupyter Notebook 15,242 1,288 Updated May 23, 2024

PyTorch code and models for the DINOv2 self-supervised learning method.

Jupyter Notebook 12,361 1,163 Updated Dec 22, 2025
Jupyter Notebook 12,272 1,437 Updated Jan 30, 2026

Solve puzzles. Learn CUDA.

Jupyter Notebook 11,936 923 Updated Sep 1, 2024

MiniCPM4 & MiniCPM4.1: Ultra-Efficient LLMs on End Devices, achieving 3+ generation speedup on reasoning tasks

Jupyter Notebook 8,519 533 Updated Oct 8, 2025

A series of large language models trained from scratch by developers @01-ai

Jupyter Notebook 7,846 491 Updated Nov 27, 2024

AIInfra(AI 基础设施)指AI系统从底层芯片等硬件,到上层软件栈支持AI大模型训练和推理。

Jupyter Notebook 5,974 819 Updated Dec 22, 2025

Material for gpu-mode lectures

Jupyter Notebook 5,682 570 Updated Feb 1, 2026

🎓 无需编写任何代码即可轻松创建漂亮的学术网站 Easily create a beautiful academic résumé or educational website using Hugo and GitHub. No code.

Jupyter Notebook 4,809 6,484 Updated Feb 1, 2026

Acceptance rates for the major AI conferences

Jupyter Notebook 4,729 316 Updated Sep 23, 2025

Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding

Jupyter Notebook 4,293 361 Updated Nov 27, 2025

A simplified implemention of Faster R-CNN that replicate performance from origin paper

Jupyter Notebook 4,034 1,125 Updated May 15, 2021

Qwen2.5-Omni is an end-to-end multimodal model by Qwen team at Alibaba Cloud, capable of understanding text, audio, vision, video, and performing real-time speech generation.

Jupyter Notebook 3,913 316 Updated Jun 12, 2025

Official codebase used to develop Vision Transformer, SigLIP, MLP-Mixer, LiT and more.

Jupyter Notebook 3,343 207 Updated May 19, 2025

Grounded SAM 2: Ground and Track Anything in Videos with Grounding DINO, Florence-2 and SAM 2

Jupyter Notebook 3,257 375 Updated Nov 11, 2025

OneDiff: An out-of-the-box acceleration library for diffusion models.

Jupyter Notebook 1,963 126 Updated Dec 4, 2025

Seed1.5-VL, a vision-language foundation model designed to advance general-purpose multimodal understanding and reasoning, achieving state-of-the-art performance on 38 out of 60 public benchmarks.

Jupyter Notebook 1,538 60 Updated Jun 14, 2025

NVIDIA curated collection of educational resources related to general purpose GPU programming.

Jupyter Notebook 1,150 203 Updated Feb 4, 2026

This repo contains the code for 1D tokenizer and generator

Jupyter Notebook 1,106 63 Updated Mar 20, 2025

Frontier Multimodal Foundation Models for Image and Video Understanding

Jupyter Notebook 1,102 81 Updated Aug 14, 2025

Phi2-Chinese-0.2B 从0开始训练自己的Phi2中文小模型,支持接入langchain加载本地知识库做检索增强生成RAG。Training your own Phi2 small chat model from scratch.

Jupyter Notebook 585 66 Updated Jul 11, 2024
Next