ZiangWu-77

🎯

Focusing

Ziang Wu ZiangWu-77

🎯

Focusing

18 followers · 222 following

Peking University
Shenzhen
06:09 (UTC +08:00)

Achievements

Lists (9)

Sort

Stars

36 stars written in Jupyter Notebook

Clear filter

rasbt / LLMs-from-scratch

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 84,553 12,782 Updated Jan 29, 2026

CompVis / stable-diffusion

A latent text-to-image diffusion model

Jupyter Notebook 72,335 10,580 Updated Jun 18, 2024

karpathy / nn-zero-to-hero

Neural Networks: Zero to Hero

Jupyter Notebook 20,161 2,872 Updated Aug 18, 2024

facebookresearch / sam2

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 18,455 2,338 Updated Dec 25, 2024

Lordog / dive-into-llms

《动手学大模型Dive into LLMs》系列编程实践教程

Jupyter Notebook 18,343 2,122 Updated Oct 10, 2025

QwenLM / Qwen3-VL

Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 18,141 1,576 Updated Jan 30, 2026

Infrasys-AI / AISystem

AISystem 主要是指AI系统，包括AI芯片、AI编译器、AI推理和训练框架等AI全栈底层技术

Jupyter Notebook 16,235 2,327 Updated Sep 3, 2025

KindXiaoming / pykan

Kolmogorov Arnold Networks

Jupyter Notebook 16,159 1,548 Updated Jan 19, 2025

UFund-Me / Qbot

[🔥updating ...] AI 自动量化交易机器人(完全本地部署) AI-powered Quantitative Investment Research Platform. 📃 online docs: https://ufund-me.github.io/Qbot ✨ :news: qbot-mini: https://github.com/Charmve/iQuant

Jupyter Notebook 16,088 2,290 Updated Jul 6, 2025

naklecha / llama3-from-scratch

llama3 implementation one matrix multiplication at a time

Jupyter Notebook 15,242 1,288 Updated May 23, 2024

facebookresearch / dinov2

PyTorch code and models for the DINOv2 self-supervised learning method.

Jupyter Notebook 12,361 1,163 Updated Dec 22, 2025

google-research / vision_transformer

Jupyter Notebook 12,272 1,437 Updated Jan 30, 2026

srush / GPU-Puzzles

Solve puzzles. Learn CUDA.

Jupyter Notebook 11,936 923 Updated Sep 1, 2024

OpenBMB / MiniCPM

MiniCPM4 & MiniCPM4.1: Ultra-Efficient LLMs on End Devices, achieving 3+ generation speedup on reasoning tasks

Jupyter Notebook 8,519 533 Updated Oct 8, 2025

01-ai / Yi

A series of large language models trained from scratch by developers @01-ai

Jupyter Notebook 7,846 491 Updated Nov 27, 2024

Infrasys-AI / AIInfra

AIInfra（AI 基础设施）指AI系统从底层芯片等硬件，到上层软件栈支持AI大模型训练和推理。

Jupyter Notebook 5,974 819 Updated Dec 22, 2025

gpu-mode / lectures

Material for gpu-mode lectures

Jupyter Notebook 5,682 570 Updated Feb 1, 2026

HugoBlox / hugo-theme-academic-cv

🎓 无需编写任何代码即可轻松创建漂亮的学术网站 Easily create a beautiful academic résumé or educational website using Hugo and GitHub. No code.

Jupyter Notebook 4,809 6,484 Updated Feb 1, 2026

lixin4ever / Conference-Acceptance-Rate

Acceptance rates for the major AI conferences

Jupyter Notebook 4,729 316 Updated Sep 23, 2025

Tencent-Hunyuan / HunyuanDiT

Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding

Jupyter Notebook 4,293 361 Updated Nov 27, 2025

chenyuntc / simple-faster-rcnn-pytorch

A simplified implemention of Faster R-CNN that replicate performance from origin paper

Jupyter Notebook 4,034 1,125 Updated May 15, 2021

QwenLM / Qwen2.5-Omni

Qwen2.5-Omni is an end-to-end multimodal model by Qwen team at Alibaba Cloud, capable of understanding text, audio, vision, video, and performing real-time speech generation.

Jupyter Notebook 3,913 316 Updated Jun 12, 2025

google-research / big_vision

Official codebase used to develop Vision Transformer, SigLIP, MLP-Mixer, LiT and more.

Jupyter Notebook 3,343 207 Updated May 19, 2025

IDEA-Research / Grounded-SAM-2

Grounded SAM 2: Ground and Track Anything in Videos with Grounding DINO, Florence-2 and SAM 2

Jupyter Notebook 3,257 375 Updated Nov 11, 2025

siliconflow / onediff

OneDiff: An out-of-the-box acceleration library for diffusion models.

Jupyter Notebook 1,963 126 Updated Dec 4, 2025

ByteDance-Seed / Seed1.5-VL

Seed1.5-VL, a vision-language foundation model designed to advance general-purpose multimodal understanding and reasoning, achieving state-of-the-art performance on 38 out of 60 public benchmarks.

Jupyter Notebook 1,538 60 Updated Jun 14, 2025

NVIDIA / accelerated-computing-hub

NVIDIA curated collection of educational resources related to general purpose GPU programming.

Jupyter Notebook 1,150 203 Updated Feb 4, 2026

bytedance / 1d-tokenizer

This repo contains the code for 1D tokenizer and generator

Jupyter Notebook 1,106 63 Updated Mar 20, 2025

DAMO-NLP-SG / VideoLLaMA3

Frontier Multimodal Foundation Models for Image and Video Understanding

Jupyter Notebook 1,102 81 Updated Aug 14, 2025

charent / Phi2-mini-Chinese

Phi2-Chinese-0.2B 从0开始训练自己的Phi2中文小模型，支持接入langchain加载本地知识库做检索增强生成RAG。Training your own Phi2 small chat model from scratch.

Jupyter Notebook 585 66 Updated Jul 11, 2024

Ziang Wu ZiangWu-77

Lists (9)

🔨AI infra

📍Amazing Tools

🚀Efficient llm

🚀Efficient video or image gen

🍓great course

🚏Interesting Work

🔥MoE

🚀 My stack

瓜

Stars