Skip to content
View GUOZHIWEN's full-sized avatar

Block or report GUOZHIWEN

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.

Python 2 1 Updated Dec 24, 2025

手把手带你实战 Huggingface Transformers 课程视频同步更新在B站与YouTube

Jupyter Notebook 3,657 481 Updated Jul 15, 2024

The code and data of We-Math, accepted by ACL 2025 main conference.

Python 134 8 Updated Dec 11, 2025

Enjoy the magic of Diffusion models!

Python 11,234 1,065 Updated Dec 23, 2025

🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.

Python 32,220 6,635 Updated Dec 25, 2025

Official Implementation of Rectified Flow (ICLR2023 Spotlight)

Python 1,503 90 Updated Jul 20, 2024

STAT 453: Intro to Deep Learning @ UW-Madison (Spring 2021)

Jupyter Notebook 531 316 Updated Feb 3, 2022

”数学不难“ 之 《线性代数不难》上下册,66话题完册;欢迎批评指正

Jupyter Notebook 1,281 176 Updated Sep 3, 2025

Book_4_《矩阵力量》 | 鸢尾花书:从加减乘除到机器学习;上架!

Jupyter Notebook 9,735 1,480 Updated Dec 10, 2025

Official code for "Top-Down Visual Attention from Analysis by Synthesis" (CVPR 2023 highlight)

Jupyter Notebook 168 13 Updated Aug 20, 2023

从零手搓Flow Matching(Rectified Flow)

Python 563 33 Updated Dec 10, 2025

🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning

Python 20,443 3,354 Updated Dec 25, 2025

Unofficial implementation of Titans, SOTA memory for transformers, in Pytorch

Python 1,789 181 Updated Dec 20, 2025

Attention is all you need implementation

Jupyter Notebook 1,130 379 Updated Jun 8, 2024

[ECCV 2022]Code for paper "DaViT: Dual Attention Vision Transformer"

Python 372 33 Updated Feb 13, 2024

Accelerator on how to finetune Microsoft's Florance-2 model for a variety of computer vision use cases.

Jupyter Notebook 12 1 Updated May 6, 2025

Quick exploration into fine tuning florence 2

Jupyter Notebook 339 30 Updated Sep 19, 2024

Florence-2

Jupyter Notebook 72 14 Updated Feb 13, 2025
Python 20 8 Updated Jul 4, 2025

Taming Transformers for High-Resolution Image Synthesis

Jupyter Notebook 6,385 1,229 Updated Jul 30, 2024

Next-Token Prediction is All You Need

Python 2,271 91 Updated Nov 19, 2025

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 81,705 12,227 Updated Dec 21, 2025

整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。

21,966 2,086 Updated May 19, 2025

🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 2h!

Python 36,156 4,270 Updated Dec 24, 2025

🚀 「大模型」1小时从0训练26M参数的视觉多模态VLM!🌏 Train a 26M-parameter VLM from scratch in just 1 hours!

Python 5,789 616 Updated Dec 24, 2025

Official Repository of "Learning to Reason under Off-Policy Guidance"

Python 394 48 Updated Oct 4, 2025

✨✨Latest Advances on Multimodal Large Language Models

17,063 1,098 Updated Dec 25, 2025

https://transformer-circuits.pub/2025/attribution-graphs/methods.html

JavaScript 90 21 Updated Mar 27, 2025
Jupyter Notebook 192 32 Updated Nov 17, 2024

Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 17,401 1,457 Updated Nov 28, 2025
Next