GUOZHIWEN

GUOZHIWEN

Stars

Peyton-Chen / diffusers

Forked from huggingface/diffusers

🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.

Python 2 1 Updated Dec 24, 2025

zyds / transformers-code

手把手带你实战 Huggingface Transformers 课程视频同步更新在B站与YouTube

Jupyter Notebook 3,657 481 Updated Jul 15, 2024

We-Math / We-Math

The code and data of We-Math, accepted by ACL 2025 main conference.

Python 134 8 Updated Dec 11, 2025

modelscope / DiffSynth-Studio

Enjoy the magic of Diffusion models!

Python 11,234 1,065 Updated Dec 23, 2025

huggingface / diffusers

🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.

Python 32,220 6,635 Updated Dec 25, 2025

gnobitab / RectifiedFlow

Official Implementation of Rectified Flow (ICLR2023 Spotlight)

Python 1,503 90 Updated Jul 20, 2024

rasbt / stat453-deep-learning-ss21

STAT 453: Intro to Deep Learning @ UW-Madison (Spring 2021)

Jupyter Notebook 531 316 Updated Feb 3, 2022

Visualize-ML / Linear-Algebra-Made-Easy---Learn-with-Python-and-Visualization

”数学不难“ 之《线性代数不难》上下册，66话题完册；欢迎批评指正

Jupyter Notebook 1,281 176 Updated Sep 3, 2025

Visualize-ML / Book4_Power-of-Matrix

Book_4_《矩阵力量》 | 鸢尾花书：从加减乘除到机器学习；上架！

Jupyter Notebook 9,735 1,480 Updated Dec 10, 2025

bfshi / AbSViT

Official code for "Top-Down Visual Attention from Analysis by Synthesis" (CVPR 2023 highlight)

Jupyter Notebook 168 13 Updated Aug 20, 2023

TongTong313 / rectified-flow

从零手搓Flow Matching（Rectified Flow）

Python 563 33 Updated Dec 10, 2025

huggingface / lerobot

🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning

Python 20,443 3,354 Updated Dec 25, 2025

lucidrains / titans-pytorch

Unofficial implementation of Titans, SOTA memory for transformers, in Pytorch

Python 1,789 181 Updated Dec 20, 2025

hkproj / pytorch-transformer

Attention is all you need implementation

Jupyter Notebook 1,130 379 Updated Jun 8, 2024

dingmyu / davit

[ECCV 2022]Code for paper "DaViT: Dual Attention Vision Transformer"

Python 372 33 Updated Feb 13, 2024

microsoft / dstoolkit-finetuning-florence-2

Accelerator on how to finetune Microsoft's Florance-2 model for a variety of computer vision use cases.

Jupyter Notebook 12 1 Updated May 6, 2025

andimarafioti / florence2-finetuning

Quick exploration into fine tuning florence 2

Jupyter Notebook 339 30 Updated Sep 19, 2024

retkowsky / florence-2

Florence-2

Jupyter Notebook 72 14 Updated Feb 13, 2025

ytdeepia / DDPM

Python 20 8 Updated Jul 4, 2025

CompVis / taming-transformers

Taming Transformers for High-Resolution Image Synthesis

Jupyter Notebook 6,385 1,229 Updated Jul 30, 2024

baaivision / Emu3

Next-Token Prediction is All You Need

Python 2,271 91 Updated Nov 19, 2025

rasbt / LLMs-from-scratch

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 81,705 12,227 Updated Dec 21, 2025

HqWu-HITCS / Awesome-Chinese-LLM

整理开源的中文大语言模型，以规模较小、可私有化部署、训练成本较低的模型为主，包括底座模型，垂直领域微调及应用，数据集与教程等。

21,966 2,086 Updated May 19, 2025

jingyaogong / minimind

🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT！🌏 Train a 26M-parameter GPT from scratch in just 2h!

Python 36,156 4,270 Updated Dec 24, 2025

jingyaogong / minimind-v

🚀 「大模型」1小时从0训练26M参数的视觉多模态VLM！🌏 Train a 26M-parameter VLM from scratch in just 1 hours!

Python 5,789 616 Updated Dec 24, 2025

ElliottYan / LUFFY

Official Repository of "Learning to Reason under Off-Policy Guidance"

Python 394 48 Updated Oct 4, 2025

BradyFU / Awesome-Multimodal-Large-Language-Models

✨✨Latest Advances on Multimodal Large Language Models

17,063 1,098 Updated Dec 25, 2025

anthropics / attribution-graphs-frontend

https://transformer-circuits.pub/2025/attribution-graphs/methods.html

JavaScript 90 21 Updated Mar 27, 2025

jacobdunefsky / transcoder_circuits

Jupyter Notebook 192 32 Updated Nov 17, 2024

QwenLM / Qwen3-VL

Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 17,401 1,457 Updated Nov 28, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly