Skip to content
View vfph55's full-sized avatar

Block or report vfph55

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[ICML 2026] Official implementation of "Open-o3 Video: Grounded Video Reasoning with Explicit Spatio-Temporal Evidence"

Python 153 7 Updated May 1, 2026

[ICML 2026] Video-o3: Native Interleaved Clue Seeking for Long Video Multi-Hop Reasoning

Python 118 7 Updated Jun 14, 2026

📚 《从零开始构建智能体》——从零开始的智能体原理与实践教程

Python 60,676 7,480 Updated Jun 18, 2026

Using FLUX.1 Kontext for creating segmentation masks for objects absent from images, enabling workflows in inpainting and virtual try-ons.

6 Updated Sep 26, 2025

The ultimate training toolkit for finetuning diffusion models

Python 10,949 1,367 Updated Jun 21, 2026

Reference PyTorch implementation and models for DINOv3

Jupyter Notebook 10,712 879 Updated Jun 15, 2026

Diffusion Models in Medical Imaging (Published in Medical Image Analysis Journal)

2,098 172 Updated Nov 17, 2025

Clean, scalable and easy to use ResNet implementation in Pytorch

Jupyter Notebook 218 46 Updated Feb 21, 2020

Ultralytics YOLO 🚀

Python 58,632 11,231 Updated Jun 21, 2026

Implementation of paper - YOLOv9: Learning What You Want to Learn Using Programmable Gradient Information

Python 9,533 1,624 Updated Aug 9, 2024
Python 21 6 Updated Nov 3, 2025

Using Low-rank adaptation to quickly fine-tune diffusion models.

Jupyter Notebook 7,539 493 Updated Mar 22, 2024

official code repo for paper "CogView2: Faster and Better Text-to-Image Generation via Hierarchical Transformers"

Python 957 87 Updated Aug 3, 2022

PyTorch implementation of a 1.3B text-to-image generation model trained on 14 million image-text pairs

Python 1 Updated Jun 23, 2025

GLIDE: a diffusion-based text-conditional image synthesis model

Jupyter Notebook 1 Updated Jun 21, 2025

Stable Diffusion implemented from scratch in PyTorch

Jupyter Notebook 1 Updated May 31, 2025

Coding a Multimodal (Vision) Language Model from scratch in PyTorch with full explanation: https://www.youtube.com/watch?v=vAmKB7iPkWw

Python 1 Updated Dec 6, 2024