Skip to content
View daizuozhuo's full-sized avatar

Block or report daizuozhuo

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

科技爱好者周刊,每周五发布

78,333 3,692 Updated Oct 31, 2025

Official Implementation of paper "MonST3R: A Simple Approach for Estimating Geometry in the Presence of Motion"

Python 1,297 78 Updated Jun 16, 2025

HunyuanVideo-I2V: A Customizable Image-to-Video Model based on HunyuanVideo

Python 1,714 175 Updated May 20, 2025

HunyuanVideo: A Systematic Framework For Large Video Generation Model

Python 11,235 1,119 Updated Aug 27, 2025

[CVPR2024 Highlight] VBench - We Evaluate Video Generation

Python 1,298 85 Updated Oct 16, 2025

PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation

Python 1,858 91 Updated Oct 31, 2024

UniEdit: A Unified Tuning-Free Framework for Video Motion and Appearance Editing

Python 112 4 Updated Apr 16, 2025

Simple script to parallelize download and extract files for SA-1B Dataset.

Python 37 4 Updated Jul 2, 2025

VideoSys: An easy and efficient system for video generation

Python 2,005 132 Updated Aug 27, 2025

LAVIS - A One-stop Library for Language-Vision Intelligence

Jupyter Notebook 11,000 1,079 Updated Nov 18, 2024

Fine-Grained Open Domain Image Animation with Motion Guidance

Python 949 76 Updated Oct 18, 2024

Fine-Grained Open Domain Image Animation with Motion Guidance

9 Updated Dec 8, 2023

Stable diffusion webui based on diffusers.

Python 973 68 Updated Sep 29, 2023

cvpr2024/cvpr2023/cvpr2022/cvpr2021/cvpr2020/cvpr2019/cvpr2018/cvpr2017 论文/代码/解读/直播合集,极市团队整理

12,509 2,267 Updated Apr 25, 2024

Mini-DALLE3: Interactive Text to Image by Prompting Large Language Models

Python 312 29 Updated Dec 28, 2023

🎥 Python and OpenCV-based scene cut/transition detection program & library.

Python 4,294 462 Updated Oct 5, 2025

Finetune ModelScope's Text To Video model using Diffusers 🧨

Python 690 111 Updated Dec 14, 2023

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 48,976 8,203 Updated Dec 9, 2024
Python 668 86 Updated Nov 1, 2024

Aligning pretrained language models with instruction data generated by themselves.

Python 4,516 521 Updated Mar 27, 2023

ChatGPT and Bing AI prompt curation

894 82 Updated Jun 11, 2025

GLM-130B: An Open Bilingual Pre-Trained Model (ICLR 2023)

Python 7,681 607 Updated Jul 25, 2023

Robust Speech Recognition via Large-Scale Weak Supervision

Python 90,394 11,321 Updated Sep 8, 2025

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

Python 13,341 1,352 Updated Oct 1, 2025

Cross-View Language Modeling: Towards Unified Cross-Lingual Cross-Modal Pre-training (ACL 2023))

Python 92 8 Updated Jun 12, 2023

SymSpell: 1 million times faster spelling correction & fuzzy search through Symmetric Delete spelling correction algorithm

C# 3,325 308 Updated Nov 5, 2025

Grounded Language-Image Pre-training

Python 2,528 214 Updated Jan 24, 2024

METER: A Multimodal End-to-end TransformER Framework

Python 373 33 Updated Nov 16, 2022

A modular framework for vision & language multimodal research from Facebook AI Research (FAIR)

Python 5,598 940 Updated Apr 24, 2025
Next