Skip to content
View hzphzp's full-sized avatar
  • University of Science and Technology of China
  • University of Science and Technology of China
  • 08:33 (UTC -12:00)

Block or report hzphzp

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
TypeScript 18,842 2,213 Updated May 18, 2026

[CVPR 2026] UnicEdit-10M and UnicBench project

Python 41 1 Updated Mar 3, 2026

Create mermaid diagrams in image format on-the-fly.

TypeScript 132 19 Updated Jan 17, 2026

Open source implementation of "A Self-Supervised Descriptor for Image Copy Detection" (SSCD).

Python 398 30 Updated Aug 2, 2022

Official PyTorch Implementation of "Diffusion Transformers with Representation Autoencoders"

Python 1,893 83 Updated Feb 25, 2026

Google Gen AI Python SDK provides an interface for developers to integrate Google's generative models into their Python applications.

Python 3,708 872 Updated May 19, 2026

The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.

Python 113,582 13,300 Updated May 19, 2026

Windows inside a Docker container.

Shell 51,422 4,315 Updated May 19, 2026

Convert PDF to markdown + JSON quickly with high accuracy

Python 35,250 2,443 Updated May 5, 2026

Open-Source Frontier Voice AI

Python 47,296 5,270 Updated May 6, 2026

Zotero plugin to automatically move attachments and link them

JavaScript 1,347 30 Updated May 9, 2026

Find duplicate files

Python 7,572 508 Updated Jan 6, 2026

[🚀 ICLR 2026 Oral] NextStep-1: SOTA Autogressive Image Generation with Continuous Tokens. A research project developed by the StepFun’s Multimodal Intelligence team.

Python 684 25 Updated Feb 27, 2026

Some tools to help move my notes from LogSeq to Obsidian

Python 225 37 Updated Jul 2, 2025

[NeurIPS 2025] An official implementation of Flow-GRPO: Training Flow Matching Models via Online RL

Python 2,276 159 Updated May 7, 2026

A collection of LogitsProcessors to customize and enhance LLM behavior for specific tasks.

Python 391 24 Updated Jul 8, 2025

Open-source unified multimodal model

Python 5,936 526 Updated May 4, 2026

VARGPT-v1.1: Improve Visual Autoregressive Large Unified Model via Iterative Instruction Tuning and Reinforcement Learning

Python 270 16 Updated Apr 15, 2025

Retrieve author and publication information from Google Scholar in a friendly, Pythonic way without having to worry about CAPTCHAs!

Python 1,850 347 Updated Mar 24, 2026

DreamSim: Learning New Dimensions of Human Visual Similarity using Synthetic Data (NeurIPS 2023 Spotlight) / / / / When Does Perceptual Alignment Benefit Vision Representations? (NeurIPS 2024)

Python 601 32 Updated Nov 24, 2025

🧡 Everything is RSSible

TypeScript 44,129 9,798 Updated May 19, 2026

Code for [CVPR 2024] VideoSwap: Customized Video Subject Swapping with Interactive Semantic Point Correspondence

Python 407 20 Updated Dec 6, 2024

Grounded SAM 2: Ground and Track Anything in Videos with Grounding DINO, Florence-2 and SAM 2

Jupyter Notebook 3,518 411 Updated Nov 11, 2025

Video-Inpaint-Anything: This is the inference code for our paper CoCoCo: Improving Text-Guided Video Inpainting for Better Consistency, Controllability and Compatibility.

Python 324 11 Updated Sep 24, 2024

A generative world for general-purpose robotics & embodied AI learning.

Python 28,812 2,710 Updated May 19, 2026

Production-ready platform for agentic workflow development.

TypeScript 141,908 22,305 Updated May 19, 2026

[NeurIPS2024] Repo for the paper `ControlMLLM: Training-Free Visual Prompt Learning for Multimodal Large Language Models'

Python 209 6 Updated Jul 17, 2025

Open Overleaf/ShareLaTex projects in vscode, with full collaboration support.

TypeScript 1,551 62 Updated May 11, 2026

Out-of-the-box (OOTB) GUI Agent for Windows and macOS

Python 1,942 204 Updated May 21, 2025

OmniGen: Unified Image Generation. https://arxiv.org/pdf/2409.11340

Jupyter Notebook 4,322 362 Updated Dec 4, 2025
Next