[🚀 ICLR 2026 Oral] NextStep-1: SOTA Autogressive Image Generation with Continuous Tokens. A research project developed by the StepFun’s Multimodal Intelligence team.

Python 684 25 Updated Feb 27, 2026

NishantTharani / LogSeqToObsidian

Some tools to help move my notes from LogSeq to Obsidian

Python 225 37 Updated Jul 2, 2025

yifan123 / flow_grpo

[NeurIPS 2025] An official implementation of Flow-GRPO: Training Flow Matching Models via Online RL

Python 2,276 159 Updated May 7, 2026

NVIDIA / logits-processor-zoo

A collection of LogitsProcessors to customize and enhance LLM behavior for specific tasks.

Python 391 24 Updated Jul 8, 2025

ByteDance-Seed / Bagel

Open-source unified multimodal model

Python 5,936 526 Updated May 4, 2026

VARGPT-family / VARGPT-v1.1

VARGPT-v1.1: Improve Visual Autoregressive Large Unified Model via Iterative Instruction Tuning and Reinforcement Learning

Python 270 16 Updated Apr 15, 2025

scholarly-python-package / scholarly

Retrieve author and publication information from Google Scholar in a friendly, Pythonic way without having to worry about CAPTCHAs!

Python 1,850 347 Updated Mar 24, 2026

ssundaram21 / dreamsim

DreamSim: Learning New Dimensions of Human Visual Similarity using Synthetic Data (NeurIPS 2023 Spotlight) / / / / When Does Perceptual Alignment Benefit Vision Representations? (NeurIPS 2024)

Python 601 32 Updated Nov 24, 2025

DIYgod / RSSHub

🧡 Everything is RSSible

TypeScript 44,129 9,798 Updated May 19, 2026

showlab / VideoSwap

Code for [CVPR 2024] VideoSwap: Customized Video Subject Swapping with Interactive Semantic Point Correspondence

Python 407 20 Updated Dec 6, 2024

IDEA-Research / Grounded-SAM-2

Grounded SAM 2: Ground and Track Anything in Videos with Grounding DINO, Florence-2 and SAM 2

Jupyter Notebook 3,518 411 Updated Nov 11, 2025

zibojia / COCOCO

Video-Inpaint-Anything: This is the inference code for our paper CoCoCo: Improving Text-Guided Video Inpainting for Better Consistency, Controllability and Compatibility.

Python 324 11 Updated Sep 24, 2024

Genesis-Embodied-AI / genesis-world

A generative world for general-purpose robotics & embodied AI learning.

Python 28,812 2,710 Updated May 19, 2026

langgenius / dify

Production-ready platform for agentic workflow development.

TypeScript 141,908 22,305 Updated May 19, 2026

mrwu-mac / ControlMLLM

[NeurIPS2024] Repo for the paper `ControlMLLM: Training-Free Visual Prompt Learning for Multimodal Large Language Models'

Python 209 6 Updated Jul 17, 2025

overleaf-workshop / Overleaf-Workshop

Open Overleaf/ShareLaTex projects in vscode, with full collaboration support.

TypeScript 1,551 62 Updated May 11, 2026

showlab / computer_use_ootb

Out-of-the-box (OOTB) GUI Agent for Windows and macOS

Python 1,942 204 Updated May 21, 2025

VectorSpaceLab / OmniGen

OmniGen: Unified Image Generation. https://arxiv.org/pdf/2409.11340

Jupyter Notebook 4,322 362 Updated Dec 4, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Zhipeng Huang hzphzp

Achievements

Achievements

Block or report hzphzp

Stars

JimLiu / baoyu-skills

WeChatCV / UnicBench

benjypng / logseq-mermaid-plugin

facebookresearch / sscd-copy-detection

bytetriper / RAE

googleapis / python-genai

Comfy-Org / ComfyUI

dockur / windows

datalab-to / marker

microsoft / VibeVoice

wileyyugioh / zotmoov

arsenetar / dupeguru

stepfun-ai / NextStep-1