hanggun

hanggun

2 followers · 3 following

Achievements

Stars

RE-N-Y / imscore

Minimal Differentiable Image Reward Functions

Python 110 3 Updated Aug 19, 2025

chasecjg / ICTA2Net

🔥[AAAI 2026, Official Code] First work of Aesthetics Assessment of Image Color Temperature. 首篇针对色温美学评估的工作

Python 16 Updated Mar 21, 2026

woshidandan / Assessing-Image-Aesthetics-via-Multimodal-Large-Language-Models

🔥[AAAI 2026, Official Code] Regression Over Classification: Assessing Image Aesthetics via Multimodal Large Language Models. 克服大模型在美学评估过程中对分数不敏感的问题

Python 26 2 Updated Mar 21, 2026

Tencent-Hunyuan / SRPO

Directly Aligning the Full Diffusion Trajectory with Fine-Grained Human Preference

Python 1,265 42 Updated Feb 24, 2026

zai-org / ImageReward

[NeurIPS 2023] ImageReward: Learning and Evaluating Human Preferences for Text-to-image Generation

Python 1,656 90 Updated Oct 29, 2025

huggingface / flux-fast

Making Flux go brrr on GPUs.

Python 166 17 Updated Jan 5, 2026

ModelTC / Qwen-Image-Lightning

Qwen-Image-Lightning: Speed up Qwen-Image model with distillation

Python 1,277 44 Updated Jan 1, 2026

GoatWu / Self-Forcing-Plus

Forked from guandeh17/Self-Forcing

Unofficial extension implementation of Self-Forcing to support I2V && 14B training.

Python 361 23 Updated Sep 29, 2025

QwenLM / Qwen-Image

Qwen-Image is a powerful image generation foundation model capable of complex text rendering and precise image editing.

Python 7,654 467 Updated Feb 10, 2026

Xiaojiu-z / EasyControl

Implementation of "EasyControl: Adding Efficient and Flexible Control for Diffusion Transformer"(ICCV2025)

Python 1,724 126 Updated Jul 25, 2025

google-gemini / gemini-fullstack-langgraph-quickstart

Get started with building Fullstack Agents using Gemini 2.5 and LangGraph

Jupyter Notebook 18,050 3,060 Updated Mar 26, 2026

fpgaminer / joycaption

JoyCaption is an image captioning Visual Language Model (VLM) being built from the ground up as a free, open, and uncensored model for the community to use in training Diffusion models.

Jupyter Notebook 1,117 68 Updated Feb 24, 2026

LituRout / RF-Inversion

Rectified Flow Inversion (RF-Inversion) - ICLR 2025

Python 472 19 Updated Mar 19, 2025

leeruibin / RORem

[CVPR2025] RORem: Training a Robust Object Remover with Human-in-the-Loop

Python 65 3 Updated Sep 9, 2025

adriaandotcom / smoothsvg.com

A simple tool to make SVG paths more smooth. Customizable tolerance and download the result.

JavaScript 3 2 Updated Jan 29, 2024

PaddlePaddle / PaddleOCR

Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.

Python 73,376 10,051 Updated Mar 30, 2026

MhLiao / DB

A PyTorch implementation of "Real-time Scene Text Detection with Differentiable Binarization".

Python 2,246 487 Updated Mar 11, 2024

franklinkemta / oneclick-image-downloader-extension

Chrome extension to download images with one click, saving time on image dataset creation.

JavaScript 9 2 Updated Jul 13, 2025

justUmen / Bjornulf_custom_nodes

ComfyUI : 163 nodes : Display, manipulate, and edit text, images, videos, loras and more. Manage looping operations, generate randomized content, use logical conditions and work with external AI to…

Python 522 56 Updated Jun 11, 2025

Phhofm / models

All my self trained & released AI upscaling models. After gathering and applying over 600 different upscaling models, I learned how to train my own models, and these are the results.

Python 575 37 Updated Nov 14, 2025

deepseek-ai / FlashMLA

FlashMLA: Efficient Multi-head Latent Attention Kernels

C++ 12,546 1,007 Updated Feb 6, 2026

spacepxl / demystifying-sd-finetuning

Python 171 6 Updated Feb 4, 2025

hiyouga / LlamaFactory

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 69,252 8,435 Updated Mar 30, 2026

modelscope / ms-swift

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3.5, DeepSeek-R1, GLM-5, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Llava, Phi4, ...)…

Python 13,420 1,307 Updated Mar 30, 2026

Alibaba-YuFeng / AttentiveEraser

Official implementation of the paper "Attentive Eraser: Unleashing Diffusion Model’s Object Removal Potential via Self-Attention Redirection Guidance" (AAAI 2025 Oral)

Jupyter Notebook 214 9 Updated May 9, 2025

Jiayi-Pan / TinyZero

Minimal reproduction of DeepSeek R1-Zero

Python 13,003 1,586 Updated Feb 27, 2026

ali-vilab / In-Context-LoRA

Official repository of In-Context LoRA for Diffusion Transformers

2,066 94 Updated Dec 20, 2024

TTPlanetPig / Comfyui_Object_Migration

This is a study aim to transfer the single concept by using DIT model self-attention capablity

Python 787 36 Updated Nov 20, 2024

zmwv823 / ComfyUI_Anytext

Unofficial custom_node for AnyText v1.1: https://github.com/tyxsspa/AnyText and AnyText v2.0: https://github.com/tyxsspa/AnyText2 and Glyph-ByT5: https://github.com/AIGText/Glyph-ByT5 (Test failed …

Python 99 14 Updated May 28, 2025

XieZilongAI / E2E-AFG

An End-to-End Model with Adaptive Filtering for Retrieval-Augmented Generation

Python 16 2 Updated Oct 27, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

hanggun

Achievements

Achievements

Block or report hanggun

Stars

RE-N-Y / imscore

chasecjg / ICTA2Net

woshidandan / Assessing-Image-Aesthetics-via-Multimodal-Large-Language-Models

Tencent-Hunyuan / SRPO

zai-org / ImageReward

huggingface / flux-fast

ModelTC / Qwen-Image-Lightning

GoatWu / Self-Forcing-Plus

QwenLM / Qwen-Image

Xiaojiu-z / EasyControl

google-gemini / gemini-fullstack-langgraph-quickstart

fpgaminer / joycaption

LituRout / RF-Inversion

leeruibin / RORem

adriaandotcom / smoothsvg.com

PaddlePaddle / PaddleOCR

MhLiao / DB

franklinkemta / oneclick-image-downloader-extension

justUmen / Bjornulf_custom_nodes

Phhofm / models

deepseek-ai / FlashMLA

spacepxl / demystifying-sd-finetuning

hiyouga / LlamaFactory

modelscope / ms-swift

Alibaba-YuFeng / AttentiveEraser

Jiayi-Pan / TinyZero

ali-vilab / In-Context-LoRA

TTPlanetPig / Comfyui_Object_Migration

zmwv823 / ComfyUI_Anytext

XieZilongAI / E2E-AFG