Skip to content
View hanggun's full-sized avatar

Block or report hanggun

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Minimal Differentiable Image Reward Functions

Python 110 2 Updated Aug 19, 2025

🔥[AAAI 2026, Official Code] First work of Aesthetics Assessment of Image Color Temperature. 首篇针对色温美学评估的工作

Python 15 Updated Mar 21, 2026

🔥[AAAI 2026, Official Code] Regression Over Classification: Assessing Image Aesthetics via Multimodal Large Language Models. 克服大模型在美学评估过程中对分数不敏感的问题

Python 26 2 Updated Mar 21, 2026

Directly Aligning the Full Diffusion Trajectory with Fine-Grained Human Preference

Python 1,264 42 Updated Feb 24, 2026

[NeurIPS 2023] ImageReward: Learning and Evaluating Human Preferences for Text-to-image Generation

Python 1,655 90 Updated Oct 29, 2025

Making Flux go brrr on GPUs.

Python 166 17 Updated Jan 5, 2026

Qwen-Image-Lightning: Speed up Qwen-Image model with distillation

Python 1,275 44 Updated Jan 1, 2026

Unofficial extension implementation of Self-Forcing to support I2V && 14B training.

Python 361 23 Updated Sep 29, 2025

Qwen-Image is a powerful image generation foundation model capable of complex text rendering and precise image editing.

Python 7,632 465 Updated Feb 10, 2026

Implementation of "EasyControl: Adding Efficient and Flexible Control for Diffusion Transformer"(ICCV2025)

Python 1,724 126 Updated Jul 25, 2025

Get started with building Fullstack Agents using Gemini 2.5 and LangGraph

Jupyter Notebook 18,041 3,062 Updated Mar 26, 2026

JoyCaption is an image captioning Visual Language Model (VLM) being built from the ground up as a free, open, and uncensored model for the community to use in training Diffusion models.

Jupyter Notebook 1,114 68 Updated Feb 24, 2026

Rectified Flow Inversion (RF-Inversion) - ICLR 2025

Python 472 19 Updated Mar 19, 2025

[CVPR2025] RORem: Training a Robust Object Remover with Human-in-the-Loop

Python 65 3 Updated Sep 9, 2025

A simple tool to make SVG paths more smooth. Customizable tolerance and download the result.

JavaScript 3 2 Updated Jan 29, 2024

Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.

Python 73,185 10,042 Updated Mar 26, 2026

A PyTorch implementation of "Real-time Scene Text Detection with Differentiable Binarization".

Python 2,245 487 Updated Mar 11, 2024

Chrome extension to download images with one click, saving time on image dataset creation.

JavaScript 9 2 Updated Jul 13, 2025

ComfyUI : 163 nodes : Display, manipulate, and edit text, images, videos, loras and more. Manage looping operations, generate randomized content, use logical conditions and work with external AI to…

Python 522 56 Updated Jun 11, 2025

All my self trained & released AI upscaling models. After gathering and applying over 600 different upscaling models, I learned how to train my own models, and these are the results.

Python 575 37 Updated Nov 14, 2025

FlashMLA: Efficient Multi-head Latent Attention Kernels

C++ 12,540 1,006 Updated Feb 6, 2026

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 69,152 8,433 Updated Mar 27, 2026

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3.5, DeepSeek-R1, GLM-5, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Llava, Phi4, ...)…

Python 13,389 1,303 Updated Mar 27, 2026

Official implementation of the paper "Attentive Eraser: Unleashing Diffusion Model’s Object Removal Potential via Self-Attention Redirection Guidance" (AAAI 2025 Oral)

Jupyter Notebook 213 9 Updated May 9, 2025

Minimal reproduction of DeepSeek R1-Zero

Python 12,992 1,585 Updated Feb 27, 2026

Official repository of In-Context LoRA for Diffusion Transformers

2,066 94 Updated Dec 20, 2024

This is a study aim to transfer the single concept by using DIT model self-attention capablity

Python 787 36 Updated Nov 20, 2024

Unofficial custom_node for AnyText v1.1: https://github.com/tyxsspa/AnyText and AnyText v2.0: https://github.com/tyxsspa/AnyText2 and Glyph-ByT5: https://github.com/AIGText/Glyph-ByT5 (Test failed …

Python 99 14 Updated May 28, 2025

An End-to-End Model with Adaptive Filtering for Retrieval-Augmented Generation

Python 16 2 Updated Oct 27, 2024
Next