Stars
Pytorch implementation of "M-LSD: Towards Light-weight and Real-time Line Segment Detection"
Official Tensorflow implementation of "M-LSD: Towards Light-weight and Real-time Line Segment Detection" (AAAI 2022 Oral)
吴恩达《ChatGPT Prompt Engineering for Developers》课程中英版
Implementation of "FLUX-Text: A Simple and Advanced Diffusion Transformer Baseline for Scene Text Editing"
Official implementation of HYPIR: Harnessing Diffusion-Yielded Score Priors for Image Restoration
[ECCV 2024] InstructIR: High-Quality Image Restoration Following Human Instructions https://huggingface.co/spaces/marcosv/InstructIR
This is official implementtaion of "VmambaIR: Visual State Space Model for Image Restoration"
Arbitrary-steps Image Super-resolution via Diffusion Inversion (CVPR 2025)
SUPIR aims at developing Practical Algorithms for Photo-Realistic Image Restoration In the Wild. Our new online demo is also released at suppixel.ai.
Qwen-Image is a powerful image generation foundation model capable of complex text rendering and precise image editing.
Enjoy the magic of Diffusion models!
The ultimate training toolkit for finetuning diffusion models
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.
[CVPR2024] Diffusion-based Blind Text Image Super-Resolution (Official)
The official project of paper "Visual Text Processing: A Comprehensive Review and Unified Evaluation""
Official repository of In-Context LoRA for Diffusion Transformers
A high-throughput and memory-efficient inference and serving engine for LLMs
Foundational Models for State-of-the-Art Speech and Text Translation
Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 500+ LLMs (Qwen3, Qwen3-MoE, Llama4, GLM4.5, InternLM3, DeepSeek-R1, ...) and 200+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, Llava, GLM4v, Ph…
A novel Multimodal Large Language Model (MLLM) architecture, designed to structurally align visual and textual embeddings.
An unified model that seamlessly integrates multimodal understanding, text-to-image generation, and image editing within a single powerful framework.
Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM
A Gradio interface for the Hunyuan-MT-7B translation model with a public share link enabled.
NiuTrans.SMT is an open-source statistical machine translation system developed by a joint team from NLP Lab. at Northeastern University and the NiuTrans Team. The NiuTrans system is fully develope…
Real-ESRGAN aims at developing Practical Algorithms for General Image/Video Restoration.
Official PyTorch implementation of SegFormer
A latent text-to-image diffusion model