Skip to content
View yyyouy's full-sized avatar
🏠
Working from home
🏠
Working from home
  • renmin university of china
  • beijing
  • 00:06 (UTC +08:00)

Highlights

  • Pro

Organizations

@ML-GSAI

Block or report yyyouy

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Resources and paper list for "Thinking with Images for LVLMs". This repository accompanies our survey on how LVLMs can leverage visual information for complex reasoning, planning, and generation.

1,333 41 Updated Feb 3, 2026

Elevate your AI research writing, no more tedious polishing ✨

6,218 481 Updated Feb 11, 2026

[ICLR 2026] Harder Is Better: Boosting Mathematical Reasoning via Difficulty-Aware GRPO and Multi-Aspect Question Reformulation

Python 121 2 Updated Feb 15, 2026

GLM-Image: Auto-regressive for Dense-knowledge and High-fidelity Image Generation.

Python 785 46 Updated Feb 2, 2026

EvoToken-DLM (Beyond Hard Masks: Progressive Token Evolution for Diffusion Language)

Python 26 Updated Jan 13, 2026

TurboDiffusion: 100–200× Acceleration for Video Diffusion Models

Python 3,334 231 Updated Feb 16, 2026
Python 147 5 Updated Jan 20, 2026

Official PyTorch Implementation of "Diffusion Transformers with Representation Autoencoders"

Python 1,769 66 Updated Jan 20, 2026

dInfer: An Efficient Inference Framework for Diffusion Language Models

Python 421 41 Updated Feb 11, 2026

dLLM: Simple Diffusion Language Modeling

Python 1,733 171 Updated Feb 17, 2026

Official Implementation for the paper "d1: Scaling Reasoning in Diffusion Large Language Models via Reinforcement Learning"

Python 404 50 Updated Jan 26, 2026

Simple MoE - Day 17 of 365 Days of Repos

Python 16 1 Updated Jan 17, 2025

SDAR (Synergy of Diffusion and AutoRegression), a large diffusion language model(1.7B, 4B, 8B, 30B)

Python 332 17 Updated Dec 15, 2025

Official PyTorch implementation of DiffMoE, TC-DiT, EC-DiT and Dense DiT

Python 164 5 Updated Oct 21, 2025

[ICLR'26] Official PyTorch implementation of "Time Is a Feature: Exploiting Temporal Dynamics in Diffusion Language Models".

Python 61 3 Updated Feb 6, 2026

Awesome Unified Multimodal Models

1,110 36 Updated Feb 6, 2026

Discrete Diffusion Forcing (D2F): dLLMs Can Do Faster-Than-AR Inference

Python 242 16 Updated Feb 3, 2026

Qwen-Image-Lightning: Speed up Qwen-Image model with distillation

Python 1,231 44 Updated Jan 1, 2026
Python 170 8 Updated Dec 22, 2025

VeOmni: Scaling Any Modality Model Training with Model-Centric Distributed Recipe Zoo

Python 1,654 148 Updated Feb 18, 2026

[ICLR2025 Spotlight] SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models

Python 3,686 223 Updated Feb 14, 2026

Fine-tuning Qwen2.5-VL for vision-language tasks | Optimized for Vision understanding | LoRA & PEFT support.

Python 152 21 Updated Feb 7, 2025

Witness the aha moment of VLM with less than $3.

Python 4,033 285 Updated May 19, 2025

OLMoE: Open Mixture-of-Experts Language Models

Jupyter Notebook 970 92 Updated Sep 23, 2025

Open-source unified multimodal model

Python 5,675 502 Updated Oct 27, 2025

An open-source AI agent that brings the power of Gemini directly into your terminal.

TypeScript 94,783 11,236 Updated Feb 18, 2026
Python 717 19 Updated Feb 5, 2026

[NeurIPS 2025 Spotlight] ReasonFlux (long-CoT), ReasonFlux-PRM (process reward model) and ReasonFlux-Coder (code generation)

Python 520 36 Updated Sep 27, 2025

OmniGen2: Exploration to Advanced Multimodal Generation. https://arxiv.org/abs/2506.18871

Jupyter Notebook 4,025 17 Updated Dec 2, 2025

🚀 Lightning-fast computer vision models. Fine-tune SOTA models with just a few lines of code. Ready for cloud ☁️ and edge 📱 deployment.

Python 348 3 Updated Dec 11, 2025
Next