LiYinqi

Yinqi Li LiYinqi

Last-year PhD student at Institute of Computing Technology, Chinese Academy of Sciences (ICT, CAS)

13 followers · 14 following

06:25 (UTC +08:00)

Achievements

Highlights

Lists (12)

Sort

Stars

knightnemo / Awesome-World-Models

A Curated List of Awesome Works in World Modeling, Aiming to Serve as a One-stop Resource for Researchers, Practitioners, and Enthusiasts Interested in World Modeling.

2,503 110 Updated Apr 8, 2026

Paranioar / Awesome_Matching_Pretraining_Transfering

The Paper List of Large Multi-Modality Model (Perception, Generation, Unification), Parameter-Efficient Finetuning, Vision-Language Pretraining, Conventional Image-Text Matching for Preliminary Ins…

443 49 Updated Sep 25, 2025

AIDC-AI / Awesome-Unified-Multimodal-Models

Awesome Unified Multimodal Models

1,181 38 Updated Mar 24, 2026

dw-dengwei / daily-arXiv-ai-enhanced

Automatically crawl arXiv papers daily and summarize them using AI. Illustrating them using GitHub Pages.

JavaScript 2,548 914 Updated Apr 9, 2026

zhaochen0110 / Awesome_Think_With_Images

Resources and paper list for "Thinking with Images for LVLMs". This repository accompanies our survey on how LVLMs can leverage visual information for complex reasoning, planning, and generation.

1,414 42 Updated Mar 9, 2026

AMAP-ML / FE2E

[CVPR 2026] Beyond Generation: Advancing Image Editing Priors for Depth and Normal Estimation

Python 223 8 Updated Mar 31, 2026

bytedance / SuperEdit

[ICCV 2025] Code & Data for: SuperEdit - Rectifying and Facilitating Supervision for Instruction-Based Image Editing

Python 164 9 Updated Jun 26, 2025

River-Zhang / ICEdit

[NeurIPS 2025] Image editing is worth a single LoRA! 0.1% training data for fantastic image editing! Surpasses GPT-4o in ID persistence~ MoE ckpt released! Only 4GB VRAM is enough to run!

Python 2,090 115 Updated Dec 19, 2025

peteole / lm-writing-tool

VSCode extension that grammar-checks texts through a local LLM

TypeScript 26 6 Updated Oct 30, 2025

HVision-NKU / Cascade-CLIP

Official implement of ICML2024 Cascade-CLIP: Cascaded Vision-Language Embeddings Alignment for Zero-Shot Semantic Segmentation

Python 57 3 Updated Aug 15, 2024

baaivision / DIVA

[ICLR 2025] Diffusion Feedback Helps CLIP See Better

Python 301 14 Updated Jan 23, 2025

LiYinqi / un2CLIP

[NeurIPS'25] A work to improve CLIP's visual detail capturing ability by inverting the unCLIP generative model.

Python 23 Updated Mar 19, 2026

JackYFL / WiCo

The official implementation of CVPR Workshop 2025 paper: Window Token Concatenation for Efficient Visual Large Language Models.

Python 10 Updated Apr 10, 2025

mashijie1028 / GenHancer

(ICCV 2025) Enhance CLIP and MLLM's fine-grained visual representations with generative models.

Python 77 4 Updated Jun 25, 2025

JackYFL / awesome-VLLMs

This repository collects papers on VLLM applications. We will update new papers irregularly.

212 16 Updated Feb 23, 2026

rom1504 / img2dataset

Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.

Python 4,403 375 Updated Oct 19, 2025

diffusion-classifier / diffusion-classifier

Diffusion Classifier leverages pretrained diffusion models to perform zero-shot classification without additional training

Python 486 45 Updated Feb 28, 2024

cvg / NoPoSplat

[ICLR'25 Oral] No Pose, No Problem: Surprisingly Simple 3D Gaussian Splats from Sparse Unposed Images

Python 945 50 Updated Feb 25, 2026

mohuangrui / ucasthesis

LaTeX Thesis Template for the University of Chinese Academy of Sciences

TeX 3,812 943 Updated Feb 29, 2024

Stability-AI / sd3.5

Python 1,482 146 Updated Jan 8, 2025

Darkbblue / generic-diffusion-feature

Official implementation of NeurIPS'24 paper Not All Diffusion Model Activations Have Been Evaluated as Discriminative Features

Python 38 5 Updated May 28, 2025

meta-llama / llama-models

Utilities intended for use with Llama models.

Python 7,549 1,355 Updated Feb 11, 2026

baaivision / Emu3

Next-Token Prediction is All You Need

Python 2,393 95 Updated Jan 12, 2026

facebookresearch / fvcore

Collection of common code that's shared among different research projects in FAIR computer vision team.

Python 2,233 237 Updated Mar 15, 2026

facebookresearch / sapiens

High-resolution models for human tasks.

Python 5,319 316 Updated Nov 18, 2024

facebookresearch / sam2

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 18,899 2,421 Updated Apr 7, 2026

LTH14 / mar

PyTorch implementation of MAR+DiffLoss https://arxiv.org/abs/2406.11838

Python 1,891 120 Updated Feb 20, 2026

cambrian-mllm / cambrian

Cambrian-1 is a family of multimodal LLMs with a vision-centric design.

Python 1,992 137 Updated Nov 7, 2025

bytedance / 1d-tokenizer

This repo contains the code for 1D tokenizer and generator

Jupyter Notebook 1,140 67 Updated Mar 20, 2025

TencentARC / SEED-Voken

SEED-Voken: A Series of Powerful Visual Tokenizers

Python 1,003 44 Updated Nov 25, 2025

Yinqi Li LiYinqi

Highlights

Lists (12)

Det & Seg

Diffusion

GAN Inversion

GANs

Img-to-Img

Neural Arch

Pretrained Models

Robustness & Generalization

SelfSL

Shape-Texture

Text-and-Img

Tools

Stars