song2yu

Songsong Yu song2yu

SJTU

9 followers · 10 following

Tencent
Shanghai
https://song2yu.github.io/

Achievements

Stars

wusize / OpenUni

Python 167 8 Updated Jun 27, 2025

facebookresearch / metaquery

Official Implementation of Paper Transfer between Modalities with MetaQueries

Python 278 8 Updated Oct 12, 2025

song2yu / song2yu.github.io

Forked from CaiJimmy/hugo-theme-stack-starter

Academic Websites.

HTML 2 Updated Dec 7, 2025

facebookresearch / sam3

The repository provides code for running inference and finetuning with the Meta Segment Anything Model 3 (SAM 3), links for downloading the trained model checkpoints, and example notebooks that sho…

Python 6,165 713 Updated Dec 11, 2025

LTH14 / JiT

PyTorch implementation of JiT https://arxiv.org/abs/2511.13720

Python 1,816 107 Updated Dec 8, 2025

baaivision / Emu3.5

Native Multimodal Models are World Learners

Python 1,367 51 Updated Nov 28, 2025

Pointcept / Concerto

[NeurIPS'25] Official repository of Concerto: Joint 2D-3D Self-Supervised Learning Emerges Spatial Representations

Python 449 20 Updated Nov 29, 2025

JiuhaiChen / BLIP3o

Official implementation of BLIP3o-Series

Python 1,610 72 Updated Nov 29, 2025

AvaLovelace1 / BrickGPT

Official repository for BrickGPT, the first approach for generating physically stable toy brick models from text prompts.

Python 1,548 94 Updated Nov 9, 2025

deepseek-ai / DeepSeek-OCR

Contexts Optical Compression

Python 21,492 1,921 Updated Oct 25, 2025

OpenThinkIMG / OpenThinkIMG

OpenThinkIMG is an end-to-end open-source framework that empowers Large Vision-Language Models to think with images.

Jupyter Notebook 104 6 Updated Jul 11, 2025

inclusionAI / Ming

Ming - facilitating advanced multimodal understanding and generation capabilities built upon the Ling LLM.

Jupyter Notebook 558 46 Updated Oct 30, 2025

WayneJin0918 / SRUM

Official repo of paper "SRUM: Fine-Grained Self-Rewarding for Unified Multimodal Models". A post-training framework that creates a cost-effective, self-iterative optimization loop.

Python 88 6 Updated Nov 26, 2025

bytetriper / RAE

Official PyTorch Implementation of "Diffusion Transformers with Representation Autoencoders"

Python 1,641 53 Updated Nov 15, 2025

ByteDance-Seed / Bagel

Open-source unified multimodal model

Python 5,478 481 Updated Oct 27, 2025

HorizonWind2004 / reconstruction-alignment

Official repo of paper "Reconstruction Alignment Improves Unified Multimodal Models". Unlocking the Massive Zero-shot Potential in Unified Multimodal Models through Self-supervised Learning.

Python 335 11 Updated Dec 16, 2025