keyu-tian

🧐

Focusing

Keyu Tian keyu-tian

🧐

Focusing

Incoming Ph.D. student. Self-supervised learning & generative models & reinforcement learning.

994 followers · 4 following

Achievements

Highlights

Stars

microsoft / markitdown

Python tool for converting files and office documents to Markdown.

Python 123,518 8,359 Updated Apr 20, 2026

z-x-yang / Segment-and-Track-Anything

An open-source project dedicated to tracking and segmenting any objects in videos, either automatically or interactively. The primary algorithms utilized include the Segment Anything Model (SAM) fo…

Jupyter Notebook 3,123 357 Updated Mar 13, 2026

pipilurj / bootstrapped-preference-optimization-BPO

code for "Strengthening Multimodal Large Language Model with Bootstrapped Preference Optimization"

Python 63 1 Updated Aug 23, 2024

FoundationVision / OmniTokenizer

[NeurIPS 2024]OmniTokenizer: one model and one weight for image-video joint tokenization.

Python 323 8 Updated Jul 9, 2024

Shenzhi-Wang / Llama3-Chinese-Chat

This is the first Chinese chat model specifically fine-tuned for Chinese through ORPO based on the Meta-Llama-3-8B-Instruct model.

318 21 Updated May 6, 2024

FoundationVision / Groma

[ECCV2024] Grounded Multimodal Large Language Model with Localized Visual Tokenization

Python 587 45 Updated Jun 7, 2024

FoundationVision / vaex

🔥stable, simple, state-of-the-art VQVAE toolkit & cookbook

Python 107 8 Updated Jun 23, 2024

FoundationVision / VAR

[NeurIPS 2024 Best Paper Award][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". A…

Jupyter Notebook 8,683 566 Updated Nov 10, 2025

FoundationVision / GenerateU

[CVPR2024] Generative Region-Language Pretraining for Open-Ended Object Detection

Python 197 8 Updated Mar 29, 2025

Haiyang-W / GiT

[ECCV2024 Oral🔥] Official Implementation of "GiT: Towards Generalist Vision Transformer through Universal Language Interface"

Python 362 15 Updated Jan 14, 2025

OpenGVLab / MM-Interleaved

MM-Interleaved: Interleaved Image-Text Generative Modeling via Multi-modal Feature Synchronizer

Python 253 12 Updated Apr 3, 2024

LAION-AI / aesthetic-predictor

A linear estimator on top of clip to predict the aesthetic quality of pictures

Jupyter Notebook 697 28 Updated Aug 15, 2022

FoundationVision / GLEE

[CVPR2024 Highlight]GLEE: General Object Foundation Model for Images and Videos at Scale

Python 1,173 76 Updated Oct 21, 2024

linexjlin / GPTs

leaked prompts of GPTs

31,990 4,393 Updated Sep 27, 2024

keyu-tian / BUAA-datastructure-project-solution

[Ranked No. 1🥇] My solution for the course project of Datastructure 2019'Spring @ BUAA (北航数据结构). Plenty of C language tricks, hacks, and optimizations are used for extreme efficiency. *Ranked 1/800…

C 10 Updated Jan 25, 2024

keyu-tian / BUAA-parallel-computing-project-solution

[Ranked No. 1🥇] My solution for the course project of Parallel Computing 2021'Spring @ BUAA (北航并行程序设计). Plenty of C++ tricks, hacks, and optimizations are used for extreme efficiency. Ranked *1/100…

C++ 6 Updated Nov 14, 2023