-
Peking University
- Lyoko
-
17:18
(UTC) - linkedin.com/in/keyu-tian/?locale=en_US
- https://orcid.org/0000-0001-5909-2091
- @keyutian
Highlights
- Pro
Stars
Python tool for converting files and office documents to Markdown.
An open-source project dedicated to tracking and segmenting any objects in videos, either automatically or interactively. The primary algorithms utilized include the Segment Anything Model (SAM) fo…
code for "Strengthening Multimodal Large Language Model with Bootstrapped Preference Optimization"
[NeurIPS 2024]OmniTokenizer: one model and one weight for image-video joint tokenization.
This is the first Chinese chat model specifically fine-tuned for Chinese through ORPO based on the Meta-Llama-3-8B-Instruct model.
[ECCV2024] Grounded Multimodal Large Language Model with Localized Visual Tokenization
🔥stable, simple, state-of-the-art VQVAE toolkit & cookbook
[NeurIPS 2024 Best Paper Award][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". A…
[CVPR2024] Generative Region-Language Pretraining for Open-Ended Object Detection
[ECCV2024 Oral🔥] Official Implementation of "GiT: Towards Generalist Vision Transformer through Universal Language Interface"
MM-Interleaved: Interleaved Image-Text Generative Modeling via Multi-modal Feature Synchronizer
A linear estimator on top of clip to predict the aesthetic quality of pictures
[CVPR2024 Highlight]GLEE: General Object Foundation Model for Images and Videos at Scale
[Ranked No. 1🥇] My solution for the course project of Datastructure 2019'Spring @ BUAA (北航数据结构). Plenty of C language tricks, hacks, and optimizations are used for extreme efficiency. *Ranked 1/800…
[Ranked No. 1🥇] My solution for the course project of Parallel Computing 2021'Spring @ BUAA (北航并行程序设计). Plenty of C++ tricks, hacks, and optimizations are used for extreme efficiency. Ranked *1/100…
High-fidelity performance metrics for generative models in PyTorch
(NeurIPS2023) CoDet: Co-Occurrence Guided Region-Word Alignment for Open-Vocabulary Object Detection
Official implementation of SEED-LLaMA (ICLR 2024).
[ ICLR 2024 ] Official Codebase for "InstructCV: Instruction-Tuned Text-to-Image Diffusion Models as Vision Generalists"
[T-PAMI'25] PyTorch Implementation of GDRNPP, winner (most of the awards) of the BOP Challenge 2022 at ECCV'22
Peking University - Optimization Method - Gao, Li - Spring, 2018
[ICCV2023] Official Implementation of "UniTR: A Unified and Efficient Multi-Modal Transformer for Bird’s-Eye-View Representation"
EVA Series: Visual Representation Fantasies from BAAI