Skip to content
View Teddy12155555's full-sized avatar
๐ŸŒด
On vacation
๐ŸŒด
On vacation

Highlights

  • Pro

Block or report Teddy12155555

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this userโ€™s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[NeurIPS 2025] The official repository for our paper, "Open Vision Reasoner: Transferring Linguistic Cognitive Behavior for Visual Reasoning".

152 1 Updated Sep 12, 2025

Code for ICML 2025 Paper "Highly Compressed Tokenizer Can Generate Without Training"

Jupyter Notebook 195 12 Updated Jun 10, 2025

[NeurIPS 2025] Efficient Reasoning Vision Language Models

Python 440 29 Updated Sep 18, 2025

[NeurIPS 2025] Official code for paper: Beyond Attention or Similarity: Maximizing Conditional Diversity for Token Pruning in MLLMs.

Python 81 5 Updated Sep 20, 2025

Xiaomi Miloco

Python 1,956 128 Updated Dec 17, 2025

Official repository for VisionZip (CVPR 2025)

Python 392 16 Updated Jul 21, 2025

[AAAI 2026] Global Compression Commander: Plug-and-Play Inference Acceleration for High-Resolution Large Vision-Language Models

Python 37 1 Updated Dec 15, 2025

SLED: Self Logits Evolution Decoding for Improving Factuality in Large Language Model https://arxiv.org/pdf/2411.02433

Python 114 20 Updated Dec 5, 2024

Official repository of 'Visual-RFT: Visual Reinforcement Fine-Tuning' & 'Visual-ARFT: Visual Agentic Reinforcement Fine-Tuning'โ€™

Jupyter Notebook 2,286 103 Updated Oct 29, 2025

[CVPR2025 && NTIRE2025] HVI: A New Color Space for Low-light Image Enhancement (Official Implementation)

Python 699 71 Updated Oct 28, 2025

CycleResearcher: Improving Automated Research via Automated Review

Jupyter Notebook 313 25 Updated Jul 10, 2025

๐ŸŒ WorldGen - Generate Any 3D Scene in Seconds

Python 937 72 Updated Nov 11, 2025

Physics-Informed Neural networks for Advanced modeling

Python 685 92 Updated Dec 19, 2025
Python 68 11 Updated Dec 1, 2025

PyTorch implementation of JiT https://arxiv.org/abs/2511.13720

Python 1,839 108 Updated Dec 8, 2025

LLM inference in C/C++

C++ 91,803 14,185 Updated Dec 22, 2025

๐Ÿธ๐Ÿ’ฌ - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Python 43,959 5,859 Updated Aug 16, 2024

A generative speech model for daily dialogue.

Python 38,378 4,166 Updated Dec 3, 2025

A large-scale dataset of music sheet images designed for VQA in music understanding.

Python 6 1 Updated Jul 13, 2025

repo for paper https://arxiv.org/abs/2504.13837

Python 303 17 Updated Dec 17, 2025
Python 105 6 Updated Jun 10, 2025

[CVPR 2025] Devils in Middle Layers of Large Vision-Language Models: Interpreting, Detecting and Mitigating Object Hallucinations via Attention Lens

Python 59 6 Updated Oct 9, 2025
Python 15 1 Updated Oct 6, 2023

Paper2Agent is a multi-agent AI system that automatically transforms research papers into interactive AI agents with minimal human input.

Jupyter Notebook 1,861 307 Updated Dec 15, 2025

[CVPR 2025] Teaching Large Language Models to Regress Accurate Image Quality Scores using Score Distribution

Python 210 4 Updated Dec 16, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 65,935 12,121 Updated Dec 22, 2025

OmniVinci is an omni-modal LLM for joint understanding of vision, audio, and language.

Python 608 51 Updated Oct 29, 2025

Contexts Optical Compression

Python 21,527 1,926 Updated Oct 25, 2025

VILA is a family of state-of-the-art vision language models (VLMs) for diverse multimodal AI tasks across the edge, data center, and cloud.

Python 3,701 310 Updated Nov 28, 2025
JavaScript 10 1 Updated Dec 11, 2025
Next