Teddy12155555

🌴

On vacation

TaiMing Teddy12155555

🌴

On vacation

26 followers · 31 following

NTU
OuterSpace
19:55 (UTC +08:00)
https://teddy12155555.github.io/

Achievements

Highlights

Stars

Open-Reasoner-Zero / Open-Vision-Reasoner

[NeurIPS 2025] The official repository for our paper, "Open Vision Reasoner: Transferring Linguistic Cognitive Behavior for Visual Reasoning".

152 1 Updated Sep 12, 2025

lukaslaobeyer / token-opt

Code for ICML 2025 Paper "Highly Compressed Tokenizer Can Generate Without Training"

Jupyter Notebook 195 12 Updated Jun 10, 2025

dvlab-research / VisionThink

[NeurIPS 2025] Efficient Reasoning Vision Language Models

Python 440 29 Updated Sep 18, 2025

Theia-4869 / CDPruner

[NeurIPS 2025] Official code for paper: Beyond Attention or Similarity: Maximizing Conditional Diversity for Token Pruning in MLLMs.

Python 81 5 Updated Sep 20, 2025

XiaoMi / xiaomi-miloco

Xiaomi Miloco

Python 1,956 128 Updated Dec 17, 2025

dvlab-research / VisionZip

Official repository for VisionZip (CVPR 2025)

Python 392 16 Updated Jul 21, 2025

xuyang-liu16 / GlobalCom2

[AAAI 2026] Global Compression Commander: Plug-and-Play Inference Acceleration for High-Resolution Large Vision-Language Models

Python 37 1 Updated Dec 15, 2025

JayZhang42 / SLED

SLED: Self Logits Evolution Decoding for Improving Factuality in Large Language Model https://arxiv.org/pdf/2411.02433

Python 114 20 Updated Dec 5, 2024

Liuziyu77 / Visual-RFT

Official repository of 'Visual-RFT: Visual Reinforcement Fine-Tuning' & 'Visual-ARFT: Visual Agentic Reinforcement Fine-Tuning'’

Jupyter Notebook 2,286 103 Updated Oct 29, 2025

Fediory / HVI-CIDNet

[CVPR2025 && NTIRE2025] HVI: A New Color Space for Low-light Image Enhancement (Official Implementation)

Python 699 71 Updated Oct 28, 2025

zhu-minjun / Researcher

CycleResearcher: Improving Automated Research via Automated Review

Jupyter Notebook 313 25 Updated Jul 10, 2025

ZiYang-xie / WorldGen

🌍 WorldGen - Generate Any 3D Scene in Seconds

Python 937 72 Updated Nov 11, 2025

mathLab / PINA

Physics-Informed Neural networks for Advanced modeling

Python 685 92 Updated Dec 19, 2025

google-deepmind / alignet

Python 68 11 Updated Dec 1, 2025

LTH14 / JiT

PyTorch implementation of JiT https://arxiv.org/abs/2511.13720

Python 1,839 108 Updated Dec 8, 2025

ggml-org / llama.cpp

LLM inference in C/C++

C++ 91,803 14,185 Updated Dec 22, 2025

coqui-ai / TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Python 43,959 5,859 Updated Aug 16, 2024

2noise / ChatTTS

A generative speech model for daily dialogue.

Python 38,378 4,166 Updated Dec 3, 2025

puar-playground / MusiXQA

A large-scale dataset of music sheet images designed for VQA in music understanding.

Python 6 1 Updated Jul 13, 2025

LeapLabTHU / limit-of-RLVR

repo for paper https://arxiv.org/abs/2504.13837

Python 303 17 Updated Dec 17, 2025

si0wang / ThinkLite-VL

Python 105 6 Updated Jun 10, 2025

ZhangqiJiang07 / middle_layers_indicating_hallucinations

[CVPR 2025] Devils in Middle Layers of Large Vision-Language Models: Interpreting, Detecting and Mitigating Object Hallucinations via Attention Lens

Python 59 6 Updated Oct 9, 2025

yuwchen / InQSS

Python 15 1 Updated Oct 6, 2023

jmiao24 / Paper2Agent

Paper2Agent is a multi-agent AI system that automatically transforms research papers into interactive AI agents with minimal human input.

Jupyter Notebook 1,861 307 Updated Dec 15, 2025

zhiyuanyou / DeQA-Score

[CVPR 2025] Teaching Large Language Models to Regress Accurate Image Quality Scores using Score Distribution

Python 210 4 Updated Dec 16, 2025

vllm-project / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 65,935 12,121 Updated Dec 22, 2025

NVlabs / OmniVinci

OmniVinci is an omni-modal LLM for joint understanding of vision, audio, and language.

Python 608 51 Updated Oct 29, 2025

deepseek-ai / DeepSeek-OCR

Contexts Optical Compression

Python 21,527 1,926 Updated Oct 25, 2025

NVlabs / VILA

VILA is a family of state-of-the-art vision language models (VLMs) for diverse multimodal AI tasks across the edge, data center, and cloud.

Python 3,701 310 Updated Nov 28, 2025

Pi3AI / IvyFake

JavaScript 10 1 Updated Dec 11, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

TaiMing Teddy12155555

Achievements

Achievements

Highlights

Block or report Teddy12155555

Stars

Open-Reasoner-Zero / Open-Vision-Reasoner

lukaslaobeyer / token-opt

dvlab-research / VisionThink

Theia-4869 / CDPruner

XiaoMi / xiaomi-miloco

dvlab-research / VisionZip

xuyang-liu16 / GlobalCom2

JayZhang42 / SLED

Liuziyu77 / Visual-RFT

Fediory / HVI-CIDNet

zhu-minjun / Researcher

ZiYang-xie / WorldGen

mathLab / PINA

google-deepmind / alignet

LTH14 / JiT

ggml-org / llama.cpp

coqui-ai / TTS

2noise / ChatTTS

puar-playground / MusiXQA

LeapLabTHU / limit-of-RLVR

si0wang / ThinkLite-VL

ZhangqiJiang07 / middle_layers_indicating_hallucinations

yuwchen / InQSS

jmiao24 / Paper2Agent

zhiyuanyou / DeQA-Score

vllm-project / vllm

NVlabs / OmniVinci

deepseek-ai / DeepSeek-OCR

NVlabs / VILA

Pi3AI / IvyFake