daeunni

☘️

Researching for the happiness

Daeun Lee daeunni

☘️

Researching for the happiness

PhD student @ UNC Chapel hill

101 followers · 56 following

UNC Chapel Hill
Chaple Hill, NC
22:02 (UTC -05:00)
https://daeunni.github.io/
https://daeun-computer-uneasy.tistory.com/

Achievements

Stars

CYWang735 / AdaTooler-V

Python 30 3 Updated Dec 23, 2025

983632847 / Awesome-Multimodal-Object-Tracking

A continuously updated project to track the latest progress in the field of multi-modal object tracking. This project focuses solely on single-object tracking.

Jupyter Notebook 914 50 Updated Dec 23, 2025

speedyapply / 2026-AI-College-Jobs

2026 AI/ML internship & new graduate job list updated daily

4,250 172 Updated Dec 23, 2025

vbdi / Ego3D-Bench

Spatial Reasoning with Vision-Language Models

Python 30 1 Updated Nov 17, 2025

fansunqi / VideoTool

Official Repository for NeurIPS'25 Paper "Tool-Augmented Spatiotemporal Reasoning for Streamlining Video Question Answering Task"

4 Updated Oct 18, 2025

FoundationVision / ByteTrack

[ECCV 2022] ByteTrack: Multi-Object Tracking by Associating Every Detection Box

Python 5,918 1,072 Updated Jun 19, 2024

RT-DETRs / RT-DETRv4

Official implementation of RT-DETRv4: Painlessly Furthering Real-Time Object Detection with Vision Foundation Models

Python 178 14 Updated Nov 30, 2025

lyuwenyu / RT-DETR

[CVPR 2024] Official RT-DETR (RTDETR paddle pytorch), Real-Time DEtection TRansformer, DETRs Beat YOLOs on Real-time Object Detection. 🔥 🔥 🔥

Python 4,640 543 Updated Dec 3, 2025

HCPLab-SYSU / STKET

Spatial-Temporal Knowledge-Embedded Transformer for Video Scene Graph Generation (TIP 2024, ACM MM 2023)

Python 19 2 Updated Mar 13, 2024

Zhuzi24 / Video-Dynamic-Scene-Graph-Generation

14 Updated May 9, 2024

Wan-Video / Wan2.2

Wan: Open and Advanced Large-Scale Video Generative Models

Python 13,019 1,519 Updated Dec 17, 2025

InternLM / ARM-Thinker

Official Code for "ARM-Thinker: Reinforcing Multimodal Generative Reward Models with Agentic Tool Use and Visual Reasoning"

Python 72 1 Updated Dec 5, 2025

Wan-Video / Wan2.1

Wan: Open and Advanced Large-Scale Video Generative Models

Python 14,975 2,222 Updated Dec 15, 2025

mbzuai-oryx / Video-CoM

Video-CoM: Interactive Video Reasoning via Chain of Manipulations

16 Updated Dec 1, 2025

KR-0822 / PAIR

Python 3 Updated Nov 19, 2025

Tencent-Hunyuan / HunyuanVideo-1.5

HunyuanVideo-1.5: A leading lightweight video generation model

Python 2,107 102 Updated Dec 23, 2025

LJungang / Awesome-Video-Reasoning-Landscape

🔥An open-source survey of the latest video reasoning tasks, paradigms, and benchmarks.

108 4 Updated Dec 20, 2025

G-JWLee / PRInTS

Official code for PRInTS: Rewarding Agents for Long-Horizon Information Seeking

Python 4 Updated Dec 10, 2025

EvolvingLMMs-Lab / OpenMMReasoner

Pushing the Frontiers for Multimodal Reasoning with an Open and General Recipe

Python 129 5 Updated Dec 17, 2025

microsoft / DeepVideoDiscovery

**Deep Video Discovery (DVD)** is a deep-research style question answering agent designed for understanding extra-long videos.

Python 320 7 Updated Nov 3, 2025

zhengxuJosh / Awesome-Multimodal-Spatial-Reasoning

This repository collects and organises state‑of‑the‑art papers on spatial reasoning for Multimodal Vision–Language Models (MVLMs).

249 15 Updated Dec 15, 2025

brown-palm / ObjectMLLM

Official implementation for paper How Can Objects Help Video-Language Understanding

Python 6 Updated Aug 2, 2025

SceneCOT / scenecot

A step-by-step reasoning framework for 3D scene understanding

12 1 Updated Nov 7, 2025

facebookresearch / sam-3d-objects

SAM 3D Objects

Python 5,063 470 Updated Dec 16, 2025

dayeonki / graphicbench

GraphicBench: A Planning Benchmark for Graphic Design Generation with Language Agents

JavaScript 4 Updated Apr 17, 2025

taiyi98 / EgoGazeVQA

Official code for EgoGazeVQA, accepted to NeurIPS D&B 2025

Python 8 Updated Oct 22, 2025

marinero4972 / Open-o3-Video

Official implementation of "Open-o3 Video: Grounded Video Reasoning with Explicit Spatio-Temporal Evidence"

Python 127 7 Updated Dec 18, 2025

zhaochen0110 / OpenThinkIMG

OpenThinkIMG is an end-to-end open-source framework that empowers LVLMs to think with images.

Jupyter Notebook 336 7 Updated Jun 1, 2025

zhang9302002 / ThinkingWithVideos

The official code of "Thinking With Videos: Multimodal Tool-Augmented Reinforcement Learning for Long Video Reasoning"

Python 71 1 Updated Oct 15, 2025

qunzhongwang / vr-thinker

Python 37 1 Updated Oct 20, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Daeun Lee daeunni

Achievements

Achievements

Block or report daeunni

Stars

CYWang735 / AdaTooler-V

983632847 / Awesome-Multimodal-Object-Tracking

speedyapply / 2026-AI-College-Jobs

vbdi / Ego3D-Bench

fansunqi / VideoTool

FoundationVision / ByteTrack

RT-DETRs / RT-DETRv4

lyuwenyu / RT-DETR

HCPLab-SYSU / STKET

Zhuzi24 / Video-Dynamic-Scene-Graph-Generation

Wan-Video / Wan2.2

InternLM / ARM-Thinker

Wan-Video / Wan2.1

mbzuai-oryx / Video-CoM

KR-0822 / PAIR

Tencent-Hunyuan / HunyuanVideo-1.5

LJungang / Awesome-Video-Reasoning-Landscape

G-JWLee / PRInTS

EvolvingLMMs-Lab / OpenMMReasoner

microsoft / DeepVideoDiscovery

zhengxuJosh / Awesome-Multimodal-Spatial-Reasoning

brown-palm / ObjectMLLM

SceneCOT / scenecot

facebookresearch / sam-3d-objects

dayeonki / graphicbench

taiyi98 / EgoGazeVQA

marinero4972 / Open-o3-Video

zhaochen0110 / OpenThinkIMG

zhang9302002 / ThinkingWithVideos

qunzhongwang / vr-thinker