- Seoul, South Korea
- https://hamzashafiq.me/
- https://orcid.org/0000-0003-2212-5878
- in/hamzashafiq28
Highlights
- Pro
Stars
[MICCAI 2025 Oral] Code for "EndoMamba: An Efficient Foundation Model for Endoscopic Videos via Hierarchical Pre-training."
[AAAI 2026] CrossVid: A Comprehensive Benchmark for Evaluating Cross-Video Reasoning in Multimodal Large Language Models
Official PyTorch Implementation of "Diffusion Transformers with Representation Autoencoders"
Awesome papers & datasets specifically focused on long-term videos.
Code for the paper "Vamba: Understanding Hour-Long Videos with Hybrid Mamba-Transformers" [ICCV 2025]
Learn OpenCV : C++ and Python Examples
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming