avijit9

🎯

Focusing

Avijit Dasgupta avijit9

🎯

Focusing

PhD @ IIIT Hyderabad

86 followers · 380 following

Hyderabad, India
https://avijit9.github.io/

Achievements

Stars

aniket004 / DuoLoRA

DuoLoRA implementation

Python 6 Updated Oct 18, 2025

luigifreda / pyslam

pySLAM is a Python-based Visual SLAM pipeline that supports monocular, stereo, and RGB-D cameras. It offers a wide range of modern local and global features, multiple loop-closing strategies, a vol…

Python 2,713 446 Updated Nov 6, 2025

lucas-ventura / chapter-llama

Official PyTorch implementation of the paper "Chapter-Llama: Efficient Chaptering in Hour-Long Videos with LLMs"

Python 78 10 Updated Jun 6, 2025

OpenBMB / MiniCPM-V

MiniCPM-V 4.5: A GPT-4o Level MLLM for Single Image, Multi Image and High-FPS Video Understanding on Your Phone

Python 22,190 1,665 Updated Sep 24, 2025

zhengrongz / AoTD

[CVPR 2025] Official PyTorch code of "Enhancing Video-LLM Reasoning via Agent-of-Thoughts Distillation".

Python 50 Updated May 25, 2025

showlab / UniVTG

[ICCV 2023] UniVTG: Towards Unified Video-Language Temporal Grounding

Python 368 34 Updated May 8, 2024

KAIST-Visual-AI-Group / Diffusion-Assignment1-DDPM

Jupyter Notebook 41 31 Updated Feb 8, 2025

pangzhan27 / GTLA

Group-wise Temporal Logit Adjustment for TAS

Python 10 Updated Oct 24, 2024

kuleshov-group / awesome-discrete-diffusion-models

A curated list for awesome discrete diffusion models resources.

488 19 Updated Sep 9, 2025

facebookresearch / vggt

[CVPR 2025 Best Paper Award] VGGT: Visual Geometry Grounded Transformer

Python 11,545 1,190 Updated Oct 11, 2025

sidgairo18 / simple_diffusion_models

Building simple diffusion models for image generation. More so for understanding and learning.

Python 8 2 Updated Mar 30, 2025

DavidZhang73 / TDGV

[WACV'25] Temporal Instructional Diagram Grounding in Unconstrained Videos

Python 5 Updated Dec 17, 2024

anucvml / vidat

Video Annotation Tool

Vue 226 29 Updated Jun 18, 2024

jmhb0 / viddiff

[ICLR 2025] Video Action Differencing

Python 47 2 Updated Jul 3, 2025

presmihaylov / booknotes

A collection of my book notes on various subjects, mainly computer science

Java 2,888 749 Updated Mar 1, 2025

olga-zats / GTDA

[ECCV2024] Gated Temporal Action Anticipation for Stochastic Long-Term Anticipation

Python 18 1 Updated May 29, 2025

zihuixue / AlignEgoExo

Code and data release for the paper "Learning Fine-grained View-Invariant Representations from Unpaired Ego-Exo Videos via Temporal Alignment" (NeurIPS 2023)

Python 19 2 Updated Apr 5, 2024

tovacinni / research-website-template

React + Next.js template for research websites (for PhD students, researchers, etc)

TypeScript 202 82 Updated Jan 12, 2025

yiyixuxu / TimeSformer-rolled-attention

Visualizing the learned space-time attention using Attention Rollout

Jupyter Notebook 37 8 Updated Apr 1, 2022

ml-explore / mlx

MLX: An array framework for Apple silicon

C++ 22,725 1,378 Updated Nov 6, 2025

SimarKareer / EgoMimic

Jupyter Notebook 139 12 Updated Nov 10, 2024

jingyi0000 / VLM_survey

Collection of AWESOME vision-language models for vision tasks

2,990 223 Updated Oct 14, 2025

chalk-diagrams / chalk

A declarative drawing API in Python

Python 298 15 Updated Aug 28, 2024

BoltzmannEntropy / interviews.ai

It is my belief that you, the postgraduate students and job-seekers for whom the book is primarily meant will benefit from reading it; however, it is my hope that even the most experienced research…

4,725 320 Updated Aug 22, 2025

zhengli97 / Awesome-Prompt-Adapter-Learning-for-VLMs

A curated list of awesome prompt/adapter learning methods for vision-language models like CLIP.

698 31 Updated Sep 8, 2025

yenchenlin / nerf-pytorch

A PyTorch implementation of NeRF (Neural Radiance Fields) that reproduces the results.

Python 5,941 1,123 Updated Jul 25, 2024

ViLab-UCSD / LaGTran_ICML2024

Code and models for the ICML 2024 paper "Tell, Don`t Show!: Language Guidance Eases Transfer Across Domains in Images and Videos"

Python 6 1 Updated May 18, 2024

LLaVA-VL / LLaVA-NeXT

Python 4,373 417 Updated Sep 14, 2025

facebookresearch / Detic

Code release for "Detecting Twenty-thousand Classes using Image-level Supervision".

Python 1,985 220 Updated Mar 21, 2024

Annusha / xmic

X-MIC: Cross-Modal Instance Conditioning for Egocentric Action Generalization, CVPR 2024

Python 11 Updated Nov 7, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Avijit Dasgupta avijit9

Achievements

Achievements

Block or report avijit9

Stars

aniket004 / DuoLoRA

luigifreda / pyslam

lucas-ventura / chapter-llama

OpenBMB / MiniCPM-V

zhengrongz / AoTD

showlab / UniVTG

KAIST-Visual-AI-Group / Diffusion-Assignment1-DDPM

pangzhan27 / GTLA

kuleshov-group / awesome-discrete-diffusion-models

facebookresearch / vggt

sidgairo18 / simple_diffusion_models

DavidZhang73 / TDGV

anucvml / vidat

jmhb0 / viddiff

presmihaylov / booknotes

olga-zats / GTDA

zihuixue / AlignEgoExo

tovacinni / research-website-template

yiyixuxu / TimeSformer-rolled-attention

ml-explore / mlx

SimarKareer / EgoMimic

jingyi0000 / VLM_survey

chalk-diagrams / chalk

BoltzmannEntropy / interviews.ai

zhengli97 / Awesome-Prompt-Adapter-Learning-for-VLMs

yenchenlin / nerf-pytorch

ViLab-UCSD / LaGTran_ICML2024

LLaVA-VL / LLaVA-NeXT

facebookresearch / Detic

Annusha / xmic