- π Final-year M.Tech (Research) at IISc, CDS β working with VAL / VCL.
- π Research interests: sketch-guided localization, handβobject interaction generation, 3D scene representations (NeRF, 3D Gaussians), and grounded detection.
- βοΈ Love building research-grade systems: custom PyTorch ops, DDP/NCCL, SLURM on DGX, Dockerized reproducible setups.
- π± Currently exploring: triplane-guided HOI generation, sketch-conditioned GroundingDINO, and agentic ML systems for real-world impact.
- Sketch-conditioned GroundingDINO β extended grounding with a SketchEncoder for retrieval-aligned detection.
- HOI-Diffusion β triplane intermediate representation for coherent handβobject trajectories in 3D.
- 3D Gaussian Workflows β experiments on generalizable splats & NeRF pipelines with robust camera handling.
- Award β Co-author on IAPR Best Paper (CVIP 2023) for vision-based fire detection & classification.
I enjoy turning messy research ideas into clean, reproducible repos with good docs, configs, and ablations.
- π§© Better open-set / sketch-guided localization for real-world categories.
- π Consistent HOI generation with physically plausible contacts.
- π§± Infra: multi-GPU training (DDP/NCCL), data pipelines, and robust loaders for large mixed-modality datasets.
I refactor dataloaders more than I refactor life. Also: dark mode forever.