Skip to content
View Arka-h's full-sized avatar
🎯
Focusing
🎯
Focusing

Highlights

  • Pro

Block or report Arka-h

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
arka-h/README.md

Hi, I'm Arka πŸ‘‹

Machine Learning β€’ Computer Vision β€’ Generative Models

Typing SVG


πŸ§‘β€πŸ”¬ About Me

  • πŸŽ“ Final-year M.Tech (Research) at IISc, CDS β€” working with VAL / VCL.
  • πŸ”­ Research interests: sketch-guided localization, hand–object interaction generation, 3D scene representations (NeRF, 3D Gaussians), and grounded detection.
  • βš™οΈ Love building research-grade systems: custom PyTorch ops, DDP/NCCL, SLURM on DGX, Dockerized reproducible setups.
  • 🌱 Currently exploring: triplane-guided HOI generation, sketch-conditioned GroundingDINO, and agentic ML systems for real-world impact.

πŸ”— Connect

LinkedIn Β  Google Scholar Β  Email Β  Website


πŸ› οΈ Languages & Tools


πŸ“Œ Selected Highlights

  • Sketch-conditioned GroundingDINO β€” extended grounding with a SketchEncoder for retrieval-aligned detection.
  • HOI-Diffusion β€” triplane intermediate representation for coherent hand–object trajectories in 3D.
  • 3D Gaussian Workflows β€” experiments on generalizable splats & NeRF pipelines with robust camera handling.
  • Award β€” Co-author on IAPR Best Paper (CVIP 2023) for vision-based fire detection & classification.

I enjoy turning messy research ideas into clean, reproducible repos with good docs, configs, and ablations.


πŸ”¬ What I’m Working On

  • 🧩 Better open-set / sketch-guided localization for real-world categories.
  • πŸ‘ Consistent HOI generation with physically plausible contacts.
  • 🧱 Infra: multi-GPU training (DDP/NCCL), data pipelines, and robust loaders for large mixed-modality datasets.

πŸ“ˆ GitHub Stats

stats

streak

summary


⚑ Fun Fact

I refactor dataloaders more than I refactor life. Also: dark mode forever.

views

Pinned Loading

  1. Vision-Reading-Group Vision-Reading-Group Public

    Stay upto date with recent papers in the field!

    1

  2. RudreshVeerkhare/Recruitment-Assisting-Platform RudreshVeerkhare/Recruitment-Assisting-Platform Public

    Recruitment Assisting Platform, an ML and NLP based System to classify resumes into potential job positions.

    Python 6 3

  3. GATE-CS-quiz-generator GATE-CS-quiz-generator Public

    Multi-section GATE CSE quiz generator based on GO's 3 volumes of PYQs

    Python 4

  4. Resume Resume Public

    TeX

  5. ml-hugs ml-hugs Public

    Forked from apple/ml-hugs

    Official repository of HUGS: Human Gaussian Splats (CVPR 2024)

    Python

  6. sam2 sam2 Public

    Forked from facebookresearch/sam2

    The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

    Jupyter Notebook