Sid2697

Focusing

Siddhant Bansal Sid2697

Focusing

Research focused on exploring various aspects of first-person (egocentric) vision. Working with Prof. Dima Damen at University of Bristol.

84 followers · 69 following

MaVi, University of Bristol
Bristol
https://sid2697.github.io
@Sid__Bansal

Achievements

x2 x2

Achievements

x2 x2

Highlights

Organizations

Stars

rolpotamias / WiLoR

WiLoR: End-to-end 3D hand localization and reconstruction in-the-wild

Python 488 41 Updated Apr 7, 2026

lixiny / manotorch

MANO hand model in PyTorch (anatomy consistent, anchors, etc)

Python 282 27 Updated Feb 2, 2026

zc-alexfan / arctic

[CVPR 2023] Official repository for downloading, processing, visualizing, and training models on the ARCTIC dataset.

Python 467 29 Updated Mar 4, 2026

DLR-RM / BlenderProc

A procedural Blender pipeline for photorealistic training image generation

Python 3,503 507 Updated Jan 20, 2026

hughw19 / NOCS_CVPR2019

[CVPR2019 Oral] Normalized Object Coordinate Space for Category-Level 6D Object Pose and Size Estimation on Python3, Tensorflow, and Keras

Python 493 74 Updated Dec 2, 2022

naver / dust3r

DUSt3R: Geometric 3D Vision Made Easy

Python 7,088 746 Updated Sep 24, 2025

facebookresearch / sam2

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 18,979 2,422 Updated Apr 7, 2026

zc-alexfan / hold

[CVPR 2024✨Highlight] Official repository for HOLD, the first method that jointly reconstructs articulated hands and objects from monocular videos without assuming a pre-scanned object template and…

Python 473 14 Updated Mar 10, 2026

sinhasaptarshi / EveryShotCounts

Codebase for "Every Shot Counts: Using Exemplars for Repetition Counting in Videos"

Python 29 Updated Dec 18, 2024

rerun-io / rerun

An open source SDK for logging, storing, querying, and visualizing multimodal and multi-rate data

Rust 10,555 712 Updated Apr 19, 2026

otaheri / GRAB

GRAB: A Dataset of Whole-Body Human Grasping of Objects

Python 366 35 Updated Mar 8, 2022

abelcabezaroman / definitive-image-comparison-slider

Light Vanilla Javascript library to compare multiples images with sliders. Also, you can add text and filters to your images.

JavaScript 70 6 Updated Mar 1, 2022

meta-llama / llama3

The official Meta Llama 3 GitHub site

Python 29,295 3,530 Updated Jan 26, 2025

xiexh20 / VisTracker

Official implementation for the CVPR'23 paper: Visibility Aware Human-Object Interaction Tracking from Single RGB Camera

Python 76 3 Updated Jun 10, 2023

dcharatan / flowmap

[3DV 2025] Code for "FlowMap: High-Quality Camera Poses, Intrinsics, and Depth via Gradient Descent" by Cameron Smith*, David Charatan*, Ayush Tewari, and Vincent Sitzmann

Python 975 92 Updated Mar 26, 2025

GradientSpaces / LivingScenes

[CVPR 2024, Highlight] Living Scenes: Multi-object Relocalization and Reconstruction in Changing 3D Environments

Python 104 8 Updated Jul 5, 2024

janehwu / mcc-ho

MCC-HO

Python 57 7 Updated Dec 2, 2024

geopavlakos / hamer

HaMeR: Reconstructing Hands in 3D with Transformers

Python 949 138 Updated Feb 7, 2026

zhifanzhu / getagrip

Python 33 1 Updated Dec 4, 2025

cvlab-columbia / viper

Code for the paper "ViperGPT: Visual Inference via Python Execution for Reasoning"

Jupyter Notebook 1,715 130 Updated Jan 29, 2024

epic-kitchens / epic-fields-code

Python 44 6 Updated Jan 13, 2026

LiheYoung / Depth-Anything

[CVPR 2024] Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data. Foundation Model for Monocular Depth Estimation

Python 8,068 611 Updated Jul 17, 2024

Sindhu-Hegde / gestsync

Official code for the paper "GestSync: Determining who is speaking without a talking head" published at BMVC 2023

Python 47 2 Updated Sep 1, 2024

meta-llama / llama-cookbook

Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We also show you how to solve end to end problems using Llama mode…

Jupyter Notebook 18,292 2,723 Updated Apr 17, 2026

apple / ml-ferret

Python 8,687 521 Updated Oct 9, 2024

Vision-CAIR / MiniGPT-4

Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)

Python 25,741 2,905 Updated Sep 2, 2024

zai-org / CogVLM

a state-of-the-art-level open visual language model | 多模态预训练模型

Python 6,738 455 Updated May 29, 2024

QwenLM / Qwen-VL

The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.

Python 6,624 490 Updated Aug 7, 2024

BradyFU / Awesome-Multimodal-Large-Language-Models

✨✨Latest Advances on Multimodal Large Language Models

17,663 1,124 Updated Apr 9, 2026

Instruction-Tuning-with-GPT-4 / GPT-4-LLM

Instruction Tuning with GPT-4

HTML 4,337 309 Updated Jun 11, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Siddhant Bansal Sid2697

Achievements

Achievements

Highlights

Organizations

Block or report Sid2697

Stars

rolpotamias / WiLoR

lixiny / manotorch

zc-alexfan / arctic

DLR-RM / BlenderProc

hughw19 / NOCS_CVPR2019

naver / dust3r

facebookresearch / sam2

zc-alexfan / hold

sinhasaptarshi / EveryShotCounts

rerun-io / rerun

otaheri / GRAB

abelcabezaroman / definitive-image-comparison-slider

meta-llama / llama3

xiexh20 / VisTracker

dcharatan / flowmap

GradientSpaces / LivingScenes

janehwu / mcc-ho

geopavlakos / hamer

zhifanzhu / getagrip

cvlab-columbia / viper

epic-kitchens / epic-fields-code

LiheYoung / Depth-Anything

Sindhu-Hegde / gestsync

meta-llama / llama-cookbook

apple / ml-ferret

Vision-CAIR / MiniGPT-4

zai-org / CogVLM

QwenLM / Qwen-VL

BradyFU / Awesome-Multimodal-Large-Language-Models

Instruction-Tuning-with-GPT-4 / GPT-4-LLM