harshm2601

Follow

Harsh Maheshwari harshm2601

Follow

27 followers · 142 following

Achievements

Achievements

Highlights

Pro

Lists (1)

Sort

✨ Inspiration

Stars

surrealdb / surrealdb

A scalable, distributed, collaborative, document-graph database, for the realtime web

Rust 30,598 1,087 Updated Dec 19, 2025

FalkorDB / FalkorDB

A super fast Graph Database uses GraphBLAS under the hood for its sparse adjacency matrix graph representation. Our goal is to provide the best Knowledge Graph for LLM (GraphRAG).

C 2,612 196 Updated Dec 19, 2025

facebookresearch / sam-audio

The repository provides code for running inference with the Meta Segment Anything Audio Model (SAM-Audio), links for downloading the trained model checkpoints, and example notebooks that show how t…

Python 2,004 141 Updated Dec 19, 2025

apple / foundationdb

FoundationDB - the open source, distributed, transactional key-value store

C++ 16,023 1,457 Updated Dec 19, 2025

apple / ml-sharp

Sharp Monocular View Synthesis in Less Than a Second

Python 3,487 205 Updated Dec 19, 2025

google / A2UI

TypeScript 5,078 344 Updated Dec 19, 2025

NVlabs / Fast-FoundationStereo

Fast-FoundationStereo: Real-Time Zero-Shot Stereo Matching

236 5 Updated Dec 18, 2025

madd86 / awesome-system-design

A curated list of awesome System Design (A.K.A. Distributed Systems) resources.

11,304 1,236 Updated Jun 27, 2024

microsoft / MeshTransformer

Research code for CVPR 2021 paper "End-to-End Human Pose and Mesh Reconstruction with Transformers"

Python 633 96 Updated Jul 6, 2023

Gy920 / segment-anything-2-real-time

Run Segment Anything Model 2 on a live video stream

Jupyter Notebook 553 91 Updated Jun 3, 2025

Aleafy / V-RGBX

V-RGBX: Video Editing with Accurate Controls over Intrinsic Properties

54 Updated Dec 15, 2025

huanghoujing / AlignedReID-Re-Production-Pytorch

Reproduce AlignedReID: Surpassing Human-Level Performance in Person Re-Identification, using Pytorch.

Python 645 190 Updated Oct 25, 2018

DanceTrack / DanceTrack

[CVPR2022] DanceTrack: Multiple Object Tracking in Uniform Appearance and Diverse Motion

Python 439 38 Updated Sep 27, 2024

PeizeSun / TransTrack

Multiple Object Tracking with Transformer

Python 666 110 Updated Apr 30, 2023

ifzhang / FairMOT

[IJCV-2021] FairMOT: On the Fairness of Detection and Re-Identification in Multi-Object Tracking

Python 4,203 929 Updated Sep 19, 2023

obss / sahi

Framework agnostic sliced/tiled inference + interactive ui + error analysis plots

Python 4,991 718 Updated Dec 15, 2025

simstudioai / sim

Open-source platform to build and deploy AI agent workflows.

TypeScript 23,675 2,944 Updated Dec 19, 2025

jamjamjon / usls

A Rust library integrated with ONNXRuntime, providing a collection of Computer Vison and Vision-Language models such as YOLO, FastVLM, and more.

Rust 291 40 Updated Dec 11, 2025

apple / ml-fastvlm

This repository contains the official implementation of "FastVLM: Efficient Vision Encoding for Vision Language Models" - CVPR 2025

Python 7,064 521 Updated May 5, 2025

AI-Hypercomputer / google-cloud-mldiagnostics

Python 6 2 Updated Dec 8, 2025

cxh0519 / VTB

Official implementation of "A Simple Visual-Textual Baseline for Pedestrian Attribute Recognition" [TCSVT 2022]

Python 39 Updated Jan 17, 2024

valencebond / Rethinking_of_PAR

Pytorch Pedestrian Attribute Recognition: A strong PyTorch baseline for pedestrian attribute recognition and multi-label classification.

Python 209 38 Updated Feb 12, 2023

Zplusdragon / PLIP

[NeurIPS2024] PLIP: Language-Image Pre-training for Person Representation Learning

Python 131 10 Updated Dec 17, 2024

Syliz517 / CLIP-ReID

Official implementation for "CLIP-ReID: Exploiting Vision-Language Model for Image Re-identification without Concrete Text Labels" (AAAI 2023)

Python 445 63 Updated Nov 21, 2023

MaXDL4Phys / tear

Text-Enhanced Zero-Shot Action Recognition

Python 2 1 Updated Sep 11, 2024

Shahzadnit / T2L

Python 6 Updated May 11, 2025

intel / TVP

Python 15 Updated Aug 4, 2025

YuanGongND / ltu

Code, Dataset, and Pretrained Models for Audio and Speech Large Language Model "Listen, Think, and Understand".

Python 463 40 Updated Apr 24, 2024

anandpathak / AnimeAvataar

creating Anime Avataar from a facial image

C++ 44 14 Updated Mar 26, 2017

zai-org / Open-AutoGLM

An Open Phone Agent Model & Framework. Unlocking the AI Phone for Everyone

Python 17,957 2,807 Updated Dec 19, 2025