rishiswethan

🎯

Focusing

Rishi Swethan rishiswethan

🎯

Focusing

In love with code

48 followers · 7 following

Serna.ai
India
17:20 (UTC +05:30)
in/rishi-swethan
https://medium.com/@rishiswethan.c.r

Achievements

Stars

innat / VideoMAE

[NeurIPS'22] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training

Jupyter Notebook 22 3 Updated Jan 19, 2024

rishiswethan / Cancer-detection-using-CNN

This CNN is capable of diagnosing breast cancer from an eosin stained image. This model was trained using 400 images. It has an accuracy of 80%

Python 65 36 Updated Apr 15, 2023

JuanBindez / pytubefix

Python3 library for downloading YouTube Videos.

Python 1,338 171 Updated Oct 7, 2025

OpenGVLab / InternVideo

[ECCV2024] Video Foundation Models & Data for Multimodal Understanding

Python 2,071 127 Updated Aug 7, 2025

meta-llama / llama3

The official Meta Llama 3 GitHub site

Python 29,022 3,469 Updated Jan 26, 2025

DAMO-NLP-SG / Video-LLaMA

[EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding

Python 3,077 281 Updated Jun 4, 2024

AILab-CVC / YOLO-World

[CVPR 2024] Real-Time Open-Vocabulary Object Detection

Python 5,902 558 Updated Feb 26, 2025

open-mmlab / mmyolo

OpenMMLab YOLO series toolbox and benchmark. Implemented RTMDet, RTMDet-Rotated,YOLOv5, YOLOv6, YOLOv7, YOLOv8,YOLOX, PPYOLOE, etc.

Python 3,312 607 Updated Jul 14, 2024

vllm-project / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 59,695 10,578 Updated Oct 9, 2025

haotian-liu / LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 23,689 2,640 Updated Aug 12, 2024

IDEA-Research / GroundingDINO

[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"

Python 9,013 922 Updated Aug 12, 2024

cool-xuan / BN-WVAD

The official implementation of "Divergence of Features and Mean: A BatchNorm-based Abnormality Criterion for Weakly Supervised Video Anomaly Detection"

Python 65 15 Updated Nov 30, 2023

autodistill / autodistill

Images to inference with no labeling (use foundation models to train supervised models).

Python 2,412 197 Updated May 14, 2025

zai-org / CogVLM

a state-of-the-art-level open visual language model | 多模态预训练模型

Python 6,671 441 Updated May 29, 2024

rishiswethan / Diabetic-Retinopathy-Detection-Retinal-Vessel-Segmentation

Classification of Fundus Images into 5 stages of Diabetic Retinopathy, and segmentation of blood vessels in fundus images

Python 16 2 Updated Sep 18, 2023

voxel51 / fiftyone

Refine high-quality datasets and visual AI models

Python 9,927 673 Updated Oct 9, 2025

capjamesg / zero-shot-crack-detection

Zero-shot crack detection with SAM and Grounding DINO.

Python 4 1 Updated Nov 9, 2023

luigifreda / plvs

PLVS is a real-time SLAM system with points, lines, volumetric mapping and 3D unsupervised incremental segmentation.

C++ 522 77 Updated Sep 21, 2025

mit-han-lab / efficientvit

Efficient vision foundation models for high-resolution generation and perception.

Python 3,093 232 Updated Sep 5, 2025

Deci-AI / super-gradients

Easily train or fine-tune SOTA computer vision models with one open source training library. The home of Yolo-NAS.

Jupyter Notebook 4,924 565 Updated Sep 17, 2024

rishiswethan / Video-Audio-Face-Emotion-Recognition

The repo contains an audio emotion detection model, facial emotion detection model, and a model that combines both these models to predict emotions from a video

Jupyter Notebook 85 23 Updated Sep 13, 2023

brianhill11 / ABPImputation

Package for imputing the arterial blood pressure (ABP) waveform from non-invasive physiological waveforms (PPG & ECG) using a deep neural network

Python 32 6 Updated Jul 24, 2022

gaomingqi / Track-Anything

Track-Anything is a flexible and interactive tool for video object tracking and segmentation, based on Segment Anything, XMem, and E2FGVI.

Python 6,832 501 Updated May 31, 2024

brjathu / LART

Code repository for the paper "On the Benefits of 3D Pose and Tracking for Human Action Recognition", (CVPR 2023)

Jupyter Notebook 281 31 Updated Jan 19, 2024

qianqianwang68 / omnimotion

Python 2,246 131 Updated Jun 11, 2024

Cadene / pretrained-models.pytorch

Pretrained ConvNets for pytorch: NASNet, ResNeXt, ResNet, InceptionV4, InceptionResnetV2, Xception, DPN, etc.

Python 9,106 1,828 Updated Apr 22, 2022

facebookresearch / ImageBind

ImageBind One Embedding Space to Bind Them All

Python 8,809 827 Updated Oct 3, 2025

keshavbhatt / whatsie

Feature rich WhatsApp Client for Desktop Linux

C++ 2,600 71 Updated Nov 1, 2024

facebookresearch / segment-anything

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Jupyter Notebook 52,030 6,094 Updated Sep 18, 2024

open-mmlab / mmsegmentation

OpenMMLab Semantic Segmentation Toolbox and Benchmark.

Python 9,271 2,769 Updated Aug 13, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Rishi Swethan rishiswethan

Achievements

Achievements

Block or report rishiswethan

Stars

innat / VideoMAE

rishiswethan / Cancer-detection-using-CNN

JuanBindez / pytubefix

OpenGVLab / InternVideo

meta-llama / llama3

DAMO-NLP-SG / Video-LLaMA

AILab-CVC / YOLO-World

open-mmlab / mmyolo

vllm-project / vllm

haotian-liu / LLaVA

IDEA-Research / GroundingDINO

cool-xuan / BN-WVAD

autodistill / autodistill

zai-org / CogVLM

rishiswethan / Diabetic-Retinopathy-Detection-Retinal-Vessel-Segmentation

voxel51 / fiftyone

capjamesg / zero-shot-crack-detection

luigifreda / plvs

mit-han-lab / efficientvit

Deci-AI / super-gradients

rishiswethan / Video-Audio-Face-Emotion-Recognition

brianhill11 / ABPImputation

gaomingqi / Track-Anything

brjathu / LART

qianqianwang68 / omnimotion

Cadene / pretrained-models.pytorch

facebookresearch / ImageBind

keshavbhatt / whatsie

facebookresearch / segment-anything

open-mmlab / mmsegmentation