-
SRI International
- Princeton
- @AnirudhSom
Stars
RWKV (pronounced RwaKuv) is an RNN with great LLM performance, which can also be directly trained like a GPT transformer (parallelizable). We are at RWKV-7 "Goose". So it's combining the best of RN…
[CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation
Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"
An open access book on scientific visualization using python and matplotlib
Large Language Model Text Generation Inference
Source code for Twitter's Recommendation Algorithm
ImageBind One Embedding Space to Bind Them All
A python library built to empower developers to build applications and systems with self-contained Computer Vision capabilities
Pytorch implementation of our method for high-resolution (e.g. 2048x1024) photorealistic video-to-video translation.
Real-Time and Accurate Full-Body Multi-Person Pose Estimation&Tracking System
Fully local web research and report writing assistant
An implementation of model parallel GPT-2 and GPT-3-style models using the mesh-tensorflow library.
A curated list of practical financial machine learning tools and applications.
Agent S: an open agentic framework that uses computers like a human
BoxMOT: Pluggable SOTA multi-object tracking modules modules for segmentation, object detection and pose estimation models
Collection of generative models, e.g. GAN, VAE in Pytorch and Tensorflow.
Official repository of "SAMURAI: Adapting Segment Anything Model for Zero-Shot Visual Tracking with Motion-Aware Memory"
[ECCV 2022] ByteTrack: Multi-Object Tracking by Associating Every Detection Box
Adversarial Robustness Toolbox (ART) - Python Library for Machine Learning Security - Evasion, Poisoning, Extraction, Inference - Red and Blue Teams
A modular framework for vision & language multimodal research from Facebook AI Research (FAIR)
docTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.
PointNet: Deep Learning on Point Sets for 3D Classification and Segmentation
The unofficial python package that returns response of Google Bard through cookie value.
Cracking the Coding Interview 6th Ed. Python Solutions
Pretrained Pytorch face detection (MTCNN) and facial recognition (InceptionResnet) models
Structured data extraction and instruction calling with ML, LLM and Vision LLM
[ECCV2024] IDM-VTON : Improving Diffusion Models for Authentic Virtual Try-on in the Wild