-
REVSMART WEARABLE TECHNOLOGIES PVT LTD
- Coimbatore
- www.orchidsasia.com
- @jags111
- jagsdesigner
Lists (4)
Sort Name ascending (A-Z)
Starred repositories
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
A latent text-to-image diffusion model
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable…
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…
Zero-Shot Speech Editing and Text-to-Speech in the Wild
Inpaint anything using Segment Anything and inpainting models.
A unified framework for 3D content generation.
Official Code for Stable Cascade
Accepted as [NeurIPS 2024] Spotlight Presentation Paper
[ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (RPG)
Try out deep learning models online on Google Colab
Code for Motion Representations for Articulated Animation paper
Tools to train a generative model on arbitrary audio samples
Concept Sliders for Precise Control of Diffusion Models
[ECCV'2024] Gaussian Grouping for open-world Anything reconstruction, segmentation and editing.
[CVPR 2024 Oral] Rethinking Inductive Biases for Surface Normal Estimation
[CVPR 2024] FRESCO: Spatial-Temporal Correspondence for Zero-Shot Video Translation
Implementation of Paint-with-words with Stable Diffusion : method from eDiff-I that let you generate image from text-labeled segmentation map.
Erasing Concepts from Diffusion Models
Code and data for "AnyV2V: A Tuning-Free Framework For Any Video-to-Video Editing Tasks" [TMLR 2024]
[CVPR 2023] CodeTalker: Speech-Driven 3D Facial Animation with Discrete Motion Prior
Course content and resources for the AIAIART course.
Official implementation of Würstchen: Efficient Pretraining of Text-to-Image Models
Code for "Fast and Robust Multi-Person 3D Pose Estimation from Multiple Views" (CVPR 2019, T-PAMI 2021)
PeRFlow: Piecewise Rectified Flow as Universal Plug-and-Play Accelerator (NeurIPS 2024)
[ECCV-2024] This is the official implementation of ZeST.
OpenAerialMap is an open service to provide access to a commons of openly licensed imagery and map layer services.