-
REVSMART WEARABLE TECHNOLOGIES PVT LTD
- Coimbatore
- www.orchidsasia.com
- @jags111
- jagsdesigner
Lists (4)
Sort Name ascending (A-Z)
Starred repositories
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
A latent text-to-image diffusion model
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable…
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…
Zero-Shot Speech Editing and Text-to-Speech in the Wild
Inpaint anything using Segment Anything and inpainting models.
A unified framework for 3D content generation.
Official Code for Stable Cascade
Accepted as [NeurIPS 2024] Spotlight Presentation Paper
Adapted from https://note.com/kohya_ss/n/nbf7ce8d80f29 for easier cloning
[ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (RPG)
Try out deep learning models online on Google Colab
An extensive node suite for ComfyUI with over 210 new nodes
Code for Motion Representations for Articulated Animation paper
Concept Sliders for Precise Control of Diffusion Models
Tools to train a generative model on arbitrary audio samples
[ECCV'2024] Gaussian Grouping for open-world Anything reconstruction, segmentation and editing.
[CVPR 2024 Oral] Rethinking Inductive Biases for Surface Normal Estimation
[CVPR 2024] FRESCO: Spatial-Temporal Correspondence for Zero-Shot Video Translation
Erasing Concepts from Diffusion Models
Implementation of Paint-with-words with Stable Diffusion : method from eDiff-I that let you generate image from text-labeled segmentation map.
Code and data for "AnyV2V: A Tuning-Free Framework For Any Video-to-Video Editing Tasks" [TMLR 2024]
[CVPR 2023] CodeTalker: Speech-Driven 3D Facial Animation with Discrete Motion Prior
Course content and resources for the AIAIART course.
Official implementation of Würstchen: Efficient Pretraining of Text-to-Image Models
PeRFlow: Piecewise Rectified Flow as Universal Plug-and-Play Accelerator (NeurIPS 2024)
Code for "Fast and Robust Multi-Person 3D Pose Estimation from Multiple Views" (CVPR 2019, T-PAMI 2021)
[ECCV-2024] This is the official implementation of ZeST.