-
SRLLC
- New York
Stars
🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
real time face swap and one-click video deepfake with only a single image
Clone a voice in 5 seconds to generate arbitrary speech in real-time
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System
A Conversational Speech Generation Model
Osintgram is a OSINT tool on Instagram. It offers an interactive shell to perform analysis on Instagram account of any users by its nickname
3D plotting and mesh analysis through a streamlined interface for the Visualization Toolkit (VTK)
DeepMind's Tacotron-2 Tensorflow implementation
HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
Non-local Neural Networks for Video Classification
Official PyTorch implementation of BigVGAN (ICLR 2023)
[IJCV] Show-1: Marrying Pixel and Latent Diffusion Models for Text-to-Video Generation
A Python library for designing chips (Photonics, Analog, Quantum, MEMS), PCBs, and 3D-printable objects. We aim to make hardware design accessible, intuitive, and fun—empowering everyone to build t…
Code for "Neural Controlled Differential Equations for Irregular Time Series" (Neurips 2020 Spotlight)
MelGAN vocoder (compatible with NVIDIA/tacotron2)
Explicit Estimation of Magnitude and Phase Spectra in Parallel for High-Quality Speech Enhancement
CLI tool for exploring arXiv (inspired by karpathy's brilliant ArXiv Sanity Preserver)
Data and runtime repository for the Water Supply Forecast Rodeo competition on DrivenData