-
SRLLC
- New York
Stars
🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
real time face swap and one-click video deepfake with only a single image
Curated list of design and UI resources from stock photos, web templates, CSS frameworks, UI libraries, tools and much more
Ghidra is a software reverse engineering (SRE) framework
Clone a voice in 5 seconds to generate arbitrary speech in real-time
Your self-hosted, globally interconnected microblogging community
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
A game theoretic approach to explain the output of any machine learning model.
An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System
NVIDIA Linux open GPU kernel module source
State-of-the-Art Deep Learning scripts organized by models - easy to train and deploy with reproducible accuracy and performance on enterprise-grade infrastructure.
A Conversational Speech Generation Model
Osintgram is a OSINT tool on Instagram. It offers an interactive shell to perform analysis on Instagram account of any users by its nickname
A collection of GPT system prompts and various prompt injection/leaking knowledge.
A booklet on machine learning systems design with exercises. NOT the repo for the book "Designing Machine Learning Systems", which is `dmls-book`
CoTracker is a model for tracking any point (pixel) on a video.
3D plotting and mesh analysis through a streamlined interface for the Visualization Toolkit (VTK)
Linux-CAN / SocketCAN user space applications
DeepMind's Tacotron-2 Tensorflow implementation
HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
Non-local Neural Networks for Video Classification
[ICASSP 2024] 🍵 Matcha-TTS: A fast TTS architecture with conditional flow matching
Official PyTorch implementation of BigVGAN (ICLR 2023)
[IJCV] Show-1: Marrying Pixel and Latent Diffusion Models for Text-to-Video Generation