-
ViveStudios
- Anyang, Gyeonggi, South Korea
Stars
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Official inference framework for 1-bit LLMs
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
Open-Sora: Democratizing Efficient Video Production for All
A Lightweight Face Recognition and Facial Attribute Analysis (Age, Gender, Emotion and Race) Library for Python
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
A TTS model capable of generating ultra-realistic dialogue in one pass.
Fast and accurate AI powered file content types detection
Bringing Old Photo Back to Life (CVPR 2020 oral)
Nuitka is a Python compiler written in Python. It's fully compatible with Python 2.6, 2.7, 3.4-3.13. You feed it your Python app, it does a lot of clever things, and spits out an executable or exte…
Enjoy the magic of Diffusion models!
Generate 3D objects conditioned on text or images
InstantID: Zero-shot Identity-Preserving Generation in Seconds 🔥
🐍 Geometric Computer Vision Library for Spatial AI
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi…
Flexible and powerful tensor operations for readable and reliable code (for pytorch, jax, TF and others)
Robust Video Matting in PyTorch, TensorFlow, TensorFlow.js, ONNX, CoreML!
Text-to-3D & Image-to-3D & Mesh Exportation with NeRF + Diffusion.
The easiest way to serve AI apps and models - Build Model Inference APIs, Job queues, LLM apps, Multi-model pipelines, and more!
EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
Accessible large language models via k-bit quantization for PyTorch.
Pythonic AI generation of images and videos
PaddlePaddle GAN library, including lots of interesting applications like First-Order motion transfer, Wav2Lip, picture repair, image editing, photo2cartoon, image style transfer, GPEN, and so on.
🔥 2D and 3D Face alignment library build using pytorch