๐๏ธ ML
Track-Anything is a flexible and interactive tool for video object tracking and segmentation, based on Segment Anything, XMem, and E2FGVI.
Automatically generate and overlay subtitles for any video.
An unnecessarily tiny implementation of GPT-2 in NumPy.
Examples and guides for using the OpenAI API
Invoke is a leading creative engine for Stable Diffusion models, empowering professionals, artists, and enthusiasts to generate and create visual media using the latest AI-driven technologies. The โฆ
Video Frame Interpolation & Super Resolution using NVIDIA's TensorRT & Tencent's NCNN inference, beautifully crafted and packaged into a single app
๐ค Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.
๐ธ๐ฌ - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Clone a voice in 5 seconds to generate arbitrary speech in real-time
โ๏ธ Neural Style Transfer: A Review
A list of resources for video enhancement, including video super-resolutio, interpolation, denoising, compression artifact removal et al..
Convolutional Neural Network for 3D meshes in PyTorch
Real-Time High-Resolution Background Matting
Reconstruction of the original paper on neural style transfer (Gatys et al.). I've additionally included reconstruction scripts which allow you to reconstruct only the content or the style of the iโฆ
Automatic rigging using neural network from RigNet
TensorFlow (Python API) implementation of Neural Style
Flowframes Windows GUI for video interpolation using DAIN (NCNN) or RIFE (CUDA/NCNN)
Official PyTorch implementation of "I2L-MeshNet: Image-to-Lixel Prediction Network for Accurate 3D Human Pose and Mesh Estimation from a Single RGB Image", ECCV 2020
High-Resolution 3D Human Digitization from A Single Image.
This repo contains source code and materials for the TEmporally COherent GAN SIGGRAPH project.
Attention-Guided Hierarchical Structure Aggregation for Image Matting(CVPR2020)
TensorFlow (Python API) implementation of Neural Style
TensorFlow CNN for fast style transfer โก๐ฅ๐จ๐ผ
The code for the bark-voicecloning model. Training and inference.
Metric depth estimation from a single image
Unoffical implement for [StyleDrop](https://arxiv.org/abs/2306.00983)
Paint by Example: Exemplar-based Image Editing with Diffusion Models