- Athens
- https://filby89.github.io/
Highlights
- Pro
Starred repositories
A latent text-to-image diffusion model
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
Implementation of Reinforcement Learning Algorithms. Python, OpenAI Gym, Tensorflow. Exercises and Solutions to accompany Sutton's Book and David Silver's course.
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…
A multi-voice TTS system trained with an emphasis on quality
🤖 💬 Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)
Best Practices, code samples, and documentation for Computer Vision.
My continuously updated Machine Learning, Probabilistic Models and Deep Learning notes and demos (2000+ slides) 我不间断更新的机器学习,概率模型和深度学习的讲义(2000+页)和视频链接
Automatic extraction of relevant features from time series:
The "Python Machine Learning (2nd edition)" book code repository and info resource
Tacotron 2 - PyTorch implementation with faster-than-realtime inference
[CVPR 2022] Thin-Plate Spline Motion Model for Image Animation.
A small package to create visualizations of PyTorch execution graphs
2-2000x faster ML algos, 50% less memory usage, works on all hardware - new and old.
Handout for the tutorial "Creating publication-quality figures with matplotlib"
Pytorch reimplementation of the Vision Transformer (An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale)
PyTorch implementation of various methods for continual learning (XdG, EWC, SI, LwF, FROMP, DGR, BI-R, ER, A-GEM, iCaRL, Generative Classifier) in three different scenarios.
[ACM MM 2024] This is the official code for "AniTalker: Animate Vivid and Diverse Talking Faces through Identity-Decoupled Facial Motion Encoding"
A framework for using LSTMs to detect anomalies in multivariate time series data. Includes spacecraft anomaly data and experiments from the Mars Science Laboratory and SMAP missions.
yzhou359 / MakeItTalk
Forked from adobe-research/MakeItTalkEfficient face emotion recognition in photos and videos
FaceScape (PAMI2023 & CVPR2020)
Mellotron: a multispeaker voice synthesis model based on Tacotron 2 GST that can make a voice emote and sing without emotive or singing training data
[CVPR 2024] Alpha-CLIP: A CLIP Model Focusing on Wherever You Want
A high-fidelity 3D face reconstruction library from monocular RGB image(s)
Fine-tuning StyleGAN2 for Cartoon Face Generation
This repo aims to be a useful collection of notebooks/code for understanding and implementing seq2seq neural networks for time series forecasting. Networks are constructed with keras/tensorflow.