Lists (1)
Sort Name ascending (A-Z)
Stars
All Algorithms implemented in Python
🦜🔗 The platform for reliable agents.
Godot Engine – Multi-platform 2D and 3D game engine
A list of awesome beginners-friendly projects.
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
OpenMMLab Detection Toolbox and Benchmark
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
Code for Machine Learning for Algorithmic Trading, 2nd edition.
[CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs
This repository contains demos I made with the Transformers library by HuggingFace.
Large Language Model Text Generation Inference
OpenMMLab Semantic Segmentation Toolbox and Benchmark.
A MIT-licensed, deployable starter kit for building and customizing your own version of AI town - a virtual town where AI characters live, chat and socialize.
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
Panel: The powerful data exploration & web app framework for Python
Stable diffusion for real-time music generation
FILM: Frame Interpolation for Large Motion, In ECCV 2022.
An open-source project dedicated to tracking and segmenting any objects in videos, either automatically or interactively. The primary algorithms utilized include the Segment Anything Model (SAM) fo…
A single Gradio + React WebUI with extensions for ACE-Step, Kimi Audio, Piper TTS, GPT-SoVITS, CosyVoice, XTTSv2, DIA, Kokoro, OpenVoice, ParlerTTS, Stable Audio, MMS, StyleTTS2, MAGNet, AudioGen, …
Metric depth estimation from a single image
Collection of popular and reproducible image denoising works.
A colab gradio web UI for running Large Language Models
Adapted from https://note.com/kohya_ss/n/nbf7ce8d80f29 for easier cloning
A timeline of the latest AI models for audio generation, starting in 2023!
Versatile audio super resolution (any -> 48kHz) with AudioSR.
Collection of google colaboratory notebooks for fast and easy experiments
📚 A collection of Deep Learning based Image Colorization and Video Colorization papers.
A ready-to-use curated list of Spectral Indices for Remote Sensing applications.