Lists (1)
Sort Name ascending (A-Z)
Stars
Large Language Model Text Generation Inference
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
Panel: The powerful data exploration & web app framework for Python
Godot Engine – Multi-platform 2D and 3D game engine
🦜🔗 The platform for reliable agents.
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
All Algorithms implemented in Python
📚 A collection of sketch based application papers.
📚 A collection of Deep Learning based Image Colorization and Video Colorization papers.
Gamma-ray spectroscopy analysis tools with a Graphical User Interface (GUI)
A list of awesome beginners-friendly projects.
Collection of google colaboratory notebooks for fast and easy experiments
A MIT-licensed, deployable starter kit for building and customizing your own version of AI town - a virtual town where AI characters live, chat and socialize.
A single Gradio + React WebUI with extensions for ACE-Step, Kimi Audio, Piper TTS, GPT-SoVITS, CosyVoice, XTTSv2, DIA, Kokoro, OpenVoice, ParlerTTS, Stable Audio, MMS, StyleTTS2, MAGNet, AudioGen, …
A curated list of open source projects used in nuclear science and engineering
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
Awesome Spectral Indices in Python.
A ready-to-use curated list of Spectral Indices for Remote Sensing applications.
A repository of custom scripts to be used with Sentinel Hub
An easy-to-run OCR model pipeline based on CRNN and CTC loss
Versatile audio super resolution (any -> 48kHz) with AudioSR.
Seth's AI Tools: A Unity based front end that uses ComfyUI and LLMs to create stories, images, movies, quizzes and posters
C++ library for converting text to phonemes for Piper
This repository contains demos I made with the Transformers library by HuggingFace.
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs
[CVPR2023] Blind Video Deflickering by Neural Filtering with a Flawed Atlas
Metric depth estimation from a single image
Inpaint Anything performs stable diffusion inpainting on a browser UI using masks from Segment Anything.