Lists (1)
Sort Name ascending (A-Z)
Stars
Large Language Model Text Generation Inference
Godot Engine – Multi-platform 2D and 3D game engine
🦜🔗 The platform for reliable agents.
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
📚 A collection of sketch based application papers.
A list of awesome beginners-friendly projects.
Panel: The powerful data exploration & web app framework for Python
A single Gradio + React WebUI with extensions for ACE-Step, Kimi Audio, Piper TTS, GPT-SoVITS, CosyVoice, XTTSv2, DIA, Kokoro, OpenVoice, ParlerTTS, Stable Audio, MMS, StyleTTS2, MAGNet, AudioGen, …
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
Gamma-ray spectroscopy analysis tools with a Graphical User Interface (GUI)
All Algorithms implemented in Python
A curated list of open source projects used in nuclear science and engineering
Collection of google colaboratory notebooks for fast and easy experiments
📚 A collection of Deep Learning based Image Colorization and Video Colorization papers.
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
Awesome Spectral Indices in Python.
A ready-to-use curated list of Spectral Indices for Remote Sensing applications.
A repository of custom scripts to be used with Sentinel Hub
An easy-to-run OCR model pipeline based on CRNN and CTC loss
Versatile audio super resolution (any -> 48kHz) with AudioSR.
Seth's AI Tools: A Unity based front end that uses ComfyUI and LLMs to create stories, images, movies, quizzes and posters
C++ library for converting text to phonemes for Piper
This repository contains demos I made with the Transformers library by HuggingFace.
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs
[CVPR2023] Blind Video Deflickering by Neural Filtering with a Flawed Atlas
Metric depth estimation from a single image
A MIT-licensed, deployable starter kit for building and customizing your own version of AI town - a virtual town where AI characters live, chat and socialize.
Inpaint Anything performs stable diffusion inpainting on a browser UI using masks from Segment Anything.