-
Epidemic Sound
- Stockholm, Sweden
- https://mmxgn.github.io
Stars
Kandinsky 5.0: A family of diffusion models for Video & Image generation
Audio-to-Audio Schrodinger Bridges is a diffusion-based audio restoration model for bandwidth extension and inpainting.
This is the git repository for the Dataflux Python client library, providing fast listing and download of small files from GCS in Python. Also see https://github.com/GoogleCloudPlatform/dataflux-py…
A Data Streaming Library for Efficient Neural Network Training
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
Raspberry Pi ARM based bare metal examples
dawproject-py – A Python repository with code for parsing, generating, and modifying DAWProject files, enabling seamless DAW interoperability.
[NeurIPS 2023] Uni-ControlNet: All-in-One Control to Text-to-Image Diffusion Models
Jenova Runtime is a component for the Godot Engine that brings fully-featured C++ scripting directly into the engine.
A family of state-of-the-art Transformer-based audio codecs for low-bitrate high-quality audio coding.
Chordonomicon: A Dataset of 666,000 Chord Progressions
High-quality Text-to-Audio Generation with Efficient Diffusion Transformer
[ECCV 2024 - Oral] HiT-SR: Hierarchical Transformer for Efficient Image Super-Resolution
The official code for the SALMon🍣 benchmark (ICASSP 2025 - Oral)
😎 A curated list of the best resources in the Nix community [maintainer=@cyntheticfox]
The "Activate Windows" watermark ported to Linux
This repository gathers the list of online publicly available bioacoustics datasets that can be used together with deep learning.
GRAFX: An Open-Source Library for Audio Processing Graphs in PyTorch
ALIEN is a CUDA-powered artificial life simulation program.
Fine-tune Stable Audio Open with DiT ControlNet.