Skip to content
View accraze's full-sized avatar
πŸ‘½
πŸ‘½
  • Zone 8b

Organizations

@wikimedia @Earth-MGMT-SYS @wiki-ai @sunhypnotic @open-source-botany

Block or report accraze

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
39 stars written in Jupyter Notebook
Clear filter

πŸ”Š Text-Prompted Generative Audio Model

Jupyter Notebook 38,656 4,650 Updated Aug 19, 2024

A simple screen parsing tool towards pure vision based GUI agent

Jupyter Notebook 23,798 2,039 Updated Sep 12, 2025

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable…

Jupyter Notebook 22,625 2,480 Updated Mar 13, 2025

Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node

Jupyter Notebook 13,576 1,603 Updated Oct 24, 2025

Dopamine is a research framework for fast prototyping of reinforcement learning algorithms.

Jupyter Notebook 10,816 1,390 Updated Nov 4, 2024

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

Jupyter Notebook 8,633 965 Updated Oct 23, 2025

Zero-Shot Speech Editing and Text-to-Speech in the Wild

Jupyter Notebook 8,429 798 Updated Mar 15, 2025

Image restoration with neural networks but without learning.

Jupyter Notebook 8,039 1,445 Updated Apr 27, 2023

Official Code for Stable Cascade

Jupyter Notebook 6,582 526 Updated Jul 25, 2024

A course in reinforcement learning in the wild

Jupyter Notebook 6,328 1,767 Updated Sep 21, 2025

Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper

Jupyter Notebook 5,102 474 Updated Oct 14, 2025

A collection of infrastructure and tools for research in neural network interpretability.

Jupyter Notebook 4,699 657 Updated Feb 6, 2023

OmniGen2: Exploration to Advanced Multimodal Generation.

Jupyter Notebook 3,925 6 Updated Sep 30, 2025

Algorithms for outlier, adversarial and drift detection

Jupyter Notebook 2,447 237 Updated Oct 27, 2025

2-2000x faster ML algos, 50% less memory usage, works on all hardware - new and old.

Jupyter Notebook 2,329 150 Updated Nov 19, 2024

Google Colaboratory Notebooks and Repositories (by @firmai)

Jupyter Notebook 1,470 262 Updated Mar 10, 2022

Tools to train a generative model on arbitrary audio samples

Jupyter Notebook 1,109 175 Updated Apr 29, 2024

Utility functions for handling MIDI data in a nice/intuitive way.

Jupyter Notebook 981 162 Updated Oct 8, 2025

A list of Machine Learning Art Colabs

Jupyter Notebook 823 133 Updated Feb 21, 2022

Apply diffusion models using the new Hugging Face diffusers package to synthesize music instead of images.

Jupyter Notebook 778 75 Updated Sep 25, 2024

Machine Learning applied to sound

Jupyter Notebook 283 48 Updated Jun 8, 2025

Tegridy MIDI Dataset for precise and effective Music AI models creation.

Jupyter Notebook 232 16 Updated Oct 19, 2025

A neural attention model for speech command recognition

Jupyter Notebook 187 80 Updated Jul 12, 2025

Multiple notebooks which allow the use of various machine learning methods to generate or modify multimedia content

Jupyter Notebook 177 49 Updated Sep 23, 2023

The Song Describer dataset is an evaluation dataset made of ~1.1k captions for 706 permissively licensed music recordings.

Jupyter Notebook 162 3 Updated Dec 22, 2023

Symbolic Music NLP Artificial Intelligence Toolkit

Jupyter Notebook 109 19 Updated Nov 1, 2025

Generate YouTube Shorts using Reddit posts scraped with PRAW, title and captions generated with GPT, images and thumbnails generated with Stable Diffusion and voiceover with 11Labs

Jupyter Notebook 79 15 Updated Sep 25, 2023

Deploy a RAG use case on AWS by using Terraform and Amazon Bedrock

Jupyter Notebook 49 10 Updated Oct 8, 2025
Next