Skip to content
View evan2jiang's full-sized avatar

Block or report evan2jiang

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Open-Source Chrome extension for AI-powered web automation. Run multi-agent workflows using your own LLM API key. Alternative to OpenAI Operator.

TypeScript 11,177 1,118 Updated Oct 7, 2025

Examples of my Claude Code infrastructure with skill auto-activation, hooks, and agents

Shell 3,864 535 Updated Oct 31, 2025

12 Lessons to Get Started Building AI Agents

Jupyter Notebook 43,913 14,812 Updated Oct 24, 2025

Microsoft AI

Python 2,156 601 Updated May 10, 2025

AI Audio Datasets (AI-ADS) 🎵, including Speech, Music, and Sound Effects, which can provide training data for Generative AI, AIGC, AI model training, intelligent audio tool development, and audio a…

858 81 Updated Jul 8, 2025

Join the community on Discord for more discussions around Neutone! https://discord.gg/VHSMzb8Wqp

Python 562 29 Updated Nov 2, 2025

Deezer source separation library including pretrained models.

Python 27,707 3,045 Updated Apr 2, 2025

Code for the blog "Neural audio codecs: how to get audio into LLMs"

Python 122 3 Updated Oct 20, 2025

Whisper-Flamingo [Interspeech 2024] and mWhisper-Flamingo [IEEE SPL 2025] for Audio-Visual Speech Recognition and Translation

Jupyter Notebook 192 14 Updated Jul 29, 2025

Speech recognition

C 1,161 171 Updated Oct 29, 2025

中文翻译的 Hands-On-Large-Language-Models (hands-on-llms),动手学习大模型

Jupyter Notebook 1,625 173 Updated Oct 19, 2025

implementation of Monaural Speech Enhancement with Recursive Learning in the Time Domain

Python 48 7 Updated Nov 4, 2020

We Speech Toolkit, LLM based Speech Toolkit for Speech Understanding, Generation, and Interaction

Python 139 8 Updated Nov 5, 2025

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 77,939 11,503 Updated Nov 3, 2025

LLMs-from-scratch项目中文翻译

Jupyter Notebook 1,906 310 Updated Oct 15, 2025
MATLAB 1 Updated Jul 23, 2025

Collection of MATLAB scripts and toolboxes regarding my Master Thesis on psychoacoustics

MATLAB 10 1 Updated Dec 16, 2017

CMSIS-DSP embedded compute library for Cortex-M and Cortex-A

C 848 196 Updated Oct 27, 2025

Collection of papers related to neural nets/machine learning for audio DSP.

144 4 Updated Apr 29, 2025

PDFs and Codelabs for the Efficient Deep Learning book.

Jupyter Notebook 202 25 Updated May 29, 2023

Rust speaker safety daemon for Asahi Linux

Rust 182 14 Updated Mar 29, 2025

Loudspeaker simulation

Jupyter Notebook 5 1 Updated Aug 22, 2025

FEMM loudspeaker model

Lua 4 3 Updated Sep 27, 2021

Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.

Python 62,706 9,239 Updated Nov 5, 2025

Domestic environment sound event detection task

Python 149 69 Updated Jun 11, 2024

automatic speech recognition paper roadmap, including HMM, DNN, RNN, CNN, Seq2Seq, Attention

1 Updated Jun 28, 2017

Speech Reinforcement for In-Room Communications

MATLAB 7 4 Updated Mar 9, 2025
Next