Skip to content
View evan2jiang's full-sized avatar

Block or report evan2jiang

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

1086 results for source starred repositories
Clear filter

Open-Source Chrome extension for AI-powered web automation. Run multi-agent workflows using your own LLM API key. Alternative to OpenAI Operator.

TypeScript 11,178 1,119 Updated Oct 7, 2025

Examples of my Claude Code infrastructure with skill auto-activation, hooks, and agents

Shell 3,870 536 Updated Oct 31, 2025

12 Lessons to Get Started Building AI Agents

Jupyter Notebook 43,917 14,816 Updated Oct 24, 2025

Microsoft AI

Python 2,157 601 Updated May 10, 2025

AI Audio Datasets (AI-ADS) 🎵, including Speech, Music, and Sound Effects, which can provide training data for Generative AI, AIGC, AI model training, intelligent audio tool development, and audio a…

858 81 Updated Jul 8, 2025

Join the community on Discord for more discussions around Neutone! https://discord.gg/VHSMzb8Wqp

Python 562 29 Updated Nov 2, 2025

Deezer source separation library including pretrained models.

Python 27,705 3,045 Updated Apr 2, 2025

Whisper-Flamingo [Interspeech 2024] and mWhisper-Flamingo [IEEE SPL 2025] for Audio-Visual Speech Recognition and Translation

Jupyter Notebook 192 14 Updated Jul 29, 2025

Speech recognition

C 1,161 171 Updated Oct 29, 2025

中文翻译的 Hands-On-Large-Language-Models (hands-on-llms),动手学习大模型

Jupyter Notebook 1,625 173 Updated Oct 19, 2025

implementation of Monaural Speech Enhancement with Recursive Learning in the Time Domain

Python 48 7 Updated Nov 4, 2020

We Speech Toolkit, LLM based Speech Toolkit for Speech Understanding, Generation, and Interaction

Python 139 8 Updated Nov 5, 2025

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 77,950 11,503 Updated Nov 3, 2025

LLMs-from-scratch项目中文翻译

Jupyter Notebook 1,906 310 Updated Oct 15, 2025
MATLAB 1 Updated Jul 23, 2025

Collection of MATLAB scripts and toolboxes regarding my Master Thesis on psychoacoustics

MATLAB 10 1 Updated Dec 16, 2017

CMSIS-DSP embedded compute library for Cortex-M and Cortex-A

C 848 196 Updated Oct 27, 2025

Collection of papers related to neural nets/machine learning for audio DSP.

144 4 Updated Apr 29, 2025

PDFs and Codelabs for the Efficient Deep Learning book.

Jupyter Notebook 202 25 Updated May 29, 2023

Rust speaker safety daemon for Asahi Linux

Rust 182 14 Updated Mar 29, 2025

Loudspeaker simulation

Jupyter Notebook 5 1 Updated Aug 22, 2025

FEMM loudspeaker model

Lua 4 3 Updated Sep 27, 2021

Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.

Python 62,711 9,240 Updated Nov 5, 2025

Domestic environment sound event detection task

Python 149 69 Updated Jun 11, 2024

Speech Reinforcement for In-Room Communications

MATLAB 7 4 Updated Mar 9, 2025

Feedback Analysis and Cancellation Toolkit

MATLAB 24 7 Updated Nov 26, 2018

Qwen3-omni is a natively end-to-end, omni-modal LLM developed by the Qwen team at Alibaba Cloud, capable of understanding text, audio, images, and video, as well as generating speech in real time.

Jupyter Notebook 2,821 159 Updated Oct 9, 2025
Python 67 3 Updated Sep 25, 2025
Next