-
xares-llm Public
Forked from xiaomi-research/xares-llmXARES-LLM
Python Apache License 2.0 UpdatedDec 19, 2025 -
AIGC_music_eval_tools Public
Organized Music Eval Tools. Currently composed by Vocal Clarity, Prompt Alignment, SongEval and AudioBox-Aes, etc.
Python UpdatedDec 11, 2025 -
MIT6S184 Public
Labs from "Introduction to Flow Matching and Diffusion Models" MIT-6.S184
Jupyter Notebook UpdatedDec 5, 2025 -
assignment2-systems Public
Forked from stanford-cs336/assignment2-systemsStudent version of Assignment 2 for Stanford CS336 - Language Modeling From Scratch
Python MIT License UpdatedNov 14, 2025 -
DiffRhythm Public
Forked from ASLP-lab/DiffRhythmDi♪♪Rhythm: Blazingly Fast and Embarrassingly Simple End-to-End Full-Length Song Generation with Latent Diffusion
Python Apache License 2.0 UpdatedOct 29, 2025 -
assignment1-basics Public
Forked from stanford-cs336/assignment1-basicsStudent version of Assignment 1 for Stanford CS336 - Language Modeling From Scratch
Jupyter Notebook UpdatedOct 21, 2025 -
ADI-SHARC-Command-Line-Tools Public
This project is a command-line toolkit for SHARC ADSP development on Ubuntu Linux. It's designed for embedded developers, researchers, and "coding agents" who need to automate their workflow withou…
Shell UpdatedAug 21, 2025 -
MU-LLaMA Public
Forked from shansongliu/MU-LLaMAMU-LLaMA: Music Understanding Large Language Model
Python GNU General Public License v3.0 UpdatedAug 18, 2025 -
jamify Public
Forked from declare-lab/jamifyJAM: A Tiny Flow-based Song Generator with Fine-grained Controllability and Aesthetic Alignment
Python Other UpdatedAug 7, 2025 -
-
-
SongEval Public
Forked from ASLP-lab/SongEvalA song aesthetic evaluation toolkit trained on SongEval.
Python Apache License 2.0 UpdatedJun 15, 2025 -
audiobox-aesthetics Public
Forked from facebookresearch/audiobox-aestheticsUnified automatic quality assessment for speech, music, and sound.
Python Creative Commons Attribution 4.0 International UpdatedJun 5, 2025 -
SRP-DNN Public
Forked from BingYang-20/SRP-DNNA python implementation of “SRP-DNN: Learning Direct-Path Phase Difference for Multiple Moving Sound Source Localization” [ICASSP 2022]
Jupyter Notebook MIT License UpdatedFeb 27, 2024 -
Retrieval-based-Voice-Conversion-WebUI Public
Forked from RVC-Project/Retrieval-based-Voice-Conversion-WebUIVoice data <= 10 mins can also be used to train a good VC model!
Python MIT License UpdatedFeb 16, 2024 -
-
NTU_ML_2023 Public
HUNG-YI LEE (李宏毅) Machine Learning 2023 Homework
Jupyter Notebook UpdatedNov 25, 2023 -
audiocraft Public
Forked from facebookresearch/audiocraftAudiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable…
Python MIT License UpdatedNov 23, 2023 -
so-vits-svc Public
Forked from svc-develop-team/so-vits-svcSoftVC VITS Singing Voice Conversion
Python GNU Affero General Public License v3.0 UpdatedNov 11, 2023 -
stable-diffusion-guide Public
Forked from JiaojiaoYe1994/stable-diffusion-guideHere is a guide for Stable Diffusion
Jupyter Notebook UpdatedOct 31, 2023 -
TriU-Net-module Public
Forked from CaA23187/TriU-Net-modulePyTorch Implement of TriU-Net
Jupyter Notebook UpdatedOct 29, 2023 -
di_nn Public
Forked from egrinstein/di_nnDual-Input Neural Networks
Python MIT License UpdatedOct 24, 2023 -
deep-music-enhancer Public
Forked from serkansulun/deep-music-enhancerPython Other UpdatedOct 4, 2023 -
Transformer-based-SER Public
Forked from HoseinAzad/Transformer-based-SERTransformer-based model for Speech Emotion Recognition(SER) - implemented by Pytorch
Python UpdatedSep 20, 2023 -
elderly_ser Public
Forked from HLTCHKUST/elderly_serTransferability of cross-lingual and cross-age speech emotion recognition
Jupyter Notebook MIT License UpdatedJun 30, 2023 -
bwe_historical_recordings Public
Forked from eloimoliner/bwe_historical_recordingsBandwidth Extension of Historical Recordings using Generative Adversarial Networks
Python UpdatedMay 25, 2023 -
Speech-Emotion-Recognition Public
Forked from Renovamen/Speech-Emotion-RecognitionSpeech emotion recognition implemented in Keras (LSTM, CNN, SVM, MLP) | 语音情感识别
Python MIT License UpdatedMar 25, 2023 -
cs-self-learning Public
Forked from PKUFlyingPig/cs-self-learning计算机自学指南
-
seld-dcase2022 Public
Forked from sharathadavanne/seld-dcase2022Baseline method for sound event localization task of DCASE 2022 challenge
Python UpdatedJun 21, 2022 -
blog_my_coding_road Public
Forked from transitive-bullshit/nextjs-notion-starter-kitDeploy your own Notion-powered website in minutes with Next.js and Vercel.
TypeScript MIT License UpdatedApr 26, 2022