Skip to content
View Daisyqk's full-sized avatar

Block or report Daisyqk

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Long-form streaming TTS system for multi-speaker dialogue generation

Python 1,190 106 Updated Oct 26, 2025

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Python 17,128 1,874 Updated Oct 21, 2025

SoulX-Podcast is an inference codebase by the Soul AI team for generating high-fidelity podcasts from text.

Python 1,614 166 Updated Nov 4, 2025

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

Jupyter Notebook 8,631 963 Updated Oct 23, 2025

MagicData-RAMC Dataset and Baseline

Shell 56 11 Updated Sep 13, 2022

NeurIPS2023 - A generic biosignal learning framework. Large EEG pre-trained models.

Python 161 26 Updated Dec 11, 2023

Mother of All BCI Benchmarks

Python 864 217 Updated Oct 8, 2025

Official codebase for "Brain-JEPA: Brain Dynamics Foundation Model with Gradient Positioning and Spatiotemporal Masking" (NeurIPS 2024, Spotlight).

Python 139 34 Updated Oct 27, 2025

[ICLR 2025] CBraMod: A Criss-Cross Brain Foundation Model for EEG Decoding

Python 202 26 Updated Aug 11, 2025

[ICLR 2024 spotlight] Large Brain Model for Learning Generic Representations with Tremendous EEG Data in BCI

Python 460 87 Updated Sep 29, 2025

Deep learning software to decode EEG, ECG or MEG signals

Python 1,062 232 Updated Nov 5, 2025

Public repository of the start kit

Jupyter Notebook 70 37 Updated Oct 11, 2025
Python 13 2 Updated Sep 29, 2025

An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System

Python 14,795 1,669 Updated Oct 30, 2025

喜马拉雅专辑批量下载小工具

JavaScript 58 15 Updated Mar 6, 2025

Frontier Open-Source Text-to-Speech

9,849 1,249 Updated Sep 5, 2025

💯2025年 软件设计师 (软考中级)备考资源库+配套免费刷题软件。https://ruankaodaren.com

1,044 197 Updated Aug 14, 2025

Simple software for downloading podcasts

Python 217 24 Updated Jul 2, 2025

xiaoyuzhou fm audio downloder.

Python 44 10 Updated Mar 12, 2025

VibeVoice: Expressive, longform conversational speech synthesis. (Community fork)

Python 681 270 Updated Oct 27, 2025

喜马拉雅专辑音频一键下载工具

JavaScript 1,220 171 Updated Aug 22, 2025

🧡 Everything is RSSible

TypeScript 39,671 8,693 Updated Nov 5, 2025

采集Apple Podcast数据

Python 3 Updated Apr 11, 2025

下载指定 B 站 UP 主全部或指定范围的音频,支持多种合集。A script to download all audios of the Bilibili uploader you love.

Python 71 14 Updated Sep 12, 2025

A feature-rich command-line audio/video downloader

Python 133,906 10,752 Updated Nov 3, 2025

👾 Fast and simple video download library and CLI tool written in Go

Go 30,582 3,218 Updated Sep 15, 2025

AutoPrep: An Automatic Preprocessing Framework for In-the-Wild Speech Data

SCSS 33 3 Updated Dec 31, 2023

Fast and High-Quality Zero-Shot Text-to-Speech with Flow Matching

Python 691 90 Updated Oct 27, 2025

[NeurIPS 2025] Let Them Talk: Audio-Driven Multi-Person Conversational Video Generation

Python 2,640 448 Updated Sep 25, 2025

A generative speech model for daily dialogue.

Python 38,101 4,132 Updated Jul 6, 2025
Next