Skip to content
View Daisyqk's full-sized avatar

Block or report Daisyqk

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

一个开源的多角色、多情绪 AI 配音生成平台,支持小说、剧本、视频等内容的自动配音与导出。

Python 194 25 Updated Nov 4, 2025

使用IndexTTS模型在ComfyUI中实现高质量文本到语音转换的自定义节点。支持中文和英文文本,可以基于参考音频复刻声音特征。

Python 507 46 Updated Oct 23, 2025

This is the GitHub page for publicly available emotional speech data.

368 25 Updated Jan 6, 2022

Long-form streaming TTS system for multi-speaker dialogue generation

Python 1,198 106 Updated Oct 26, 2025

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Python 17,165 1,877 Updated Oct 21, 2025

SoulX-Podcast is an inference codebase by the Soul AI team for generating high-fidelity podcasts from text.

Python 1,710 177 Updated Nov 6, 2025

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

Jupyter Notebook 8,647 966 Updated Oct 23, 2025

MagicData-RAMC Dataset and Baseline

Shell 56 11 Updated Sep 13, 2022

NeurIPS2023 - A generic biosignal learning framework. Large EEG pre-trained models.

Python 162 26 Updated Dec 11, 2023

Mother of All BCI Benchmarks

Python 867 217 Updated Nov 7, 2025

Official codebase for "Brain-JEPA: Brain Dynamics Foundation Model with Gradient Positioning and Spatiotemporal Masking" (NeurIPS 2024, Spotlight).

Python 140 34 Updated Oct 27, 2025

[ICLR 2025] CBraMod: A Criss-Cross Brain Foundation Model for EEG Decoding

Python 203 26 Updated Aug 11, 2025

[ICLR 2024 spotlight] Large Brain Model for Learning Generic Representations with Tremendous EEG Data in BCI

Python 461 87 Updated Sep 29, 2025

Deep learning software to decode EEG, ECG or MEG signals

Python 1,067 233 Updated Nov 7, 2025

Public repository of the start kit

Jupyter Notebook 70 36 Updated Oct 11, 2025
Python 13 2 Updated Sep 29, 2025

An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System

Python 14,903 1,684 Updated Nov 7, 2025

喜马拉雅专辑批量下载小工具

JavaScript 58 15 Updated Mar 6, 2025

Frontier Open-Source Text-to-Speech

9,874 1,250 Updated Sep 5, 2025

💯2025年 软件设计师 (软考中级)备考资源库+配套免费刷题软件。https://ruankaodaren.com

1,053 200 Updated Aug 14, 2025

Simple software for downloading podcasts

Python 217 24 Updated Jul 2, 2025

xiaoyuzhou fm audio downloder.

Python 44 10 Updated Mar 12, 2025

VibeVoice: Expressive, longform conversational speech synthesis. (Community fork)

Python 684 270 Updated Oct 27, 2025

喜马拉雅专辑音频一键下载工具

JavaScript 1,220 171 Updated Aug 22, 2025

🧡 Everything is RSSible

TypeScript 39,707 8,702 Updated Nov 7, 2025

采集Apple Podcast数据

Python 3 Updated Apr 11, 2025

下载指定 B 站 UP 主全部或指定范围的音频,支持多种合集。A script to download all audios of the Bilibili uploader you love.

Python 71 14 Updated Sep 12, 2025

A feature-rich command-line audio/video downloader

Python 134,132 10,772 Updated Nov 5, 2025

👾 Fast and simple video download library and CLI tool written in Go

Go 30,594 3,218 Updated Sep 15, 2025

AutoPrep: An Automatic Preprocessing Framework for In-the-Wild Speech Data

SCSS 33 3 Updated Dec 31, 2023
Next