Skip to content
View honggaoyao's full-sized avatar

Block or report honggaoyao

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

🎯 哔哩哔哩(bilibili)评论区数据可视化分析软件-- up主可用于指导自己的题材选择,明确自己的粉丝群体

Python 448 67 Updated Jun 27, 2025

Bravura music font, reference font for SMuFL (Standard Music Font Layout)

Inno Setup 394 43 Updated May 8, 2022

Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.

Python 75,388 10,229 Updated Apr 6, 2026

🚀Clone a voice in 5 seconds to generate arbitrary speech in real-time

Python 36,902 5,230 Updated Mar 3, 2026

The New Yorker (azw3/epub/pdf)

150 15 Updated Apr 6, 2026

Download sheet music

TypeScript 2,826 115 Updated Apr 11, 2026

⚠️ This repo has moved to https://github.com/LibreScore/dl-librescore ⚠️ | Download sheet music (MSCZ, PDF, MusicXML, MIDI, MP3, download individual parts as PDF) from musescore.com for free, no lo…

TypeScript 2,754 194 Updated Feb 28, 2023

A dataset of 222 digital musical scores aligned with 1068 performances (more than 92 hours) of Western classical piano music.

Jupyter Notebook 260 35 Updated Apr 7, 2026

Generative models for conditional audio generation

Python 3,663 440 Updated Feb 14, 2026

Web video downloader for Bilibili, iQIYI, Tencent Video, MGTV and WeTV. 网站视频下载器,主要支持Bilibili、爱奇艺、腾讯视频、芒果TV、WeTV、愛奇藝台灣站。

Python 1,344 269 Updated Jan 16, 2024

FMA: A Dataset For Music Analysis

Jupyter Notebook 2,583 456 Updated Jan 5, 2023

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Python 45,021 6,034 Updated Aug 16, 2024

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Python 56,573 6,176 Updated Feb 9, 2026

免费AI去水印在线工具汇总:一键去除图片和视频水印

588 35 Updated Apr 15, 2025

Latest generation of Audiveris OMR engine

Java 2,413 364 Updated Apr 11, 2026

清华大学计算机系课程攻略 Guidance for courses in Department of Computer Science and Technology, Tsinghua University

HTML 36,931 7,842 Updated Apr 9, 2026

Plainchant Analyser tool for MEI Neumes (PAM)

Svelte 3 1 Updated Apr 5, 2026

Transformer: PyTorch Implementation of "Attention Is All You Need"

Python 4,512 633 Updated Jul 15, 2025

Mamba SSM architecture

Python 17,936 1,688 Updated Apr 10, 2026

Deezer source separation library including pretrained models.

Python 28,155 3,065 Updated Apr 2, 2025

A list of tools, papers and code related to Fake Audio Detection.

Python 239 12 Updated Apr 10, 2026

A curated list of awesome article, tutorial, library, webpage, etc.

191 10 Updated Jul 10, 2023

An "awesome music theory" kinda wiki with books, resources and courses for studying everything about music and sound

2,172 81 Updated Feb 11, 2026

Voice Recognition to Text Tool / 一个离线运行的本地音视频转字幕工具,输出json、srt字幕、纯文字格式

Python 4,404 472 Updated Jan 22, 2026

✨ AsrTools: Smart Voice-to-Text Tool | Efficient Batch Processing | User-Friendly Interface | No GPU Required | Supports SRT/TXT Output | Turn your audio into accurate text in an instant!

Python 3,165 297 Updated Nov 25, 2025

利用AI大模型,一键解说并剪辑视频; Using AI models to automatically provide commentary and edit videos with a single click.

Python 8,738 1,169 Updated Apr 8, 2026
MATLAB 17 2 Updated May 31, 2025

OpenFace – a state-of-the art tool intended for facial landmark detection, head pose estimation, facial action unit recognition, and eye-gaze estimation.

MATLAB 7,617 1,910 Updated Jun 1, 2024
Next