Skip to content
View honggaoyao's full-sized avatar

Block or report honggaoyao

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

🎯 哔哩哔哩(bilibili)评论区数据可视化分析软件-- up主可用于指导自己的题材选择,明确自己的粉丝群体

Python 453 69 Updated Jun 27, 2025

Bravura music font, reference font for SMuFL (Standard Music Font Layout)

Inno Setup 396 43 Updated May 8, 2022

Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.

Python 75,881 10,255 Updated Apr 16, 2026

🚀Clone a voice in 5 seconds to generate arbitrary speech in real-time

Python 36,907 5,228 Updated Mar 3, 2026

The New Yorker (azw3/epub/pdf)

152 15 Updated Apr 13, 2026

Download sheet music

TypeScript 2,839 116 Updated Apr 11, 2026

⚠️ This repo has moved to https://github.com/LibreScore/dl-librescore ⚠️ | Download sheet music (MSCZ, PDF, MusicXML, MIDI, MP3, download individual parts as PDF) from musescore.com for free, no lo…

TypeScript 2,756 194 Updated Feb 28, 2023

A dataset of 222 digital musical scores aligned with 1068 performances (more than 92 hours) of Western classical piano music.

Jupyter Notebook 262 35 Updated Apr 12, 2026

Generative models for conditional audio generation

Python 3,671 440 Updated Feb 14, 2026

Web video downloader for Bilibili, iQIYI, Tencent Video, MGTV and WeTV. 网站视频下载器,主要支持Bilibili、爱奇艺、腾讯视频、芒果TV、WeTV、愛奇藝台灣站。

Python 1,346 269 Updated Jan 16, 2024

FMA: A Dataset For Music Analysis

Jupyter Notebook 2,593 455 Updated Jan 5, 2023

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Python 45,099 6,039 Updated Aug 16, 2024

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Python 56,774 6,198 Updated Apr 18, 2026

免费AI去水印在线工具汇总:一键去除图片和视频水印

608 36 Updated Apr 15, 2025

Latest generation of Audiveris OMR engine

Java 2,428 364 Updated Apr 18, 2026

清华大学计算机系课程攻略 Guidance for courses in Department of Computer Science and Technology, Tsinghua University

HTML 36,954 7,843 Updated Apr 13, 2026

Plainchant Analyser tool for MEI Neumes (PAM)

Svelte 3 1 Updated Apr 5, 2026

Transformer: PyTorch Implementation of "Attention Is All You Need"

Python 4,523 632 Updated Jul 15, 2025

Mamba SSM architecture

Python 18,006 1,699 Updated Apr 16, 2026

Deezer source separation library including pretrained models.

Python 28,173 3,065 Updated Apr 2, 2025

A list of tools, papers and code related to Fake Audio Detection.

Python 244 12 Updated Apr 10, 2026

A curated list of awesome article, tutorial, library, webpage, etc.

191 10 Updated Jul 10, 2023

An "awesome music theory" kinda wiki with books, resources and courses for studying everything about music and sound

2,178 81 Updated Feb 11, 2026

Voice Recognition to Text Tool / 一个离线运行的本地音视频转字幕工具,输出json、srt字幕、纯文字格式

Python 4,429 475 Updated Jan 22, 2026

✨ AsrTools: Smart Voice-to-Text Tool | Efficient Batch Processing | User-Friendly Interface | No GPU Required | Supports SRT/TXT Output | Turn your audio into accurate text in an instant!

Python 3,177 298 Updated Nov 25, 2025

利用AI大模型,一键解说并剪辑视频; Using AI models to automatically provide commentary and edit videos with a single click.

Python 8,846 1,189 Updated Apr 8, 2026
MATLAB 17 2 Updated May 31, 2025

OpenFace – a state-of-the art tool intended for facial landmark detection, head pose estimation, facial action unit recognition, and eye-gaze estimation.

MATLAB 7,626 1,912 Updated Jun 1, 2024
Next