Skip to content
View alephpi's full-sized avatar
🇨🇳
精通的目的在于运用,运用的程度要对得起禀赋
🇨🇳
精通的目的在于运用,运用的程度要对得起禀赋

Highlights

  • Pro

Block or report alephpi

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A library to manipulate font files from Python.

Python 5,100 517 Updated May 14, 2026

AI自动化战斗的洛克王国世界

Python 147 16 Updated Apr 20, 2026

[ICLR 2025] Binary Spherical Quantization + [CVPR 2026] Leech Spherical Quantization

Python 210 7 Updated Dec 18, 2025
Swift 1 Updated May 5, 2026

[ICLR 2026] Official implementation of Toward Complex-Valued Neural Networks for Waveform Generation

Python 17 1 Updated Apr 10, 2026

The repo provides information about KeSpeech dataset.

177 12 Updated Oct 13, 2022

Large, modern dataset for speech recognition

Shell 726 66 Updated Feb 26, 2024

A Large-scale Cantonese Speech Corpus with Multi-dimensional Annotation

Python 311 17 Updated Feb 5, 2026

Official repository for the WenetSpeech-Chuan dataset.

Python 185 6 Updated Feb 5, 2026

A Large-scale Wu Dialect Speech Corpus with Multi-dimensional Annotations

Python 143 3 Updated Feb 6, 2026

List of papers studying machine learning through the lens of category theory

Python 1,510 101 Updated Apr 17, 2026

Helper for managing arXiv papers in Zotero

TypeScript 347 6 Updated May 9, 2026

OLaPh (Optimal Language Phonemizer) is a multilingual phonemization framework that converts text into phonemes surpassing the quality of comparable frameworks.

Python 17 2 Updated May 13, 2026
Python 6 Updated Apr 7, 2026

Comparison of Python audio resampling implementations

Jupyter Notebook 2 Updated Dec 1, 2023

Official code for Interspeech 2023 paper "Self-supervised Fine-tuning for Improved Content Representations by Speaker-invariant Clustering"

Python 65 6 Updated May 19, 2023

The Smallest English TTS Model with only 1M parameters

Python 381 35 Updated Apr 10, 2026

Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.

Python 3,049 263 Updated May 6, 2026

A library to analyze PyTorch traces.

Python 517 92 Updated May 13, 2026

Kanade is a single-layer disentangled speech tokenizer that extracts compact tokens suitable for both generative and discriminative modeling.

Python 98 12 Updated Apr 3, 2026

Multi-Scale Neural Audio Codec (SNAC) compresses audio into discrete codes at a low bitrate

Python 764 45 Updated Nov 19, 2024

dLLM: Simple Diffusion Language Modeling

Python 2,493 260 Updated Apr 15, 2026

Fast and High-Quality Zero-Shot Text-to-Speech with Flow Matching

Python 976 143 Updated Dec 2, 2025

📦 快速转化「中文数字」和「阿拉伯数字」~ (最新特性:分数,日期、温度等转化)

Python 760 83 Updated Apr 23, 2026

High-Quality Voice Cloning TTS for 600+ Languages

Python 5,974 857 Updated May 6, 2026

RWKV-LM-V7(https://github.com/BlinkDL/RWKV-LM) Under Lightning Framework

HIP 59 13 Updated May 13, 2026

INT4/INT5/INT8 and FP16 inference on CPU for RWKV language model

C++ 1,568 128 Updated Mar 23, 2025

MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化,也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志、论文、台词、帖子、wiki、古诗、歌词、商品介绍、笑话、糗事、聊天记录等一切形式的纯文本中文数据。

4,193 289 Updated May 5, 2026

原汁原昧 Claude Code 可运行,可构建, 可调试版; 生产级工程化, 企业级可靠性; 安全无毒, 内存泄露修复

TypeScript 18,263 15,810 Updated May 14, 2026
Next