Stars
Train transformer language models with reinforcement learning.
Open-source framework for the research and development of foundation models.
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
A beautiful, simple, clean, and responsive Jekyll theme for academics
A simple pip-installable Python tool to generate your own HTML citation world map from your Google Scholar ID.
The official repo of Pai-Megatron-Patch for LLM & VLM large scale training developed by Alibaba Cloud.
Metrics for evaluating music and audio generative models – with a focus on long-form, full-band, and stereo generations.
Your faithful, impartial partner for audio evaluation — know yourself and your rivals.真实评测,知己知彼。
Align Anything: Training All-modality Model with Feedback
FSA/FST algorithms, differentiable, with PyTorch compatibility.
The repository provides links to collections of influential and interesting research papers from top AI conferences, with open-source code to promote reproducibility and provide detailed implementa…
A Framework for Speech, Language, Audio, Music Processing with Large Language Model
State-of-the-art pretrained music models for training, evaluation, inference
Speech Human Evaluation Estimation Toolkit (SHEET)
kaldi-asr/kaldi is the official location of the Kaldi project.
openslr-org / openslr
Forked from danpovey/openslrRepository for the web pages and scripts associated with OpenSLR: the open speech and language repository
A simple library for Fréchet Audio Distance (FAD) calculation
Awesome speech/audio LLMs, representation learning, and codec models
Google Drive Public File Downloader when Curl/Wget Fails