Lists (1)
Sort Name ascending (A-Z)
Stars
🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
为GPT/GLM等LLM大语言模型提供实用化交互接口,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, m…
🚀🚀 「大模型」2小时完全从0训练64M的小参数GPT!🌏 Train a 64M-parameter GPT from scratch in just 2h!
PyTorch Tutorial for Deep Learning Researchers
Graph Neural Network Library for PyTorch
Open standard for machine learning interoperability
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
The PyTorch improved version of TPAMI 2017 paper: Face Alignment in Full Pose Range: A 3D Total Solution.
Python Toolkit for Causal and Probabilistic Reasoning
The official PyTorch implementation of Towards Fast, Accurate and Stable 3D Dense Face Alignment, ECCV 2020.
PyTorch Geometric Temporal: Spatiotemporal Signal Processing with Neural Machine Learning Models (CIKM 2021)
The PyTorch-based audio source separation toolkit for researchers
An optimized deep prompt tuning strategy comparable to fine-tuning across scales and tasks
Avalanche: an End-to-End Library for Continual Learning based on PyTorch.
Pyroomacoustics is a package for audio signal processing for indoor applications. It was developed as a fast prototyping platform for beamforming algorithms in indoor scenarios.
CogDL: A Comprehensive Library for Graph Deep Learning (WWW 2023)
DELTA is a deep learning based natural language and speech processing platform. LF AI & DATA Projects: https://lfaidata.foundation/projects/delta/
This repo contains the scripts, models, and required files for the Deep Noise Suppression (DNS) Challenge.
A best practice for deep learning project template architecture.
Unofficial PyTorch implementation of Google AI's VoiceFilter system
In defence of metric learning for speaker recognition
A wrapper around speech quality metrics MOSNet, BSSEval, STOI, PESQ, SRMR, SISDR
A PyTorch implementation of Speech Transformer, an End-to-End ASR with Transformer network on Mandarin Chinese.
An Open Source Tools for Speaker Recognition
PyTorch implementation of "Generalized End-to-End Loss for Speaker Verification" by Wan, Li et al.
End-to-end ASR/LM implementation with PyTorch
ACM MM 2021: 'Is Someone Speaking? Exploring Long-term Temporal Features for Audio-visual Active Speaker Detection'