Skip to content
View WWWWWLI's full-sized avatar

Block or report WWWWWLI

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python 156,429 32,051 Updated Feb 12, 2026

a.k.a. Awesome ChatGPT Prompts. Share, discover, and collect prompts from the community. Free and open source — self-host for your organization with complete privacy.

HTML 145,155 19,164 Updated Feb 12, 2026

Robust Speech Recognition via Large-Scale Weak Supervision

Python 94,535 11,750 Updated Dec 15, 2025

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 67,227 8,180 Updated Feb 12, 2026

深度学习500问,以问答形式对常用的概率知识、线性代数、机器学习、深度学习、计算机视觉等热点问题进行阐述,以帮助自己及有需要的读者。 全书分为18个章节,50余万字。由于水平有限,书中不妥之处恳请广大读者批评指正。 未完待续............ 如有意合作,联系scutjy2015@163.com 版权所有,违权必究 Tan 2018.06

JavaScript 57,148 15,979 Updated Jun 26, 2024

A generative speech model for daily dialogue.

Python 38,699 4,207 Updated Jan 18, 2026

深度学习经典、新论文逐段精读

32,562 2,776 Updated Mar 22, 2025

Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"

Python 14,078 2,079 Updated Feb 2, 2026

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3, Qwen3-MoE, DeepSeek-R1, GLM4.5, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Llava, …

Python 12,648 1,203 Updated Feb 11, 2026

Research and development (R&D) is crucial for the enhancement of industrial productivity, especially in the AI era, where the core aspects of R&D are mainly focused on data and models. We are commi…

Python 11,078 1,279 Updated Feb 12, 2026

A Python Library for Outlier and Anomaly Detection, Integrating Classical and Deep Learning Techniques

Python 9,713 1,460 Updated Jan 5, 2026

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi…

Python 9,689 790 Updated May 27, 2025

基于 OpenAI API 的文本翻译、文本润色、语法纠错 Bob 插件,让我们一起迎接不需要巴别塔的新时代!Licensed under CC BY-NC-SA 4.0

TypeScript 5,656 264 Updated Feb 12, 2026

PyTorch deep learning projects made easy.

Python 5,092 1,106 Updated Jun 4, 2024

A data augmentations library for audio, image, text, and video.

Python 5,067 310 Updated Feb 12, 2026

Scalable and user friendly neural 🧠 forecasting algorithms.

Python 3,968 479 Updated Feb 12, 2026

A Python toolbox to create adversarial examples that fool neural networks in PyTorch, TensorFlow, and JAX

Python 2,938 434 Updated Dec 3, 2025

This library provides common speech features for ASR including MFCCs and filterbank energies.

Python 2,421 617 Updated Oct 20, 2021

PyTorch implementation of adversarial attacks [torchattacks]

Python 2,141 368 Updated Jun 29, 2024

Pyroomacoustics is a package for audio signal processing for indoor applications. It was developed as a fast prototyping platform for beamforming algorithms in indoor scenarios.

Python 1,789 479 Updated Feb 4, 2026

FinRL­®-Meta: Dynamic datasets and market environments for FinRL.

Python 1,786 735 Updated Jan 19, 2026

A list of tools, papers and code related to Deepfake Detection.

1,677 149 Updated Sep 2, 2025

SALMONN family: A suite of advanced multi-modal LLMs

1,390 112 Updated Feb 3, 2026

SincNet is a neural architecture for efficiently processing raw audio samples.

Python 1,232 271 Updated Apr 28, 2021

Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit

Python 1,197 179 Updated Feb 11, 2026

In defence of metric learning for speaker recognition

Python 1,161 287 Updated Mar 26, 2024

Audio processing by using pytorch 1D convolution network

Python 1,118 97 Updated Dec 7, 2025

VideoX: a collection of video cross-modal models

Python 1,061 165 Updated Jun 3, 2024

A comprehensive benchmark of deepfake detection

Python 977 160 Updated Aug 20, 2025
Next