Skip to content
View Daisyqk's full-sized avatar

Block or report Daisyqk

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A feature-rich command-line audio/video downloader

Python 134,028 10,764 Updated Nov 5, 2025

Robust Speech Recognition via Large-Scale Weak Supervision

Python 90,443 11,325 Updated Sep 8, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 62,320 11,076 Updated Nov 6, 2025

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Python 52,090 5,707 Updated Sep 10, 2025

🧡 Everything is RSSible

TypeScript 39,689 8,697 Updated Nov 6, 2025

A generative speech model for daily dialogue.

Python 38,108 4,133 Updated Jul 6, 2025

Free ChatGPT&DeepSeek API Key,免费ChatGPT&DeepSeek API。免费接入DeepSeek API和GPT4 API,支持 gpt | deepseek | claude | gemini | grok 等排名靠前的常用大模型。

Python 34,043 2,423 Updated Oct 10, 2025

👾 Fast and simple video download library and CLI tool written in Go

Go 30,590 3,219 Updated Sep 15, 2025

A GPT-4 AI Tutor Prompt for customizable personalized learning experiences.

29,655 3,376 Updated Sep 30, 2025

《开源大模型食用指南》针对中国宝宝量身打造的基于Linux环境快速微调(全参数/Lora)、部署国内外开源大模型(LLM)/多模态大模型(MLLM)教程

Jupyter Notebook 25,738 2,588 Updated Nov 4, 2025

Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.

Python 25,271 1,762 Updated Oct 13, 2025

Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch

Python 24,359 3,427 Updated Oct 28, 2025

SOTA Open Source TTS

Python 23,994 1,957 Updated Nov 6, 2025

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Python 21,815 2,665 Updated Jul 3, 2025

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Python 19,993 2,084 Updated Nov 5, 2025

Use ChatGPT to summarize the arXiv papers. 全流程加速科研,利用chatgpt进行论文全文总结+专业翻译+润色+审稿+审稿回复

Python 19,102 1,949 Updated Apr 4, 2024

A TTS model capable of generating ultra-realistic dialogue in one pass.

Python 18,766 1,629 Updated Jul 6, 2025

此项目是机器学习(Machine Learning)、深度学习(Deep Learning)、NLP面试中常考到的知识点和代码实现,也是作为一个算法工程师必会的理论基础知识。

Jupyter Notebook 17,279 4,643 Updated Jun 21, 2022

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Python 17,150 1,874 Updated Oct 21, 2025

沉浸式双语网页翻译扩展 , 支持输入框翻译, 鼠标悬停翻译, PDF, Epub, 字幕文件, TXT 文件翻译 - Immersive Dual Web Page Translation Extension

16,443 942 Updated Nov 5, 2025

Mamba SSM architecture

Python 16,344 1,481 Updated Oct 10, 2025

Translate the video from one language to another and add dubbing.

Python 15,108 1,765 Updated Nov 4, 2025

An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System

Python 14,860 1,677 Updated Oct 30, 2025

A Conversational Speech Generation Model

Python 14,256 1,426 Updated May 27, 2025

Ongoing research training transformer models at scale

Python 14,108 3,247 Updated Nov 6, 2025

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

Python 13,351 1,352 Updated Oct 1, 2025

20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.

Python 12,903 1,347 Updated Nov 5, 2025

Foundational Models for State-of-the-Art Speech and Text Translation

Jupyter Notebook 11,689 1,163 Updated Nov 14, 2024

主要记录大语言大模型(LLMs) 算法(应用)工程师相关的知识及面试题

HTML 10,718 1,094 Updated Apr 30, 2025
Next