Stars
A feature-rich command-line audio/video downloader
Robust Speech Recognition via Large-Scale Weak Supervision
A high-throughput and memory-efficient inference and serving engine for LLMs
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
A generative speech model for daily dialogue.
Free ChatGPT&DeepSeek API Key,免费ChatGPT&DeepSeek API。免费接入DeepSeek API和GPT4 API,支持 gpt | deepseek | claude | gemini | grok 等排名靠前的常用大模型。
👾 Fast and simple video download library and CLI tool written in Go
A GPT-4 AI Tutor Prompt for customizable personalized learning experiences.
《开源大模型食用指南》针对中国宝宝量身打造的基于Linux环境快速微调(全参数/Lora)、部署国内外开源大模型(LLM)/多模态大模型(MLLM)教程
Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
Use ChatGPT to summarize the arXiv papers. 全流程加速科研,利用chatgpt进行论文全文总结+专业翻译+润色+审稿+审稿回复
A TTS model capable of generating ultra-realistic dialogue in one pass.
此项目是机器学习(Machine Learning)、深度学习(Deep Learning)、NLP面试中常考到的知识点和代码实现,也是作为一个算法工程师必会的理论基础知识。
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
沉浸式双语网页翻译扩展 , 支持输入框翻译, 鼠标悬停翻译, PDF, Epub, 字幕文件, TXT 文件翻译 - Immersive Dual Web Page Translation Extension
Translate the video from one language to another and add dubbing.
An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System
A Conversational Speech Generation Model
Ongoing research training transformer models at scale
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.
Foundational Models for State-of-the-Art Speech and Text Translation
主要记录大语言大模型(LLMs) 算法(应用)工程师相关的知识及面试题