Stars
Sourcetrail - free and open-source interactive source explorer
Lets make video diffusion practical!
Transcription from mp3 files to html with or without embedded player
Multilingual automatic speech recognition (ASR) with speaker segmentation (SS) / speaker diarization (SD) and word-level timestamps (WLT)
🍦 Speech-AI-Forge is a project developed around TTS generation model, implementing an API Server and a Gradio-based WebUI.
An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System
An unprofessional open-source Chinese font derived from Fontworks' Klee One. 一款非专业的开源中文字体,基于 FONTWORKS 出品字体 Klee One 衍生。
ChID: A Large-scale Chinese IDiom Dataset for Cloze Test
CPED: A Large-Scale Chinese Personalized and Emotional Dialogue Dataset for Conversational AI | 中文个性情感对话数据集
🚀 「大模型」1小时从0训练26M参数的视觉多模态VLM!🌏 Train a 26M-parameter VLM from scratch in just 1 hours!
This is a repository used by individuals to experiment and reproduce the pre-training process of LLM.
仅需Python基础,从0构建大语言模型;从0逐步构建GLM4\Llama3\RWKV6, 深入理解大模型原理
Task manager with Todoist, Nextcloud & CalDAV support designed for GNOME
User-friendly AI Interface (Supports Ollama, OpenAI API, ...)
Translation for Trilium Notes. Trilium Notes 中文适配, 体验优化
faster_whisper GUI with PySide6
EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
计算机专业课(408)思维导图和笔记:计算机组成原理(第五版 王爱英),数据结构(王道),计算机网络(第七版 谢希仁),操作系统(第四版 汤小丹)
Build Conversational AI in minutes ⚡️
Use ChatGPT to summarize the arXiv papers. 全流程加速科研,利用chatgpt进行论文全文总结+专业翻译+润色+审稿+审稿回复
Based QLabel, create a waveform monitor for Qt, Support for mouse wheel control chart scaling.
中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)
An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries
Neural Machine Translation (NMT) tutorial. Data preprocessing, model training, evaluation, and deployment.
100+ Chinese Word Vectors 上百种预训练中文词向量