maosth

Follow

maosth

Follow

3 followers · 7 following

Lists (4)

Sort

AI_todo

Daily Skill

Resource

Study

Starred repositories

docling-project / docling

Get your documents ready for gen AI

Python 47,278 3,325 Updated Dec 19, 2025

datalab-to / chandra

OCR model that handles complex tables, forms, handwriting with full layout.

Python 3,798 425 Updated Dec 19, 2025

deepseek-ai / DeepSeek-OCR

Contexts Optical Compression

Python 21,502 1,923 Updated Oct 25, 2025

zai-org / GLM-TTS

GLM-TTS: Controllable & Emotion-Expressive Zero-shot TTS with Multi-Reward Reinforcement Learning

Python 742 89 Updated Dec 17, 2025

microsoft / call-center-ai

Send a phone call from AI agent, in an API call. Or, directly call the bot from the configured phone number!

Python 6,020 685 Updated Oct 27, 2025

lss233 / kirara-ai

🤖 可 DIY 的多模态 AI 聊天机器人 | 🚀 快速接入微信、 QQ、Telegram、等聊天平台 | 🦈支持DeepSeek、Grok、Claude、Ollama、Gemini、OpenAI | 工作流系统、网页搜索、AI画图、人设调教、虚拟女仆、语音对话 |

Python 17,659 1,787 Updated Jun 28, 2025

meituan-longcat / LongCat-Video

Python 1,519 198 Updated Dec 20, 2025

univa-agent / univa

Official Code Repo for UniVA: Universal Video Agents

TypeScript 256 38 Updated Nov 29, 2025

SkyworkAI / SkyReels-V1

SkyReels V1: The first and most advanced open-source human-centric video foundation model

Python 2,595 294 Updated Mar 10, 2025

Zyphra / Zonos

Zonos-v0.1 is a leading open-weight text-to-speech model trained on more than 200k hours of varied multilingual speech, delivering expressiveness and quality on par with—or even surpassing—top TTS …

Python 7,139 816 Updated Mar 5, 2025

zai-org / Open-AutoGLM

An Open Phone Agent Model & Framework. Unlocking the AI Phone for Everyone

Python 18,090 2,827 Updated Dec 19, 2025

yoyoung / zquant

ZQuant量化分析平台是一个功能完整的股票量化分析系统，基于 FastAPI 构建，提供数据服务、回测引擎、策略管理等功能，旨在为量化分析者提供从数据采集、策略开发、回测分析到结果管理的一站式解决方案。

Python 28 4 Updated Dec 13, 2025

TommyZihao / ChatTTS_Tutorials

Step-by-step Jupyter notebook tutorials for ChatTTS

Jupyter Notebook 172 30 Updated Jun 15, 2024

libukai / Awesome-ChatTTS

官方推荐的 ChatTTS 资源汇总项目，整理了全网相关资源和常见问题 || Officially recommended ChatTTS resource collection project

1,827 110 Updated Jul 3, 2024

edtechre / pybroker

Algorithmic Trading in Python with Machine Learning

Python 2,972 386 Updated Dec 5, 2025

facebookresearch / omnilingual-asr

Omnilingual ASR Open-Source Multilingual SpeechRecognition for 1600+ Languages

Python 2,488 213 Updated Dec 16, 2025

FireRedTeam / FireRedASR

Open-source industrial-grade ASR models supporting Mandarin, Chinese dialects and English, achieving a new SOTA on public Mandarin ASR benchmarks, while also offering outstanding singing lyrics rec…

Python 1,675 153 Updated Sep 22, 2025

k2-fsa / ZipVoice

Fast and High-Quality Zero-Shot Text-to-Speech with Flow Matching

Python 742 103 Updated Dec 2, 2025

NVIDIA / tacotron2

Tacotron 2 - PyTorch implementation with faster-than-realtime inference

Jupyter Notebook 5,296 1,427 Updated Jun 12, 2024

ruc-datalab / DeepAnalyze

DeepAnalyze is the first agentic LLM for autonomous data science. 🎈你的AI数据分析师，自动分析大量数据，一键生成专业分析报告！

Python 3,244 478 Updated Dec 15, 2025

sugarforever / unsloth-tutorials

Jupyter Notebook 5 2 Updated May 9, 2025

lansinuote / Simple_Text_to_Speech

Python 21 2 Updated Mar 13, 2025

snakers4 / silero-models

Silero Models: pre-trained text-to-speech models made embarrassingly simple

Jupyter Notebook 5,669 358 Updated Dec 5, 2025

nari-labs / dia2

TTS model capable of streaming conversational audio in realtime.

Python 920 77 Updated Nov 29, 2025

HKoon / ChatTTS-OpenVoice

Fuse ChatTTS with OpenVoice, upload a 10-second audio clip, and clone your personalized ChatTTS voice.

Python 459 62 Updated Nov 7, 2024

myshell-ai / OpenVoice

Instant voice cloning by MIT and MyShell. Audio foundation model.

Python 35,660 3,972 Updated Apr 19, 2025

neosun100 / supertonic-tts-enhanced

Enhanced Supertonic TTS with Docker, FastAPI, Web UI, and comprehensive API documentation

Python 13 Updated Dec 7, 2025

warmshao / ChatTTSPlus

Extension of ChatTTS, 3x Faster on Windows, Support Voice Cloning and Mobile Deployment

Python 174 18 Updated Feb 9, 2025

Ksuriuri / index-tts-vllm

Added vLLM support to IndexTTS for faster inference.

Python 957 128 Updated Oct 24, 2025

asr-pub / index-tts-lora

High-quality speech synthesis with LoRA fine-tuning on index-tts, enhancing prosody and naturalness for single and multi-speaker voices.

Python 272 19 Updated Dec 19, 2025

Starred topics

Artificial Intelligence