Skip to content
View liumingxiy's full-sized avatar

Block or report liumingxiy

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
22 stars written in Python
Clear filter

Langflow is a powerful tool for building and deploying AI-powered agents and workflows.

Python 146,311 8,659 Updated Mar 27, 2026

Robust Speech Recognition via Large-Scale Weak Supervision

Python 96,737 11,930 Updated Mar 27, 2026

Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.

Python 73,194 10,042 Updated Mar 26, 2026

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Python 56,165 6,134 Updated Feb 9, 2026

Ultralytics YOLO 🚀

Python 55,088 10,589 Updated Mar 27, 2026

An open-source long-horizon SuperAgent harness that researches, codes, and creates. With the help of sandboxes, memories, tools, skill, subagents and message gateway, it handles different levels of…

Python 49,960 5,960 Updated Mar 27, 2026

Easily train a good VC model with voice data <= 10 mins!

Python 34,991 4,945 Updated Nov 24, 2024

Contexts Optical Compression

Python 22,756 2,093 Updated Jan 27, 2026

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Python 20,268 2,303 Updated Mar 16, 2026

An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System

Python 19,626 2,418 Updated Mar 16, 2026

Wan: Open and Advanced Large-Scale Video Generative Models

Python 14,891 1,806 Updated Mar 17, 2026

Android in docker solution with noVNC supported and video recording

Python 14,432 1,648 Updated Mar 26, 2026

Zonos-v0.1 is a leading open-weight text-to-speech model trained on more than 200k hours of varied multilingual speech, delivering expressiveness and quality on par with—or even surpassing—top TTS …

Python 7,199 820 Updated Mar 5, 2025

Simple Online Realtime Tracking with a Deep Association Metric

Python 6,103 1,566 Updated Mar 2, 2025

Towards Human-Sounding Speech

Python 6,038 513 Updated Dec 5, 2025

ECCV2022 - Real-Time Intermediate Flow Estimation for Video Frame Interpolation

Python 5,366 533 Updated Sep 10, 2025

使用盲水印保护创作者的知识产权using invisible watermark to protect creator's intellectual property

Python 1,622 194 Updated Aug 30, 2024

Yuan 2.0 Large Language Model

Python 689 84 Updated Jul 11, 2024

[CVPR2026]🚀🚀🚀Official code for the paper "YOLO-Master: MOE-Accelerated with Specialized Transformers for Enhanced Real-time Detection." *(YOLO = You Only Look Once)* 🔥🔥🔥

Python 450 50 Updated Mar 9, 2026
Python 246 11 Updated Mar 4, 2026

follow my CSDN:https://blog.csdn.net/u012465304

Python 22 9 Updated Aug 6, 2018