Skip to content
View anhlbt's full-sized avatar

Block or report anhlbt

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
  • PaddleOCR Public

    Python Apache License 2.0 Updated Dec 15, 2025
  • Amphion Public

    Forked from open-mmlab/Amphion

    Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi…

    Python MIT License Updated Dec 5, 2025
  • BabelDOC Public

    Forked from funstory-ai/BabelDOC

    Yet Another Document Translator

    Python GNU Affero General Public License v3.0 Updated Dec 1, 2025
  • A topic-centric list of high-quality open datasets in public domains. By everyone, for everyone!

    MIT License Updated Nov 29, 2025
  • PaddleOCR inference in PyTorch. Converted from [PaddleOCR](https://github.com/PaddlePaddle/PaddleOCR)

    Python Apache License 2.0 Updated Nov 18, 2025
  • ART Public

    Forked from OpenPipe/ART

    Agent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on-the-job training. Reinforcement learning for Qwen2.5, Qwen3, Llama, Kimi, and more!

    Python Apache License 2.0 Updated Aug 12, 2025
  • dots.ocr Public

    Forked from rednote-hilab/dots.ocr

    Multilingual Document Layout Parsing in a Single Vision-Language Model

    Python MIT License Updated Aug 12, 2025
  • 一键将音视频转化为小红书/公众号/知识笔记/思维导图/视频字幕等各种风格的文档。

    Vue MIT License Updated Jul 23, 2025
  • olmocr Public

    Forked from allenai/olmocr

    Toolkit for linearizing PDFs for LLM datasets/training

    Python Apache License 2.0 Updated Jul 14, 2025
  • AI Audio Datasets (AI-ADS) 🎵, including Speech, Music, and Sound Effects, which can provide training data for Generative AI, AIGC, AI model training, intelligent audio tool development, and audio a…

    MIT License Updated Jul 8, 2025
  • fonts Public

    Forked from google/fonts

    Font files available from Google Fonts, and a public issue tracker for all things Google Fonts

    HTML Updated Jul 7, 2025
  • fixdesktop Public

    Shell Updated Jul 7, 2025
  • ai_wiki Public

    Forked from charliedream1/ai_wiki

    《AI全栈-全网优秀资源搜集站》:搜集全网优秀资源,记载工程实践问题的解决策略与关键要点,分享各种实用案例,追踪前沿技术发展,囊括 AI 全栈知识,涵盖大模型、编程技术、机器学习、深度学习、强化学习、图神经网络、语音识别、NLP 及图像识别等领域

    Jupyter Notebook Updated Jul 6, 2025
  • RAG-Anything Public

    Forked from HKUDS/RAG-Anything

    "RAG-Anything: All-in-One RAG System"

    Python MIT License Updated Jul 5, 2025
  • ACE-Step Public

    Forked from ace-step/ACE-Step

    ACE-Step: A Step Towards Music Generation Foundation Model

    Python Apache License 2.0 Updated Jun 27, 2025
  • docling Public

    Forked from docling-project/docling

    Get your documents ready for gen AI

    Python MIT License Updated Jun 25, 2025
  • GPT-SoVITS Public

    Forked from RVC-Boss/GPT-SoVITS

    1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

    Python MIT License Updated Jun 15, 2025
  • nerd-fonts Public

    Forked from ryanoasis/nerd-fonts

    Iconic font aggregator, collection, & patcher. 3,600+ icons, 50+ patched fonts: Hack, Source Code Pro, more. Glyph collections: Font Awesome, Material Design Icons, Octicons, & more

    CSS Other Updated Jun 13, 2025
  • 🍦 Speech-AI-Forge is a project developed around TTS generation model, implementing an API Server and a Gradio-based WebUI.

    Python GNU Affero General Public License v3.0 Updated Jun 7, 2025
  • YuE: Open Full-song Music Generation Foundation Model, something similar to Suno.ai but open

    Python Apache License 2.0 Updated Jun 4, 2025
  • real time face swap and one-click video deepfake with only a single image

    Python GNU Affero General Public License v3.0 Updated Jun 3, 2025
  • opik Public

    Forked from comet-ml/opik

    Debug, evaluate, and monitor your LLM applications, RAG systems, and agentic workflows with comprehensive tracing, automated evaluations, and production-ready dashboards.

    Python Apache License 2.0 Updated Jun 3, 2025
  • F5R-TTS Public

    Forked from FrontierLabs/F5R-TTS

    Official code for "F5R-TTS: Improving Flow-Matching based Text-to-Speech with Group Relative Policy Optimization"

    Python MIT License Updated Jun 3, 2025
  • agenticSeek Public

    Forked from Fosowl/agenticSeek

    Fully Local Manus AI. No APIs, No $200 monthly bills. Enjoy an autonomous agent that thinks, browses the web, and code for the sole cost of electricity. 🔔 Official updates only via twitter @Martin9…

    Python GNU General Public License v3.0 Updated Jun 3, 2025
  • WeClone Public

    Forked from xming521/WeClone

    🚀 One-stop solution for creating your digital avatar from chat logs 💡 Fine-tune LLMs with your chat logs to capture your unique style, then bind to a chatbot to bring your digital self to life. 从聊天…

    Python GNU Affero General Public License v3.0 Updated Jun 2, 2025
  • CosyVoice Public

    Forked from FunAudioLLM/CosyVoice

    Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

    Python Apache License 2.0 Updated Jun 2, 2025
  • ragflow Public

    Forked from infiniflow/ragflow

    RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.

    TypeScript Apache License 2.0 Updated May 31, 2025
  • Gradio WebUI for creators and developers, featuring key TTS (Edge-TTS, kokoro) and zero-shot Voice Cloning (E2 & F5-TTS, CosyVoice), with Whisper audio processing, YouTube download, Demucs vocal is…

    Python MIT License Updated May 31, 2025
  • Qwen-Agent Public

    Forked from QwenLM/Qwen-Agent

    Agent framework and applications built upon Qwen>=3.0, featuring Function Calling, MCP, Code Interpreter, RAG, Chrome extension, etc.

    Python Apache License 2.0 Updated May 29, 2025
  • :octocat: 分享 GitHub 上有趣、入门级的开源项目。Share interesting, entry-level open source projects on GitHub.

    Python Updated May 28, 2025