Stars
Speech-to-text, text-to-speech, speaker diarization, speech enhancement, source separation, and VAD using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Andr…
一个简单好用的 Word 文档(.docx/.doc)转 Markdown 工具,支持图片、公式(LaTeX)、表格与批量转换。提供图形界面与一键可执行程序,开箱即用。
A personalized language-learning tool that combines Duolingo-style lessons with your own curated vocabulary lists. Seamlessly add words from books, articles, or videos, and revisit them through in…
携程评论爬虫,使用线程池来爬取热门景区评论,简单易用。一键爬取任意省的所有热门景区并分析评论数据,可视化展示。
程序员在家做饭方法指南。Programmer's guide about how to cook at home (Simplified Chinese only).
Mirror of redmine code source - Official Subversion repository is at https://svn.redmine.org/redmine - contact: @vividtone or maeda (at) farend (dot) jp
rga: ripgrep, but also search in PDFs, E-Books, Office documents, zip, tar.gz, etc.
The open-source LLMOps platform: prompt playground, prompt management, LLM evaluation, and LLM observability all in one place.
A library of shared system prompts for creating customized educational GPT agents.
🚀 Truly open-source AI avatar(digital human) toolkit for offline video generation and digital human cloning.
Vibe Workflow Platform for Non-technical Creators.
Unified framework for building enterprise RAG pipelines with small, specialized models
No fortress, purely open ground. OpenManus is Coming.
WebSocket, gRPC and WebRTC speech recognition server based on Vosk and Kaldi libraries
Task-Aware Agent-driven Prompt Optimization Framework
Fair-code workflow automation platform with native AI capabilities. Combine visual building with custom code, self-host or cloud, 400+ integrations.
AIInfra(AI 基础设施)指AI系统从底层芯片等硬件,到上层软件栈支持AI大模型训练和推理。
🧸 Lobe Vidol - Making Virtual Idols Accessible for EveryOne
Official repo for paper "Structured 3D Latents for Scalable and Versatile 3D Generation" (CVPR'25 Spotlight).
A framework to enable autonomous android and computer use using any LLM (local or remote)
An alternative, self-hosted solution that allows you to continue using Snap Camera with all Snapchat filters after its shutdown on January 25, 2023.
Memory-Guided Diffusion for Expressive Talking Video Generation
🔥 The Web Data API for AI - Turn entire websites into LLM-ready markdown or structured data
A lightweight next-gen data explorer - Postgres, MySQL, SQLite, MongoDB, Redis, MariaDB, Elastic Search, and Clickhouse with Chat interface
3D-printed open-source humanoid robot platform for sim-to-real and RL
first base model for full-duplex conversational audio
A browser extension for automating your browser by connecting blocks