Starred repositories
Instantly generate AI-powered subtitles on your device. Works standalone or connects to DaVinci Resolve.
very good whiteboard SDK / infinite canvas SDK
Speech-to-text, text-to-speech, speaker diarization, speech enhancement, source separation, and VAD using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Andr…
The most advanced Nano Banana image generator and editor application. Your central hub for AI image generation and revisions. Intuitive UI features reference images, editing with image masks, versi…
faster_whisper GUI with PySide6
from Google AI Studio
一键生成产品营销与泛内容短视频,AI批量自动剪辑,高颜值跨平台桌面端工具 One click generation of product marketing and general content short videos, AI batch automatic cliping, beautiful cross platform desktop tool
A MapBoxGL and D3 web mapping tool for exploring the dynamic population of Manhattan.
A free and open source, self hosted Ai based live meeting note taker and minutes summary generator that can completely run in your Local device (Mac OS and windows OS Support added. Working on addi…
This repository contains the official implementation of the research papers, "MobileCLIP" CVPR 2024 and "MobileCLIP2" TMLR August 2025
This repository contains the official implementation of "FastVLM: Efficient Vision Encoding for Vision Language Models" - CVPR 2025
可以实现按下 Option 按钮开始录制,抬起按钮就结束录制,并调用 Groq Whisper Large V3 Turbo 模型进行转译,由于 Groq 的速度非常快,所以大部分的语音输入都可以在 1-2s 内反馈。并且得益于 whisper 的强大能力,转译效果非常不错。
An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System
ゲームの字幕にルビ(ふりがな)を表示するためにフォントにルビを埋め込むプログラム
Simultaneous speech-to-text model
一个使用Flutter开发,支持诸多云平台AI大模型API调用的智能工作生活助手应用。除了常规大模型应用,还有极简记账、随机菜品、猫狗之家、waifu图片、MAL动漫排行、BGM动漫资讯、饮食健康等生活日常工具。
Take notes with your voice & transform them with AI
🚀 The open-source alternative to Twilio.
Open-Source AI Presentation Generator and API (Gamma, Beautiful AI, Decktopus Alternative)
A personalized language-learning tool that combines Duolingo-style lessons with your own curated vocabulary lists. Seamlessly add words from books, articles, or videos, and revisit them through in…
The open-source CapCut alternative
LiYing is an automated photo processing program designed for automating the post-processing workflow of ID photos in general photo studios. | LiYing 是一套适用于自动化 完成一般照相馆后期证件照处理流程的照片自动处理的程序。
Multilingual Voice Understanding Model
A gallery that showcases on-device ML/GenAI use cases and allows people to try and use models locally.