Lists (1)
Sort Name ascending (A-Z)
Stars
AI-powered image description generator for Immich that analyzes photos with Ollama models and updates database metadata
Chris Titus Tech's Windows Utility - Install Programs, Tweaks, Fixes, and Updates
A free, open-source, and cross-platform iDevice management tool
Lightning-Fast, On-Device TTS — running natively via ONNX.
Uses a local language model to simulate Twitch chat
Video chat with Modal's mascots, Moe and Dal, about Modal and its documentation.
Open Source framework for voice and multimodal conversational AI
Extracts iPhone messages for use with LLMs.
a free local self hosted video compressor webui designed for performance and ease of use. inspired by 8mb.video
Fully Local Manus AI. No APIs, No $200 monthly bills. Enjoy an autonomous agent that thinks, browses the web, and code for the sole cost of electricity. 🔔 Official updates only via twitter @Martin9…
Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audio codec.
Speech-to-speech AI assistant with natural conversation flow, mid-speech interruption, vision capabilities and AI-initiated follow-ups. Features low-latency audio streaming, dynamic visual feedback…
User-friendly AI Interface (Supports Ollama, OpenAI API, ...)
A high-throughput and memory-efficient inference and serving engine for LLMs
(NIPS 2025) OpenOmni: Official implementation of Advancing Open-Source Omnimodal Large Language Models with Progressive Multimodal Alignment and Real-Time Self-Aware Emotional Speech Synthesis
Foligo is a comprehensive AI-powered platform that generates polished portfolio content from simple verbal descriptions. Go from idea to a live portfolio piece in minutes, just by talking. The plat…
A functioning Sesame CSM project with a desktop GUI - Real-time factor: 0.6x with 4070 Ti Super - Requires only 8GB VRAM
Automate browser based workflows with AI
A bytebot variant that uses Holo 1.5 7b to control the desktop
zhound420 / bytebot-hawkeye
Forked from bytebot-ai/bytebotBytebot is a self-hosted AI desktop agent that automates computer tasks through natural language commands, operating within a containerized Linux desktop environment.
Bytebot is a self-hosted AI desktop agent that automates computer tasks through natural language commands, operating within a containerized Linux desktop environment.
🌐 Make websites accessible for AI agents. Automate tasks online with ease.
Qwen3-omni is a natively end-to-end, omni-modal LLM developed by the Qwen team at Alibaba Cloud, capable of understanding text, audio, images, and video, as well as generating speech in real time.
Agent S: an open agentic framework that uses computers like a human
Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
A talking LLM that runs on your own computer without needing the internet.
Have a natural, spoken conversation with AI!
Liquid Audio - Speech-to-Speech audio models by Liquid AI
A real-time, fully local voice AI system optimized for low-resource devices like an 8GB Ubuntu laptop with no GPU, achieving sub-second STT-to-TTS latency using Ollama, Vosk, Piper, and JACK/PipeWi…