Skip to content
View JMS1717's full-sized avatar

Highlights

  • Pro

Block or report JMS1717

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

AI-powered image description generator for Immich that analyzes photos with Ollama models and updates database metadata

Rust 3 Updated Dec 17, 2025

Chris Titus Tech's Windows Utility - Install Programs, Tweaks, Fixes, and Updates

PowerShell 44,193 2,342 Updated Dec 18, 2025

A free, open-source, and cross-platform iDevice management tool

C++ 2,067 82 Updated Dec 16, 2025

Lightning-Fast, On-Device TTS — running natively via ONNX.

JavaScript 1,848 166 Updated Dec 15, 2025

Uses a local language model to simulate Twitch chat

Python 75 4 Updated Nov 12, 2025

Video chat with Modal's mascots, Moe and Dal, about Modal and its documentation.

Python 50 13 Updated Dec 9, 2025

Open Source framework for voice and multimodal conversational AI

Python 9,425 1,533 Updated Dec 18, 2025

Extracts iPhone messages for use with LLMs.

Python 1 Updated Oct 20, 2025

a free local self hosted video compressor webui designed for performance and ease of use. inspired by 8mb.video

Python 519 23 Updated Dec 15, 2025

Fully Local Manus AI. No APIs, No $200 monthly bills. Enjoy an autonomous agent that thinks, browses the web, and code for the sole cost of electricity. 🔔 Official updates only via twitter @Martin9…

Python 24,103 2,644 Updated Nov 15, 2025

Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audio codec.

Python 9,185 831 Updated Nov 20, 2025

Speech-to-speech AI assistant with natural conversation flow, mid-speech interruption, vision capabilities and AI-initiated follow-ups. Features low-latency audio streaming, dynamic visual feedback…

TypeScript 271 51 Updated Apr 14, 2025

User-friendly AI Interface (Supports Ollama, OpenAI API, ...)

Svelte 118,164 16,634 Updated Dec 16, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 65,725 12,047 Updated Dec 18, 2025

(NIPS 2025) OpenOmni: Official implementation of Advancing Open-Source Omnimodal Large Language Models with Progressive Multimodal Alignment and Real-Time Self-Aware Emotional Speech Synthesis

Python 115 6 Updated Nov 8, 2025

Foligo is a comprehensive AI-powered platform that generates polished portfolio content from simple verbal descriptions. Go from idea to a live portfolio piece in minutes, just by talking. The plat…

Vue 2 Updated Nov 27, 2025

A functioning Sesame CSM project with a desktop GUI - Real-time factor: 0.6x with 4070 Ti Super - Requires only 8GB VRAM

Python 73 12 Updated May 19, 2025

Automate browser based workflows with AI

Python 19,815 1,727 Updated Dec 18, 2025

A bytebot variant that uses Holo 1.5 7b to control the desktop

TypeScript 20 5 Updated Nov 4, 2025

Bytebot is a self-hosted AI desktop agent that automates computer tasks through natural language commands, operating within a containerized Linux desktop environment.

TypeScript 26 4 Updated Oct 15, 2025

Bytebot is a self-hosted AI desktop agent that automates computer tasks through natural language commands, operating within a containerized Linux desktop environment.

TypeScript 9,986 1,280 Updated Sep 12, 2025

🖥️ Run AI Agent in your browser.

Python 15,332 2,660 Updated Aug 31, 2025

🌐 Make websites accessible for AI agents. Automate tasks online with ease.

Python 73,899 8,835 Updated Dec 18, 2025

Qwen3-omni is a natively end-to-end, omni-modal LLM developed by the Qwen team at Alibaba Cloud, capable of understanding text, audio, images, and video, as well as generating speech in real time.

Jupyter Notebook 3,129 190 Updated Oct 9, 2025

Agent S: an open agentic framework that uses computers like a human

Python 8,867 988 Updated Dec 16, 2025

Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 17,257 1,446 Updated Nov 28, 2025

A talking LLM that runs on your own computer without needing the internet.

Python 736 148 Updated Oct 20, 2025

Have a natural, spoken conversation with AI!

Python 3,409 382 Updated Jul 11, 2025

Liquid Audio - Speech-to-Speech audio models by Liquid AI

Python 298 44 Updated Sep 30, 2025

A real-time, fully local voice AI system optimized for low-resource devices like an 8GB Ubuntu laptop with no GPU, achieving sub-second STT-to-TTS latency using Ollama, Vosk, Piper, and JACK/PipeWi…

Python 14 2 Updated Jul 21, 2025
Next