Skip to content
View patrickkeenan's full-sized avatar

Block or report patrickkeenan

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
25 stars written in Python
Clear filter

AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.

Python 179,518 46,104 Updated Nov 6, 2025

Robust Speech Recognition via Large-Scale Weak Supervision

Python 90,426 11,325 Updated Sep 8, 2025

🌐 Make websites accessible for AI agents. Automate tasks online with ease.

Python 72,193 8,566 Updated Nov 5, 2025

🚀🤖 Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/jP8KfhDhyN

Python 55,479 5,557 Updated Nov 6, 2025

Real-time face swap for PC streaming or video calls

Python 29,996 966 Updated Nov 8, 2024

WebUI extension for ControlNet

Python 17,834 2,023 Updated Aug 12, 2024

Automate browser based workflows with AI

Python 16,593 1,410 Updated Nov 6, 2025

SoTA open-source TTS

Python 14,437 1,944 Updated Sep 25, 2025

Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audio codec.

Python 9,073 828 Updated Nov 3, 2025

A robust, efficient, low-latency speech-to-text library with advanced voice activity detection, wake word activation and instant transcription.

Python 8,865 749 Updated Jul 11, 2025

Open Source framework for voice and multimodal conversational AI

Python 8,733 1,412 Updated Nov 6, 2025

The #1 open-source voice interface for desktop, mobile, and ESP32 chips.

Python 5,094 539 Updated Nov 1, 2024

Converts text to speech in realtime

Python 3,608 348 Updated Jul 22, 2025

Design circuit boards with code! ✨ Get software-like design reuse 🚀, validation, version control and collaboration in hardware; starting with electronics ⚡️

Python 2,555 151 Updated Nov 6, 2025

A robust, all-in-one GPT interface for Discord. ChatGPT-style conversations, image generation, AI-moderation, custom indexes/knowledgebase, youtube summarizer, and more!

Python 1,853 295 Updated May 30, 2024

This project provides an API with user level access support to transcribe speech to text using a finetuned and processed Whisper ASR model.

Python 899 38 Updated Nov 25, 2023

Examples for Cerebrium Serverless GPUs

Python 512 74 Updated Oct 30, 2025

Open source, local, and self-hosted highly optimized language inference server supporting ASR/STT, TTS, and LLM across WebRTC, REST, and WS

Python 478 52 Updated Jun 25, 2025

HTTP service wrapper for BASNet: Boundary-Aware Salient Object Detection

Python 450 60 Updated Nov 22, 2022

Project an image centroid to another image using OpenCV

Python 442 52 Updated Oct 12, 2021

A high-level programming language for using computer vision.

Python 344 18 Updated Apr 11, 2024

A python program to detect and classify hand pose using deep learning techniques

Python 248 67 Updated Mar 24, 2023

Public web scraping scripts for the University of Toronto.

Python 51 14 Updated May 16, 2020

Python BLE Server for RPi Accepts setting wifi (SSID) via bluetooth

Python 46 13 Updated Mar 22, 2025

A pipecat bot demo implementation of a Spotify assistant for creating playlists

Python 18 2 Updated Oct 14, 2025