Skip to content
View KoljaB's full-sized avatar

Block or report KoljaB

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

"ViMax: Agentic Video Generation (Director, Screenwriter, Producer, and Video Generator All-in-One)"

Python 5,844 968 Updated Mar 29, 2026

Awesome AI Memory | LLM Memory | A curated knowledge base on AI memory for LLMs and agents, covering long-term memory, reasoning, retrieval, and memory-native system design. Awesome-AI-Memory 是一个 集…

Python 898 83 Updated May 20, 2026

26m function call model that runs on incredibly small devices

Python 2,327 149 Updated May 16, 2026
Python 312 30 Updated Jan 2, 2026

🚀 One-stop solution for creating your AI twin from chat history 💡 Fine-tune LLMs with your chat logs to capture your unique style, then bind to a chatbot to bring your digital self to life.

Python 17,922 1,528 Updated May 13, 2026

Talk to any LLM with hands-free voice interaction, voice interruption, and Live2D taking face running locally across platforms

Python 7,756 1,011 Updated May 15, 2026

Shannon Lite is an autonomous, white-box AI pentester for web applications and APIs. It analyzes your source code, identifies attack vectors, and executes real exploits to prove vulnerabilities bef…

TypeScript 43,322 4,955 Updated May 19, 2026

Converts text to speech in realtime

Python 3,921 392 Updated May 10, 2026

Model Context Protocol Servers

TypeScript 85,979 10,764 Updated May 17, 2026

Have a natural, spoken conversation with AI!

Python 12 4 Updated Aug 23, 2025

[NeurIPS' 25] Benchmark for evaluating TTS models on complex prosodic, expressiveness, and linguistic challenges.

Python 210 14 Updated Dec 9, 2025

Fast and High-Quality Zero-Shot Text-to-Speech with Flow Matching

Python 976 142 Updated Dec 2, 2025

SoTA open-source TTS

Python 137 26 Updated Jun 7, 2025

Streaming and Fine-tuning for Chatterbox TTS

Python 283 55 Updated Jun 15, 2025

A robust, efficient, low-latency speech-to-text library with advanced voice activity detection, wake word activation and instant transcription.

Python 9,793 835 Updated May 10, 2026

Have a natural, spoken conversation with AI!

Python 3,733 439 Updated Jul 11, 2025

Roomey is a multi-purpose Voice Agent designed to run your personal and business life.

Python 65 10 Updated Jun 15, 2025

A real-time silent speech recognition tool.

Python 733 84 Updated Nov 2, 2025

Make Qwen3 Think like Gemini 2.5 Pro | Open webui function

Python 25 1 Updated May 10, 2025

Build Real-Time Knowledge Graphs for AI Agents

Python 26,293 2,616 Updated May 14, 2026

Run Orpheus 3B Locally With LM Studio

Python 541 116 Updated Mar 20, 2025

Towards Human-Sounding Speech

Python 6,148 523 Updated Dec 5, 2025

Omni SenseVoice: High-Speed Speech Recognition with words timestamps 🗣️🎯

Python 892 37 Updated Dec 10, 2025

A generative speech model for daily dialogue.

Python 39,293 4,260 Updated Apr 10, 2026

Multilingual Voice Understanding Model

Python 8,184 744 Updated May 19, 2026

A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.

1,870 241 Updated Jul 22, 2025

A pattern for an always on AI Assistant powered by Deepseek-V3, RealtimeSTT, and Typer for engineering

Python 988 216 Updated Jan 12, 2025

An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.

Python 28,243 2,570 Updated Sep 30, 2025
Next