Skip to content
View R1im's full-sized avatar

Sponsoring

@protomaps

Block or report R1im

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

An MCP server plus a CLI tool that indexes local code into a graph database to provide context to AI assistants.

Python 2,901 530 Updated Apr 9, 2026

Generate map templates for Farming Simulator from real places.

Python 139 51 Updated Apr 3, 2026

This app can now use Android, just like a human.

Kotlin 895 127 Updated Jan 13, 2026

Make your meetings accessible to AI Agents

Python 484 77 Updated Mar 19, 2026

An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System

Python 19,939 2,449 Updated Mar 16, 2026
TypeScript 1,329 100 Updated Apr 10, 2026

[ICCV 2025] UrbanLLaVA: A Multi-modal Large Language Model for Urban Intelligence with Spatial Reasoing and Understanding.

Python 76 7 Updated Feb 28, 2026

✨✨[CVPR 2025] Video-MME: The First-Ever Comprehensive Evaluation Benchmark of Multi-modal LLMs in Video Analysis

752 28 Updated Dec 8, 2025

Generate audiobooks from e-books, voice cloning & 1158+ languages!

Python 18,654 1,530 Updated Apr 10, 2026

Upscales Video 2x or 4x using AI

Python 122 11 Updated Mar 16, 2024

A consolidation of various compiled open-source AI image/video upscaling product for a working CLI friendly image and video upscaling program.

Shell 480 39 Updated May 8, 2025

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Python 45,016 6,032 Updated Aug 16, 2024

Instant voice cloning by MIT and MyShell. Audio foundation model.

Python 36,214 4,032 Updated Apr 19, 2025

A gallery that showcases on-device ML/GenAI use cases and allows people to try and use models locally.

Kotlin 20,260 1,891 Updated Apr 8, 2026

Unsloth Studio is a web UI for training and running open models like Gemma 4, Qwen3.5, DeepSeek, gpt-oss locally.

Python 60,952 5,255 Updated Apr 11, 2026

SoTA open-source TTS

Python 24,247 3,226 Updated Mar 26, 2026

Step1X-3D: Towards High-Fidelity and Controllable Generation of Textured 3D Assets

Python 853 58 Updated Sep 8, 2025

Official repo for paper "Structured 3D Latents for Scalable and Versatile 3D Generation" (CVPR'25 Spotlight).

Python 12,160 1,143 Updated Nov 5, 2025

This is a cross-platform desktop application that allows you to chat with locally hosted LLMs and enjoy features like MCP support

TypeScript 226 20 Updated Mar 2, 2026

The simplest, fastest repository for training/finetuning small-sized VLMs.

Python 4,797 481 Updated Oct 27, 2025

Dia-JAX: A JAX port of Dia, the text-to-speech model for generating realistic dialogue from text with emotion and tone control.

Jupyter Notebook 29 4 Updated May 7, 2025

Groundhog's primary purpose is to teach people how Cursor and all these other coding agents work under the hood. If you understand how these coding assistants work from first principles, then you c…

Rust 401 20 Updated Aug 20, 2025

A powerful MCP toolkit for coding, providing semantic retrieval and editing capabilities - the IDE for your agent

Python 22,760 1,527 Updated Apr 10, 2026

video description generation vision-language model

Python 21 3 Updated Jan 21, 2025

This Windows Batchscript helps setup a Mingw-w64 compiler environment for building ffmpeg and other media tools under Windows.

Shell 1,770 288 Updated Apr 7, 2026

Scripts to build a trimmed-down Windows 11 image.

PowerShell 18,350 1,420 Updated Sep 12, 2025

Automatically convert epubs to audiobooks

Python 259 19 Updated Mar 8, 2025

A real-time silent speech recognition tool.

Python 721 82 Updated Nov 2, 2025

Auto-AVSR: Lip-Reading Sentences Project

Python 411 74 Updated Jan 8, 2025
Next