Skip to content
View R1im's full-sized avatar

Sponsoring

@protomaps

Block or report R1im

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

An MCP server plus a CLI tool that indexes local code into a graph database to provide context to AI assistants.

Python 2,642 490 Updated Mar 24, 2026

Generate map templates for Farming Simulator from real places.

Python 136 51 Updated Mar 27, 2026

This app can now use Android, just like a human.

Kotlin 886 124 Updated Jan 13, 2026

Make your meetings accessible to AI Agents

Python 475 73 Updated Mar 19, 2026

An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System

Python 19,627 2,418 Updated Mar 16, 2026
TypeScript 1,332 102 Updated Mar 27, 2026

[ICCV 2025] UrbanLLaVA: A Multi-modal Large Language Model for Urban Intelligence with Spatial Reasoing and Understanding.

Python 72 7 Updated Feb 28, 2026

✨✨[CVPR 2025] Video-MME: The First-Ever Comprehensive Evaluation Benchmark of Multi-modal LLMs in Video Analysis

740 28 Updated Dec 8, 2025

Generate audiobooks from e-books, voice cloning & 1158+ languages!

Python 18,575 1,523 Updated Mar 10, 2026

Upscales Video 2x or 4x using AI

Python 121 11 Updated Mar 16, 2024

A consolidation of various compiled open-source AI image/video upscaling product for a working CLI friendly image and video upscaling program.

Shell 478 40 Updated May 8, 2025

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Python 44,923 6,015 Updated Aug 16, 2024

Instant voice cloning by MIT and MyShell. Audio foundation model.

Python 36,162 4,041 Updated Apr 19, 2025

A gallery that showcases on-device ML/GenAI use cases and allows people to try and use models locally.

Kotlin 15,484 1,365 Updated Mar 27, 2026

Unsloth Studio is a web UI for training and running open models like Qwen, DeepSeek, gpt-oss and Gemma locally.

Python 58,425 4,933 Updated Mar 27, 2026

SoTA open-source TTS

Python 24,004 3,183 Updated Mar 26, 2026

Step1X-3D: Towards High-Fidelity and Controllable Generation of Textured 3D Assets

Python 848 57 Updated Sep 8, 2025

Official repo for paper "Structured 3D Latents for Scalable and Versatile 3D Generation" (CVPR'25 Spotlight).

Python 12,069 1,134 Updated Nov 5, 2025

This is a cross-platform desktop application that allows you to chat with locally hosted LLMs and enjoy features like MCP support

TypeScript 227 20 Updated Mar 2, 2026

The simplest, fastest repository for training/finetuning small-sized VLMs.

Python 4,747 476 Updated Oct 27, 2025

Dia-JAX: A JAX port of Dia, the text-to-speech model for generating realistic dialogue from text with emotion and tone control.

Jupyter Notebook 29 4 Updated May 7, 2025

Groundhog's primary purpose is to teach people how Cursor and all these other coding agents work under the hood. If you understand how these coding assistants work from first principles, then you c…

Rust 399 21 Updated Aug 20, 2025

A powerful coding agent toolkit providing semantic retrieval and editing capabilities (MCP server & other integrations)

Python 22,181 1,481 Updated Mar 27, 2026

video description generation vision-language model

Python 21 3 Updated Jan 21, 2025

This Windows Batchscript helps setup a Mingw-w64 compiler environment for building ffmpeg and other media tools under Windows.

Shell 1,766 285 Updated Mar 10, 2026

Scripts to build a trimmed-down Windows 11 image.

PowerShell 18,228 1,410 Updated Sep 12, 2025

Automatically convert epubs to audiobooks

Python 259 19 Updated Mar 8, 2025

A real-time silent speech recognition tool.

Python 718 79 Updated Nov 2, 2025

Auto-AVSR: Lip-Reading Sentences Project

Python 409 75 Updated Jan 8, 2025
Next