Skip to content
View hermanprawiro's full-sized avatar

Highlights

  • Pro

Block or report hermanprawiro

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

The Swiss Army Knife of Offline AI. Chat, Speak, and Generate Images - Privacy First, Zero Internet. Download an LLM and use it on your mobile device. No data ever leaves your phone. Supports text-…

TypeScript 1,903 163 Updated Apr 30, 2026

React Native binding of llama.cpp

C++ 928 100 Updated Apr 17, 2026

TinyChatEngine: On-Device LLM Inference Library

C++ 952 97 Updated Jul 4, 2024

A metasearch library that aggregates results from diverse web search services

Python 2,579 249 Updated Apr 20, 2026

A general fine-tuning kit geared toward image/video/audio diffusion models.

Python 2,825 279 Updated Apr 27, 2026

Fork of the Triton language and compiler for Windows support and easy installation

MLIR 1,915 93 Updated Feb 18, 2026

Helper Project with Nvidia 50 Series support

33 2 Updated Sep 4, 2025

An open source SDK for logging, storing, querying, and visualizing multimodal and multi-rate data

Rust 10,611 719 Updated Apr 30, 2026

Semantic segmentation models with 500+ pretrained convolutional and transformer-based backbones.

Python 11,520 1,837 Updated Apr 17, 2026

A TTS model capable of generating ultra-realistic dialogue in one pass.

Python 19,293 1,682 Updated Nov 19, 2025

fufufafa dan kearifan lokal-nya

559 78 Updated Apr 13, 2025

The open source research environment for AI researchers to seamlessly train, evaluate, and scale models from local hardware to GPU clusters.

Python 4,935 510 Updated Apr 30, 2026

Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model

Python 8,115 704 Updated Feb 10, 2025

🎦 Extract video hard subtitles and automatically generate corresponding srt files.

Python 498 60 Updated Sep 11, 2025

TTS with kokoro and onnx runtime

Python 2,510 268 Updated Jan 30, 2026

Tag manager and captioner for image datasets

Python 1,294 70 Updated Oct 11, 2025

Convert any PDF into a podcast episode!

Python 2,582 278 Updated Dec 7, 2024

A simple screen parsing tool towards pure vision based GUI agent

Jupyter Notebook 24,707 2,163 Updated Apr 13, 2026
TypeScript 9 2 Updated Jul 26, 2024

DSPy: The framework for programming—not prompting—language models

Python 34,118 2,859 Updated Apr 30, 2026

The world's smartest system-wide grammar assistant; a better version of the Apple Intelligence Writing Tools. Works on Windows, Linux, & macOS, with the free Gemini API, local LLMs, & more.

Swift 2,215 136 Updated Mar 1, 2026

An easy-to-understand framework for LLM samplers that rewind and revise generated tokens

Python 150 10 Updated Jan 7, 2026

g1: Using Llama-3.1 70b on Groq to create o1-like reasoning chains

Python 4,193 359 Updated Dec 30, 2025

SoftVC VITS Singing Voice Conversion

Python 28,052 5,067 Updated Nov 11, 2023

A feature-rich command-line audio/video downloader

Python 159,905 13,260 Updated Apr 30, 2026

🔊 Text-Prompted Generative Audio Model

Jupyter Notebook 39,099 4,685 Updated Aug 19, 2024

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Python 57,081 6,230 Updated Apr 30, 2026

An OpenAI API compatible text to speech server using Coqui AI's xtts_v2 and/or piper tts as the backend.

Python 859 130 Updated Feb 2, 2025
Next