Skip to content
View hermanprawiro's full-sized avatar

Highlights

  • Pro

Block or report hermanprawiro

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

TinyChatEngine: On-Device LLM Inference Library

C++ 940 94 Updated Jul 4, 2024

DDGS | Dux Distributed Global Search. A metasearch library that aggregates results from diverse web search services

Python 2,144 213 Updated Dec 19, 2025

A general fine-tuning kit geared toward image/video/audio diffusion models.

Python 2,748 271 Updated Feb 4, 2026

Fork of the Triton language and compiler for Windows support and easy installation

MLIR 1,830 94 Updated Feb 5, 2026

Helper Project with Nvidia 50 Series support

32 2 Updated Sep 4, 2025

An open source SDK for logging, storing, querying, and visualizing multimodal and multi-rate data

Rust 10,149 645 Updated Feb 5, 2026

Semantic segmentation models with 500+ pretrained convolutional and transformer-based backbones.

Python 11,315 1,829 Updated Dec 23, 2025

A TTS model capable of generating ultra-realistic dialogue in one pass.

Python 19,084 1,665 Updated Nov 19, 2025

fufufafa dan kearifan lokal-nya

549 78 Updated Apr 13, 2025

The open source research environment for AI researchers to seamlessly train, evaluate, and scale models from local hardware to GPU clusters.

Python 4,784 500 Updated Feb 5, 2026

Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model

Python 8,075 701 Updated Feb 10, 2025

🎦 Extract video hard subtitles and automatically generate corresponding srt files.

Python 478 58 Updated Sep 11, 2025

TTS with kokoro and onnx runtime

Python 2,361 244 Updated Jan 30, 2026

Tag manager and captioner for image datasets

Python 1,259 65 Updated Oct 11, 2025

Convert any PDF into a podcast episode!

Python 2,561 283 Updated Dec 7, 2024

A simple screen parsing tool towards pure vision based GUI agent

Jupyter Notebook 24,341 2,112 Updated Sep 12, 2025
TypeScript 9 2 Updated Jul 26, 2024

DSPy: The framework for programming—not prompting—language models

Python 32,017 2,607 Updated Feb 5, 2026

The world's smartest system-wide grammar assistant; a better version of the Apple Intelligence Writing Tools. Works on Windows, Linux, & macOS, with the free Gemini API, local LLMs, & more.

Swift 2,118 117 Updated Feb 4, 2026

An easy-to-understand framework for LLM samplers that rewind and revise generated tokens

Python 150 10 Updated Jan 7, 2026

g1: Using Llama-3.1 70b on Groq to create o1-like reasoning chains

Python 4,213 367 Updated Dec 30, 2025

SoftVC VITS Singing Voice Conversion

Python 27,979 5,086 Updated Nov 11, 2023

A feature-rich command-line audio/video downloader

Python 145,934 11,824 Updated Feb 4, 2026

🔊 Text-Prompted Generative Audio Model

Jupyter Notebook 38,958 4,682 Updated Aug 19, 2024

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Python 54,759 5,989 Updated Dec 30, 2025

An OpenAI API compatible text to speech server using Coqui AI's xtts_v2 and/or piper tts as the backend.

Python 849 126 Updated Feb 2, 2025

Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key

Python 9,958 952 Updated Dec 12, 2025

User-friendly AI Interface (Supports Ollama, OpenAI API, ...)

Python 123,033 17,367 Updated Feb 5, 2026
Next