Skip to content
View hermanprawiro's full-sized avatar

Highlights

  • Pro

Block or report hermanprawiro

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

TinyChatEngine: On-Device LLM Inference Library

C++ 932 94 Updated Jul 4, 2024

DDGS | Dux Distributed Global Search. A metasearch library that aggregates results from diverse web search services

Python 2,027 196 Updated Dec 19, 2025

A general fine-tuning kit geared toward image/video/audio diffusion models.

Python 2,669 264 Updated Dec 21, 2025

Fork of the Triton language and compiler for Windows support and easy installation

MLIR 1,678 93 Updated Dec 15, 2025

Helper Project with Nvidia 50 Series support

28 2 Updated Sep 4, 2025

An open source SDK for logging, storing, querying, and visualizing multimodal and multi-rate data

Rust 9,800 597 Updated Dec 21, 2025

Semantic segmentation models with 500+ pretrained convolutional and transformer-based backbones.

Python 11,192 1,810 Updated Dec 1, 2025

A TTS model capable of generating ultra-realistic dialogue in one pass.

Python 18,985 1,651 Updated Nov 19, 2025

fufufafa dan kearifan lokal-nya

542 77 Updated Apr 13, 2025

Open Source Machine Learning Research Platform designed for frontier AI/ML workflows. Local, on-prem, or in the cloud. Open source.

Python 4,597 470 Updated Dec 21, 2025

Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model

Python 8,038 701 Updated Feb 10, 2025

🎦 Extract video hard subtitles and automatically generate corresponding srt files.

Python 468 57 Updated Sep 11, 2025

TTS with kokoro and onnx runtime

Python 2,302 235 Updated Jun 20, 2025

Tag manager and captioner for image datasets

Python 1,213 60 Updated Oct 11, 2025

Convert any PDF into a podcast episode!

Python 2,535 284 Updated Dec 7, 2024

A simple screen parsing tool towards pure vision based GUI agent

Jupyter Notebook 24,042 2,059 Updated Sep 12, 2025
TypeScript 9 2 Updated Jul 26, 2024

DSPy: The framework for programming—not prompting—language models

Python 30,933 2,488 Updated Dec 21, 2025

The world's smartest system-wide grammar assistant; a better version of the Apple Intelligence Writing Tools. Works on Windows, Linux, & macOS, with the free Gemini API, local LLMs, & more.

Swift 2,054 116 Updated Dec 16, 2025

An easy-to-understand framework for LLM samplers that rewind and revise generated tokens

Python 150 10 Updated Feb 20, 2025

g1: Using Llama-3.1 70b on Groq to create o1-like reasoning chains

Python 4,219 368 Updated Sep 11, 2025

SoftVC VITS Singing Voice Conversion

Python 27,867 5,077 Updated Nov 11, 2023

A feature-rich command-line audio/video downloader

Python 139,005 11,223 Updated Dec 20, 2025

🔊 Text-Prompted Generative Audio Model

Jupyter Notebook 38,846 4,675 Updated Aug 19, 2024

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Python 53,351 5,839 Updated Dec 19, 2025

An OpenAI API compatible text to speech server using Coqui AI's xtts_v2 and/or piper tts as the backend.

Python 838 124 Updated Feb 2, 2025

Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key

Python 9,629 915 Updated Dec 12, 2025

User-friendly AI Interface (Supports Ollama, OpenAI API, ...)

Svelte 118,369 16,676 Updated Dec 21, 2025
Next