- Indonesia
Highlights
- Pro
Stars
TinyChatEngine: On-Device LLM Inference Library
DDGS | Dux Distributed Global Search. A metasearch library that aggregates results from diverse web search services
A general fine-tuning kit geared toward image/video/audio diffusion models.
woct0rdho / triton-windows
Forked from triton-lang/tritonFork of the Triton language and compiler for Windows support and easy installation
Helper Project with Nvidia 50 Series support
An open source SDK for logging, storing, querying, and visualizing multimodal and multi-rate data
Semantic segmentation models with 500+ pretrained convolutional and transformer-based backbones.
A TTS model capable of generating ultra-realistic dialogue in one pass.
fufufafa dan kearifan lokal-nya
Open Source Machine Learning Research Platform designed for frontier AI/ML workflows. Local, on-prem, or in the cloud. Open source.
Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model
🎦 Extract video hard subtitles and automatically generate corresponding srt files.
Convert any PDF into a podcast episode!
A simple screen parsing tool towards pure vision based GUI agent
DSPy: The framework for programming—not prompting—language models
The world's smartest system-wide grammar assistant; a better version of the Apple Intelligence Writing Tools. Works on Windows, Linux, & macOS, with the free Gemini API, local LLMs, & more.
An easy-to-understand framework for LLM samplers that rewind and revise generated tokens
g1: Using Llama-3.1 70b on Groq to create o1-like reasoning chains
SoftVC VITS Singing Voice Conversion
A feature-rich command-line audio/video downloader
🔊 Text-Prompted Generative Audio Model
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
An OpenAI API compatible text to speech server using Coqui AI's xtts_v2 and/or piper tts as the backend.
Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key
User-friendly AI Interface (Supports Ollama, OpenAI API, ...)