- Indonesia
Highlights
- Pro
Stars
The Swiss Army Knife of Offline AI. Chat, Speak, and Generate Images - Privacy First, Zero Internet. Download an LLM and use it on your mobile device. No data ever leaves your phone. Supports text-…
TinyChatEngine: On-Device LLM Inference Library
A metasearch library that aggregates results from diverse web search services
A general fine-tuning kit geared toward image/video/audio diffusion models.
woct0rdho / triton-windows
Forked from triton-lang/tritonFork of the Triton language and compiler for Windows support and easy installation
Helper Project with Nvidia 50 Series support
An open source SDK for logging, storing, querying, and visualizing multimodal and multi-rate data
Semantic segmentation models with 500+ pretrained convolutional and transformer-based backbones.
A TTS model capable of generating ultra-realistic dialogue in one pass.
fufufafa dan kearifan lokal-nya
The open source research environment for AI researchers to seamlessly train, evaluate, and scale models from local hardware to GPU clusters.
Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model
🎦 Extract video hard subtitles and automatically generate corresponding srt files.
Convert any PDF into a podcast episode!
A simple screen parsing tool towards pure vision based GUI agent
DSPy: The framework for programming—not prompting—language models
The world's smartest system-wide grammar assistant; a better version of the Apple Intelligence Writing Tools. Works on Windows, Linux, & macOS, with the free Gemini API, local LLMs, & more.
An easy-to-understand framework for LLM samplers that rewind and revise generated tokens
g1: Using Llama-3.1 70b on Groq to create o1-like reasoning chains
SoftVC VITS Singing Voice Conversion
A feature-rich command-line audio/video downloader
🔊 Text-Prompted Generative Audio Model
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
An OpenAI API compatible text to speech server using Coqui AI's xtts_v2 and/or piper tts as the backend.