Skip to content
View ggerganov's full-sized avatar

Sponsors

Organizations

@ggml-org

Block or report ggerganov

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Port of Nvidia LocateAnything-3B on ggml

C++ 93 6 Updated Jun 12, 2026

Vim-fork focused on extensibility and usability

Vim Script 100,482 6,922 Updated Jun 18, 2026

github action to speedup building using ccache

TypeScript 181 73 Updated Jun 15, 2026

Visualizer for neural network, deep learning and machine learning models

JavaScript 33,093 3,131 Updated Jun 17, 2026

Fast state-of-the-art image and video segmentation in portable C/C++

C++ 321 31 Updated Apr 10, 2026

Mount Hugging Face Buckets and repos as local filesystems. No download, no copy, no waiting.

Rust 747 54 Updated Jun 18, 2026

Portable C++17 implementation of ACE-Step 1.5 AI Music Generator using GGML. Text + lyrics in, stereo 48kHz MP3 or WAV out. Runs on CPU, CUDA, ROCm, Metal, Vulkan.

C++ 341 56 Updated May 20, 2026

A C++17 single-file header-only wrapper for llama.cpp

C++ 24 Updated Jun 15, 2026

Run AI models locally on your machine with node.js bindings for llama.cpp. Enforce a JSON schema on the model output on the generation level

TypeScript 2,104 199 Updated Jun 18, 2026

A free, open source, and extensible speech-to-text application that works completely offline.

Rust 24,058 2,026 Updated Jun 18, 2026

A cosy home for your LLMs.

Swift 1,326 76 Updated Jun 17, 2026

Audio playback and capture library written in C, in a single source file.

C 6,911 572 Updated May 10, 2026

Local LLM-assisted text completion for Qt Creator.

C++ 61 7 Updated Apr 16, 2026

Simple GUI around whisper.cpp for voice-to-text on Linux

Python 76 15 Updated Apr 22, 2026

Local LLM-assisted text completion for Qt Creator.

C++ 66 13 Updated Apr 16, 2026

MLPerf Client is a benchmark for Windows, Linux and macOS, focusing on client form factors in ML inference scenarios.

C++ 86 8 Updated Apr 20, 2026

Low-latency AI engine for mobile devices & wearables

C++ 5,350 430 Updated Jun 18, 2026

Emacs package for LLM-assisted code/text completion

Emacs Lisp 43 2 Updated May 22, 2026

Lemonade helps users discover and run local AI apps by serving optimized LLMs right from their own GPUs and NPUs. Join our discord: https://discord.gg/5xXzkMu8Zk

C++ 4,524 352 Updated Jun 18, 2026

The application performs real-time inference on audio from an ALSA capture device

C++ 39 1 Updated Jun 19, 2025

TTS support with GGML

C++ 242 30 Updated Oct 5, 2025
Python 537 56 Updated Jun 11, 2026

LLM plugin for interacting with llama-server models

Python 31 6 Updated May 28, 2025

Running any GGUF SLMs/LLMs locally, on-device in Android

Kotlin 851 140 Updated Jun 17, 2026

DINOv2 inference engine written in C/C++ using ggml and OpenCV.

C++ 96 7 Updated May 6, 2025

Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.

Python 27,310 1,990 Updated Jan 9, 2026

Real-time webcam demo with SmolVLM and llama.cpp server

HTML 5,557 898 Updated May 12, 2025

📎 Clippy, now with some AI

TypeScript 1,311 74 Updated Nov 15, 2025
Next