Skip to content
View atiorh's full-sized avatar

Block or report atiorh

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Very fast, accurate speaker diarization

Python 148 14 Updated Oct 3, 2025

The Package Manager for the Swift Programming Language

Swift 1 Updated Sep 18, 2025

Superwhisper is an AI voice-to-text tool for quick and accurate transcription. Download free now.

55 Updated Aug 23, 2025

State-of-the-art TTS model under 25MB 😻

Python 8,873 435 Updated Aug 23, 2025

Apple and Android Device Knowledge Base for On-device Inference Deployment Configurations

Python 5 Updated Sep 20, 2025

Argmax SDK Swift Playground

Swift 13 1 Updated Sep 26, 2025

Local-first AI Notepad for Private Meetings

TypeScript 6,313 380 Updated Oct 10, 2025

Voice-to-text app for macOS to transcribe what you say to text almost instantly

Swift 1,901 236 Updated Oct 9, 2025

VOICE → WORDS

Swift 324 37 Updated Sep 10, 2025

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 15,836 3,126 Updated Oct 10, 2025

The simplest, fastest repository for training/finetuning small-sized VLMs.

Python 4,099 393 Updated Sep 10, 2025

This repository contains the official implementation of "FastVLM: Efficient Vision Encoding for Vision Language Models" - CVPR 2025

Python 6,738 460 Updated May 5, 2025

An implementation of the Nvidia's Parakeet models for Apple Silicon using MLX.

Python 515 41 Updated Oct 5, 2025

LiteRT continues the legacy of TensorFlow Lite as the trusted, high-performance runtime for on-device AI. Now with LiteRT Next, we're expanding our vision with a new generation of APIs designed for…

C++ 852 120 Updated Oct 10, 2025

Open-source reproducible benchmarks from Argmax

Jupyter Notebook 61 2 Updated Oct 9, 2025

A text-to-speech (TTS), speech-to-text (STT) and speech-to-speech (STS) library built on Apple's MLX framework, providing efficient speech analysis on Apple Silicon.

Python 2,728 217 Updated Sep 25, 2025

Towards Human-Sounding Speech

Python 5,610 468 Updated May 6, 2025

SmolVLM2 Demo

Swift 174 20 Updated Mar 20, 2025
Swift 3 1 Updated Jan 27, 2025

The easiest tool for fine-tuning LLM models, synthetic data generation, and collaborating on datasets.

Python 4,285 308 Updated Oct 10, 2025

Pipelines for running pretrained Core ML models

Swift 43 14 Updated Jan 29, 2025

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

Jupyter Notebook 8,448 950 Updated Oct 8, 2025
Python 7 1 Updated Jan 10, 2025

Python tools for WhisperKit: Model conversion, optimization and evaluation

Python 228 78 Updated Jul 31, 2025

Type annotations and runtime checking for shape and dtype of JAX/NumPy/PyTorch/etc. arrays. https://docs.kidger.site/jaxtyping/

Python 1,560 77 Updated Oct 3, 2025
Swift 120 9 Updated Jun 26, 2025

Example PIR service & documentation for Live Caller ID Lookup & NEURLFilter

Swift 188 24 Updated Oct 1, 2025

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python 150,824 30,710 Updated Oct 9, 2025

The Qualcomm® AI Hub apps are a collection of state-of-the-art machine learning models optimized for performance (latency, memory etc.) and ready to deploy on Qualcomm® devices.

Java 310 75 Updated Oct 1, 2025
Next