Lists (5)
Sort Name ascending (A-Z)
Starred repositories
A generative AI extension for JupyterLab
Fourth evolution of Code Injection for Xcode
Omnilingual ASR Open-Source Multilingual SpeechRecognition for 1600+ Languages
Korean stock analysis MCP server with 6 investment gurus' strategies | 한국 주식 6대 투자대가 전략 분석 MCP 서버
RAG on Everything with LEANN. Enjoy 97% storage savings while running a fast, accurate, and 100% private RAG application on your personal device.
Bitmap fonts inspired by the font design from Nintendo DS
Simultaneous speech-to-text model
A powerful coding agent toolkit providing semantic retrieval and editing capabilities (MCP server & other integrations)
Gradio WebUI for creators and developers, featuring key TTS (Edge-TTS, kokoro) and zero-shot Voice Cloning (E2 & F5-TTS, CosyVoice), with Whisper audio processing, YouTube download, Demucs vocal is…
A Model Context Protocol (MCP) server that provides Xcode-related tools for integration with AI assistants and other MCP clients.
🚀 Give Claude AI superpowers for GitHub workflows. Transform "fix stuff" commits into professional messages, generate intelligent changelogs, and get AI code reviews - all with one command.
A browser-based API emulator for ChatGPT.
Use Claude Code as the foundation for coding infrastructure, allowing you to decide how to interact with the model while enjoying updates from Anthropic.
Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek-R1, Qwen3, Gemma 3, TTS 2x faster with 70% less VRAM.
Get up and running with OpenAI gpt-oss, DeepSeek-R1, Gemma 3 and other models.
A Web UI for easy subtitle using whisper model.
kaldi-asr/kaldi is the official location of the Kaldi project.
Record spatial features of real-world objects, then use the results to find those objects in the user's environment and trigger AR content.
[CORL 2025 Oral]One View, Many Worlds: Single-Image to 3D Object Meets Generative Domain Randomization for One-Shot 6D Pose Estimation.
LLM.swift is a simple and readable library that allows you to interact with large language models locally with ease for macOS, iOS, watchOS, tvOS, and visionOS.
AI enabled pair programmer for Claude, GPT, O Series, Grok, Deepseek, Gemini and 300+ models
Qwen3-Coder is the code version of Qwen3, the large language model series developed by Qwen team, Alibaba Cloud.
a pose estimation result visualization application based on Swift
This GitHub repository provides cutting-edge tools for body pose estimation. It enables real-time pose estimation from images and camera feeds, along with visualization options such as heatmaps, li…
Bringing the scanning box from SceneKit to RealityKit