- Melbourne, Australia
Lists (1)
Sort Name ascending (A-Z)
Stars
Marketing skills for Claude Code and AI agents. CRO, copywriting, SEO, analytics, and growth engineering.
Pure-MLX port of Zyphra ZONOS2 (8B MoE TTS) for Apple Silicon — voice cloning, 44.1 kHz.
Pure-MLX port of Google's UMT5-XXL text encoder for Apple Silicon — pre-quantized int8, fast lazy-mmap load, parity-checked. The text encoder behind open video models like Wan2.1/2.2.
[AAAI 2026] EchoMimicV3: 1.3B Parameters are All You Need for Unified Multi-Modal and Multi-Task Human Animation
SoulX-FlashHead: A unified 1.3B-parameter framework designed for high-fidelity, infinite-length, and real-time streaming portrait video generation.
Pure-MLX port of rednote-hilab/dots.tts — multilingual zero-shot voice-clone TTS on Apple Silicon
Production LLM inference on the Apple Neural Engine — a practitioner's guide, complete with converters, Swift runtimes, and validated model manifests
Try X-Dub to sync any character in a video with any audio you like | Official repository for "From Inpainting to Editing: Unlocking Robust Mask-Free Visual Dubbing via Generative Bootstrapping"
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs
The retrieval layer for production AI systems. Lightning-fast (<10ms) search without vector databases. Built for browser, edge, on-device, and cloud.
SkillOpt is a text-space optimizer that trains reusable natural-language skills for frozen LLM agents through trajectory-driven edits, validation-gated updates, and deployable best_skill.md artifacts.
The open-source app everyone uses to manage agents at work
The open-source managed agents platform. Turn coding agents into real teammates — assign tasks, track progress, compound skills.
A variety of custom ComfyUI nodes and workflows
Apple MLX port of FasterLivePortrait for Apple Silicon
Memory library for building stateful agents
verl/HybridFlow: A Flexible and Efficient RL Post-Training Framework
An open-source long-horizon SuperAgent harness that researches, codes, and creates. With the help of sandboxes, memories, tools, skill, subagents and message gateway, it handles different levels of…
The open source AI operating system
AI-powered job search system built on Claude Code. 14 skill modes, Go dashboard, PDF generation, batch processing.
Step-Audio 2 is an end-to-end multi-modal large language model designed for industry-strength audio understanding and speech conversation.
Local banking voice assistant focused on banking
2.24x decode TPS increase On Qwen 3.6 27B @ temp 0.6 | Native MTP Speculative Decoding On Apple Silicon With No External Drafter.
Memory-efficient Cut Cross-Entropy for MLX on Apple Silicon