- Berlin, Germany
- https://andywer.com
- @andywritescode
Highlights
- Pro
Lists (1)
Sort Name ascending (A-Z)
Stars
Under 1KB each! Super Tiny Icons are miniscule SVG versions of your favourite website and app logos
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
pix2code: Generating Code from a Graphical User Interface Screenshot
Run any open-source LLMs, such as DeepSeek and Llama, as OpenAI compatible API endpoint in the cloud.
Super Resolution for images using deep learning.
💡 All-in-one open-source AI framework for semantic search, LLM orchestration and language model workflows
HunyuanVideo: A Systematic Framework For Large Video Generation Model
Claude Engineer is an interactive command-line interface (CLI) that leverages the power of Anthropic's Claude-3.5-Sonnet model to assist with software development tasks.This framework enables Claud…
Official repo for paper "Structured 3D Latents for Scalable and Versatile 3D Generation" (CVPR'25 Spotlight).
A collaboration friendly studio for NeRFs
Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.
Research and development (R&D) is crucial for the enhancement of industrial productivity, especially in the AI era, where the core aspects of R&D are mainly focused on data and models. We are commi…
Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audio codec.
A robust, efficient, low-latency speech-to-text library with advanced voice activity detection, wake word activation and instant transcription.
Open Source framework for voice and multimodal conversational AI
A powerful framework for building realtime voice AI agents 🤖🎙️📹
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
The LBRY SDK for building decentralized, censorship resistant, monetized digital content apps.
Official repository of "SAMURAI: Adapting Segment Anything Model for Zero-Shot Visual Tracking with Motion-Aware Memory"
A new kind of Progress Bar, with real-time throughput, ETA, and very cool animations!
[SIGGRAPH Asia 2024, Journal Track] ToonCrafter: Generative Cartoon Interpolation
[EMNLP'23, ACL'24] To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which achieves up to 20x compression with minimal performance loss.
MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting
[COLM 2024] OpenAgents: An Open Platform for Language Agents in the Wild