- Düsseldorf
-
01:26
(UTC +01:00) - https://kate941-su.github.io/website/
- https://medium.com/@kworkshere
Stars
Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.
🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
FastAPI framework, high performance, easy to learn, fast to code, ready for production
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Portable file server with accelerated resumable uploads, dedup, WebDAV, FTP, TFTP, zeroconf, media indexer, thumbnails++ all in one file, no deps
The official Python SDK for Model Context Protocol servers and clients
リアルタイムボイスチェンジャー Realtime Voice Changer
A Python library for extracting structured information from unstructured text using LLMs with precise source grounding and interactive visualization.
Free and Open Source Machine Translation API. Self-hosted, offline capable and easy to setup.
Download pictures (or videos) along with their captions and other metadata from Instagram.
Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audio codec.
cryptography is a package designed to expose cryptographic primitives and recipes to Python developers.
Securely and anonymously share files, host websites, and chat with friends using the Tor network
無料で使える中品質なテキスト読み上げソフトウェア、VOICEVOXの音声合成エンジン
A community-driven distribution of up to date WebRTC framework binaries for iOS and macOS