-
timedomAIn
- Beijing
- seanweichat
Lists (28)
Sort Name ascending (A-Z)
3d-rendering
unity or other 3D rendering relatedAI_tricks
audio_framework
audio-generation
models for audio generationbigData
blockchain
chatGPTxxx
dataset
DeepLearning—learning
dsp
game_framework
game_graphic
game_physics
image_generation
xxGAN, diffusion..infra
interesting
large_model
MIR_ASR
ML_model deploy/optimization
music-generation
nlp
other_tools
server_dev
TTS_or_singing-sythesis
deep-learning paper for MIR, TTS for SInging-synthesisui_framework
vocoder
voice-conversion
webui
Stars
A Model Context Protocol server for searching and analyzing arXiv papers
A high-throughput and memory-efficient inference and serving engine for LLMs
NVIDIA Linux open GPU with P2P support
A scriptable music downloader for Qobuz, Tidal, SoundCloud, and Deezer
Implementation of Vision Mamba from the paper: "Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model" It's 2.8x faster than DeiT and saves 86.8% GPU memory wh…
Krita is a free and open source cross-platform application that offers an end-to-end solution for creating digital art files from scratch built on the KDE and Qt frameworks.
Cross-Platform, GPU Accelerated Whisper 🏎️
Automatic fingering generator for piano scores
Implementation of "Attention Is Off By One" by Evan Miller
Your API ⇒ Paid MCP. Instantly.
a.k.a. Awesome ChatGPT Prompts. Share, discover, and collect prompts from the community. Free and open source — self-host for your organization with complete privacy.
The definitive Web UI for local AI, with powerful features and easy setup.
The Secure CommsOS™ for mission-critical operations
An open-source, self-hosted note-taking service. Your thoughts, your data, your control — no tracking, no ads, no subscription fees.
An open-source UI-first Identity and Access Management (IAM) / Single-Sign-On (SSO) platform with web UI supporting OAuth 2.0, OIDC, SAML, CAS, LDAP, SCIM, WebAuthn, TOTP, MFA, Face ID, RADIUS, Goo…
リアルタイムボイスチェンジャー Realtime Voice Changer
A timeline of the latest AI models for audio generation, starting in 2023!
AudioLDM: Generate speech, sound effects, music and beyond, with text.
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
iSTFTNet : Fast and Lightweight Mel-spectrogram Vocoder Incorporating Inverse Short-time Fourier Transform
DALL·E Mini - Generate images from a text prompt
A playground to generate images from any text prompt using Stable Diffusion (past: using DALL-E Mini)
A curated list of JUCE modules, templates, plugins, oh my!
Qt-oriented static code analyzer based on the Clang framework
Deep Performer: Score-to-audio music performance synthesis