Highlights
- Pro
Stars
[WIP] Better (FP8) attention for Hopper
[ICASSP 2024] 🍵 Matcha-TTS: A fast TTS architecture with conditional flow matching
Replacement classic layers with KAN Block for Hyperspectral data
Code and slides of my YouTube series called "Audio Signal Proessing for Machine Learning"
ttslearn: Library for Pythonで学ぶ音声合成 (Text-to-speech with Python)
30 days of React Native demos
[ICCV 2023 Oral] "FateZero: Fusing Attentions for Zero-shot Text-based Video Editing"
A curated list of awesome audio technology resources for developers
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable…
GPU tensor framework with support for running ONNX models
Efficient Deep Learning Systems course materials (HSE, YSDA)
polyriddim is being known about how "unplayable" of the SV in osu!mania, due to the overwhelming popularity, I decided to create polyriddim map with new svs
Deep Generative Models Course
A course in reinforcement learning in the wild
Audio Generation and Enhancement
A lightweight, scalable, and general framework for visual question answering research
This is a pytorch implementation of our Recurrent Aggregation of Multimodal Embeddings Network (RAMEN) from our CVPR-2019 paper.
Конспекты для подготовки к экзамену по курсу Непрерывная оптимизация 2020 для специализации МОП ПМИ ФКН ВШЭ.
😎 Awesome lists about all kinds of interesting topics
Bottom-up attention model for image captioning and VQA, based on Faster R-CNN and Visual Genome