shigabeev

🥐

Ilya Shigabeev shigabeev

🥐

59 followers · 46 following

langswap.app
https://langswap.app

Achievements

Highlights

stretch_audio Public

Pretty decent algorithm to stretch audio with far less artifacts than WSOLA/librosa.

Python 9 Updated Mar 4, 2026
awesome-russian-speech Public
Forked from alphacep/awesome-russian-speech

Russian speech technology links

2 Apache License 2.0 Updated Oct 23, 2025
vits2-inference Public

inference code for p0p4k/vits2_pytorch

Python 3 Updated Aug 1, 2025
mini_chat_gpt Public

Python 1 Updated Jun 20, 2025
whisperX Public
Forked from m-bain/whisperX

WhisperX: Automatic Speech Recognition with Accurate Word-level Timestamps.

Python BSD 2-Clause "Simplified" License Updated May 1, 2025
llama-cpp-python Public
Forked from abetlen/llama-cpp-python

Python bindings for llama.cpp

Python MIT License Updated Apr 11, 2025
vocos Public
Forked from gemelo-ai/vocos

Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis

Python MIT License Updated Mar 8, 2025
yt-dlp-ui Public

Download videos from mulitple plaforms with an easy UI

Python Updated Mar 6, 2025
stable-diffusion-webui Public
Forked from AUTOMATIC1111/stable-diffusion-webui

Stable Diffusion Forge with support for SD3.5

Python GNU Affero General Public License v3.0 Updated Jan 14, 2025
DataProcessingFramework Public
Forked from ai-forever/DataProcessingFramework

Framework for processing and filtering datasets

Python Apache License 2.0 Updated Dec 18, 2024
CRAFT-text-detection Public
Forked from boomb0om/CRAFT-text-detection

An unofficial PyTorch implementation of CRAFT text detector with better interface and fp16 support

Jupyter Notebook Updated Dec 2, 2024
Grounded-Segment-Anything Public
Forked from IDEA-Research/Grounded-Segment-Anything

Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything

Jupyter Notebook Apache License 2.0 Updated Oct 22, 2024
sd-scripts Public
Forked from kohya-ss/sd-scripts

Python Apache License 2.0 Updated Sep 26, 2024
kohya_ss Public
Forked from bmaltais/kohya_ss

Python Apache License 2.0 Updated Sep 9, 2024
stable-diffusion-docker Public
Forked from FurkanGozukara/stable-diffusion-docker

Docker image for Stable Diffusion WebUI with ControlNet, After Detailer, Dreambooth, Deforum and ReActor extensions, as well as Kohya_ss and ComfyUI

Shell 2 5 GNU General Public License v3.0 Updated Jul 29, 2024
Q-VITS2-Voice-Cloning Public
Forked from FENRlR/MB-iSTFT-VITS2

WIP: VITS 2 with quantized output of text-encoder and voice cloning

Python 6 4 MIT License Updated Jul 19, 2024
OpenVoice Public
Forked from myshell-ai/OpenVoice

Instant voice cloning by MyShell.

Python MIT License Updated Jul 5, 2024
demucs Public
Forked from facebookresearch/demucs

Code for the paper Hybrid Spectrogram and Waveform Source Separation

Python MIT License Updated Jul 5, 2024
flask-elastic-image-search Public
Forked from radoondas/flask-elastic-image-search

Python Apache License 2.0 Updated Jun 21, 2024
russian_tts_normalization Public

Normalize Text in Russian

Python 29 3 Updated Nov 7, 2023
vits2_pytorch_bigvgan Public
Forked from p0p4k/vits2_pytorch

unofficial vits2-TTS implementation in pytorch

Python 6 MIT License Updated Sep 2, 2023
shigabeev Public

Updated Sep 1, 2023
HiFi-GAN Public
Forked from rishikksh20/HiFi-GAN

HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis

Python MIT License Updated May 28, 2023
serverless-template-whisper Public template
Forked from sahil280114/serverless-template-whisper

Python MIT License Updated Feb 7, 2023
address-normalizer Public

Ищет выбранный адрес в ФИАС

address standardization fias

Python 39 6 Updated Oct 10, 2022
ParlAI Public
Forked from facebookresearch/ParlAI

A framework for training and evaluating AI models on a variety of openly available dialogue datasets.

Python MIT License Updated Aug 13, 2022
gruut-ipa Public
Forked from rhasspy/gruut-ipa

Python library for manipulating pronunciations using the International Phonetic Alphabet (IPA)

Python MIT License Updated Jun 24, 2022
awesome-speech-recognition-speech-synthesis-papers Public
Forked from zzw922cn/awesome-speech-recognition-speech-synthesis-papers

Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synthesis (SVS), Voice Conversion (VC)

MIT License Updated Jun 3, 2022
Master-s-Thesis Public

Диплом ВКР

TeX Updated Jun 18, 2021
Paper-Template-for-INTERSPEECH-2021 Public

Paper Template for INTERSPEECH 2021

TeX Updated Feb 19, 2021

Ilya Shigabeev shigabeev

Achievements

Achievements

Highlights

stretch_audio Public

Uh oh!

awesome-russian-speech Public

Uh oh!

vits2-inference Public

Uh oh!

mini_chat_gpt Public

Uh oh!

whisperX Public

Uh oh!

llama-cpp-python Public

Uh oh!

vocos Public

Uh oh!

yt-dlp-ui Public

Uh oh!

stable-diffusion-webui Public

Uh oh!

DataProcessingFramework Public

Uh oh!

CRAFT-text-detection Public

Uh oh!

Grounded-Segment-Anything Public

Uh oh!

sd-scripts Public

Uh oh!

kohya_ss Public

Uh oh!

stable-diffusion-docker Public

Uh oh!

Q-VITS2-Voice-Cloning Public

Uh oh!

OpenVoice Public

Uh oh!

demucs Public

Uh oh!

flask-elastic-image-search Public

Uh oh!

russian_tts_normalization Public

Uh oh!

vits2_pytorch_bigvgan Public

Uh oh!

shigabeev Public

Uh oh!

HiFi-GAN Public

Uh oh!

serverless-template-whisper Public template

Uh oh!

address-normalizer Public

Uh oh!

ParlAI Public

Uh oh!

gruut-ipa Public

Uh oh!

awesome-speech-recognition-speech-synthesis-papers Public

Uh oh!

Master-s-Thesis Public

Uh oh!

Paper-Template-for-INTERSPEECH-2021 Public

Uh oh!