freds0

Follow

Frederico S. Oliveira freds0

Follow

Researcher in the area of NLP, Ph.D. student at UFG, focusing on speech synthesis and recognition using deep learning and also professor at UFMT.

101 followers · 96 following

UFMT
Cuiabá, Mato Grosso - Brazil
https://www.fredso.com.br
@fred_s0

Achievements

Achievements

Highlights

Developer Program Member

Lists (12)

Sort

Avatar

datasets

LipSync

Pitch-Extractor

Singing-Voice-Conversion

Speech-Enhancement

Speech-Metrics

speech-to-speech-translation

Speech-to-Text

Text-to-Speech

19 repositories

video-super-resolution

Voice-Conversion

Stars

multica-ai / andrej-karpathy-skills

A single CLAUDE.md file to improve Claude Code behavior, derived from Andrej Karpathy's observations on LLM coding pitfalls.

59,798 5,180 Updated Apr 18, 2026

ultraworkers / claw-code

The repo is finally unlocked. enjoy the party! The fastest repo in history to surpass 100K stars ⭐. Join Discord: https://discord.gg/5TUQKqFWd Built in Rust using oh-my-codex.

Rust 186,178 108,761 Updated Apr 17, 2026

forithmus / MR-RATE

MR-RATE: A Vision-Language Foundation Model and Dataset for Magnetic Resonance Imaging

Python 65 6 Updated Apr 18, 2026

torvalds / linux

Linux kernel source tree

C 229,812 61,728 Updated Apr 19, 2026

NoSavedDATA / NSK

NSK Coding Language: Fast and Simple

C++ 15 1 Updated Apr 11, 2026

BUTSpeechFIT / DiCoW

Python 92 12 Updated Jan 28, 2026

torvalds / AudioNoise

Random digital audio effects

C 4,332 202 Updated Feb 26, 2026

k2-fsa / Flow2GAN

Hybrid Flow Matching and GAN with Multi-Resolution Network for Few-Step High-Fidelity Audio Generation

Python 140 8 Updated Mar 8, 2026

facebookresearch / sam-audio

The repository provides code for running inference with the Meta Segment Anything Audio Model (SAM-Audio), links for downloading the trained model checkpoints, and example notebooks that show how t…

Python 3,461 308 Updated Jan 5, 2026

Aratako / T5Gemma-TTS

Multilingual TTS model with voice cloning and duration control, based on T5Gemma encoder-decoder LLM

Python 303 30 Updated Apr 3, 2026

zai-org / GLM-TTS

GLM-TTS: Controllable & Emotion-Expressive Zero-shot TTS with Multi-Reward Reinforcement Learning

Python 977 122 Updated Apr 10, 2026

karpathy / nanochat

The best ChatGPT that $100 can buy.

Python 52,136 6,930 Updated Apr 14, 2026

OpenBMB / VoxCPM

VoxCPM2: Tokenizer-Free TTS for Multilingual Speech Generation, Creative Voice Design, and True-to-Life Cloning

Python 14,712 1,754 Updated Apr 17, 2026

google-ai-edge / ai-edge-quantizer

AI Edge Quantizer: flexible post training quantization for LiteRT models.

Python 122 27 Updated Apr 19, 2026

NVIDIA / diffusion-audio-restoration

Audio-to-Audio Schrodinger Bridges is a diffusion-based audio restoration model for bandwidth extension and inpainting.

Python 141 9 Updated Aug 13, 2025

freddyaboulton / orpheus-cpp

Fast Streaming TTS with Orpheus + WebRTC (with FastRTC)

Python 351 38 Updated Apr 10, 2025

Vyvo-Labs / VyvoTTS

Jupyter Notebook 256 38 Updated Apr 8, 2026

microsoft / VibeVoice

Open-Source Frontier Voice AI

Python 40,266 4,669 Updated Apr 18, 2026

Edresson / TTS-Portuguese-Corpus

Open Source Text-To-Speech Portuguese Dataset

178 16 Updated Feb 2, 2024

AliAkhtari78 / SpotifyScraper

Spotify Scraper to extract all the information from spotify, download mp3 with cover of the song

Makefile 248 28 Updated Apr 15, 2026

shubham0204 / SmolChat-Android

Running any GGUF SLMs/LLMs locally, on-device in Android

Kotlin 779 132 Updated Apr 17, 2026

ga642381 / speech-trident

Awesome speech/audio LLMs, representation learning, and codec models

1,220 74 Updated Apr 4, 2026

stlohrey / chatterbox-finetuning

Forked from resemble-ai/chatterbox

SoTA open-source TTS

Python 137 26 Updated Jun 7, 2025

resemble-ai / chatterbox

SoTA open-source TTS

Python 24,382 3,252 Updated Mar 26, 2026

tuanh123789 / Spark-TTS-finetune

finetune llm part for spark-tts model

Python 122 25 Updated Mar 25, 2025

MYZY-AI / Muyan-TTS

Python 478 42 Updated May 19, 2025

stlohrey / dia-finetuning

Forked from nari-labs/dia

A TTS model capable of generating ultra-realistic dialogue in one pass.

Python 130 21 Updated Jul 25, 2025

fagenorn / handcrafted-persona-engine

An AI-powered interactive avatar engine using Live2D, LLM, ASR, TTS, and RVC. Ideal for VTubing, streaming, and virtual assistant applications.

C# 1,035 118 Updated Apr 18, 2026

nari-labs / dia

A TTS model capable of generating ultra-realistic dialogue in one pass.

Python 19,260 1,681 Updated Nov 19, 2025

zhenye234 / X-Codec-2.0

Codec for paper: LLaSA: Scaling Train-time and Inference-time Compute for LLaMA-based Speech Synthesis

Python 351 52 Updated Jul 21, 2025