Skip to content

Tags: edwko/OuteTTS

Tags

0.4.2

Toggle 0.4.2's commit message
Update 0.4.2

0.4.0

Toggle 0.4.0's commit message
Info update

0.3.2

Toggle 0.3.2's commit message
fix

0.2.3

Toggle 0.2.3's commit message
fix

0.2.2

Toggle 0.2.2's commit message
refactor: streamline WavTokenizer interface and reduce size

- Remove model dependencies, consolidate into model.py

0.2.1

Toggle 0.2.1's commit message
Whisper integration for speaker generation

Added Whisper-based transcription for speaker creation when `transcript` is None (#28).

0.2.0

Toggle 0.2.0's commit message
Release OuteTTS v0.2.0

### Major Changes
- Added support for OuteTTS-0.2-500M model
- Introduced default speaker presets for each supported language
- **Breaking Changes**:
  - Incompatible speaker files from versions <0.2.0
  - Revised interface usage (see README.md)

### New Features
- Added voice cloning guidelines and interface usage in README.md
- Implemented Gradio example playground for OuteTTS-0.2-500M
- Multi-language alignment support
- Enhanced speaker management:
  - Methods: `print_default_speakers()` and `load_default_speaker(name)`
  - JSON format for speaker saving with language info
- Option to load WavTokenizer from custom path (fixes #24)
- Support for multiple interface version initialization

### Improvements
- Restructured library files for better organization
- Added hash verification for WavTokenizer downloads (fixes #3)
- Reworked interface for improved usability
- Made sounddevice optional with better error handling
- Included training data preparation examples

### Error Handling
- Improved validation for audio token detection
- Enhanced error messages for long inputs and EOS cases
- Better library-wide error handling and feedback