F5-TTS Web Application

A modern web interface for Gujarati Text-to-Speech synthesis using the F5-TTS model.

Features

Text-to-Speech Conversion: Convert Gujarati text to natural-sounding speech
Advanced Generation Parameters: Customize speech generation with adjustable parameters
Dark/Light Theme: Toggle between dark and light mode for comfortable viewing
Audio History: View, play, and download previously generated audio
Responsive Design: Works on desktop and mobile devices
User Authentication: Basic login system to protect the application

Technologies Used

Backend: Flask (Python)
Frontend: HTML, CSS, JavaScript
TTS Engine: F5-TTS model via Gradio API
Audio Processing: Generated WAV files with customizable settings

Setup & Installation

Prerequisites

Python 3.8+
Gradio API running locally on port 7860
The F5-TTS model files for Gujarati

Installation

Clone the repository:

git clone https://github.com/Ahir7/f5-tts-web-app.git
cd f5-tts-web-app

Create a virtual environment and activate it:

python -m venv venv
source venv/bin/activate  # On Windows: venv\Scripts\activate

Install dependencies:

pip install -r requirements.txt

Configure the model paths in app.py: Update the LANGUAGES dictionary with your local model paths.

Running the Application

Start the Gradio API for the TTS model on port 7860.
Run the Flask application:

python app.py

Access the web interface at http://localhost:5000

Usage

Login: Use the default credentials (admin/admin123) or update them in the app
Enter Text: Type or paste Gujarati text in the text area
Adjust Parameters (optional):
- NFE Step: Adjust the number of function evaluations (default: 32)
- Speed: Control speech speed (0.5-2.0)
- Random Seed: Set for consistent output (or -1 for random)
- Remove Silence: Toggle to trim silence
- Use EMA: Toggle Exponential Moving Average usage
Generate: Click "Generate Speech" to process the text
Listen & Download: Play the audio in the browser or download it
View History: Access previously generated audio files

Project Structure

app.py: Main Flask application
templates/: HTML templates
- index.html: Main TTS interface
- login.html: Login page
- history.html: Audio history page
static/: Static assets
- css/style.css: Application styling
- js/script.js: Client-side functionality
- audio/: Generated audio files

Customization

Adding Languages: Add new language configurations to the LANGUAGES dictionary in app.py
Styling: Modify the static/css/style.css file to change the appearance
Users: Update the users dictionary in app.py to manage authentication

License

MIT License

Acknowledgements

F5-TTS Model Team for the text-to-speech technology
Contributors and maintainers of the original F5-TTS project

Future Enhancements

Database integration for user management
Multiple language support
Batch processing capabilities
API key authentication
Customizable voice profiles

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
static		static
templates		templates
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
app.py		app.py
docker-compose.yml		docker-compose.yml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

F5-TTS Web Application

Features

Technologies Used

Setup & Installation

Prerequisites

Installation

Running the Application

Usage

Project Structure

Customization

License

Acknowledgements

Future Enhancements

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

F5-TTS Web Application

Features

Technologies Used

Setup & Installation

Prerequisites

Installation

Running the Application

Usage

Project Structure

Customization

License

Acknowledgements

Future Enhancements

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages