🧠 AI Audio Transcriber 🎙️

A Python + FastAPI web app that transcribes audio recordings using OpenAI Whisper. Supports multilingual transcription (including English and Hindi) and provides a beautiful web interface to upload and view results.

🚀 Features

✅ Upload audio recordings (MP3, WAV, etc.)

🧠 AI Audio Transcriber 🎙️

A Python + FastAPI web app that transcribes audio recordings using OpenAI Whisper. Supports multilingual transcription (including English and Hindi) and provides a beautiful web interface with real-time progress tracking.

🚀 Features

✅ Upload audio recordings (MP3, WAV, etc.)
✅ Youtube video transcription
✅ Transcribes audio using Whisper
✅ Supports English, Hindi & other languages
✅ Clean, responsive UI with multi-line output
✅ Dockerized for easy deployment
✅ GitHub Actions CI to auto-publish Docker image

✅ Transcribes audio using Whisper
✅ Supports English, Hindi & other languages
✅ Clean, responsive UI with multi-line output
✅ Dockerized for easy deployment
✅ GitHub Actions CI to auto-publish Docker image

🖼️ Demo UI

🛠️ Tech Stack

Backend: FastAPI, Whisper
Frontend: HTML + CSS (Jinja2 templating)
Container: Docker
CI/CD: GitHub Actions

📦 Installation (Local)

1. Clone the repo

git clone https://github.com/Amrish-Sharma/ata.git
cd ata

2. Install dependencies

pip install -r requirements.txt

3. Run the app

uvicorn app.main:app --reload

Open in browser: http://localhost:8000

🐳 Docker Setup

Build & Run

docker build -t audio-transcriber .
docker run -p 8000:8000 audio-transcriber

Visit: http://localhost:8000

📡 GitHub Action – Docker Publish

This project includes a GitHub Action that automatically:

Builds the Docker image
Pushes it to GitHub Container Registry (GHCR) on every main push

Image will be available at:

ghcr.io/Amrish-Sharma/ata:latest

📱 Android Client (Coming Soon)

An Android app is in development to let users record or select audio and get transcriptions directly on their phones.

📁 File Structure

.
├── app/
│   ├── main.py         # FastAPI entrypoint
│   └── utils.py        # Whisper transcription logic
├── static/             # Static assets
│   ├── css/
│   │   ├── style.css   # Main stylesheet
│   └── js/
│       └── main.js     # Frontend logic & AJAX handlers
├── templates/
│   |── index.html      # UI frontend
|   |__ result.html     # result interface
├── uploads/            # Uploaded audio (gitignored)
├── Dockerfile
├── .gitignore
├── requirements.txt
└── README.md

🤝 Contributing

Pull requests welcome! For major changes, please open an issue first to discuss what you'd like to change.

📄 License

MIT License

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🧠 AI Audio Transcriber 🎙️

🚀 Features

🧠 AI Audio Transcriber 🎙️

🚀 Features

🖼️ Demo UI

🛠️ Tech Stack

📦 Installation (Local)

1. Clone the repo

2. Install dependencies

3. Run the app

🐳 Docker Setup

Build & Run

📡 GitHub Action – Docker Publish

📱 Android Client (Coming Soon)

📁 File Structure

🤝 Contributing

📄 License

🙌 Credits

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
.github/workflows		.github/workflows
app		app
logs		logs
screenshots		screenshots
static		static
templates		templates
uploads		uploads
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
requirements.txt		requirements.txt

Folders and files

Latest commit

History

Repository files navigation

🧠 AI Audio Transcriber 🎙️

🚀 Features

🧠 AI Audio Transcriber 🎙️

🚀 Features

🖼️ Demo UI

🛠️ Tech Stack

📦 Installation (Local)

1. Clone the repo

2. Install dependencies

3. Run the app

🐳 Docker Setup

Build & Run

📡 GitHub Action – Docker Publish

📱 Android Client (Coming Soon)

📁 File Structure

🤝 Contributing

📄 License

🙌 Credits

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages