Video Transcription Tool

A command-line utility that converts video files to readable transcripts and summaries using OpenAI's APIs.

Features

Extracts optimized audio from video files using FFmpeg
Transcribes audio to text using OpenAI's Whisper API
Formats transcripts into readable HTML with proper structure
Generates concise summaries of video content
Handles batch processing of multiple videos
Intelligently caches intermediate files to avoid redundant processing

Requirements

Python 3.6+
FFmpeg installed and available in PATH
OpenAI API key

Installation

Clone this repository:

git clone https://github.com/iliakan/transcript.git
cd transcript

Install required Python packages:
```
pip install requests
```
Set your OpenAI API key as an environment variable:
```
export OPENAI_API_KEY="your-api-key-here"
```

Usage

Process a single video

python video_to_text.py /path/to/your/video.mp4

Process all videos in the 'video' directory

python video_to_text.py

How It Works

Audio Extraction: The tool extracts audio from the video file, optimizing it for transcription with appropriate bitrate and filtering.
Transcription: The audio is sent to OpenAI's Whisper API for transcription.
Formatting: The raw transcript is processed by GPT-4o to create a well-formatted HTML version with proper paragraphs and sections.
Summarization: GPT-4o generates a concise summary of the video content in HTML format.
Output: Both the formatted transcript and summary are saved in the project directory structure.

Output Directory Structure

The tool organizes processed files in a flat structure with separate directories for each file type:

├── audio/
│   └── video_name.mp3       # Extracted audio files
├── transcript/
│   ├── video_name.txt       # Raw VTT transcripts
│   └── video_name.html      # Formatted HTML transcripts
├── summary/
│   └── video_name.html      # HTML summaries
└── video/
    └── video_name.mp4       # Source video files

Notes

The tool optimizes audio extraction to keep files under 25MB (OpenAI's file size limit)
All files (audio, transcripts, HTML, and summaries) are cached to avoid redundant processing
Place your video files in the 'video' directory for batch processing
Detailed error information is displayed for OpenAI API errors to help with troubleshooting

License

MIT License

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
.gitignore		.gitignore
README.md		README.md
video_to_text.py		video_to_text.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Video Transcription Tool

Features

Requirements

Installation

Usage

Process a single video

Process all videos in the 'video' directory

How It Works

Output Directory Structure

Notes

License

About

Uh oh!

Releases

Packages

Languages

iliakan/transcript

Folders and files

Latest commit

History

Repository files navigation

Video Transcription Tool

Features

Requirements

Installation

Usage

Process a single video

Process all videos in the 'video' directory

How It Works

Output Directory Structure

Notes

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages