top of page

Video Frame Extraction and AI Narration

Automates video downloading, frame extraction using OpenCV, script generation with GPT-4 multimodal LLM, and text-to-speech audio creation via OpenAI API. The workflow batches image frames, creates narration scripts, synthesizes voiceovers, and uploads results to Google Drive.

bottom of page