Extract Text or subtitles From Video File


What is the use case

This robot, will extract text or subtitles from any video File along with the timestamp at which that text was present in the video file

How do you see a solution for the use case?

I am not certain at this point but this is what i am going to do Step by Step:

  1. Play the video
  2. at Every 200 mili-Second
  • Take the screenshot and pause the video, Get the Playback time and save it in a timestamp variable
  • Use any OCR reader and extract the text from screenshot
  • Save the extracted Text as well as timestamp variable value in CSV ( or PDF, Word etc. ) file
  • Goto Step 1
  1. Repeat all the above steps until video ends

Scope: ______________

  • Custom Activity
  • Reusable Component
  • Template
  • Automation Framework
  • Application Connector
  • Data Connector
  • RPA Documentation
  • Machine learning model
  • Dashboard


This is a nice trick. You can collect all the subtitles and actually train a text-Speech Model .
Building a text to speech engine like google or alexa would take atleast 50,000 hours of speech along with the text just for 1 language .
I double vote.


Awesome Solution for Extracting Hardcoded Text from video files


There is no available software or app, that can do this kind of thing, it would be nice, if you or someone else can make it happen


Hey this is an awesome idea. I like it and I keep doing these stuff as I’m OCR guy. Thumbs-Up!!!


thanks Ashok :slight_smile:


It so hard to implement…
I think its better to use some python with CV library, like OpenCV to do this


Curious to know the business use case this idea solves…


i think its easy if we use UiPath


Just give me idea how to read list of images from a folder, i can send u xaml file😅