Skip to content

transcriber

Release Build status Commit activity License

Description

This transcribe script is intended to be used, primarily, with the Bonsai Tutorials video collection to create SRT subtitle text files for the tutorial videos.

Since there were so many videos it became difficult to remember which one contained a valuable explanation and exactly when in the video that explanation occurred.

I found the OpenAI Whisper python library and set about using it to transcribe the videos to SRT subtitle text files (thanks to the pydub and pysrt modules).

SRT subtitle text files are described by this Wikipedia entry:

I used the cookiecutter-uv project template to try to ensure that I had the basis of a modern python project.

Prerequisites

You can either:

  1. Use python's uv and make to manage the script's dependencies. See the 🚀 Getting Started document for more details.
  2. Use the included Dockerfile and the docker command to make use of this script. See the 🚢 Using Docker document for more details.

Notes
Each matching video found in the target directory will have an .srt file created as a sibling. This makes it easy for video players to match the video with the subtitles.