Speech Translation with OpenAI Whisper

Ng Wai Foong
5 min readJun 27, 2023

An experimental hack that works out-of-the-box

Photo by Hannah Wright on Unsplash

Whisper is a general-purpose speech recognition model built by OpenAI. It was officially released to the public in the late 2022 and is now one of the state-of-the-art model for speech recognition.

The model is trained on a large dataset of diverse audio and is capable of performing the following tasks:

--

--

Ng Wai Foong

Senior AI Engineer@Yoozoo | Content Writer #NLP #datascience #programming #machinelearning | Linkedin: https://www.linkedin.com/in/wai-foong-ng-694619185/