Whisper AI: Live English Subtitles for 96 Languages

Cite

Related Material

EuroPython

Arens, Mathias

Formal Metadata

Title

Whisper AI: Live English Subtitles for 96 Languages

Title of Series

EuroPython 2023

Number of Parts

141

Author

Arens, Mathias

Contributors

N. N. (Moderation)

License

CC Attribution - NonCommercial - ShareAlike 4.0 International:
You are free to use, adapt and copy, distribute and transmit the work or content in adapted or unchanged form for any legal and non-commercial purpose as long as the work is attributed to the author in the manner specified by the author or licensor and the work or content is shared also in adapted form only under the conditions of this

Identifiers

10.5446/68766 (DOI)

Publisher

EuroPython

Release Date

2023

Language

English

Content Metadata

Subject Area

Computer Science

Genre

Conference/Talk

Abstract

Whisper AI, a model from OpenAI, has been largely overlooked despite its impressive ability to accurately transcribe and translate human speech from audio. In this talk I will explore the architecture of the model and explain why it works so well. Additionally, I will live demo the model's capabilities in three languages, showing how you can use it on your own computer to generate English subtitles for a wide range of content.