Whisper (speech recognition system)

Whisper (speech recognition system)
Whisper (speech recognition system)
Original author(s)	OpenAI
Initial release	September 21, 2022
Repository	github.com/openai/whisper
Written in	Python
Type	Transcription software; Encoder-decoder transformer; Foundation model; Acoustic model;
License	MIT License

Whisper is a machine learning model for speech recognition and transcription, created by OpenAI and first released as open-source software in September 2022.

It is capable of transcribing speech in English and several other languages, and is also capable of translating several non-English languages into English. OpenAI claims that the combination of different training data used in its development has led to improved recognition of accents, background noise and jargon compared to previous approaches.

Whisper is a weakly-supervised deep learning acoustic model, made using an encoder-decoder transformer architecture.

Whisper Large V2 was released on December 8, 2022. Whisper Large V3 was released in November 2023, on the OpenAI Dev Day.