Introducing Whisper

OpenAI

Introducing Whisper

AITime Stamp: September 21, 2022 2:00 AM

Source Node: 2969080

Republished By Plato

Followers: 0

Other existing approaches frequently use smaller, more closely paired audio-text training datasets,^{[^reference-1]} ^{[^reference-2]} or use broad but unsupervised audio pretraining.^{[^reference-4]}^{[^reference-6]} Because Whisper was trained on a large and diverse dataset and was not fine-tuned to any specific one, it does not beat models that specialize in LibriSpeech performance, a famously competitive benchmark in speech recognition. However, when we measure Whisper’s zero-shot performance across many diverse datasets we find it is much more robust and makes 50% fewer errors than those models.

About a third of Whisper’s audio dataset is non-English, and it is alternately given the task of transcribing in the original language or translating to English. We find this approach is particularly effective at learning speech to text translation and outperforms the supervised SOTA on CoVoST2 to English translation zero-shot.

SEO Powered Content & PR Distribution. Get Amplified Today.
PlatoData.Network Vertical Generative Ai. Empower Yourself. Access Here.
PlatoAiStream. Web3 Intelligence. Knowledge Amplified. Access Here.
PlatoESG. Carbon, CleanTech, Energy, Environment, Solar, Waste Management. Access Here.
PlatoHealth. Biotech and Clinical Trials Intelligence. Access Here.
Source: https://openai.com/research/whisper

Time Stamp: September 21, 2022

More from OpenAI

Frontier risk and preparedness

Frontier risk and preparedness

Source Cluster:

Source Node: 2957529

Time Stamp: Oct 26, 2023

Efficient training of language models to fill in the middle

Efficient training of language models to fill in the middle

Source Cluster:

Source Node: 2790387

Time Stamp: Jul 28, 2022

Custom instructions for ChatGPT

Custom instructions for ChatGPT

Source Cluster:

Source Node: 2775556

Time Stamp: Jul 20, 2023

Confidence-Building Measures for Artificial Intelligence: Workshop proceedings

Confidence-Building Measures for Artificial Intelligence: Workshop proceedings

Source Cluster:

Source Node: 2803090

Time Stamp: Aug 1, 2023

Introducing OpenAI London

Introducing OpenAI London

Source Cluster:

Source Node: 2737775

Time Stamp: Jun 28, 2023

Multimodal Neurons in Artificial Neural Networks

Source Cluster:

Source Node: 747637

Time Stamp: Mar 4, 2021

Summarizing Books with Human Feedback

Source Cluster:

Source Node: 1232619

Time Stamp: Sep 23, 2021

OpenAI at NeurIPS 2020

Source Cluster:

Source Node: 1849626

Time Stamp: Dec 4, 2020

OpenAI’s API Now Available with No Waitlist

Source Cluster:

Source Node: 1573067

Time Stamp: Nov 18, 2021

New AI classifier for indicating AI-written text

New AI classifier for indicating AI-written text

Source Cluster:

Source Node: 1933362

Time Stamp: Jan 31, 2023

OpenAI API

Source Cluster:

Source Node: 747761

Time Stamp: Jun 11, 2020

OpenAI Scholars Spring 2020: Final Projects

Source Cluster:

Source Node: 747648

Time Stamp: Jul 9, 2020