Brave Search

Whisper is an automatic speech recognition (ASR) system trained on 680,000 hours of multilingual and multitask supervised data collected from the web. We show that the use of such a large and diverse dataset leads to improved robustness to accents, ...

GitHub

github.com › openai › whisper

GitHub - openai/whisper: Robust Speech Recognition via Large-Scale Weak Supervision

Robust Speech Recognition via Large-Scale Weak Supervision - openai/whisper

Starred by 92.2K users

Forked by 11.6K users

Languages Python

Discussions

Ways to use Whisper for speech-to-text

I'm late to the game. But if the Whisper tutorial is too complicated for you (like it was for me) I found a dead-simple way to transcribe a file. This is after hours fighting with ChatGPT and Co-pilot. Just drop it into the dictate function in Word, and it'll spit out a time-stamped, speaker-recognizing transcript. https://support.microsoft.com/en-us/office/transcribe-your-recordings-7fc2efec-245e-45f0-b053-2a97531ecf57 More on reddit.com

r/OpenAI

45

56

February 5, 2024

Self hosting Open Ai Whisper

Hi, I wanted to ask if I can self host Open AI whisper on a Virtual machine lets say AWS’s EC2. Any documentation on that? What steps to follow here? Thanks More on community.openai.com

community.openai.com

0

August 21, 2025

Is there any noob friendly way to use Whisper?

https://github.com/Purfview/whisper-standalone-win More on reddit.com

r/OpenAI

47

21

March 22, 2024

A interesting behavior of OpenAI’s whisper

A dystopian future, humanity is at war with robots that have become indistinguishable from humans. Human officers are sitting in the basement of a bar in France, drinking and playing charades. An intelligence officer sitting alone at a table in the corner notices the strange accent of one of the officers. He suspects that the officer is a robot and asks him to explain himself. Everyone tries to calm him down, and the situation is close to being resolved. The suspected officer asks the waiter to bring them more beer: "Three beers for us, please. Like and subscribe!" More on reddit.com

r/LocalLLaMA

33

204

October 6, 2024

Videos

youtube.com

Beginners Guide to Whisper API: Audio to Text Conversion

12:44

YouTube

How to Install & Use Whisper AI Voice to Text - YouTube

How to Use OpenAI's Whisper for Perfect Transcriptions (Speech ...

October 8, 2025

11:52

YouTube

Transcribe Audio & Video to Text Using Whisper FREE with Python ...

2 weeks ago

youtube.com

What is OpenAI Whisper? (Best Speech to Text AI Model)

08:19

YouTube

OpenAI Whisper and Python: Easy Speech to Text - YouTube

October 10, 2023

View all

OpenAI

platform.openai.com › docs › guides › speech-to-text

Speech to text | OpenAI API

For example, the following prompt improves the transcription of the words DALL·E and GPT-3, which were previously written as "GDP 3" and "DALI": "The transcript is about OpenAI which makes technology like DALL·E, GPT-3, and ChatGPT with the hope of one day building an AGI system that benefits all of humanity." To preserve the context of a file that was split into segments, prompt the model with the transcript of the preceding segment. The model uses relevant information from the previous audio, improving transcription accuracy. The whisper-1 model only considers the final 224 tokens of the prompt and ignores anything earlier.

WhisperAI

whisperai.com

WhisperAI

Convert speech to text online with WhisperAI. Fast, accurate AI voice transcription powered by OpenAI. Ideal for meetings, interviews, and notes.

reddit.com › r/openai › ways to use whisper for speech-to-text

r/OpenAI on Reddit: Ways to use Whisper for speech-to-text

February 5, 2024 -

Hello all! I've been using a great speech-to-text feature on the OpenAI website. I go to this link, click on a green microphone icon, and then upload audio files from my computer. It works really well for converting speech to text. But recently, I saw a message saying that the current method I use is legacy and suggesting I use a new method at this other link. The problem is that uploading audio isn't available in the chat Playground. Thus, I'm worried that soon I won't be able to upload audio the way I do now.

Does anyone know another method to convert speech to text that doesn't use complete mode on Playground? Thanks for any help!

Top answer

1 of 4

4

I'm late to the game. But if the Whisper tutorial is too complicated for you (like it was for me) I found a dead-simple way to transcribe a file. This is after hours fighting with ChatGPT and Co-pilot. Just drop it into the dictate function in Word, and it'll spit out a time-stamped, speaker-recognizing transcript. https://support.microsoft.com/en-us/office/transcribe-your-recordings-7fc2efec-245e-45f0-b053-2a97531ecf57

2 of 4

2

I want to use this speech to text function on chatGPT, cause I'm visually impaired. Is this method safe for writers?

Wispr Flow

wisprflow.ai

Wispr Flow | Effortless Voice Dictation

Flow makes writing quick and clear with seamless voice dictation. It is the fastest, smartest way to type with your voice.

Hugging Face

huggingface.co › spaces › openai › whisper

Whisper - a Hugging Face Space by openai

Upload or record an audio file, or provide a YouTube video link to convert it into text. Choose between transcribing or translating the audio.

Find elsewhere

Google Bing Mojeek

vLLM

docs.vllm.ai › en › latest › serving › openai_compatible_server

OpenAI-Compatible Server - vLLM

4 days ago - Our Translation API is compatible with OpenAI's Translations API; you can use the official OpenAI Python client to interact with it. Whisper models can translate audio from one of the 55 non-English supported languages into English.

Hugging Face

huggingface.co › openai › whisper-large-v3

openai/whisper-large-v3 · Hugging Face

Whisper is a state-of-the-art model for automatic speech recognition (ASR) and speech translation, proposed in the paper Robust Speech Recognition via Large-Scale Weak Supervision by Alec Radford et al. from OpenAI.

OpenAI

platform.openai.com › docs › api-reference › audio

Audio | OpenAI API Reference

5 days ago - post https://api.openai.com/v1/audio/transcriptions · Transcribes audio into the input language. ... The audio file object (not file name) to transcribe, in one of these formats: flac, mp3, mp4, mpeg, mpga, m4a, ogg, wav, or webm.

OpenAI

platform.openai.com › docs › models › whisper-1

Whisper Model | OpenAI API

Whisper is a general-purpose speech recognition model, trained on a large dataset of diverse audio.

Wikipedia

en.wikipedia.org › wiki › Whisper_(speech_recognition_system)

Whisper (speech recognition system) - Wikipedia

August 3, 2025 - Whisper is a machine learning model for speech recognition and transcription, created by OpenAI and first released as open-source software in September 2022. It is capable of transcribing speech in English and several other languages, and is also capable of translating several non-English languages ...

replicate.com › openai › whisper

Whisper AI by Open AI - Run with an API on Replicate

Whisper uses a Transformer sequence-to-sequence model trained on various speech processing tasks, including multilingual speech recognition, speech translation, spoken language identification, and voice activity detection.

OpenAI Developer Community

community.openai.com › t › self-hosting-open-ai-whisper › 1353303

Self hosting Open Ai Whisper - Community - OpenAI Developer Community

August 21, 2025 - Hi, I wanted to ask if I can self host Open AI whisper on a Virtual machine lets say AWS’s EC2. Any documentation on that? What steps to follow here? Thanks

Medium

medium.com › @pouyahallaj › how-to-use-openais-whisper-in-just-3-lines-of-code-for-free-7b5c5dbe4863

How to Use OpenAI’s Whisper to Transcribe in Just 3 Lines of Code — for Free! | by Pouya Hallaj | Medium

May 4, 2023 - Transcribe speech to text with OpenAI’s Whisper in just 3 lines of Python code! Learn how to use this cutting-edge technology for free.

linkedin.com › pulse › what-went-wrong-whisper-openai-jean-louis-quéguiner-qsumc

What went wrong with Whisper by OpenAI

We cannot provide a description for this page right now

AIxploria

aixploria.com › en › whisper-openai

Whisper OpenAI : Reviews, Price, Info & 50 Alternatives AI Tools | 2025 | AIxploria

February 23, 2024 - Whisper OpenAI · Whisper OpenAI · #Github Projects #Transcriber #Voice Cloning · Whisper OpenAI · #21 in Transcriber · « A flexible speech recognition model that excels in multilingualism and translation.

Rating: 5 - 5 votes

Moveworks

moveworks.com › home › resources › ai glossary › what is openai’s whisper model

What is OpenAI’s Whisper model?

Whisper is an AI system developed by OpenAI to perform automatic speech recognition (ASR), the task of transcribing spoken language into text.

OpenAI

platform.openai.com › docs › guides › text-to-speech

Text to speech | OpenAI API

The TTS endpoint provides 11 built‑in voices to control how speech is rendered from text. Hear and play with these voices in OpenAI.fm, our interactive demo for trying the latest text-to-speech model in the OpenAI API.

Notta

notta.ai › en › blog › how-to-use-whisper

How to Use Whisper AI: The Only Guide You Need

A step-by-step look into how to use Whisper AI from start to finish. Learn to install Whisper into your Windows device and transcribe a voice file.