🌐
OpenAI
openai.com › index › whisper
Introducing Whisper | OpenAI
Whisper is an automatic speech recognition (ASR) system trained on 680,000 hours of multilingual and multitask supervised data collected from the web. We show that the use of such a large and diverse dataset leads to improved robustness to accents, ...
🌐
GitHub
github.com › openai › whisper
GitHub - openai/whisper: Robust Speech Recognition via Large-Scale Weak Supervision
Robust Speech Recognition via Large-Scale Weak Supervision - openai/whisper
Starred by 92.2K users
Forked by 11.6K users
Languages   Python
Discussions

Ways to use Whisper for speech-to-text
I'm late to the game. But if the Whisper tutorial is too complicated for you (like it was for me) I found a dead-simple way to transcribe a file. This is after hours fighting with ChatGPT and Co-pilot. Just drop it into the dictate function in Word, and it'll spit out a time-stamped, speaker-recognizing transcript. https://support.microsoft.com/en-us/office/transcribe-your-recordings-7fc2efec-245e-45f0-b053-2a97531ecf57 More on reddit.com
🌐 r/OpenAI
45
56
February 5, 2024
Self hosting Open Ai Whisper
Hi, I wanted to ask if I can self host Open AI whisper on a Virtual machine lets say AWS’s EC2. Any documentation on that? What steps to follow here? Thanks More on community.openai.com
🌐 community.openai.com
0
August 21, 2025
Is there any noob friendly way to use Whisper?
https://github.com/Purfview/whisper-standalone-win More on reddit.com
🌐 r/OpenAI
47
21
March 22, 2024
A interesting behavior of OpenAI’s whisper
A dystopian future, humanity is at war with robots that have become indistinguishable from humans. Human officers are sitting in the basement of a bar in France, drinking and playing charades. An intelligence officer sitting alone at a table in the corner notices the strange accent of one of the officers. He suspects that the officer is a robot and asks him to explain himself. Everyone tries to calm him down, and the situation is close to being resolved. The suspected officer asks the waiter to bring them more beer: "Three beers for us, please. Like and subscribe!" More on reddit.com
🌐 r/LocalLLaMA
33
204
October 6, 2024
🌐
OpenAI
platform.openai.com › docs › guides › speech-to-text
Speech to text | OpenAI API
For example, the following prompt improves the transcription of the words DALL·E and GPT-3, which were previously written as "GDP 3" and "DALI": "The transcript is about OpenAI which makes technology like DALL·E, GPT-3, and ChatGPT with the hope of one day building an AGI system that benefits all of humanity." To preserve the context of a file that was split into segments, prompt the model with the transcript of the preceding segment. The model uses relevant information from the previous audio, improving transcription accuracy. The whisper-1 model only considers the final 224 tokens of the prompt and ignores anything earlier.
🌐
WhisperAI
whisperai.com
WhisperAI
Convert speech to text online with WhisperAI. Fast, accurate AI voice transcription powered by OpenAI. Ideal for meetings, interviews, and notes.
🌐
Reddit
reddit.com › r/openai › ways to use whisper for speech-to-text
r/OpenAI on Reddit: Ways to use Whisper for speech-to-text
February 5, 2024 -

Hello all! I've been using a great speech-to-text feature on the OpenAI website. I go to this link, click on a green microphone icon, and then upload audio files from my computer. It works really well for converting speech to text. But recently, I saw a message saying that the current method I use is legacy and suggesting I use a new method at this other link. The problem is that uploading audio isn't available in the chat Playground. Thus, I'm worried that soon I won't be able to upload audio the way I do now.

Does anyone know another method to convert speech to text that doesn't use complete mode on Playground? Thanks for any help!

🌐
Wispr Flow
wisprflow.ai
Wispr Flow | Effortless Voice Dictation
Flow makes writing quick and clear with seamless voice dictation. It is the fastest, smartest way to type with your voice.
🌐
Hugging Face
huggingface.co › spaces › openai › whisper
Whisper - a Hugging Face Space by openai
Upload or record an audio file, or provide a YouTube video link to convert it into text. Choose between transcribing or translating the audio.
Find elsewhere
🌐
vLLM
docs.vllm.ai › en › latest › serving › openai_compatible_server
OpenAI-Compatible Server - vLLM
4 days ago - Our Translation API is compatible with OpenAI's Translations API; you can use the official OpenAI Python client to interact with it. Whisper models can translate audio from one of the 55 non-English supported languages into English.
🌐
Hugging Face
huggingface.co › openai › whisper-large-v3
openai/whisper-large-v3 · Hugging Face
Whisper is a state-of-the-art model for automatic speech recognition (ASR) and speech translation, proposed in the paper Robust Speech Recognition via Large-Scale Weak Supervision by Alec Radford et al. from OpenAI.
🌐
OpenAI
platform.openai.com › docs › api-reference › audio
Audio | OpenAI API Reference
5 days ago - post https://api.openai.com/v1/audio/transcriptions · Transcribes audio into the input language. ... The audio file object (not file name) to transcribe, in one of these formats: flac, mp3, mp4, mpeg, mpga, m4a, ogg, wav, or webm.
🌐
OpenAI
platform.openai.com › docs › models › whisper-1
Whisper Model | OpenAI API
Whisper is a general-purpose speech recognition model, trained on a large dataset of diverse audio.
🌐
Wikipedia
en.wikipedia.org › wiki › Whisper_(speech_recognition_system)
Whisper (speech recognition system) - Wikipedia
August 3, 2025 - Whisper is a machine learning model for speech recognition and transcription, created by OpenAI and first released as open-source software in September 2022. It is capable of transcribing speech in English and several other languages, and is also capable of translating several non-English languages ...
🌐
Replicate
replicate.com › openai › whisper
Whisper AI by Open AI - Run with an API on Replicate
Whisper uses a Transformer sequence-to-sequence model trained on various speech processing tasks, including multilingual speech recognition, speech translation, spoken language identification, and voice activity detection.
🌐
OpenAI Developer Community
community.openai.com › t › self-hosting-open-ai-whisper › 1353303
Self hosting Open Ai Whisper - Community - OpenAI Developer Community
August 21, 2025 - Hi, I wanted to ask if I can self host Open AI whisper on a Virtual machine lets say AWS’s EC2. Any documentation on that? What steps to follow here? Thanks
🌐
Medium
medium.com › @pouyahallaj › how-to-use-openais-whisper-in-just-3-lines-of-code-for-free-7b5c5dbe4863
How to Use OpenAI’s Whisper to Transcribe in Just 3 Lines of Code — for Free! | by Pouya Hallaj | Medium
May 4, 2023 - Transcribe speech to text with OpenAI’s Whisper in just 3 lines of Python code! Learn how to use this cutting-edge technology for free.
🌐
AIxploria
aixploria.com › en › whisper-openai
Whisper OpenAI : Reviews, Price, Info & 50 Alternatives AI Tools | 2025 | AIxploria
February 23, 2024 - Whisper OpenAI · Whisper OpenAI · #Github Projects #Transcriber #Voice Cloning · Whisper OpenAI · #21 in Transcriber · « A flexible speech recognition model that excels in multilingualism and translation.
Rating: 5 ​ - ​ 5 votes
🌐
Moveworks
moveworks.com › home › resources › ai glossary › what is openai’s whisper model
What is OpenAI’s Whisper model?
Whisper is an AI system developed by OpenAI to perform automatic speech recognition (ASR), the task of transcribing spoken language into text.
🌐
OpenAI
platform.openai.com › docs › guides › text-to-speech
Text to speech | OpenAI API
The TTS endpoint provides 11 built‑in voices to control how speech is rendered from text. Hear and play with these voices in OpenAI.fm, our interactive demo for trying the latest text-to-speech model in the OpenAI API.
🌐
Notta
notta.ai › en › blog › how-to-use-whisper
How to Use Whisper AI: The Only Guide You Need
A step-by-step look into how to use Whisper AI from start to finish. Learn to install Whisper into your Windows device and transcribe a voice file.