OpenAI
openai.com › index › whisper
Introducing Whisper | OpenAI
Whisper is an automatic speech recognition (ASR) system trained on 680,000 hours of multilingual and multitask supervised data collected from the web. We show that the use of such a large and diverse dataset leads to improved robustness to accents, ...
GitHub
github.com › openai › whisper
GitHub - openai/whisper: Robust Speech Recognition via Large-Scale Weak Supervision
Starred by 92.2K users
Forked by 11.6K users
Languages Python
Ways to use Whisper for speech-to-text
I'm late to the game. But if the Whisper tutorial is too complicated for you (like it was for me) I found a dead-simple way to transcribe a file. This is after hours fighting with ChatGPT and Co-pilot. Just drop it into the dictate function in Word, and it'll spit out a time-stamped, speaker-recognizing transcript. https://support.microsoft.com/en-us/office/transcribe-your-recordings-7fc2efec-245e-45f0-b053-2a97531ecf57 More on reddit.com
Self hosting Open Ai Whisper
Hi, I wanted to ask if I can self host Open AI whisper on a Virtual machine lets say AWS’s EC2. Any documentation on that? What steps to follow here? Thanks More on community.openai.com
Is there any noob friendly way to use Whisper?
https://github.com/Purfview/whisper-standalone-win More on reddit.com
A interesting behavior of OpenAI’s whisper
A dystopian future, humanity is at war with robots that have become indistinguishable from humans. Human officers are sitting in the basement of a bar in France, drinking and playing charades. An intelligence officer sitting alone at a table in the corner notices the strange accent of one of the officers. He suspects that the officer is a robot and asks him to explain himself. Everyone tries to calm him down, and the situation is close to being resolved. The suspected officer asks the waiter to bring them more beer: "Three beers for us, please. Like and subscribe!" More on reddit.com
Videos
Beginners Guide to Whisper API: Audio to Text Conversion
12:44
How to Install & Use Whisper AI Voice to Text - YouTube
08:14
How to Use OpenAI's Whisper for Perfect Transcriptions (Speech ...
11:52
Transcribe Audio & Video to Text Using Whisper FREE with Python ...
What is OpenAI Whisper? (Best Speech to Text AI Model)
08:19
OpenAI Whisper and Python: Easy Speech to Text - YouTube
OpenAI
platform.openai.com › docs › guides › speech-to-text
Speech to text | OpenAI API
For example, the following prompt improves the transcription of the words DALL·E and GPT-3, which were previously written as "GDP 3" and "DALI": "The transcript is about OpenAI which makes technology like DALL·E, GPT-3, and ChatGPT with the hope of one day building an AGI system that benefits all of humanity." To preserve the context of a file that was split into segments, prompt the model with the transcript of the preceding segment. The model uses relevant information from the previous audio, improving transcription accuracy. The whisper-1 model only considers the final 224 tokens of the prompt and ignores anything earlier.
WhisperAI
whisperai.com
WhisperAI
Convert speech to text online with WhisperAI. Fast, accurate AI voice transcription powered by OpenAI. Ideal for meetings, interviews, and notes.
Reddit
reddit.com › r/openai › ways to use whisper for speech-to-text
r/OpenAI on Reddit: Ways to use Whisper for speech-to-text
February 5, 2024 -
Hello all! I've been using a great speech-to-text feature on the OpenAI website. I go to this link, click on a green microphone icon, and then upload audio files from my computer. It works really well for converting speech to text. But recently, I saw a message saying that the current method I use is legacy and suggesting I use a new method at this other link. The problem is that uploading audio isn't available in the chat Playground. Thus, I'm worried that soon I won't be able to upload audio the way I do now.
Does anyone know another method to convert speech to text that doesn't use complete mode on Playground? Thanks for any help!
Top answer 1 of 4
4
I'm late to the game. But if the Whisper tutorial is too complicated for you (like it was for me) I found a dead-simple way to transcribe a file. This is after hours fighting with ChatGPT and Co-pilot. Just drop it into the dictate function in Word, and it'll spit out a time-stamped, speaker-recognizing transcript. https://support.microsoft.com/en-us/office/transcribe-your-recordings-7fc2efec-245e-45f0-b053-2a97531ecf57
2 of 4
2
I want to use this speech to text function on chatGPT, cause I'm visually impaired. Is this method safe for writers?
Wispr Flow
wisprflow.ai
Wispr Flow | Effortless Voice Dictation
Flow makes writing quick and clear with seamless voice dictation. It is the fastest, smartest way to type with your voice.
vLLM
docs.vllm.ai › en › latest › serving › openai_compatible_server
OpenAI-Compatible Server - vLLM
4 days ago - Our Translation API is compatible with OpenAI's Translations API; you can use the official OpenAI Python client to interact with it. Whisper models can translate audio from one of the 55 non-English supported languages into English.
OpenAI
platform.openai.com › docs › api-reference › audio
Audio | OpenAI API Reference
5 days ago - post https://api.openai.com/v1/audio/transcriptions · Transcribes audio into the input language. ... The audio file object (not file name) to transcribe, in one of these formats: flac, mp3, mp4, mpeg, mpga, m4a, ogg, wav, or webm.
OpenAI
platform.openai.com › docs › models › whisper-1
Whisper Model | OpenAI API
Whisper is a general-purpose speech recognition model, trained on a large dataset of diverse audio.
Wikipedia
en.wikipedia.org › wiki › Whisper_(speech_recognition_system)
Whisper (speech recognition system) - Wikipedia
August 3, 2025 - Whisper is a machine learning model for speech recognition and transcription, created by OpenAI and first released as open-source software in September 2022. It is capable of transcribing speech in English and several other languages, and is also capable of translating several non-English languages ...
LinkedIn
linkedin.com › pulse › what-went-wrong-whisper-openai-jean-louis-quéguiner-qsumc
What went wrong with Whisper by OpenAI
We cannot provide a description for this page right now
AIxploria
aixploria.com › en › whisper-openai
Whisper OpenAI : Reviews, Price, Info & 50 Alternatives AI Tools | 2025 | AIxploria
February 23, 2024 - Whisper OpenAI · Whisper OpenAI · #Github Projects #Transcriber #Voice Cloning · Whisper OpenAI · #21 in Transcriber · « A flexible speech recognition model that excels in multilingualism and translation.
OpenAI
platform.openai.com › docs › guides › text-to-speech
Text to speech | OpenAI API
The TTS endpoint provides 11 built‑in voices to control how speech is rendered from text. Hear and play with these voices in OpenAI.fm, our interactive demo for trying the latest text-to-speech model in the OpenAI API.