Hello, I’m configuring Whisperai as a provider on bazarr and I cannot get it to connect. I can type in the local ip address with port 9000 and I can see the interface. How can I fix the error “ConnectionError” on bazarr?
I've got whisper working in unraid using the whisper-asr-webservice container, but I've noticed there is another whisper model 'faster-whisper', that's supposedly 4x faster than the original.. but, I can't seem to get bazarr to talk to it.
Anyone had any luck using bazarr with linuxserver/faster-whisper?
Is it any good?
Bazarr Ultimate Subtitle Generation Guide
This guide outlines how to set up Bazarr for automatic, perfectly synced subtitle generation using Whisper-ASR and Aeneas. This solution is ideal for users who want precise subtitle synchronization without manually searching for synced subtitles. Note that this process works best for English source videos and subtitles.
Overview
This guide is perfect for users who:
Want perfectly synced subtitles for their media.
Don’t mind minor literal translation inaccuracies.
Prefer an automated pipeline for subtitle generation.
The setup involves leveraging Whisper-ASR for subtitle creation and Aeneas for fine-tuned subtitle synchronization.
Prerequisites
Before you begin, ensure you have the following:
Bazarr installed and configured with ARR tools like Sonarr and Radarr.
Basic familiarity with ARR workflows.
A system capable of running Whisper-ASR (preferably with GPU support for faster processing).
Steps to Set Up
1. Enable Custom Post-Processing
In Bazarr, add the following custom post-processing command in your configuration:
/config/postproces.sh "{{episode}}" "{{subtitles}}" "{{provider}}"2. Set Language Profile
The source language for this setup is English.
Non-English source languages are not currently supported.
3. Integrate Whisper Provider with Bazarr
Modify the postproces.sh script in your Bazarr config directory. The script should handle the following:
Identify if the subtitle provider is Whisper.
Run post-processing if provider is not embedded subtitles; otherwise, exit without making changes.
How It Works
Process Flow
The Flask app interacts with the video file and subtitle file.
It converts the video to
.mp3format and syncs subtitles using Aeneas.After processing:The original subtitle is replaced with the synced version.The intermediate
.mp3file is deleted.The resulting English subtitle file is perfectly synced with the video.
You can then use Bazarr’s translate option to convert these synced subtitles into other languages.
Implementation Details
The Flask app communicates with Aeneas running on
aeneas:5000.The script sends the following parameters:
series_path: Path to the video file.subtitle_path: Path to the subtitle file.provider: The subtitle provider.
Docker and GPU Support
If using Whisper-ASR, GPU support is highly recommended for efficiency.
Modify your Docker Compose file to specify the desired Whisper model or version.
Personal Pipeline Example
Input: English video source.
Processing:Generate English subtitles using Whisper-ASR.Sync subtitles perfectly with the video using Aeneas.
Output:Use Bazarr’s mass-translate feature to generate subtitles in other languages.
Notes & Tips
This solution is tailored for English source to English subtitle workflows (for now).
GPU support for Whisper is crucial for faster processing.
Tutorials for configuring Whisper with Docker Compose can be found on the Bazarr Wiki.
Why Use This Solution?
This project was born out of the frustration of finding high-quality, perfectly synced subtitles. With this setup, you can ensure a seamless experience for all your media.
Happy subtitle syncing!
https://github.com/nik-dev-ops/bazarr-ultimate-subs
UPDATE:
Hey FYI i've been working on custom provider for Bazarr. Downside is it needs to be built from source but container will handle that by it self, this is a sneak peak. ( imgur link below )
https://imgur.com/a/c02ltaT
I have been using it for a little over 48 hours and it generated 1150 subtitles in the meantime.
Having tried Spanish, English, and French shows. I can say that they are about 90-95% accurate, which beats no subs at all for me that has hearing issues.
Complete info here!
An example of the delay between generations:
I've noticed Bazarr now includes Whisper to generate AI subtitles. I haven't tested it yet but it seems promising. They say the subs are vastly superiour to e.g. Youtube subs.
Did anyone test it?
Been trying whisper a bit. Running whisper asr docker on my unraid server as well as bazarr docker. Using base model.
I just setup whisper provider and it seems to work.
I however have some questions. Are there supposed to be some way to see the progress of whisper?
Now there is a spinning download notification. Then I get some weird error message -<h1>504 Gateway Time-out</h1></center> <hr><center>openresty</center> </body> </html> <!-- a padding to disable MSIE and Chrome friendly error page -->
However it keeps on working in the background. I have 100% cpu usage for two minutes or so. When it settles I actually have a new subtitle.
My cpu is quite slow i3-8100 but it seems to work.
Are there no way to track status of the whisper process or is it something weird with my setup?