Hugging Face
huggingface.co › pyannote › speaker-diarization
pyannote/speaker-diarization · Hugging Face
This report describes the main principles behind version 2.1 of pyannote.audio speaker diarization pipeline. It also provides recipes explaining how to adapt the pipeline to your own set of annotated data.
GitHub
github.com › huggingface › diarizers
GitHub - huggingface/diarizers
It can be used to improve performance on both English and multilingual diarization datasets with simple example scripts, with as little as ten hours of labelled diarization data and just 5 minutes of GPU compute time.
Starred by 319 users
Forked by 22 users
Languages Python 98.9% | Makefile 1.1%
Hugging Face
huggingface.co › Revai › reverb-diarization-v2
Revai/reverb-diarization-v2 · Hugging Face
Reverb diarization V2 provides a 22.25% relative improvement in WDER (Word Diarization Error Rate) compared to the baseline pyannote3.0 model, evaluated on over 1,250,000 tokens across five different test suites. # taken from https://huggingface.co/pyannote/speaker-diarization-3.1 - see for more details # instantiate the pipeline from pyannote.audio import Pipeline pipeline = Pipeline.from_pretrained( "Revai/reverb-diarization-v2", use_auth_token="HUGGINGFACE_ACCESS_TOKEN_GOES_HERE") # run the pipeline on an audio file diarization = pipeline("audio.wav") # dump the diarization output to disk using RTTM format with open("audio.rttm", "w") as rttm: diarization.write_rttm(rttm)
Hugging Face
huggingface.co › models
Speaker Diarization
Speaker Diarization · Inference Endpoints · text-generation-inference · Eval Results · Merge · 4-bit precision · custom_code · 8-bit precision · text-embeddings-inference · Mixture of Experts · Carbon Emissions · Apply filters · 1 · Full-text search Inference Available ·