Hugging Face
huggingface.co › pyannote › speaker-diarization
pyannote/speaker-diarization · Hugging Face
This report describes the main principles behind version 2.1 of pyannote.audio speaker diarization pipeline. It also provides recipes explaining how to adapt the pipeline to your own set of annotated data.
GitHub
github.com › huggingface › diarizers
GitHub - huggingface/diarizers
It can be used to improve performance on both English and multilingual diarization datasets with simple example scripts, with as little as ten hours of labelled diarization data and just 5 minutes of GPU compute time.
Starred by 319 users
Forked by 22 users
Languages Python 98.9% | Makefile 1.1%
Hugging Face
huggingface.co › Revai › reverb-diarization-v2
Revai/reverb-diarization-v2 · Hugging Face
Reverb diarization V2 provides a 22.25% relative improvement in WDER (Word Diarization Error Rate) compared to the baseline pyannote3.0 model, evaluated on over 1,250,000 tokens across five different test suites. # taken from https://huggingface.co/pyannote/speaker-diarization-3.1 - see for more details # instantiate the pipeline from pyannote.audio import Pipeline pipeline = Pipeline.from_pretrained( "Revai/reverb-diarization-v2", use_auth_token="HUGGINGFACE_ACCESS_TOKEN_GOES_HERE") # run the pipeline on an audio file diarization = pipeline("audio.wav") # dump the diarization output to disk using RTTM format with open("audio.rttm", "w") as rttm: diarization.write_rttm(rttm)
Hugging Face
huggingface.co › papers › 2401.03506
Paper page - DiarizationLM: Speaker Diarization Post-Processing with Large Language Models
In this paper, we introduce DiarizationLM, a framework to leverage large language models (LLM) to post-process the outputs from a speaker diarization system. Various goals can be achieved with the proposed framework, such as improving the readability of the diarized transcript, or reducing the word diarization error rate (WDER).
Hugging Face
huggingface.co › models
Speaker Diarization
Speaker Diarization · Inference Endpoints · text-generation-inference · Eval Results · Merge · 4-bit precision · custom_code · 8-bit precision · text-embeddings-inference · Mixture of Experts · Carbon Emissions · Apply filters · 1 · Full-text search Inference Available ·