pyannote

17 models • 9 total models in database
Sort by:

segmentation-3.0

16,996,259
652

wespeaker-voxceleb-resnet34-LM

Using this open-source model in production? Consider switching to pyannoteAI for better and faster options. This model requires `pyannote.audio` version 3.1 or higher. This is a wrapper around WeSpeaker `wespeaker-voxceleb-resnet34-LM` pretrained speaker embedding model, for use in `pyannote.audio`. > The pretrained model in WeNet follows the license of it's corresponding dataset. For example, the pretrained model on VoxCeleb follows Creative Commons Attribution 4.0 International License., since it is used as license of the VoxCeleb dataset, see https://mm.kaist.ac.kr/datasets/voxceleb/.

13,556,928
83

speaker-diarization-3.1

12,792,608
1,286

segmentation

--- tags: - pyannote - pyannote-audio - pyannote-audio-model - audio - voice - speech - speaker - speaker-segmentation - voice-activity-detection - overlapped-speech-detection - resegmentation license: mit inference: false extra_gated_prompt: "The collected information will help acquire a better knowledge of pyannote.audio userbase and help its maintainers apply for grants to improve it further. If you are an academic researcher, please cite the relevant papers in your own publications using the

license:mit
1,766,987
655

embedding

--- tags: - pyannote - pyannote-audio - pyannote-audio-model - audio - voice - speech - speaker - speaker-recognition - speaker-verification - speaker-identification - speaker-embedding datasets: - voxceleb license: mit inference: false extra_gated_prompt: "The collected information will help acquire a better knowledge of pyannote.audio userbase and help its maintainers apply for grants to improve it further. If you are an academic researcher, please cite the relevant papers in your own publicat

license:mit
991,546
175

speaker-diarization

--- tags: - pyannote - pyannote-audio - pyannote-audio-pipeline - audio - voice - speech - speaker - speaker-diarization - speaker-change-detection - voice-activity-detection - overlapped-speech-detection - automatic-speech-recognition datasets: - ami - dihard - voxconverse - aishell - repere - voxceleb license: mit extra_gated_prompt: "The collected information will help acquire a better knowledge of pyannote.audio userbase and help its maintainers apply for grants to improve it further. If you

license:mit
811,184
1,188

voice-activity-detection

--- tags: - pyannote - pyannote-audio - pyannote-audio-pipeline - audio - voice - speech - speaker - voice-activity-detection - automatic-speech-recognition datasets: - ami - dihard - voxconverse license: mit extra_gated_prompt: "The collected information will help acquire a better knowledge of pyannote.audio userbase and help its maintainers apply for grants to improve it further. If you are an academic researcher, please cite the relevant papers in your own publications using the model. If you

license:mit
648,113
215

speaker-diarization-3.0

--- tags: - pyannote - pyannote-audio - pyannote-audio-pipeline - audio - voice - speech - speaker - speaker-diarization - speaker-change-detection - voice-activity-detection - overlapped-speech-detection - automatic-speech-recognition license: mit extra_gated_prompt: "The collected information will help acquire a better knowledge of pyannote.audio userbase and help its maintainers improve it further. Though this pipeline uses MIT license and will always remain open-source, we will occasionnally

license:mit
632,565
203

overlapped-speech-detection

license:mit
168,268
48

speaker-diarization-community-1

license:cc-by-4.0
148,371
79

brouhaha

dataset:MIT-Acoustical-Reverberation-Scene
77,140
25

speaker-diarization-precision-2

30,812
4

speech-separation-ami-1.0

license:mit
4,663
67

speaker-segmentation

license:mit
921
36

speaker-diarization-community-1-cloud

58
0

ci-segmentation

license:mit
3
1

separation-ami-1.0

license:mit
0
12