speaker-diarization

Here are 117 public repositories matching this topic...

modelscope / FunASR

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

pytorch speech-recognition vad punctuation whisper audio-visual-speech-recognition speaker-diarization voice-activity-detection conformer pretrained-model rnnt dfsmn paraformer speechgpt speechllm

Updated Jan 1, 2026
Python

speechbrain / speechbrain

Star

A PyTorch-based Speech Toolkit

Updated Jan 1, 2026
Python

espnet / espnet

Star

End-to-End Speech Processing Toolkit

text-to-speech deep-learning chainer end-to-end machine-translation pytorch speech-synthesis speech-recognition kaldi voice-conversion speaker-diarization speech-separation speech-enhancement spoken-language-understanding speech-translation singing-voice-synthesis

Updated Dec 16, 2025
Python

linto-ai / whisper-timestamped

Star

Multilingual Automatic Speech Recognition with word-level timestamps and confidence

Updated Sep 9, 2025
Python

modelscope / 3D-Speaker

Star

A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization

speaker-verification speaker-diarization language-identification voxceleb modelscope campplus eres2net 3d-speaker cnceleb sdpn

Updated Dec 8, 2025
Python

juanmc2005 / diart

Star

A python package to build AI-powered real-time audio applications

real-time deep-learning transcription speaker-diarization streaming-audio voice-activity-detection speaker-embedding

Updated Feb 12, 2025
Python

google / uis-rnn

Star

This is the library for the Unbounded Interleaved-State Recurrent Neural Network (UIS-RNN) algorithm, corresponding to the paper Fully Supervised Speaker Diarization.

machine-learning clustering supervised-learning speaker-recognition speaker-diarization supervised-clustering uis-rnn

Updated Sep 25, 2024
Python

wenet-e2e / wespeaker

Star

Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit

Updated Dec 31, 2025
Python

transcriptionstream / transcriptionstream

Star

turnkey self-hosted offline transcription and diarization service with llm summary

automation speech-recognition transcription whisper speaker-diarization diarization llm whisperx ollama mistral-7b

Updated Sep 25, 2024
Python

yinruiqing / pyannote-whisper

Star

whisper asr speaker-diarization meeting-summarization pyannote chatgpt

Updated Sep 24, 2025
Python

FunAudioLLM / Fun-ASR

Star

Fun-ASR is an end-to-end speech recognition large model launched by Tongyi Lab.

audio pytorch speech-recognition speaker-diarization multimodal-large-language-models audio-understanding audio-language-model fun-asr

Updated Dec 31, 2025
Python

wq2012 / SpectralCluster

Star

Python re-implementation of the (constrained) spectral clustering algorithms used in Google's speaker diarization papers.

python machine-learning clustering unsupervised-learning constrained-clustering speaker-diarization spectral-clustering unsupervised-clustering auto-tune

Updated Sep 25, 2024
Python

taylorlu / Speaker-Diarization

Star

speaker diarization by uis-rnn and speaker embedding by vgg-speaker-recognition

speaker-recognition speaker-diarization uis-rnn ghostvlad vgg-speaker-recognition

Updated Jul 1, 2021
Python

google / speaker-id

Star

This repository contains audio samples and supplementary materials accompanying publications by the "Speaker, Voice and Language" team at Google.

speaker-recognition speaker-verification source-separation speaker-diarization speaker-identification

Updated Aug 12, 2025
Python

revdotcom / reverb

Star

Open source inference code for Rev's model

docker open-source opensource neural-network canary speech-recognition deeplearning speech-to-text whisper rev asr speaker-diarization speechrecognition asr-model diarization huggingface revai pyannote wenet

Updated Apr 22, 2025
Python

hitachi-speech / EEND

Star

End-to-End Neural Diarization

machine-learning deep-learning chainer end-to-end kaldi speaker-diarization eend

Updated Aug 30, 2021
Python

nuaazs / VAF_2

Star

Aims to create a comprehensive voice toolkit for training, testing, and deploying speaker verification systems.

microservices speech-recognition speaker-recognition antifraud speaker-diarization

Updated Apr 16, 2024
Python

manojpamk / pytorch_xvectors

Star

Deep speaker embeddings in PyTorch, including x-vectors. Code used in this work: https://arxiv.org/abs/2007.16196

speaker-recognition speaker-verification speaker-diarization speaker-embeddings

Updated Nov 11, 2020
Python

NavodPeiris / speechlib

Sponsor

Star

speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts with actual speaker names.

ai automatic-speech-recognition transcription speaker-recognition speaker-verification speaker-diarization whisper-ai faster-whisper

Updated Aug 16, 2025
Python

cvqluu / TDNN

Star

Time delay neural network (TDNN) implementation in Pytorch using unfold method

pytorch speech-recognition speaker-recognition speaker-verification speech-processing asr speaker-diarization tdnn x-vector

Updated Nov 21, 2019
Python

Improve this page

Add a description, image, and links to the speaker-diarization topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the speaker-diarization topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

speaker-diarization

Here are 117 public repositories matching this topic...

modelscope / FunASR

speechbrain / speechbrain

espnet / espnet

linto-ai / whisper-timestamped

modelscope / 3D-Speaker

juanmc2005 / diart

google / uis-rnn

wenet-e2e / wespeaker

transcriptionstream / transcriptionstream

yinruiqing / pyannote-whisper

FunAudioLLM / Fun-ASR

wq2012 / SpectralCluster

taylorlu / Speaker-Diarization

google / speaker-id

revdotcom / reverb

hitachi-speech / EEND

nuaazs / VAF_2

manojpamk / pytorch_xvectors

NavodPeiris / speechlib

cvqluu / TDNN

Improve this page

Add this topic to your repo