Can we run NeMo MSDD Neural Diarizer model in realtime for realtime diarization? #9439

ThiruRJST · 2024-06-11T17:08:57Z

Disclaimer
I just want some inputs in developing a below mentioned pipeline, if it's feasible.

Requirements
I'm currently working on a project where I need the diarization to work in a real-time manner. I tried using the PyAnnote Library but None of those could match the accuracy of MSDD but We weren't able to run MSDD in a real-time manner. Is there any example for how to do this. We need to run MSDD along with Whisper model for transcription.

MahmoudAshraf97 · 2024-06-13T15:00:31Z

check these two PRs: #5609, #7896
Although I couldn't get them to work

nithinraok · 2024-06-14T22:32:57Z

Thanks for your interest, no we currently don;t support online speaker diarization using MSDD.

ThiruRJST · 2024-06-21T06:40:50Z

@nithinraok thank you nithin for the confirmation

ThiruRJST assigned okuchaiev Jun 11, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Can we run NeMo MSDD Neural Diarizer model in realtime for realtime diarization? #9439

Can we run NeMo MSDD Neural Diarizer model in realtime for realtime diarization? #9439

ThiruRJST commented Jun 11, 2024

MahmoudAshraf97 commented Jun 13, 2024

nithinraok commented Jun 14, 2024

ThiruRJST commented Jun 21, 2024

Can we run NeMo MSDD Neural Diarizer model in realtime for realtime diarization? #9439

Can we run NeMo MSDD Neural Diarizer model in realtime for realtime diarization? #9439

Comments

ThiruRJST commented Jun 11, 2024

MahmoudAshraf97 commented Jun 13, 2024

nithinraok commented Jun 14, 2024

ThiruRJST commented Jun 21, 2024