Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Can we run NeMo MSDD Neural Diarizer model in realtime for realtime diarization? #9439

Open
ThiruRJST opened this issue Jun 11, 2024 · 3 comments
Assignees

Comments

@ThiruRJST
Copy link

Disclaimer
I just want some inputs in developing a below mentioned pipeline, if it's feasible.

Requirements
I'm currently working on a project where I need the diarization to work in a real-time manner. I tried using the PyAnnote Library but None of those could match the accuracy of MSDD but We weren't able to run MSDD in a real-time manner. Is there any example for how to do this. We need to run MSDD along with Whisper model for transcription.

@MahmoudAshraf97
Copy link

check these two PRs: #5609, #7896
Although I couldn't get them to work

@nithinraok
Copy link
Collaborator

Thanks for your interest, no we currently don;t support online speaker diarization using MSDD.

@ThiruRJST
Copy link
Author

@nithinraok thank you nithin for the confirmation

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants