Separate multiple speaker in mono audio files
Efficent separation for noisy medical condition.
Changelog
We are excited to unveil Monster, our new speaker diarization system, Nijta’s latest innovation in audio segmentation that redefines both accuracy and efficiency. This first release marks a major step forward in speaker diarization, offering precise segmentation, multilingual support, and robust performance on your noisy data. From medical conversations to customer service calls and teleconference, our advanced model adapts to various acoustic conditions, ensuring high accuracy where other diarization systems fall short.
Diarization Benchmark – CALLHOME Eval Set
Model / Dataset | de | en | sp | jp | zh |
---|---|---|---|---|---|
Pyanote 3.1 | 23 | 31 | 34 | 42 | 26 |
Voice Harbor V3.0 | 26 | 31 | 35 | 43 | 26 |
revAI | 27 | 35 | 38 | 44 | 33 |
Pyanote AI | 19 | 16 | 18 | 21 | 17 |
Voice Harbor V3.0-Large | 18 | 22 | 26 | 31 | 22 |
Bolded values indicate best performance for the language.
Features
- Precise speaker diarization
- Seamless handling of overlapping speech
- Robust performance across varied acoustic environments
- Unlimited number of speakers
Diarize and Redact PHI in audio, followed by transcription using Python SDK
Set Up the Client and Create a Job Once you have the necessary setup, you can create a job on the server and interact with the API. Here’s how to do it:
Next, initialize the client and define the parameters for your job, including the files you want to submit for transcription.
Now, submit the job and wait for the transcription results. Here’s how to complete the process: