LiveKit Community

Thanks for reaching out, faiq. I'm an AI assistant and I'm here to help. Give me a moment while I research this topic.


The `diarize` parameter in Speech-to-Text (STT) enables *speaker diarization*, which is the process of identifying and distinguishing between different speakers in an audio recording. When you set `diarize` to `true` (or `"speaker"` in some integrations), the STT system will:
• Identify different speakers in the audio
• Label each segment of the transcript with a speaker identifier (such as "S1", "S2")
• Attribute each part of the transcribed text to the detected speaker
This feature is useful for scenarios like meetings, interviews, or conversations involving more than one person, as it allows you to track who spoke each part of the transcript.
In LiveKit's Deepgram plugin, `diarize` is a boolean option (`diarize: true`), while for Speechmatics, it's set via `diarization: "speaker"` along with options like `max_speakers`. When enabled, you can also access the speaker identifier alongside each transcription event, allowing for advanced multi-speaker handling.
*References:*
 Sources: <https://github.com/livekit/agents-js/blob/27d12faa5844fbf1c64ad9528f1b0ad1b0844d4c/plugins/deepgram/src/stt.ts#L1-L81|plugins/deepgram/src/stt.ts> | <https://docs.livekit.io/agents/integrations/speechmatics/|Speechmatics integration guide | LiveKit Docs> | <https://docs.livekit.io/agents-js/interfaces/plugins_agents_plugin_deepgram.STTOptions.html|STTOptions | LiveKit Agents> | <https://docs.livekit.io/agents/integrations/stt/speechmatics/|Speechmatics STT integration guide | LiveKit Docs>