square-bear-67002
05/08/2025, 6:25 PMcant get the user transcription when using gemini live api. how to fix??
tall-airline-16756
05/08/2025, 6:25 PMtall-airline-16756
05/08/2025, 6:25 PMuser_transcription
capability is enabled in the Gemini Realtime model. This can be configured through the input_audio_transcription
parameter when initializing the RealtimeModel.
• Verify that the enable_user_audio_transcription
option is set correctly. If enabled, a TranscriberSession
should be created to handle the transcription.
• Check if the audio input is being processed correctly and that there are no issues with the audio format or quality.
If the issue persists, please provide more details about your setup and any error messages you might be encountering, so we can assist you further.
Sources: Tree View: | dse-workbench | Tree View: | dse-workbenchsquare-bear-67002
05/08/2025, 6:33 PMtall-airline-16756
05/08/2025, 6:33 PMuser_transcription
capability is enabled, the regular Speech-to-Text (STT) transcription is skipped. This is by design, as the Gemini transcription process takes over.
To resolve the issue, please ensure the following:
• Verify that the user_transcription
capability is properly enabled and configured in your setup.
• Check if the Gemini transcription process is correctly implemented and receiving the audio input as expected.
• Ensure that there are no conflicts or errors in the configuration of the transcription settings.
If you continue to face issues, please provide additional details about your configuration and any error messages you are encountering, so we can assist you further.
Sources: Tree View: | dse-workbench