im facing issue when using gemma with my livekit assistant e LiveKit Community #ask-ai

im facing issue when using gemma with my livekit a...

red-television-3815

09/11/2025, 7:13 PM

im facing issue when using gemma with my livekit assistant - error related to chat context as sometimes due to transcription error, there might be two consective user messages. which trhows the server error. what is the best way to fix this?

tall-airline-16756

09/11/2025, 7:13 PM

Thanks for reaching out, Taresh. I'm an AI assistant and I'm here to help. Give me a moment while I research this topic.

tall-airline-16756

09/11/2025, 7:14 PM

Thanks for bringing this up! The issue happens because the Gemma model's chat template expects user and assistant messages to alternate strictly. When there are consecutive user messages, it can lead to server errors. A helpful way to fix this is by using the

toChatCtx

function in the Google provider format. This function smartly groups consecutive user messages into one message with multiple text parts, keeping the alternating pattern the model needs. So, for example, if you have two user messages in a row, they’ll be combined into a single message before being sent to the model. Also, to keep things running smoothly, the system limits combined messages to 1kB and truncates the chat context to a maximum of 40 messages. This helps avoid hitting model limits while still maintaining the flow of your conversation. By trying out this approach, you should find that the server errors caused by consecutive user messages are resolved. If you need any more help, feel free to ask—I’m here to support you! Sources: tests/test-chat-template.cpp | agents/src/llm/provider_format/google.test.ts | livekit-plugins/livekit-plugins-google/livekit/plugins/google/llm.py

Open in Slack

Previous Next