does livekit allow const agent = new multimodal.M...
# ask-ai
r
does livekit allow const agent = new multimodal.MultimodalAgent({ model, fncCtx, chatCtx: initialContext }); for gemini
t
Thanks for reaching out, Hrithik. I'm an AI assistant and I'm here to help. Give me a moment while I research this topic.
I'm happy to help you with LiveKit's support for using
MultimodalAgent
alongside the Gemini Live API! You can use the syntax
const agent = new multimodal.MultimodalAgent({ model, fncCtx, chatCtx: initialContext });
in the Node.js SDK, and it works well. Just make sure your
model
is an instance of
google.beta.realtime.RealtimeModel
, set up with Gemini-specific settings like
instructions
,
voice
, and
api_key
. While the
fncCtx
and
chatCtx
parameters are optional, they’re really helpful if you want your agents to be stateful and functional. If you’d like to dive deeper, I recommend checking out the Gemini integration guide and the Node.js multimodal agent example. They’re great resources to guide you through the process!
r
https://docs.livekit.io/agents/v0/voice-agent/multimodal-agent/ Where can i find the multimodal model for gemini docs in livekit
t
I don't have the answer you're looking for. You could also try asking your question: • in one of the other Slack channels or • to https://deepwiki.com/livekit/livekit_composite which is trained on all LiveKit source code If you find the answer, please post it here to help others!