LiveKit Community #announcements

Join Slack

dry-elephant-14928

11/14/2024, 6:30 AM

We have a new channel for those of you that are working with OpenAI's Realtime API: #C0816MD6LCR

🎉 7

🎉 2

victorious-nest-89511

11/14/2024, 11:43 PM

🚢

livekit-agents@0.11.2

release 🚢 Hello LiveKit community! Go and grab

livekit-agents@0.11.2

for some new additions to the OpenAI plugin along a handful of bug fixes: ◦ New features in

livekit-plugins-openai@0.10.16

▪︎ Added usage metrics for the OpenAI Realtime API, which provides more visibility into the input/output token usage when using

MultimodalAgent

with this API. ▪︎ The

chat_ctx

is now synchronized with the OpenAI Realtime API - transcripts from the model are written back to the

chat_ctx

while the realtime session is updated based on input from

chat_ctx

. • Fix to correctly follow the

min_endpointing_delay

configuration • Fix for

agent_speech_committed

not being called in certain scenarios Please refer to the changelog for a full rundown of all updates - and thanks to @jayeshp19, @martin-purplefish, and @longcw for your contributions!

🚀 8

🔥 7

🚢 3

victorious-nest-89511

11/15/2024, 5:33 PM

🚢 Node.js

@livekit/agents-plugin-openai

0.5.0 release 🚢 Version (0.5.0) of the OpenAI plugin for Node.js Agents is now available and includes the following updates: • Support for OpenAI TTS and STT • Full compatibility with

VoicePipelineAgent

as of

agents-js

Version 0.4.4

🚀 8

victorious-nest-89511

11/19/2024, 2:56 PM

🚢 Agents for Node.js Release 0.4.5 🚢 We’ve just shipped a bugfix release for LiveKit Agents for Node.js (0.4.5) which you should grab at your convenience. This release also includes a new version of the Node.js OpenAI plugin, which has now bumped to Version 0.6.0. Highlights: • Ensures all plugins are using the same version of Agents • OpenAI function calling now allows for raw JSON arguments • Various code quality improvements (see changelog) - a huge thanks to @astonishing-boots-14432 for your excellent contributions!

🙏 2

🎉 13

❤️ 1

victorious-nest-89511

11/25/2024, 2:44 PM

🚢 Agents for Node.js Release 0.4.6 🚢 LiveKit Agents for Node.js 0.4.6 is now available, which includes: • Agent/user speech committed events • New OpenAI Realtime voices • Fixed an issue where agent state was not updating correctly • Updated version of Node.js OpenAI plugin (0.6.1) Thanks to @john-royal for your contributions!

👍 2

❤️ 3

🚀 6

🔥 3

victorious-nest-89511

12/03/2024, 3:00 AM

🚢 Agents for Node.js Release 0.5.0 🚢 Agents for Node.js 0.5.0 is now available, which includes: • Native support for CommonJS, which makes it easy to use Agents in frameworks like NestJS • Silero VAD is now compatible with Windows • Fixes to interruption handling in

VoicePipelineAgent

🚀 8

🔥 10

victorious-nest-89511

12/14/2024, 2:40 PM

Weekly Roundup - Dec 14 2024 Here’s a recap of a few new things that have landed this week. Check them out and let us know what you think! == Outbound calling guide for agents == We’ve published a new guide on how to build an AI voice agent that can make outgoing calls to phone numbers using LiveKit. You’ll find it helpful for a variety of use cases such as returning customer calls, following up on purchased items, confirming appointments and more. The new guide walks through several new features that can be used in these scenarios, including explicit dispatch, job dispatch metadata, voicemail detection, end call intent handling, and appointment scheduling. In order to make it easy to get started building an agent with outbound calling capabilities, we’ve also created a new bootstrap template which you can use to get started. == Voice agent example using Gemini 2.0 Flash == We’ve added a new Python example that demonstrates how to create a voice agent using the Gemini 2.0 Flash model along with Google STT and TTS. You can use this as a starting point for building your Gemini-powered agents. == Agents for Node.js Version 0.5.1 == Update to the latest version for the following updates: • Fixed LLM tool calling not executing multiple times when a question has more than one tool call • Fixed sentence tokenizer • Allow attributes to be set on

accept()

== Agents for Python Version 0.12.2 == Update to the latest version for the following updates: • Improvements to endpointing latency • Improvements to end of turn plugin • Fixing duplicate messages committed with function calls • Fixes handling of optional func args in tool calls

🎉 12

🚀 10

❤️ 9

🙌 8

🤩 5

🚢 5

flaky-sundown-18688

12/20/2024, 7:59 PM

Hey <!everyone>! We’ve just released a new feature for Agents: a language-aware turn detection model. This adds a semantic layer on top of the usual voice activity detection (VAD). Instead of just listening for silence to figure out when a user’s turn ends, it analyzes the text to decide if the user has completed their thought 🤯 This makes conversations feel a lot smoother and more natural. For example, if someone says “I’d like to order a…” and pauses, the model knows they’re not done yet and keeps waiting. But if the sentence is complete, it triggers the response right away. It will make your agents sound a lot more natural in conversations with pauses. Performance-wise, it’s lightweight and built to work seamlessly with the existing Agent pipeline. Running on a 4-core instance takes ~50ms of inference and about 1.5GB of RAM. You can find the plugin and setup instructions here. It’s easy to add to your current VoicePipelineAgent, it’s a single param when you’re setting up the config for the agent. We’re still refining it and would love your feedback. Let us know how it handles your use cases, what’s working well, and where it struggles. Your input is invaluable as we iterate on this!

🎉 17

🏓 10

👍 11

❇️ 12

🙂 4

🙌 75

🚀 44

🙏 25

🔥 42

🇦🇺 1

+29

victorious-nest-89511

12/24/2024, 2:39 AM

🚢 Agents for Node.js Release 0.6.0 🚢 Agents for Node.js 0.6.0 is now available, which includes: • Support for performance and usage metrics logging • Ability to use

say()

inside of a function call • Fix multiple function calls not firing multiple times • Parity with Python SDK in

MultimodalAgent

events • Additional small fixes

👍 8

🚀 8

🔥 12

🙌 14

victorious-nest-89511

12/31/2024, 3:18 PM

🚢 Agents for Python 0.12.6 Release 🚢 Agents for Python 0.12.6 is now available, which includes: • Improved interruption handling: avoids getting stuck on false positive interruptions • Added manual interrupt method for VoicePipelineAgent • PlayHT/PlayAI TTS plugin • MultimodalAgent with Gemini Live API • Bug fixes for STT plugins to address incorrect handling of retries

🎉 5

🚀 10

🔥 5

✅ 9

👀 4

🤩 3

🎉 7

🙌 7

🚢 4

victorious-nest-89511

01/18/2025, 4:00 AM

Moving RunLLM bot to dedicated channel: #C088ZNU7QQ5 Lately, we've been testing a bot from RunLLM across our product channels in order to help answer questions related to LiveKit products. It’s been trained on our documentation and code base and can answer a range of common questions that developers have when building with LiveKit. At the same time, we’ve also found (and heard!) that it can be a bit noisy to have these bot interactions directly in conversations within our product channels. In particular, we’ve seen that it's been distracting for other developers that are engaging on these conversation threads as well. To strike a better balance here, we've created a new, dedicated channel (#C088ZNU7QQ5) for developers to engage the RunLLM bot for questions and guidance related to LiveKit products. With this channel, you no longer even need to mention RunLLM - simply create a new post in the channel, and RunLLM will respond in a thread on your post. With the new channel in place, we’ll be removing the RunLLM bot from our product channels. If you want to use the RunLLM bot, you’ll need to post in the #C088ZNU7QQ5 channel moving forward. Please reach out if you have any questions or additional feedback!

🙌 15

🙌🏽 1

🙌🏼 1

🎉 4

👍 1

victorious-nest-89511

01/29/2025, 9:29 PM

New LiveKit Cloud dashboards in alpha We’re excited to introduce early access to a new set of views for the LiveKit Cloud dashboard! We’ve overhauled several pages with improved usability and performance enhancements to handle larger loads and faster load times. Everyone can opt-in to this new experience to give it a try. When you visit your LiveKit Cloud dashboard, you’ll see a banner at the top of the page through which you can opt-in to enable the new views for your project. We consider this update to be a fairly stable alpha release and there are still some things under development, so you can switch back to the legacy views if needed at any point. We wanted to share these updates early so that your feedback can directly influence the shape of things to come - please use the feedback form at the top of the dashboard to let us know what you think!

🙌🏽 1

📊 6

🎉 4

🙌🏼 1

🙌 21

📈 3

📲 1

victorious-nest-89511

01/31/2025, 9:54 PM

🚢 Agents for Python 0.12.11 + Turn Detector 0.4.0 Releases 🚢 Agents for Python 0.12.11 is now available, which includes: • Fixed LLM

FallbackAdapter

to handle function calls correctly • Fixed handling of

before_llm_cb

behavior for skipped inferences • Fixed false-positive interruption handling with Anthropic and Google LLMs • Added

generate_reply

API for

MultimodalAgent

• Improved TTFB metrics for streaming TTS We’ve also released a major version update to our Turn Detector (0.4.0) model: • 3x smaller • 5x faster inference • 98% true positive rate (+1%) • 97% true negative rate (+11%)

🎉 28

victorious-nest-89511

02/06/2025, 3:48 PM

📆 Realtime AI Meetup | Feb 27 in SF 📆 Join us at Founders, Inc in SF on February 27 for the Realtime AI Meetup! Hear from experts across the AI stack where they’ll share insights into building realtime AI agents. Discussions will cover voice agent pipelines, edge vs. cloud inference, multimodal models, agentic workflows, reasoning models, and more. The event will feature speakers from Cartesia, Cerebras, and Google DeepMind - and moderated by our co-founder & CEO, Russ d'Sa. If you’re local to the Bay Area, we’d love to see you there! RSVP and more details here: https://lu.ma/a9xvupy4

🙌🏽 1

🙌 4

🗓️ 3

✨ 23

🗣️ 3

gifted-family-53487

02/11/2025, 7:49 PM

🚀 New feature: Text & Byte Streams We've got a pair of new features for you that make data channels easier to use: text streams and byte streams. • First, two simple new methods

sendText

and

sendFile

, which let you send anything of any size and we'll handle encoding, chunking, buffering, and reconstruction for you. • Additionally we have added

streamText

and

streamBytes

, which let you send anything incrementally (e.g. for streaming a long LLM response to your frontend) • Either way, receive the data in a stream with

registerTextStreamHandler

and

registerByteStreamHandler

. You can read chunks as they come in or just

readAll

if you don't need progressive rendering. • Both features rely on

topic

so you can organize them within your application for different uses. And both support sending to any or all participants in the room simultaneously. These features are currently available in a preview state in the latest release of the JS (2.9.0), Python (0.20.0), and Node.js (0.13.4) SDKs. Others coming soon. We've also published new documentation to cover all the realtime text & data features. Let us know what you think, and we'd love to see what you build with it!

🙌🏽 1

🚀 14

🙌 19

🔥 19

victorious-nest-89511

02/26/2025, 4:11 AM

🚢 Agents for Python 0.12.15 + Agents for Node.js 0.7.0 Releases 🚢 We’ve recently dropped some new updates to both the Python and Node.js distributions of the Agents framework, so be sure to grab the latest for your platform of choice, including: Agents for Python 0.12.15 • Improved performance of OpenAI TTS by switching to use Opus encoding (#1494) • Improved exception logging (#1490) • Fix interrupting nested speech from

before_llm_cb

(#1504) • Add cache tokens in

CompletionUsage

dataclass (#1478) Agents for Node.js 0.7.0 • Introduces support for Turn Detector model (#225) • Replace transcription forwarder with synchronizer (#301) • Gracefully fail on StreamAdapter errors (#299) • Skip TTS on empty LLM output (#293) • Clearer timeout handling for drain (#277) • Fix feeding null LLM input (#296)

🚀 6

🎉 4

🔥 4

🤎 2

🙌 25

👍 1

victorious-nest-89511

03/10/2025, 2:08 PM

🚨 Action Required 🚨 Managing package requirements prior to Agents for Python 1.0 release We’re nearing the release of Agents for Python v1.0 which will contain some breaking changes for agents that have been built with versions prior to v1.0. We’ll be publishing a detailed migration guide soon in order to help you navigate these changes as you upgrade your agents to use the new v1.0 package. Since v1.0 contains changes that will need to be made to your agents, we strongly recommend that you proactively manage your upgrades to the Agents for Python packages (

livekit-agents

and all associated plugins) to ensure that you don’t upgrade until you’re ready. For instance, if you’re using a pip requirements file, you can add a version constraint to each package to restrict to only versions earlier than v1.0:

Copy code

livekit-agents>=0.12.6,<1.0.0

We’ll share some more details on v1.0 soon - if you have any questions in the meantime, please reach out as we’re here to help!

🎉 11

⏱️ 1

👍🏻 1

🙌 2

👀 3

👍 33

👍🏽 1

flaky-sundown-18688

03/18/2025, 3:05 PM

🚀 Analytics v2 for LiveKit Cloud is here!!! 🚀 We rebuilt our analytics from the ground up to support scale at any size for any use case. Agents, streaming, and now telephony metrics! Some other things to 👀 when you're checking the new dashboard: • Faster load times: Background data processing and streaming updates makes everything load a lot faster. • Streamlined navigation: Breadcrumb system, redesigned layout, and improved search for quick session and participant insights. • New insights: Connection minutes, quality metrics, latency stats, and 🎉 telephony call logs 🎉 . • Enhanced controls: Filter, slice, and share dashboard states easily with URL params and live updates. We built these updates directly from your feedback. Check out the announcement and let us know what you think. More feedback = better features 😄 https://x.com/livekit/status/1902010421264937246

🤌 4

🔥 12

🎉 12

🎉 4

🙌 3

💯 2

gifted-family-53487

03/20/2025, 6:48 PM

🚀 Krisp support in Python! You can now apply enhanced noise cancellation directly inside your agent or other Python app. This is available to all LiveKit Cloud projects at no extra cost. This has a few distinct benefits over client-side integrations: • Support for Krisp's Background Voice Cancellation (BVC) model - filter not just noise but also background speakers that would otherwise confuse turn detection and transcription processes in voice agents. • Universal client support - This filter works on audio coming from anywhere, including native apps, web, telephony, and even our Unity SDK • Simpler integration for multi-platform apps - integrate the filter in just one spot instead of adding the package to Swift, Android, Flutter, and JS apps Integration couldn't be simpler:

Copy code

agent = VoicePipelineAgent(
    # .. STT/TTS/LLM/etc ..,
    noise_cancellation=noise_cancellation.BVC()
)

The package and documentation are available on PyPI. Node.js support is in the works and will be ready soon. Give it a try and let us know what you think! https://pypi.org/project/livekit-plugins-noise-cancellation/

🚀 6

🥳 1

🙌 20

✅ 1

👀 1

+18

refined-appointment-81829

03/21/2025, 2:06 PM

A few folks in the community have suggested the idea of open office hours and I think it sounds like a good idea. Next Thursday at 9am US Pacific/12pm US Eastern I will host an open office hour livestream where folks can raise their hand and ask questions and get answers. I will share a link here for the call 10mins before the call.

🙌 18

👀 1

🤚 1

👍🏽 1

👍 34

dry-elephant-14928

03/27/2025, 6:41 AM

🤖 Python Agents 1.0 RC Hey everyone! We've just rolled out a big update for Python Agents! Thanks to all your feedback over the past year, we've made the framework cleaner and more flexible. Here's what's new: ✨ Multi-Agent Support ◦ Break down complex workflows into simpler tasks. 🔄 Unified AgentSession Class ◦ We've merged VoicePipelineAgent and MultimodalAgent into a single AgentSession. ◦ Handles both pipelined models and the Realtime API. ⚙️ Flexible IO and overides ◦ Flexible IO system to take input/direct output as you wish ◦ pre/post-process each part of the pipeline 📡 Improved Text Stream Handling ◦ Text input/output are now fully integrated as first-class features. 🎛️ API Updates ◦ `generate_reply`: Generate dynamic responses from LLMs. ◦ `say`: Respond with static text or pre-synthesized audio. ◦ `interrupt`: Interrupt your agent whenever needed. 🐞 Lots of Bug Fixes Check out the examples in the repo and the new docs site! To try it out:

Copy code

pip install "livekit-agents[openai,silero,deepgram,cartesia]~=1.0rc"

Let us know what you think!

🎯 6

🎉 5

🎉 4

livekit logo 6

🤩 3

🙌 49

👏 2

🚀 36

🙌🏼 1

✅ 1

+16

refined-appointment-81829

03/27/2025, 1:59 PM

Reminder that in one hour we will have an open office hour (LiveStream). You will be able to raise your hand and ask questions. We will have several folks form the LiveKit staff on hand to answer the questions. (I will add additional details in the thread)

refined-appointment-81829

04/10/2025, 5:26 PM

🚀 LiveKit Agents 1.0 is here! The wait is over — we’ve shipped a major upgrade to our open-source framework for building voice AI agents. Agents 1.0 introduces support for structured workflows, making it dramatically easier to build closed-loop agents for scheduling, support, and automation use cases. Key updates include: • Unified AgentSession interface for all agent types, real-time or pipeline • Modular pipeline nodes for easier customization • Synchronized captioning • Multilingual semantic turn detection • Streamlined function tool definitions (function_tool) • Fine-grained control over STT → LLM → TTS flows We’re also kicking off a closed beta for Cloud Agents, our fully-managed hosting platform for deploying agents at scale. 📚 Explore what’s new → https://docs.livekit.io/agents 🔄 Migration guide → https://docs.livekit.io/agents/start/v0-migration/ ⚡ Examples → https://github.com/livekit/agents/tree/main/examples/voice_agents 📝 Blog post → https://blog.livekit.io/livekits-series-b/ 🤖 Cloud agents beta signup → here 💬 Let us know what you’re building — and if you’re excited, please reshare the blog post or give it a boost on social! Every bit helps 🙌

🚀 30

💥 24

🎉 55

❤️ 4

🔥 42

refined-appointment-81829

04/22/2025, 11:12 AM

Come Joins Us. Devs building in voice and video AI, we’ve got something special cooked up for next week — on 4/30, we’re doing a fireside chat with @juberti in SF. Justin is a legend. He created the WebRTC protocol, led dev for Google Meet and Stadia, started @FixieAI, and is now the Head of... https://x.com/livekit/status/1914335043549368761

🎉 15

👍 1

🔥 1

gentle-refrigerator-18414

05/01/2025, 9:40 PM

Hi everyone! We just created a brand new #C08QMJD9J9G channel for everyone that's using LiveKit for robotics, machines, and other embedded platforms! Please drop in and say hi. We'd love to hear what you're working and how we can make LiveKit even better for your use case! 🦾🤖🚗

🎉 12

magnificent-art-43333

05/07/2025, 5:29 PM

Hey LK community, @flaky-sundown-18688 and I worked with Andrew Ng on a free short course (< 1h) about building AI voice agents. We a cover some things we hope will be helpful/interesting: • the architecture of an AI voice agent • voice AI network protocols (HTTP vs. WebSocket vs. WebRTC) • building a basic voice agent with STT ⇒ LLM ⇒ TTS • optimizing latency with semantic VAD, interruption handling, and streaming requests • digging into performance metrics across your voice pipeline If you take the course and have any questions or feedback, DM one of us or post in this thread! 🙏 Links Andrew’s post: https://x.com/AndrewYNg/status/1920161212312268988 Course: https://www.deeplearning.ai/short-courses/building-ai-voice-agents-for-production/?utm_campaign=livekit-launch&utm_medium=partner&utm_source=livekit

🚀 14

🔥 9

👍 3

🎉 25

refined-appointment-81829

05/08/2025, 7:49 PM

Tavus Avatars!!! https://x.com/shayneparlo/status/1920562746069852339 https://blog.livekit.io/bringing-ai-avatars-to-voice-agents/

🔥 15

🙌 1

refined-appointment-81829

05/09/2025, 12:12 AM

Now you can code during the day! 😎 We’re excited to announce light mode for LiveKit Docs 🎉 This has been one of the most requested features from our community, and we’re thrilled to make it happen. Thanks to all of you who kept it on our radar with your feedback! And this is just the beginning, we’re planning improvements to make light mode even better, and expand it to other surfaces soon. Give it a try and enjoy a fresh look for your daytime coding sessions

docs-lightmode.mp4

🙌 18

🎉 3

👏 2

magnificent-art-43333

05/19/2025, 5:34 PM

Hey LK devs 👋 Late last week, we crossed 10,000 members in this community. This Slack started in late 2020 with just a few close friends who’d heard we were working on a new project. Back then, LiveKit was an open source WebRTC engine for building video apps during the pandemic. Today we’re an AI infrastructure company with 50+ teammates across 14 countries. LiveKit software runs on billions of devices, and on some days, the LiveKit Cloud network sees as many concurrent sessions as Fortnite. Pretty wild. But none of this could have happened without you. Every question, suggestion, bug report, GitHub issue, PR, demo, and feature request — it’s all made LiveKit better. The time, energy, and patience you’ve poured into exploring and building with LiveKit is what keeps us going. From the whole core team: thank you. For the last four years — and for whatever we build together next. Cheers 🥂

💥 8

🔥 18

🤎 1

🎈 2

🚀 36

🫶 30

🎉 19

🥂 17

🙌 84

refined-appointment-81829

06/13/2025, 5:27 PM

Light mode in Cloud just dropped 😎 You asked, we delivered. Whether you prefer a brighter workspace or just need a break from the dark, you can switch it up anytime in User settings. Twitter https://x.com/livekit/status/1933548983403630890 LinkedIn https://www.linkedin.com/feed/update/urn:li:activity:7339315527611211776

🎉 5

🙏 2

😎 13