victorious-nest-89511
10/15/2024, 6:12 PMlivekit-agents
0.10.2 release π’
Heads up, there's a new version of livekit-agents
available! v0.10.2 includes the following updates:
β’ Added VAD.update_options
to the Silero plugin, which you can use to update the VAD options after the VAD object has been created.
β’ Included various fixes related to address cases where VoicePipelineAgent
was getting stuck.
β’ Included some additional VAD-related fixes.
Please reach out if you run into any issues or have any other feedback!flaky-sundown-18688
10/28/2024, 5:08 PMvictorious-nest-89511
10/31/2024, 4:18 AMlivekit-agents
0.11.0 release π’
We've just released livekit-agents
0.11.0 which is full of treats! Inside youβll find the much-requested addition of more detailed timing metrics as each stage of the VoicePipelineAgent
and much more.
Highlights:
β’ Introduced detailed time metrics (time to first byte/token of STT/LLM/TTS, duration) and usage metrics, including an example for calculating the cost of a conversation.
β’ Added TTS.update_options
- useful for making runtime changes, such as changing voices.
β’ Added the ability to get the human and agent transcript for the MultimodalAgent
class.
β’ Fixed CPU load calculation using the system usage instead of the cgroup usage, addressing autoscaling issues for some developers.
Check out the full changelog for livekit-agents and each plugin for a detailed rundown of all updates. A huge thanks to @martin-purplefish, @Tanesan, @longcw, @jayeshp19, @hari-truviz, and @samirsalman for your excellent contributions!gifted-family-53487
11/01/2024, 6:11 PMlivekit-client@2.6.0
), Python (livekit==0.18.0
), Node (@livekit/rtc-node@0.11.0
), and Rust (livekit = "0.7.0"
). Updates for the mobile SDKs are coming over the next few weeks. Check out the full docs for a complete overview and examples.victorious-nest-89511
11/12/2024, 6:44 PMVoicePipelineAgent
π
β’ Silero VAD, ElevenLabs TTS, Deepgram STT, OpenAI LLM πͺ
β’ Various fixes π οΈ
Note: We are aware of a bug concerning users of tsx under certain conditions. See this issue for more details and a fix.dry-elephant-14928
11/14/2024, 6:30 AMvictorious-nest-89511
11/14/2024, 11:43 PMlivekit-agents@0.11.2
release π’
Hello LiveKit community! Go and grab livekit-agents@0.11.2
for some new additions to the OpenAI plugin along a handful of bug fixes:
β¦ New features in livekit-plugins-openai@0.10.16
βͺοΈ Added usage metrics for the OpenAI Realtime API, which provides more visibility into the input/output token usage when using MultimodalAgent
with this API.
βͺοΈ The chat_ctx
is now synchronized with the OpenAI Realtime API - transcripts from the model are written back to the chat_ctx
while the realtime session is updated based on input from chat_ctx
.
β’ Fix to correctly follow the min_endpointing_delay
configuration
β’ Fix for agent_speech_committed
not being called in certain scenarios
Please refer to the changelog for a full rundown of all updates - and thanks to @jayeshp19, @martin-purplefish, and @longcw for your contributions!victorious-nest-89511
11/15/2024, 5:33 PM@livekit/agents-plugin-openai
0.5.0 release π’
Version (0.5.0) of the OpenAI plugin for Node.js Agents is now available and includes the following updates:
β’ Support for OpenAI TTS and STT
β’ Full compatibility with VoicePipelineAgent
as of agents-js
Version 0.4.4victorious-nest-89511
11/19/2024, 2:56 PMvictorious-nest-89511
11/25/2024, 2:44 PMvictorious-nest-89511
12/03/2024, 3:00 AMVoicePipelineAgent
victorious-nest-89511
12/14/2024, 2:40 PMaccept()
== Agents for Python Version 0.12.2 ==
Update to the latest version for the following updates:
β’ Improvements to endpointing latency
β’ Improvements to end of turn plugin
β’ Fixing duplicate messages committed with function calls
β’ Fixes handling of optional func args in tool callsflaky-sundown-18688
12/20/2024, 7:59 PMvictorious-nest-89511
12/24/2024, 2:39 AMsay()
inside of a function call
β’ Fix multiple function calls not firing multiple times
β’ Parity with Python SDK in MultimodalAgent
events
β’ Additional small fixesvictorious-nest-89511
12/31/2024, 3:18 PMvictorious-nest-89511
01/18/2025, 4:00 AMvictorious-nest-89511
01/29/2025, 9:29 PMvictorious-nest-89511
01/31/2025, 9:54 PMFallbackAdapter
to handle function calls correctly
β’ Fixed handling of before_llm_cb
behavior for skipped inferences
β’ Fixed false-positive interruption handling with Anthropic and Google LLMs
β’ Added generate_reply
API for MultimodalAgent
β’ Improved TTFB metrics for streaming TTS
Weβve also released a major version update to our Turn Detector (0.4.0) model:
β’ 3x smaller
β’ 5x faster inference
β’ 98% true positive rate (+1%)
β’ 97% true negative rate (+11%)victorious-nest-89511
02/06/2025, 3:48 PMgifted-family-53487
02/11/2025, 7:49 PMsendText
and sendFile
, which let you send anything of any size and we'll handle encoding, chunking, buffering, and reconstruction for you.
β’ Additionally we have added streamText
and streamBytes
, which let you send anything incrementally (e.g. for streaming a long LLM response to your frontend)
β’ Either way, receive the data in a stream with registerTextStreamHandler
and registerByteStreamHandler
. You can read chunks as they come in or just readAll
if you don't need progressive rendering.
β’ Both features rely on topic
so you can organize them within your application for different uses. And both support sending to any or all participants in the room simultaneously.
These features are currently available in a preview state in the latest release of the JS (2.9.0), Python (0.20.0), and Node.js (0.13.4) SDKs. Others coming soon.
We've also published new documentation to cover all the realtime text & data features.
Let us know what you think, and we'd love to see what you build with it!victorious-nest-89511
02/26/2025, 4:11 AMbefore_llm_cb
(#1504)
β’ Add cache tokens in CompletionUsage
dataclass (#1478)
Agents for Node.js 0.7.0
β’ Introduces support for Turn Detector model (#225)
β’ Replace transcription forwarder with synchronizer (#301)
β’ Gracefully fail on StreamAdapter errors (#299)
β’ Skip TTS on empty LLM output (#293)
β’ Clearer timeout handling for drain (#277)
β’ Fix feeding null LLM input (#296)victorious-nest-89511
03/10/2025, 2:08 PMlivekit-agents
and all associated plugins) to ensure that you donβt upgrade until youβre ready. For instance, if youβre using a pip requirements file, you can add a version constraint to each package to restrict to only versions earlier than v1.0:
livekit-agents>=0.12.6,<1.0.0
Weβll share some more details on v1.0 soon - if you have any questions in the meantime, please reach out as weβre here to help!flaky-sundown-18688
03/18/2025, 3:05 PMgifted-family-53487
03/20/2025, 6:48 PMagent = VoicePipelineAgent(
# .. STT/TTS/LLM/etc ..,
noise_cancellation=noise_cancellation.BVC()
)
The package and documentation are available on PyPI. Node.js support is in the works and will be ready soon.
Give it a try and let us know what you think!
https://pypi.org/project/livekit-plugins-noise-cancellation/refined-appointment-81829
03/21/2025, 2:06 PMdry-elephant-14928
03/27/2025, 6:41 AMpip install "livekit-agents[openai,silero,deepgram,cartesia]~=1.0rc"
Let us know what you think!refined-appointment-81829
03/27/2025, 1:59 PMrefined-appointment-81829
04/10/2025, 5:26 PMrefined-appointment-81829
04/22/2025, 11:12 AMgentle-refrigerator-18414
05/01/2025, 9:40 PM