Sherlock Xu
07/19/2024, 12:48 PM@bentoml.task
decorator to set a task endpoint for executing long-running workloads (such as batch processing or video generation).
β¦ Added the .submit()
method to both the sync and async clients, which can submit task inputs via the task endpoint and dedicated worker processes constantly monitor task queues for new work to perform.
β¦ Full compatibility with BentoCloud to run Bentos defined with task endpoints.
β¦ See the Services and Clients doc with examples of a Service API by initializing a long running task in the Service constructor, creating clients to call the endpoint, and retrieving task status.
π Optimized the build cache to accelerate the build process
β¦ Enhanced build speed for bentoml build
& containerize
through pre-installed large packages like torch
β¦ Switch to uv
as the installer and resolver, replacing pip
π¨ Supported concurrency-based autoscaling on BentoCloud
β¦ Added the concurrency
configuration to the @bentoml.service
decorator to set the ideal number of simultaneous requests a Service is designed to handle.
β¦ Added the external_queue
configuration to the @bentoml.service
decorator to queue excess requests until they can be processed within the defined concurrency
limits.
β¦ See the documentation to configure concurrency and external queue.
π Secure data handling with secrets in BentoCloud
β¦ You can now create and manage credentials, such as HuggingFace tokens and AWS secrets, securely on BentoCloud and easily apply them across multiple Deployments.
β¦ Added secret subcommands to the BentoML CLI for secret management. Run bentoml secret -h
to learn more.
ποΈ Added streamed logs for Bento image deployment
β¦ Easier to troubleshoot build issues and enable faster development iterations
π Thank you for your continued support! Feel free to try 1.3 now!Axel Mendoza
07/22/2024, 11:08 AM