Sherlock Xu
07/19/2024, 12:48 PM@bentoml.task decorator to set a task endpoint for executing long-running workloads (such as batch processing or video generation).
β¦ Added the .submit() method to both the sync and async clients, which can submit task inputs via the task endpoint and dedicated worker processes constantly monitor task queues for new work to perform.
β¦ Full compatibility with BentoCloud to run Bentos defined with task endpoints.
β¦ See the Services and Clients doc with examples of a Service API by initializing a long running task in the Service constructor, creating clients to call the endpoint, and retrieving task status.
π Optimized the build cache to accelerate the build process
β¦ Enhanced build speed for bentoml build & containerize through pre-installed large packages like torch
β¦ Switch to uv as the installer and resolver, replacing pip
π¨ Supported concurrency-based autoscaling on BentoCloud
β¦ Added the concurrency configuration to the @bentoml.service decorator to set the ideal number of simultaneous requests a Service is designed to handle.
β¦ Added the external_queue configuration to the @bentoml.service decorator to queue excess requests until they can be processed within the defined concurrency limits.
β¦ See the documentation to configure concurrency and external queue.
π Secure data handling with secrets in BentoCloud
β¦ You can now create and manage credentials, such as HuggingFace tokens and AWS secrets, securely on BentoCloud and easily apply them across multiple Deployments.
β¦ Added secret subcommands to the BentoML CLI for secret management. Run bentoml secret -h to learn more.
ποΈ Added streamed logs for Bento image deployment
β¦ Easier to troubleshoot build issues and enable faster development iterations
π Thank you for your continued support! Feel free to try 1.3 now!Axel Mendoza
07/22/2024, 11:08 AM