Slackbot
02/28/2023, 10:20 PMJim Rohrer
02/28/2023, 10:37 PMmax_latency_ms
being too short to allow the model to respond. Increasing this to some ridiculous amount got rid of the ServiceUnavailable errors but obviously also increased latency.Jim Rohrer
02/28/2023, 10:37 PMsauyon
03/01/2023, 7:25 PMThomas Busath
03/01/2023, 8:02 PMsauyon
03/01/2023, 8:10 PMJim Rohrer
03/01/2023, 9:57 PMmax_latency_ms
was simply the window used to batch requests...."I have 100 req/s and a 100ms max latency, my batches will probably be in sizes of 10"sauyon
03/01/2023, 9:58 PM