OpenHands

:rocket: Just dropped from our team at Snowflake AI Research: *ArcticInference* — a vLLM plugin that supercharges the CodeAct Agent using *<https://arxiv.org/abs/2411.04975|SuffixDecoding>*, achieving *1.8×–4.5× faster* end-to-end speeds :zap: with *no loss in quality*!

:bulb: Example: Solving SWE-Bench Verified on 4×H100 GPUs with openhands-lm-32b (37.2% resolve rate) now takes *5.8h instead of 10.9h* — saving both time :stopwatch: and money :moneybag:.

:link: Dive in: <https://www.snowflake.com/en/engineering-blog/fast-speculative-decoding-vllm-arctic/>