Sammy Sidhu
05/02/2024, 1:15 AMjay
05/02/2024, 4:55 AMjay
05/02/2024, 8:57 PM@jay_chia @sammysidhu What did you do to Daft between these versions?! π Or I did something wrong? π
From time to time I play with my own benchmarks with different versions of some DataFrame libraries (Daft, Polars, DuckDB, DataFusion...).
Few days back I noticed this drastic improvement in Daft, around 55% in one of the queries on sorting parquet file by a column and writing to disk:
Here is a plot of the timings. Daft is actually the fastest engine out of all those benchmarked.
(Link to github gist)