https://www.getdaft.io logo
Join Slack
Powered by
# benchmarks
  • s

    Sammy Sidhu

    05/02/2024, 1:15 AM
    @Kiril Aleksovski Made this channel to talk about writing a blog together from this post https://dist-data.slack.com/archives/C041NA2RBFD/p1714339992634259
    πŸŽ‰ 1
  • j

    jay

    05/02/2024, 4:55 AM
    πŸš€ πŸš€ πŸš€ πŸš€ πŸš€
  • j

    jay

    05/02/2024, 8:57 PM
    @Kiril Aleksovski any thoughts on making a quick LinkedIn/Twitter post of your benchmarks? I’d hate to let you work go to waste… You should just tag us like you just did πŸ˜› it’s snarky and I love it
    @jay_chia @sammysidhu What did you do to Daft between these versions?! πŸ‘ Or I did something wrong? 😁
    From time to time I play with my own benchmarks with different versions of some DataFrame libraries (Daft, Polars, DuckDB, DataFusion...).
    Few days back I noticed this drastic improvement in Daft, around 55% in one of the queries on sorting parquet file by a column and writing to disk:
    Here is a plot of the timings. Daft is actually the fastest engine out of all those benchmarked.
    (Link to github gist)
    πŸ™Œ 1
    k
    • 2
    • 4