Aaron Wishnick
05/14/2021, 5:28 PMselect foo, percentiletdigest(bar, 0.5) from mytable group by foo
. I've got foo
in my dimensionsSplitOrder
and I've got PERCENTILE_TDIGEST__bar
as well as AVG__bar
in my functionColumnPairs
. My query takes about 700 ms but if I switch it to avg(bar)
it takes 15 ms. Is it expected that the t-digest would be that much slower? Anything I can do to speed it up?Xiang Fu
Xiang Fu
Jackie
05/14/2021, 5:36 PMJackie
05/14/2021, 5:41 PMAaron Wishnick
05/14/2021, 5:42 PMAaron Wishnick
05/14/2021, 5:42 PMMayank
Aaron Wishnick
05/14/2021, 7:36 PMnumDocsScanned
mean in the context of a star tree index?Mayank
Aaron Wishnick
05/14/2021, 7:36 PMMayank
Aaron Wishnick
05/14/2021, 7:37 PMMayank
Aaron Wishnick
05/14/2021, 7:37 PMselect foo, percentiletdigest(bar, 0.5) from mytable group by foo
is slow, select foo, avg(bar) from mytable group by foo
is fastMayank
Mayank
Mayank
Aaron Wishnick
05/14/2021, 7:40 PMJackie
05/14/2021, 7:45 PMJackie
05/14/2021, 7:47 PMpercentiletdigest
is expected to be much higher than avg
Aaron Wishnick
05/14/2021, 8:04 PMAaron Wishnick
05/14/2021, 8:05 PMAaron Wishnick
05/14/2021, 8:05 PMMayank
Mayank
Aaron Wishnick
05/14/2021, 8:06 PMAaron Wishnick
05/14/2021, 8:06 PMAaron Wishnick
05/14/2021, 8:06 PMMayank
Aaron Wishnick
05/14/2021, 8:07 PMAaron Wishnick
05/14/2021, 8:07 PMMayank
Mayank
Jackie
05/14/2021, 8:11 PMfoo
is the first dimension in the split order, then it will always use the pre-aggregate docJackie
05/14/2021, 8:12 PMfoo
? How many segments do you have right now?Aaron Wishnick
05/14/2021, 8:22 PMAaron Wishnick
05/14/2021, 8:22 PMAaron Wishnick
05/14/2021, 8:22 PMAaron Wishnick
05/14/2021, 8:33 PMJackie
05/14/2021, 9:05 PMmaxLeafRecords
threshold. While this will increase the size of the star-treeMayank
Mayank
Aaron Wishnick
05/14/2021, 9:09 PMAaron Wishnick
05/14/2021, 9:12 PMMayank
Aaron Wishnick
05/14/2021, 9:16 PMMayank
Jackie
05/14/2021, 9:18 PMmaxLeafRecords
Jackie
05/14/2021, 9:19 PM