Asking because if I add a where clause that filters out noth Apache Pinot #pinot-perf-tuning

Asking because if I add a where clause (that filte...

Ken Krugler

12/29/2020, 3:17 PM

Asking because if I add a where clause (that filters out nothing) using a dimension NOT in my dimensionsSplitOrder list, I though that the star tree wouldn’t be used - and the query time is the same for the case without that where clause.

Chinmay Soman

12/29/2020, 6:20 PM

You might want to ensure that the index is created properly first for those segments: https://docs.pinot.apache.org/basics/getting-started/frequent-questions/query-faq#how-do-i-verify-that-an-index-i[…]s-created-on-a-particular-column

Jackie

12/29/2020, 7:50 PM

If it filters out nothing, star-tree can still be used. You may check the

numDocsScanned

to see if the query is scanning the pre-aggregated records

Ken Krugler

12/29/2020, 8:00 PM

numDocsScanned

= 1B (out of 1.7B) for case where the filter matches a few records, 362M for case where the filter matches no records, and 362M for the case of no filtering. But time taken is almost identical in all cases (27 seconds, +/- 1 second).

Jackie

12/29/2020, 8:13 PM

Can you share the query and the star-tree config? Seems not much documents are pre-aggregated

Jackie

12/29/2020, 8:14 PM

Too many documents selected even with star-tree

Ken Krugler

12/29/2020, 8:14 PM

Copy code

"starTreeIndexConfigs": [{
      "dimensionsSplitOrder": [
        "advertiser",
        "adHash",
        "network",
        "imageSize",
        "adType",
        "platform",
        "country",
        "crawlDays"
      ],
      
      "skipStarNodeCreationForDimensions": [
      ],
      
      "functionColumnPairs": [
        "SUM__adSpend",
        "SUM__impressions",
        "DistinctCountHLL__adHash",
        "DistinctCountHLL__crawlDays",
        "MIN__crawlDays",
        "MAX__crawlDays"
      ],
      
      "maxLeafRecords": 10000
    },

Ken Krugler

12/29/2020, 8:14 PM

select adHash,advertiser,sum(adSpend) from crawldata group by adHash,advertiser order by sum(adSpend) desc limit 100

Jackie

12/29/2020, 8:16 PM

Does

adHash

and

advertiser

has very high cardinality?

Ken Krugler

12/29/2020, 8:16 PM

Yes

Jackie

12/29/2020, 8:17 PM

Most of the query time should be spent on grouping these 2 columns

Jackie

12/29/2020, 8:18 PM

Star-tree won't help much for this query as not much documents can be pre-aggregated

Jackie

12/29/2020, 8:18 PM

Only records with the same

adHash

and

advertiser

can be pre-aggregated

Ken Krugler

12/29/2020, 8:19 PM

There’s a one-to-many relationship from advertiser to adHash. So given an adHash, it’s always for one advertiser.

Jackie

12/29/2020, 8:20 PM

I see. Can you try

select adHash, sum(adSpend) from crawldata group by adHash order by sum(adSpend) desc limit 100

and see if it is faster?

Ken Krugler

12/29/2020, 8:20 PM

it’s slower - timed out after 30 sec

Ken Krugler

12/29/2020, 8:21 PM

Retrying with 50 sec

Jackie

12/29/2020, 8:21 PM

It should definitely be cheaper than grouping on 2 columns

Ken Krugler

12/29/2020, 8:25 PM

I’m going to have to log onto servers and check logs, as now the initial query is also timing out. So feels like something is borked…

Ken Krugler

12/29/2020, 8:25 PM

I did just run a distinct_count query on adHash, which probably blew memory somewhere 😞

Ken Krugler

12/29/2020, 8:26 PM

As there are likely > 1B unique adHashes, out of 1.7B records.

Jackie

12/29/2020, 8:30 PM

Based on the

numDocsScanned

for star-tree queries, maybe ~300M unique adHashes 😉

Ken Krugler

12/29/2020, 10:04 PM

I partition each month of data into 30 segments, by adHash. So likely there’s about 300M unique adHashes per segment…

Ken Krugler

12/31/2020, 12:42 AM

Hi @Jackie just saw your response to the other user, where you said:

Ken Krugler

12/31/2020, 12:42 AM

Another way is to enable the tracing for the query and see if it uses the
StarTreeFilterOperator

Ken Krugler

12/31/2020, 12:42 AM

I had enabled tracing, and I didn’t see this in the trace output. So that would indicate my star tree isn’t being used for some reason, yes?

Jackie

12/31/2020, 12:43 AM

Can you paste the tracing?

Ken Krugler

12/31/2020, 12:49 AM

OK, my bad - with tracing on, and the right query, I do see StarTreeFilterOperators in the JSON, thanks. For one of the many pieces, operators with time > 1ms were:

Copy code

{\"0_10\":[
	{\"StarTreeFilterOperator Time\":21}
	{\"DocIdSetOperator Time\":69}
	{\"ProjectionOperator Time\":69}
	{\"TransformOperator Time\":69}
	{\"DocIdSetOperator Time\":17}
	{\"ProjectionOperator Time\":17}
	{\"TransformOperator Time\":17}
	{\"AggregationGroupByOrderByOperator Time\":97}]}

Jackie

12/31/2020, 12:58 AM

AggregationGroupByOrderByOperator

time includes the time for other operators because it is nested

Ken Krugler

12/31/2020, 12:59 AM

But it’s not the sum of times, so curious what that actually means.

Jackie

12/31/2020, 1:01 AM

The operators are chained with some extra operations

Copy code

AggregationGroupByOrderByOperator
TransformOperator
ProjectionOperator
DocIdSetOperator
StarTreeFilterOperator

And each one should include the time for the previous layer + time for the extra operations

Jackie

12/31/2020, 1:03 AM

E.g. with these 2 entries:

Copy code

{\"TransformOperator Time\":17}
	{\"AggregationGroupByOrderByOperator Time\":97}]}

We know the engine spends 17ms running the transform operator + 80ms aggregating the records

Ken Krugler

12/31/2020, 1:08 AM

So given my example, how would I know which operators were chained (and thus showing a sum)? Or is it just by increasing time, so StarTreeFilter is 21ms, then next DocIdSet is chained and thus 48ms, and no time for the next Projection or Transform, but the next DocIdSet is 17ms? Just trying to figure out how to interpret the numbers…

Open in Slack

Previous Next