Hi Community I am fairly new to pinot Does the star tree ind Apache Pinot #troubleshooting

Hi Community, I am fairly new to pinot. Does the s...

Lakshmanan Velusamy

04/05/2021, 11:01 PM

Hi Community, I am fairly new to pinot. Does the star tree index help when we have aggregate queries with time range (and exclusion) filters ?

Lakshmanan Velusamy

04/05/2021, 11:09 PM

Here is the query:

Copy code

SELECT 
  created_at_1_hour_seconds as time_col, 
  SUM(id) as total

FROM table 

WHERE 
  (created_at_seconds BETWEEN 1615924566 AND 1617134166) AND 
  ((field1 <> 'null') AND 
  (entity_id NOT IN ('uuid1', 'uuid2', 'uuid3'))

GROUP BY time_col 
ORDER BY time_col ASC

table/index config:

Copy code

indexSpec:
  starTreeIndexConfigs:
    - dimensionsSplitOrder:
      - created_at_1_hour_seconds
      - created_at_seconds
      - field1
      - entity_id
      functionColumnPairs:
        - function: SUM
          column: id
      skipStarNodeCreationForDimensions:
        - created_at_seconds
  bloomFilterColumns:
    - field1
    - entity_id

Jackie

04/05/2021, 11:11 PM

It helps if the cardinality of the columns are relatively low

Lakshmanan Velusamy

04/05/2021, 11:11 PM

Not sure if created_at_seconds should be on the star tree index dimension split order as the cardinality is very high (timestamps are millisecs granularity).

Jackie

04/05/2021, 11:12 PM

In order to use star-tree to solve queries with filter on it, it needs to be included in the split order

Jackie

04/05/2021, 11:12 PM

created_at_seconds

should be second granularity right?

Lakshmanan Velusamy

04/05/2021, 11:13 PM

my bad, yeah its seconds.

Jackie

04/05/2021, 11:13 PM

Cardinality wise it should be fine. How about

entity_id

Jackie

04/05/2021, 11:13 PM

I feel it's cardinality is going to be high

Lakshmanan Velusamy

04/05/2021, 11:13 PM

should created_at_seconds be on the dimension order given the high Cardinality ?

Lakshmanan Velusamy

04/05/2021, 11:15 PM

entity_id has high cardinality, but total_records (in millions) >> total_entities (in 10s of thousands).

Jackie

04/05/2021, 11:16 PM

Ok, so both

entity_id

and

created_at

would have cardinality of 10s thousands per segment

Jackie

04/05/2021, 11:20 PM

I would recommend not including

created_at_seconds

in the split order in this case. Segments that are fully covered in the time range will use the star-tree index. Segments that are partially covered will fall back to the non-aggregated records.

👍 1

Jackie

04/05/2021, 11:22 PM

Then splitOrder will be

created_at_1_hour_seconds, field1, entity_id

Lakshmanan Velusamy

04/05/2021, 11:25 PM

Got it, we have this in the star tree index config as well:

Copy code

skipStarNodeCreationForDimensions:
        - created_at_seconds

Lakshmanan Velusamy

04/05/2021, 11:26 PM

thanks for the reply @Jackie!

Jackie

04/05/2021, 11:26 PM

Yeah, you may remove it from the skip list as well

👍 1

Lakshmanan Velusamy

04/05/2021, 11:26 PM

We got an another query with more complex aggregations:

Copy code

SELECT 
  dimension_uuid as dimension,
  AVG(total) AS avg_total,
  SUM(total)/DistinctCount(entity_id) AS total_per_entity,
  COUNT(order_id) AS order_count,
  SUM(total) AS total_amount,
  COUNT(order_id)/DistinctCount(entity_id) AS orders_per_entity,
  DISTINCTCOUNT(entity_id) AS entity_count 

FROM table

WHERE 
  (created_at_seconds BETWEEN 1617049575 AND 1617654375) AND 
  (field1 <> 'null') AND 
  (entity_id NOT IN ('uuid1', 'uuid2', 'uuid3'))

GROUP BY dimension 
ORDER BY order_count DESC 
LIMIT 50

Lakshmanan Velusamy

04/05/2021, 11:28 PM

this one has the same set of filters, but does a bunch of aggregations, including DISTINCTCOUNT. Is there a chance to improve performance using star tree index at all for the aggregations ?

Jackie

04/05/2021, 11:30 PM

Here you can find all the supported functions: https://docs.pinot.apache.org/basics/indexing/star-tree-index

Jackie

04/05/2021, 11:31 PM

If you need accurate distinct (

distinctcount

instead of

distinctcounthll

), then it cannot be supported by star-tree due to the risk of storage explosion

Jackie

04/05/2021, 11:33 PM

For the second query, you need to put both

dimension_uuid

and

entity_id

into the split list. The performance of star-tree comes from the pre-aggregation of the records, and I'm not sure if we can get much pre-aggregation with these 2 high cardinality dimensions

Lakshmanan Velusamy

04/05/2021, 11:36 PM

Was skeptical about DISTINCTCOUNT as it mentioned that is not supported due to storage explosion problem, will look into the possibility of using distinctcounthll and also measure the performance with and without index to see if there is a difference.

Lakshmanan Velusamy

04/05/2021, 11:37 PM

Any tool/command to understand the effect of the index when processing the query ?

Jackie

04/05/2021, 11:41 PM

There is no in-built tool for that. For experiment, you can set up 2 tables with the same data, one with index and one without, and query them separately to compare the throughput and latency

Lakshmanan Velusamy

04/05/2021, 11:43 PM

Got it, thats the pretty much the setup we got, along with checking the query stats (especially

numEntriesScannedPostFilter

in the response stats to see the impact of star tree filter on aggregations) for the same query with and without index, along with latency.

Lakshmanan Velusamy

04/05/2021, 11:50 PM

Is there an accuracy (standard error) expectation on the distinctcounthll algorithm used in pinot ?

Jackie

04/06/2021, 12:12 AM

We expect ~2% standard error for HyperLogLog. Reference: https://en.wikipedia.org/wiki/HyperLogLog

Lakshmanan Velusamy

04/06/2021, 12:16 AM

is it 2% for low value metrics as well ? Any plans to implement something like HLL+ ?

Jackie

04/06/2021, 12:23 AM

Based on my understanding, it will be quite accurate with few values

Jackie

04/06/2021, 12:25 AM

The HLL Sketch from

DataSketches

seems promising: https://datasketches.apache.org/docs/HLL/HLL.html

Jackie

04/06/2021, 12:25 AM

We don't have it supported yet. It should not be hard to add. Contributions are very welcome.

👍 1

Lakshmanan Velusamy

04/06/2021, 2:43 AM

Thank you so much for the responses @Jackie !

Jackie

04/06/2021, 2:44 AM

You're welcome 😉

Lakshmanan Velusamy

04/06/2021, 2:44 AM

Does pinot use HLL from datasketches that you pointed above ?

Jackie

04/06/2021, 2:46 AM

No, Pinot leverages this implementation: https://github.com/addthis/stream-lib/blob/master/src/main/java/com/clearspring/analytics/stream/cardinality/HyperLogLog.java

👍 1

Jackie

04/06/2021, 2:48 AM

In star-tree, we use the default

log2m: 8

Lakshmanan Velusamy

04/06/2021, 2:59 AM

got it, which means per this formula, if we plug in log2m as 8, we get 1-1.04/sqrt(2^8) => 93.5% is the accuracy ?

Jackie

04/06/2021, 6:31 AM

Hmm, I think you are correct

Jackie

04/06/2021, 6:32 AM

You can easily test the error rate by comparing the

distinctcount

and

distinctcounthll

results

👍 2

🙏 1

Lakshmanan Velusamy

04/07/2021, 6:39 AM

Hi @Jackie, We have 3 servers in the tenant with replication factor of 3 (pretty much all the servers have the replica of all the segments). Like we saw above, all of our queries are filtered by timestamp. Is it possible to limit the no of segments queried by servers using segmentPartitionConfig ? Was wondering if the segment pruning for timeColumn filter works automatically or if we need to configure something.

Jackie

04/07/2021, 7:21 AM

It works automatically based on the min/max value of the segment

👍 1

Jackie

04/07/2021, 7:22 AM

No need to explicitly configure

Lakshmanan Velusamy

04/07/2021, 8:23 AM

Sounds good, thanks !

Open in Slack

Previous Next