https://pinot.apache.org/ logo
a

Aaron Wishnick

06/18/2021, 8:58 PM
Any suggestion for speeding up a query that uses REGEX_LIKE to filter on a dimension? I see string operations being super slow. Even if I rewrite my regex as
SUBSTR(foo, ..., ...) = bar
I still see the query taking more than 10 seconds
m

Mayank

06/18/2021, 8:58 PM
Have you tried text index?
a

Aaron Wishnick

06/18/2021, 8:59 PM
Does it play nice with star tree index?
(To be more precise, I want my query to be accelerated by the use of the star tree index, and I also want to quickly filter by regex for one of the dimensions)
m

Mayank

06/18/2021, 9:00 PM
Should work
a

Aaron Wishnick

06/18/2021, 9:02 PM
Ok, that's really neat
How can a text index accelerate a regex to be faster than table scan?
m

Mayank

06/18/2021, 9:02 PM
It leverages lucene index internally
a

Aaron Wishnick

06/18/2021, 9:03 PM
Cool