https://pinot.apache.org/ logo
Join Slack
Powered by
# inconsistent-perf
  • k

    Kishore G

    05/05/2020, 3:49 AM
    its a simple fix
  • k

    Kishore G

    05/05/2020, 3:49 AM
    we will submit the fix and will be great if you can test it
  • s

    srisudha

    05/05/2020, 3:52 AM
    Sure we will test it out.
  • s

    srisudha

    05/05/2020, 3:52 AM
    Can the fix be submitted today?
  • k

    Kishore G

    05/05/2020, 3:54 AM
    @User can you take it up?
  • j

    Jackie

    05/05/2020, 4:48 AM
    @User Can you try with the branch in this pr: https://github.com/apache/incubator-pinot/pull/5331
  • j

    Jackie

    05/05/2020, 4:49 AM
    @User I added a threshold to switch between dictId-based and value-based as this could have big performance impact for low cardinality dictionary
  • j

    Jackie

    05/05/2020, 4:49 AM
    Let's evaluate it before merging
  • k

    Kishore G

    05/05/2020, 4:50 AM
    sweet. yes, thats a great idea
  • s

    srisudha

    05/05/2020, 5:01 AM
    Sure will get back
  • s

    Shounak Kulkarni

    05/05/2020, 10:56 AM
    hey @User and @User tried the scenario with mentioned branch. Now its performing equally great with and without time range predicate!🎉
    🎉 1
  • k

    Kishore G

    05/05/2020, 3:46 PM
    Thanks @User
  • s

    Subbu Subramaniam

    05/05/2020, 3:54 PM
    In case of realtime segments using offheap dictionaries, it should not take time to do a hash lookup. I suppose we were not using offheap here?
  • k

    Kishore G

    05/05/2020, 3:54 PM
    We were using dictionary and that was the problem
  • k

    Kishore G

    05/05/2020, 3:56 PM
    Since dictionary is not sorted in real-time, we had to scan the entire dictionary to find matching dictionary id’s
  • s

    Subbu Subramaniam

    05/05/2020, 3:57 PM
    Ah, ok. Each value in the dictionary is looked up.... got it
  • s

    srisudha

    05/05/2020, 3:58 PM
    One ques why would a query that is to be served by a completed segment also be affected by something in consuming seg?
  • k

    Kishore G

    05/05/2020, 3:58 PM
    It does not
  • s

    Subbu Subramaniam

    05/05/2020, 3:58 PM
    @User ompleted segments have sorted dictionaries
  • s

    Subbu Subramaniam

    05/05/2020, 3:58 PM
    Do yuo have a sorted column setting also?
  • k

    Kishore G

    05/05/2020, 3:59 PM
    Yes
  • s

    srisudha

    05/05/2020, 3:59 PM
    Yes got that but in our case v noticed that if there is any consuming seg then any further queries with range time stamp is also impacted
  • s

    srisudha

    05/05/2020, 4:00 PM
    Those further queries could be served from completed segment
  • k

    Kishore G

    05/05/2020, 4:03 PM
    @User I am not following, are you referring to a test after the latest fix by Jackie or earlier tests?
  • s

    srisudha

    05/05/2020, 4:03 PM
    Earlier tests .. Before fix..
  • j

    Jackie

    05/05/2020, 6:28 PM
    @User Are you saying with a time filter time < T when consuming segment should not be included, you still see query getting impacted?
  • k

    Kishore G

    05/05/2020, 6:29 PM
    the query was >T
  • s

    srisudha

    05/09/2020, 6:05 AM
    @User thanks for the fix. Ignore my question for now.. We are performing few tests and we will get back to you in a while..
  • s

    srisudha

    05/09/2020, 6:06 AM
    Sorry could not get back earlier
  • k

    Kishore G

    05/27/2020, 5:19 PM
    @User has left the channel