https://pinot.apache.org/ logo
Join Slack
Powered by
# general
  • s

    srisudha

    05/14/2020, 6:55 AM
    @User i am probably missing something. Want to validate my understanding.. if we have x pinot servers running, since threshold has met a new pinot server has to be added, are you saying we should not use HPA file ( horizontal auto scaling) or v will have hpa and using lifecycle hook of a container we identify the trigger and automate running of the rebalance script? Or is there any effective way to get this done ?
  • k

    Kishore G

    05/14/2020, 7:07 AM
    you can use definitely use HPA (make sure you have a good handle on the metrics based on which you are setting the auto-scaler and avoid thrashing). Rebalance is just a rest api call, you invoke this using lifecycle hook or contribute the feature to Pinot to trigger rebalance on node additions. Helix provides all the callbacks you need to implement this and can be enabled based on a cluster config
  • s

    srisudha

    05/14/2020, 7:19 AM
    Thanks @User for clarifying. Will reflect and get back
  • p

    Prasad Ballingam

    05/14/2020, 7:39 PM
    Hello all. Last meetup was my first one. There were few roadmap items towards the end and one of it was Kinesis connector. I have some experience with AWS Kinesis Client Library. I started exploring the src code. If anyone has any pointers and/or high-level requirements, please share.
  • k

    Kishore G

    05/14/2020, 7:42 PM
    @User is working on it
    👍 1
  • s

    Suraj

    05/15/2020, 11:37 PM
    Hi we realized that we didnt configure column size and it seems to be using default of 512. How can i update it ?
  • k

    Kishore G

    05/15/2020, 11:43 PM
    Column size for ?
  • m

    Mayank

    05/15/2020, 11:43 PM
    seems like String
  • m

    Mayank

    05/15/2020, 11:48 PM
    There's a
    maxLength
    field in the FieldSpec
  • s

    Suraj

    05/15/2020, 11:58 PM
    Yes string type
  • s

    Suraj

    05/15/2020, 11:59 PM
    Can i update config via REST api ?
  • m

    Mayank

    05/15/2020, 11:59 PM
    Can you elaborate the use case?
  • b

    Buchi Reddy

    05/16/2020, 12:01 AM
    @User basically, we didn't specify any size for dimensions and that ended up with default size of
    512
    for its length. Now, we're wondering if we can increase the max size for a dimension and if so what config?
  • m

    Mayank

    05/16/2020, 12:03 AM
    You can upload the schema again with the maxLength via rest api.
  • s

    Suraj

    05/16/2020, 12:14 AM
    Is there a limit on how much we can set ?
  • s

    Suraj

    05/16/2020, 12:14 AM
    For example string ~ 128KB
  • m

    Mayank

    05/16/2020, 12:15 AM
    Technically you are overwriting that limit. You definitely want to remove dictionary for this column
  • m

    Mayank

    05/16/2020, 12:27 AM
    Could you elaborate a bit on what you are storing on this column and how you plan to query it?
  • b

    Buchi Reddy

    05/16/2020, 12:51 AM
    @User right now the requirement is more like putting a blob, which will only be used in selection for display purpose. We think Pinot isn't probably optimized for this and could potentially move that out but until then, we want to have the blob not truncated to 512 chars
  • s

    Sidd

    05/16/2020, 2:59 PM
    One other thing to note is computing the space overhead. If it is standard english ASCII, then size of each value in bytes is equal to number of characters since we use UTF-8. But if the blob data has heterogeneous characters going into the higher ASCII range then each character in UTF-8 is likely to take 2 or even 3 bytes
  • k

    Kenny Bastani

    05/18/2020, 4:04 PM
    <!here> The recording of the 5/5 meetup with Uber is now available, enjoy!

    https://www.youtube.com/watch?v=3UN0ZkmydM8&amp;list=PLLEUtp5eGr7AfzOsoVhxMWUXDXhiWmGtG▾

    (recorded as separate videos)
    🎉 4
    👍 5
  • n

    Neha Pawar

    05/19/2020, 12:06 AM
    Added this to FAQs : https://docs.pinot.apache.org/basics/getting-started/frequent-questions#how-to-set-inverted-indexes
    👍 1
  • o

    Oguzhan Mangir

    05/19/2020, 3:43 PM
    Is there any way to create sql/pql query string from broker request?
  • r

    Ranveer Singh

    05/19/2020, 4:57 PM
    Need to store this JSON post flattening as separate column in table
  • r

    Ranveer Singh

    05/19/2020, 4:57 PM
    {
      "payload" : {     "tenantId" : "0",     "attributes" : {       "attr1" : "0",       "attr2" : "0"     },     "verticalId" : "",     "orderNo" : "",     "orderDate" : "",     "entityErrors" : [ ],     "orderLines" : [ {       "ft" : "",       "qty" : {         "uom" : ""       },       "lineNo" : 2,       "orderLineQuantityInfo" : [ {         "status" : "",         "statusChangeDate" : ""       } ]     } ]   } } (edited)
  • r

    Ranveer Singh

    05/19/2020, 4:58 PM
    Any suggestion will be appreciated.. columns in tables are tenantId, attr1, attr2, orderNo,statusChangeDate etc
  • m

    Mayank

    05/19/2020, 4:59 PM
    I think TransformFunction can be used here
  • k

    Kishore G

    05/19/2020, 5:00 PM
    this entire json as one column?
  • m

    Mayank

    05/19/2020, 5:00 PM
    This is probably incoming event
  • r

    Ranveer Singh

    05/19/2020, 6:53 PM
    Not in one column , each field should go to seperate columns as mentioned above. We have to flaten this before ingestion.. This Json is coming from Kafka
1...134135136...160Latest