https://pinot.apache.org/ logo
Join Slack
Powered by
# general
  • s

    Seunghyun

    01/22/2020, 1:30 AM
    @User Is there a particular reason why we chose
    MYSQL_ANSI
    lex over
    CALCITE_DEFAULT
    ? Does this mean that if i generate SQL using calcite (relbuilder -> relnode -> sqlnode -> sql string), I need to set lex to
    MYSQL_ANSI
    in order for Pinot to be able to parse the generated query?
  • s

    Sidd

    01/22/2020, 1:43 AM
    I think default behavior of calcite is to treat quoted and unquoted identifiers separately. So if the column name is unquoted, it will convert it into upper case silently. Using MYSQL_ANSI will preserve the case of identifiers whether or not they are quoted -- this is important since since PQL was case-sensitive until few days ago
  • x

    Xiang Fu

    01/22/2020, 1:43 AM
    I remember one reason is about case sensitive and quoting
  • s

    Sidd

    01/22/2020, 1:44 AM
    Also, relbuilder -> relnode -> sqlnode -> sql string will probably not generate a SQL query. RelBuilder is used for building relational expression from a query
  • s

    Sidd

    01/23/2020, 4:57 PM
    <!here> people might want to take a look at this PR -- https://github.com/apache/incubator-pinot/pull/5006
  • s

    Subbu Subramaniam

    01/23/2020, 5:35 PM
    Please see my comment on the PR. For a change of this type (since it changes persistent data), we should at least have a doc in cwiki, wherever, and give people time to think and comment on it.
  • s

    Subbu Subramaniam

    01/24/2020, 5:34 PM
    https://www.linkedin.com/feed/update/urn:li:activity:6625265474131365888/ regarding pinot open source builds
  • s

    Sandeep Nayak

    01/27/2020, 2:40 AM
    Anyone has a calculator for figuring cost of setting up an Apache Pinot cluster in AWS and/or Azure?
  • x

    Xiang Fu

    01/28/2020, 2:14 AM
    Currently there is no calculator, we are measuring it based on ec2 instances and ebs size
  • j

    James Shao

    01/29/2020, 10:00 PM
    so I am looking at pinot doc and found this line
    Results of aggregations with large amounts of group keys (>1M) are approximated
  • j

    James Shao

    01/29/2020, 10:00 PM
    do we have doc about this approximation behavior? like how much it will be approximated?
  • x

    Xiang Fu

    01/29/2020, 10:58 PM
    @User ^^
  • n

    Neha Pawar

    01/29/2020, 11:12 PM
    At every level from segment -> server -> broker, we apply some limits on number of unique group by keys. After those limits are reached, the remaining keys are dropped. These limits are configurable
  • n

    Neha Pawar

    01/29/2020, 11:12 PM
    Screen Shot 2020-01-29 at 3.11.04 PM.png
  • n

    Neha Pawar

    01/29/2020, 11:14 PM
    with order by support in group by, this problem is slightly alleviated. We perform on-the-fly trimming. WHich means, we will first order, and then trim the bottom results. This is better than the current group by approach of blindly trimming after reaching limit.
  • n

    Neha Pawar

    01/29/2020, 11:15 PM
    however, this is only available on server level as of now. At each segment level, we still stop accepting new group by keys after reaching 100k
  • n

    Neha Pawar

    01/29/2020, 11:15 PM
    @User
  • j

    James Shao

    01/29/2020, 11:17 PM
    sounds good, thanks for the explanation. thanks for the detail explanation. do you know how I can help to put this to our https://pinot.readthedocs.io/en/latest/pql_examples.html so other users can also access this?
  • n

    Neha Pawar

    01/29/2020, 11:25 PM
    where did you read about the aggregation being approximated?
  • j

    James Shao

    01/30/2020, 4:12 AM
    from the https://pinot.readthedocs.io/en/latest/pql_examples.html
  • n

    Neha Pawar

    01/30/2020, 4:50 PM
    you could submit a PR, you'll find this in the docs folder
  • n

    Neha Pawar

    01/30/2020, 4:50 PM
    additionally/alternatively, just edit in gitbooks. i believe we're moving to gitbooks for documentation
  • n

    Neha Pawar

    01/30/2020, 4:51 PM
    https://app.gitbook.com/@apache-pinot/s/apache-pinot-cookbook/
  • n

    Neha Pawar

    01/30/2020, 4:51 PM
    i think you'll need permissions from Kishore
  • s

    Seunghyun

    01/31/2020, 1:40 AM
    I added the label called
    incompatible
    . Please associate this label to PR if you introduce any backward incompatibility. This will help when we come up with the release documentation.
    👍 2
  • s

    Sidd

    02/04/2020, 4:24 AM
    <!here>, I just sent an email to dev@pinot.apache.org to share a doc. I am not sure if everyone has subscribed to mailing list. In any case, here it is:
  • s

    Sidd

    02/04/2020, 4:24 AM
    Hi All, I have shared a doc highlighting a change to TableConfig. It essentially proposes a new model for the users to specify per column encoding and index info along with cleanup in parts of IndexingConfig. As part of text search PR, we need to add another section to IndexingConfig -- something like TextIndexConfig. During discussions with Subbu and Kishore, we thought it is probably the right time to start doing the cleanup and use the new model directly for text search. It would be great if folks can review this soon as this is the last thing that text search PR is waiting on. However, if we realize that this will take some time to get consensus on, then I plan to go ahead with text search PR following the old (existing) model by introducing a new subsection inside IndexingConfig -- that works perfectly fine and in fact that is what PR currently has.  Later when this proposal is approved, we can do the cleanup for everything. https://docs.google.com/document/d/1PyULfbLOYa5OexZMqbP_MZgg-338flKETJOKWS5en3g/edit
  • h

    Harshini Elath

    02/05/2020, 4:41 PM
    <!here> java.lang.RuntimeException: Failed to get table config from push locations: [pinot_host:80] for table: ops_wb_metrics
  • h

    Harshini Elath

    02/05/2020, 4:42 PM
    anyone know the cause of this error?
  • m

    Mayank

    02/05/2020, 5:55 PM
    I take it your are getting this error when trying to push data?
1...108109110...160Latest