https://pinot.apache.org/ logo
Join Slack
Powered by
# general
  • a

    abk

    08/22/2019, 3:30 PM
    Am going through the docs to run a service/instance and want to do some POC
  • k

    Kishore G

    08/22/2019, 3:30 PM
    The administrator for capitalone.com has disabled the ability to receive items from outside their domain. If those you're attempting to share with would like more information, they should contact their domain administrator directly.
  • a

    abk

    08/22/2019, 3:30 PM
    oh, okay thanks @User
  • k

    Kishore G

    08/28/2019, 6:18 PM
    <!here> We created a new meetup group for Apache Pinot. https://www.meetup.com/apache-pinot/ Please sign up and share with your network.
  • x

    Xiang Fu

    08/29/2019, 12:10 AM
    As well as our official twitter account : https://twitter.com/ApachePinot
    👍 1
  • x

    Xiang Fu

    08/29/2019, 12:12 AM
    The twitter account is suspended and just unlocked, however all the followers are cleared. So please re-follow this account for news and updates.
  • t

    Thomas

    09/04/2019, 12:48 AM
    I would like to configure Pinot locally /Azure. The documentation available doesn't provide necessary steps. It would be helpful if some one can guide us on the same we are planning to use Pinot in our existing project and would like to test all the use cases
  • k

    Kishore G

    09/04/2019, 1:17 AM
    Are you running kubernetes on Azure?
  • m

    Mayank

    09/04/2019, 8:38 PM
    You can have multiple
    independent
    tables each of which consumes from a topic each
  • t

    The Alchemist

    09/04/2019, 9:47 PM
    <!here>: hey everyone! experimenting with Pinot. anyone have a quick definition for the difference between dimensions fields and metrics fields? it looks like metric fields are almost always `INT`s and the
    STRING
    columns are almost always dimension fields. could someone explain the difference, or link me to a doc? thanks!
  • k

    Kishore G

    09/04/2019, 9:50 PM
    metrics and dimensions can be of any data type
  • t

    The Alchemist

    09/04/2019, 9:51 PM
    i found this: https://support.google.com/datastudio/answer/6402048?hl=en but i’m not sure if it’s applicable to Pinot
  • k

    Kishore G

    09/04/2019, 9:51 PM
    metrics mostly are of INT, LONG, DOUBLE, (BYTES in special cases like tdigest/HLL)
  • t

    The Alchemist

    09/04/2019, 9:51 PM
    @User: thanks! i think i understand that, i just don’t understand how the choice of metric vs. dimension affects storage or query capabilities
  • j

    Jackie

    09/04/2019, 9:52 PM
    Metrics are mainly for aggregation purpose, and cannot be type STRING
  • j

    Jackie

    09/04/2019, 9:52 PM
    Dimensions are mainly for slice and dice (in filter clause and group by clause)
  • j

    Jackie

    09/04/2019, 9:52 PM
    Dimensions are usually not of type BYTES
  • t

    The Alchemist

    09/04/2019, 9:53 PM
    thanks, @User!
  • t

    The Alchemist

    09/04/2019, 9:53 PM
    looking at the sample baseball stats schema,
    yearID
    is a dimension
  • j

    Jackie

    09/04/2019, 9:53 PM
    Dimensions can be both single-valued or multi-valued, while metrics can only be single-valued
  • t

    The Alchemist

    09/04/2019, 9:54 PM
    ah, ok
  • t

    The Alchemist

    09/04/2019, 9:54 PM
    so if it needs to be aggregated, go with metrics. otherwise, dimension
    👍 1
  • j

    Jackie

    09/04/2019, 9:55 PM
    Internally, we use the same way to compress the data, so with the same config, they will have the same performance and storage
    👍 1
  • j

    Jackie

    09/04/2019, 9:56 PM
    You are correct, for example,
    memberId
    should be a dimension, while
    revenue
    or
    price
    which are used for aggregation should be metrics
  • t

    The Alchemist

    09/04/2019, 9:56 PM
    i think that makes sense…
    yearID
    is an
    INT
    dimension because we’d want to slice and dice (filter/group by), but it would be silly to aggregate the year field
  • t

    The Alchemist

    09/04/2019, 9:58 PM
    do you need any metrics at all? i.e., can you have a dimension-only table?
  • k

    Kishore G

    09/04/2019, 10:00 PM
    yes, you can have dimension-only table but you can only run count(*) on that table
    👍 2
  • j

    Jackie

    09/04/2019, 10:01 PM
    For certain cases, yes. For example, you only need to
    COUNT
    for different slice and dice
  • j

    Jackie

    09/04/2019, 10:02 PM
    We support aggregation on dimensions, but some optimizations will differentiate dimensions and metrics
  • j

    Jackie

    09/04/2019, 10:02 PM
    E.g. pre-aggregation will only happen on metrics if configured
1...828384...160Latest