https://pinot.apache.org/ logo
Join Slack
Powered by
# general
  • a

    Alex

    11/12/2019, 10:56 PM
    ran out of space
  • x

    Xiang Fu

    11/12/2019, 11:04 PM
    yeah, for kube, better to have your own volume to mount or storageclass defined
  • s

    Sandeep Nayak

    11/13/2019, 4:41 AM
    Leaving a few questions here 1. During setup what creates the topic
    flights-realtime
    ? 2. I imagine I can use the same
    pinot-example-loader
    to load my sample data into the pinot cluster? 3. The
    StreamAvroIntoKafka
    reads from a provided file, how are the records written to the file? I am thinking of using the same tooling to load my data 4. If I have to change my table schema after I load data, what is the recommendation? 5. I imagine once I set the
    timeFieldSpec
    I cannot alter it? 6. Imagine metrics coming in at a min granularity, what controls roll-up? Say I want to rollup every 15 mins, would that be
    outgoingGranularitySpec
    ? 7. Should there always be a 1:1 mapping between schema -> table schema?
  • k

    Kishore G

    11/13/2019, 4:44 AM
    QuickStart setup starts a stand-alone Kafka cluster and we invoke Kafka admin api to create the topic.
  • k

    Kishore G

    11/13/2019, 4:45 AM
    2 and 3, yes that should work.
  • k

    Kishore G

    11/13/2019, 4:46 AM
    4. Invoke Pinot admin api to update schema
  • k

    Kishore G

    11/13/2019, 4:48 AM
    Run reload segment api. As long as the schema is backwards compatible, Pinot will generate default values for ew columns
  • k

    Kishore G

    11/13/2019, 4:49 AM
    5. Yes altering time column will be tricky. Retention and routing depends on that column
  • k

    Kishore G

    11/13/2019, 4:50 AM
    6. Yes.
  • k

    Kishore G

    11/13/2019, 4:51 AM
    7. Which schema are you referring to?
  • s

    Sandeep Nayak

    11/13/2019, 4:52 AM
    For 7 take for example
    airlineStats_Schema
    and the realtime table for it i.e.
    airlineStats_realtime_table_config
  • m

    Mayank

    11/13/2019, 4:53 AM
    Table config is separate from schema
  • m

    Mayank

    11/13/2019, 4:54 AM
    But yes 1-1 mapping
    👍 1
  • s

    Sandeep Nayak

    11/13/2019, 4:55 AM
    Can I have multiple
    tableIndexConfig
    point to the same topic?
  • m

    Mayank

    11/13/2019, 4:55 AM
    Yes
  • m

    Mayank

    11/13/2019, 4:56 AM
    Multiple tables can consume from the same topic.
  • s

    Sandeep Nayak

    11/13/2019, 4:56 AM
    Think of this as different messages (messages have different schemas) being fed into the same topic which then feeds into the multiple tables
  • m

    Mayank

    11/13/2019, 4:56 AM
    But one table can only consume from 1
  • s

    Sandeep Nayak

    11/13/2019, 4:57 AM
    @User I am thinking m : 1 : n where m = number of schemas and n = number of tables. Any entry in m points to exactly one entry in n
  • m

    Mayank

    11/13/2019, 4:57 AM
    Messages on one topic should have same schema right?
  • s

    Sandeep Nayak

    11/13/2019, 4:57 AM
    That was my question
  • s

    Sandeep Nayak

    11/13/2019, 4:57 AM
    Does it have to be?
  • m

    Mayank

    11/13/2019, 4:57 AM
    I think that’s what Kafka supports
  • s

    Sandeep Nayak

    11/13/2019, 4:58 AM
    Kafka does not dictate the schema of the message
  • s

    Sandeep Nayak

    11/13/2019, 4:58 AM
    Kafka is only the transport layer
  • s

    Sandeep Nayak

    11/13/2019, 4:58 AM
    I can have multiple consumers on Kafka each reading a different message and ignoring ones it does not understand
  • m

    Mayank

    11/13/2019, 4:58 AM
    Yeah you can do that
  • m

    Mayank

    11/13/2019, 4:59 AM
    Each table will have separate schema, and it will pull out the part of the message as per its schema
    👍 1
  • s

    Sandeep Nayak

    11/13/2019, 4:59 AM
    May not be the most efficient way of doing it but it would work
  • k

    Kishore G

    11/13/2019, 5:05 AM
    Interesting idea. Why would you do that?
1...939495...160Latest