https://pinot.apache.org/ logo
Join Slack
Powered by
# pinot-dev
  • m

    Mayank

    04/29/2022, 5:45 PM
    Are we good with this PR functionally? https://github.com/apache/pinot/pull/8314
    r
    • 2
    • 2
  • a

    Atri Sharma

    05/02/2022, 5:47 PM
    Is there a way to set the number of segments required when creating a test data set for an integration test?
    a
    • 2
    • 1
  • d

    Dan DC

    05/02/2022, 9:05 PM
    hey there, can I get someone to review this PR? https://github.com/apache/pinot/pull/8625 I'll need this to finish implementing timestamp and json support in trino connector. I've left a comment with regards to timestamp and time zones, I'll raise separate issue for that if there isn't one yet
    m
    • 2
    • 1
  • e

    Eric Song

    05/12/2022, 1:51 AM
    @Rong R Hi Rong. Sorry to bother you. I'm trying to build Pinot on branch 'pr-query-integration'. I followed instructions from this Conversation(https://github.com/apache/pinot/pull/8662). But when I try to query on broker, I encountered some problems. When I query like this "SELECT * FROM baseballStats_OFFLINE", it will return some results. But if I query like this "SELECT playerId FROM baseballStats_OFFLINE", it will take almost 10 seconds, and return an empty result. It's same when I use 'inner join', doesn't report any exception or error, but just return empty result. I started Pinot like this 'bin/pinot-admin.sh StartServiceManager -bootstrapConfigPaths conf/pinot-controller.conf conf/pinot-broker.conf conf/pinot-server.conf'(I has already started Zookeeper), and all components on the same machine(1 controller,1 broker,1 server) And now I don't know what happened and want to know how to locate problems, could you please give me some advices? Thanks.
    r
    • 2
    • 13
  • f

    francoisa

    05/19/2022, 3:54 PM
    Hi 🙂 Working on integration testing for the purgeProcess implementation and looking for a way to set
    SegmentZkMetada
    my
    lastPurgeTime
    during the segment build using something like what have been done in
    MergeRollupMinionClusterIntegrationTest
    in the
    buildSegmentsFromAvroWithPostfix
    method. Is there any way to put a metadata to the generated segment in order to validate my code ?
    • 1
    • 1
  • k

    Kriti

    05/20/2022, 5:47 PM
    In Pinot, segments belonging to a particular dataset may have different indexes. Is this correct? If so, what is the benefit, aside from not having to reload all segments to apply index changes?
    r
    • 2
    • 13
  • d

    Dan DC

    05/25/2022, 11:40 AM
    Hello, I'm reading the freshness metrics design document which is something we are thinking of using in one of our uses cases. However the freshness timestamp returned by pinot always seems to be the pinot indexing time. Reading through the code it seems there isn't a row metadata implementation for kafka. I'd like to confirm this is the case and if so I'd like to contribute the code changes to get this working as per the design document. I can't see an open issue in github related to this
  • k

    Kishore G

    05/25/2022, 1:13 PM
    I thought it used timestamp from row metadata if it’s available
  • d

    Dan DC

    05/25/2022, 1:29 PM
    Yup, it does choose indexing timestamp if row metadata is not available. Doesn't seem like any of the stream ingestion plugins returns row metadata at the moment
  • d

    Dan DC

    05/25/2022, 1:29 PM
    The default implementation returns null
  • d

    Dan DC

    05/25/2022, 1:32 PM
    I'm going to raise an issue in github and open a PR, I'll post the link here if further discussion is needed
    👍 1
  • k

    Kishore G

    05/25/2022, 1:41 PM
    👍
  • d

    Dan DC

    05/25/2022, 3:58 PM
    This is the PR, I didn't write a lot of details on it nor in the linked github issue. Apologies for the rush https://github.com/apache/pinot/pull/8774
    👍 4
  • k

    Ken Krugler

    05/25/2022, 5:51 PM
    Is anyone else getting a dependency convergence failure when building from master? Details in thread…
    k
    • 2
    • 8
  • s

    Stuart Millholland

    05/26/2022, 2:04 PM
    Hi all, this may be a silly question, but are server tags and tenants synonymous? Just noticed when I created my own server tags they also show up under tenants.
  • k

    Kishore G

    05/26/2022, 2:10 PM
    Yes.. server tags are super set of tenants..
  • k

    Kishore G

    05/26/2022, 2:11 PM
    Tags can be used for an anything.. we use tags to implement tenant functionality
  • s

    Stuart Millholland

    05/26/2022, 2:11 PM
    Excellent, that makes sense, thanks!
  • k

    Kishore G

    05/26/2022, 2:12 PM
    We use tags for tiering, node pools etc
  • k

    Kishore G

    05/26/2022, 2:12 PM
    Tags are similar to k8s labels
  • s

    Stuart Millholland

    05/26/2022, 2:15 PM
    If we are overriding server tags on our tables is it good practice to also keep the default ones there?
  • s

    Stuart Millholland

    05/26/2022, 2:15 PM
    Copy code
    "TAG_LIST": [
          "DefaultTenant_OFFLINE",
          "DefaultTenant_REALTIME"
        ]
  • k

    Kishore G

    05/26/2022, 2:27 PM
    It basically means that if some creates a table without tenant name, then these servers will be picked
  • s

    Stuart Millholland

    05/26/2022, 3:08 PM
    So will there be potential problems if I remove those tags in favor of my own?
  • k

    Kishore G

    05/26/2022, 3:25 PM
    Yes but you can run rebalance anytime
    ✅ 1
  • s

    Stuart Millholland

    05/26/2022, 4:14 PM
    Another question. We are building a realtime table with tiered storage. The base storage will be on a realtime server and we have the tiered storage configured to move to an offline server after 7 days. So it makes me thing a realtime table can have data in an offline server. Is that correct?
    n
    • 2
    • 1
  • s

    Stuart Millholland

    05/26/2022, 4:14 PM
    We do have an offline version of this same table configured, but I don't think that table will get any actual data (so probably not needed) but me just moving the realtime data to the offline servers.
  • s

    Stuart Millholland

    05/26/2022, 4:16 PM
    My tables are immutable_events_realtime and immutable_events_offline. My question in other words is if I query immutable_events_offline directly I will not have any data. It all exists in immutable_events_realtime, the first 7 days in realtime server segments and the older data in offline server segments.
  • s

    Stuart Millholland

    05/26/2022, 4:16 PM
    And if that's true, I'm not sure I even need my offline table
    n
    • 2
    • 10
  • s

    Stuart Millholland

    05/26/2022, 5:34 PM
    Sorry all, I think I've been asking my questions in the wrong channel, my apologies.
1...456...30Latest