https://pinot.apache.org/ logo
Join Slack
Powered by
# pinot-dev
  • d

    Dan DC

    07/13/2022, 6:21 PM
    I did some test in my env and it would move on if the message cannot be converted to a json object
  • d

    Dan DC

    07/13/2022, 6:22 PM
    With avro though the segment gets stuck in offline state
  • d

    Dan DC

    07/13/2022, 6:22 PM
    I believe until the corrupt message is purged from kafka logs
  • k

    Kishore G

    07/13/2022, 6:26 PM
    perfect. I am +1 on the PR
  • d

    Dan DC

    07/13/2022, 6:29 PM
    Many thanks Kishore
  • d

    Dan DC

    07/13/2022, 6:30 PM
    I'll look at the bullet points you highlighted in the parent issue and address them accordingly in a separate PR
  • d

    Dan DC

    07/13/2022, 6:31 PM
    Subbu's proposal also makes sense for retryable decoding errors
  • k

    Kishore G

    07/13/2022, 6:45 PM
    yeah, possible. I dont know what value Pinot is adding there because the decoder has know thats its retryable vs not
  • k

    Kishore G

    07/13/2022, 6:46 PM
    and if it knows its retryable, it's better for the decoder to be self contained and just return a generic row or null
  • k

    Kishore G

    07/13/2022, 6:46 PM
    we can discuss more when you get to start on that.. for now, we are good
  • s

    Subbu Subramaniam

    07/13/2022, 6:55 PM
    The value is uniform handling of exceptions from all streams. The amount of change in the code is small, I wouild strongly suggest to go with it. @Kishore G what is the objection here?
  • k

    Kishore G

    07/13/2022, 8:10 PM
    I dont have any objection.. we were just brainstorming on long term solution
  • s

    Subbu Subramaniam

    07/13/2022, 8:28 PM
    ok, got it. I think we can start with what Dan has now. I have asked for adding a small comment on the interface so that other streams can use this if they choose to.
  • c

    chandarasekaran m

    07/17/2022, 2:55 AM
    Hi Team, How I can parse kafka header(in bytes) and filter based on specific field ? any code samples?
  • d

    Dan DC

    07/21/2022, 1:06 PM
    Apologies for taking this thread to this channel. We believe this is a bug and would like to raise a github issue after confirmation
  • k

    Kishore G

    07/21/2022, 9:26 PM
    we can easily debug that
  • k

    Kishore G

    07/21/2022, 9:26 PM
    do you have upsert enabled there?
  • d

    Dan DC

    07/22/2022, 7:15 AM
    We don't have upsert enabled for these tables
  • d

    Dan DC

    07/22/2022, 7:16 AM
    The table definition is in one of the comments
  • d

    Dan DC

    07/22/2022, 7:16 AM
    I think it can be reproduced in a unit test, I'll give it a try as well
  • d

    Dan DC

    07/22/2022, 11:39 AM
    I've raised this issue in github for the above https://github.com/apache/pinot/issues/9091 we can always reject it if this is proven not to be a bug
  • d

    Dan DC

    07/22/2022, 11:39 AM
    we could also provide a fix or discuss a potential solution for this
  • r

    Richard

    07/22/2022, 12:03 PM
    this bug is because time filtering and tombstone filtering are conflated,
    LLRealtimeSegmentDataManager
    should react to time filtered empty batches differently to batches which only contain tombstones
  • r

    Richard

    07/22/2022, 12:05 PM
    the advance condition (i.e. line 4 in your description) should depend not on the unfiltered count but all messages having been tombstones
  • r

    Richard

    07/22/2022, 12:08 PM
    it should be easy to fix by just removing
    getUnfilteredMessageCount
    from
    MessageBatch
    and replacing it with
    boolean allMessagesInvalid
    or something to that effect, then track if all the messages are tombstones in the partition level consumer
  • d

    Dan DC

    07/22/2022, 12:18 PM
    agree, should be easy to fix
  • d

    Dan DC

    07/22/2022, 12:19 PM
    I'm trying to put a unit test together
  • r

    Richard

    07/22/2022, 12:30 PM
    unit testing
    LLRealtimeSegmentDataManager
    is going to be quite challenging
  • d

    Dan DC

    07/22/2022, 12:32 PM
    I know, I'm still willing to give it a try
  • d

    Dan DC

    07/22/2022, 12:33 PM
    there is a junit test for a fake sublcass and that may help with reproducing this issue
1...678...30Latest