https://pinot.apache.org/ logo
Join Slack
Powered by
# discuss-validation
  • m

    Mayank

    01/07/2021, 7:24 PM
    So this code is not ours, there are pinot users who have written all kinds of code/scriptware etc to generate these, and most of the time we aren't even aware of their existence.
  • c

    Chinmay Soman

    01/07/2021, 7:41 PM
    I see
  • c

    Chinmay Soman

    01/07/2021, 7:42 PM
    well all the more reason for early validation 🙂
  • c

    Chinmay Soman

    01/07/2021, 7:42 PM
    I modified the schema to allow both 3 and "3" FYI
  • c

    Chinmay Soman

    01/07/2021, 7:43 PM
    this does mean we cannot enforce additioanl constraints (for eg: min / max )
  • c

    Chinmay Soman

    01/07/2021, 7:43 PM
    but I think its ok for now
  • m

    Mayank

    01/07/2021, 7:44 PM
    I agree, we do need validation for sure. As Subbu suggested, we can allow for level of checking (strict vs loose) and move from loose to strict over time for deployments where that is needed.
  • c

    Chinmay Soman

    01/07/2021, 10:38 PM
    upon further reading, it doesn't look like json-schema has tunable validation levels. We might have to skip validation checks for a lot of fields (eg: inverted indices) to account for custom code generated table config.
  • c

    Chinmay Soman

    01/07/2021, 10:39 PM
    at this point - I'm not sure if json-schema will work fo rus
  • k

    Kishore G

    01/07/2021, 10:41 PM
    then just add rules, we can still have json-schema based validation but thats just one of the rules
  • c

    Chinmay Soman

    01/08/2021, 12:02 AM
    you mean, be able to configure json-schema as one of the validation mechanisms ?
  • c

    Chinmay Soman

    01/08/2021, 12:02 AM
    we can do that
  • k

    Kishore G

    01/08/2021, 12:57 AM
    yes
  • c

    Chinmay Soman

    01/11/2021, 6:57 PM
    @User: let's go ahead with that strategy - lets make the json schema based validation configurable. So by default - we don't enforce it. Let's finish up the remaining items in the schema and open a PR ?
  • m

    Mohammed Galalen

    01/11/2021, 8:31 PM
    Ok
  • m

    Mohammed Galalen

    01/12/2021, 9:22 AM
    I've updated the schema in the docs https://docs.google.com/document/d/1uw0lDBKDU9tM_R1mTMgQFmcAwz9pc5QJzeNSRe1GxXE/edit# if it's good we can move forward with opening the PR for making it configurable
  • c

    Chinmay Soman

    01/12/2021, 6:33 PM
    I'll review it today
  • c

    Chinmay Soman

    01/12/2021, 6:34 PM
    thanks Mohammed !
    👍 1
  • c

    Chinmay Soman

    02/03/2021, 10:47 PM
    FYI: Opened a new PR for some missing validation. @User might want to eyeball it - I dont think this will cause any issues in the LinkedIn integration tests - but doesn't hurt to verify
    👍 1
  • c

    Chinmay Soman

    02/03/2021, 10:47 PM
    https://github.com/apache/incubator-pinot/pull/6541
  • c

    Chinmay Soman

    02/04/2021, 12:03 AM
    lol - it just broke a bunch of integration tests 🙂. Looks like some integration tests are specifying a range index on a non numeric column
  • c

    Chinmay Soman

    02/09/2021, 12:02 AM
    ok - ready for review now: https://github.com/apache/incubator-pinot/pull/6541
  • c

    Chinmay Soman

    02/10/2021, 5:38 PM
    @User mind reviewing before we commit ?
  • m

    Mayank

    02/10/2021, 5:39 PM
    Will do today
  • c

    Chinmay Soman

    02/10/2021, 8:10 PM
    thank you
  • c

    Chinmay Soman

    03/08/2021, 11:00 PM
    @User @User we're using this doc to track all remaining validations: https://docs.google.com/document/d/1uw0lDBKDU9tM_R1mTMgQFmcAwz9pc5QJzeNSRe1GxXE/edit As Jackie mentioned some of the recent validations are for newer features or were missed. I can use this to categorize these as MUST HAVE or SUGGESTED ?
  • c

    Chinmay Soman

    03/08/2021, 11:01 PM
    we can also use those 2 categories for enforcing how strict the validations are
  • c

    Chinmay Soman

    03/08/2021, 11:01 PM
    one PR in progress related to this is : https://github.com/apache/incubator-pinot/pull/6485
  • c

    Chinmay Soman

    03/08/2021, 11:02 PM
    which is trying to enforce a schema on table config - this will be the most stringent check and will probably belong in the SUGGESTED cateogry
  • n

    Neha Pawar

    05/04/2022, 8:45 PM
    @Neha Pawar has left the channel