Can pinot infer partitioning field based on input ...
# general
s
Can pinot infer partitioning field based on input path? Like if I have s3 path like "s3://my-bucket/dt=2022-09-01" , and I have a table with
dt
in schema (but my actual data in s3 doesn't contain
dt
column), if I run an ingestion job through spark , can it infer that the partition is
dt=2022-09-01
and creates a partition on that and also populate the
dt
value?
k
Unfortunately.. no
irrespective of pinot not supporting it, I strongly recommend having date column in the data
h
@Kishore G is right if we are only talking about the open source Pinot. StarTree has an extension to support this requirement: see https://dev.startree.ai/docs/startree-extensions/file-ingestion-task#derive-columns-from-the-source-file-paths for details.