delightful-barista-90363
07/29/2022, 6:27 PM<s3://bucket_name/{table}/20220729/*.csv>
but it doesnt seem to be working. spark gets initialized but thats about it. thanks in advanced. Actually doesnt look like profiling is run at all. Spark gets initialized but isnt used 🤔
More specifically getting this error
Unable to infer schema for CSV. It must be specified manually.
Looks like in the debug logs, its only going to the {table}
when trying to open up spark
DEBUG:datahub.ingestion.source.s3.source:Opening file <s3://bucket/jordan-test/dataset_a> for profiling in spark
when the file lives 2 folders downcareful-pilot-86309
08/01/2022, 6:08 AMUnable to infer schema for CSV. It must be specified manually.
seems like issue has occurred during schema extraction. Can you confirm if the given recipe works well without profiling? Also, complete log file preferably with debug log enabled will be very helpful in pinning the issue.delightful-barista-90363
08/01/2022, 3:55 PMdelightful-barista-90363
08/01/2022, 3:56 PMdelightful-barista-90363
08/01/2022, 3:56 PMdelightful-barista-90363
08/01/2022, 3:56 PMcareful-pilot-86309
08/01/2022, 3:58 PMdelightful-barista-90363
08/01/2022, 3:59 PMdelightful-barista-90363
08/01/2022, 4:00 PMdelightful-barista-90363
08/01/2022, 4:41 PMdelightful-barista-90363
08/01/2022, 4:56 PMdelightful-barista-90363
08/01/2022, 4:56 PMdelightful-barista-90363
08/01/2022, 4:57 PM<s3://bucket-name/jordan-test/{table}>
delightful-barista-90363
08/01/2022, 5:17 PM<s3://bucket-name/jordan-test/{table}/20220715/*.csv>
but that extra folder after {table}
makes it so that spark cant read a folderdelightful-barista-90363
08/01/2022, 5:17 PM