Hi,
I wanted to enable profiling when I ingest the s3 file to datahub but I receive missing spark dependencies due to the proxies. Is there some easy way to pass to spark session parameters like this?
spark.driver.extraJavaOptions -D http.proxyHost=<my_proxy>
b
bulky-electrician-72362
11/28/2022, 12:55 PM
cc: @dazzling-judge-80093
b
breezy-portugal-43538
12/05/2022, 10:54 AM
@bulky-electrician-72362, @dazzling-judge-80093 can you help with narrowing this down? I tried to re check your repository but couldn't find any info on where this additional arg could be passed.
All help is appreciated : )
d
dazzling-judge-80093
12/05/2022, 10:55 AM
@breezy-portugal-43538 can you paste here the exact error message you get?
b
breezy-portugal-43538
12/05/2022, 11:03 AM
@dazzling-judge-80093
The issue happens when I try to enable profiling in the source file by setting profiling: enabled: true
Datahub tries to fetch the packages for spark but it throws timeout error and info regarding missing packages due to the lack of proxy issue