Hi everyone! I'm attempting to ingest some data i...
# getting-started
t
Hi everyone! I'm attempting to ingest some data in the quickstart guide. I have a simple CSV file and I am using the datalake files source. I am trying to use the profile feature to infer the schema and profile the data. It needs PySpark and PyDeequ for it. Where I am running into trouble is getting all the dependencies to versions that work together. I am on a Mac Book Pro 2020 (M1 chip). I can install everything via homebrew and PySpark works well. However it only installs apach-spark version 3.3.0 which doesn't currently work with PyDeequ. I have tried to install an older version of spark manually but having lots of trouble getting everything to work together. Has anyone done this already and would be able to share the detailed steps to help? Thank you!
l
Hi @tall-magician-303! Just want you to know this is still on our radar; our M1 expert is currently on PTO but we should be able to get you direction within the next ~1wk
t
Sounds for the update! No rush on my end. I will try again sometime this week and let you know if i can get past the issue i was running into