Hi everyone! I'm attempting to ingest some data in the quickstart guide. I have a simple CSV file and I am using the datalake files source. I am trying to use the profile feature to infer the schema and profile the data. It needs PySpark and PyDeequ for it. Where I am running into trouble is getting all the dependencies to versions that work together. I am on a Mac Book Pro 2020 (M1 chip). I can install everything via homebrew and PySpark works well. However it only installs apach-spark version 3.3.0 which doesn't currently work with PyDeequ. I have tried to install an older version of spark manually but having lots of trouble getting everything to work together. Has anyone done this already and would be able to share the detailed steps to help?
Thank you!