Hi folks, I am running validations on several csv ...
# troubleshoot
s
Hi folks, I am running validations on several csv files via GreatExpectations operator. I set up the DataHubValidationAction, and all seems to work fine. But I don’t see the results in the datasets, in Datahhub. For completeness, I set up the lineage in the tasks as
outlets={"datasets": [Dataset("file", "AppleSchoolManager.courses_csv")]},
g
Hey @stale-jewelry-2440 we’ll take a look at this
thank you 1
h
Hey @stale-jewelry-2440, we currently do not support datasources using non-sqlalchemy execution engine. https://datahubproject.io/docs/metadata-ingestion/integration_docs/great-expectations/#limitations We log this warning message for such cases -
DataHubValidationAction does not recognize this GE data asset type
which execution engine/datasource are you using for csv files ? Please show your interest by upvoting this feature request if you would like DatahubValidationAction to work with pandas/spark execution engine. https://feature-requests.datahubproject.io/b/feedback/p/great-expectations-support-different-execution-engines
s
Just upvoted, thank you. I use Pandas, they are small-ish files