`DataHubValidationAction` in Great Expectations an...
# troubleshoot
s
DataHubValidationAction
in Great Expectations and Mixpanel Hi folks, it seems that the module sends data to mixpaned by default. Differently statetd, I never asked nor configured anything to send data to Mixpanel, but when I activate the module to send expectations results to DataHub (i.e. adding the relative action in the checkpoint yml file), I read this on the logs:
Copy code
[2022-03-16, 13:55:38 CET] {connectionpool.py:1005} DEBUG - Starting new HTTPS connection (1): <http://api.mixpanel.com:443|api.mixpanel.com:443>
[2022-03-16, 13:55:38 CET] {connectionpool.py:465} DEBUG - <https://api.mixpanel.com:443> "POST /engage HTTP/1.1" 200 25
Note that those messages disappear if I comment out the DataHubValidationAction in the checkpoint file.
h
s
you can disable via using environment variables https://datahubproject.io/docs/cli/#telemetry
plus1 1
s
Indeed it was the telemetry. Thank you! By the way, are you sure itโ€™s compliant to send stats, even anonymous, without asking for explicit consent?
h
For the European Union ๐Ÿ‡ช๐Ÿ‡บ it is definitely not compliant. ๐Ÿ˜ฟ
s
Regarding compliance @loud-island-88694 or @little-megabyte-1074 can answer. We understand that some organisations would not want telemetry to be sent. So we have added a new flag for orgs to control this recently https://github.com/datahub-project/datahub/blob/master/metadata-service/factories/src/main/resources/application.yml#L31 The CLI changes for this are already released. Once we have a new server release the CLI will contact server and update local settings to disable telemetry based on server settings. Documenting the new environment variables is on my radar. The CLI telemetry doc is already publicly available.
๐Ÿ‘ 1