Does anyone auto-classify data (e.g. pii/sensitive...
# feature-requests
a
Does anyone auto-classify data (e.g. pii/sensitive) and integrated it with DataHub? We're attempting something similar to https://datahubproject.io/docs/metadata-ingestion/source_docs/sql_profiles/ except we would 1. sample actual data 2. run it through classifier 3. produce MCEs
l
Hi Nick! This isn’t on our immediate roadmap, but it’s a great suggestion! I’ve added it to our feature request portal https://feature-requests.datahubproject.io/b/Developer-Experience/p/auto-classify-data-as-pii-sensitive
thank you 1
w
Hi Nick there is a google API if you want to use. Its called google DLP and Inspect commands can be used for automatic data classification. We can leverage that for Datahub