what kind of hashing function does Self-Managed En...
# ask-ai
t
what kind of hashing function does Self-Managed Enterprise use for PII masking? does the same input always return the same output?
u
Based on the provided knowledge sources, I can offer some information about hashing functions used in Airbyte Self-Managed Enterprise for PII masking, though the details are somewhat limited. Airbyte Self-Managed Enterprise includes a feature called "Column Hashing" which is designed to protect sensitive information by hashing personally identifiable information (PII) as it moves through data pipelines [Enterprise Setup](https://docs.airbyte.com/enterprise-setup). While the specific hashing function isn't explicitly stated for the Enterprise version, an example in the Connector Builder UI documentation shows the use of an MD5 hash function. The example demonstrates how to apply a hash function to a field, using the syntax
{{ record['from']['name'] | hash('md5') }}
[Connector Builder UI](https://docs.airbyte.com/connector-development/connector-builder-ui/record-processing#adding-fields). Regarding whether the same input always returns the same output, this is typically a characteristic of cryptographic hash functions like MD5. However, without more specific information about the implementation in Airbyte Self-Managed Enterprise, I cannot definitively confirm this behavior. For more detailed information about the specific hashing function used in Self-Managed Enterprise and its properties, you may need to consult Airbyte's enterprise documentation or reach out to their support team directly.