Hi! We created a datahub-prerequisites chart to help folks quickstart on kubernetes. We have tested that it is able to handle up to 300K records. Please refer to our kubernetes guide
https://datahubproject.io/docs/deploy/kubernetes On the components side, kafka and elasticsearch is a required component for DataHub. We have implemented our graph storage using neo4j and elasticsearch. Some of the community members wanted the extensive capabilities of neo4j to do advanced graph operations. However, DataHub internally does not use any complicated graph queries, which is why we also gave an option to use elasticsearch as the graph storage layer, reducing required components by one.
https://github.com/acryldata/datahub-helm/blob/master/charts/datahub/quickstart-values-without-neo4j.yaml#L46 Refer to this chart on how to point to elasticsearch instead of neo4j. If you do so, you can set neo4j-community.enabled to false in the values.yaml for prerequisites chart.