Hello.
Has anyone got any advice about real-world Elasticsearch cluster sizes and index sizes to share please?
We're currently planning a deployment of DataHub from scratch, with at most a few 10s of thousands of datasets from Hive, Druid, Cassandra etc. I'm also considering hosting the graph database on Elasticsearch as well, as opposed to Neo4J.
I'm looking at a 3-node Elasticsearch cluster for high-availability purposes, but I wondered if anyone could share any experiences of their experiences in sizing an Elasticsearch cluster for a similar workload, to make sure I'm not massively over or under speccing it.
Thanks.