thanks all; to provide a little context, we’re looking at DH to provide a key piece of the Data Mesh self-service platform, covering the discoverability of Data Products. Data Products have both a semantic schema as well as one or more syntactic schemas. The former is general, not (necessarily) specified by a specific format like Protobuf, Avro, GraphQL, SQL, etc. However, the latter, the syntactic schema, is one or more of those. A single Data Product can have multi-modal outputs, so both Avro and GraphQL for example. We’re exploring options at this point. DH could have the semantic schema, and we build support for generating syntactic schemas. This reduces the burden on the Data Product developers so they only need to know about and define one schema (format), the semantic one. Or we could just push it to the Data Products and they need to define and provide the schemas for each output type they support. But the key is that it’s not tied to a data*set* but to a data product, which could have many datasets (or none/infinite one for a streaming case). Hope that helps. 🙂