if a table exists for multiple tenants is it possible to res Apache Pinot #general

Join Slack

if a table exists for multiple tenants, is it poss...

# general

Josh Highley

07/06/2021, 6:36 PM

if a table exists for multiple tenants, is it possible to restrict query results to a single tenant?

Mayank

07/06/2021, 6:50 PM

What do you mean by tenant here?

Josh Highley

07/06/2021, 6:51 PM

the Tenant component of Pinot https://docs.pinot.apache.org/basics/components/tenant

Josh Highley

07/06/2021, 6:52 PM

we need to segregate client data

Josh Highley

07/06/2021, 6:54 PM

well, looking at docs, can I specify multiple tenants when creating a table?

Copy code

"tenants": {
      "broker": "myBrokerTenant", 
      "server": "myServerTenant"
    },

Mayank

07/06/2021, 7:15 PM

A table can only have one tenant for server and one for broker. A tenant can be shared across tables

Josh Highley

07/06/2021, 7:17 PM

well, dang. So if we need to segregate data by client (tenant) then each table requires a unique name?

Mayank

07/06/2021, 7:17 PM

No it does not

Mayank

07/06/2021, 7:18 PM

You can have a single table on single tenant and have all clients data on the same table?

Josh Highley

07/06/2021, 7:19 PM

no -- our clients don't like their data mixed.

Mayank

07/06/2021, 7:19 PM

Then have separate table per client?

Josh Highley

07/06/2021, 7:19 PM

each client needs their own 'customer' table, as an example

Mayank

07/06/2021, 7:20 PM

Yeah so 1 client - 1 table - 1 tenant if you want to complete separation

Mayank

07/06/2021, 7:20 PM

Not a scalable mode perhaps

Mayank

07/06/2021, 7:20 PM

But seems like that is what your customers are asking for

Josh Highley

07/06/2021, 7:22 PM

no, without multi-tenancy, each client would have their own environment. Each environment would have the same tables.

Mayank

07/06/2021, 7:23 PM

What’s is an environment? Helix cluster? If so, then two helix clusters are completely air gapped and you are fine

Josh Highley

07/06/2021, 7:23 PM

our hope was to have TenantA on BrokerA and ServerA with table 'Customers'. Then also, TenantB on BrokerB and ServerB with table 'Customers'...

Josh Highley

07/06/2021, 7:23 PM

I was using 'environment' in a general sense: a set of servers.

Mayank

07/06/2021, 7:23 PM

It me it sounds like separate tables? If so, why does the name of table need to be same?

Mayank

07/06/2021, 7:24 PM

Because customers may end up having their own schema as well in future?

Mayank

07/06/2021, 7:25 PM

Note that you cannot have multiple tables with same name in one cluster

Josh Highley

07/06/2021, 7:25 PM

because we have 100s of clients. Managing tables Customer_ClientA, Customer_ClientB, Customer_ClientC gets very cumbersome

Josh Highley

07/06/2021, 7:25 PM

there's lots of tables for each customer also

Mayank

07/06/2021, 7:26 PM

I think you want same table across all customers but then no two customers can share the same set of brokers/servers?

Josh Highley

07/06/2021, 7:27 PM

right. Their data needs to be kept separate

Mayank

07/06/2021, 7:27 PM

That is also not scalable if you have 100's of customers. For durability, you will end up having 3 brokers + 3 servers per customer, regardless of what amount of data they have.

Mayank

07/06/2021, 7:27 PM

One way is to partition the data on customerId. But that will segregate at partition level and not customer level.

Mayank

07/06/2021, 7:28 PM

Perhaps customers really want is customer level ACL?

Mayank

07/06/2021, 7:28 PM

If so, that can be built on a mid-tier layer on top of single table in Pinot?

Josh Highley

07/06/2021, 7:29 PM

our customers are financial companies -- mixing data across those companies isn't an option

Mayank

07/06/2021, 7:29 PM

What you are trying to use the tenant concept in Pinot is not what it is meant for, and doesn't solve your problem.

Mayank

07/06/2021, 7:29 PM

A table in Pinot can only have one tenant for server and one for broker

Open in Slack

Previous Next