Is there a way to have an inverted index for a column but no Apache Pinot #pinot-perf-tuning

Join Slack

Is there a way to have an inverted index for a col...

# pinot-perf-tuning

Ken Krugler

01/20/2021, 3:57 PM

Is there a way to have an inverted index for a column, but not store the column data? So a pure filter-only field?

Mayank

01/20/2021, 4:11 PM

Inv index has dict id to docIds mapping. You need dictionary to store values. This is the current implementation

Kishore G

01/20/2021, 4:20 PM

Not yet, it’s not hard to go this.. file an issue

Ken Krugler

01/20/2021, 4:47 PM

@Kishore G would it make sense to add a “noStorageColumns” config setting for tables?

Kishore G

01/20/2021, 4:48 PM

yes, something along those lines

Kishore G

01/20/2021, 4:49 PM

also, add some points on why this feature is important

Kishore G

01/20/2021, 4:49 PM

is it purely storage on disk? bcos Pinot will not read the forward index if its never accessed in query

Mayank

01/20/2021, 4:50 PM

Just for my understanding, is this a request for sparse dictionary? I am missing something- don’t we need to have some storage for values to be able to reference them from queries?

Mayank

01/20/2021, 4:51 PM

Oh, so have the inv index but not the fwd index

Kishore G

01/20/2021, 4:51 PM

@Mayank there are three things Forward index, dictionary, inverted index

Mayank

01/20/2021, 4:52 PM

Yeah got it

Kishore G

01/20/2021, 4:52 PM

what @Ken Krugler is asking for is not to store the forward index

Mayank

01/20/2021, 4:53 PM

Yes. It would be great to see how much storage is being used for fwd index in your case @Ken Krugler. Index_map file inside segment dir has that info.

Ken Krugler

01/20/2021, 8:20 PM

I must be missing something, given the above discussion 🙂 In Lucene, you can have a field in an index which only has the terms-to-docIdSet mappings, but without any stored data. Given what you said above, it sounds like the equivalent is to have the forward dictionary (so you have dict ids) and the inverted index (to map from a dict id to a set of doc ids), but no actual data, yes?

Mayank

01/20/2021, 8:21 PM

What you are referring to as actual data maps to fwd index. That also does not have data, it is not encoded dict ids per docId

Mayank

01/20/2021, 8:23 PM

To add more detail:

Copy code

Dictionary: value to id map
Fwd index: for each docId - dictId
Inv index: for each dictId -> list of docIds.

Mayank

01/20/2021, 8:24 PM

With dictionary encoding, and bit encoding (10 bits can represent 2^10 unique values), you can get compression.

Ken Krugler

01/20/2021, 8:24 PM

We have a multi-valued field, so in that case the fwd index is what?

Mayank

01/20/2021, 8:25 PM

So the question to you is:

Copy code

Are you trying to reduce storage cost? If so, the only thing you can eliminate is fwdIndex - let's check its size for your segments.

Ken Krugler

01/20/2021, 8:25 PM

We’re blowing the 2gb limit for a column. So yes, I guess you’d call that a “storage cost” 🙂

Mayank

01/20/2021, 8:25 PM

for MV: You can think of docId - [list of dictIds]

Mayank

01/20/2021, 8:26 PM

For that, you might want to reduce num docs per segment instead.

Ken Krugler

01/20/2021, 8:26 PM

When our next build succeeds (where increasing number of segments) I can check the fwd index sizes

Mayank

01/20/2021, 8:27 PM

Do you have star tree?

Ken Krugler

01/20/2021, 8:27 PM

Yes, though not with that column

Mayank

01/20/2021, 8:27 PM

Ok, then I am curious to know the metadata for that column (cardinality, etc)

Ken Krugler

01/20/2021, 8:27 PM

We’ve found that having # of segments <= number of available server threads really helps our query performance, thus the balancing act with segment size

Ken Krugler

01/20/2021, 8:27 PM

roughly 300K unique terms

Ken Krugler

01/20/2021, 8:28 PM

(it’s a text field that we’re tokenizing/normalizing)

Mayank

01/20/2021, 8:28 PM

Do you have text index for that column?

Ken Krugler

01/20/2021, 8:28 PM

No - it would be huge, and all we really need is term-level filtering

Mayank

01/20/2021, 8:28 PM

Also, it might not make sense to have dictionary on that column (if you have filters on other columns)

Mayank

01/20/2021, 8:29 PM

Ok, then explore no-dict index for that column

Mayank

01/20/2021, 8:30 PM

Ok, once you have the index generated, please share the metadata.properties for that column.

Mayank

01/20/2021, 8:30 PM

That will help me understand if no-dict or some other index might be better for that column

Ken Krugler

01/20/2021, 8:30 PM

OK, thanks

Mayank

01/20/2021, 8:30 PM

For dict, we pad strings to make them same length, and that could lead to lot of storage wastage.

Mayank

01/20/2021, 8:31 PM

Metadata will tell us

Ken Krugler

01/20/2021, 8:31 PM

wow, yes that would be an issue

Ken Krugler

01/20/2021, 8:31 PM

I could throw in a filter to remove long terms, which would also help

Mayank

01/20/2021, 8:32 PM

no-dict will eliminate padding and hence reduce size. But there is no inv index for no-dict so you would need to rely on setting index on other columns

Mayank

01/20/2021, 8:32 PM

for high cardinality with uneven string size, no-dict give better overall size

Ken Krugler

01/20/2021, 8:34 PM

Right, but sounds like what would be the best match for our use case would be a dictionary + inv index, without the forward index.

Mayank

01/20/2021, 8:36 PM

I think the wastage from padding in dictionary might be the root cause, and if so, removing fwd index won’t help

Mayank

01/20/2021, 8:36 PM

Let’s look at the index sizes and metadata once we have that

Ken Krugler

01/20/2021, 8:36 PM

Max term length is 20, average term length is 6, so assume 14 bytes/term * 400K terms is 5.6MB

Ken Krugler

01/20/2021, 8:37 PM

And yes, agree that examining the metadata is the right next step.

Mayank

01/20/2021, 8:41 PM

Sounds good

Ken Krugler

01/20/2021, 9:57 PM

From metadata.properties:

Copy code

column.landingPageText_terms.cardinality = 144997
column.landingPageText_terms.totalDocs = 6100482
column.landingPageText_terms.dataType = STRING
column.landingPageText_terms.bitsPerElement = 18
column.landingPageText_terms.lengthOfEachEntry = 45
column.landingPageText_terms.columnType = DIMENSION
column.landingPageText_terms.isSorted = false
column.landingPageText_terms.hasNullValue = false
column.landingPageText_terms.hasDictionary = true
column.landingPageText_terms.textIndexType = NONE
column.landingPageText_terms.hasInvertedIndex = true
column.landingPageText_terms.isSingleValues = false
column.landingPageText_terms.maxNumberOfMultiValues = 4984
column.landingPageText_terms.totalNumberOfEntries = 312834131
column.landingPageText_terms.isAutoGenerated = false
column.landingPageText_terms.maxValue = \uFF42\uFF49\uFF5A
column.landingPageText_terms.defaultNullValue = null

Ken Krugler

01/20/2021, 9:57 PM

And top four columns by size:

Copy code

landingPageText_terms.forward_index.size	743576242
destinationUrl.forward_index.size	199861484
creativeText.forward_index.size	99124657
imageUrl.forward_index.size	95375146

Mayank

01/20/2021, 9:58 PM

What about inv index and dict size for landingPageText?

Mayank

01/20/2021, 9:59 PM

Oh it is multi valued?

Ken Krugler

01/20/2021, 9:59 PM

yes

Mayank

01/20/2021, 10:00 PM

My guess inv index might be even bigger

Mayank

01/20/2021, 10:00 PM

Can you share inv index and dict size?

Ken Krugler

01/20/2021, 10:00 PM

So where is inv index size?

Mayank

01/20/2021, 10:00 PM

Index_map file

Ken Krugler

01/20/2021, 10:00 PM

All I’ve got for landingPageText_terms is:

Copy code

landingPageText_terms.dictionary.startOffset = 406482808
landingPageText_terms.dictionary.size = 6524873
landingPageText_terms.forward_index.startOffset = 413007681
landingPageText_terms.forward_index.size = 743576242

Mayank

01/20/2021, 10:01 PM

No inv index on this column?

Ken Krugler

01/20/2021, 10:01 PM

Hmm, says

hasInvertedIndex = true

Ken Krugler

01/20/2021, 10:01 PM

from metadata file

Ken Krugler

01/20/2021, 10:02 PM

column.landingPageText_terms.hasInvertedIndex = true

Mayank

01/20/2021, 10:02 PM

Unfortunately it always does

Mayank

01/20/2021, 10:02 PM

If index map file does not show it and you don’t have it in indexing config, then there is no inv index

Ken Krugler

01/20/2021, 10:02 PM

and that column is in the `tableIndexConfig`’s

invertedIndexColumns

list

Mayank

01/20/2021, 10:02 PM

Hmm

Mayank

01/20/2021, 10:03 PM

Oh there’s this config to generate inv index offline vs in server during loading

Ken Krugler

01/20/2021, 10:03 PM

I didn’t build these segments, someone else at the company did, but I believe the tableIndexConfig matches

Mayank

01/20/2021, 10:03 PM

But if index map does not have that info then it is not built yet

Ken Krugler

01/20/2021, 10:04 PM

ah, right

"createInvertedIndexDuringSegmentGeneration": false,

Mayank

01/20/2021, 10:04 PM

Typically inv index size for MV columns might be bigger than fwd index

Ken Krugler

01/20/2021, 10:05 PM

Should be a dict id, and a bitset, right?

Ken Krugler

01/20/2021, 10:05 PM

(compressed bitset, like RoaringDocIdSet)

Mayank

01/20/2021, 10:05 PM

Yes

Mayank

01/20/2021, 10:06 PM

I have seen this pattern in the past, where server OOMs when building inv index of MV columns (2GB limit)

Mayank

01/20/2021, 10:06 PM

We get around by reducing num docs per segment

Mayank

01/20/2021, 10:07 PM

Is adding more cores to server not an option?

Ken Krugler

01/20/2021, 10:08 PM

Adding more servers is an option, yes. Just trying to figure out bounds on what we can do here.

Ken Krugler

01/20/2021, 10:09 PM

But forward index for this column is 750M (out of a total of 886M, for this segment), so getting rid of that would be nice.

Mayank

01/20/2021, 10:10 PM

Yes, agree, if you definitely need inv index on the column. Otherwise, we need to check which of the two is smaller (fwd vs inv)

Ken Krugler

01/20/2021, 10:11 PM

we need to be able to filter using terms, so yes I think the inv index is a requirement

Mayank

01/20/2021, 10:11 PM

Well, if there are other filters in the query which eliminate a lot of rows, may be not

Ken Krugler

01/20/2021, 10:14 PM

Is there documentation on the format of the forward index? I’m also curious how that gets compressed (using Snappy?), if at all.

Mayank

01/20/2021, 10:15 PM

Uses min number of bits to represent dictIds

Mayank

01/20/2021, 10:15 PM

There is no additional compression on top of that for dict columns

Ken Krugler

01/20/2021, 10:18 PM

yeah, just seems like you’d need an additional table to map from docId to a bit offset into the bit-packed dictIds, and a count of how many dictIds exist for that docId. The Lucene index formats deal with similar issues, and get pretty complex trying to trade off size for lookup speed.

Mayank

01/20/2021, 10:19 PM

For SIngle value we don’t need not offset, MV yes

Ken Krugler

01/20/2021, 10:19 PM

yes, I’m interested in the MV case

Ken Krugler

01/20/2021, 10:19 PM

Which source file should I look at, if there’s no documentation?

Mayank

01/20/2021, 10:44 PM

FixedBitMVForwardIndexWriter

Sidd

01/20/2021, 11:13 PM

We recently did this for text index. But didn't remove the forward index completely. The raw text data was huge and was taking up ton of storage. So we stored dummy value in fwd index dictionay encoded

Sidd

01/20/2021, 11:13 PM

This was much easier than changing the semantics completely by not having the fwd index physically

Ken Krugler

01/20/2021, 11:23 PM

Thanks @Sidd I guess I could look into that and see how hard it would be to do the same thing for an arbitrary column, given a table config setting.

Ken Krugler

01/20/2021, 11:48 PM

@Sidd - where exactly in the code are you writing out a dummy fwd index for text columns?

Sidd

01/20/2021, 11:59 PM

@Ken Krugler see this PR https://github.com/apache/incubator-pinot/pull/6284

Sidd

01/21/2021, 12:02 AM

This doesn' go all the way in not having the fwd index physically I am interested in seeing how we can possibly not have the fwd index at all and if it is worth it or not given that with the above change storage overhead is significantly reduced

👍 2

Kishore G

01/21/2021, 12:05 AM

isn't it a matter of having a empty forward Index reader impl?

Open in Slack

Previous Next