Hi I m evaluating Pinot for the following use case and want Apache Pinot #general

Hi! I’m evaluating Pinot for the following use cas...

Andrew First

08/10/2020, 7:15 PM

Hi! I’m evaluating Pinot for the following use case and want to know if it’s a good fit, or any best practices to help achieve it. • Ingest events for ~1B total users at ~100k/second • Run aggregation queries on events filtered on individual user IDs at ~10k/second, each query completing in < 100ms What I understand is that the data is organized primarily by time (segments) and secondarily (within a segment) by indexes. In this case, I tried sorting by user ID. To query for a particular user ID, it seems that each segment must be queried, since the data is not consolidated by user. The runtime would be O(s log n) where s is the number of segments in a particular timeframe and n is the number of events per segment. Thus, it seems that Pinot may not scale when there are tens/hundreds of thousands of segments and may not be a good fit here. However, this use case seems similar to the use cases at Linkedin, such as the “who’s viewed your profile” feature, which also would operate on events for individual users. Is my understanding correct, and is there anything I’m missing here? Would appreciate any thoughts or resources you could point me to. Thanks!

Mayank

08/10/2020, 7:25 PM

Hello, if the query is for a user, you can partition the data on user, and achieve very high scalability The numbers you quoted should be able easily achievable In fact we have very similar use cases we are powering using Pinot at LinkedIn (

Andrew First

08/10/2020, 7:25 PM

Thanks! I am reading up on partitioning. So the partitioning must be done by Kafka, and the settings in Pinot must match - e.g. number of partitions and partition function? What happens when you need to repartition the data? Are there any best practices for the number of partitions?

Mayank

08/10/2020, 7:27 PM

Yes partitioning must be done in kafka as well as offline ingestion. By repartitioning, do you mean increase number of partitions? That is supported.

Mayank

08/10/2020, 7:27 PM

If you mean change the partition function, it may have latency impact and we don't recommend that.

Mayank

08/10/2020, 7:27 PM

Please also read up on replica groups in pinot, that also helps with scaling.

Mayank

08/10/2020, 7:28 PM

BTW, stay tuned for our virtual meetup (date coming soon) where we will also talk about this.

Kishore G

08/10/2020, 7:57 PM

+1 to using partitioning and replica group placement etc

Kishore G

08/10/2020, 7:58 PM

https://medium.com/@shounakmk219/tasted-apache-pinot-and-we-loved-it-85f9022c30f7

Kishore G

08/10/2020, 7:58 PM

image.png

Kishore G

08/10/2020, 7:59 PM

@Andrew First you can see how the latency improved with partition aware + replica group implementation

Andrew First

08/10/2020, 8:24 PM

Thanks! this is very helpful info. yes, i mean increase the # of partitions, not change the function. would you need to increase the partitions in Pinot first, and then Kafka - since they wouldn’t be able to be done at the same time.

Kishore G

08/10/2020, 8:25 PM

pinot does not have concept of partition as such, it just derives from Kafka partitions

Kishore G

08/10/2020, 8:25 PM

each segment metadata is self describing

Kishore G

08/10/2020, 8:25 PM

• total partitions, • hashing function • partition Ids

Kishore G

08/10/2020, 8:26 PM

so one segment can be (100, MOD, 5) and another one can say (110, MOD, [10,11,15])

Andrew First

08/10/2020, 8:27 PM

i see, so you don’t need to specify any of this in the config, e.g.:

Copy code

"segmentPartitionConfig": {
      "columnPartitionMap": {
        "memberId": {
          "functionName": "Modulo",
          "numPartitions": 4
        }
      }

Kishore G

08/10/2020, 8:27 PM

I think thats needed when you generate batch segments

Andrew First

08/10/2020, 8:28 PM

got it, thanks! will play around with this for a bit

Kishore G

08/10/2020, 8:28 PM

in case of Kafka, we derive that automatically, I might be wrong here but that was our initial thinking

Kishore G

08/10/2020, 8:28 PM

in any case, you get the idea

Andrew First

08/10/2020, 9:38 PM

the partition field must be an integer? Since I partitioned on UserID which is a String, and I try to query on UserID, e.g.

select * from events where userId = '10000070d3f29ba15aac40b1' limit 10

I get:

Copy code

ProcessingException(errorCode:450, message:InternalError:
java.io.IOException: Failed : HTTP error code : 500

In the broker logs:

Copy code

java.lang.NumberFormatException: For input string: "15380218d3181aa3dc2d3c05"
	at java.lang.NumberFormatException.forInputString(NumberFormatException.java:65) ~[?:1.8.0_265]
	at java.lang.Integer.parseInt(Integer.java:580) ~[?:1.8.0_265]
	at java.lang.Integer.parseInt(Integer.java:615) ~[?:1.8.0_265]
	at org.apache.pinot.core.data.partition.ModuloPartitionFunction.getPartition(ModuloPartitionFunction.java:56) ~[pinot-all-0.5.0-SNAPSHOT-jar-with-dependencies.jar:0.5.0-SNAPSHOT-1d4d47adfe7abf0c3ed8a3a14929de084e979968]

Mayank

08/10/2020, 9:39 PM

I think you need to config the partition function

Andrew First

08/10/2020, 9:39 PM

I configured it this way:

Copy code

"segmentPartitionConfig": {
          "columnPartitionMap": {
            "userId": {
              "functionName": "Modulo",
              "numPartitions": 128
            }
          }
        },

Mayank

08/10/2020, 9:39 PM

We recommend MurmurPartitionFunction

Mayank

08/10/2020, 9:40 PM

Modulo needs numeric I think

👍 1

Mayank

08/10/2020, 9:44 PM

Also for the scale you mentioned, we are using 32 partitions at lnkd.

Andrew First

08/10/2020, 9:48 PM

thanks!

Mayank

08/11/2020, 6:41 AM

correction, 64 partitions for 100k events per second (with 4 dimensions and one metric)

Open in Slack

Previous Next