I am facing a issue with a table I observe that when I creat Apache Pinot #troubleshooting

I am facing a issue with a table. I observe that w...

Harish Bohara

06/28/2022, 8:11 AM

I am facing a issue with a table. I observe that when I create a table, the data is injected but after 1-2 days the new data is not visible. I don’t see the latest data. If i delete and re-create the table it will show the new data . This table is a upsert table.

Copy code

"routing": {
        "instanceSelectorType": "strictReplicaGroup"
    },
"query": {},
"upsertConfig": {
    "mode": "PARTIAL",
    "partialUpsertStrategies": {
        "status": "OVERWRITE",
        "tenant_name": "OVERWRITE",
        "sub_tenant_name": "OVERWRITE"
    },
    "defaultPartialUpsertStrategy": "OVERWRITE",
    "hashFunction": "NONE"
},

🟢 1

Kartik Khare

06/28/2022, 10:27 AM

Hi Can you add the complete table config and schema here also, can you try querying with

option(skipUpsert=true)

Harish Bohara

06/28/2022, 1:45 PM

Ok will share the schema.. and table def..

Harish Bohara

06/28/2022, 7:25 PM

I checked with option(skipUpsert=true), still I dont see data

Harish Bohara

06/28/2022, 7:29 PM

One more thing - I created a new table with same settings. I am able to see the data in the new table (table_v2) but not in old table (table_v1)

Harish Bohara

06/28/2022, 7:33 PM

Here is the def for table

Copy code

{
  "schemaName": "schema_v1",
  "dimensionFieldSpecs": [   
    {
      "name": "channel",
      "dataType": "STRING"
    },
    {
      "name": "pipeline",
      "dataType": "STRING"
    },    
    {
      "name": "id",
      "dataType": "STRING"
    },
    {
      "name": "id_type",
      "dataType": "STRING"
    },    
    {
      "name": "status",
      "dataType": "STRING"
    },    
    {
      "name": "lob_name",
      "dataType": "STRING"
    }
  ],
  "dateTimeFieldSpecs": [
    {
      "name": "timestamp",
      "dataType": "STRING",
      "format": "1:HOURS:SIMPLE_DATE_FORMAT:yyyy-MM-dd'T'HH:mm:ss.SSS",
      "granularity": "1:MINUTES"
    }
  ],
  "primaryKeyColumns": [
    "id",
    "id_type"
  ]
}



{
    "tableName": "table_v1",
    "tableType": "REALTIME",
    "segmentsConfig": {
        "schemaName": "schema_v1",
        "retentionTimeUnit": "DAYS",
        "retentionTimeValue": "2",
        "replication": "2",
        "timeColumnName": "timestamp",
        "allowNullTimeValue": true,
        "replicasPerPartition": "2"
    },
    "tenants": {
        "broker": "DefaultTenant",
        "server": "DefaultTenant",
        "tagOverrideConfig": {}
    },
    "tableIndexConfig": {
        "invertedIndexColumns": [
            "pipeline",
            "channel"           
        ],
        "noDictionaryColumns": [],
        "streamConfigs": {
            "streamType": "kafka",
            "stream.kafka.topic.name": "----MY TOPOOC-----",
            "stream.kafka.broker.list": "{{kafka}}",
            "stream.kafka.consumer.type": "lowlevel",
            "stream.kafka.consumer.prop.auto.offset.reset": "largest",
            "stream.kafka.consumer.factory.class.name": "org.apache.pinot.plugin.stream.kafka20.KafkaConsumerFactory",
            "stream.kafka.decoder.class.name": "org.apache.pinot.plugin.stream.kafka.KafkaJSONMessageDecoder",
            "realtime.segment.flush.threshold.rows": "0",
            "realtime.segment.flush.threshold.time": "1h",
            "realtime.segment.flush.desired.size": "100M",
            "realtime.segment.flush.autotune.initialRows": "10000"
        },
        "sortedColumn": [],
        "bloomFilterColumns": [
            "channel"
        ],
        "loadMode": "MMAP",
        "onHeapDictionaryColumns": [],
        "varLengthDictionaryColumns": [],
        "enableDefaultStarTree": false,
        "enableDynamicStarTreeCreation": false,
        "aggregateMetrics": false,
        "nullHandlingEnabled": true,
        "rangeIndexColumns": [],
        "rangeIndexVersion": 1,
        "autoGeneratedInvertedIndex": false,
        "createInvertedIndexDuringSegmentGeneration": false
    },
    "metadata": {},
    "quota": {},
    "routing": {
        "instanceSelectorType": "strictReplicaGroup"
    },
    "query": {},
    "upsertConfig": {
        "mode": "PARTIAL",
        "partialUpsertStrategies": {
            "status": "OVERWRITE",
            "lob_name": "OVERWRITE"          
        },
        "defaultPartialUpsertStrategy": "OVERWRITE",
        "hashFunction": "NONE"
    },
    "ingestionConfig": {},
    "isDimTable": false
}

Kartik Khare

06/29/2022, 6:15 AM

Hi, is there a need for partial upsert in this case. With the current config, what you are doing is only full upsert since every column is getting OVERWRITTEN.

Harish Bohara

06/29/2022, 6:52 AM

the status and lob_name update can happen in diff events… What i need: 1. I have a row e.g. id=10, lob_name=abcd, status=init, pipeline=p1, channel=c1, udf_1=0 2. Event 1 -> update status=processing, udf_1=10 3. Event 2 -> update lob_name=client_1 4. Event 2 -> update status=done Result expected: id=10, lob_name=client_1, status=done, pipeline=p1, channel=c1, udf_1=10

Harish Bohara

06/29/2022, 6:56 AM

What settings you suggest? The major problem I see is data not coming to table. When this happened I checked the logs of servers, I did not see the kafka consuming logs (normally I see topic name and offset in server logs).. As i also mentioned above, I created a v2 table. Now v1 table had data only till 28th and v2 has data for 29. So the data is there in the kafka. Not sure why v1 table is not showing that data

Kartik Khare

06/29/2022, 6:58 AM

ok. then, what you need to do is set

"defaultPartialUpsertStrategy": "IGNORE"

and mention only the columns that need to be updated in

partialUpsertStrategies

such as status and lob_name with mode

OVERWRITE

Kartik Khare

06/29/2022, 7:00 AM

The data not getting consumed is strange though. Are there any segments created for 29th in v1 table? If not, can you once check the status of the latest segment

Harish Bohara

06/29/2022, 7:02 AM

Done.. Created a new table v3 with what you suggested.

Harish Bohara

06/29/2022, 7:05 AM

No in v1 table i have only 10 segments (hist_v1__0__0__20220628T0808Z, hist_v1__1__0__20220628T0808Z, …) Note these tables do not have lot of data e.g. < 100K-500K records per day for now. It will go to 1-5M per day once I have all traffic in

Harish Bohara

06/29/2022, 7:34 AM

Something is wrong - none of the tables are ingesting new data.. I checked he logs and I dont see consumeing logs kubectl -n pinot logs --follow pod/pinot-server-0 | grep my_topic -> no output

Kartik Khare

06/29/2022, 7:36 AM

what does it show in Zookeeper Browser -> IDEAL STATES -> table name

Harish Bohara

06/29/2022, 7:38 AM

Copy code

hist_v3__0__0__20220629T0659Z": {
      "Server_pinot-server-0.pinot-server-headless.pinot.svc.cluster.local_8098": "CONSUMING",
      "Server_pinot-server-3.pinot-server-headless.pinot.svc.cluster.local_8098": "CONSUMING"
    },

Kartik Khare

06/29/2022, 7:38 AM

and in EXTERNAL VIEW?

Harish Bohara

06/29/2022, 7:38 AM

same thing

Harish Bohara

06/29/2022, 7:41 AM

Also I am sure that the kafka has data - i can see it coming in “kafka-console-consumer”

Kartik Khare

06/29/2022, 7:46 AM

@saurabh dubey can you help here

saurabh dubey

06/29/2022, 7:50 AM

Could you check PROPERTYSTORE in ZK to confirm there are segments in "IN_PROGRESS" state?

Harish Bohara

06/29/2022, 7:52 AM

“segment.realtime.status”: “IN_PROGRESS”

Harish Bohara

06/29/2022, 7:59 AM

Just to give more info: Given below is the info about one of the partition from kafka topic (10th part) - this is from PROPERTYSTORE.

Copy code

Same table:

Table v1: crated on 28:

hist_v1__9__0__20220628T0808Z
29330927


Table v1: crated on 28 (11:30 PM) - because V1 was stuck
hist_v2__9__0__20220628T1927Z
29596616


Table v1: crated on 29 ( - because V2 was stuck
hist_v3__9__0__20220629T0659Z
29840247

All of them has:
"segment.realtime.status": "IN_PROGRESS"

Harish Bohara

06/30/2022, 6:39 AM

@saurabh dubey anything on this? FYI - I also created same table in a different Pinot cluster. Same issue now. It stopped ingesting data..

Kartik Khare

06/30/2022, 8:27 AM

Hi What is offset retention period set in Kafka topic?

Harish Bohara

06/30/2022, 10:26 AM

2 days : PartitionCount: 10 ReplicationFactor: 2 Configs: min.insync.replicas=2,retention.ms=172800000,message.format.version=2.3-IV1,unclean.leader.election.enable=false

Harish Bohara

06/30/2022, 10:29 AM

If you are looking for setting of “__consumer_offsets” -> it is 2 days.

Kartik Khare

06/30/2022, 12:29 PM

got it. And the consumption stops after 24 hours or does it happen before that as well. can be verified by the timestamp difference between first and last row

Harish Bohara

06/30/2022, 12:41 PM

On 29th it stopped 2 times in a day. Both v2 and v3 table stopped in a single day

Kartik Khare

06/30/2022, 12:45 PM

What I wanted to know was approximate duration in minutes. That might give us some hints as to if it is a period job causing this issue inside pinot or a random event

Harish Bohara

06/30/2022, 12:46 PM

Ok will share the data..

Harish Bohara

06/30/2022, 12:47 PM

FYI I also ran the exact same table in new pinot cluster - same issue .. it is stuck there as well..

Harish Bohara

06/30/2022, 12:54 PM

Copy code

Table V2 
1656462921
GMT: Wednesday, June 29, 2022 0:35:21
Your time zone: Wednesday, June 29, 2022 6:05:21 GMT+05:30
Relative: 2 days ago


1656470101579
GMT: Wednesday, June 29, 2022 2:35:01.579
Your time zone: Wednesday, June 29, 2022 8:05:01.579 GMT+05:30

--------------------------------------------------------------------


Table V3 (which i created once V2 table stopped):
1656486065
GMT: Wednesday, June 29, 2022 7:01:05
Your time zone: Wednesday, June 29, 2022 12:31:05 GMT+05:30


1656486423000
GMT: Wednesday, June 29, 2022 7:07:03
Your time zone: Wednesday, June 29, 2022 12:37:03 GMT+05:30

Harish Bohara

06/30/2022, 12:54 PM

Copy code

I do see a issue in my timestamp - some timestamps are "1656462921" and others are "1656470101579" (extra 3 digit)

Harish Bohara

06/30/2022, 1:26 PM

Not sure if this is the issue - anyways I have fixed the issues in my job now all timestamps are millseconds

Open in Slack

Previous Next