Hello all We just upgraded one of our test k8s namespaces fr Apache Pinot #troubleshooting

Hello all. We just upgraded one of our test k8s na...

Stuart Millholland

09/23/2022, 1:40 PM

Hello all. We just upgraded one of our test k8s namespaces from .10 to .11 by deleting all of the statefulsets (with cascade=orphan) then deploying, then deleting the pods zookeeper, broker, controller, server, minion pods in that order and then re-deploying. Pinot was upgraded successfully but our segments were not downloaded from the deepstore. Looks like zookeper doesn't have knowledge of them. We know we can use the LaunchDataIngestionJob to load the deepstore segments (BTW this is a hybrid table), but I'm curious what we would do in production. Was there a step we missed that would have made pinot recognize the segments in the deepstore and automatically load them?

Kishore G

09/23/2022, 1:48 PM

where is the zookeeper dataDir? was it using a PV or local storage

Luis Fernandez

09/23/2022, 1:54 PM

hey Stuart also in the process of upgrading to .11 in our test clusters, wondering why do you have to do with cascade=orphan? also according to this guide https://docs.pinot.apache.org/operators/operating-pinot/upgrading-pinot-cluster there’s no need to redeploy zk

Stuart Millholland

09/23/2022, 1:55 PM

@Kishore G it was using a pv

Stuart Millholland

09/23/2022, 1:55 PM

@Luis Fernandez we used that option so the pv's would not get deleted (our thinking anyway)

Stuart Millholland

09/23/2022, 1:56 PM

Yeah, maybe our problem was redeploying zookeeper in the first place

Stuart Millholland

09/23/2022, 1:58 PM

In any case this was a test env so we will use these learnings

Luis Fernandez

09/23/2022, 1:58 PM

you have all your old tables in your test env only not the data?

Stuart Millholland

09/23/2022, 1:58 PM

Correct and the realtime table started ingesting so that part was all good

Mayank

09/23/2022, 1:59 PM

If zk snapshots are available on the PVs, you can restore it from there. In general though, I’d run ZK separately from Pinot.

Stuart Millholland

09/23/2022, 1:59 PM

I think we just nuked zk and that was our issue sounds like

Stuart Millholland

09/23/2022, 1:59 PM

Yeah, we feel confident we could restore the data, and it looks like we caused the isssue ourselves by messing w/ zk

Luis Fernandez

09/23/2022, 2:00 PM

if zk gets nuked and you lose information i would imagine you would also lose your old tables and all which is interesting, at least that has been my experience when messing with zk 😄 always scared to do anything to it

🌟 1

Luis Fernandez

09/23/2022, 2:00 PM

(given that zk stores all that information if i’m not mistaken)

Stuart Millholland

09/23/2022, 2:00 PM

Well we have an init script that is idempotent that creates them 🙂

Luis Fernandez

09/23/2022, 2:01 PM

oooo i see i see

Luis Fernandez

09/23/2022, 2:01 PM

that explains that

Stuart Millholland

09/23/2022, 2:01 PM

ya lol

Luis Fernandez

09/23/2022, 2:03 PM

i guess there should be ways to restore this info and also make copies of it elsewhere do you have something like that setup? cause we don’t and maybe we should 😄

Mayank

09/23/2022, 2:08 PM

Are you destroying and recreating tables with each deployment? That should not be necessary. You can upgrade Pinot bits in a deployment without have to recreate table and with zero downtime

Stuart Millholland

09/23/2022, 2:09 PM

Nope, it doesn't destroy them, it's idempotent (for the most part)

Stuart Millholland

09/23/2022, 2:09 PM

Yeah I think we just made a mistake here by nuking zk, lesson learned.

Mayank

09/23/2022, 2:24 PM

But I am still curious on the need to have an idempotent script to create tables (and applying during deployment?).

Stuart Millholland

09/23/2022, 2:24 PM

Well for us it's to make local development easier

Stuart Millholland

09/23/2022, 2:25 PM

We use skaffold for local dev

Stuart Millholland

09/23/2022, 2:25 PM

We do a few things in that init script, we add cluster configs and server tags and schema and such

Stuart Millholland

09/23/2022, 2:29 PM

For now the table portion is very simple, just create it if it doesn't exist, we don't make any changes to an existing table yet, we may add some sort of "migration" process for that later on

Open in Slack

Previous Next