ripe-eye-60209
12/07/2022, 8:56 AMincalculable-ocean-74010
12/07/2022, 9:33 AMripe-eye-60209
12/07/2022, 9:43 AMacceptable-morning-73148
12/07/2022, 9:43 AM2022-12-07T09:12:49.415Z | Args are RestoreIndicesArgs(start=1081000, batchSize=1000, numThreads=1, batchDelayMs=100, aspectName=null, urn=null, urnLike=null)
2022-12-07T09:12:49.415Z | Reading rows 1081000 through 1082000 from the aspects table started.
2022-12-07T09:12:49.415Z | Reading rows 1081000 through 1082000 from the aspects table completed.
2022-12-07T09:12:49.415Z | metrics so far RestoreIndicesResult(ignored=0, rowsMigrated=1081000, timeSqlQueryMs=641, timeGetRowMs=0, timeUrnMs=2354, timeEntityRegistryCheckMs=827, aspectCheckMs=580, createRecordMs=62871, sendMessageMs=308002)
2022-12-07T09:12:49.415Z | Successfully sent MAEs for 1081000/1282980 rows (84.26% of total). 0 rows ignored (0.00% of total)
2022-12-07T09:12:49.415Z | 1394.20 mins taken. 260.50 est. mins to completion. Total mins est. = 1654.70.ripe-eye-60209
12/07/2022, 9:44 AMincalculable-ocean-74010
12/07/2022, 10:31 AMripe-eye-60209
12/07/2022, 10:38 AMincalculable-ocean-74010
12/07/2022, 10:40 AMKafkaJob :
public class KafkaJob implements Callable<RestoreIndicesResult>
That processes a subset of the data from the DB: https://github.com/datahub-project/datahub/blob/626a06445a39457e276c59352ff58a2fd2[…]va/com/linkedin/datahub/upgrade/restoreindices/SendMAEStep.javaincalculable-ocean-74010
12/07/2022, 10:42 AMKafkaJob will call a method (restoreIndices) in GMS’s EntityService which will use a SQL connection to the DB, meaning that if the DB is configured to have say 50 connections and GMS has the same config you can only run at most 50 KafkaJobs at the same time but it would likely have an unintended side effect if GMS needed a sql connection for some other work.incalculable-ocean-74010
12/07/2022, 10:42 AMKafkaJob fairly low but this depends on your config.ripe-eye-60209
12/07/2022, 2:31 PM