acceptable-musician-1893
04/07/2022, 9:22 PMplain-napkin-77279
04/08/2022, 2:40 AMbumpy-activity-74405
04/08/2022, 7:45 AM0.8.23
-> 0.8.32.1
I’ve lost most of my looker charts (±58k to ±1k). I don’t have a deep understanding of what happens when gms is starting but I suspect that this happened, because the container was killed in the middle of bootstrapping due to misconfiguration of initialDelaySeconds
on my part. I see around ±58k chart records in mysql (aspect = chartKey
) and chartindex_v2
es index. I’ve tried reingesting but only 100-200 more charts show up after ingesting the ±58k elements. I suspect that the couple hundred more that appear in the UI are the ones that had any changes. How does one go about fixing something like this?breezy-portugal-43538
04/08/2022, 8:56 AMcuddly-lunch-28022
04/08/2022, 8:56 AMprehistoric-dawn-23569
04/08/2022, 9:00 AMCaused by: javax.net.ssl.SSLPeerUnverifiedException: Certificate for <datahub-gms-main-tls-service.datahub.svc.cluster.local> doesn't match any of the subject alternative names: [staging.svc.eqiad.wmnet, staging.svc.codfw.wmnet]
cuddly-lunch-28022
04/08/2022, 9:38 AMbusy-shampoo-12116
04/08/2022, 6:29 PMtypes.generated.ts
and get the repo to create my-entity.generated.ts
in the graphql folder? thank you in advance!adamant-magazine-62649
04/08/2022, 11:57 PMdatahub docker quickstart --quickstart-compose-file ./docker/quickstart/docker-compose.quickstart.yml
from the root of the datahub repo
Fetching docker-compose file https://raw.githubusercontent.com/datahub-project/datahub/master/docker/quickstart/docker-compose-without-neo4j.quickstart.yml from GitHub
WARNING: The HOME variable is not set. Defaulting to a blank string.
Pulling elasticsearch ... done
Pulling elasticsearch-setup ... done
Pulling mysql ... done
Pulling datahub-gms ... pulling from linkedin/datahub-gms
Pulling datahub-frontend-react ... pulling from linkedin/datahub-fro...
Pulling datahub-actions ... done
Pulling mysql-setup ... done
Pulling zookeeper ... done
Pulling broker ... done
Pulling schema-registry ... done
Pulling kafka-setup ... done
ERROR: for datahub-frontend-react no matching manifest for linux/amd64 in the manifest list entries
ERROR: for datahub-gms no matching manifest for linux/amd64 in the manifest list entries
ERROR: no matching manifest for linux/amd64 in the manifest list entriesplain-napkin-77279
04/09/2022, 12:18 AMmany-guitar-67205
04/11/2022, 8:26 AMdatahub delete
output is ambiguous, and the data is not gone:
❯ datahub delete --entity_type dataset --platform kafka --hard
This will permanently delete data from DataHub. Do you want to continue? [y/N]: y
[2022-04-11 10:17:22,059] INFO {datahub.cli.delete_cli:200} - datahub configured with <http://localhost:8080>
[2022-04-11 10:17:22,182] INFO {datahub.cli.delete_cli:212} - Filter matched 22 entities. Sample: ['urn:li:dataset:(urn:li:dataPlatform:kafka,
... (22 urns)
]
This will delete 22 entities. Are you sure? [y/N]: y
100% (22 of 22) |################################################################################################################################################################################################################| Elapsed Time: 0:00:01 Time: 0:00:01
Took 6.673 seconds to hard delete 0 rows for 22 entities
the gms debug log shows 22 successful delete actions, but the output of the command says 0 rows
The data is not deleted.
What can I do to
a. troubleshoot this further
b. actually delete the data
?creamy-van-28626
04/11/2022, 10:50 AMcreamy-van-28626
04/11/2022, 10:51 AMable-rain-74449
04/11/2022, 2:20 PMsee thread
salmon-manchester-60485
04/11/2022, 2:28 PMearly-midnight-66457
04/12/2022, 7:42 AMhundreds-ability-78888
04/12/2022, 4:43 PMquick-pizza-8906
04/13/2022, 7:31 AMThe bean 'kafkaProducerFactory', defined in class path resource [org/springframework/boot/autoconfigure/kafka/KafkaAutoConfiguration.class], could not be registered. A bean with that name has already been defined in URL [jar:file:/datahub/datahub-upgrade/bin/datahub-upgrade.jar!/BOOT-INF/lib/factories.jar!/com/linkedin/gms/factory/kafka/KafkaProducerFactory.class] and overriding is disabled.
Any ideas where might it be coming from?colossal-easter-99672
04/13/2022, 10:06 AMbusy-waiter-6669
04/13/2022, 10:08 AMstrong-kite-83354
04/13/2022, 10:41 AMhttps://datahubspace.slack.com/files/U02NY1Q2PFG/F03B8HD4ETF/image.png▾
breezy-portugal-43538
04/13/2022, 1:57 PMsymbol: class Generated
location: package javax.annotation.processing
/sharedvolume/datahub/datahub-graphql-core/src/mainGeneratedGraphQL/java/com/linkedin/datahub/graphql/generated/Filter.java:7: error: cannot find symbol
@javax.annotation.processing.Generated(
^
symbol: class Generated
location: package javax.annotation.processing
100 errors
> Task :datahub-graphql-core:compileJava FAILED
FAILURE: Build failed with an exception.
* What went wrong:
Execution failed for task ':datahub-graphql-core:compileJava'.
> Compilation failed; see the compiler error output for details.
* Try:
Run with --info or --debug option to get more log output. Run with --scan to get full insights.
* Exception is:
org.gradle.api.tasks.TaskExecutionException: Execution failed for task ':datahub-graphql-core:compileJava'.
at org.gradle.api.internal.tasks.execution.ExecuteActionsTaskExecuter.lambda$executeIfValid$3(ExecuteActionsTaskExecuter.java:186)
at org.gradle.internal.Try$Failure.ifSuccessfulOrElse(Try.java:268)
at org.gradle.api.internal.tasks.execution.ExecuteActionsTaskExecuter.executeIfValid(ExecuteActionsTaskExecuter.java:184)
at org.gradle.api.internal.tasks.execution.ExecuteActionsTaskExecuter.execute(ExecuteActionsTaskExecuter.java:173)
at org.gradle.api.internal.tasks.execution.CleanupStaleOutputsExecuter.execute(CleanupStaleOutputsExecuter.java:109)
at org.gradle.api.internal.tasks.execution.FinalizePropertiesTaskExecuter.execute(FinalizePropertiesTaskExecuter.java:46) [37/1861]
at org.gradle.api.internal.tasks.execution.ResolveTaskExecutionModeExecuter.execute(ResolveTaskExecutionModeExecuter.java:62)
at org.gradle.api.internal.tasks.execution.SkipTaskWithNoActionsExecuter.execute(SkipTaskWithNoActionsExecuter.java:57)
at org.gradle.api.internal.tasks.execution.SkipOnlyIfTaskExecuter.execute(SkipOnlyIfTaskExecuter.java:56)
at org.gradle.api.internal.tasks.execution.CatchExceptionTaskExecuter.execute(CatchExceptionTaskExecuter.java:36)
at org.gradle.api.internal.tasks.execution.EventFiringTaskExecuter$1.executeTask(EventFiringTaskExecuter.java:77)
at org.gradle.api.internal.tasks.execution.EventFiringTaskExecuter$1.call(EventFiringTaskExecuter.java:55)
at org.gradle.api.internal.tasks.execution.EventFiringTaskExecuter$1.call(EventFiringTaskExecuter.java:52)
at org.gradle.internal.operations.DefaultBuildOperationRunner$CallableBuildOperationWorker.execute(DefaultBuildOperationRunner.java:200)
at org.gradle.internal.operations.DefaultBuildOperationRunner$CallableBuildOperationWorker.execute(DefaultBuildOperationRunner.java:195)
at org.gradle.internal.operations.DefaultBuildOperationRunner$3.execute(DefaultBuildOperationRunner.java:75)
at org.gradle.internal.operations.DefaultBuildOperationRunner$3.execute(DefaultBuildOperationRunner.java:68)
at org.gradle.internal.operations.DefaultBuildOperationRunner.execute(DefaultBuildOperationRunner.java:153)
at org.gradle.internal.operations.DefaultBuildOperationRunner.execute(DefaultBuildOperationRunner.java:68)
at org.gradle.internal.operations.DefaultBuildOperationRunner.call(DefaultBuildOperationRunner.java:62)
at org.gradle.internal.operations.DefaultBuildOperationExecutor.lambda$call$2(DefaultBuildOperationExecutor.java:76)
at org.gradle.internal.operations.UnmanagedBuildOperationWrapper.callWithUnmanagedSupport(UnmanagedBuildOperationWrapper.java:54)
at org.gradle.internal.operations.DefaultBuildOperationExecutor.call(DefaultBuildOperationExecutor.java:76)
at org.gradle.api.internal.tasks.execution.EventFiringTaskExecuter.execute(EventFiringTaskExecuter.java:52)
at org.gradle.execution.plan.LocalTaskNodeExecutor.execute(LocalTaskNodeExecutor.java:41)
at org.gradle.execution.taskgraph.DefaultTaskExecutionGraph$InvokeNodeExecutorsAction.execute(DefaultTaskExecutionGraph.java:411)
at org.gradle.execution.taskgraph.DefaultTaskExecutionGraph$InvokeNodeExecutorsAction.execute(DefaultTaskExecutionGraph.java:398)
at org.gradle.execution.taskgraph.DefaultTaskExecutionGraph$BuildOperationAwareExecutionAction.execute(DefaultTaskExecutionGraph.java:391)
at org.gradle.execution.taskgraph.DefaultTaskExecutionGraph$BuildOperationAwareExecutionAction.execute(DefaultTaskExecutionGraph.java:377)
at org.gradle.execution.plan.DefaultPlanExecutor$ExecutorWorker.lambda$run$0(DefaultPlanExecutor.java:127)
at org.gradle.execution.plan.DefaultPlanExecutor$ExecutorWorker.execute(DefaultPlanExecutor.java:191)
at org.gradle.execution.plan.DefaultPlanExecutor$ExecutorWorker.executeNextNode(DefaultPlanExecutor.java:182)
at org.gradle.execution.plan.DefaultPlanExecutor$ExecutorWorker.run(DefaultPlanExecutor.java:124)
at org.gradle.internal.concurrent.ExecutorPolicy$CatchAndRecordFailures.onExecute(ExecutorPolicy.java:64)
at org.gradle.internal.concurrent.ManagedExecutorImpl$1.run(ManagedExecutorImpl.java:48)
at org.gradle.internal.concurrent.ThreadFactoryImpl$ManagedThreadRunnable.run(ThreadFactoryImpl.java:56)
Caused by: org.gradle.api.internal.tasks.compile.CompilationFailedException: Compilation failed; see the compiler error output for details.
at org.gradle.api.internal.tasks.compile.JdkJavaCompiler.execute(JdkJavaCompiler.java:57)
at org.gradle.api.internal.tasks.compile.JdkJavaCompiler.execute(JdkJavaCompiler.java:40)
at org.gradle.api.internal.tasks.compile.daemon.AbstractDaemonCompiler$CompilerWorkAction.execute(AbstractDaemonCompiler.java:135)
at org.gradle.workers.internal.DefaultWorkerServer.execute(DefaultWorkerServer.java:63)
at org.gradle.workers.internal.AbstractClassLoaderWorker$1.create(AbstractClassLoaderWorker.java:49)
at org.gradle.workers.internal.AbstractClassLoaderWorker$1.create(AbstractClassLoaderWorker.java:43)
at org.gradle.internal.classloader.ClassLoaderUtils.executeInClassloader(ClassLoaderUtils.java:97)
at org.gradle.workers.internal.AbstractClassLoaderWorker.executeInClassLoader(AbstractClassLoaderWorker.java:43)
at org.gradle.workers.internal.FlatClassLoaderWorker.run(FlatClassLoaderWorker.java:32)
at org.gradle.workers.internal.FlatClassLoaderWorker.run(FlatClassLoaderWorker.java:22)
at org.gradle.workers.internal.WorkerDaemonServer.run(WorkerDaemonServer.java:85)
at org.gradle.workers.internal.WorkerDaemonServer.run(WorkerDaemonServer.java:55)
at org.gradle.process.internal.worker.request.WorkerAction$1.call(WorkerAction.java:138)
at org.gradle.process.internal.worker.child.WorkerLogEventListener.withWorkerLoggingProtocol(WorkerLogEventListener.java:41)
at org.gradle.process.internal.worker.request.WorkerAction.run(WorkerAction.java:135)
at org.gradle.internal.dispatch.ReflectionDispatch.dispatch(ReflectionDispatch.java:36)
at org.gradle.internal.dispatch.ReflectionDispatch.dispatch(ReflectionDispatch.java:24)
at org.gradle.internal.remote.internal.hub.MessageHubBackedObjectConnection$DispatchWrapper.dispatch(MessageHubBackedObjectConnection.java:182)
at org.gradle.internal.remote.internal.hub.MessageHubBackedObjectConnection$DispatchWrapper.dispatch(MessageHubBackedObjectConnection.java:164)
at org.gradle.internal.remote.internal.hub.MessageHub$Handler.run(MessageHub.java:414)
at org.gradle.internal.concurrent.ExecutorPolicy$CatchAndRecordFailures.onExecute(ExecutorPolicy.java:64)
at org.gradle.internal.concurrent.ManagedExecutorImpl$1.run(ManagedExecutorImpl.java:48)
at org.gradle.internal.concurrent.ThreadFactoryImpl$ManagedThreadRunnable.run(ThreadFactoryImpl.java:56)
I'm not sure where does it come from and how to handle this, I think i did everything correctly according to the instruction here: https://datahubproject.io/docs/developers/
Also, since I am not familiar with datahub repository, how big is the scope of the change to make endpoint_url for aws changeable via the yml file? Could you provide some info on what to check and where to look and what to change in what src file?
Thank you deeply for all the help you guys provide : )red-napkin-59945
04/13/2022, 4:43 PMContainer
and Domain
?curved-crayon-1929
04/13/2022, 5:05 PMsource:
type: glue
config:
aws_region: us-east-2
aws_access_key_id: AKIA226GV
aws_secret_access_key: j4EzEH12YEQLw0p4+K
aws_session_token: null
database_pattern:
allow:
- "billing"
table_pattern:
allow:
- "billingtable"
sink:
type: datahub-rest
config:
server: '<http://localhost:8080>'
nutritious-machine-80578
04/13/2022, 6:21 PMgentle-father-80172
04/13/2022, 6:27 PMPython 3.9.2
required for Looker ingestion? Thanks
TypeError: You should use `typing_extensions.TypedDict` instead of `typing.TypedDict` with Python < 3.9.2. Without it, there is no way to differentiate required and optional fields when subclassed.
[2022-04-13 18:18:03,597] INFO {datahub.entrypoints:161} - DataHub CLI version: 0.8.32.6 at /home/ubuntu/.local/lib/python3.8/site-packages/datahub/__init__.py
[2022-04-13 18:18:03,597] INFO {datahub.entrypoints:164} - Python version: 3.8.10 (default, Mar 15 2022, 12:22:08)
[GCC 9.4.0] at /usr/bin/python3 on Linux-5.4.0-1045-aws-x86_64-with-glibc2.29
[2022-04-13 18:18:03,597] INFO {datahub.entrypoints:167} - GMS config {'models': {}, 'versions': {'linkedin/datahub': {'version': 'v0.8.32', 'commit': '7080798825c4ac696c074d335a7eab7d510346c8'}}, 'managedIngestion': {'defaultCliVersion': '0.8.32.1', 'enabled': True}, 'statefulIngestionCapable': True, 'supportsImpactAnalysis': False, 'telemetry': {'enabledCli': True, 'enabledIngestion': False}, 'retention': 'true', 'noCode': 'true'}
nutritious-bird-77396
04/13/2022, 9:51 PMv0.8.32.5
so its not blocking...able-rain-74449
04/14/2022, 8:48 AMable-rain-74449
04/14/2022, 8:50 AMnutritious-jackal-99119
04/14/2022, 9:01 AM