dazzling-appointment-34954
02/02/2022, 6:09 PMlate-bear-87552
02/02/2022, 6:11 PMsource:
type: "bigquery"
config:
## Coordinates
project_id: adf-adfa-240416
credential:
project_id: adf-adfa-240416
private_key_id: ""
private_key: "-----BEGIN PRIVATE KEY"
client_email: ""
client_id: ""
table_pattern:
deny:
-
sink:
type: "datahub-rest"
config:
server: "<http://localhost:8080>"
wanted to deny table which starts with temp_, can anyone help me with the yml file???handsome-football-66174
02/02/2022, 10:35 PMcalm-river-44367
02/03/2022, 1:59 PMstrong-iron-17184
02/03/2022, 2:40 PMnumerous-eve-42142
02/03/2022, 9:07 PMbetter-orange-49102
02/04/2022, 10:34 AMgifted-queen-61023
02/04/2022, 10:57 AMsetup.py
install for sasl3
did not run successfully.
│ exit code: 1
╰─> [28 lines of output]
/.../datahub/metadata-ingestion/venv/lib/python3.8/site-packages/setuptools/dist.py697 UserWarning: Usage of dash-separated 'description-file' will not be supported in future versions. Please use the underscore name 'description_file' instead
note: This error originates from a subprocess, and is likely not a problem with pip.
error: legacy-install-failure
Thanks in advance 🙌sparse-planet-56664
02/04/2022, 12:19 PMbrief-toothbrush-55766
02/04/2022, 2:47 PMDatasetAspect dataAspect = new DatasetAspect();
dataAspect.setOwnership(getOwnership(dataset));
dataAspect.setSchemaMetadata(getSchemaMetadata(dataset));
dataAspect.setInstitutionalMemory(getInstitutionalMemory(dataset));
dataAspect.setDatasetProperties(new DatasetProperties().setDescription("Gama Test description").setCustomProperties(map));
MetadataChangeProposalWrapper mcpw = MetadataChangeProposalWrapper.builder()
.entityType("dataset")
.entityUrn("urn:li:dataset:(urn:li:dataPlatform:s3,test,PROD)")
//.entityUrn("urn:li:dataset:(foo,bar,PROD)")
.upsert()
.aspect(dataAspect).aspectName("dataset")
.build();
ancient-author-86397
02/04/2022, 9:22 PMcompileMainGeneratedDataTemplateJava
it fails because it cannot find the symbol FabricType
.calm-river-44367
02/06/2022, 9:52 AMshy-parrot-64120
02/07/2022, 9:30 AMUnboundVariable: ': unbound variable'
--- running:
datahub ingest run -c glue.yml
shy-parrot-64120
02/07/2022, 1:06 PMhigh-toothbrush-90528
02/07/2022, 1:48 PM#9 252.3 There are 31 data schema input files. Using input root folder: /datahub-src/li-utils/src/main/pegasus
#9 252.9 [main] INFO com.linkedin.pegasus.generator.PegasusDataTemplateGenerator - Generating 32 files
#9 253.1
#9 253.1 FAILURE: Build failed with an exception.
#9 253.1
#9 253.1 * What went wrong:
#9 253.1 Execution failed for task ':metadata-models:generateGitProperties'.
#9 253.1 > gradlegitproperties.org.eclipse.jgit.errors.MissingObjectException: Missing unknown 0b1d79ea8d4295908f5f808431e7a8b5faba6759
#9 253.1
plain-farmer-27314
02/07/2022, 3:34 PMdatahub ingest list-runs
is not showing all of the different ingestion jobs we run. We run 5 or so jobs daily, and the latest one its showing is from 02/05, and its missing several jobs from each day as well. Double checked our airflow logs and confirmed jobs ran each day successfully
Wondering if this is a known issue or if I'm missing something herecalm-river-44367
02/08/2022, 7:49 AMstrong-iron-17184
02/08/2022, 2:06 PMwooden-football-7175
02/08/2022, 2:32 PM{datahub.py:122} ERROR - ('Unable to emit metadata to DataHub GMS', {'message': "Invalid URL '<host>/entities?action=ingest': No schema supplied. Perhaps you meant http://<host>/entities?action=ingest?"})
Anyone have an idea about this!!!
Thanksmodern-monitor-81461
02/08/2022, 5:33 PM'message': "Parameters of method 'ingest' failed validation with error 'ERROR :: "
'/entity/value/com.linkedin.metadata.snapshot.DatasetSnapshot/aspects/2/com.linkedin.schema.SchemaMetadata/fields/3/type/type/com.linkedin.schema.ArrayType/nestedType '
':: array type is not backed by a DataList\n'
I understand that there is a problem with my SchemaMetadata aspect about the 4th field. It is indeed an array:
SchemaFieldClass({"fieldPath": "[version=2.0].[type=struct].[type=struct].leaf_cert.[type=array].[type=string].all_domains", "jsonPath": None, "nullable": True, "description": None, "type": SchemaFieldDataTypeClass({"type": ArrayTypeClass({"nestedType": None})}), "nativeDataType": "list<string>", "recursive": False, "globalTags": None, "glossaryTerms": None, "isPartOfKey": False, "jsonProps": "{"native_data_type": "list<string>"}"}),
all_domains
is a List<String>
in Iceberg, so I'm modeling it as an Array
of String
in DataHub.
What is wrong with my code now that I'm on 0.8.25? What does "array type is not backed by a DataList" mean?ambitious-cartoon-15344
02/09/2022, 3:27 AMblue-plastic-11088
02/09/2022, 5:56 AMdazzling-appointment-34954
02/09/2022, 7:33 AMfew-air-56117
02/09/2022, 1:03 PMdamp-queen-61493
02/09/2022, 3:07 PMFailed to remove term: An unknown error occurred.
GMS log:
15:06:25.538 [Thread-9955] INFO c.l.d.g.r.mutate.RemoveTermResolver:52 - Removing Term. input: {}
15:06:25.545 [Thread-9955] ERROR c.l.d.g.r.mutate.RemoveTermResolver:63 - Failed to perform update against input com.linkedin.datahub.graphql.generated.TermAssociationInput@43af2fc3, Failed to validate record with class com.linkedin.common.GlossaryTerms: ERROR :: /editableSchemaFieldInfo :: unrecognized field found but not allowed
ERROR :: /terms :: field is required but not found and has no default value
ERROR :: /auditStamp :: field is required but not found and has no default value
15:06:25.546 [Thread-9955] ERROR c.l.d.g.e.DataHubDataFetcherExceptionHandler:21 - Failed to execute DataFetcher
java.util.concurrent.CompletionException: java.lang.RuntimeException: Failed to perform update against input com.linkedin.datahub.graphql.generated.TermAssociationInput@43af2fc3
at java.util.concurrent.CompletableFuture.encodeThrowable(CompletableFuture.java:273)
at java.util.concurrent.CompletableFuture.completeThrowable(CompletableFuture.java:280)
at java.util.concurrent.CompletableFuture$AsyncSupply.run(CompletableFuture.java:1606)
at java.lang.Thread.run(Thread.java:748)
Caused by: java.lang.RuntimeException: Failed to perform update against input com.linkedin.datahub.graphql.generated.TermAssociationInput@43af2fc3
at com.linkedin.datahub.graphql.resolvers.mutate.RemoveTermResolver.lambda$get$0(RemoveTermResolver.java:64)
at java.util.concurrent.CompletableFuture$AsyncSupply.run(CompletableFuture.java:1604)
... 1 common frames omitted
Caused by: com.linkedin.metadata.entity.ValidationException: Failed to validate record with class com.linkedin.common.GlossaryTerms: ERROR :: /editableSchemaFieldInfo :: unrecognized field found but not allowed
ERROR :: /terms :: field is required but not found and has no default value
ERROR :: /auditStamp :: field is required but not found and has no default value
at com.linkedin.metadata.entity.ValidationUtils.lambda$validateOrThrow$0(ValidationUtils.java:19)
at com.linkedin.metadata.entity.RecordTemplateValidator.validate(RecordTemplateValidator.java:37)
at com.linkedin.metadata.entity.ValidationUtils.validateOrThrow(ValidationUtils.java:17)
at com.linkedin.metadata.entity.EntityService.ingestProposal(EntityService.java:398)
at com.linkedin.datahub.graphql.resolvers.mutate.MutationUtils.persistAspect(MutationUtils.java:33)
at com.linkedin.datahub.graphql.resolvers.mutate.util.LabelUtils.removeTermFromTarget(LabelUtils.java:69)
at com.linkedin.datahub.graphql.resolvers.mutate.RemoveTermResolver.lambda$get$0(RemoveTermResolver.java:54)
... 2 common frames omitted
15:06:25.547 [Thread-9954] ERROR c.datahub.graphql.GraphQLController:94 - Errors while executing graphQL query: "mutation removeTerm($input: TermAssociationInput!) {\n removeTerm(input: $input)\n}\n", result: {errors=[{message=An unknown error occurred., locations=[{line=2, column=3}], path=[removeTerm], extensions={code=500, type=SERVER_ERROR, classification=DataFetchingException}}], data={removeTerm=null}}, errors: [DataHubGraphQLError{path=[removeTerm], code=SERVER_ERROR, locations=[SourceLocation{line=2, column=3}]}]
Datahub version: 0.8.25
How can I recover from this error and how is the correct flow to remove a glossary term with associated dataset and fiels?wooden-football-7175
02/09/2022, 3:52 PMdatahub-rest
Config? I’m trying to execute with webui receipe and I’m having error with the publish because on “console” it is working fine!nutritious-bird-77396
02/09/2022, 9:36 PM0.8.26
from 0.8.24
When trying to list groups from the UI throws an Invalid urn error...
Error Stack in the 🧵nutritious-bird-77396
02/09/2022, 9:53 PMmetadata_index
table - https://github.com/arunvasudevan/datahub/blob/master/docker/mysql-setup/init.sql#L42
2. I don't see it in postgres-setup
I am assuming its just missed right? https://github.com/arunvasudevan/datahub/blob/master/docker/postgres-setup/init.sqlfew-air-56117
02/10/2022, 7:07 AMhelm repo update
and install datahub 0.8.26
but i got this errors
14 Thu Feb 10 08:53:50 2022 superseded datahub-0.2.42 0.8.24 Upgrade complete
15 Thu Feb 10 08:58:38 2022 deployed datahub-0.2.45 0.8.26 Upgrade complete