creamy-smartphone-10810
05/12/2022, 11:25 AMrecipe.yaml
the scheduling? I’m executing it with datahub ingest -c recipe.yaml
but it performs a oneshoot ingestion, and I would to schedule them! thx in advance!agreeable-army-26750
05/12/2022, 2:18 PMpip install -e '.[dev]'
and quickstarted the system.
Is it possible to configure the UI (or the metadata-service) to call my new custom datahub cli ingestion instead of the default one?
Maybe I have to configure something on the UI?
Thanks for your answers in advance!fresh-garage-83780
05/12/2022, 5:16 PMsource:
type: trino
config:
env: CORP
platform: trino
host_port: "<http://trino.example.com:443|trino.example.com:443>"
database: dbname
username: foo
This recipe above connects via http and so times out. If I add a password
value it connects correctly via https because of this line in dialect.py, but this doesn't help as Trino throws a 401 (counter-intuitively)
curl -X POST <https://foo:bar@trino.example.com:443/v1/statement>
401 Password not allowed for insecure authentication
I can't seem to find any options for the options block that can override this. I tried using sqlalchemy_uri
too, but likewise couldn't find a way to set http_scheme through the connection string. Hope someone can point me in the right direction?handsome-football-66174
05/12/2022, 6:06 PMmillions-sundown-65420
05/12/2022, 7:38 PMchilly-gpu-46080
05/13/2022, 3:38 AMrich-policeman-92383
05/13/2022, 12:15 PMbillions-table-9927
05/13/2022, 1:47 PMsalmon-midnight-86020
05/13/2022, 7:39 PMcuddly-arm-8412
05/15/2022, 2:53 AMalert-football-80212
05/15/2022, 9:49 AMechoing-farmer-38304
05/15/2022, 2:26 PMbest-umbrella-24804
05/16/2022, 6:15 AMWARNING - DataHubValidationAction does not recognize this GE data asset type - <class 'great_expectations.validator.validator.Validator'>.
INFO - Metadata not sent to datahub. No datasets found.
My code looks like this
I'm not sure how to specify what datasets should be mapped?microscopic-mechanic-13766
05/16/2022, 8:40 AMWARNING: acryl-datahub 0.8.32.1 does not provide the extra 'presto-on-hive'
cuddly-arm-8412
05/16/2022, 10:13 AMbland-morning-36590
05/16/2022, 10:08 PMbrave-pager-62740
05/16/2022, 11:16 PMcuddly-arm-8412
05/17/2022, 2:23 AMbest-umbrella-24804
05/17/2022, 5:45 AMfrom datahub.integrations.great_expectations.action import DataHubValidationAction
Before it would stop hanging, it seems that merely importing this package causes the hangingprehistoric-salesclerk-23462
05/17/2022, 2:54 PMmillions-waiter-49836
05/17/2022, 4:40 PMalert-football-80212
05/17/2022, 7:23 PMchilly-gpu-46080
05/18/2022, 7:38 AMpolite-application-51650
05/18/2022, 12:25 PMpowerful-librarian-82760
05/18/2022, 12:42 PMpowerful-librarian-82760
05/18/2022, 5:20 PMcuddly-arm-8412
05/19/2022, 1:08 AMpolite-application-51650
05/19/2022, 5:27 AMbest-wolf-3369
05/19/2022, 9:30 AMurn:li:glossaryTerm:camelcaseobject
instead of the good one urn:li:glossaryTerm:camelCaseObject.
Could you provide some insight?
import requests
import json
url = "<http://host>:port/entities?action=ingest"
payload = json.dumps({
"entity": {
"value": {
"com.linkedin.metadata.snapshot.GlossaryTermSnapshot": {
"urn": "urn:li:glossaryTerm:camelCaseObject",
"aspects": [
{
"com.linkedin.glossary.GlossaryTermInfo": {
"definition": "Object definition",
"parentNode": "urn:li:glossaryTerm:camelCaseObjectParent",
"sourceRef": "DataHub",
"sourceUrl": "<https://github.com/linkedin/datahub/>",
"termSource": "INTERNAL"
}
}
]
}
}
}
})
headers = {
'Content-Type': 'application/json'
}
response = requests.request("POST", url, headers=headers, data=payload)
Thank you very much.great-cpu-72376
05/19/2022, 10:26 AM<http://localhost:9090/openapi/entities/v1/>
And I pasted this json (I copied what is reported in https://datahubproject.io/docs/how/add-custom-data-platform):
{
"entity": {
"value": {
"com.linkedin.metadata.snapshot.DataPlatformSnapshot": {
"aspects": [
{
"com.linkedin.dataplatform.DataPlatformInfo": {
"datasetNameDelimiter": "/",
"name": "filesystem",
"type": "FILE_SYSTEM",
"doc": "local filesystem"
}
}
],
"urn": "urn:li:dataPlatform:filesystem"
}
}
}
}
On gms I found this error:
WARN o.s.w.s.m.s.DefaultHandlerExceptionResolver:208 - Resolved [org.springframework.http.converter.HttpMessageNotReadableException: JSON parse error: Cannot deserialize value of type `java.util.ArrayList<io.datahubproject.openapi.dto.UpsertAspectRequest>` from Object value (token `JsonToken.START_OBJECT`); nested exception is com.fasterxml.jackson.databind.exc.MismatchedInputException: Cannot deserialize value of type `java.util.ArrayList<io.datahubproject.openapi.dto.UpsertAspectRequest>` from Object value (token `JsonToken.START_OBJECT`)<EOL> at [Source: (org.springframework.util.StreamUtils$NonClosingInputStream); line: 1, column: 1]]