bland-orange-13353
05/02/2023, 6:02 PMquiet-television-68466
05/03/2023, 9:17 AMurn:li:dataset:(urn:li:dataPlatform:snowflake,source.github.pull_requests,PROD)
, but its corresponding schema and database have the following urns: urn:li:container:0080ebfa374633b2294b7ff38c82923b, urn:li:container:0a6efd87d585a012e259f1457f68ce0d
The main use case for us is having them be parsable in the same way the dataset urns are, but if its not possible that’s fine!acoustic-quill-54426
05/03/2023, 3:29 PMcolossal-hairdresser-6799
05/03/2023, 3:36 PMancient-queen-15575
05/03/2023, 4:21 PMbucket_duration
variable?
An initial run of a snowflake ingestion I’m trying takes about 3 minutes. If I use stateful ingestion and remove the ignore_start_time_lineage: true
line, then a rerun takes about 30s. That seems great but what I understood from the docs is that only lineage changes from the past day will be picked up like this. It would be nice if the past few days were checked incase Datahub went down for a few days.
Is there a way to configure checking, for example, the past 3 days? I see there’s a bucket_duration
variable that’s an enum, but what are the accepted values for it? I can’t see any documentation for that.brainy-oxygen-20792
05/03/2023, 4:50 PM--select
) on their own schedule, which may be daily or hourly.
So scheduling DataHub to pull on a schedule means our assertions (run_results.json) may not be complete.
Ideas we're considering in the thread.purple-salesmen-12745
05/03/2023, 7:07 PMbland-lighter-26751
05/03/2023, 7:26 PMbland-orange-13353
05/03/2023, 7:57 PMlively-dusk-19162
05/03/2023, 8:50 PMable-evening-90828
05/04/2023, 1:57 AM0.10.2
server and 0.10.2.2
CLI for the UI ingestion.
We looked at the metadata_aspect_v2
table and noticed the dataPlatformInstance
is missing the instance
field in the metadata
column. We saw the following:
{"platform":"urn:li:dataPlatform:mysql"}
as opposed to
{"platform":"urn:li:dataPlatform:mysql","instance":"<OUR_PLATFORM_INSTANCE_URN"}
We have never had such problem before. Has anyone else seen this?steep-midnight-37232
05/04/2023, 10:49 AMadamant-honey-44884
05/04/2023, 1:24 PMloud-hospital-37195
05/04/2023, 5:38 PMloud-hospital-37195
05/04/2023, 5:39 PM~~~~ Execution Summary - RUN_INGEST ~~~~
Execution finished with errors.
{'exec_id': '7741f040-bc32-4b90-9182-bd273621ab7e',
'infos': ['2023-05-04 16:58:30.727353 INFO: Starting execution for task with name=RUN_INGEST',
"2023-05-04 16:58:34.832761 INFO: Failed to execute 'datahub ingest'",
'2023-05-04 16:58:34.832922 INFO: Caught exception EXECUTING task_id=7741f040-bc32-4b90-9182-bd273621ab7e, name=RUN_INGEST, '
'stacktrace=Traceback (most recent call last):\n'
' File "/usr/local/lib/python3.10/site-packages/acryl/executor/execution/default_executor.py", line 122, in execute_task\n'
' task_event_loop.run_until_complete(task_future)\n'
' File "/usr/local/lib/python3.10/asyncio/base_events.py", line 649, in run_until_complete\n'
' return future.result()\n'
' File "/usr/local/lib/python3.10/site-packages/acryl/executor/execution/sub_process_ingestion_task.py", line 231, in execute\n'
' raise TaskError("Failed to execute \'datahub ingest\'")\n'
"acryl.executor.execution.task.TaskError: Failed to execute 'datahub ingest'\n"],
'errors': []}
~~~~ Ingestion Logs ~~~~
Obtaining venv creation lock...
Acquired venv creation lock
venv setup time = 0
This version of datahub supports report-to functionality
datahub ingest run -c /tmp/datahub/ingest/7741f040-bc32-4b90-9182-bd273621ab7e/recipe.yml --report-to /tmp/datahub/ingest/7741f040-bc32-4b90-9182-bd273621ab7e/ingestion_report.json
[2023-05-04 16:58:33,369] INFO {datahub.cli.ingest_cli:165} - DataHub CLI version: 0.10.0
[2023-05-04 16:58:33,395] INFO {datahub.ingestion.run.pipeline:179} - Sink configured successfully. DataHubRestEmitter: configured to talk to <http://datahub-datahub-gms:8080>
Failed to configure the source (datahub-business-glossary): 1 validation error for BusinessGlossarySourceConfig
file
file or directory at path "cristina.narros/testterms.yml" does not exist (type=value_error.path.not_exists; path=cristina.narros/testterms.yml)
lemon-scooter-69730
05/05/2023, 9:52 AMgreat-notebook-53658
05/07/2023, 7:32 AMgreat-notebook-53658
05/08/2023, 2:00 AMbest-wire-59738
05/08/2023, 3:44 AMSSLV3_ALERT_HANDSHAKE_FAILURE
error while connecting to MariaDB using ssl account. can you please help me to overcome this issue.loud-hospital-37195
05/08/2023, 8:53 AMdelightful-painter-8227
05/08/2023, 10:21 AMacceptable-morning-73148
05/08/2023, 11:41 AMmake_container_urn
function. Is there a particular reason for that? Can we use another string instead of a UUID given the fact that our custom containers always have a unique name?fierce-finland-15121
05/08/2023, 10:58 PMdatahub-actions actions -c /etc/datahub/actions/system/conf/executor.yaml
%3|1683586489.986|FAIL|rdkafka#consumer-1| [thrd:sasl_ssl://<my broker url>/bootstr]: sasl_ssl://<my broker url>/bootstrap: SASL authentication error: Authentication failed (after 5055ms in state AUTH_REQ, 5 identical error(s) suppressed)
cool-flag-71835
05/08/2023, 11:07 PMflaky-refrigerator-97518
05/09/2023, 2:48 AMimportant-afternoon-19755
05/09/2023, 6:20 AMcolossal-hairdresser-6799
05/09/2023, 9:10 AMrapid-crowd-46218
05/09/2023, 3:08 PMprehistoric-wall-71780
05/09/2023, 6:47 PMgorgeous-psychiatrist-31553
05/10/2023, 6:09 AMGood afternoon. I have a problem with missing INGESTIONS
When creating a new one INGESTION in the DATAHUB
After saving, it has the status Pending (but it's not scary, I think it's normal)
After a while, he missing. Even if I immediately launched it disappears.
This happened after one of the INGESTION had a connection error. The server where the account was not available and was fixed later. I Made a ROLLBACK of one of the ingestion and now every new my ingestions dont work. They are missing in few seconds.
Can you help me?
I made restart dicker containers, but it did not help.