brave-room-48783
04/13/2023, 11:01 AM~~~~ Execution Summary - RUN_INGEST ~~~~
Execution finished with errors.
{'exec_id': '426ca545-5c66-4adb-98b0-7f2fffbfcd0f',
'infos': ['2023-04-14 02:35:39.958717 INFO: Starting execution for task with name=RUN_INGEST',
"2023-04-14 02:55:31.996839 INFO: Failed to execute 'datahub ingest'",
'2023-04-14 02:55:32.010133 INFO: Caught exception EXECUTING task_id=426ca545-5c66-4adb-98b0-7f2fffbfcd0f, name=RUN_INGEST, '
'stacktrace=Traceback (most recent call last):\n'
' File "/usr/local/lib/python3.10/site-packages/acryl/executor/execution/default_executor.py", line 122, in execute_task\n'
' task_event_loop.run_until_complete(task_future)\n'
' File "/usr/local/lib/python3.10/asyncio/base_events.py", line 649, in run_until_complete\n'
' return future.result()\n'
' File "/usr/local/lib/python3.10/site-packages/acryl/executor/execution/sub_process_ingestion_task.py", line 231, in execute\n'
' raise TaskError("Failed to execute \'datahub ingest\'")\n'
"acryl.executor.execution.task.TaskError: Failed to execute 'datahub ingest'\n"],
'errors': []}
bumpy-engineer-7375
04/13/2023, 12:39 PMelegant-salesmen-99143
04/13/2023, 7:54 PM"Validation error (WrongType@[searchAcrossEntities]) : argument 'input.orFilters[0]' with value 'ObjectValue{objectFields=[ObjectField{name='query', value=StringValue{value='*'}}, ObjectField{name='orFilters', value=ArrayValue{values=[ObjectValue{objectFields=[ObjectField{name='field', value=StringValue{value='removed'}}, ObjectField{name='condition', value=StringValue{value='EQUAL'}}, ObjectField{name='values', value=ArrayValue{values=[StringValue{value='true'}]}}, ObjectField{name='negated', value=BooleanValue{value=false}}]}]}}]}' contains a field not in 'AndFilterInput': 'field'"
But the field field
is definitely is possible in AndFilterInput. What is wrong here?
{
searchAcrossEntities (
input: {query: "*",
orFilters: [{field: "removed", condition: "EQUAL", values: ["true"], negated: false}]}
) {
start
count
total
searchResults {
entity {
type
... on Dataset {
urn
type
description
platform {
name
}
name
}
}
}
}
}
steep-alligator-93593
04/14/2023, 1:53 AMdatahub-mysql-setup:v0.10.1
image.. working with some tight permissions and need a workaround
-- create datahub database
CREATE DATABASE IF NOT EXISTS <DB> CHARACTER SET utf8mb4 COLLATE utf8mb4_bin;
USE <DB>;
incalculable-stone-67607
04/14/2023, 6:59 AMwonderful-jordan-36532
04/14/2023, 10:27 AMpython3 -m datahub user upsert -f user.yaml
Error: No such command 'user'.
fierce-electrician-85924
04/14/2023, 10:30 AM0.10.2
.
io.netty.channel.AbstractChannel$AnnotatedConnectException: Connection refused: localhost/127.0.0.1:8080
I am using k8s with helm to setup local instance. Default linkedin/datahub-gms
image works fine but if I try to build the image locally and then use it for instance setup it throws this error. (We haven't changed anything in datahub-gms)best-wire-59738
04/14/2023, 10:48 AMView Dataset Usage and View Dataset Profile
.
For more details on the Issue please follow this thread. https://datahubspace.slack.com/archives/C029A3M079U/p1680054629679729victorious-monkey-86128
04/14/2023, 3:38 PMcontainer_name.dataset_name
and containers with the name container_name
. They exist independently of each other currently. I'd like to assign a set of datasets to the Container. How would I go about doing so? Thanks!creamy-ram-28134
04/14/2023, 4:25 PMbland-orange-13353
04/14/2023, 9:00 PMsteep-alligator-93593
04/15/2023, 5:06 AMdatahub-system-update-job
2023-04-15 05:00:18.721 ERROR 1 --- [ main] i.c.k.s.client.rest.RestService : Failed to send HTTP request to endpoint: <http://prerequisites-cp-schema-registry:8081/subjects/DataHubUpgradeHistory_v1-value/versions>
252
251
java.net.UnknownHostException: prerequisites-cp-schema-registry
250
at java.base/java.net.AbstractPlainSocketImpl.connect(AbstractPlainSocketImpl.java:229) ~[na:na]
249
at java.base/java.net.Socket.connect(Socket.java:609) ~[na:na]
Along with
2023-04-15 05:00:18.697 ERROR 1 --- [ main] i.c.k.s.client.rest.RestService : Failed to send HTTP request to endpoint: <http://prerequisites-cp-schema-registry:8081/subjects/DataHubUpgradeHistory_v1-value/versions>
460
459
java.net.UnknownHostException: prerequisites-cp-schema-registry
I am running on kubernetes any help would be greatly appreciated, thank youred-painter-89141
04/15/2023, 3:46 PMred-painter-89141
04/15/2023, 3:47 PM$ python3 -m datahub docker quickstart
[2023-04-15 08:46:29,229] INFO {datahub.cli.quickstart_versioning:144} - Saved quickstart config to /home/tim/.datahub/quickstart/quickstart_version_mapping.yaml.
[2023-04-15 08:46:29,229] INFO {datahub.cli.docker_cli:638} - Using quickstart plan: composefile_git_ref='master' docker_tag='head'
Docker doesn't seem to be running. Did you start it?
$ docker ps -a
CONTAINER ID IMAGE COMMAND CREATED STATUS PORTS NAMES
$ docker compose version
Docker Compose version v2.17.2
red-painter-89141
04/16/2023, 2:50 PM$ datahub --debug docker quickstart
[2023-04-16 07:48:08,882] DEBUG {datahub.telemetry.telemetry:219} - Sending init Telemetry
[2023-04-16 07:48:08,934] DEBUG {datahub.upgrade.upgrade:134} - Failed to get a valid server: Cannot connect to host localhost:8080 ssl:default [Connect call failed ('127.0.0.1', 8080)]
[2023-04-16 07:48:09,792] DEBUG {datahub.telemetry.telemetry:248} - Sending telemetry for function-call
[2023-04-16 07:48:11,036] INFO {datahub.cli.quickstart_versioning:144} - Saved quickstart config to /home/tim/.datahub/quickstart/quickstart_version_mapping.yaml.
[2023-04-16 07:48:11,036] INFO {datahub.cli.docker_cli:638} - Using quickstart plan: composefile_git_ref='master' docker_tag='head'
[2023-04-16 07:48:11,037] DEBUG {datahub.telemetry.telemetry:248} - Sending telemetry for function-call
[2023-04-16 07:48:11,196] DEBUG {datahub.entrypoints:189} - Error: Docker doesn't seem to be running. Did you start it?
Traceback (most recent call last):
File "/usr/lib/python3/dist-packages/urllib3/connectionpool.py", line 699, in urlopen
httplib_response = self._make_request(
File "/usr/lib/python3/dist-packages/urllib3/connectionpool.py", line 394, in _make_request
conn.request(method, url, **httplib_request_kw)
File "/usr/lib/python3.10/http/client.py", line 1282, in request
self._send_request(method, url, body, headers, encode_chunked)
File "/usr/lib/python3.10/http/client.py", line 1328, in _send_request
self.endheaders(body, encode_chunked=encode_chunked)
File "/usr/lib/python3.10/http/client.py", line 1277, in endheaders
self._send_output(message_body, encode_chunked=encode_chunked)
File "/usr/lib/python3.10/http/client.py", line 1037, in _send_output
self.send(msg)
File "/usr/lib/python3.10/http/client.py", line 975, in send
self.connect()
File "/home/tim/.local/lib/python3.10/site-packages/docker/transport/unixconn.py", line 30, in connect
sock.connect(self.unix_socket)
PermissionError: [Errno 13] Permission denied
rapid-zoo-88437
04/17/2023, 7:07 AMclever-magician-79463
04/17/2023, 7:24 AMExecution finished with errors.
{'exec_id': 'dc2c87c8-2590-40e6-88eb-c07e7e63adfa',
'infos': ['2023-04-17 07:10:16.517630 INFO: Starting execution for task with name=RUN_INGEST',
"2023-04-17 07:10:38.176173 INFO: Failed to execute 'datahub ingest'",
'2023-04-17 07:10:38.181265 INFO: Caught exception EXECUTING task_id=dc2c87c8-2590-40e6-88eb-c07e7e63adfa, name=RUN_INGEST, '
'stacktrace=Traceback (most recent call last):\n'
' File "/usr/local/lib/python3.10/site-packages/acryl/executor/execution/default_executor.py", line 122, in execute_task\n'
' task_event_loop.run_until_complete(task_future)\n'
' File "/usr/local/lib/python3.10/asyncio/base_events.py", line 649, in run_until_complete\n'
' return future.result()\n'
' File "/usr/local/lib/python3.10/site-packages/acryl/executor/execution/sub_process_ingestion_task.py", line 231, in execute\n'
' raise TaskError("Failed to execute \'datahub ingest\'")\n'
"acryl.executor.execution.task.TaskError: Failed to execute 'datahub ingest'\n"],
'errors': []}
Hello,
I am facing this issue while trying to ingest redshift data.
Can anyone help with any fixes?bland-orange-13353
04/17/2023, 7:43 AMgentle-nest-73959
04/17/2023, 8:26 AMgroups
sync. however I couldn't find how to assign role to a group. how can I achieve this?microscopic-room-90690
04/17/2023, 8:33 AMThis entity is not discoverable via search or lineage graph. Contact your DataHub admin for more information.
brief-mechanic-70547
04/17/2023, 11:01 AMdatahub docker quickstart
Even though DataHub is running established, I got the warning below
βClient-Server Incompatibleβ Your client version 0.10.1.1 is older than your server version 0.10.2. Upgrading the cli to 0.10.2 is recommended.
Upgrade via "pip install 'acryl-datahub==0.10.2'"
2- When I follow the requested installation, I got an error: ERROR: No matching distribution found for acryl-datahub==0.10.2
Any suggestions?flat-engineer-75197
04/17/2023, 11:05 AMbrief-mechanic-70547
04/17/2023, 11:17 AM[2023-04-17 11:12:13,532] ERROR {datahub.entrypoints:188} - Command failed: (psycopg2.OperationalError) could not connect to server: Connection refused
Is the server running on host "localhost" (127.0.0.1) and accepting
TCP/IP connections on port 5432?
could not connect to server: Cannot assign requested address
Is the server running on host "localhost" (::1) and accepting
TCP/IP connections on port 5432?
On the host and port I tried various alternatives but I couldn't succeed;
β’ host.docker.internal:5432
β’ localhost:5432
β’ 127.0.0.1:5432
Could you help me with this?
Thanksdazzling-appointment-34954
04/17/2023, 12:29 PMquiet-rain-16785
04/17/2023, 1:59 PMbest-morning-7115
04/17/2023, 2:07 PMbrief-ability-41819
04/17/2023, 2:15 PM0.9.6.1
on EKS with managed storages (RDS, OpenSearch, MSK).
After an attempt to upgrade to 0.10.2
, we had to perform a disaster recovery on OpenSearch cluster, as datahub-upgrade
pod failed along with datahub-gms
pod and we couldnβt restore the functionality with rollback.
All upgrade jobs were set to true
before applying the Helm chart. Chart itself was updated before an upgrade with helm repo update
.
Questions:
β’ Is there any breaking change weβre not aware of (like too many major versions skip)? We did multiple upgrades in the past and this is the first time weβre blocked.
β’ Shall we perform a manual diff on values.yaml
to mirror GitHubβs chart to-the-letter?wide-afternoon-79955
04/17/2023, 3:35 PM0.10.1
Running Datahub on AWS EKS and we are facing very high latency (~20 seconds) in our Datahub searches. AWS's Elastic search service with version 7.10 and 3 data nodes and no dedicated master node
Cache is not enabled on Datahub.
We have are using RDS for Mysql
We are not seeing any high memory or CPU utilisation on any Datahub components and even Elastic Search data metrics seems to looks good.
Datahub GMS config 2 replicas with 16G each and the rest of pods are on 4G memory.
We have 10 - 13 k Datasets.
Can some one please guide on how to tune the Datahub search.clever-twilight-40247
04/17/2023, 8:20 PM0.10.1.1
on mac M1 and trying to ingest oracle datasource via datahub CLI(and UI), but it is failing due to lack of arm64
support for cx_oracle. Are there any alternatives? Error messages in π§΅quiet-rain-16785
04/18/2023, 8:02 AM