Ingestion via UI is not working, When I try to exe...
# ingestion
s
Ingestion via UI is not working, When I try to execute ingestion via UI, nothing happened.
e
cc @astonishing-monkey-81508 to consolidate threads. is acryl-datahub-actions container running? (or pod if you’re on k8s)
s
What version of DataHub server are you running? If deployed via helm chart what helm chart version is being used?
s
@square-activity-64562 Can you please tell me how to get the version of DataHub server. Datahub CLI version is 0.8.27.2. I am running server by docker only. @early-lamp-41924 I didn't see acryl-datahub-actions container. Please let me know how to get it run?
e
Starting with v0.8.23 we started showing the server version in the UI!
Once you upgrade, this should be much easier!
s
@early-lamp-41924 My datahub server version is v0.8.27
I just deployed it this week.
e
Ah amazing! in that case, mouse over the user avatar on the top right!
s
then?
e
You should see the backend version on the top of dropdown.
s
@early-lamp-41924 Thanks. I got the version is v0.8.27.
e
Picking up more context here. Do you deploy through docker or helm?
s
ocker only
Docker only
e
Got it. in that case, if you start up docker using the latest docker-compose, it should automatically start up the datahub-actions!
s
@early-lamp-41924
there is no datahub-actions.
e
How did you upgrade to v0.8.27? Do you have a local docker-compose.yml that you update?
s
I dint't do upgrade. I deployed at first time
just run python3 -m datahub docker quickstart
e
Ah with the following?
Copy code
datahub docker quickstart
interesting
in that case, seems like that pod died for some reason
can you try running
Copy code
docker logs datahub-actions
s
Error: No such container: datahub-actions
e
Copy code
docker logs datahub_datahub-actions_1
could you try ^?
s
2022/03/07 013639 Problem with request: Get "http://datahub-gms:8080/health": dial tcp 172.18.0.78080 connect: connection refused. Sleeping 1s 2022/03/07 013640 Problem with request: Get "http://datahub-gms:8080/health": dial tcp 172.18.0.78080 connect: connection refused. Sleeping 1s 2022/03/07 013641 Problem with request: Get "http://datahub-gms:8080/health": dial tcp 172.18.0.78080 connect: connection refused. Sleeping 1s 2022/03/07 013642 Problem with request: Get "http://datahub-gms:8080/health": dial tcp 172.18.0.78080 connect: connection refused. Sleeping 1s 2022/03/07 013643 Problem with request: Get "http://datahub-gms:8080/health": dial tcp 172.18.0.78080 connect: connection refused. Sleeping 1s 2022/03/07 013644 Problem with request: Get "http://datahub-gms:8080/health": dial tcp 172.18.0.78080 connect: connection refused. Sleeping 1s 2022/03/07 013645 Received 200 from http://datahub-gms:8080/health Traceback (most recent call last): File "/usr/local/lib/python3.9/site-packages/urllib3/connection.py", line 174, in _new_conn conn = connection.create_connection( File "/usr/local/lib/python3.9/site-packages/urllib3/util/connection.py", line 95, in create_connection raise err File "/usr/local/lib/python3.9/site-packages/urllib3/util/connection.py", line 85, in create_connection sock.connect(sa) socket.timeout: timed out During handling of the above exception, another exception occurred:
e
hmn wonder if it timed out before gms was fully alive
could you try running
datahub docker quickstart
again?
s
it done and I got the same error message for datahub action container
e
@square-activity-64562 Can you pick up for me. need to head out for the day (midnight here) For some reason, seems like the actions container cannot reach datahub-gms
s
@early-lamp-41924 Thanks for your time. Let me take some time to check. ]
s
Can you please try running the command once again?
Copy code
datahub docker quickstart
No need to nuke the deployment first. In rare cases I have seen different pods going down. If you are no linux-like environment please run the following command to see datahub containers
Copy code
docker ps -a | grep datahub
But yes we should ideally restart it ourselves in case of failure at least a few times to see if this was a networking issue. Let me see if I can send a PR
Sent a PR which should improve this quick start experience once that is merged in https://github.com/linkedin/datahub/pull/4342
s
@square-activity-64562 Below is the info on webui for 8080 ,is 8080 port normal?
s
Try
localhost:8080/config
and
localhost:9002/config
s
Copy code
{
  "supportsImpactAnalysis": true,
  "models": {},
  "versions": {
    "linkedin/datahub": {
      "version": "RC",
      "commit": "beb51ebf59a045de0aac9115f6eeecc8f7207d6b"
    }
  },
  "statefulIngestionCapable": true,
  "retention": "true",
  "noCode": "true"
}
Copy code
{
  "status": "ok",
  "config": {
    "application": "datahub-frontend",
    "appVersion": "1.0",
    "isInternal": false,
    "shouldShowDatasetLineage": true,
    "suggestionConfidenceThreshold": 50,
    "wikiLinks": {
      "appHelp": "<https://github.com/linkedin/datahub>",
      "gdprPii": "",
      "tmsSchema": "",
      "gdprTaxonomy": "",
      "staleSearchIndex": "",
      "dht": "",
      "purgePolicies": "",
      "jitAcl": "",
      "metadataCustomRegex": "",
      "exportPolicy": "",
      "metadataHealth": "",
      "purgeKey": "",
      "datasetDecommission": ""
    },
    "tracking": {
      "trackers": {
        "piwik": {
          "piwikSiteId": 0,
          "piwikUrl": ""
        }
      },
      "isEnabled": true
    },
    "isStagingBanner": false,
    "isLiveDataWarning": false,
    "showChangeManagement": false,
    "showPeople": true,
    "changeManagementLink": "",
    "isStaleSearch": true,
    "showAdvancedSearch": true,
    "useNewBrowseDataset": true,
    "showLineageGraph": true,
    "showInstitutionalMemory": true,
    "userEntityProps": {
      "aviUrlPrimary": "",
      "aviUrlFallback": ""
    }
  }
}
any issues?
2022/03/08 081838 Waiting for: http://datahub-gms:8080/health 2022/03/08 081838 Problem with request: Get "http://datahub-gms:8080/health": dial tcp xxx.xxx.0.xxx8080 connect: connection refused. Sleeping 1s
s
The configs look fine. Is the actions pod still failing for you?
Copy code
docker ps -a | grep datahub
s
error message for actions container :2022/03/08 081838 Waiting for: http://datahub-gms:8080/health 2022/03/08 081838 Problem with request: Get "http://datahub-gms:8080/health": dial tcp xxx.xxx.0.xxx8080 connect: connection refused. Sleeping 1s
97331e75015c public.ecr.aws/datahub/acryl-datahub-actions:v0.0.1-beta.11 "/bin/sh -c 'dockeri…" 5 hours ago Exited (1) 38 minutes ago datahub_datahub-actions_1 5
s
Can you please post the full logs for the actions pod? This is quickstart so there should not be any secret in there
s
You want full logs by running docker logs datahub_datahub-actions_1?
s
yes
s
@square-activity-64562 Kindly find attached log
s
Ok I see what the problem is here. Are you in an air-gapped environment where internet access is limited?
s
yws
yes
In this case what can I do? is there any specific web site need to access?
s
Mixpanel for telemetry. This is a bug in the actions pod. This is not expected behaviour. The pod should not be failing in air-gapped environment. We are aware this issue exists for air-gapped environments and are working on a fix.
s
@square-activity-64562 Thanks for your info. Please let me know when it is fixed.
s
The changes have been made. If you re-run the quickstart now then it should work
s
@square-activity-64562 Thanks a lot
@square-activity-64562 error log from action container ERROR: Could not find a version that satisfies the requirement acryl-datahub[datahub-rest,hive]==0.8.26.6 (from versions: none) ERROR: No matching distribution found for acryl-datahub[datahub-rest,hive]==0.8.26.6 /tmp/datahub/ingest/venv-2f304ec6-d8af-4df3-9237-b1c6104eae2e/bin/python3: No module named datahub
s
Please upgrade to a a newer version. The reason it is failing is probably because starting from v0.8.25 there was a temporary restriction of not publishing python packages for Python 3.9.10
If you use the latest version it should work https://github.com/acryldata/datahub/releases
m
Is there any command which I run to upgrade my existing setup ?
?
s
thanks
s
@mysterious-australia-30101
Copy code
To update server - Is DataHub working on your local computer (via quickstart) or on a k8s cluster (AWS EKS, GCP GKE etc).? If on your quickstart you can nuke it and restart it. If using K8s through helm charts update using helm charts <https://datahubproject.io/docs/deploy/gcp/#setup-datahub-using-helm>

To update CLI follow <https://datahubproject.io/docs/cli/#using-pip>

Note the server and CLI should be the same versions. if server is `0.8.28` then CLI should be `0.8.28.x`. We do not run tests of new CLI with older servers so we don't recommend you to update the CLI if your server is on an older version