Hi team I m facing the same issue like <https datahubspace s DataHub #troubleshoot

Hi team, I’m facing the same issue like <this> Exe...

powerful-cat-68806

05/28/2023, 9:51 AM

Hi team, I’m facing the same issue like this Executing helm on my namespace, but not sure if upgraded is relevant here GMS helm chart:

Copy code

apiVersion: v2
appVersion: v0.9.3
description: A Helm chart for LinkedIn DataHub's datahub-gms component
name: datahub-gms
type: application
version: 0.2.121

Also - how can I found, from the namespace, what’s the GMS version?

modern-artist-55754

05/30/2023, 8:50 AM

I think there is a

/config

endpoint in the gms that tells you the info.

powerful-cat-68806

05/30/2023, 9:10 AM

@modern-artist-55754 hi WDYM?

modern-artist-55754

05/30/2023, 9:11 AM

You want to know the version of gms, there is an endpoint in the gms that return the its version. Not sure if my answer is helpful.

powerful-cat-68806

05/30/2023, 9:23 AM

Can you elaborate @modern-artist-55754? What endpoint? Technical steps will assist 🙏

powerful-cat-68806

05/30/2023, 9:23 AM

I’m running in K8s

modern-artist-55754

05/30/2023, 9:24 AM

Sorry i might have misunderstood your question. Did you want to know which version your gms is right now?

powerful-cat-68806

05/30/2023, 9:25 AM

YUp

powerful-cat-68806

05/30/2023, 9:25 AM

🙂

powerful-cat-68806

05/30/2023, 9:27 AM

BTW - did you faced this error too?

modern-artist-55754

05/30/2023, 9:28 AM

You can use postman to request the /config endpoint from gms Http://gms-ip:port/config

modern-artist-55754

05/30/2023, 9:28 AM

Or write python code like this

Copy code

import requests

res = requests.get("<http://ip:port/config|http://ip:port/config>")
print(request.json())

powerful-cat-68806

05/30/2023, 9:29 AM

Can I see this on the pod/UI/exec ?

modern-artist-55754

05/30/2023, 9:32 AM

I am not sure, i am not that familiar with k8s. Can you ssh to the gms port, maybe ssh to that and just curl http://localhost:8080/config.

powerful-cat-68806

05/30/2023, 9:54 AM

Copy code

"linkedin/datahub" : {
      "version" : "v0.9.6.1",
      "commit" : "xxxxxx"
    }

modern-artist-55754

05/30/2023, 9:57 AM

Is that the information you after?

powerful-cat-68806

05/30/2023, 9:58 AM

Sorry?

modern-artist-55754

05/30/2023, 9:58 AM

Is that the version information you need?

powerful-cat-68806

05/30/2023, 9:59 AM

Not sure

powerful-cat-68806

05/30/2023, 9:59 AM

I tying to understand the RC for my error

powerful-cat-68806

05/30/2023, 10:00 AM

Based on previous comment on this channel, the gms version was the issue I'm trying to figure out if my gms chart.yaml is correct

modern-artist-55754

05/30/2023, 10:03 AM

You have duplicate ingestion?

powerful-cat-68806

05/30/2023, 10:03 AM

Nope

powerful-cat-68806

05/30/2023, 10:03 AM

I have several ingestions,but each for a different resource

modern-artist-55754

05/30/2023, 10:06 AM

I am not sure how gms version related to the problem. He had a set up that running multiple event consumer in and out of gms

gentle-hamburger-31302

05/30/2023, 10:18 AM

@orange-night-91387 might help you

gentle-hamburger-31302

05/31/2023, 11:29 AM

@incalculable-ocean-74010 Could you please have a look here ?

aloof-gpu-11378

05/31/2023, 11:57 AM

Hello, Using Kubernetes you can see the version of DataHub that got deployed by checking the resource manifest and searching for the

image

property.

aloof-gpu-11378

05/31/2023, 11:58 AM

If GMS is already deployed then a curl command to GMS endpoint with the

/config

endpoint will return a JSON akin to:

Copy code

{
  "models": {},
  "patchCapable": true,
  "versions": {
    "linkedin/datahub": {
      "version": "v0.10.3",
      "commit": "a29b576daa2fffcdd356250ca8a60ea9d40a4e11"
    }
  },
  "managedIngestion": {
    "defaultCliVersion": "0.10.3",
    "enabled": false
  },
  "statefulIngestionCapable": true,
  "supportsImpactAnalysis": true,
  "timeZone": "GMT",
  "telemetry": {
    "enabledCli": true,
    "enabledIngestion": false
  },
  "datasetUrnNameCasing": false,
  "retention": "true",
  "datahub": {
    "serverType": "prod"
  },
  "noCode": "true"
}

aloof-gpu-11378

05/31/2023, 11:58 AM

☝️ is for our demo

powerful-cat-68806

06/04/2023, 8:07 AM

@incalculable-ocean-74010 hi This is mine:

Copy code

{
  "models": {},
  "patchCapable": true,
  "versions": {
    "linkedin/datahub": {
      "version": "v0.9.6.1",
      "commit": "xxxxxxx"
    }
  },
  "managedIngestion": {
    "defaultCliVersion": "0.9.6",
    "enabled": true
  },
  "statefulIngestionCapable": true,
  "supportsImpactAnalysis": true,
  "telemetry": {
    "enabledCli": true,
    "enabledIngestion": false
  },
  "datasetUrnNameCasing": false,
  "retention": "true",
  "datahub": {
    "serverType": "prod"
  },
  "noCode": "true"
}

powerful-cat-68806

06/04/2023, 8:07 AM

@modern-garden-35830 FYI to this issue ☝️

powerful-cat-68806

06/05/2023, 9:23 AM

@incalculable-ocean-74010 @astonishing-answer-96712 @orange-night-91387 can someone pls. assist & tell me what & how to run the upgrade? This is our prod env & we need this resolution pls. 🙏

incalculable-ocean-74010

06/05/2023, 9:00 PM

What upgrade? If you are using the community Helm chart to deploy DataHub a simple Helm upgrade should be enough

incalculable-ocean-74010

06/05/2023, 9:01 PM

By this I mean, update your datahub helm-chart dependency in the Chart.yaml like so:

Copy code

apiVersion: v2
name: demo-datahub
description: A Helm chart to deploy DataHub for 
type: application
version: 0.0.1
dependencies:
  - name: datahub-prerequisites
    version: 0.0.12
    repository: <https://helm.datahubproject.io/>
    condition: datahub-prerequisites.enabled
  - name: datahub
    version: 0.2.132 <--- Update this to whatever version you want.
    repository: <https://helm.datahubproject.io/>
    condition: datahub.enabled

incalculable-ocean-74010

06/05/2023, 9:01 PM

Our latest version is 0.2.165: https://github.com/acryldata/datahub-helm/blob/3678c86f5c925662828442f405b5b2e769c960d2/charts/datahub/Chart.yaml#L7

powerful-cat-68806

06/11/2023, 7:08 AM

@incalculable-ocean-74010 hi I’m using FORK for your repo & DH community Helm chart, but it’s not upgrading my versions The change you mentioned above is for the prerequisites chart or DH chart?

powerful-cat-68806

06/11/2023, 2:52 PM

Ok, I’ve deployed to my staging environment with

0.2.165

Where can I see that I’m using this version?

incalculable-ocean-74010

06/12/2023, 5:37 PM

My apologies, I don’t understand the question.

powerful-cat-68806

06/12/2023, 5:41 PM

Hi @incalculable-ocean-74010 I've set my chart with

0.2.165

but I'm still facing the same error in the ingestion

powerful-cat-68806

06/13/2023, 7:50 AM

@astonishing-answer-96712 I need inputs here pls.

gentle-hamburger-31302

06/15/2023, 9:33 AM

Hi @incalculable-ocean-74010 / @dazzling-yak-93039 Could you please check this issue

dazzling-yak-93039

06/15/2023, 8:18 PM

Hi @powerful-cat-68806, I took a look and it looks like the error is

Copy code

Provided urn urn:li:corpGroup:sccm-Amadeus Altea INV (RIT HD\\\\\\" is invalid

which is coming from this line of code: https://github.com/datahub-project/datahub/blob/8f9a23fb2e129cace52e7adc42978120b8[…]/src/main/javaPegasus/com/linkedin/common/urn/UrnValidator.java I believe the validation check that is failing specifically is this one: https://github.com/datahub-project/datahub/blob/8f9a23fb2e129cace52e7adc42978120b8[…]tils/src/main/javaPegasus/com/linkedin/common/urn/TupleKey.java which checks for matching parentheses in an Urn. Can you see where this Urn is coming from and exclude it? Or make sure it is not being truncated? We use parentheses as a special character in Urns to indicate tuples of entities, so the Urn you're trying to store should not have parentheses if it is not representing this concept.

powerful-cat-68806

06/18/2023, 8:23 AM

@dazzling-yak-93039 hi 10x for the input. I’m not fully understand, how this is relevant to the issue I’m trying to resolve….

powerful-cat-68806

06/18/2023, 8:33 AM

From all that I’ve read, this is related to the GMS version I’m running Trying to upgrade, didn’t work for me

nutritious-yacht-6205

06/19/2023, 10:30 AM

For some time I'm having the same issue: I run it with docker compose. From what I see. I see this error in first ingestions of same source. It stops after the first one. It shouts error when I ingest postgres, and also business glossary yaml file. I use postgres as datahub db. Error Code:

failed to write record with workunit xxx with ('Unable to emit metadata to DataHub GMS: javax.persistence.PersistenceException: Error when batch flush on sql: update metadata_aspect_v2 set metadata=?, createdOn=?, createdBy=?, createdFor=?, systemmetadata=? where urn=? and aspect=? and version=?', {'exceptionClass': 'com.linkedin.restli.server.RestLiServiceException', 'message': 'javax.persistence.PersistenceException: Error when batch flush on sql: update metadata_aspect_v2 set metadata=?, createdOn=?, createdBy=?, createdFor=?, systemmetadata=? where urn=? and aspect=? and version=?', 'status': 500, 'id': 'urn:li:glossaryNode:pii.impact.levels'}) and info {'exceptionClass': 'com.linkedin.restli.server.RestLiServiceException', 'message': 'javax.persistence.PersistenceException: Error when batch flush on sql: update metadata_aspect_v2 set metadata=?, createdOn=?, createdBy=?, createdFor=?, systemmetadata=? where urn=? and aspect=? and version=?', 'status': 500, 'id': 'urn:li:glossaryNode:pii.impact.levels'}

Copy code

{
  "models": {},
  "patchCapable": true,
  "versions": {
    "linkedin/datahub": {
      "version": "v0.10.4",
      "commit": "3de94c52230ff2ea25de6afd1cf42c7fd85b2375"
    }
  },
  "managedIngestion": {
    "defaultCliVersion": "0.10.4",
    "enabled": true
  },
  "statefulIngestionCapable": true,
  "supportsImpactAnalysis": true,
  "timeZone": "GMT",
  "telemetry": {
    "enabledCli": true,
    "enabledIngestion": false
  },
  "datasetUrnNameCasing": false,
  "retention": "true",
  "datahub": {
    "serverType": "dev"
  },
  "noCode": "true"
}

powerful-cat-68806

06/19/2023, 8:00 PM

@nutritious-yacht-6205 on my side, this error pops when ingesting Redshift cluster I’m also using Postgres as default DH db

powerful-cat-68806

06/21/2023, 8:23 AM

@astonishing-answer-96712 hi Can someone assist here pls. ?

gentle-hamburger-31302

06/21/2023, 8:25 AM

@orange-night-91387 Could you please check this ?

orange-night-91387

06/21/2023, 4:38 PM

https://github.com/datahub-project/datahub/issues/8257