https://datahubproject.io logo
Join Slack
Powered by
# all-things-deployment
  • b

    breezy-camera-11182

    01/05/2022, 4:49 AM
    Hi team, i encountered an error
    _03:58:27.120_ [main] WARN  o.s.w.c.s.XmlWebApplicationContext:558 - Exception encountered during context initialization - cancelling refresh attempt: org.springframework.beans.factory.UnsatisfiedDependencyException: Error creating bean with name 'authorizationManagerFactory': Unsatisfied dependency expressed through field 'entityClient'; nested exception is org.springframework.beans.factory.UnsatisfiedDependencyException: Error creating bean with name 'javaEntityClientFactory': Unsatisfied dependency expressed through field '_entityService'; nested exception is org.springframework.beans.factory.BeanCreationException: Error creating bean with name 'ebeanAspectDao' defined in com.linkedin.gms.factory.entity.EbeanAspectDaoFactory: Bean instantiation via factory method failed; nested exception is org.springframework.beans.BeanInstantiationException: Failed to instantiate [com.linkedin.metadata.entity.ebean.EbeanAspectDao]: Factory method 'createInstance' threw exception; nested exception is org.springframework.beans.factory.BeanCreationException: Error creating bean with name 'ebeanServer' defined in com.linkedin.gms.factory.entity.EbeanServerFactory: Bean instantiation via factory method failed; nested exception is org.springframework.beans.BeanInstantiationException: Failed to instantiate [io.ebean.EbeanServer]: Factory method 'createServer' threw exception; nested exception is java.lang.NullPointerException
    when running datahub-gms in minikube. i followed the deployment guide from https://datahubproject.io/docs/deploy/kubernetes and only change the mysql to my local mysql with these configuration (
    datahub-helm/charts/prerequisites/values.yaml
    )
    Copy code
    sql:
        datasource:
          host: "host.minikube.internal:3306"
          hostForMysqlClient: "host.minikube.internal"
          port: "3306"
          url: "jdbc:<mysql://host.minikube.internal:3306/datahub?verifyServerCertificate=false&useSSL=true&useUnicode=yes&characterEncoding=UTF-8&enabledTLSProtocols=TLSv1.2>"
          driver: "com.mysql.cj.jdbc.Driver"
          username: "datahub"
          password:
            secretRef: mysql-secrets-datahub
            secretKey: mysql-root-password
    e
    b
    • 3
    • 13
  • f

    few-air-56117

    01/05/2022, 2:11 PM
    Hi guys, how can i acces the datahub CLI if a deployed datahub on kube? Some comands like deleting a dataset exist only on CLI. Thx :D
    r
    b
    • 3
    • 9
  • m

    millions-notebook-72121

    01/05/2022, 2:53 PM
    Hi guys, in terms of deployments has anyone ran into the same issue I am facing here? https://datahubspace.slack.com/archives/C029A3M079U/p1641309976204200 running out of ideas to debug it! Also, it was all working fine a month ago 😔
    h
    • 2
    • 1
  • f

    fresh-memory-20741

    01/06/2022, 7:25 AM
    How datahub ingest metadata from Hive? show tables? MetaStore? hive-hook-log? Where can i see the Source Code? thank you。
    s
    • 2
    • 3
  • a

    ancient-hair-10877

    01/06/2022, 8:29 AM
    Hi guy, where is datahub save the session id? I see the datahub not expire session id. I have used a session id to authorized since 2 month ago but until now it is not expire
    b
    • 2
    • 2
  • f

    few-air-56117

    01/10/2022, 8:30 AM
    Hi guys, its possible to install with help just a part of datahub, like de frontend or gsm (not all the components)?
    s
    e
    n
    • 4
    • 58
  • w

    wide-helicopter-97009

    01/10/2022, 3:43 PM
    Hi, had anybody tried to set up the datahub CLI in a productionized environment?
    m
    • 2
    • 2
  • b

    better-orange-49102

    01/11/2022, 7:16 AM
    noticed that the entry for datahub-gms container inside ./docker/docker-compose.yml does not have a .env file defined unlike the entry for datahub-frontend-react 🤔, shouldnt it be pointing to ./docker/dataub-gms/env/docker.env?
    e
    • 2
    • 3
  • b

    billions-receptionist-60247

    01/12/2022, 4:37 AM
    Hi Do i have to use separate elastic search for datahub or can i use the existing one already in use
    e
    • 2
    • 4
  • f

    few-air-56117

    01/12/2022, 3:47 PM
    Hi guts, i have a small question. I activated de OICD auth on gogole, and its work, but i cant login with the admin user anymore
    b
    • 2
    • 3
  • b

    billions-receptionist-60247

    01/12/2022, 7:13 PM
    Hi i'm getting this error datahub gms. Any idea Logs added in the thread
    l
    r
    • 3
    • 5
  • r

    red-window-75368

    01/17/2022, 2:18 PM
    Hi, I'll be implementing DataHub, my infrastructure will contain various data lakes, each data lake will have a different owner, I want that anyone can access the metadata from all of these data lakes (each data lake owner should be able to ingest metadata only to their own section, shouldn't be able to change other data lakes' metadata in datahub and should be able to see all the metadata from all data lakes) so I thought of a centralized DataHub, but I read about Federated Metadata Serving in DataHub website and I am trying to grasp this concept and want to know what are the advantages of implementing this option instead of just ingesting the metadata from all the data lake into one DataHub. Also want to know if there is any information on how to implement this federated metadata serving. Thank you.
    m
    • 2
    • 5
  • b

    brave-businessperson-3969

    01/18/2022, 1:16 PM
    Hi, I just had a look at the Analytics page in DataHub and the log files the system creates. In the logs I can see each and every activity/GraphQL operation. But is there any option to monitor user acceptance: which user logged in when and how many operations he/she performed / how long he stayed active?
    b
    s
    +2
    • 5
    • 10
  • h

    handsome-football-66174

    01/18/2022, 7:05 PM
    Hi , How to we enable SQL profiling for Glue ? Referring this https://datahubproject.io/docs/metadata-ingestion/source_docs/sql_profiles
    s
    b
    s
    • 4
    • 3
  • f

    few-air-56117

    01/19/2022, 11:24 AM
    Hi guys, its posible to get azure users and groups with azure OIDC integration?
    s
    q
    +4
    • 7
    • 41
  • l

    late-bear-87552

    01/21/2022, 6:08 AM
    Hi guys, which version of datahub has feature of creating recipe and scheduling through UI?
    b
    l
    b
    • 4
    • 6
  • b

    billions-receptionist-60247

    01/24/2022, 5:04 AM
    Hi i'm getting this error datahub gms
    Copy code
    ElasticsearchStatusException[method [HEAD], host [<https://xxxxxxxxxxx.es.amazonaws.com:443>], URI [/graph_service_v1?ignore_throttled=false&ignore_unavailable=false&expand_wildcards=open%2Cclosed&allow_no_indices=false], status line [HTTP/1.1 400 Bad Request]]
    any idea why i'm getting this error. i'm using aws elastic search version: 6.4.5
    e
    • 2
    • 1
  • g

    glamorous-controller-12246

    01/24/2022, 6:02 PM
    Just looking into how to properly host DataHub for our needs. I've engaged Acryl to see what a managed service costs but I'm also looking into the possibility of hosting in our AWS account. Your documentation suggests EKS but it seems that ECS on Fargate might be a better solution. Is there a specific reason EKS is the preferred method in the documentation or is it mostly personal preference due to individual requirements?
    b
    g
    • 3
    • 5
  • a

    adorable-flower-19656

    01/25/2022, 2:00 AM
    Hi guys, is there any recommended cluster spec or something recommended for a production level deployment of Neo4j in Datahub?
    b
    • 2
    • 2
  • f

    few-air-56117

    01/25/2022, 8:47 AM
    Hi, i tried to integrate AZURE OIDC with datahub, i have the users and groups, but when i want to see what users are in a group i get this error
    Copy code
    <https://graph.microsoft.com/v1.0/me/photo/$value>
    Thx 😇
    b
    c
    m
    • 4
    • 8
  • b

    bland-wolf-37286

    01/25/2022, 5:55 PM
    Hi, I need to deploy the DataHub frontend so that it is accessed via a URL with a path prefix rather than residing on the server root (e.g. http://myserver.example.com/datahub instead of http://myserver.example.com/). This is because DataHub will be one of a number of services accessed off the same DNS name, fronted by an Nginx reverse proxy to route to each service. As far as I can tell, I will need to change
    datahub-frontend/conf/routes
    ,
    datahub-web-react/src/conf/Global.ts
    and
    datahub-web-react/.env
    to add the path prefix, then rebuild and make a Docker image off that. Is that correct?
    b
    m
    +4
    • 7
    • 9
  • s

    strong-iron-17184

    01/25/2022, 6:19 PM
    No me dice ningun error, pero no entiendo porque no esta levantado
    e
    l
    b
    • 4
    • 4
  • b

    billions-twilight-48559

    01/26/2022, 11:29 AM
    Hi, which is the best/recommended method to include drivers like sql server for my crawlers when deploying with helm?
    i
    • 2
    • 2
  • h

    hallowed-airline-89779

    01/27/2022, 6:48 AM
    Hi guys, not sure if this is the best place to ask this question(Please correct me if wrong): Is there any plan/existing feature to support Flink lineage? Potentially we can build one similar to spark-lineage by ourself, but to prevent re-invent the wheel, just want to double check if someone's working on it already.
    l
    • 2
    • 5
  • f

    few-air-56117

    01/27/2022, 9:31 AM
    Hi guys, i want to make some grapql call from python. How cani get the auth token?
    h
    • 2
    • 1
  • l

    late-bear-87552

    01/27/2022, 12:22 PM
    wanted to deploy datahub on production environment in my company, can anyone help me with the minimum machine configuration??
    b
    • 2
    • 2
  • f

    few-air-56117

    01/27/2022, 2:42 PM
    Hi guysm how can i get dataset stats using graphql,i see that its required a paramtere (resource), but i cant find what it is
    b
    • 2
    • 4
  • g

    gorgeous-dinner-4055

    01/27/2022, 5:00 PM
    Hello All! We've got most things in our deployment sorted out, but stuck on one last bug with fetching AWS credentials. I know a couple other people have deployed using mostly managed AWS services, so wanted to see if anyone else has run into the same issues as us and how you solved it. What we're seeing: Calls to AWS Glue to fetch schemas seems to fail with credentials not found(See log in thread). However, if I ssh into the pod, I am able to run glue fetch schemas, get current role, etc with the CLI. So the role seems valid, and the latest version of AWS cli seems to pick up the role just fine. The Glue and AWS dependency seem pretty up to date. Any idea on what else could be causing this issue? We've seen this type of issue before when the AWS library is out of data, and our workaround was to have a background thread that refreshes credentials to a credential file. So we can use that same solution, but ofc would prefer not to.
    b
    n
    w
    • 4
    • 27
  • a

    ambitious-pharmacist-14608

    01/28/2022, 12:00 AM
    Hi all I am trying to figure out some questions from our users regarding tag and business glossary features supported by datahub. To them, they felt that they are quite similar and can be used interchangeably. Any advanced user can enlighten me when to use tags and when to use business glossary terms and why do you think so?
    m
    • 2
    • 1
  • l

    late-bear-87552

    01/28/2022, 7:00 AM
    Hi all getting this error from linkedin/datahub-gms:head , can anyone help me with this??
    Copy code
    05:24:01.419 [kafka-coordinator-heartbeat-thread | mce-consumer-job-client] WARN  o.apache.kafka.clients.NetworkClient:969 - [Consumer clientId=consumer-mce-consumer-job-client-2, groupId=mce-consumer-job-client] Error connecting to node broker:29092 (id: 1 rack: null)
    java.net.UnknownHostException: broker
    	at java.net.InetAddress.getAllByName0(InetAddress.java:1282)
    	at java.net.InetAddress.getAllByName(InetAddress.java:1194)
    	at java.net.InetAddress.getAllByName(InetAddress.java:1128)
    	at org.apache.kafka.clients.ClientUtils.resolve(ClientUtils.java:110)
    	at org.apache.kafka.clients.ClusterConnectionStates$NodeConnectionState.currentAddress(ClusterConnectionStates.java:403)
    	at org.apache.kafka.clients.ClusterConnectionStates$NodeConnectionState.access$200(ClusterConnectionStates.java:363)
    	at org.apache.kafka.clients.ClusterConnectionStates.currentAddress(ClusterConnectionStates.java:151)
    	at org.apache.kafka.clients.NetworkClient.initiateConnect(NetworkClient.java:962)
    	at org.apache.kafka.clients.NetworkClient.access$600(NetworkClient.java:74)
    	at org.apache.kafka.clients.NetworkClient$DefaultMetadataUpdater.maybeUpdate(NetworkClient.java:1135)
    	at org.apache.kafka.clients.NetworkClient$DefaultMetadataUpdater.maybeUpdate(NetworkClient.java:1023)
    	at org.apache.kafka.clients.NetworkClient.poll(NetworkClient.java:548)
    	at org.apache.kafka.clients.consumer.internals.ConsumerNetworkClient.poll(ConsumerNetworkClient.java:262)
    	at org.apache.kafka.clients.consumer.internals.ConsumerNetworkClient.pollNoWakeup(ConsumerNetworkClient.java:303)
    	at org.apache.kafka.clients.consumer.internals.AbstractCoordinator$HeartbeatThread.run(AbstractCoordinator.java:1280)
    05:24:01.419 [kafka-coordinator-heartbeat-thread | mce-consumer-job-client] WARN  o.apache.kafka.clients.NetworkClient:969 - [Consumer clientId=consumer-mce-consumer-job-client-2, groupId=mce-consumer-job-client] Error connecting to node broker:29092 (id: 1 rack: null)
    java.net.UnknownHostException: broker
    d
    • 2
    • 2
1...678...53Latest