https://datahubproject.io logo
Join Slack
Powered by
# getting-started
  • s

    some-car-9623

    09/13/2022, 3:02 PM
    Hello Everyone, I am trying to understand the Deprecation process in Datahub, is there any better document available for the same and any code samples? Thanks
    b
    • 2
    • 4
  • b

    bland-sundown-49496

    09/13/2022, 4:13 PM
    hello, is there any way to change gms REST port from 8080 to something else? I dont see the option in quickstart
    b
    • 2
    • 1
  • e

    echoing-knife-86832

    09/13/2022, 4:16 PM
    Hi people, I follow th instructions from site but I have experienced some errors
  • e

    echoing-knife-86832

    09/13/2022, 4:16 PM
    Error response from daemon: invalid volume specification: 'C\Users\USER\.datahub\mysql\init.sql/docker-entrypoint-initdb.d/init.sql:rw': invalid mount config for type "bind": bind source path does not exist: c:\users\user\.datahub\mysql\init.sql
    b
    g
    • 3
    • 16
  • e

    echoing-knife-86832

    09/13/2022, 4:16 PM
    any ideas?
  • s

    swift-nail-32514

    09/13/2022, 6:08 PM
    Can anyone point me to documentation (or at least intended purpose) of the "metadata test" found in the ingest-sample-data env. I found it here: http://localhost:9002/dataset/urn:li:dataset:(urn:li:dataPlatform:hive,SampleHiveDataset,PROD)/Validation?is_lineage_mode=false I found this page while searching the docs, but I'm not sure if it's referring to the same concept, since it's very WIP 🙂
    s
    l
    s
    • 4
    • 5
  • b

    breezy-shoe-41523

    09/14/2022, 8:38 AM
    Hi team how long is session duration and how can i set it???
  • p

    polite-application-51650

    09/14/2022, 8:48 AM
    Hi team, is their a way I can use some other instance elastic search by modifying the connection URL from localhost to my hosted elastic search.
    b
    • 2
    • 2
  • w

    wonderful-scooter-67279

    09/14/2022, 10:57 AM
    Hi everyone!
  • w

    wonderful-scooter-67279

    09/14/2022, 11:00 AM
    how to enable lineage in mysql please help me to resolve this issue
    b
    • 2
    • 1
  • p

    plain-magician-50536

    09/14/2022, 5:24 PM
    We are looking for assistance with DataHub deployment and best practices in Airflow, dbt, and Snowflake setup. If anyone knows of firms/consultants that can help, please DM me.
    b
    • 2
    • 1
  • b

    bland-orange-13353

    09/14/2022, 7:06 PM
    This message was deleted.
    g
    • 2
    • 1
  • b

    breezy-shoe-41523

    09/14/2022, 8:41 AM
    And i’m also curious why there is null in right menu bar
    🤨 1
    a
    g
    • 3
    • 6
  • p

    plain-traffic-15517

    09/15/2022, 3:34 PM
    docker pull failed due to bad network several times.. any idea how to download the images?
    m
    • 2
    • 2
  • f

    fast-potato-13714

    09/15/2022, 2:34 PM
    Hello everybody! We are having the next issue while trying to capture a spark job lineage: when we read the table tmp.agus_test_4857 it captures it in datahub as a hdfs file instead of a hive table. Since we've already ingested the hive table with its metadata, it appears as another object with all its metadata are we doing something incorrectly? Thanks in advance!
    m
    • 2
    • 5
  • b

    bland-sundown-49496

    09/16/2022, 2:46 PM
    Hello, Would you please share some ideas on how to troubleshoot the error. I am running on MAC. Error connecting to node broker:29092 (id: -1 rack: null) (org.apache.kafka.clients.NetworkClient) java.net.UnknownHostException: broker: Name does not resolve
    Error while executing topic command : Call(callName=createTopics, deadlineMs=1663338172286, tries=1, nextAllowedTryMs=1663338172387) timed out at 1663338172287 after 1 attempt(s)
    [2022-09-16 142252,290] ERROR org.apache.kafka.common.errors.TimeoutException: Call(callName=createTopics, deadlineMs=1663338172286, tries=1, nextAllowedTryMs=1663338172387) timed out at 1663338172287 after 1 attempt(s) Caused by: org.apache.kafka.common.errors.TimeoutException: Timed out waiting for a node assignment. Call: createTopics (kafka.admin.TopicCommand$) WARNING: Due to limitations in metric names, topics with a period ('.') or underscore ('_') could collide. To avoid issues it is best to use either, but not both. [2022-09-16 142254,036] WARN [AdminClient clientId=adminclient-1] Connection to node -1 (broker/172.18.0.9:29092) could not be established. Broker may not be available. (org.apache.kafka.clients.NetworkClient) [2022-09-16 142254,160] WARN [AdminClient clientId=adminclient-1] Connection to node -1 (broker/172.18.0.9:29092) could not be established. Broker may not be available. (org.apache.kafka.clients.NetworkClient) [2022-09-16 142254,365] WARN [AdminClient clientId=adminclient-1] Connection to node -1 (broker/172.18.0.9:29092) could not be established. Broker may not be available. (org.apache.kafka.clients.NetworkClient) [2022-09-16 142254,569] WARN [AdminClient clientId=adminclient-1] Connection to node -1 (broker/172.18.0.9:29092) could not be established. Broker may not be available. (org.apache.kafka.clients.NetworkClient) [2022-09-16 142255,079] WARN [AdminClient clientId=adminclient-1] Connection to node -1 (broker/172.18.0.9:29092) could not be established. Broker may not be available. (org.apache.kafka.clients.NetworkClient) [2022-09-16 142256,097] WARN [AdminClient clientId=adminclient-1] Connection to node -1 (broker/172.18.0.9:29092) could not be established. Broker may not be available. (org.apache.kafka.clients.NetworkClient) [2022-09-16 142327,118] WARN [AdminClient clientId=adminclient-1] Error connecting to node broker:29092 (id: -1 rack: null) (org.apache.kafka.clients.NetworkClient) java.net.UnknownHostException: broker: Name does not resolve at java.base/java.net.Inet4AddressImpl.lookupAllHostAddr(Native Method) at java.base/java.net.InetAddress$PlatformNameService.lookupAllHostAddr(InetAddress.java:929) at java.base/java.net.InetAddress.getAddressesFromNameService(InetAddress.java:1529) at java.base/java.net.InetAddress$NameServiceAddresses.get(InetAddress.java:848) at java.base/java.net.InetAddress.getAllByName0(InetAddress.java:1519) at java.base/java.net.InetAddress.getAllByName(InetAddress.java:1378) at java.base/java.net.InetAddress.getAllByName(InetAddress.java:1306) at org.apache.kafka.clients.DefaultHostResolver.resolve(DefaultHostResolver.java:27) at org.apache.kafka.clients.ClientUtils.resolve(ClientUtils.java:111) at org.apache.kafka.clients.ClusterConnectionStates$NodeConnectionState.currentAddress(ClusterConnectionStates.java:513) at org.apache.kafka.clients.ClusterConnectionStates$NodeConnectionState.access$200(ClusterConnectionStates.java:467) at org.apache.kafka.clients.ClusterConnectionStates.currentAddress(ClusterConnectionStates.java:172) at org.apache.kafka.clients.NetworkClient.initiateConnect(NetworkClient.java:985) at org.apache.kafka.clients.NetworkClient.ready(NetworkClient.java:311) at org.apache.kafka.clients.admin.KafkaAdminClient$AdminClientRunnable.sendEligibleCalls(KafkaAdminClient.java:1080) at org.apache.kafka.clients.admin.KafkaAdminClient$AdminClientRunnable.processRequests(KafkaAdminClient.java:1321) at org.apache.kafka.clients.admin.KafkaAdminClient$AdminClientRunnable.run(KafkaAdminClient.java:1264) at java.base/java.lang.Thread.run(Thread.java:829)
  • e

    early-airplane-84388

    09/17/2022, 5:58 AM
    Hi Team, How can I run a DataHub Actions Framework in the DataHub running on Kubernetes (installed with helm)? I want to run a Custom Action that I'm able to run with DataHub on my localhost (with quickstart). For testing, I tried updating the URL in the connection of hello_world.yaml with the IP address of DataHub but it failed with below error. (Obfuscated the IP below)
    Copy code
    Failed to instantiate Actions Pipeline using config {'name': 'hello_world', 'source': {'type': 'kafka', 'config': {'connection': {'bootstrap': '<http://34.1XX.XX.XXX:9092>', 'schema_registry_url': '<http://34.1XX.XX.XXX:8081>'}}}, 'action': {'type': 'hello_world'}} due to
                    'Caught exception while attempting to instantiate Event Source of type kafka' due to
                            '1 validation error for KafkaEventSourceConfig
    connection -> bootstrap
      host contains bad characters, found <http://34.1XX.XX.XXX> (type=assertion_error)'.
            Run with --debug to get full stacktrace.
            e.g. 'datahub --debug actions -c hello_world.yaml'
    m
    i
    • 3
    • 15
  • m

    melodic-beach-18239

    09/19/2022, 3:56 AM
    Hi, all. I want to know if i can install datahub offline?
    b
    i
    • 3
    • 10
  • m

    melodic-beach-18239

    09/19/2022, 3:56 AM
    Because my prod env cannot connect to Internet.
  • r

    rhythmic-zoo-52859

    09/19/2022, 12:20 PM
    Hi every one, can I hide Server info from ResponseHeader
  • r

    rhythmic-zoo-52859

    09/19/2022, 12:22 PM
    Hi every one, @big-carpet-38439 can I hide
    Server
    info from ResponseHeader
  • w

    wonderful-scooter-67279

    09/19/2022, 1:32 PM
    Hii Everyone,
  • w

    wonderful-scooter-67279

    09/19/2022, 1:33 PM
    any one create lineage with mysql??
    g
    • 2
    • 1
  • f

    famous-fall-59477

    09/20/2022, 9:11 AM
    Hi, I just wanted to understand the docker build process a bit better. In particular, I noticed in a few Dockerfiles, a reference is made to an environment variable
    BUILDPLATFORM
    . For instance, here in the
    datahub-gms
    Dockerfile: https://github.com/datahub-project/datahub/blob/master/docker/datahub-gms/Dockerfile#L27 I am not sure where
    BUILDPLATFORM
    is being set initially, can anyone point me to the place where it is being set?
    d
    • 2
    • 3
  • a

    adamant-rain-51672

    09/20/2022, 3:22 PM
    Hey, is there an easy way to change admin user password (from UI or CLI)?
    g
    b
    +2
    • 5
    • 22
  • e

    echoing-pillow-41000

    09/20/2022, 7:23 PM
    Getting a 404 from helm.datahubproject.io, is something down temporarily?
    l
    m
    • 3
    • 3
  • e

    enough-monitor-24292

    09/21/2022, 8:25 AM
    Hi, Can we perform search on schema description, schema tags and schema glossary?
    m
    b
    • 3
    • 3
  • g

    glamorous-microphone-33484

    09/22/2022, 1:41 AM
    Hi saw this https://github.com/datahub-project/datahub/pull/5896 doc regarding the approval workflow. Will this feature (assume it is in the managed service) be ported over to the open source version?
    i
    l
    • 3
    • 2
  • l

    little-spring-72943

    09/22/2022, 9:49 AM
    Do we have a working example of assertions in https://demo.datahubproject.io/ ? I would like to see how assertionRunEvent output looks in UI?
    h
    • 2
    • 1
  • f

    future-smartphone-53257

    09/22/2022, 9:51 AM
    Is there some way I can use the GraphQL API to only get things in a specific environment? I have tried this but it does not work:
    Copy code
    {
      searchAcrossEntities(input:{query:"", count: 999, types: [DATASET], filters: [{field: "environment", value: "EI"}]}) {
        count
        searchResults{
          entity {
            urn
            type
          }
        }
      }
    }
    b
    • 2
    • 2
1...424344...80Latest