https://datahubproject.io logo
Join Slack
Powered by
# getting-started
  • p

    prehistoric-furniture-42991

    02/14/2023, 10:59 AM
    Hi, I'm installing Datahub on EC2 (Linux) & while starting the quickstart, getting the below error: $ datahub docker quickstart Docker doesn't seem to be running. Did you start it? And I have checked the docker status $ sudo systemctl status docker.service ● docker.service - Docker Application Container Engine Loaded: loaded (/usr/lib/systemd/system/docker.service; enabled; vendor preset: disabled) Active: active (running); 10min ago
    👀 1
    i
    • 2
    • 3
  • s

    silly-dog-87292

    02/14/2023, 3:11 PM
    Team, i have this question, I am building a local customized UI to show my catalog items, lineage, business glossary etc. Does datahub allow to expose these metadata over restapi to external applications?
    ✅ 1
    👀 1
    i
    b
    • 3
    • 3
  • w

    white-horse-97256

    02/14/2023, 5:29 PM
    Hi team, where does datahub store secrets? In Mysql or elasticsearch? Is there a way we can see these backend storage?
    ✅ 2
    👀 1
    a
    i
    b
    • 4
    • 10
  • s

    silly-helmet-2790

    02/14/2023, 9:13 PM
    Hi All, I got the datahub quickstart running. Able to logon using the datahub default user. I am trying to follow this guide to setup UI based ingestion - https://datahubproject.io/docs/ui-ingestion . But my settings and profile page do not show any of those options. I wonder if the 'datahub' is indeed a root user. Here is how my screen look like. I did CLI ingestion with a mysql source and got msg that pipeline finished successfully. But still the homepage don't show any metadata. Please guide what I am missing.
    a
    b
    e
    • 4
    • 8
  • h

    hallowed-businessperson-25451

    02/15/2023, 8:51 AM
    Hi all. I have so far been impressed with the metadata mapping of Datahub, but I have a few simple questions. We need a data catalogue that also allows users to access the data as well, such as specifing downloadable datasets or exposing APIs for certain data. Is that possible with Datahub?
    a
    • 2
    • 7
  • h

    hundreds-oxygen-82939

    02/15/2023, 11:27 AM
    Hi everyone! I’m trying to send profiling data (Stats tab) to a dataset in DataHub using Python. Does anyone have an example of this implementation?
    ✅ 1
    a
    b
    • 3
    • 4
  • f

    famous-afternoon-64386

    02/15/2023, 4:57 PM
    Hello everyone, Thanks for being such a great commnity!
    b
    • 2
    • 1
  • f

    famous-afternoon-64386

    02/15/2023, 4:57 PM
    I'm testing datahub on my infrastructure, and I was wondering if it was possible to ingest a MongoDB database via ssh tunnel from the UI. If so, what parameters should be used? Thanks in advance! :) Best regards Luis
    d
    b
    • 3
    • 5
  • w

    witty-butcher-82399

    02/15/2023, 5:28 PM
    I had the opportunity to read the blog post about PII Classification and it's really interesting 👏 Kudos for such a nice feature! https://blog.datahubproject.io/pii-classification-just-got-easier-with-datahub-6bab2b63abcb There are 4 prediction factors, one of them being "values". Is it possible to use the classifier without this predictor? I'm asking because in most of our ingestors we don't have permissions on the data, only the metadata. Thanks!
    🙌 2
    ✅ 1
    👍 1
  • r

    red-telephone-12711

    02/16/2023, 7:36 AM
    Hello there! Can I send data tests info to datahub from my own test framework based on pytest and see results on Validation tab?
    ✅ 1
    a
    • 2
    • 3
  • n

    nice-carpenter-79371

    02/16/2023, 12:45 PM
    Hello, May I know is "activity feed" supported by datahub https://docs.open-metadata.org/openmetadata/discovery-collaboration
    ✅ 1
    a
    • 2
    • 2
  • c

    calm-forest-3576

    02/17/2023, 1:19 PM
    Hi everyone, in our City, we would like to test a data catalog. Normally what we understand udner catalog is the Open data portal something similar this one https://hri.fi/en_gb/ We have decided to use CKAN as the catalog. However recently a colleague of mine showed me the DataHub. I am not quite convineced, if this is the right data catalog for what we need. Is there anyone here who can give me a bit more insight about who should use DataHub?
    ✅ 1
    a
    • 2
    • 2
  • l

    late-rocket-94535

    02/18/2023, 9:10 AM
    Hi all. Thank you for orFilters in GraphQL, it's very useful, but can I use search by null values? Something as
    Copy code
    search(
                    input: {start: 0, count: 100, query: "*", type: DATASET, 
                      orFilters: [{and: [
                        {field: "platform", values: "urn:li:dataPlatform:kafka"}, 
                        {field: "subTypes", condition: EQUAL, values: null},
                      ]}] }
                  )
    ✅ 1
    a
    b
    e
    • 4
    • 7
  • a

    alert-printer-93847

    02/19/2023, 11:34 PM
    Hi all, just started here and I have a quick question! Is it possible to use my data hub as landing page without requiring users to login? The problem im solving is I want to create a datahub where users can search and download data without logging-in !
    ✅ 1
    i
    • 2
    • 2
  • r

    ripe-eye-60209

    02/20/2023, 4:24 PM
    Hello Team, how can we extract/track who logged in when, and how many times? the data behind the weekly active users.
    👀 1
    r
    • 2
    • 3
  • p

    powerful-telephone-2424

    02/21/2023, 8:07 AM
    Hi Everyone! I’m just getting started with DataHub and I have a question around the entity model. Currently I see my datasets having URNs like:
    urn:li:dataset:(urn:li:dataPlatform:snowflake,db.schema.table,PROD)
    But this might cause issues if I have two snowflake instances with the same db, schema and table names. (We may possibly have this scenario quite soon). I am thinking this dataset URN format is not actually unique. It needs information beyond just
    snowflake
    . This led me to dig deeper into the current entity model and I found the
    dataPlatformInstance
    entity type. However I don’t see this entity being created in the mariaDb table. Are there any known reasons why this entity is not really used?
    ✅ 1
    a
    b
    • 3
    • 4
  • r

    rich-daybreak-77194

    02/21/2023, 10:06 AM
    Hi everyone i'm just getting start in datahub. I want to integrate datahub with great expectations. I want to run datahub and great expectation in one time when i command with yml file like docker-compose up -d --build Anyone can suggest or help? Thank you
    a
    o
    • 3
    • 4
  • f

    fresh-kitchen-67478

    02/21/2023, 11:14 AM
    Hi Everyone Im just starting with datahub when i started to install locally to get an understanding im getting the following error
    Traceback (most recent call last):
    File "/Users/adm-kumardine/opt/anaconda3/bin/datahub", line 5, in <module>
    from datahub.entrypoints import main
    File "/Users/adm-kumardine/opt/anaconda3/lib/python3.9/site-packages/datahub/entrypoints.py", line 17, in <module>
    from datahub.cli.delete_cli import delete
    File "/Users/adm-kumardine/opt/anaconda3/lib/python3.9/site-packages/datahub/cli/delete_cli.py", line 18, in <module>
    from datahub.upgrade import upgrade
    File "/Users/adm-kumardine/opt/anaconda3/lib/python3.9/site-packages/datahub/upgrade/upgrade.py", line 25, in <module>
    class VersionStats(BaseModel, arbitrary_types_allowed=True):
    File "/Users/adm-kumardine/opt/anaconda3/lib/python3.9/site-packages/pydantic/main.py", line 262, in __new__
    fields[ann_name] = inferred = ModelField.infer(
    File "/Users/adm-kumardine/opt/anaconda3/lib/python3.9/site-packages/pydantic/fields.py", line 315, in infer
    return cls(
    File "/Users/adm-kumardine/opt/anaconda3/lib/python3.9/site-packages/pydantic/fields.py", line 284, in __init__
    self.prepare()
    File "/Users/adm-kumardine/opt/anaconda3/lib/python3.9/site-packages/pydantic/fields.py", line 362, in prepare
    self.populate_validators()
    File "/Users/adm-kumardine/opt/anaconda3/lib/python3.9/site-packages/pydantic/fields.py", line 538, in populate_validators
    *(get_validators() if get_validators else list(find_validators(self.type_, self.model_config))),
    File "/Users/adm-kumardine/opt/anaconda3/lib/python3.9/site-packages/pydantic/validators.py", line 629, in find_validators
    `raise RuntimeError(f'no validator found for {type_}, see
    arbitrary_types_allowed
    in Config')` `RuntimeError: no validator found for <class 'packaging.version.Version'>, see
    arbitrary_types_allowed
    in Config`
    a
    o
    • 3
    • 5
  • f

    fresh-kitchen-67478

    02/21/2023, 11:15 AM
    im using a mac book m2
  • r

    ripe-eye-60209

    02/21/2023, 1:48 PM
    Hello Team, how can we retrived glossaryTerm createdAt timestamp using graphql. We are getting null, any ideas?
    a
    b
    • 3
    • 5
  • s

    steep-vr-39297

    02/21/2023, 3:10 PM
    Hi, I have a question. Elasticsearch's X-Pack cannot be used because of the license. So I can't use the Analytics and Linegae function?
    ✅ 1
    a
    o
    • 3
    • 4
  • m

    most-television-46558

    02/21/2023, 6:36 PM
    Hi Folks, looking for a bit of help on docs and links that explain how one might contrib. to the project.
    a
    • 2
    • 3
  • b

    bland-orange-13353

    02/22/2023, 3:02 AM
    This message was deleted.
    ✅ 1
    m
    b
    • 3
    • 2
  • t

    thankful-family-92426

    02/22/2023, 7:37 AM
    Hi Team, How can I start
    datahub-web-react
    without running the docker containers? I executed
    yarn isntall & yarn start:mock
    , but the process was blocked in below logs:
    Search for the keywords to learn more about each warning. To ignore, add // eslint-disable-next-line to the line before.
    and another question, may I code and test without running the docker containers in idea?
    a
    b
    • 3
    • 6
  • a

    astonishing-printer-13992

    02/22/2023, 9:51 AM
    Hi Everyone! Can I deploy datahub to openshift container platform? Did you test it on openshift?
    ✅ 1
    a
    o
    +2
    • 5
    • 7
  • t

    thankful-family-92426

    02/22/2023, 12:11 PM
    Hi Everyone, Who can help me with this error
    > Could not read '<https://api.adoptopenjdk.net/v3/binary/latest/8/ga/mac/aarch64/jdk/hotspot/normal/adoptopenjdk>' as it does not exist.
    a
    b
    • 3
    • 6
  • a

    agreeable-army-26750

    02/22/2023, 3:15 PM
    Hi everyone! I just tried to run the metadata-service build (
    ./gradlew :metadata-service:war:build --stacktrace
    ), but it fails with the following error message:
    org.gradle.api.GradleException: Failed to create Jar file /Users/marcellreti/.gradle/caches/jars-8/a42b3fb0a665a80cce3408f0f251b9e9/buildSrc.jar.
    at org.gradle.internal.classpath.ClasspathBuilder.jar(ClasspathBuilder.java:47)
    at org.gradle.internal.classpath.InstrumentingClasspathFileTransformer.instrument(InstrumentingClasspathFileTransformer.java:103)
    at org.gradle.internal.classpath.InstrumentingClasspathFileTransformer.transform(InstrumentingClasspathFileTransformer.java:98)
    at org.gradle.internal.classpath.InstrumentingClasspathFileTransformer.transform(InstrumentingClasspathFileTransformer.java:68)
    at org.gradle.internal.classpath.DefaultCachedClasspathTransformer$TransformFile.lambda$schedule$0(DefaultCachedClasspathTransformer.java:261)
    at org.gradle.internal.concurrent.ExecutorPolicy$CatchAndRecordFailures.onExecute(ExecutorPolicy.java:64)
    at org.gradle.internal.concurrent.ManagedExecutorImpl$1.run(ManagedExecutorImpl.java:48)
    at org.gradle.internal.concurrent.ThreadFactoryImpl$ManagedThreadRunnable.run(ThreadFactoryImpl.java:56)
    Caused by: java.lang.IllegalArgumentException: Unsupported class file major version 62
    at org.objectweb.asm.ClassReader.<init>(ClassReader.java:196)
    at org.objectweb.asm.ClassReader.<init>(ClassReader.java:177)
    at org.objectweb.asm.ClassReader.<init>(ClassReader.java:163)
    at org.gradle.internal.classpath.InstrumentingClasspathFileTransformer.lambda$visitEntries$1(InstrumentingClasspathFileTransformer.java:116)
    at org.gradle.internal.classpath.ClasspathWalker.visitJarContents(ClasspathWalker.java:91)
    at org.gradle.internal.classpath.ClasspathWalker.visit(ClasspathWalker.java:55)
    at org.gradle.internal.classpath.InstrumentingClasspathFileTransformer.visitEntries(InstrumentingClasspathFileTransformer.java:114)
    at org.gradle.internal.classpath.InstrumentingClasspathFileTransformer.lambda$instrument$0(InstrumentingClasspathFileTransformer.java:105)
    at org.gradle.internal.classpath.ClasspathBuilder.buildJar(ClasspathBuilder.java:58)
    at org.gradle.internal.classpath.ClasspathBuilder.jar(ClasspathBuilder.java:45)
    ... 7 more
    Could you help me find out what could be the issue? Thanks in advance! (I am on release 0.10.0)
    ✅ 1
    s
    • 2
    • 10
  • s

    strong-analyst-53140

    02/22/2023, 6:18 PM
    Hello, new here and currently looking for a product that fits our metadata & governance needs. data quality or validations an external integration is needed?
    ✅ 1
    a
    • 2
    • 4
  • l

    loud-analyst-32617

    02/22/2023, 9:02 PM
    hey all, i am actually up and running with data hub in EKS and looking to get some sample data loaded. there are lots of docs about how to do this in docker on your local machine, but everything i have tried to translate that to an instance in kubernetes does not seem to go anywhere. do you support this or do i have to hook into real sources?
    o
    • 2
    • 3
  • m

    millions-pencil-75565

    02/22/2023, 9:40 PM
    Hi everyone- I’m looking to pair Datahub with MonteCarlo in a GCP+AWS setup. Does Datahub play nice with MonteCarlo table quality stats? Rest of the environment is Airflow, Stitch and dbt. Any problems there?
    plus1 1
    a
    d
    +3
    • 6
    • 6
1...555657...80Latest