https://datahubproject.io logo
Join SlackCommunities
Powered by
# getting-started
  • s

    square-activity-64562

    07/22/2021, 9:26 AM
    How do people specify foreign keys in datahub? Adding descriptions would be too cumbersome
    m
    • 2
    • 1
  • s

    square-activity-64562

    07/23/2021, 4:51 AM
    Suggestion: Maintain a page with breaking changes like apache superset does. https://github.com/apache/superset/blob/master/UPDATING.md. While github releases are good, the way apache superset has been doing it https://github.com/apache/superset/tree/master/RELEASING/release-notes-1-2 is great
    m
    • 2
    • 1
  • f

    faint-hair-91313

    07/24/2021, 12:04 PM
    Hi guys! Any chance to see the recording of the last Townhall?
    m
    g
    • 3
    • 5
  • s

    square-activity-64562

    07/25/2021, 6:41 AM
    In the townhall new CLI commands were shown which could do rollbacks. In a kubernetes deployment where would we run those commands? Inside gms pod? Will that rollback the profiling stats? Asking because I would like to enable the profiling stats. But the
    sample values
    is something that I am not sure I would like to have in the UI. So wanted to try to ingest it once. But in case wanted to revert it should be able to do that
    m
    • 2
    • 4
  • s

    strong-restaurant-35629

    07/26/2021, 1:48 PM
    Hi, I'm new DataHub but very in its capabilities. I'm looking some guidance or support on deploying GKE. Any guidance or support much appreciated
    m
    • 2
    • 4
  • m

    millions-engineer-56536

    07/26/2021, 9:11 PM
    Not sure what's the right channel but... In

    latest Community Meetingβ–Ύ

    The super awesome feature of dataset stats seems to rely on time series based datastore. It sounds like that data is not persisted in MySQL but instead is only stored in ES... In case of some issues and a need for running reindexing of ES how is that data preserved? Or would we need to fully re-ingest this data after reindexing?
    b
    m
    +2
    • 5
    • 14
  • s

    square-activity-64562

    07/28/2021, 7:44 AM
    When is the next release going to happen? Looking forward to some of the new things which are already in.
    m
    • 2
    • 1
  • s

    square-activity-64562

    07/28/2021, 8:31 AM
    Is there any way to search for datasets with/without owners? I can see "21.94% have owners assigned!". I am trying to get a breakdown for this so I can see something like
    Copy code
    users  -> owner of number of datasets
    groups -> owner of number of datasets
    e
    b
    • 3
    • 2
  • q

    quick-restaurant-75578

    07/29/2021, 3:09 AM
    Dear Datahub team - can we have another group for installation and infrastructure setup please
    m
    • 2
    • 2
  • s

    square-activity-64562

    07/30/2021, 7:42 AM
    Is there a ping endpoint for gms? I would like to test connectivity from a different cluster to see if I able to reach gms
    m
    e
    b
    • 4
    • 10
  • b

    big-carpet-38439

    07/30/2021, 3:16 PM
    Welcome @little-smartphone-52405 and @late-pizza-91254!
    πŸ‘‹ 1
    l
    l
    • 3
    • 2
  • b

    better-orange-49102

    08/02/2021, 9:34 AM
    will the rbac feature timeline be shifted back further? i think the last ETA communicated was in mid Aug?
    m
    • 2
    • 2
  • m

    mammoth-bear-12532

    08/02/2021, 3:17 PM
    Welcome @chilly-wall-98962! You are member 1000 πŸ™‚
    πŸ“ˆ 2
    πŸ™Œ 4
    πŸŽ‰ 3
    πŸ™ŒπŸ» 1
    datahub 2
    πŸš€ 11
    c
    • 2
    • 1
  • m

    mammoth-bear-12532

    08/02/2021, 7:03 PM
    Welcome @full-refrigerator-38116! Just because you are 1001 doesn't mean you're not special for us πŸ™‚
    πŸŽ‰ 3
    f
    • 2
    • 1
  • q

    quick-restaurant-75578

    08/03/2021, 3:38 AM
    Hi … is there a documentation available detailing the differentiators from Amundsen ?
    m
    • 2
    • 2
  • s

    square-activity-64562

    08/03/2021, 8:45 AM
    Is there an example of View
    DatasetLineageTypeClass
    https://github.com/linkedin/datahub/blob/5eee818a619e27300f8ae3cf749d3f82aa23f43e/metadata-ingestion/src/datahub/metadata/schema_classes.py#L2279 Does it show up differently in the UI?
    g
    • 2
    • 3
  • a

    ambitious-airline-8020

    08/03/2021, 12:49 PM
    Dear team. Does anyone know, how to clean metadata correctly (both mysql and elastic)? Not as big as
    docker/nuke.sh
    , just drop meta P.S. It is about dev env, running from
    docker/dev.sh
    , so not related to any production cases
    g
    • 2
    • 8
  • m

    mammoth-bear-12532

    08/04/2021, 12:23 AM
    <!here> πŸ“£ The big July release is here! We released DataHub 0.8.7 earlier today along with the
    acryl_datahub
    PyPi package (version 0.8.7.0). Looking forward to hearing all the awesome things you all will do with this!
    πŸ’― 2
    πŸŽ† 3
    πŸ‘ 6
    πŸš€ 9
    s
    • 2
    • 3
  • p

    prehistoric-yak-75049

    08/05/2021, 6:41 PM
    Hi I am trying to run update
    docker pull acryldata/datahub-upgrade:head && docker run --env-file ~/setup/docker/datahub-upgrade/env/docker.env acryldata/datahub-upgrade:head -u NoCodeDataMigration
    but getting below exception to build docker image
    Copy code
    ERROR SpringApplication Application run failed
     org.springframework.beans.factory.UnsatisfiedDependencyException: Error creating bean with name 'upgradeCli': Unsatisfied dependency expressed through field 'noCodeUpgrade'; nested exception is org.springframework.beans.factory.UnsatisfiedDependencyException: Error creating bean with name 'com.linkedin.gms.factory.entityregistry.EntityRegistryFactory': Unsatisfied dependency expressed through field 'configEntityRegistry'; nested exception is org.springframework.beans.factory.BeanCreationException: Error creating bean with name 'configEntityRegistry' defined in class path resource [com/linkedin/gms/factory/entityregistry/ConfigEntityRegistryFactory.class]: Bean instantiation via factory method failed; nested exception is org.springframework.beans.BeanInstantiationException: Failed to instantiate [com.linkedin.metadata.models.registry.ConfigEntityRegistry]: Factory method 'getInstance' threw exception; nested exception is java.io.FileNotFoundException: ../../metadata-models/src/main/resources/entity-registry.yml (No such file or directory)
    	at org.springframework.beans.factory.annotation.AutowiredAnnotationBeanPostProcessor$AutowiredFieldElement.inject(AutowiredAnnotationBeanPostProcessor.java:643)
    	at org.springframework.beans.factory.annotation.InjectionMetadata.inject(InjectionMetadata.java:116)
    	at org.springframework.beans.factory.annotation.AutowiredAnnotationBeanPostProcessor.postProcessProperties(AutowiredAnnotationBeanPostProcessor.java:399)
    	at org.springframework.beans.factory.support.AbstractAutowireCapableBeanFactory.populateBean(AbstractAutowireCapableBeanFactory.java:1422)
    	at org.springframework.beans.factory.support.AbstractAutowireCapableBeanFactory.doCreateBean(AbstractAutowireCapableBeanFactory.java:594)
    	at org.springframework.beans.factory.support.AbstractAutowireCapableBeanFactory.createBean(AbstractAutowireCapableBeanFactory.java:517)
    	at org.springframework.beans.factory.support.AbstractBeanFactory.lambda$doGetBean$0(AbstractBeanFactory.java:323)
    	at org.springframework.beans.factory.support.DefaultSingletonBeanRegistry.getSingleton(DefaultSingletonBeanRegistry.java:222)
    	at org.springframework.beans.factory.support.AbstractBeanFactory.doGetBean(AbstractBeanFactory.java:321)
    	at org.springframework.beans.factory.support.AbstractBeanFactory.getBean(AbstractBeanFactory.java:202)
    	at org.springframework.beans.factory.support.DefaultListableBeanFactory.preInstantiateSingletons(DefaultListableBeanFactory.java:879)
    	at org.springframework.context.support.AbstractApplicationContext.finishBeanFactoryInitialization(AbstractApplicationContext.java:878)
    	at org.springframework.context.support.AbstractApplicationContext.refresh(AbstractApplicationContext.java:550)
    	at org.springframework.boot.SpringApplication.refresh(SpringApplication.java:775)
    	at org.springframework.boot.SpringApplication.refreshContext(SpringApplication.java:397)
    	at org.springframework.boot.SpringApplication.run(SpringApplication.java:316)
    	at org.springframework.boot.builder.SpringApplicationBuilder.run(SpringApplicationBuilder.java:139)
    	at com.linkedin.datahub.upgrade.UpgradeCliApplication.main(UpgradeCliApplication.java:13)
    ...
    Caused by: org.springframework.beans.factory.UnsatisfiedDependencyException: Error creating bean with name 'com.linkedin.gms.factory.entityregistry.EntityRegistryFactory': Unsatisfied dependency expressed through field 'configEntityRegistry'; nested exception is org.springframework.beans.factory.BeanCreationException: Error creating bean with name 'configEntityRegistry' defined in class path resource [com/linkedin/gms/factory/entityregistry/ConfigEntityRegistryFactory.class]: Bean instantiation via factory method failed; nested exception is org.springframework.beans.BeanInstantiationException: Failed to instantiate [com.linkedin.metadata.models.registry.ConfigEntityRegistry]: Factory method 'getInstance' threw exception; nested exception is java.io.FileNotFoundException: ../../metadata-models/src/main/resources/entity-registry.yml (No such file or directory)
    m
    e
    • 3
    • 7
  • c

    crooked-toddler-8683

    08/06/2021, 4:48 PM
    good afternoon. Would datahub discover the SSIS packages for pipelines?
    g
    • 2
    • 1
  • w

    witty-butcher-82399

    08/09/2021, 12:29 PM
    In terms of observability, is DataHub exposing metrics such us number of entities?
    πŸ‘€ 1
    m
    b
    • 3
    • 4
  • m

    magnificent-camera-71872

    08/10/2021, 1:29 AM
    Hi folks..... I'm evaluating datahub as a potential data catalog for our org. I ingested some test metadata but now want to delete it from datahub. I trying to follow the delete instructions here: https://datahubproject.io/docs/how/delete-metadata but my datahub server is remote. How can i specify the server url using the datahub cli ? Cheers -- Simon
    g
    m
    • 3
    • 9
  • m

    magnificent-camera-71872

    08/10/2021, 3:32 AM
    Does anyone know if its possible to restrict the metadata users can see by using role based access control in datahub. Our org wants to restrict what particular users can see within the datahub catalog.
    ☝️ 1
    b
    m
    • 3
    • 6
  • b

    blue-megabyte-68048

    08/10/2021, 12:53 PM
    Hello! I've been looking at extending the built-in metadata using the [No Code Metadata](https://datahubproject.io/docs/advanced/no-code-modeling) instructions, but it seems the [MCP/MCL stuff](https://datahubproject.io/docs/advanced/mcp-mcl) might make some changes to that process. Do the steps remain the same? Also, are all the old topics (MCE, MAE, FCE) and new topics (MCP, FMCP, MCLV, MCLT) required for a new deployment? Thanks!
    m
    b
    • 3
    • 11
  • m

    mammoth-bear-12532

    08/10/2021, 4:10 PM
    PSA: github seems to be having some stability issues. Please write code slowly 🐒
    • 1
    • 1
  • b

    blue-megabyte-68048

    08/11/2021, 4:38 PM
    Are there currently any efforts or discussion to enable dynamic/runtime Entity definition?
    l
    b
    • 3
    • 4
  • m

    magnificent-camera-71872

    08/12/2021, 5:14 AM
    Hi all.... I'm trying to use curl to retrieve details on a dataset I added to datahub. No matter what I try, I always receive the message "You need to enable JavaScript to run this app". I'm running the following curl command:
    Copy code
    curl -H 'X-RestLi-Protocol-Version:2.0.0' -H 'X-RestLi-Method: get'  '<https://datahub.xxxxxxxx-nonprod-1.yyyyyyyy-datalake-nonprod.com.au/dataset/urn%3Ali%3Adataset%3A(urn%3Ali%3AdataPlatform:redshift,datalake.kayo_datalake_transform_current.ares_masterindex_daily_snapshot_view,DEV)/schema>'
    I thought maybe I'm using the wrong endpoint, but curl for the config works fine:
    Copy code
    curl '<https://datahub.xxxxxxxx-nonprod-1.yyyyyyyy-datalake-nonprod.com.au/config>'
    Does anyone have any ideas ?
    l
    g
    • 3
    • 6
  • b

    bland-orange-13353

    08/12/2021, 6:59 PM
    This message was deleted.
    g
    w
    +2
    • 5
    • 22
  • m

    mammoth-bear-12532

    08/13/2021, 5:20 AM
    <!here> πŸ“£ Release 0.8.9 just dropped! along with the companion pip package
    acryl_datahub
    v0.8.9.0. Release Highlights: β€’ Support for nested structs, union types and key-value schemas in Kafka β€’ Support for JDBC Connector based sources in Kafka Connect β€’ Support for Okta as a source for User and Group metadata β€’ Support for using AWS Glue schema registry Detailed notes are here. Enjoy! πŸ”₯
    πŸ™Œ 4
    πŸŽ‰ 3
    s
    a
    +3
    • 6
    • 10
  • m

    microscopic-musician-99632

    08/13/2021, 5:52 AM
    Is there support for classifications. Currently in the demo site can see tag "classification:confidential" , is this the structure ? Could not find much documentation regarding same or PII fields , apologize if I missed it.
    g
    s
    • 3
    • 3
1...101112...80Latest