https://datahubproject.io logo
Join Slack
Powered by
# getting-started
  • c

    calm-dog-26784

    02/26/2024, 9:52 AM
    Hello Everyone, Can someone guide me how do i get a DataHub instance provisioned? What are the steps?
    r
    • 2
    • 1
  • g

    gorgeous-tent-62316

    02/26/2024, 1:23 PM
    I don’t totally understand your question. If you are trying locally, and you want to load some initial data, you can run: datahub docker ingest-sample-data https://datahubproject.io/docs/quickstart/#ingest-sample-data
  • m

    many-hairdresser-47640

    02/26/2024, 3:58 PM
    As @gorgeous-tent-62316 mentioned I also started with the Quickstart. I was able to install on my laptop using WSL (Windows Subsystem for Linux). I am not expert in these things but it went pretty smoothly. For next step up open source I have recently learned about and am reviewing Kubernetes install at https://datahubproject.io/docs/deploy/kubernetes/.
  • r

    rich-barista-93413

    02/27/2024, 12:29 AM
    Hmm, that's a tricky one. Are you still experiencing this Philip?
  • b

    bitter-animal-44805

    02/28/2024, 4:03 PM
    Hello everyone! What would be the best place / way to learn about the default / built in analytics functionality in DataHub?
    r
    • 2
    • 5
  • a

    acceptable-portugal-56495

    02/28/2024, 8:01 PM
    When I delete platforms from Datahub using the datahub CLI, why do they still show up in the UI ? I do a hard delete, with --hard
    datahub delete --platform athena --hard
    r
    l
    • 3
    • 3
  • a

    acceptable-portugal-56495

    02/28/2024, 8:18 PM
    Or, how do I delete everything in datahub?
    g
    • 2
    • 1
  • g

    gray-gold-85760

    03/01/2024, 8:32 AM
    What all can I extract about a dataset using graphql queries I just got started query Queryname($urn: String!){dataset(urn: $urn){urn}} Apart from urn what all can I extract And can anyone guide me where can I learn more about these queries
    l
    r
    • 3
    • 8
  • b

    bright-motherboard-28717

    03/01/2024, 3:47 PM
    I have a question about the kafka-setup container. It looks like it is used to precreate topics and the default value is set to false in the docker-compose for the quickstart. In what circumstances would we want to set it to true and have it precreate topics?
    r
    • 2
    • 3
  • r

    rich-policeman-92383

    03/01/2024, 7:20 PM
    Does the release v0.13.0 support elastic8.X ?
    r
    • 2
    • 1
  • a

    abundant-garage-20831

    03/02/2024, 12:46 PM
    Hi I'm trying to create lineage via gms api of datahub, using updatelineage, that is the only one I found, there is two options edgestoadd and edgestoremove, I don't have any edge to remove, but it has to be added otherwise it is giving error. Can someone tell me the syntax to just add a lineage using curl command
    r
    • 2
    • 1
  • a

    abundant-wall-60815

    03/04/2024, 9:03 AM
    How to solve this problem of missing jar package?import com.linkedin.metadata.models.registry.config.Entities;
  • d

    damp-solstice-31196

    03/04/2024, 6:50 PM
    Hi everyone. Once data documentation and the business glossary are created, is there a way to export that information and use it in another Datahub instance? Or would the only way be copying data across separate RDS instances? Thank you!
    g
    l
    r
    • 4
    • 5
  • b

    boundless-nail-65912

    03/06/2024, 6:17 AM
    Hi Team, We are trying to implement Columnar level lineage for Vertica Source. Can anyone provide where can we get the more information about column level lineage?
    r
    • 2
    • 4
  • s

    swift-dentist-41637

    03/06/2024, 10:30 AM
    Hi folks! We've been playing with the Datahub quickstart and it's great 💪 We have a question about infrastructure for a full deployment: is it theoretically possible to bypass Kafka? We're wondering if it's mainly to enable push-based metadata changes - and so if we were willing to lose those features could we replace it with a cheaper/simpler system? Or is it quite deeply embedded and would require a lot of changes?
    b
    o
    • 3
    • 5
  • f

    flaky-raincoat-44662

    03/06/2024, 9:52 PM
    I love the data products concept and formalization of data mesh principles embedded in the product. Can a data product include predefined/calculated metrics and if so, how does grain and dimensionalization work?
    g
    • 2
    • 2
  • t

    tall-painting-67124

    03/07/2024, 12:27 AM
    Hi All,
  • t

    tall-painting-67124

    03/07/2024, 12:28 AM
    not sure if this is the right thread but just wanted to understand if we can have Apache solr as a source in datahub.. I know Elastic is supported but how about Apache solr
    r
    • 2
    • 1
  • t

    tall-answer-76571

    03/07/2024, 9:42 AM
    Hello all! Could someone please explain which PowerBI license is required for integrating dashboards with a data catalog?
  • r

    red-scientist-36390

    03/07/2024, 11:23 AM
    Hello! We’re currently evaluating our options in terms of data observability tools and are curious about the different costs & benefits of managed Datahub. Is there anywhere specific where this info could be accessed? Thank you
    b
    s
    • 3
    • 2
  • w

    wonderful-baker-8803

    03/07/2024, 10:56 PM
    Hi all, I am currently using Datahub v 0.12.1. checking how can we pass personal access token via datahub cli?
    r
    • 2
    • 2
  • l

    little-musician-10851

    03/07/2024, 11:09 PM
    Hello! I would like to evaluate and experiment with Datahub integrations with Neo4j. I know Neo4j acts as the graph database but how would i see that reflected in my local Neo4j instance? Any thoughts or additional documentation would be helpful. Thanks!
  • b

    bland-receptionist-85001

    03/08/2024, 12:16 AM
    This feature of data exploration exist?

    https://www.youtube.com/watch?v=vGGqjP-5Rms▾

    r
    • 2
    • 1
  • b

    billions-baker-82097

    03/08/2024, 12:46 PM
    Hi team, I am working on a design where I wanted to fetch the changes happening at the source side and put it the datahub without re-running the recipe file. Traditionally what we have in the datahub is like we have to re-run our recipe to get the changes from the source and it gets reflected on the datahub in a timeseries way. But can we have something where we do not require to re-run our recipe to get the changes to the datahub. Current Scene: We are fetching pull-based metadata from source ( we need to re-run the recipe ) and reflecting to the datahub UI. Required Scene: Data Source should be able to send the changes to the Datahub, whenever there is a change in their source. It's like *push-based ingestio*n from source to the Datahub. Any thoughts and inputs are welcome.
    r
    • 2
    • 1
  • w

    wonderful-solstice-50942

    03/08/2024, 3:04 PM
    Hello, how do I switch the tokenizer for ElasticSearch in a Docker deployment?
  • t

    tall-answer-76571

    03/11/2024, 9:52 AM
    Hi everyone! Is it possible to connect DataHub to PowerBI Report Server (NOT Power bi service in cloud) ?
    r
    • 2
    • 1
  • f

    fresh-rain-44904

    03/11/2024, 8:27 PM
    Hi All, We have an instance of Salesforce Marketing Cloud (SFMC); and we would like to connect SFMC with DataHub. I know DataHub has a connector for Salesforce but is this connector covering only the Salesforce Data Cloud? Also it would be great if anyone has any experience connecting SFMC with DataHub? TIA!
    r
    • 2
    • 1
  • l

    little-scooter-91144

    03/12/2024, 7:07 AM
    Hi,everyone!Can lineage relationship be established between datahub containers? I used s3 source and found that the type of folder in the ingestion result was container, but there was no blood tab on the page (just like dataset for each file).
    r
    • 2
    • 1
  • r

    red-piano-8955

    03/13/2024, 6:35 PM
    Hi All, is there a way to create a policy that allows all readers to add tags to any dataset but not delete them?
    r
    • 2
    • 1
  • t

    tall-painting-67124

    03/14/2024, 4:01 AM
    Hi All,
1...7677787980Latest