https://datahubproject.io logo
Join Slack
Powered by
# random
  • c

    clever-author-65853

    05/31/2023, 10:56 AM
    Hello! I would like to build a usage dashboard outside of analytics tab based on the usage events sent to ES. Few Questions in mind: 1. is there a session id ? 2. browser id, what exactly is it? 3. is there an event for login or signup? 4. and thought on how I can connect search query with clicking on one of the results? 5. any documentation about the types of events? Appreciate you help!
  • e

    elegant-salesmen-99143

    06/01/2023, 8:39 PM
    Hi! Doese anyone have any estimate comparison of expensivness for different profiling options? Meaning which ones take most of the resources and which ones - the least. Thanks in advance for sharing the expertise!
    m
    • 2
    • 2
  • p

    powerful-battery-5070

    06/02/2023, 5:32 PM
    Hi all, a random question not directly related to DataHub. How does everyone do storage showback/chargeback? If I have a 100G dataset stored by team A, but is used by teams B and C, who pays for the storage of this dataset? Moreover, if team B needs the data for 3 months, team C needs it for 6 and team A does not care, does team C pay for the extra 3 months while the first 3 months are split between B and C? We are working on data management in the company and one of the tasks is to be able to create a showback model. Our datasets are used by multiple teams and we are wondering if there is an industry standard to split asset/storage costs based on consumer needs. TIA!
  • b

    bright-action-35725

    06/05/2023, 12:15 PM
    @modern-carpenter-21365 What are the pros and cons of the Acryl DataHub license agreement?
    s
    d
    • 3
    • 2
  • h

    happy-helmet-66366

    06/09/2023, 12:48 PM
    Does / Could Acryl offer hosting of DataHub in AWS South Africa? With PrivateLink for secure connections to sources like Kafka?
    plus1 1
    s
    • 2
    • 3
  • b

    better-orange-49102

    06/12/2023, 10:04 AM
    im just wondering aloud... for those with "no permissions even with datahub user", would having a script to programmatically enable the privileges to datahub user help with this issue?
  • g

    glamorous-wire-83850

    06/14/2023, 11:02 AM
    Hi folks, I want to back up the glossary terms that are manually entered via the UI. Is there any possible way to back them up and insert them again with the same output?
    g
    • 2
    • 1
  • r

    red-solstice-83887

    06/15/2023, 7:44 AM
    Hey DataHub community! We’re currently working on ingesting Power BI reports and datasets into our instance of DataHub. But we’re facing the following challenge (which in our opinion is a major one): • The metadata the Power BI API provides us is severely limited! ◦ Our users want to be able to search by, not only report name, but also by the names of the charts, graphs, pages, and filters available within a report. ◦ Unfortunately we’re only able to pull name, description, url, owner, created date, modified date, and associated Power BI dataset. Has anyone managed to solve this problem and enrich the metadata for their Power BI reports using some other means than the Power BI API?
    g
    • 2
    • 6
  • m

    mysterious-application-34432

    06/19/2023, 5:00 PM
    Hi developers who are interested in data security, Cisco and Altinity are meeting over a LIVE webinar tomorrow to showcase their collaborative project on deploying Clickhouse in FedRAMP using Altinity’s FIPS-compatible stable builds. Date and Time: June 20, 10 AM PDT Speakers: • Pauline Yeung, Data Engineer & SecDevOps at Cisco Umbrella and • Robert Hodges, CEO at Altinity Tune in LIVE to learn more about: • What is Cisco Umbrella and how does it use ClickHouse? • What are the challenges of bringing up ClickHouse in a FedRAMP environment? • How are Cisco Umbrella and Altinity working together to deploy FIPS-compatible analytics? • What lessons can we share with other users on the same path? RSVP your free seat here: https://hubs.la/Q01T8qyb0
    l
    b
    • 3
    • 5
  • r

    red-solstice-83887

    06/22/2023, 10:20 AM
    Hey DH community, does anyone else here catalog their organisation’s machine learning models, features, and datasets? If so, are you facing an issue with models and features having low page views or users because they aren’t reusable? Given that most ML models are very specific to a domain or problem and there’s little that can be reused across models? I guess the broader question is…is it worth making something that isn’t reusable, easily discoverable on DataHub? CC: @boundless-student-48844
    m
    • 2
    • 4
  • f

    future-holiday-32084

    06/25/2023, 4:44 PM
    Hello everyone, in the video "Great Expectations Outcomes in DataHub" by John Joyce, he mentioned that DataHub will support other validation tools like Deequ. I'm wondering if this support is currently available, and if so, could you provide me with any relevant documentation?
    l
    • 2
    • 1
  • w

    witty-secretary-7594

    06/27/2023, 12:05 PM
    Hello DH community, I am new to the field. I’ve already had my own ETL project. Postgres-> Minio -> Postgres (as a data warehouse). Everything runs on Airflow and Docker (locally) Now, I am trying to integrate Datahub to my project. Are there any examples on how to set up Dockerfile and docker-compose? I tried to follow https://datahubproject.io/docs/lineage/airflow but still no avail 😭 this is my original docker compose https://github.com/marlovobook/data-pipeline-postgres/blob/main/docker-compose.yml Sorry if I have texted the wrong channel.
    😭 1
    • 1
    • 1
  • n

    numerous-optician-26217

    06/30/2023, 2:43 PM
    👋 Hello, team! I'm new to datahub. May I ask a question: I want to get owner of dashboard , get list of dashboard, chart by using python. so how can I do this?
    a
    • 2
    • 1
  • s

    some-flower-21264

    06/30/2023, 3:09 PM
    Hello All, I am new to Datahub and I need to upgrade datahub from 0.8 to 0.10 and create implementation plan for it. Could you please help me? Thanks in advance.
  • c

    clever-author-65853

    07/05/2023, 10:49 AM
    Does anyone knows if the chrome extension work with Tableau server? (not the saas)
    b
    • 2
    • 6
  • b

    bright-receptionist-94235

    07/18/2023, 1:56 PM
    When next version will be released?
    l
    • 2
    • 2
  • m

    mysterious-application-34432

    07/19/2023, 4:48 PM
    Tomorrow, Olga Silyutina from Sumsub will show you how Sumsub uses a schema-agnostic approach to transform different event types with ClickHouse materialized views into a flattened form that’s convenient for analysis. Tune in at 10 AM CET, Thursday 20th to join the live discussion on Zoom. Get your free ticket now: https://hubs.la/Q01Ws9SW0
  • m

    mysterious-application-34432

    07/25/2023, 5:26 PM
    Hi! As many of you have noticed ClickHouse and Kubernetes work great together. It’s easy to stand up toy applications, but what about building an entire analytic service based on ClickHouse? This coming Thursday Robert will show you how to build the full stack using Kubernetes, ArgoCD, and open source software. There’s even a GitHub project under Apache 2.0 license with the code to do it yourself. Please join us to learn more: https://hubs.la/Q01WsbS_0
  • b

    better-waiter-35215

    07/26/2023, 1:57 PM
    i am totally new at datahub so apologies for the broad discovery question. is someone working on providing ingestion from Neo4j
  • b

    best-processor-55405

    07/27/2023, 1:53 PM
    Hello Everyone, In DataHub I am not able to see any option to subscribe for specific table. So that, if there is any change happen on that table. I should get notification for the changes. Is that feature exists in DataHub, any idea?
    d
    e
    • 3
    • 2
  • e

    elegant-salesmen-99143

    07/31/2023, 6:12 PM
    Hi all. I think 0.10.5 release was said to be ready around last week or so? What is ETA for it now?
    g
    w
    • 3
    • 3
  • c

    clever-author-65853

    08/02/2023, 11:08 AM
    Hey Community… question, say I want to calculate the success rate of search results - like how many of the searches users preform, where actually ended up in a page view event …. What is the best way to do so in the Datahub usage events?
    s
    • 2
    • 1
  • b

    bland-appointment-45659

    08/02/2023, 6:47 PM
    hi, Are we expected to have profiling stats on views fetched through ingestion ? If not, is there a way to disable the stats tab for view datasets ? Any pointers ?
  • e

    elegant-student-62491

    08/03/2023, 1:23 PM
    Greetings to everyone! I hope I'm posting in the appropriate channel. The company I work for has developed an MVP ingestion module that caters to both tabular and multidimensional SSAS. We are considering contributing it to Datahub, but I have a couple of questions about the process. Firstly, do you think it would be better to create this module as a standalone, focusing solely on SSAS, or should I integrate it into the existing Mssql module? Secondly, I'm curious about whether I should follow the RFC (Request for Comments) process when contributing this module. Your insights would be greatly appreciated!
    f
    m
    • 3
    • 3
  • p

    purple-refrigerator-27989

    08/13/2023, 1:07 PM
    Hey guys!I would like to know which data sources are supported by datahub for automatic lineage extraction?😄
  • e

    early-airline-54682

    08/14/2023, 5:17 PM
    Hey team, I saw that Rest.li is no longer being actively developed by LinkedIn as they migrate to using gRPC. As DataHub currently uses rest.li, are there any current plans to peel away from it as well, or will it continue to use rest.li for the foreseeable future? Thanks
  • l

    little-megabyte-1074

    08/16/2023, 12:12 AM
    Cross-posting this thread here to see if we can get additional perspectives — what dev skills have you found to be critical to your success in implementing DataHub?
    l
    f
    p
    • 4
    • 3
  • l

    lemon-lock-89160

    08/23/2023, 3:42 PM
    Anyone know why the demo site https://demo.datahubproject.io/ does not work? Front page says plenty of data content and platform but nothings shows up when navigating around the site.
    b
    • 2
    • 1
  • r

    ripe-van-7700

    08/25/2023, 12:50 PM
    Hi all, do you know if DataHub has (or plans to have) a solution to generate business descriptions for datasets based on the SQL query by converting sql query to natural language? thx
    d
    g
    • 3
    • 2
  • t

    tall-gigabyte-99212

    08/28/2023, 2:11 AM
    Hi, I have a problem: I am trying to read the datahub-project/datahub code, but it fails when running./gradlew metadata servicewarbuild The error message is as follows: "> Task metadata servicerestli-client:mainCopyPdscSchemas SKIPPED metadata servicerestli-client:mainCopyPdscSchemas task is a NO-OP task. <=------------> 9% EXECUTING [25s]
    Task li utilsgenerateDataTemplate FAILEDerties
    There are 34 data schema input files. Using input root folder: D:\Source Code\datahub-0.10.5\li-utils\src\main\pegasus FAILURE: Build failed with an exception. * What went wrong: Execution failed for task 'li utilsgenerateDataTemplate'.
    'other' has different root"
    I would like to ask everyone, what should I do?
    d
    • 2
    • 1
1234567Latest