• b

    blue-crowd-84759

    4 months ago
    Hey all, I'm trying to ingest data from Tableau and BigQuery and so far I'm not able to see any connections between my Tableau datasets and my BQ tables, can someone help me with how/if I can map a published dataset to a BQ table so I can see the complete lineage for a Tableau chart?
    b
    o
    +1
    4 replies
    Copy to Clipboard
  • o

    orange-coat-2879

    4 months ago
    Hello, I tried to ingest data from tableau but got error below. I have installed acryl-datahub[tableau] successfully. What is the real problem? Thanks
    Installing collected packages: tableauserverclient
    Successfully installed tableauserverclient-0.18.0
    ubuntu@ip-172-31-16-11:~$ datahub ingest -c /home/ubuntu/datahub/tableau.yml
    [2022-05-12 02:07:40,616] INFO     {datahub.cli.ingest_cli:96} - DataHub CLI ver                                              sion: 0.8.34.1
    [2022-05-12 02:07:40,722] ERROR    {datahub.entrypoints:165} - tableau is disabl                                              ed; try running: pip install 'acryl-datahub[tableau]'
    [2022-05-12 02:07:40,722] INFO     {datahub.entrypoints:176} - DataHub CLI versi                                              on: 0.8.34.1 at /home/ubuntu/.local/lib/python3.8/site-packages/datahub/__init__                                              .py
    [2022-05-12 02:07:40,722] INFO     {datahub.entrypoints:179} - Python version: 3                                              .8.13 (default, Apr 19 2022, 02:32:06)
    [GCC 11.2.0] at /usr/bin/python3.8 on Linux-5.15.0-1005-aws-x86_64-with-glibc2.3                                              5
    [2022-05-12 02:07:40,722] INFO     {datahub.entrypoints:182} - GMS config {'mode                                              ls': {}, 'versions': {'linkedin/datahub': {'version': 'v0.8.34', 'commit': '5cce                                              3acddcb46443c748bf2eb0b1e5e53994d936'}}, 'managedIngestion': {'defaultCliVersion                                              ': '0.8.34.1', 'enabled': True}, 'statefulIngestionCapable': True, 'supportsImpa                                              ctAnalysis': True, 'telemetry': {'enabledCli': True, 'enabledIngestion': False},                                               'datasetUrnNameCasing': False, 'retention': 'true', 'noCode': 'true'}
    o
    h
    3 replies
    Copy to Clipboard
  • f

    fresh-napkin-5247

    4 months ago
    Hello. Anyway I can get datahub to scrape all the projects on Tableau online, instead of me having to pass a list?
    f
    h
    2 replies
    Copy to Clipboard
  • w

    wonderful-dream-38059

    3 months ago
    Hello team - the docs for the tableau integration say that
    Detect Deleted Entities
    is currently not supported. What does this mean in practice? My reading of the docs makes me think they just persist past deletion, and are never removed. If that is the case has anyone done any design work to allow removal of stale records post deletion? I'd be happy to help contribute if not.
    w
    l
    5 replies
    Copy to Clipboard
  • w

    wonderful-dream-38059

    3 months ago
    Me again 🙂. In testing the tableau connector more, I'm getting a big memory explosion. A large snowflake ingestion job or dbt ingestion job comfortably run in a container with ~1GB of memory. My Tableau ingestion job is still getting OOM Killed at 16GB of memory! Before I go down a big debugging hole - has anyone else seen very very high memory usage when running the tableau ingestion source? For any of the people who wrote the original, any hints on where the issue might be would be very helpful - otherwise I'll start diving into this one myself. (I'll work on this before I pick up any of the deleted entities stuff I mentioned above - I need to get the job to complete before I start upgrading it! 😄 ).
    w
    1 replies
    Copy to Clipboard
  • p

    purple-analyst-83660

    2 months ago
    Hi All, I am trying to ingest metadata corresponding to a project. I get NODE_LIMIT_EXCEEDED error first, when I try to include page_size: 5. I get this error. Can any body help? (Have attached the config yaml that I am using)
    p
    h
    5 replies
    Copy to Clipboard
  • c

    careful-insurance-60247

    1 month ago
    I used to be able to see lineage to my mssql boxes but now I only see tableau datasets. Did this functionality change?
    c
    h
    15 replies
    Copy to Clipboard
  • f

    faint-advantage-18690

    2 months ago
    Hi all, I am trying to get the lineage of one of my workbooks but it seems that the Tableau lineage does is not linked to the BigQuery table even though it uses one as a source. What I expect is : BigQuery table -> Published data source -> Embedded Data Source -> Charts But I get : Published data source -> Embedded Data Source -> Charts
    f
    h
    16 replies
    Copy to Clipboard
  • m

    modern-artist-55754

    3 weeks ago
    I’m facing some issues with the Node Limit exceeded. I noticed a few things: •
    PublishedDatasourcesConnection
    &
    CustomSQLTablesConnection
    doesn’t have
    page_size
    implemented like workbook. https://github.com/datahub-project/datahub/blob/7e15947a372f6f627f29f5a1c783383d49[…]daf6/metadata-ingestion/src/datahub/ingestion/source/tableau.py • The workbooksConnection is little complex ( i have some complex workbook and even with
    page_size
    =1, it still exceed the node limit), I think we can refactor the
    EmbeddedDatasourcesConnection
    to a seperate call like
    PublishedDatasourcesConnection
    (at least it seems to help with my issue, although i still have some issue that i haven’t worked out yet). https://github.com/datahub-project/datahub/blob/7e15947a372f6f627f29f5a1c783383d49[…]tadata-ingestion/src/datahub/ingestion/source/tableau_common.py
    m
    h
    6 replies
    Copy to Clipboard
  • m

    magnificent-lawyer-97772

    1 month ago
    Hi folks, I am not sure whether this is the correct channel, but with some colleagues we are thinking of implementing some improvements to the Tableau connector. Namely, we want to add the Platform Instance to the connector. Our idea would be for the platform instance to represent a Tableau site, so a 1:1 relationship between them. What do folks think?
    m
    s
    +3
    8 replies
    Copy to Clipboard