Hi, Team: I need your assistance for power BI inge...
# integrate-powerbi-datahub
s
Hi, Team: I need your assistance for power BI ingestion with Datasets that are related to the report. We have established the pipeline with "include Lineage" function on UI. We also have registered permissions on the power bi side with Admin API admin settings - Microsoft Fabric We're expecting to see the data lineage for the dataset but it appears only the name of power bi report itself. We also checked if the API account is in the workspace in Power BI but it is in there. Would you please verify and give me some advise how we can fix it?
a
I just checked ours and we are seeing the same issue. We have PowerBI being fed data from our Databricks UC but Datahub only shows the lineage up to the Query in PowerBI and doesn't connect to the source tables. I guess i could manually add them but kinda defeats the purpose of automated data cataloging.
s
@adorable-salesclerk-90917 Thank you for your info. It's very similar situation. In our case, we're ingesting the data from our redshift to power BI. @gentle-hamburger-31302 @gray-shoe-75895 If you could give us some insights for these situation, that would be great.
Data Ingestion log says the following 2 notifications:
Copy code
reports users would not get ingested as admin permission is not enabled on configured Azure AD Application
Dataset lineage can not be ingestion because this user does not have access to the PowerBI Admin API.
It seems that the user principal does not have enough privileges, but even if I give the user principal API access permission, administrator role, etc., the consequence does not change. Please kindly advise how to fix it. Thanks!
f
we see the same on our setup, lineage works between reports, pages and tables, but not to anything else.
a
Just dug into Help and saw the following:
Copy code
PowerBI Source supports M-Query expression for below listed PowerBI Data Sources
Snowflake
Oracle
PostgreSQL
Microsoft SQL Server
Google BigQuery
Native SQL query parsing is supported for Snowflake and Amazon Redshift data-sources.
f
Yep, aware of that, perhaps our source being Azure Synapse is an issue, although in Power BI its the same connector
👍 1
a
We have other dashboards that are powered from Neo4j that we are already planning to manually register so we it looks like we will have to do that for everything
g
@gentle-hamburger-31302 could you look into this? How can we make it easy for folks to register/contribute additional powerbi data source types beyond the ones we've built out ourselves?
g
One way is to document where to add things in code to get additional sources working
👍 1
s
@gentle-hamburger-31302 @gray-shoe-75895 Thank you for the message. Is there any plan to improve the power BI connector that can extract the data lineage into Datahub? We would like to eliminate the manual work as much as possible, so I would like to know. Thank you!
a
We are going to create a python script to hit the REST API on the same schedule as our powerbi ingestion run to 'manually' connect the pbi product to source from the m_query. Part of our work flow will be for the data analyst to inform data engineers of source and name of dashboard then they will add the script to etl..kinda a workaround but should work ok
s
Thank you. Understood your solution and that make sense for me. Seems like this is the most feasible solution at this moment.
a
ill share the code when we have it working if you are interested
s
Yes, I'm so interested in your proposal. Could you please share?? Thank you so much!!
Hi, Sean, Sorry for pushing you. Do you have any updates? Thank you so much!
a
@stocky-helmet-13192 are you hosting datahub locally or through acryl?
s
@adorable-salesclerk-90917 Thank you for your response. We're hosting datahub locally.\
I found out below article. In my case, Power bi connects with Redshift to create a BI report. Does the below article state that we cannot extract column level lineage? Below is the error message we got when it is activate.
Failed to configure the source (powerbi): 2 validation errors for PowerBiDashboardSourceConfig
enable_advance_lineage_sql_construct
extra fields not permitted (type=value_error.extra)
extract_column_level_lineage
extra fields not permitted (type=value_error.extra)
a
here is a python script of helper function one of my dev wrote when we were messing the with local datahub. Hope this helps. We haven't connected out lineage yet as we have other priorities for our next few sprints.
s
@adorable-salesclerk-90917 Thank you for sharing the code!! I appreciate a lot
g
@stocky-helmet-13192 the error about "extra field not permitted" likely means that your need to update your version of acryl-datahub