Hello while trying to injest with the datahub kafk...
# ingestion
l
Hello while trying to injest with the datahub kafka sink I keep getting this error
Copy code
datahub.ingestion.run.pipeline.PipelineInitError: Failed to configure the source (bigquery): Missing provider configuration.
This is what the recipe looks like
Copy code
pipeline_name: analytics
source:
    type: bigquery
    config:
        env: DEV
        include_table_lineage: true
        include_usage_statistics: true
        include_tables: true
        include_views: true
        profiling:
            enabled: true
            profile_table_level_only: false
        stateful_ingestion:
            enabled: true
        credential:
            project_id: <redacted>
            private_key: <redacted>
            private_key_id: <redacted>
            client_email: <redacted>
            client_id: <redacted>
sink:
    type: datahub-kafka
    config:
        connection:
            bootstrap: 'datahub-prerequisites-kafka:9092'
            schema_registry_url: '<http://datahub-prerequisites-cp-schema-registry:8081>'
1
Seems like it only happens with the kafka sink
m
Are you trying to ingest from Kafka to Datahub ?
l
No I'm trying to ingest from bigquery, but sink to the DB kafka sink for a higher throughput because the rest endpoint is getting hammered pretty hard, and I don't want to throttle rest API requests
m
did you install the bigquery ingest plugin ? If not, you can run the below command.
Copy code
pip install 'acryl-datahub[bigquery]'
l
That all works already... this error only comes up when I specify datahub-kafka as a sink... And I have also checked that I have the
'acryl-datahub[kafka]'
package installed
looking through the actual source code it seems like the sink section of the recipe is returning None when it's kafka
I'll have to dig a bit deeper to see why
g
Hi, @lemon-scooter-69730 you should disable stateful_ingestion according to https://datahubspace.slack.com/archives/CUMUWQU66/p1667461540670879?thread_ts=1667460445.308079&amp;cid=CUMUWQU66