quaint-barista-82836
01/23/2023, 5:52 PM'[2023-01-23 17:42:42,108] WARNING  {py.warnings:109} - '
           '/tmp/datahub/ingest/venv-bigquery-0.9.6/lib/python3.10/site-packages/datahub/ingestion/source/bigquery_v2/bigquery.py:937: '
           'DeprecationWarning: Call to deprecated function (or staticmethod) wrap_aspect_as_workunit. (use '
           'MetadataChangeProposalWrapper(...).as_workunit() instead)\n'
           '  wu = wrap_aspect_as_workunit(\n'
           '\n'
           '[2023-01-23 17:42:42,110] WARNING  {py.warnings:109} - '
           '/tmp/datahub/ingest/venv-bigquery-0.9.6/lib/python3.10/site-packages/datahub/ingestion/source/bigquery_v2/bigquery.py:957: '
           'DeprecationWarning: Call to deprecated function (or staticmethod) wrap_aspect_as_workunit. (use '
           'MetadataChangeProposalWrapper(...).as_workunit() instead)\n'
           '  wu = wrap_aspect_as_workunit("dataset", dataset_urn, "subTypes", subTypes)\n'
           '\n'
           '[2023-01-23 17:42:42,190] DEBUG    {datahub.emitter.rest_emitter:250} - Attempting to emit to DataHub GMS; using curl equivalent to:\n',
           '2023-01-23 17:42:42.336687 [exec_id=96401624-f6b0-46e7-98c9-836345181165] INFO: Caught exception EXECUTING '
           'task_id=96401624-f6b0-46e7-98c9-836345181165, name=RUN_INGEST, stacktrace=Traceback (most recent call last):\n'
           '  File "/usr/local/lib/python3.10/asyncio/streams.py", line 525, in readline\n'
           '    line = await self.readuntil(sep)\n'
           '  File "/usr/local/lib/python3.10/asyncio/streams.py", line 620, in readuntil\n'
           '    raise exceptions.LimitOverrunError(\n'
           'asyncio.exceptions.LimitOverrunError: Separator is found, but chunk is longer than limit\n'
           '\n'
           'During handling of the above exception, another exception occurred:\n'
           '\n'
           'Traceback (most recent call last):\n'
           '  File "/usr/local/lib/python3.10/site-packages/acryl/executor/execution/default_executor.py", line 123, in execute_task\n'
           '    task_event_loop.run_until_complete(task_future)\n'
           '  File "/usr/local/lib/python3.10/asyncio/base_events.py", line 646, in run_until_complete\n'
           '    return future.result()\n'
           '  File "/usr/local/lib/python3.10/site-packages/acryl/executor/execution/sub_process_ingestion_task.py", line 147, in execute\n'
           '    await tasks.gather(_read_output_lines(), _report_progress(), _process_waiter())\n'
           '  File "/usr/local/lib/python3.10/site-packages/acryl/executor/execution/sub_process_ingestion_task.py", line 99, in _read_output_lines\n'
           '    line_bytes = await ingest_process.stdout.readline()\n'
           '  File "/usr/local/lib/python3.10/asyncio/streams.py", line 534, in readline\n'
           '    raise ValueError(e.args[0])\n'
           'ValueError: Separator is found, but chunk is longer than limit\n']}
Execution finished with errors.cool-fireman-87485
01/23/2023, 5:58 PMquaint-barista-82836
01/23/2023, 10:00 PMDoes your service account has bigquery.tables.list, bigquery.routines.get, bigquery.routines.list permission, bigquery.tables.getData permission? The error was: 'type'
[2023-01-23, 21:55:49 UTC] {process_utils.py:168} INFO - Traceback (most recent call last):
[2023-01-23, 21:55:49 UTC] {process_utils.py:168} INFO -   File "/tmp/venv45wzxte5/lib/python3.8/site-packages/datahub/ingestion/source/bigquery_v2/bigquery.py", line 587, in _process_project
[2023-01-23, 21:55:49 UTC] {process_utils.py:168} INFO -     yield from self._process_schema(conn, project_id, bigquery_dataset)
[2023-01-23, 21:55:49 UTC] {process_utils.py:168} INFO -   File "/tmp/venv45wzxte5/lib/python3.8/site-packages/datahub/ingestion/source/bigquery_v2/bigquery.py", line 702, in _process_schema
[2023-01-23, 21:55:49 UTC] {process_utils.py:168} INFO -     yield from self._process_table(conn, table, project_id, dataset_name)
[2023-01-23, 21:55:49 UTC] {process_utils.py:168} INFO -   File "/tmp/venv45wzxte5/lib/python3.8/site-packages/datahub/ingestion/source/bigquery_v2/bigquery.py", line 735, in _process_table
[2023-01-23, 21:55:49 UTC] {process_utils.py:168} INFO -     yield from self.gen_table_dataset_workunits(table, project_id, schema_name)
[2023-01-23, 21:55:49 UTC] {process_utils.py:168} INFO -   File "/tmp/venv45wzxte5/lib/python3.8/site-packages/datahub/ingestion/source/bigquery_v2/bigquery.py", line 774, in gen_table_dataset_workunits
[2023-01-23, 21:55:49 UTC] {process_utils.py:168} INFO -     custom_properties["time_partitioning"] = str(table.time_partitioning)
[2023-01-23, 21:55:49 UTC] {process_utils.py:168} INFO -   File "/tmp/venv45wzxte5/lib/python3.8/site-packages/google/cloud/bigquery/table.py", line 2689, in __repr__
[2023-01-23, 21:55:49 UTC] {process_utils.py:168} INFO -     key_vals = ["{}={}".format(key, val) for key, val in self._key()]
[2023-01-23, 21:55:49 UTC] {process_utils.py:168} INFO -   File "/tmp/venv45wzxte5/lib/python3.8/site-packages/google/cloud/bigquery/table.py", line 2665, in _key
[2023-01-23, 21:55:49 UTC] {process_utils.py:168} INFO -     properties["type_"] = repr(properties.pop("type"))
[2023-01-23, 21:55:49 UTC] {process_utils.py:168} INFO - KeyError: 'type'limited-library-89060
01/24/2023, 2:26 AMDatasource test_datasource is not present in platform_instance_map
argument of type 'NoneType' is not iterableexpect_table_row_count_to_equalflat-table-17463
01/24/2023, 6:54 AMgray-ocean-32209
01/24/2023, 7:18 AMSorry, you are not authorized to access this page.on all assets after upgrading to 0.9.5. all content appears to be unaccessible with a “Unauthorized” message. Even the admin user is not able to access any entities. We use OICD for authentication when we try look at policies on the
<datahub-url>/policiesUnauthorized to perform this action. Please contact your DataHub administrator. (code 403)bland-balloon-48379
01/24/2023, 4:53 PMable-evening-90828
01/24/2023, 10:43 PMquery childGlossaryTerms {
  searchAcrossEntities(input: {
    types: [GLOSSARY_TERM], 
    query: "",
    orFilters: {
      and: {
        field: "parentNodes",
        values: ["urn:li:glossaryNode:data-type"],
      }
    }
  }) {
    searchResults {
      entity {
        urn
        type
      }
    }
  }
}best-wire-59738
01/25/2023, 6:00 AMaverage-dinner-25106
01/25/2023, 7:07 AMbrief-ability-41819
01/25/2023, 10:31 AMcurl -X 'GET' '<https://DATAHUB_URL/openapi/entities/v1/latest?urns=MY_URN>' -H 'accept: application/json' --header 'Authorization: Bearer MY_TOKEN | jqdatahub --debug get --urn "urn:li:dataset:(MY_URN)" --aspect ownership404 Client Error: Not Found for url: <https://DATAHUB_URL/openapi/entitiesV2/MY_URN?aspects=List(ownership)>/entities/v1/entities/v2elegant-salesmen-99143
01/25/2023, 3:37 PMsink:
    type: datahub-rest
    config:
        server: '***'
source:
    type: hive
    config:
        host_port: '***:10000'
        env: PROD
        username: ***
        include_tables: true
        include_views: true
        stateful_ingestion:
            enabled: true
            remove_stale_metadata: true
transformers:
    -
        type: set_dataset_browse_path
        config:
            replace_existing: true
            path_templates:
                - /ENV/PLATFORM/DATASET_PARTS
pipeline_name: 'urn:li:dataHubIngestionSource:***'acceptable-restaurant-2734
01/25/2023, 7:51 PMhelpful-fish-88957
01/25/2023, 8:22 PMUnable to run quickstart - the following issues were detected:
- kafka-setup container is not presentfaint-hair-91313
01/26/2023, 8:17 AMearly-student-2446
01/26/2023, 10:28 AMerror: unknown object type *v1beta1.CronJobServer Version: <http://version.Info|version.Info>{Major:"1", Minor:"18", GitVersion:"v1.18.14", GitCommit:"89182bdd065fbcaffefec691908a739d161efc03", GitTreeState:"clean", BuildDate:"2020-12-18T12:02:35Z", GoVersion:"go1.13.15", Compiler:"gc", Platform:"linux/amd64"}echoing-needle-51090
01/26/2023, 1:48 PMancient-kite-60433
01/26/2023, 2:12 PMOops, an error occurred. This exception has been logged with id xxxxxxxxdatahub docker checkdocker logs datahub-frontend-reactplay.api.UnexpectedException: Unexpected exception[ServerResultException: HTTP 1.0 client does not support chunked response]
        at play.api.http.HttpErrorHandlerExceptions$.throwableToUsefulException(HttpErrorHandler.scala:358)
        at play.api.http.DefaultHttpErrorHandler.onServerError(HttpErrorHandler.scala:264)
        at play.core.server.common.ServerResultUtils.validateResult(ServerResultUtils.scala:69)
        at play.core.server.akkahttp.AkkaModelConversion.$anonfun$convertResult$1(AkkaModelConversion.scala:193)
        at play.core.server.common.ServerResultUtils.resultConversionWithErrorHandling(ServerResultUtils.scala:195)
        at play.core.server.akkahttp.AkkaModelConversion.convertResult(AkkaModelConversion.scala:215)
        at play.core.server.AkkaHttpServer.$anonfun$runAction$5(AkkaHttpServer.scala:440)
        at scala.concurrent.Future.$anonfun$flatMap$1(Future.scala:307)
        at scala.concurrent.impl.Promise.$anonfun$transformWith$1(Promise.scala:41)
        at scala.concurrent.impl.CallbackRunnable.run(Promise.scala:64)
        at akka.dispatch.BatchingExecutor$AbstractBatch.processBatch(BatchingExecutor.scala:63)
        at akka.dispatch.BatchingExecutor$BlockableBatch.$anonfun$run$1(BatchingExecutor.scala:100)
        at scala.runtime.java8.JFunction0$mcV$sp.apply(JFunction0$mcV$sp.java:23)
        at scala.concurrent.BlockContext$.withBlockContext(BlockContext.scala:85)
        at akka.dispatch.BatchingExecutor$BlockableBatch.run(BatchingExecutor.scala:100)
        at akka.dispatch.TaskInvocation.run(AbstractDispatcher.scala:49)
        at akka.dispatch.ForkJoinExecutorConfigurator$AkkaForkJoinTask.exec(ForkJoinExecutorConfigurator.scala:48)
        at java.base/java.util.concurrent.ForkJoinTask.doExec(ForkJoinTask.java:290)
        at java.base/java.util.concurrent.ForkJoinPool$WorkQueue.topLevelExec(ForkJoinPool.java:1020)
        at java.base/java.util.concurrent.ForkJoinPool.scan(ForkJoinPool.java:1656)
        at java.base/java.util.concurrent.ForkJoinPool.runWorker(ForkJoinPool.java:1594)
        at java.base/java.util.concurrent.ForkJoinWorkerThread.run(ForkJoinWorkerThread.java:183)
Caused by: play.core.server.common.ServerResultException: HTTP 1.0 client does not support chunked response
        at play.core.server.common.ServerResultUtils.validateResult(ServerResultUtils.scala:68)
        ... 19 common frames omitted
2023-01-26 13:44:29,799 [application-akka.actor.default-dispatcher-19] ERROR p.api.http.DefaultHttpErrorHandler -
! @80d92mm8g - Internal server error, for (GET) [/] ->
play.api.UnexpectedException: Unexpected exception[ServerResultException: HTTP 1.0 client does not support chunked response]
        at play.api.http.HttpErrorHandlerExceptions$.throwableToUsefulException(HttpErrorHandler.scala:358)
        at play.api.http.DefaultHttpErrorHandler.onServerError(HttpErrorHandler.scala:264)
        at play.core.server.common.ServerResultUtils.validateResult(ServerResultUtils.scala:69)
        at play.core.server.akkahttp.AkkaModelConversion.$anonfun$convertResult$1(AkkaModelConversion.scala:193)
        at play.core.server.common.ServerResultUtils.resultConversionWithErrorHandling(ServerResultUtils.scala:195)
        at play.core.server.akkahttp.AkkaModelConversion.convertResult(AkkaModelConversion.scala:215)
        at play.core.server.AkkaHttpServer.$anonfun$runAction$5(AkkaHttpServer.scala:440)
        at scala.concurrent.Future.$anonfun$flatMap$1(Future.scala:307)
        at scala.concurrent.impl.Promise.$anonfun$transformWith$1(Promise.scala:41)
        at scala.concurrent.impl.CallbackRunnable.run(Promise.scala:64)
        at akka.dispatch.BatchingExecutor$AbstractBatch.processBatch(BatchingExecutor.scala:63)
        at akka.dispatch.BatchingExecutor$BlockableBatch.$anonfun$run$1(BatchingExecutor.scala:100)
        at scala.runtime.java8.JFunction0$mcV$sp.apply(JFunction0$mcV$sp.java:23)
        at scala.concurrent.BlockContext$.withBlockContext(BlockContext.scala:85)
        at akka.dispatch.BatchingExecutor$BlockableBatch.run(BatchingExecutor.scala:100)
        at akka.dispatch.TaskInvocation.run(AbstractDispatcher.scala:49)
        at akka.dispatch.ForkJoinExecutorConfigurator$AkkaForkJoinTask.exec(ForkJoinExecutorConfigurator.scala:48)
        at java.base/java.util.concurrent.ForkJoinTask.doExec(ForkJoinTask.java:290)
        at java.base/java.util.concurrent.ForkJoinPool$WorkQueue.topLevelExec(ForkJoinPool.java:1020)
        at java.base/java.util.concurrent.ForkJoinPool.scan(ForkJoinPool.java:1656)
        at java.base/java.util.concurrent.ForkJoinPool.runWorker(ForkJoinPool.java:1594)
        at java.base/java.util.concurrent.ForkJoinWorkerThread.run(ForkJoinWorkerThread.java:183)
Caused by: play.core.server.common.ServerResultException: HTTP 1.0 client does not support chunked response
        at play.core.server.common.ServerResultUtils.validateResult(ServerResultUtils.scala:68)
        ... 19 common frames omitted
2023-01-26 13:44:29,863 [application-akka.actor.default-dispatcher-19] ERROR p.api.http.DefaultHttpErrorHandler -
! @80d92mmb1 - Internal server error, for (GET) [/favicon.ico] ->bland-orange-13353
01/26/2023, 2:13 PMpython3 -m pip install --upgrade pip wheel setuptools
python3 -m pip install --upgrade acryl-datahub
datahub versionrhythmic-quill-75064
01/26/2023, 2:33 PM2023/01/26 14:22:48 Waiting for: <http://elasticsearch-master:9200>
Going to use protocol: http
Going to use default elastic headers
Create datahub_usage_event if needed against Elasticsearch at elasticsearch-master:9200
Going to use index prefix::
2023/01/26 14:22:48 Received 200 from <http://elasticsearch-master:9200>
Policy GET response code is
Got response code  while creating policy so exiting.
curl: option -k <http://elasticsearch-master:9200/_ilm/policy/datahub_usage_event_policy>: is unknown
curl: try 'curl --help' or 'curl --manual' for more information
/create-indices.sh: line 41: [: -eq: unary operator expected
/create-indices.sh: line 45: [: -eq: unary operator expected
/create-indices.sh: line 47: [: -eq: unary operator expected
2023/01/26 14:22:48 Command exited with error: exit status 1aloof-father-61672
01/26/2023, 2:47 PMdatasetdatahub.cli.cli_utils.get_urns_by_filter/entities?action=search{
  "input": "*",
  "entity": "dataflow",
  "start": 0,
  "count": 100,
  "filter": {
    "or": [
      {
        "and": [
          {
            "field": "origin",
            "value": "DEV",
            "condition": "EQUAL"
          },
          {
            "field": "platform",
            "value": "urn:li:dataPlatform:my-platform",
            "condition": "EQUAL"
          }
        ]
      }
    ]
  }
}{
  "value": {
    "numEntities": 0,
    "pageSize": 100,
    "from": 0,
    "metadata": {
      "aggregations": [
        {
          "name": "origin",
          "filterValues": [],
          "aggregations": {},
          "displayName": "origin"
        },
        {
          "name": "platform",
          "filterValues": [],
          "aggregations": {},
          "displayName": "Platform"
        }
      ]
    },
    "entities": []
  }
}quick-pizza-8906
01/26/2023, 5:29 PMRemote end closed connection without responsetableauserverclient0.19.00.23.40.19.0nutritious-bird-77396
01/26/2023, 5:42 PM0.8.430.9.6.117:30:57 [main] INFO  c.l.m.s.e.i.ESIndexBuilder - Reindexing dataset_operationaspect_v1 to dataset_operationaspect_v1_1674751305780 task has completed, will now check if reindex was successful
17:31:00 [main] INFO  c.l.m.s.e.i.ESIndexBuilder - Post-reindex document count is different, source_doc_count: 34822915 reindex_doc_count: 15463000
17:31:00 [main] WARN  o.s.w.c.s.XmlWebApplicationContext - Exception encountered during context initialization - cancelling refresh attempt: org.springframework.beans.factory.UnsatisfiedDependencyException: Error creating bean with name 'metadataChangeLogProcessor' defined in URL [jar:file:/tmp/jetty-0_0_0_0-8080-war_war-_-any-3785592998662924994/webapp/WEB-INF/lib/mae-consumer.jar!/com/linkedin/metadata/kafka/MetadataChangeLogProcessor.class]: Unsatisfied dependency expressed through constructor parameter 0; nested exception is org.springframework.beans.factory.BeanCreationException: Error creating bean with name 'updateIndicesHook' defined in URL [jar:file:/tmp/jetty-0_0_0_0-8080-war_war-_-any-3785592998662924994/webapp/WEB-INF/lib/mae-consumer.jar!/com/linkedin/metadata/kafka/hook/UpdateIndicesHook.class]: Bean instantiation via constructor failed; nested exception is org.springframework.beans.BeanInstantiationException: Failed to instantiate [com.linkedin.metadata.kafka.hook.UpdateIndicesHook]: Constructor threw exception; nested exception is java.lang.RuntimeException: Reindex from dataset_operationaspect_v1 to dataset_operationaspect_v1_1674751305780 failed
17:31:00 [main] INFO  c.l.r.t.h.c.c.AbstractNettyClient - Shutdown requested
17:31:00 [main] INFO  c.l.r.t.h.c.c.AbstractNettyClient - Shutting downable-evening-90828
01/26/2023, 11:27 PMandFilterorFiltersSearchInputcol1: [tagA, tagB]
col2: [tagA]col2query searchDataset {
  search(input: {
    type: DATASET, 
    query: "", 
    start: 0, 
    count: 1000,
    orFilters: [
      {
        and: [
          {
            field: "fieldTags",
            values: ["urn:li:tag:tagA"]
            condition: CONTAIN
          }
          {
            field: "fieldTags",
            values: ["urn:li:tag:tagB"]
            condition: CONTAIN
            negated: true
          }
        ]
      }
    ]
  }) {
    start
    count
    total
    searchResults {
      entity {
        urn
        type
      }
    }
  
}
}bland-orange-13353
01/27/2023, 12:57 AMpython3 -m pip install --upgrade pip wheel setuptools
python3 -m pip install --upgrade acryl-datahub
datahub versionrhythmic-glass-37647
01/27/2023, 1:28 AMPipelineInitErrorbrief-ability-41819
01/27/2023, 6:50 AMhelm dep updatebest-wire-59738
01/27/2023, 7:04 AMacceptable-terabyte-34789
01/27/2023, 7:13 AMTraceback (most recent call last):
  File "/usr/local/lib/python3.10/site-packages/requests/models.py", line 971, in json
    return complexjson.loads(self.text, **kwargs)
  File "/usr/local/Cellar/python@3.10/3.10.8/Frameworks/Python.framework/Versions/3.10/lib/python3.10/json/__init__.py", line 346, in loads
    return _default_decoder.decode(s)
  File "/usr/local/Cellar/python@3.10/3.10.8/Frameworks/Python.framework/Versions/3.10/lib/python3.10/json/decoder.py", line 337, in decode
    obj, end = self.raw_decode(s, idx=_w(s, 0).end())
  File "/usr/local/Cellar/python@3.10/3.10.8/Frameworks/Python.framework/Versions/3.10/lib/python3.10/json/decoder.py", line 355, in raw_decode
    raise JSONDecodeError("Expecting value", s, err.value) from None
json.decoder.JSONDecodeError: Expecting value: line 1 column 1 (char 0)gray-ocean-32209
01/27/2023, 1:27 PM[lineage]
backend = datahub_provider.lineage.datahub.DatahubLineageBackend
datahub_kwargs = {
    "datahub_conn_id": "datahub_rest_default",
    "cluster": "local_airflow",
    "capture_ownership_info": true,
    "capture_tags_info": true,
    "capture_executions": true,
    "graceful_exceptions": true }"capture_executions": truedocker-compose up airflow-init....
datahub-airflow-airflow-init-1  |     _backend = get_backend()
datahub-airflow-airflow-init-1  |   File "/home/airflow/.local/lib/python3.9/site-packages/airflow/lineage/__init__.py", line 61, in get_backend
datahub-airflow-airflow-init-1  |     return clazz()
datahub-airflow-airflow-init-1  |   File "/home/airflow/.local/lib/python3.9/site-packages/datahub_provider/lineage/datahub.py", line 64, in __init__
datahub-airflow-airflow-init-1  |     _ = get_lineage_config()
datahub-airflow-airflow-init-1  |   File "/home/airflow/.local/lib/python3.9/site-packages/datahub_provider/lineage/datahub.py", line 35, in get_lineage_config
datahub-airflow-airflow-init-1  |     return DatahubLineageConfig.parse_obj(kwargs)
datahub-airflow-airflow-init-1  |   File "pydantic/main.py", line 511, in pydantic.main.BaseModel.parse_obj
datahub-airflow-airflow-init-1  |   File "pydantic/main.py", line 331, in pydantic.main.BaseModel.__init__
datahub-airflow-airflow-init-1  | pydantic.error_wrappers.ValidationError: 1 validation error for DatahubLineageConfig
datahub-airflow-airflow-init-1  | capture_executionsacryldata/airflow-datahub:latest