dazzling-book-76108
06/08/2021, 6:54 PM./gradlew clean build
then ./docker/dev.sh
and everything sounds good.
But when I try to create a new "Service entity" I got this error:
{"exceptionClass":"com.linkedin.restli.server.RestLiServiceException","stackTrace":"com.linkedin.restli.server.RestLiServiceException [HTTP Status:400]: Parameters of method 'ingest' failed validation with error 'ERROR :: /entity/value :: \"com.linkedin.metadata.snapshot.ServiceSnapshot\" is not a member type of union [ { \"type\" : \"record\", \"name\" : \"ChartSnapshot\", \"namespace\" : \"com.linkedin.metadata.snapshot\", \"doc\" : \"A metadata snapshot for a specific Chart entity.\", \"fields\" : [ { \"name\" : \"urn\", \"type\" : { \"type\" : \"typeref\", \"name\" : \"ChartUrn\", \"namespace\" : \"com.linkedin.common\", \"doc\" : \"Standardized chart identifier\", \"ref\" : \"string\", \"java\" : { \"class\" : \"com.linkedin.common.urn.ChartUrn\" }, \"validate\" : { \"com.linkedin.common.validator.TypedUrnValidator\" : { \"accessible\" : true, \"constructable\" : true, \"doc\" : \"Standardized chart identifier\", \"entityType\" : \"chart\", \"fields\" : [ { \"doc\" : \"The name of the dashboard tool such as looker, redash etc.\", \"maxLength\" : 20, \"name\" : \"dashboardTool\", \"type\" : \"string\" }, { \"doc\" : \"Unique id for the chart. This id should be globally unique for a dashboarding tool even when there are multiple deployments of it. As an example, chart URL could be used here for Looker such as '<http://looker.linkedin.com/looks/1234|looker.linkedin.com/looks/1234>'\", \"maxLength\" : 200, \"name\" : \"chartId\", \"type\" : \"string\" } ], \"maxLength\" : 236, \"name\" : \"Chart\", \"namespace\" : \"li\", \"owners\" : [ \"urn:li:corpuser:fbar\", \"urn:li:corpuser:bfoo\" ], \"owningTeam\" : \"urn:li:internalTeam:datahub\" } } }, \"doc\" : \"URN for the entity the metadata snapshot is associated with.\" }, { \"name\" : \"aspects\", \"type\" : { \"type\" : \"array\", \"items\" : { \"type\" : \"typeref\", \"name\" : \"ChartAspect\", \"namespace\" : \"com.linkedin.metadata.aspect\", \"doc\" : \"A union of all supported metadata aspects for a Chart\", \"ref\" : [ { \"type\" : \"record\", \"name\" : \"ChartKey\", \"namespace\" : \"com.linkedin.metadata.key\", \"doc\" : \"Key for a Chart\", \"fields\" : [ { \"name\" : \"dashboardTool\", \"type\" : \"string\", \"doc\" : \"The name of the dashboard tool such as looker, redash etc.\", \"Searchable\" : { \"addToFilters\" : true, \"boostScore\" : 4.0, \"fieldName\" : \"tool\", \"fieldType\" : \"TEXT_PARTIAL\" } }, { \"name\" : \"chartId\", \"type\" : \"string\", \"doc\" : \"Unique id for the chart. This id should be globally unique for a dashboarding tool even when there are multiple deployments of it. As an example, chart URL could be used here for Looker such as '<http://looker.linkedin.com/looks/1234|looker.linkedin.com/looks/1234>'\" } ], \"Aspect\" : { \"name\" : \"chartKey\" } }, { \"type\" : \"record\", \"name\" : \"ChartInfo\", \"namespace\" : \"com.linkedin.chart\", \"doc\" : \"Information about a chart\", \"include\" : [ { \"type\" : \"record\", \"name\" : \"CustomProperties\", \"namespace\" : \"com.linkedin.common\", \"doc\" : \"Misc. properties about an entity.\",
[...]
I noticed that ServiceSnapshot.pdl
(described in the docs) is not in the master at /metadata/snapshot/
(link).
Also Snapshot.pdl
is not up-to-date containing ServiceSnapshot.pdl
inside union
(link).
Any ideas? Am I forgetting something?early-lamp-41924
06/08/2021, 7:00 PMbig-carpet-38439
06/08/2021, 7:10 PMdazzling-book-76108
06/08/2021, 7:38 PMbig-carpet-38439
06/08/2021, 9:11 PMbig-carpet-38439
06/08/2021, 9:11 PMdazzling-book-76108
06/09/2021, 1:38 PM[...]
+ mypy src/ tests/
tests/unit/test_packaging.py:8: error: Library stubs not installed for "pkg_resources" (or incompatible with Python 3.8)
tests/unit/test_packaging.py:8: note: Hint: "python3 -m pip install types-pkg_resources"
src/datahub/metadata/schema_classes.py:7: error: Library stubs not installed for "six" (or incompatible with Python 3.8)
src/datahub/metadata/schema_classes.py:7: note: Hint: "python3 -m pip install types-six"
src/datahub/ingestion/source/superset.py:5: error: Library stubs not installed for "dateutil.parser" (or incompatible with Python 3.8)
src/datahub/ingestion/source/superset.py:5: note: Hint: "python3 -m pip install types-python-dateutil"
src/datahub/ingestion/source/superset.py:5: error: Library stubs not installed for "dateutil" (or incompatible with Python 3.8)
src/datahub/ingestion/source/superset.py:6: error: Library stubs not installed for "requests" (or incompatible with Python 3.8)
src/datahub/ingestion/source/kafka_connect.py:7: error: Library stubs not installed for "requests" (or incompatible with Python 3.8)
src/datahub/emitter/rest_emitter.py:8: error: Library stubs not installed for "requests" (or incompatible with Python 3.8)
src/datahub/emitter/rest_emitter.py:8: note: Hint: "python3 -m pip install types-requests"
src/datahub/emitter/rest_emitter.py:9: error: Library stubs not installed for "requests.exceptions" (or incompatible with Python 3.8)
src/datahub/configuration/toml.py:3: error: Library stubs not installed for "toml" (or incompatible with Python 3.8)
src/datahub/configuration/toml.py:3: note: Hint: "python3 -m pip install types-toml"
tests/unit/test_rest_sink.py:4: error: Library stubs not installed for "requests" (or incompatible with Python 3.8)
src/datahub/ingestion/source/mysql.py:2: error: Library stubs not installed for "pymysql" (or incompatible with Python 3.8)
src/datahub/ingestion/source/mysql.py:2: note: Hint: "python3 -m pip install types-PyMySQL"
src/datahub/configuration/yaml.py:3: error: Library stubs not installed for "yaml" (or incompatible with Python 3.8)
src/datahub/configuration/yaml.py:3: note: Hint: "python3 -m pip install types-PyYAML"
tests/unit/test_glue_source.py:5: error: Library stubs not installed for "freezegun" (or incompatible with Python 3.8)
tests/unit/test_glue_source.py:5: note: Hint: "python3 -m pip install types-freezegun"
src/datahub/ingestion/run/pipeline.py:5: error: Library stubs not installed for "click" (or incompatible with Python 3.8)
src/datahub/check/check_cli.py:3: error: Library stubs not installed for "click" (or incompatible with Python 3.8)
src/datahub/check/check_cli.py:3: note: Hint: "python3 -m pip install types-click"
src/datahub/check/check_cli.py:3: note: (or run "mypy --install-types" to install all missing stub packages)
src/datahub/check/check_cli.py:3: note: See <https://mypy.readthedocs.io/en/stable/running_mypy.html#missing-imports>
src/datahub/check/check_cli.py:12: error: Untyped decorator makes function "check" untyped
src/datahub/check/check_cli.py:17: error: Untyped decorator makes function "mce_file" untyped
src/datahub/check/check_cli.py:18: error: Untyped decorator makes function "mce_file" untyped
src/datahub/check/check_cli.py:26: error: Untyped decorator makes function "local_docker" untyped
src/datahub/check/check_cli.py:40: error: Untyped decorator makes function "plugins" untyped
src/datahub/check/check_cli.py:41: error: Untyped decorator makes function "plugins" untyped
src/datahub_provider/_lineage_core.py:4: error: Library stubs not installed for "dateutil.parser" (or incompatible with Python 3.8)
src/datahub_provider/_lineage_core.py:4: error: Library stubs not installed for "dateutil" (or incompatible with Python 3.8)
src/datahub/entrypoints.py:6: error: Library stubs not installed for "click" (or incompatible with Python 3.8)
src/datahub/entrypoints.py:28: error: Untyped decorator makes function "datahub" untyped
src/datahub/entrypoints.py:29: error: Untyped decorator makes function "datahub" untyped
src/datahub/entrypoints.py:30: error: Untyped decorator makes function "datahub" untyped
src/datahub/entrypoints.py:46: error: Untyped decorator makes function "version" untyped
src/datahub/entrypoints.py:53: error: Untyped decorator makes function "ingest" untyped
src/datahub/entrypoints.py:54: error: Untyped decorator makes function "ingest" untyped
tests/unit/test_plugin_system.py:2: error: Library stubs not installed for "click.testing" (or incompatible with Python 3.8)
tests/unit/test_check.py:1: error: Library stubs not installed for "click.testing" (or incompatible with Python 3.8)
tests/unit/serde/test_serde.py:8: error: Library stubs not installed for "click.testing" (or incompatible with Python 3.8)
tests/integration/sql_server/test_sql_server.py:5: error: Library stubs not installed for "click.testing" (or incompatible with Python 3.8)
tests/integration/mysql/test_mysql.py:2: error: Library stubs not installed for "click.testing" (or incompatible with Python 3.8)
Found 35 errors in 19 files (checked 144 source files)
> Task :metadata-ingestion:lint FAILED
dazzling-book-76108
06/09/2021, 1:38 PM./gradlew :gms:impl:build -Prest.model.compatibility=ignore
and then:
./gradlew build
dazzling-book-76108
06/09/2021, 1:38 PM./docker/dev.sh
Creating mysql ... done
Creating neo4j ... done
Creating elasticsearch ... done
Creating zookeeper ... done
Creating elasticsearch-setup ... done
Creating kibana ... done
Creating broker ... done
Creating schema-registry ... done
Creating kafka-rest-proxy ... done
Creating kafka-setup ... done
Creating schema-registry-ui ... done
Creating datahub-mae-consumer ... done
Creating datahub-gms ... done
Creating kafka-topics-ui ... done
Creating datahub-mce-consumer ... done
Creating datahub-frontend-react ... error
ERROR: for datahub-frontend-react Cannot start service datahub-frontend-react: OCI runtime create failed: container_linux.go:380: starting container process caused: exec: "datahub-frontend/bin/playBinary": stat datahub-frontend/bin/playBinary: no such file or directory: unknown
ERROR: for datahub-frontend-react Cannot start service datahub-frontend-react: OCI runtime create failed: container_linux.go:380: starting container process caused: exec: "datahub-frontend/bin/playBinary": stat datahub-frontend/bin/playBinary: no such file or directory: unknown
ERROR: Encountered errors while bringing up the project.
How can I fix this? 😆big-carpet-38439
06/09/2021, 3:20 PMdazzling-book-76108
06/09/2021, 3:21 PM./gradlew clean
didn't work for me in this case.
After I removed the build
folders by my own everything worked according to No Code Metadata Modeling.
So weird... 😄big-carpet-38439
06/09/2021, 3:26 PMgray-shoe-75895
06/09/2021, 4:38 PMcolossal-account-65055
07/01/2021, 7:23 PM./docker/dev.sh
. What build
folders did you have to remove in order to get this working?colossal-account-65055
07/01/2021, 7:25 PMgit checkout master # up to date with upstream
./gradlew clean build # output says "Task :metadata-ingestion:test FAILED", but otherwise seems OK
datahub docker nuke
./docker/dev.sh
output from dev.sh
with similar error that Matheus mentioned above:
Creating mysql ... done
Creating zookeeper ... done
Creating elasticsearch ... done
Creating neo4j ... done
Recreating elasticsearch-setup ... done
Creating broker ... done
Recreating mysql-setup ... done
Creating schema-registry ... done
Recreating kafka-setup ... done
Creating datahub-gms ... done
Recreating 16d2c085f9f3_datahub-frontend-react ... error
ERROR: for 16d2c085f9f3_datahub-frontend-react Cannot start service datahub-frontend-react: OCI runtime create failed: container_linux.go:367: starting container process caused: exec: "datahub-frontend/bin/playBinary": stat datahub-frontend/bin/playBinary: no such file or directory: unknown
ERROR: for datahub-frontend-react Cannot start service datahub-frontend-react: OCI runtime create failed: container_linux.go:367: starting container process caused: exec: "datahub-frontend/bin/playBinary": stat datahub-frontend/bin/playBinary: no such file or directory: unknown
ERROR: Encountered errors while bringing up the project.
big-carpet-38439
07/01/2021, 8:19 PM./gradlew datahub-frontend:build
and then dev.sh again?big-carpet-38439
07/01/2021, 8:20 PMgray-shoe-75895
07/01/2021, 8:22 PMcolossal-account-65055
07/02/2021, 3:23 PMcolossal-account-65055
07/02/2021, 3:23 PMcolossal-account-65055
07/02/2021, 3:37 PMmaster
branchgray-shoe-75895
07/02/2021, 5:29 PMcolossal-account-65055
07/06/2021, 5:26 PMrm ~/airflow/airflow.cfg
.colossal-account-65055
07/06/2021, 5:29 PM$ docusaurus build
[en] Creating an optimized production build...
Loading of version failed for version "current"
Unable to build website for locale "en".
Error: Invalid sidebar file at "sidebars.js".
These sidebar document ids do not exist:
- releases
Available document ids are:
- README
- datahub-frontend/README
...
gray-shoe-75895
07/06/2021, 11:49 PMcolossal-account-65055
07/07/2021, 2:56 PMcolossal-account-65055
07/07/2021, 2:57 PMtest_package_discovery()
colossal-account-65055
07/07/2021, 2:57 PMcolossal-account-65055
07/07/2021, 2:57 PMgray-shoe-75895
07/07/2021, 5:21 PMfind . -type f -name '*.py[co]' -delete -o -type d -name __pycache__ -delete
will fix that issuecolossal-account-65055
07/07/2021, 8:12 PMcolossal-account-65055
07/07/2021, 8:47 PM./gradlew clean # output is green
find . -type f -name '*.py[co]' -delete -o -type d -name __pycache__ -delete
./gradlew build # Task :docs-website:yarnBuild fails with the same sidebar document ids error I sent before
big-carpet-38439
07/07/2021, 9:19 PMreleases
file?big-carpet-38439
07/07/2021, 9:20 PMreleases.md
under docs-website/genDocs/
colossal-account-65055
07/07/2021, 9:22 PMls docs-website/genDocs
README.md datahub-graphql-core docker metadata-ingestion
datahub-frontend datahub-kubernetes docs metadata-jobs
datahub-gms-graphql-service datahub-web-react gms
colossal-account-65055
07/07/2021, 9:25 PMdocs-website:yarnBuild
step but not related to releases anymore. I'm not 100% sure I am parsing the output right but it looks like maybe the yarnBuild failure is related to an earlier yarnGenerate error that has to do with throttling for a rate limit? Let me send you the whole output:colossal-account-65055
07/07/2021, 9:26 PMgray-shoe-75895
07/08/2021, 1:38 AMcolossal-account-65055
07/08/2021, 1:31 PM:metadata-ingestion:testQuick
step fails, even after running the find
command you sent earlier which initially seemed to fix the problem.gray-shoe-75895
07/08/2021, 7:44 PM*git clean* -*fdx*
from the metadata-ingestion directory to restore it to a “clean checkout”?ambitious-lifeguard-64025
07/08/2021, 9:21 PM--stacktrace
ambitious-lifeguard-64025
07/08/2021, 9:21 PMambitious-lifeguard-64025
07/08/2021, 9:23 PMdocker/dev.sh
also fails not sure if related
👉 docker/dev.sh
[+] Running 5/11
⠿ broker Pulled 0.8s
⠿ elasticsearch-setup Error 0.8s
⠿ elasticsearch Pulled 0.8s
⠿ schema-registry Pulled 0.7s
⠿ mysql-setup Pulled 0.7s
⠿ mysql Error 0.8s
⠿ kafka-setup Error 0.8s
⠿ datahub-gms Error 0.8s
⠿ datahub-frontend-react Error 0.8s
⠿ zookeeper Pulled 0.8s
⠇ neo4j Pulling 0.8s
Error response from daemon: manifest for linkedin/datahub-gms:debug not found: manifest unknown: manifest unknown
ambitious-lifeguard-64025
07/08/2021, 9:26 PM./gradlew build -x check
which succeeds but docker/dev.sh
still fails with the same error as abovegray-shoe-75895
07/08/2021, 11:10 PMgray-shoe-75895
07/08/2021, 11:14 PMtests/unit/test_airflow.py:7: in <module>
import airflow.configuration
venv/lib/python3.8/site-packages/airflow/__init__.py:50: in <module>
from airflow.models import DAG # noqa: E402
venv/lib/python3.8/site-packages/airflow/models/__init__.py:21: in <module>
from airflow.models.baseoperator import BaseOperator, BaseOperatorLink # noqa: F401
venv/lib/python3.8/site-packages/airflow/models/baseoperator.py:43: in <module>
from airflow.models.dag import DAG
venv/lib/python3.8/site-packages/airflow/models/dag.py:47: in <module>
from airflow.executors import LocalExecutor, get_default_executor
venv/lib/python3.8/site-packages/airflow/executors/__init__.py:23: in <module>
from airflow.executors.base_executor import BaseExecutor # noqa
venv/lib/python3.8/site-packages/airflow/executors/base_executor.py:24: in <module>
import airflow.utils.dag_processing
venv/lib/python3.8/site-packages/airflow/utils/dag_processing.py:40: in <module>
from setproctitle import setproctitle
E ImportError: dlopen(/Users/szeng/Documents/Learn/datahub/metadata-ingestion/venv/lib/python3.8/site-packages/setproctitle.cpython-38-darwin.so, 2): Symbol not found: _Py_GetArgcArgv
E Referenced from: /Users/szeng/Documents/Learn/datahub/metadata-ingestion/venv/lib/python3.8/site-packages/setproctitle.cpython-38-darwin.so
E Expected in: flat namespace
E in /Users/szeng/Documents/Learn/datahub/metadata-ingestion/venv/lib/python3.8/site-packages/setproctitle.cpython-38-darwin.so
This seems to be related to this stackoverflow question: https://stackoverflow.com/questions/60684146/airflow-initdb-undefined-symbol-py-getargcargvcolossal-account-65055
07/09/2021, 5:15 PMgray-shoe-75895
07/09/2021, 6:04 PMambitious-lifeguard-64025
07/09/2021, 7:54 PMpyenv
fixed the airflow tests, however i still get manifest unknown error when I run dev.sh
👉 ./dev.sh
[+] Running 2/11
⠿ broker Error 0.8s
⠿ elasticsearch-setup Error 0.8s
⠿ elasticsearch Error 0.8s
⠇ mysql Pulling 0.8s
⠿ datahub-frontend-react Error 0.8s
⠇ neo4j Pulling 0.8s
⠿ mysql-setup Pulled 0.8s
⠿ datahub-gms Error 0.8s
⠿ zookeeper Pulled 0.8s
⠿ schema-registry Error 0.8s
⠿ kafka-setup Error 0.8s
Error response from daemon: manifest for linkedin/datahub-frontend-react:debug not found: manifest unknown: manifest unknown