what is the easiest way to delete all data from da...
# all-things-deployment
r
what is the easiest way to delete all data from datahub?
f
Did you
datahub docker nuke
?
r
not yet, but lets say I want to delete a group of datasets (for e:g everything under the prod group, what would I do?)
f
r
maybe i dont completely understand how urns are interpreted, but if i just want to delete a group of datasets, can i do something like datahub delete -urn "urnlidev:*" ?
b
cc @mammoth-bear-12532 who has been thinking about the delete CLI recently.. Currently you have 2 options: Delete assets by primary key (urn) or by ingestion run (rollback)
f
@big-carpet-38439 Hello, I figured out about deleting by URN or Rollback on my UAT server. It send the rollback request to the Datahub-GMS (default port: 8080). But on my production server I got few services running on the Docker. On port 8080 there is a Airflow running. My datahub-gms is configured on 6080:8080, when the GMS container starts it uses 8080 port on the container. The issue is when i try to rollback or delete by "datahub delete --urn urnliPROD:" or "datahub ingest rollback --run-id 'something'" it gives me the error that some kinda other service is running on 8080. I mean the "datahub delete" or "datahub ingest rollback" commands are directly going into the server's 8080 port.
m
There are a couple of ways of overriding the host and port for the datahub cli • run
datahub init
• Set env variables (
DATAHUB_GMS_PORT
and
DATAHUB_GMS_URL
). Check the docs for the actual names. https://datahubproject.io/docs/cli/