limited-library-89060
01/11/2023, 9:20 AMworried-branch-76677
01/11/2023, 10:46 AMboundless-nail-65912
01/11/2023, 3:57 PMagreeable-dentist-42022
01/13/2023, 1:43 PMelegant-state-4
01/20/2023, 2:10 PMgentle-camera-33498
01/25/2023, 2:36 PMcurved-planet-99787
01/26/2023, 6:34 AMwitty-butcher-82399
01/30/2023, 7:31 PMfast-barista-4910
02/02/2023, 10:35 AMWITH NO SCHEMA BINDING
in the SQL and the query for lineage building was case sensitive and lower case. I changed the like
to ilike
and got it working normally. PR here: https://github.com/datahub-project/datahub/pull/7223gentle-lifeguard-88494
02/04/2023, 6:01 PM- curl -sS --fail '<http://localhost:9200/_cluster/health?wait_for_status=yellow&timeout=0s>'
|| exit 1
It's just a formatting issue - the '|| exit 1' is on a new line instead of being on the same line , here is the Github issue: https://github.com/datahub-project/datahub/issues/7255adventurous-nightfall-90271
02/13/2023, 12:17 AMgentle-lifeguard-88494
02/15/2023, 11:55 PM@_run_with_query_combiner
def _get_dataset_column_distinct_values(
self, column_profile: DatasetFieldProfileClass, column: str, unique_count: int, nonnull_count: int
) -> None:
if not self.config.include_field_distinct_values or unique_count > 25:
return
try:
# TODO do this without GE
self.dataset.set_config_value("interactive_evaluation", True)
# Check for distinct values in ever larger increments
pct_dataset = [.01,.05,.10,.25,.5,1]
for pct in pct_dataset:
samples_to_check = nonnull_count * pct
res = self.dataset.expect_column_values_to_be_in_set(
column,
[],
result_format={
"result_format": "SUMMARY",
"partial_unexpected_count": samples_to_check,
},
).result
# Get the distinct values
distinct_values = [*set(res["partial_unexpected_list"])]
if len(distinct_values) == unique_count:
column_profile.distinctValues = [
str(v) for v in res["partial_unexpected_list"]
]
# Exit loop if the distinct values are all captured
break
except Exception as e:
logger.debug(
f"Caught exception while attempting to get distinct values for column {column}. {e}"
)
self.report.report_warning(
"Profiling - Unable to get column distinct values",
f"{self.dataset_name}.{column}",
)
refined-energy-76018
02/16/2023, 12:46 AMbest-umbrella-88325
02/23/2023, 6:36 AMbig-postman-38407
02/28/2023, 9:52 AMhallowed-lizard-92381
03/01/2023, 8:58 PMquiet-jelly-11365
03/03/2023, 11:40 AMnutritious-bird-77396
03/03/2023, 11:50 PMshy-keyboard-55519
03/06/2023, 12:16 PMblue-engineer-74605
03/06/2023, 5:38 PMcold-book-93720
03/07/2023, 3:22 PMv0.10.0
https://github.com/datahub-project/datahub/pull/7512
more on that herefancy-oil-68203
03/07/2023, 11:43 PMworried-branch-76677
03/09/2023, 10:58 AMsilly-fish-85029
03/10/2023, 9:53 AMadamant-article-76582
03/13/2023, 4:02 PMacoustic-quill-54426
03/13/2023, 4:05 PMshy-dog-84302
03/15/2023, 5:57 PMastonishing-cartoon-6079
03/16/2023, 6:25 AMmysterious-monkey-71931
03/22/2023, 11:46 AMmapping types
is completely removed in OpenSearch 2.0 and ElasticSearch 8.0 as well.
So my question is there any plan to support OpenSearch and ElasticSearch 8.xflat-engineer-75197
03/22/2023, 1:53 PMflat-engineer-75197
03/22/2023, 1:53 PM