wooden-glass-33314
11/28/2023, 10:33 PMbrainy-addition-90818
11/28/2023, 10:38 PMimport whylogs as why
row = {"a": 1, "b": 2, "c": 3}
result_set = why.log(row)
print(result_set.view().serialize())
brainy-addition-90818
11/28/2023, 10:39 PMbytes
that you can store and load back up later into a DatasetProfileView. The biggest caveat is that you're going to be using a DatasetProfileView
and not a DatasetProfile
at that point, which means you can only merge it with other things, you can't add individual datapoints directly.wooden-glass-33314
11/28/2023, 10:40 PMbrainy-addition-90818
11/28/2023, 10:41 PM._columns
. It doesn't have a convenient .equals()
function sadlybrainy-addition-90818
11/28/2023, 10:42 PMimport whylogs as why
row = {"a": 1, "b": 2, "c": 3}
result_set = why.log(row)
ser = result_set.view().serialize()
view = why.DatasetProfileView.deserialize(ser)
print(view._columns)
brainy-addition-90818
11/28/2023, 10:43 PMview._columns
view._dataset_timestamp
view._creation_timestamp
view._metrics
view._metadata: Dict[str, str]
brainy-addition-90818
11/28/2023, 10:44 PMwooden-glass-33314
11/28/2023, 10:46 PMmysterious-solstice-25388
11/28/2023, 10:47 PMwooden-glass-33314
11/28/2023, 10:47 PMwooden-glass-33314
11/28/2023, 10:50 PMmysterious-solstice-25388
11/28/2023, 10:53 PMmysterious-solstice-25388
11/28/2023, 10:54 PMwooden-glass-33314
11/28/2023, 10:55 PMwooden-glass-33314
11/28/2023, 10:55 PMmysterious-solstice-25388
11/28/2023, 11:01 PMNotebookProfileVisualizer
thats right those require two profiles, sorry I thought you were referencing a "summary" method we have for displaying specific column profile viewswooden-glass-33314
11/28/2023, 11:03 PMmysterious-solstice-25388
11/28/2023, 11:05 PMNotebookProfileVisualizer
or you can call methods directly to `calculate_drift_scores`: https://github.com/whylabs/whylogs/blob/mainline/python/examples/advanced/Drift_Algorithm_Configuration.ipynbwooden-glass-33314
11/28/2023, 11:08 PMmysterious-solstice-25388
11/28/2023, 11:09 PMmysterious-solstice-25388
11/28/2023, 11:11 PMwooden-glass-33314
11/28/2023, 11:13 PMcalculate_drift_scores
gets me that data structure i was looking for, so i can at least test out my idea from here. thanks for some alternative ideas on how to look at the problem. we’re looking to automate all of this for any model we train and do it in a uniform way, with a giant list of assumptions that must be true for it to be repeatable across models. we’ll be testing those assumptions toomysterious-solstice-25388
11/28/2023, 11:15 PMwooden-glass-33314
11/28/2023, 11:15 PMmysterious-solstice-25388
11/28/2023, 11:16 PMmysterious-solstice-25388
11/28/2023, 11:17 PMwooden-glass-33314
11/28/2023, 11:20 PMrelations
package are also serializable in some form and i think we could make some use out of thiswooden-glass-33314
11/28/2023, 11:46 PMmysterious-solstice-25388
11/28/2023, 11:51 PM