Jonas Meyer-Ohle
10/16/2024, 3:20 PMActual error: No registered subclass of BaseDefaultValidator is available for arg: schema and type <class 'polars.dataframe.frame.DataFrame'>. This either means (a) this arg-type contribution isn't supported or (b) this has not been added yet (but should be). In the case of (b), we welcome contributions. Get started at <http://github.com/dagworks-inc/hamilton|github.com/dagworks-inc/hamilton>.
I stepped through the following file a bit: https://github.com/DAGWorks-Inc/hamilton/blob/main/hamilton/data_quality/pandera_validators.py#L9
And it seems like the polars plugin isn't part of the supported extensions, I'm assuming this is the issue?
Thanks!Thierry Jean
10/16/2024, 3:23 PMJonas Meyer-Ohle
10/16/2024, 3:26 PMThierry Jean
10/16/2024, 3:39 PM"polars"
to the list of extensions seems to fix the issue. We'll be releasing a patch soon
I also noticed that you were using the class-based API with pandera.polars.DataFrameModel
and pandera.polars.Field
. It didn't exist when we first built the pandera plugin. Instead you creating pandera.DataFrameSchema(...)
objects with columns specifications via pa.Column(...)
It seems to be fully compatible, but let us know if you find some rough edges!Jonas Meyer-Ohle
10/16/2024, 3:42 PMThierry Jean
10/16/2024, 4:06 PMThierry Jean
10/16/2024, 5:10 PM