Tom Barber
03/18/2024, 5:46 PMStefan Krawczyk
03/18/2024, 5:47 PMStefan Krawczyk
03/18/2024, 5:47 PMTom Barber
03/18/2024, 5:48 PMStefan Krawczyk
03/18/2024, 5:48 PMStefan Krawczyk
03/18/2024, 5:48 PMStefan Krawczyk
03/18/2024, 5:50 PMStefan Krawczyk
03/18/2024, 5:51 PMTom Barber
03/18/2024, 5:51 PMStefan Krawczyk
03/18/2024, 5:52 PMTom Barber
03/18/2024, 5:52 PMTom Barber
03/18/2024, 5:53 PMStefan Krawczyk
03/18/2024, 5:55 PMStefan Krawczyk
03/18/2024, 5:55 PMTom Barber
03/18/2024, 5:56 PM@extract_columns('transaction_id', 'originator.account_number', 'beneficiary.account_number',
'transaction_sub_type', 'credit_or_debit')
def read_transaction_input() -> pl.DataFrame:
return pl.read_parquet("/tmp/hive/data/hive/warehouse/consilient.db/new_table_name/part-00000-d9898992-0d9d-418c-ba9c-ef6923e81976-c000.snappy.parquet")
Stefan Krawczyk
03/18/2024, 5:58 PMStefan Krawczyk
03/18/2024, 6:03 PMStefan Krawczyk
03/18/2024, 6:05 PM@with_columns
.Stefan Krawczyk
03/18/2024, 6:06 PM@extract_columns
. But you can still use Hamilton!
def transaction_df() -> pl.LazyFrame:
return pl.scan_parquet("/tmp/hive/data/hive/warehouse/consilient.db/new_table_name/part-00000-d9898992-0d9d-418c-ba9c-ef6923e81976-c000.snappy.parquet")
Tom Barber
03/18/2024, 6:07 PMStefan Krawczyk
03/18/2024, 6:08 PMStefan Krawczyk
03/18/2024, 6:10 PMTom Barber
03/18/2024, 6:18 PM