Rudhra Raveendran
05/22/2023, 4:51 PMTD
05/22/2023, 4:53 PMunified_col = coalesce(new_col, some_fixing_expression(old_col))
Rudhra Raveendran
05/22/2023, 5:21 PMdf = spark.read.format("delta").load(path)
but I can still see the column with spark.read.parquet(path)
, so would it make sense to use the different functions, one for current schema and one for historical data (until retention expiry?)Kees Duvekot
05/22/2023, 11:10 PMGerhard Brueckl
05/23/2023, 6:57 AMChain Hermanson
06/03/2023, 7:48 AMKees Duvekot
06/03/2023, 7:50 AMChain Hermanson
06/03/2023, 7:58 AMKees Duvekot
06/03/2023, 8:17 AMChain Hermanson
06/03/2023, 8:33 AMKees Duvekot
06/03/2023, 8:37 AMChain Hermanson
06/03/2023, 8:52 AMKees Duvekot
06/03/2023, 8:53 AMChain Hermanson
06/03/2023, 8:58 AM