Hi guys,
I am working on the medallion architecture of delta but there is limited resources to read. Databrick says the gold layer requires fewer join and de-normallized while silver layer has the 3rd normal form. So
1. The silver is the dim of star-schema, and we can join all the dim to the universal set (as the gold table)
2. The silver
A . If Join: silver can only be generated from join bronzes on primary key
B. For other operation: mainly support clean, filter, augment, create new column
3. After the gold table is generated, which kind of data should it hold:
A .The fully joined table of star-schema
B. summarized data after groupby and agg (groupby zone, agg count*)
Can anyone have experience help me to clarify these concern?