https://delta.io logo
v

Vibhor Gupta

03/09/2023, 5:36 PM
Is it possible to incrementally deep clone/replicate a delta table ?
We have a delta lake table on azure and we need to replicate it to gcp on a daily basis. partitions/rows are added/deleted everyday to the table on azure
k

Kashyap Bhatt

03/09/2023, 8:14 PM
Seems like a simple case of streaming. Have you already considered it? What's the problem?
Copy code
spark.readStream.format("delta")
  .load("path/to/azure-table")
  .writeStream
  .trigger(...)
  .format("delta")
  .option("checkpointLocation", 'path/to/store/checkpoint')
  .start('path/to/gcp-table')
👍 1
n

Nick Karpov

03/09/2023, 9:33 PM
consider using change data feed if your azure table has UPSERTs
2 Views