Gal Stainfeld
08/21/2023, 8:40 PMdate
and inside it again by normalized_at
date.
Is there a way to tell the stream to only look for new data each interval from the normalized_at
partition that suits today and yesterday? Not sure how exactly how it knows from where on the table to take the new events from the last checkpoint so maybe by utilizing the table’s partitions can help it.
Thanks.Nick Karpov
08/22/2023, 4:47 PMforeachBatch
Gal Stainfeld
08/22/2023, 4:53 PMtoday|yesterday.
So the question is if putting such filter on the foreachBatch
will help the the stream in any way (because it will focus on reading two partitions only when getting the new data for the batch) or it won’t matter at all.Nick Karpov
08/22/2023, 5:03 PM