ritwik singh
05/31/2023, 3:11 AMDivyansh Jain
05/31/2023, 5:40 AMSuraj Malthumkar
05/31/2023, 8:29 AMRobin Moffatt
05/31/2023, 12:03 PMAlessandro Biagi
05/31/2023, 4:04 PMdelta merge
operation. What does it happen if during a merge operation the underlying data is changing? Can this lead to data inconsistencies? This would explain some issues that we're facing..Anton Abilov
05/31/2023, 4:24 PMmaxBytesPerTrigger
dynamically?
I would like for my Spark streaming job to process large batches (i.e. 50GB) when there is a lot of data to backfill, however when it’s caught up it should process smaller batches.Alessandro Biagi
05/31/2023, 4:39 PMIván RamĂrez
05/31/2023, 6:53 PMKiran Kondamadugula
05/31/2023, 7:35 PMKenny Ma
06/01/2023, 12:17 AMsharath
06/01/2023, 8:20 AMIris Shainsky
06/01/2023, 9:13 AMDominic Copeland
06/01/2023, 9:53 AMGal Stainfeld
06/01/2023, 11:40 AMOPTIMIZE
operation with operationParameters.zOrderBy
.
The above method worked for me locally after i z-ordered a table, but i was trying to use it on a table i know for sure was z-ordered in the past (half a year ago) and it returned empty results.
Is it possible for a table to just lose it’s z-ordered when the only operations made on it are add and update ones ?
ThanksBhupendra Singh Chauhan(TECH-BLR)
06/01/2023, 12:23 PMBhupendra Singh Chauhan(TECH-BLR)
06/01/2023, 3:01 PMSujith Kumar.S
06/02/2023, 6:50 AMRobin Kraft
06/02/2023, 9:37 AMDivyansh Jain
06/02/2023, 1:21 PMBhupendra Singh Chauhan(TECH-BLR)
06/02/2023, 2:32 PMAlber Tadrous
06/05/2023, 3:34 PMMohanraj M
06/05/2023, 4:13 PMCam Gallucci
06/05/2023, 5:51 PMdelta_sharing.load_as_spark(table_url)
function.
2. An update to record x occurs in version 6
3. Another update to record x occurs in version 7
4. I load the data from version 5 to 7 to get the most recent updates with delta_sharing.load_table_changes_as_spark(table_url, starting_version=5, ending_version=7)
Will step 4 give me one combined update for record x (with the update_preimage
from version 5), or will it have both updates (two update_preimage
, one from version 5 and one from version 6)? And then say in step 3 record x was deleted instead of updated, would only the delete come?
Appreciate any clarification on this, thanks!Christina
06/05/2023, 11:54 PMCREATE TABLE USING DELTA LOCATION …
creates external tables. Is there a way to create a managed delta table with an existing dataset?Ajex
06/06/2023, 4:13 AM"File does not exist: /delta_path/part-00068-23256f70-a297-416f-8c5d-3650343929b3.c000.zstd.parquet"
Please let me know if you have any suggestions or solutions to resolve this issue (execute 2 job at the same time without conflict).Rahul Sharma
06/06/2023, 6:42 AMGapyNi
06/06/2023, 12:24 PMset spark.sql.streaming.stateStore.stateSchemaCheck to false
? Main goal would be to merge schemas while streaming (adding new columns, ...).
When running this code in Notebook (12.2.x-scala2.12) it works as opposed to Job (10.4.x-scala2.12). Was it some change done?
Thanks and regards,
GapyNiDinesh N
06/06/2023, 1:05 PMAlber Tadrous
06/06/2023, 8:08 PMJohn Darrington
06/06/2023, 8:09 PM