nikhil raman
03/23/2023, 3:57 AMYuval Itzchakov
03/23/2023, 9:00 AMStarkadur Hrobjartsson
03/23/2023, 10:06 AMALTER TABLE <table> SET TBLPROPERTIES (
'delta.minReaderVersion' = '2',
'delta.minWriterVersion' = '5',
'delta.columnMapping.mode' = 'name'
)
This resulted in new files written to the storage account to be placed into directories named with random two letter names e.g. 'pi', 'Wo', '32', etc. instead of the configured partition directories as before the change.
My question is, why does this happen and is this something I need to care about?Borislav Blagoev
03/23/2023, 3:21 PMS3 Multi-Cluster Writes
in my Databricks Workspace. I have confirmed that I have read/write
permissions to the DynamoDB
table, and have successfully tested the permissions using boto3
in a Databricks notebook.
I have also set the Spark configs as per step 4 of the documentation. The runtime version on my cluster is 11.3 LTS
, and I have installed the following dependencies: io.delta:delta-storage-s3-dynamodb:2.1.0
and com.amazonaws:aws-java-sdk:1.12.427
.
I ran a script that reads a delta table and saves it to a different location, but there are no records in the DynamoDB
table. I'm wondering if I'm missing something or if there's a step I haven't completed.
I would appreciate any insights or advice on how to successfully enable S3 Multi-Cluster Writes
. Here is the link to the documentation that I'm following: https://delta.io/blog/2022-05-18-multi-cluster-writes-to-delta-lake-storage-in-s3/sabari dass
03/23/2023, 7:45 PMWilliam Wu
03/23/2023, 7:57 PMdeep clone
. Looking at the open source docs, I don’t see any mentions of cloning
but looking at github, I do see tests running clone
in CloneTableSqlSuite.scala. It appears to have been added pretty recently (3 months ago).
Is there an estimated timeline of when clone
functionality will be released?Chandra
03/23/2023, 8:11 PMChandra
03/23/2023, 9:04 PMDhruvil Shah
03/24/2023, 12:16 AMPer Tengkvist
03/24/2023, 11:05 AMMartin Beaussart
03/24/2023, 11:59 AMJoydeep Banik Roy
03/24/2023, 2:23 PMJoydeep Banik Roy
03/24/2023, 2:23 PMVinodh Thiagarajan
03/24/2023, 5:49 PMsabari dass
03/24/2023, 9:30 PMShubham Goyal
03/25/2023, 9:48 AMChristina
03/25/2023, 10:07 PMChristina
03/25/2023, 10:08 PMRusty Conover
03/25/2023, 11:53 PMAbhinav Raghuvanshi
03/26/2023, 5:06 AMTim Burns
03/26/2023, 7:50 PM# Load the Delta Lake table
delta_table = DeltaTable.forPath(spark, "/path/to/delta_table")
# Get the metadata of the table
metadata = delta_table.metadata
# Print the metadata
print(metadata)
However, I find that this doesn't work, even through the metadata JSON is right there, so I'm not writing code like this to find a simple table schema on a metadata table.
delta_dir = join(table_path, "_delta_log")
for file in os.listdir(delta_dir):
if file.endswith(".json"):
with open(join(delta_dir, file)) as json_file:
for json_line in json_file:
json_obj = json.loads(json_line)
if "metaData" in json_obj:
self.catalog_metadata[table_path] = json_obj["metaData"]
schema_string = metadata["schemaString"]
result = json.JSONDecoder().decode(schema_string)
It seems like such a simple basic question to ask of the delta table, what is your schema. Why isn't it baked into the API? Or if it, how do I get at it without resorting to writing code?
Thanks, Timorsagiv
03/27/2023, 8:45 AMLucas Zago
03/27/2023, 6:05 PMIan
03/28/2023, 5:40 AMschema": {
"date": {
"data_type": "string",
"partition_column": true,
"nullable": false
},
"shift": {
"data_type": "string",
"partition_column": false,
"nullable": false
}
}
*the above is the schema of the table that we have created but when using deltaTable.sc*hema() it gives the below schema
schema": {
"date": {
"data_type": "string",
"partition_column": true,
"nullable": true
},
"shift": {
"data_type": "string",
"partition_column": false,
"nullable": true
}
}
nullable is returned true when we have set it to false
we are using spark to create the delta tableIan
03/28/2023, 11:31 AMTrey Yi
03/29/2023, 11:16 AMFailed to merge decimal types with incompatible scale 4 to 8
Diogo Cunha
03/29/2023, 4:48 PMLucas Zago
03/29/2023, 6:21 PMChandra
03/29/2023, 6:56 PMChandra
03/29/2023, 7:14 PM