jaiminks
01/09/2023, 10:07 PMRyan Zhu
01/09/2023, 10:56 PMKashyap Bhatt
01/09/2023, 11:09 PMvirtualenv
or conda
or poetry
or something, add pyspark and delta as dependencies, activate environment and then create Spark session as described here.
Something like:
import pyspark
from delta import *
builder = pyspark.sql.SparkSession.builder.appName("MyApp") \
.config("spark.sql.extensions", "io.delta.sql.DeltaSparkSessionExtension") \
.config("spark.sql.catalog.spark_catalog", "org.apache.spark.sql.delta.catalog.DeltaCatalog")
spark = configure_spark_with_delta_pip(builder).getOrCreate()
Denny Lee
01/10/2023, 5:59 AM