https://delta.io logo
j

jaiminks

01/09/2023, 10:07 PM
Hi Team, Is it possible to deploy delta lake oss using vagrant and virtualbox for basic development and testing purpose.
r

Ryan Zhu

01/09/2023, 10:56 PM
delta is just an open source library. You can just include it in your own project. For example, the entire open source delta lake on spark project doesn’t use any virtual machines. It’s just a scala project: https://github.com/delta-io/delta
k

Kashyap Bhatt

01/09/2023, 11:09 PM
yes, just create a virtual env using
virtualenv
or
conda
or
poetry
or something, add pyspark and delta as dependencies, activate environment and then create Spark session as described here. Something like:
Copy code
import pyspark
from delta import *

builder = pyspark.sql.SparkSession.builder.appName("MyApp") \
    .config("spark.sql.extensions", "io.delta.sql.DeltaSparkSessionExtension") \
    .config("spark.sql.catalog.spark_catalog", "org.apache.spark.sql.delta.catalog.DeltaCatalog")

spark = configure_spark_with_delta_pip(builder).getOrCreate()
d

Denny Lee

01/10/2023, 5:59 AM
Also note, you can deploy the Delta Lake docker via https://go.delta.io/docker which includes Delta Rust Python-bindings, PySpark, Spark, Jupyter notebook, Rust, and ROAPI. HTH!
🙌 1
4 Views