βInstall on Spark
Option 1: With package installation and code changes (Scala only)
libraryDependencies += "io.dataflint" %% "spark_2.12" % "0.8.2"implementation 'io.dataflint:spark_2.12:0.8.2'<dependency>
<groupId>io.dataflint</groupId>
<artifactId>spark_2.12</artifactId>
<version>0.8.2</version>
</dependency> val spark = SparkSession
.builder
.appName("MyApp")
.config("spark.plugins", "io.dataflint.spark.SparkDataflintPlugin")
...
.getOrCreate()Option 2: No-code via spark-submit or spark-properties.conf (python & scala)
Option 3: With only code changes (python only)
Option 4: download jar manually and add it to class path
Option 5: k8s Spark Operator
Option 6: EMR

Last updated