Skip SortExec for partitioning columns in OPTIMIZE #1166

sezruby · 2022-06-01T23:14:35Z

Description

In OPTIMIZE, as it just reads from one directory to compact data, we don't need SortExec for partitioned data.
We can skip it by passing the plan with SortOrder of partitioned columns.

Fixes #948

How was this patch tested?

Test data: 860MB parquet in total, 100 files

Non partitioned data

spark.range(200000000).map { _ =>
    (scala.util.Random.nextInt(10).toLong, scala.util.Random.nextInt(1000000000), scala.util.Random.nextInt(1))
}.toDF("colA", "colB", "colC").repartition(100).write.mode("overwrite").format("delta").save(dataPath)

Partitioned data, but all values are same (colC = 0)

spark.range(200000000).map { _ =>
    (scala.util.Random.nextInt(10).toLong, scala.util.Random.nextInt(1000000000), scala.util.Random.nextInt(1))
}.toDF("colA", "colB", "colC").repartition(100).write.partitionBy("colC").mode("overwrite").format("delta").save(dataPath + "1")

E2E duration of OPTIMIZE with master + the PR

non-partitioned: 2 min 28 sec
partitioned: 2 min 24 sec

E2E duration of OPTIMIZE with master

non-partitioned: 2 min 30 sec
partitioned: 3 min 4 sec

Does this PR introduce any user-facing changes?

No

sezruby · 2022-06-02T00:29:12Z

@zsxwing Could you review the PR? Thanks!

vkorukanti · 2022-06-09T17:42:40Z

core/src/main/scala/org/apache/spark/sql/delta/commands/OptimizeTableCommand.scala

@@ -237,7 +237,7 @@ class OptimizeExecutor(
      sparkSession.sparkContext.getLocalProperty(SPARK_JOB_GROUP_ID),
      description)

-    val addFiles = txn.writeFiles(repartitionDF).collect {
+    val addFiles = txn.writeFiles(repartitionDF, actionType = "Optimize").collect {


I haven't tried, but wondering if there is a way we could specify the SortOrder on the repartionDf here itself, so that we can avoid passing a action type to txn.writeFiles?

I think no way to do that as it's not a spark plan. Also Spark Optimizer might build some incorrect plan with a fake df, so it's better to add after applying all rules.
We may need to add the actionType for OptimizeWrite, to exclude OPTIMIZE/ZORDER/Auto compaction for OptimizeWrite feature after all.

Maybe we can read actionType from TahoeBatchFileIndex.
Let me try it.

How about this:

val input = txn.deltaLog.createDataFrame(txn.snapshot, bin, actionTypeOpt = Some("Optimize")) val repartitionDF = input.coalesce(numPartitions = 1) val sortOrder = partitionColumns.map(p => SortOrder(p, Ascending, Seq.empty[Expression])) val df = LogicalRDD( outputAttributes, repartitionDF.queryExecution.toRdd, outputOrdering = sortOrder)

I tried this approach, but it requires some code in OptimizeTableCommand.scala - extracting physcalPlan for partitionColumns/outputattributes... etc
In addition, I'd like to minimize the side effect that may incur from the fake outputOrdering while spark optimizer. Though I know the plan for compaction is not that complicated.

How about the current PR?
In any case I need to use TransactionalWrite.isOptimizeCommand for OptimizeWrite functionality.

Signed-off-by: Eunjin Song <[email protected]>

sezruby · 2022-10-24T22:44:50Z

Hi @scottsand-db can someone review the PR? Thanks!

tdas requested a review from vkorukanti June 2, 2022 18:10

vkorukanti reviewed Jun 9, 2022

View reviewed changes

sezruby force-pushed the optsortexec branch 4 times, most recently from 46c747d to 46c2a55 Compare June 10, 2022 23:29

sezruby force-pushed the optsortexec branch from 46c2a55 to bb3466b Compare July 19, 2022 00:02

scottsand-db self-requested a review August 1, 2022 18:22

Skip SortExec for partitioning columns in OPTIMIZE

13927b5

Signed-off-by: Eunjin Song <[email protected]>

sezruby force-pushed the optsortexec branch from bb3466b to 13927b5 Compare October 18, 2022 00:25

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Skip SortExec for partitioning columns in OPTIMIZE #1166

Skip SortExec for partitioning columns in OPTIMIZE #1166

Uh oh!

sezruby commented Jun 1, 2022 •

edited

Loading

Uh oh!

sezruby commented Jun 2, 2022 •

edited

Loading

Uh oh!

vkorukanti Jun 9, 2022

Uh oh!

sezruby Jun 9, 2022

Uh oh!

sezruby Jun 10, 2022

Uh oh!

vkorukanti Jun 10, 2022

Uh oh!

sezruby Jun 10, 2022

Uh oh!

sezruby commented Oct 24, 2022

Uh oh!

Uh oh!

Skip SortExec for partitioning columns in OPTIMIZE #1166

Are you sure you want to change the base?

Skip SortExec for partitioning columns in OPTIMIZE #1166

Uh oh!

Conversation

sezruby commented Jun 1, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

How was this patch tested?

Does this PR introduce any user-facing changes?

Uh oh!

sezruby commented Jun 2, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

vkorukanti Jun 9, 2022

Choose a reason for hiding this comment

Uh oh!

sezruby Jun 9, 2022

Choose a reason for hiding this comment

Uh oh!

sezruby Jun 10, 2022

Choose a reason for hiding this comment

Uh oh!

vkorukanti Jun 10, 2022

Choose a reason for hiding this comment

Uh oh!

sezruby Jun 10, 2022

Choose a reason for hiding this comment

Uh oh!

sezruby commented Oct 24, 2022

Uh oh!

Uh oh!

sezruby commented Jun 1, 2022 •

edited

Loading

sezruby commented Jun 2, 2022 •

edited

Loading