[spark] Support merge schema in MERGE INTO by Zouxxyy · Pull Request #7789 · apache/paimon

Zouxxyy · 2026-05-08T14:52:52Z

Purpose

Add schema evolution support for MERGE INTO and fix nested-field alignment.

With spark.paimon.write.merge-schema=true, UPDATE * / INSERT * evolves target schema with new source columns. Star clauses pull from source by name; explicit clauses fill NULL.
A FROM_STAR TreeNodeTag preserves the original star intent, so a fully-listed explicit clause is not mistaken for *.
AssignmentAlignmentHelper now reorders nested struct / array / map fields by name.

Scope

UPDATE * / INSERT * → evolve
Explicit clauses → no evolve
Mixed → evolve, star pulls source, explicit fills NULL
Nested struct / array new fields

Tests

13 new cases in MergeIntoTableTestBase plus WHEN NOT MATCHED BY SOURCE coverage in MergeIntoNotMatchedBySourceTest.

JingsongLi · 2026-05-09T06:22:11Z

+  }
+
+  /** Reorder source struct fields to match target field order by name, recursing into nested types. */
+  private def reorderStructByName(


reorderStructByName crashes when target struct has fields absent from source

Should we support this?

The same issue applies to MapType value reordering in reorderFieldsByName.

JingsongLi · 2026-05-09T06:25:06Z

   * reorder and fill nulls for missing sub-fields.
   */
-  private def alignColumns(
+  def alignColumns(


SchemaHelper.scala now handles ArrayType alignment via transform, but MapType is not handled. Meanwhile, AssignmentAlignmentHelper.reorderFieldsByName does handle MapType. This inconsistency means the DataFrame write path won't align map values while the MERGE path will.

Is this a problem?

Zouxxyy marked this pull request as draft May 8, 2026 15:04

[spark] Update merge into

aeb72e2

Zouxxyy force-pushed the dev/merge-update branch from 688a4bd to aeb72e2 Compare May 8, 2026 16:54

update

6834b05

Zouxxyy marked this pull request as ready for review May 9, 2026 00:40

JingsongLi reviewed May 9, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[spark] Support merge schema in MERGE INTO#7789

[spark] Support merge schema in MERGE INTO#7789
Zouxxyy wants to merge 2 commits intoapache:masterfrom
Zouxxyy:dev/merge-update

Zouxxyy commented May 8, 2026 •

edited

Loading

Uh oh!

JingsongLi May 9, 2026

Uh oh!

JingsongLi May 9, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

Zouxxyy commented May 8, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Scope

Tests

Uh oh!

JingsongLi May 9, 2026

Choose a reason for hiding this comment

Uh oh!

JingsongLi May 9, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Zouxxyy commented May 8, 2026 •

edited

Loading