feat(write): add MERGE & UPDATE with DataEvolutionWriter by JingsongLi · Pull Request #241 · apache/paimon-rust

JingsongLi · 2026-04-13T08:59:48Z

Purpose

Add shared DataFileWriter extracted from TableWrite
Add DataEvolutionWriter: engine-agnostic Table-layer API for partial-column updates via _ROW_ID
Add MERGE INTO execution in DataFusion
Add UPDATE execution in DataFusion

Brief change log

Tests

API and Format

Documentation

littlecoder04 · 2026-04-14T15:34:06Z

crates/paimon/src/table/row_id_update_write.rs

+                .with_bucket(file_range.bucket)
+                .with_bucket_path(file_range.bucket_path.clone())
+                .with_total_buckets(file_range.total_buckets)
+                .with_data_files(vec![file.clone()])


At this point, _ROW_ID has already been narrowed to a single DataFileMeta, and this read only loads that file back. However, the current logical row range may span multiple files with the same first_row_id (base file + partial-column files). Upstream Java/Python paths seem to read the whole first_row_id group.

littlecoder04 · 2026-04-14T15:56:06Z

crates/integrations/datafusion/src/merge_into.rs

+        ins.columns
+            .iter()
+            .zip(ins.value_exprs.iter())
+            .map(|(col, expr)| format!("{expr} AS {col}"))


This keeps the generated batch in INSERT (...) order. But the write path later reads partition/bucket fields by target schema index, not by column name, so a reordered insert list can mis-map columns on partitioned / fixed-bucket tables.

JingsongLi · 2026-04-15T01:14:55Z

@littlecoder04 Thanks for the review, fixed comments and add e2e tests.

littlecoder04 · 2026-04-15T02:42:35Z

+1

luoyuxia

+1

jerry-024

+1

JingsongLi force-pushed the merge-into branch 2 times, most recently from de603ad to 1c52144 Compare April 13, 2026 10:04

JingsongLi changed the title ~~[WIP] feat(write): add MERGE INTO support with RowIdUpdateWriter~~ feat(write): add MERGE INTO support with RowIdUpdateWriter Apr 13, 2026

JingsongLi force-pushed the merge-into branch from 1c52144 to 9d4ef64 Compare April 13, 2026 10:52

JingsongLi changed the title ~~feat(write): add MERGE INTO support with RowIdUpdateWriter~~ feat(write): add MERGE & UPDATE support with RowIdUpdateWriter Apr 13, 2026

JingsongLi force-pushed the merge-into branch 2 times, most recently from 0c496bc to 16c7d0a Compare April 13, 2026 15:58

littlecoder04 reviewed Apr 14, 2026

View reviewed changes

feat(write): add MERGE & UPDATE support with RowIdUpdateWriter

0531535

JingsongLi changed the title ~~feat(write): add MERGE & UPDATE support with RowIdUpdateWriter~~ feat(write): add MERGE & UPDATE support with DataEvolutionWriter Apr 15, 2026

JingsongLi changed the title ~~feat(write): add MERGE & UPDATE support with DataEvolutionWriter~~ feat(write): add MERGE & UPDATE with DataEvolutionWriter Apr 15, 2026

JingsongLi force-pushed the merge-into branch from a12550f to 6827fe9 Compare April 15, 2026 01:48

Fix comments

9488c3b

JingsongLi force-pushed the merge-into branch from 6827fe9 to 9488c3b Compare April 15, 2026 01:49

luoyuxia approved these changes Apr 15, 2026

View reviewed changes

jerry-024 approved these changes Apr 15, 2026

View reviewed changes

JingsongLi merged commit f424ded into apache:main Apr 15, 2026
8 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(write): add MERGE & UPDATE with DataEvolutionWriter#241

feat(write): add MERGE & UPDATE with DataEvolutionWriter#241
JingsongLi merged 2 commits intoapache:mainfrom
JingsongLi:merge-into

JingsongLi commented Apr 13, 2026 •

edited

Loading

Uh oh!

littlecoder04 Apr 14, 2026

Uh oh!

littlecoder04 Apr 14, 2026

Uh oh!

JingsongLi commented Apr 15, 2026

Uh oh!

littlecoder04 commented Apr 15, 2026

Uh oh!

luoyuxia left a comment

Uh oh!

jerry-024 left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

JingsongLi commented Apr 13, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Brief change log

Tests

API and Format

Documentation

Uh oh!

littlecoder04 Apr 14, 2026

Choose a reason for hiding this comment

Uh oh!

littlecoder04 Apr 14, 2026

Choose a reason for hiding this comment

Uh oh!

JingsongLi commented Apr 15, 2026

Uh oh!

littlecoder04 commented Apr 15, 2026

Uh oh!

luoyuxia left a comment

Choose a reason for hiding this comment

Uh oh!

jerry-024 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

JingsongLi commented Apr 13, 2026 •

edited

Loading