Skip to content
New issue

Have a question about this project? # for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “#”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? # to your account

[Spark] Support in-place migration from unpartitioned table to clustered table #3174

Merged
merged 5 commits into from
May 31, 2024

Conversation

zedtang
Copy link
Collaborator

@zedtang zedtang commented May 29, 2024

Which Delta project/connector is this regarding?

  • Spark
  • Standalone
  • Flink
  • Kernel
  • Other (fill in here)

Description

Support in-place migration from unpartitioned table to clustered table. If the table is an unpartitioned table and users run ALTER TABLE CLUSTER BY on it, it will now create a clustered table with ClusteringMetadataDomain.

Resolves #2460

How was this patch tested?

New UTs.

Does this PR introduce any user-facing changes?

No

@zedtang zedtang changed the title Support in-place migration from unpartitioned table to clustered table. [Spark] Support in-place migration from unpartitioned table to clustered table May 29, 2024
@zedtang zedtang self-assigned this May 29, 2024
Copy link
Contributor

@dabao521 dabao521 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks for the work! Left a few comments, and let's add add more validations in the test.

@zedtang zedtang requested a review from dabao521 May 30, 2024 19:31
Copy link
Contributor

@dabao521 dabao521 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM!

@vkorukanti vkorukanti merged commit 56ce212 into delta-io:master May 31, 2024
10 checks passed
@zedtang zedtang deleted the in-place-migration branch May 31, 2024 15:33
richardc-db pushed a commit to richardc-db/delta that referenced this pull request Jun 5, 2024
…red table (delta-io#3174)

## Description
Support in-place migration from unpartitioned table to clustered table.
If the table is an unpartitioned table and users run `ALTER TABLE
CLUSTER BY` on it, it will now create a clustered table with
ClusteringMetadataDomain.

Resolves delta-io#2460 

## How was this patch tested?
New UTs.
# for free to join this conversation on GitHub. Already have an account? # to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

[Feature Request] In-place migration from unpartitioned table to clustered table
3 participants