feat: First version of rest catalog. #78

liurenjie1024 · 2023-10-13T03:29:29Z

In this pr we add initial support for rest, which finished simple rest apis.

Complex apis such as create table, update table, commits which be added in following pr so that we can make each pr's size reasonable.

related: #60

liurenjie1024 · 2023-10-13T03:30:10Z

cc @JanKaul @Xuanwo @Fokko @ZENOTME PTAL

crates/iceberg/Cargo.toml

crates/iceberg/src/catalog/rest.rs

crates/iceberg-rest/Cargo.toml

Xuanwo

Looks good to me!

crates/catalog/rest/src/catalog.rs

Fokko · 2023-10-18T11:12:19Z

crates/catalog/rest/src/catalog.rs

+    /// Update a table to the catalog.
+    async fn update_table(&self, _table: &TableIdent, _commit: TableCommit) -> Result<Table> {
+        todo!()
+    }
+
+    /// Update multiple tables to the catalog as an atomic operation.
+    async fn update_tables(&self, _tables: &[(TableIdent, TableCommit)]) -> Result<()> {
+        todo!()
+    }


I'm not sure if this is the way that we want to expose update tables. It is important that the right updates and requirements are set, otherwise, race conditions might occur. Also, there is little validation here, for example, can you do two distinct schema changes (so two updates), in a single commit? I would maybe leave this out for now so we can decide later on.

We might want to introduce a similar API as PyIceberg (which is inspired on Java): https://py.iceberg.apache.org/api/#schema-evolution

Yes, I agree. It should be called by transaction api rather by user directly. The main challenge is that we should limit its visibility rather than exposing directly to user. I will figure out a way when actually implementing it.

As I have said in pr description, the goal of this pr is to implement simple apis. Others will be left in following pr.

Thank you! Just some context around the concern: Within Iceberg, when creating a new table, the IDs are being reassigned (python, java). This can lead to confusion and potentially also errors. Therefore we refrain users from assigning IDs as much as possible (or reassigning them).

tbl = table.create_table( Schema( Field(1, "id", IntegerType()), Field(2, "str", StringType()) ) ) tbl.set_schema( Schema( Field(1, "id", IntegerType()), Field(3, "dt", DateType()), Field(2, "str", StringType()) ) )

Because the IDs are re-assigned, it will evolve from:

Schema { 1: id 2: str }

to:

Schema { 1: id 2: dt 3: str }

And this will corrupt the table since there is no valid promotion from string to date. The general consensus is that users shouldn't care about the IDs themselves, and this should not be exposed to users through APIs. The API should handle it.

crates/catalog/rest/src/catalog.rs

liurenjie1024 · 2023-10-20T14:54:11Z

cc @Fokko Any other comments?

Fokko

I think this is a great start. Thanks for working on it @liurenjie1024, and @Xuanwo @ZENOTME for the review!

liurenjie1024 added 3 commits October 7, 2023 11:15

feat: Add rest catalog

3ad5a4b

feat: Initial checkin of rest catalog

94a90b4

Add tests

68f82f2

liurenjie1024 requested review from Xuanwo, Fokko and JanKaul October 13, 2023 03:29

liurenjie1024 added 2 commits October 13, 2023 11:30

Fix typo

c7ba6e8

Fix

c3708bc

Xuanwo reviewed Oct 13, 2023

View reviewed changes

crates/iceberg/Cargo.toml Show resolved Hide resolved

crates/iceberg/src/catalog/rest.rs Outdated Show resolved Hide resolved

crates/iceberg/src/catalog/rest.rs Outdated Show resolved Hide resolved

crates/iceberg/src/catalog/rest.rs Outdated Show resolved Hide resolved

Fix comments

8d3fc97

ZENOTME reviewed Oct 13, 2023

View reviewed changes

crates/iceberg/src/catalog/rest.rs Outdated Show resolved Hide resolved

liurenjie1024 added 3 commits October 13, 2023 15:00

Fix comment

cb318a9

Move rest catalog to another crate

3a4f050

Remove unused deps

2165c89

liurenjie1024 mentioned this pull request Oct 16, 2023

Manage dependencies using workspace. #24

Closed

Xuanwo reviewed Oct 16, 2023

View reviewed changes

crates/iceberg-rest/Cargo.toml Outdated Show resolved Hide resolved

liurenjie1024 added 2 commits October 16, 2023 17:22

Rename to iceberg-catalog-rest

6e51f1d

Fix

91ea876

Xuanwo approved these changes Oct 16, 2023

View reviewed changes

ZENOTME approved these changes Oct 17, 2023

View reviewed changes

Fokko reviewed Oct 18, 2023

View reviewed changes

liurenjie1024 added 2 commits October 19, 2023 10:53

Fix comments

d42920d

Fix comments

1c2bac8

Fokko approved these changes Oct 28, 2023

View reviewed changes

Fokko merged commit f17bf30 into apache:main Oct 28, 2023
6 checks passed

Fokko mentioned this pull request Apr 24, 2024

Tracking issues of iceberg-rust v0.3.0 #348

Closed

73 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: First version of rest catalog. #78

feat: First version of rest catalog. #78

liurenjie1024 commented Oct 13, 2023 •

edited

Loading

liurenjie1024 commented Oct 13, 2023

Xuanwo left a comment

Fokko Oct 18, 2023

liurenjie1024 Oct 19, 2023

liurenjie1024 Oct 19, 2023

Fokko Oct 28, 2023

liurenjie1024 commented Oct 20, 2023

Fokko left a comment

feat: First version of rest catalog. #78

feat: First version of rest catalog. #78

Conversation

liurenjie1024 commented Oct 13, 2023 • edited Loading

liurenjie1024 commented Oct 13, 2023

Xuanwo left a comment

Choose a reason for hiding this comment

Fokko Oct 18, 2023

Choose a reason for hiding this comment

liurenjie1024 Oct 19, 2023

Choose a reason for hiding this comment

liurenjie1024 Oct 19, 2023

Choose a reason for hiding this comment

Fokko Oct 28, 2023

Choose a reason for hiding this comment

liurenjie1024 commented Oct 20, 2023

Fokko left a comment

Choose a reason for hiding this comment

liurenjie1024 commented Oct 13, 2023 •

edited

Loading