GPU accelerate Apache Iceberg reads #5941

jlowe · 2022-07-01T20:26:16Z

Closes #4817, closes #5453, and closes #5510.

Adds basic support for GPU acceleration of Apache Iceberg table reads along with a document detailing the limitations of the support and tests. The tests exercise the usual, generic table reading tests, but also test features more specific to Apache Iceberg like time-travel reads, incremental snapshot reads, partitioning schema evolution, row deletion and updates, etc.

Only the Parquet data format is supported in this initial version, and it only provides a per-file strategy to GPU acceleration. Multi-threaded and coalescing reader strategies are planned for the future.

This supports Apache Iceberg 0.13.x, and leverages the Iceberg api and core code provided by whatever Iceberg jar is provided by the user, with the assumption those APIs are relatively stable over time. Related Apache Iceberg code for Parquet and Spark have been adapted for use within the RAPIDS Accelerator, as these interfaces are less likely to remain stable across Apache Iceberg versions. Reflection is used to port over the relevant CPU scan state into an equivalent GPU-accelerated scan.

Metadata queries and processing remains on the CPU, as this involves parsing of relatively tiny JSON files for CPU consumption. The data is read via the existing Parquet partition reader, after row-group filtering and predicate pushdown has been applied.

Signed-off-by: Jason Lowe <jlowe@nvidia.com>

abellina

This looks good. The new fallback code looks good as well +1

firestarman · 2022-07-20T03:27:42Z

LGTM

jlowe · 2022-07-20T13:34:54Z

Rebuilding CI to ensure Iceberg tests run after #6020.

jlowe · 2022-07-20T13:35:01Z

build

jlowe · 2022-07-21T22:53:02Z

build

firestarman · 2022-07-22T06:46:36Z

A strange error failed premerge, retry it.

[2022-07-22T00:28:03.383Z] >               raise converted from None
[2022-07-22T00:28:03.384Z] E               pyspark.sql.utils.AnalysisException: java.lang.RuntimeException: The root scratch dir: /tmp/hive on HDFS should be writable. Current permissions are: rwxr-xr-x
[2022-07-22T00:28:03.384Z] 
[2022-07-22T00:28:03.384Z] ../../../.download/spark-3.1.1-bin-hadoop3.2/python/pyspark/sql/utils.py:117: AnalysisException

firestarman · 2022-07-22T06:46:41Z

build

pxLi · 2022-07-22T06:55:47Z

last premerge run failed orc_test

pyspark.sql.utils.AnalysisException: java.lang.RuntimeException: The root scratch dir: /tmp/hive on HDFS should be writable. Current permissions are: rwxr-xr-x

there should be no permission issue in premerge environment (docker), not sure if any side effects

[2022-07-22T00:28:03.383Z] �[31m�[1m_ test_read_round_trip[-{'spark.rapids.sql.format.orc.reader.type': 'PERFILE'}-read_orc_sql-[Byte, Short, Integer, Long, Float, Double, String, Boolean, Date, Timestamp, Decimal(7,3), Decimal(12,2), Decimal(20,2)]] _�[0m
[2022-07-22T00:28:03.383Z] [gw1] linux -- Python 3.8.13 /usr/bin/python
[2022-07-22T00:28:03.383Z] 
[2022-07-22T00:28:03.383Z] spark_tmp_path = '/tmp/pyspark_tests//premerge-ci-2-jenkins-rapids-premerge-github-5165-79f8r-hwhvf-gw1-21368-140972398/'
[2022-07-22T00:28:03.383Z] orc_gens = [Byte, Short, Integer, Long, Float, Double, ...]
[2022-07-22T00:28:03.383Z] read_func = <function read_orc_sql at 0x7f00f4971d30>
[2022-07-22T00:28:03.383Z] reader_confs = {'spark.rapids.sql.format.orc.reader.type': 'PERFILE'}
[2022-07-22T00:28:03.383Z] v1_enabled_list = ''
[2022-07-22T00:28:03.383Z] 
[2022-07-22T00:28:03.383Z]     @pytest.mark.order(2)
[2022-07-22T00:28:03.383Z]     @pytest.mark.parametrize('orc_gens', orc_gens_list, ids=idfn)
[2022-07-22T00:28:03.383Z]     @pytest.mark.parametrize('read_func', [read_orc_df, read_orc_sql])
[2022-07-22T00:28:03.383Z]     @pytest.mark.parametrize('reader_confs', reader_opt_confs, ids=idfn)
[2022-07-22T00:28:03.383Z]     @pytest.mark.parametrize('v1_enabled_list', ["", "orc"])
[2022-07-22T00:28:03.383Z]     def test_read_round_trip(spark_tmp_path, orc_gens, read_func, reader_confs, v1_enabled_list):
[2022-07-22T00:28:03.383Z]         gen_list = [('_c' + str(i), gen) for i, gen in enumerate(orc_gens)]
[2022-07-22T00:28:03.383Z]         data_path = spark_tmp_path + '/ORC_DATA'
[2022-07-22T00:28:03.383Z]         with_cpu_session(
[2022-07-22T00:28:03.383Z]                 lambda spark : gen_df(spark, gen_list).write.orc(data_path))
[2022-07-22T00:28:03.383Z]         all_confs = copy_and_update(reader_confs, {'spark.sql.sources.useV1SourceList': v1_enabled_list})
[2022-07-22T00:28:03.383Z] >       assert_gpu_and_cpu_are_equal_collect(
[2022-07-22T00:28:03.383Z]                 read_func(data_path),
[2022-07-22T00:28:03.383Z]                 conf=all_confs)
[2022-07-22T00:28:03.383Z] 
[2022-07-22T00:28:03.383Z] �[1m�[31m../../src/main/python/orc_test.py�[0m:142: 
[2022-07-22T00:28:03.383Z] _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ 
[2022-07-22T00:28:03.383Z] �[1m�[31m../../src/main/python/asserts.py�[0m:508: in assert_gpu_and_cpu_are_equal_collect
[2022-07-22T00:28:03.383Z]     _assert_gpu_and_cpu_are_equal(func, 'COLLECT', conf=conf, is_cpu_first=is_cpu_first)
[2022-07-22T00:28:03.383Z] �[1m�[31m../../src/main/python/asserts.py�[0m:427: in _assert_gpu_and_cpu_are_equal
[2022-07-22T00:28:03.383Z]     run_on_cpu()
[2022-07-22T00:28:03.383Z] �[1m�[31m../../src/main/python/asserts.py�[0m:413: in run_on_cpu
[2022-07-22T00:28:03.383Z]     from_cpu = with_cpu_session(bring_back, conf=conf)
[2022-07-22T00:28:03.383Z] �[1m�[31m../../src/main/python/spark_session.py�[0m:115: in with_cpu_session
[2022-07-22T00:28:03.383Z]     return with_spark_session(func, conf=copy)
[2022-07-22T00:28:03.383Z] �[1m�[31m../../src/main/python/spark_session.py�[0m:99: in with_spark_session
[2022-07-22T00:28:03.383Z]     ret = func(_spark)
[2022-07-22T00:28:03.383Z] �[1m�[31m../../src/main/python/asserts.py�[0m:201: in <lambda>
[2022-07-22T00:28:03.383Z]     bring_back = lambda spark: limit_func(spark).collect()
[2022-07-22T00:28:03.383Z] �[1m�[31m../../src/main/python/orc_test.py�[0m:31: in <lambda>
[2022-07-22T00:28:03.383Z]     return lambda spark : spark.sql('select * from orc.`{}`'.format(data_path))
[2022-07-22T00:28:03.383Z] �[1m�[31m../../../.download/spark-3.1.1-bin-hadoop3.2/python/pyspark/sql/session.py�[0m:723: in sql
[2022-07-22T00:28:03.383Z]     return DataFrame(self._jsparkSession.sql(sqlQuery), self._wrapped)
[2022-07-22T00:28:03.383Z] �[1m�[31m/home/jenkins/agent/workspace/jenkins-rapids_premerge-github-5165-ci-2/.download/spark-3.1.1-bin-hadoop3.2/python/lib/py4j-0.10.9-src.zip/py4j/java_gateway.py�[0m:1304: in __call__
[2022-07-22T00:28:03.383Z]     return_value = get_return_value(
[2022-07-22T00:28:03.383Z] _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ 
[2022-07-22T00:28:03.383Z] 
[2022-07-22T00:28:03.383Z] a = ('xro13549', <py4j.java_gateway.GatewayClient object at 0x7f00cb751430>, 'o62', 'sql')
[2022-07-22T00:28:03.383Z] kw = {}
[2022-07-22T00:28:03.383Z] converted = AnalysisException('java.lang.RuntimeException: The root scratch dir: /tmp/hive on HDFS should be writable. Current per...e.spark.sql.hive.HiveExternalCatalog.withClient(HiveExternalCatalog.scala:102)\n\t... 74 more\n', JavaObject id=o13550)
[2022-07-22T00:28:03.383Z] 
[2022-07-22T00:28:03.383Z]     def deco(*a, **kw):
[2022-07-22T00:28:03.383Z]         try:
[2022-07-22T00:28:03.383Z]             return f(*a, **kw)
[2022-07-22T00:28:03.383Z]         except py4j.protocol.Py4JJavaError as e:
[2022-07-22T00:28:03.383Z]             converted = convert_exception(e.java_exception)
[2022-07-22T00:28:03.383Z]             if not isinstance(converted, UnknownException):
[2022-07-22T00:28:03.383Z]                 # Hide where the exception came from that shows a non-Pythonic
[2022-07-22T00:28:03.383Z]                 # JVM exception message.
[2022-07-22T00:28:03.383Z] >               raise converted from None
[2022-07-22T00:28:03.384Z] �[1m�[31mE               pyspark.sql.utils.AnalysisException: java.lang.RuntimeException: The root scratch dir: /tmp/hive on HDFS should be writable. Current permissions are: rwxr-xr-x�[0m

firestarman · 2022-07-25T03:49:59Z

Hi @jlowe ,
Seems Iceberg tests do not run in premerge, would we need it ?

jlowe · 2022-07-25T13:36:47Z

Seems Iceberg tests do not run in premerge, would we need it ?

I avoided adding it to premerge since the iceberg tests are currently serial would slow down premerge. I was going to file a followup to address this, but if you feel it should be part of premerge happy to update the PR.

jlowe · 2022-07-25T18:56:08Z

build

jlowe · 2022-07-25T21:34:03Z

build

jlowe · 2022-07-25T22:06:37Z

build

firestarman · 2022-07-26T01:44:58Z

if you feel it should be part of premerge happy to update the PR

A seperate issue is good fo me. And we can have more discussion about whether to add it.

jlowe added 30 commits May 19, 2022 11:10

GPU accelerated reads for Apache Iceberg

7cf08b9

Signed-off-by: Jason Lowe <jlowe@nvidia.com>

Clip Parquet block data to read schema

2f2435f

Add configs to disable Iceberg

873443a

DPP filtering working, still not getting reuse

49e2787

Merge branch 'branch-22.06' into iceberg-read-wip

a233274

Fix lack of exchange reuse

69f6df1

Merge branch 'branch-22.06' into iceberg-read-wip

0232dad

Add support for Iceberg on Spark 3.1

a953bbd

Use metrics from parent SparkScan

c81cb86

Merge branch 'branch-22.08' into iceberg-read-wip

340b6c2

Fix metrics

d18ac4f

Fix DPP test

9f63212

Update NOTICE-binary

a2086bf

Remove unused code

e006f18

Merge branch 'branch-22.08' into iceberg-read

fb2970a

Add Iceberg support to ExternalSource

352eb92

Fix missing bytes read metric from stages reading from Iceberg

0a91f4f

Fix handling of list columns, add round trip Parquet read test

eaa3c09

Fix Iceberg read enable config

6c7b0e0

Add more Iceberg tests

41bdc86

Remove unused code

8f2361d

More Iceberg tests

4964085

Fix Iceberg metadata queries

e06865c

Fix reads of Iceberg tables with renamed columns

f959047

Fix Iceberg reads for missing columns

1680427

Add Iceberg partition update and delete tests

793f6ad

Fix Iceberg upcasting during reads

94ca5d6

Suppress some warnings

4a8e296

Update to Iceberg 0.13.2

83e0bb8

Skip tests not supported on Spark 3.1.x

d65c2a4

abellina previously approved these changes Jul 18, 2022

View reviewed changes

jlowe added 3 commits July 21, 2022 17:24

Work around classloader issues in distributed setups

abccc6b

Merge branch 'branch-22.08' into iceberg-read

779a6c4

Update to ShimLoader convention

5d2fb27

jlowe dismissed abellina’s stale review via 5d2fb27 July 21, 2022 22:52

firestarman approved these changes Jul 25, 2022

View reviewed changes

abellina approved these changes Jul 26, 2022

View reviewed changes

tgravescs approved these changes Jul 26, 2022

View reviewed changes

jlowe merged commit 9960e87 into NVIDIA:branch-22.08 Jul 26, 2022

jlowe deleted the iceberg-read branch July 26, 2022 16:42

firestarman mentioned this pull request Jul 27, 2022

Iceberg Parquet supports multi-threaded reading. #6110

Merged

This was referenced Jul 29, 2022

[BUG] intermittent orc test_read_round_trip failed due to /tmp/hive location #6146

Closed

[BUG] iceberg_test failed in nightly #6167

Closed

firestarman mentioned this pull request Aug 26, 2022

Iceberg supports coalescing reading for Parquet #6422

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

GPU accelerate Apache Iceberg reads #5941

GPU accelerate Apache Iceberg reads #5941

jlowe commented Jul 1, 2022

abellina left a comment

firestarman commented Jul 20, 2022

jlowe commented Jul 20, 2022

jlowe commented Jul 20, 2022

jlowe commented Jul 21, 2022

firestarman commented Jul 22, 2022

firestarman commented Jul 22, 2022

pxLi commented Jul 22, 2022

firestarman commented Jul 25, 2022 •

edited

Loading

jlowe commented Jul 25, 2022

jlowe commented Jul 25, 2022

jlowe commented Jul 25, 2022

jlowe commented Jul 25, 2022

firestarman commented Jul 26, 2022 •

edited

Loading

GPU accelerate Apache Iceberg reads #5941

GPU accelerate Apache Iceberg reads #5941

Conversation

jlowe commented Jul 1, 2022

abellina left a comment

Choose a reason for hiding this comment

firestarman commented Jul 20, 2022

jlowe commented Jul 20, 2022

jlowe commented Jul 20, 2022

jlowe commented Jul 21, 2022

firestarman commented Jul 22, 2022

firestarman commented Jul 22, 2022

pxLi commented Jul 22, 2022

firestarman commented Jul 25, 2022 • edited Loading

jlowe commented Jul 25, 2022

jlowe commented Jul 25, 2022

jlowe commented Jul 25, 2022

jlowe commented Jul 25, 2022

firestarman commented Jul 26, 2022 • edited Loading

firestarman commented Jul 25, 2022 •

edited

Loading

firestarman commented Jul 26, 2022 •

edited

Loading