Skip to content
New issue

Have a question about this project? # for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “#”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? # to your account

Disable write/read Parquet when Parquet field IDs are used #4882

Merged
merged 5 commits into from
Mar 7, 2022

Conversation

res-life
Copy link
Collaborator

@res-life res-life commented Mar 1, 2022

Temporarily disable write/read parquet when schema has a specified Parquet field ID
Solution is: Fallback to CPU when writing Parquet with field id column. Fallback to CPU if
"spark.sql.parquet.fieldId.read.enabled" is true when reading Parquet.
Contributes #4846

Signed-off-by: Chong Gao res_life@163.com

…rquet field ID

Signed-off-by: Chong Gao <res_life@163.com>
@jlowe jlowe changed the title Temporarily disable write/read parquet when schema has a specified Pa… Disable write/read Parquet when Parquet field IDs are used Mar 1, 2022
@jlowe jlowe added this to the Feb 28 - Mar 18 milestone Mar 1, 2022
@sameerz sameerz added the audit_3.3.0 Audit related tasks for 3.3.0 label Mar 1, 2022
@firestarman
Copy link
Collaborator

build

1 similar comment
@sameerz
Copy link
Collaborator

sameerz commented Mar 4, 2022

build

@res-life res-life merged commit d30adec into NVIDIA:branch-22.04 Mar 7, 2022
@res-life res-life deleted the tag-parquet-field-id branch March 13, 2022 05:07
# for free to join this conversation on GitHub. Already have an account? # to comment
Labels
audit_3.3.0 Audit related tasks for 3.3.0
Projects
None yet
Development

Successfully merging this pull request may close these issues.

5 participants