[FEA] Finish the JSON test matrix #10491

revans2 · 2024-02-23T18:34:58Z

Is your feature request related to a problem? Please describe.

#10490 is adding in some beginnings of a test matrix for all JSON processing. It is a good start but it is not done. We are still missing a number of things.

Confs/Options

Input Data Types/Formats:

Data Types (note that these only apply to from_json and ScanJson because the others don't take a full read schema):

All nested types should be tested both at the top level and as a child (data) column. If we don't support those types yet, then we should verify that we fallback to the CPU as expected.

The text was updated successfully, but these errors were encountered:

jihoonson · 2025-02-06T19:19:40Z

@revans2, this is a great list. I want to clarify a couple of things though. As for the local config, did you mean locale? I don't see the local config in JSONOptions. I don't see corruptedColumnName either in both the Spark and the plugin repos. Where do you see this config?

revans2 · 2025-02-06T20:40:12Z

@jihoonson yes I updated local to be locale.

corruptedColumnName is me misremembering the proper config name. https://github.com/apache/spark/blob/e89b19f0d162ace3cda0fc4d05de0771216b69ad/sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/json/JSONOptions.scala#L287

Thanks for catching my errors. I think I have updated this to be correct now.

jihoonson · 2025-02-06T21:08:56Z

@revans2 I see. Thank you for updating those config names!

revans2 added feature request New feature or request ? - Needs Triage Need team to review and classify labels Feb 23, 2024

This was referenced Feb 23, 2024

[FEA] remove duplicate tests for JSON operators #10492

Open

[FEA] JSON input support #9

Open

mattahrens added test Only impacts tests and removed ? - Needs Triage Need team to review and classify labels Feb 27, 2024

This was referenced Mar 12, 2024

Update JsonToStructs and ScanJson to have white space normalization #10575

Merged

[FEA] Review JsonToStruct and JsonScan and consolidate some testing and implementation #9750

Closed

Add string escaping JSON tests to the test_json_matrix #10604

Merged

sameerz removed the feature request New feature or request label Mar 21, 2024

revans2 mentioned this issue May 13, 2024

[FEA] Move all JSON parsing to the same backend as get_json_object #10804

Open

8 tasks

This was referenced Aug 19, 2024

Add tests for repeated JSON columns/keys #11362

Merged

JSON tests for corrected date, timestamp, and mixed types #11388

Merged

jihoonson mentioned this issue Feb 6, 2025

[FEA] Take a closer look at JsonTuple #10405

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[FEA] Finish the JSON test matrix #10491

[FEA] Finish the JSON test matrix #10491

revans2 commented Feb 23, 2024 •

edited

Loading

jihoonson commented Feb 6, 2025

revans2 commented Feb 6, 2025 •

edited

Loading

jihoonson commented Feb 6, 2025

[FEA] Finish the JSON test matrix #10491

[FEA] Finish the JSON test matrix #10491

Comments

revans2 commented Feb 23, 2024 • edited Loading

Confs/Options

Input Data Types/Formats:

Data Types (note that these only apply to from_json and ScanJson because the others don't take a full read schema):

jihoonson commented Feb 6, 2025

revans2 commented Feb 6, 2025 • edited Loading

jihoonson commented Feb 6, 2025

revans2 commented Feb 23, 2024 •

edited

Loading

revans2 commented Feb 6, 2025 •

edited

Loading