[BUG] sort does not match spark for -0.0 and 0.0 #84

revans2 · 2020-05-29T19:45:58Z

Describe the bug
This is a really odd corner case, but spark will place all -0.0 entries as less than 0.0 entries. This should almost never show up in practice, but because our tests hit corner cases we end up seeing it.

cudf does not even expose a good way for us to extract the sign bit from a -0.0 to try and use that as a second order sort column. So this one might be fun to try and fix.

Steps/Code to reproduce bug
Create a dataframe that has both 0.0 and -0.0 values in it. The sort it.

Expected behavior
-0.0 values come before 0.0 values

revans2 · 2020-12-17T13:47:27Z

Spark actually changed behavior in 3.1.0

Signed-off-by: spark-rapids automation <70000568+nvauto@users.noreply.github.com>

revans2 added bug Something isn't working ? - Needs Triage Need team to review and classify SQL part of the SQL/Dataframe plugin labels May 29, 2020

revans2 removed the ? - Needs Triage Need team to review and classify label Jun 2, 2020

revans2 mentioned this issue Jun 2, 2020

[FEA] order by integration tests #100

Closed

revans2 mentioned this issue Jun 15, 2020

Updated join tests to cover more data. #176

Merged

revans2 mentioned this issue Jun 26, 2020

[BUG] -0.0 vs 0.0 is a hot mess #294

Open

revans2 mentioned this issue Jul 7, 2020

[REVIEW] Updated join tests for cache #286

Merged

5 tasks

sameerz added the P2 Not required for release label Aug 25, 2020

revans2 mentioned this issue Sep 17, 2020

Benchmark utility to perform diff of output from benchmark runs, allowing for precision differences #782

Merged

revans2 mentioned this issue Dec 16, 2020

Fix a lot of tests marked with xfail for Spark 3.1.0 that no longer fail #1412

Merged

revans2 closed this as completed in #1412 Dec 17, 2020

revans2 added this to the Dec 7 - Dec 18 milestone Dec 17, 2020

tgravescs pushed a commit to tgravescs/spark-rapids that referenced this issue Nov 30, 2023

Update submodule cudf to 3f175ce (NVIDIA#84)

421c46b

Signed-off-by: spark-rapids automation <70000568+nvauto@users.noreply.github.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[BUG] sort does not match spark for -0.0 and 0.0 #84

[BUG] sort does not match spark for -0.0 and 0.0 #84

revans2 commented May 29, 2020

revans2 commented Dec 17, 2020

[BUG] sort does not match spark for -0.0 and 0.0 #84

[BUG] sort does not match spark for -0.0 and 0.0 #84

Comments

revans2 commented May 29, 2020

revans2 commented Dec 17, 2020