[BUG] test_cast_decimal_to_decimal[to:DecimalType(1,-1)-from:Decimal(5,-3)] fails with DATAGEN_SEED=1702439569 #10050

NVnavkumar · 2023-12-14T01:11:02Z

Describe the bug

FAILED ../../src/main/python/cast_test.py::test_cast_decimal_to_decimal[to:DecimalType(1,-1)-from:Decimal(5,-3)][DATAGEN_SEED=1702439569] - AssertionError: GPU and CPU are not both null at [994, 'a']

This was originally detected in the scala 2.13 pre-merge check against Spark 3.3.0, but I was also able to reproduce against Spark 3.1.1 with Scala 2.12 using the same seed value.

The text was updated successfully, but these errors were encountered:

NVnavkumar · 2023-12-14T19:57:44Z

Failure also showed up in Databricks 11.3 nightly IT run, so very likely this affects all shims

andygrove · 2023-12-15T22:03:29Z

Input is -1E+3. CPU produces None and GPU produces Decimal('-1.00E+3').

-Row(a=None, a=Decimal('-1E+3'))
+Row(a=Decimal('-1.00E+3'), a=Decimal('-1E+3'))

razajafri · 2023-12-20T16:59:45Z

We are returning what cudf is returning

scala> val vec = ColumnVector.fromDecimals(new java.math.BigDecimal("-1E+3"))
scala> val casted = vec.castTo(DType.create(DType.DTypeEnum.DECIMAL32, 1))
casted: ai.rapids.cudf.ColumnVector = ColumnVector{rows=1, type=DECIMAL32 scale:1, nullCount=Optional.empty, offHeap=(ID: 88 7f36afa50c60)}
scala> println(casted.copyToHost.getBigDecimal(0))
-1.00E+3

This makes it obvious that there is a part of code where we are determining if the value is out of bounds, that part is broken. I believe that is the checkNFixDecimalBounds method here
Spark thinks that we are overflowing but cudf thinks we don’t I would look at this part of the code and just copy it on our side to make it compatible.

razajafri · 2023-12-20T19:29:45Z

I know what's happening. The new precision required for correct representation (3) is greater than the target precision (1).
Cudf doesn't have the concept of precision so this is not a problem there but Spark doesn't allow exceeding the precision

I will put up a PR for this

razajafri · 2024-01-10T03:32:05Z

~~Will be~~ fixed by rapidsai/cudf#14731

razajafri · 2024-01-12T02:35:46Z

Will test tomorrow then close

razajafri · 2024-01-12T19:32:37Z

Will have to hold off testing this until jni nightly gets kicked off again as it doesn't have the changes from rapidsai/cudf#14731

NVnavkumar added bug Something isn't working ? - Needs Triage Need team to review and classify labels Dec 14, 2023

thirtiseven mentioned this issue Dec 14, 2023

Fix out of range error from pySpark in test_timestamp_millis and other two integration test cases #10049

Merged

NVnavkumar mentioned this issue Dec 14, 2023

Support date_format via Gpu for non-UTC time zone [databricks] #9721

Merged

thirtiseven mentioned this issue Dec 14, 2023

Set seed=0 for some integration test cases #10052

Merged

mattahrens assigned andygrove Dec 14, 2023

mattahrens removed the ? - Needs Triage Need team to review and classify label Dec 18, 2023

razajafri added the ? - Needs Triage Need team to review and classify label Dec 20, 2023

andygrove removed their assignment Dec 20, 2023

razajafri self-assigned this Dec 20, 2023

razajafri removed the ? - Needs Triage Need team to review and classify label Dec 20, 2023

razajafri closed this as completed Jan 16, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[BUG] test_cast_decimal_to_decimal[to:DecimalType(1,-1)-from:Decimal(5,-3)] fails with DATAGEN_SEED=1702439569 #10050

[BUG] test_cast_decimal_to_decimal[to:DecimalType(1,-1)-from:Decimal(5,-3)] fails with DATAGEN_SEED=1702439569 #10050

NVnavkumar commented Dec 14, 2023 •

edited

Loading

NVnavkumar commented Dec 14, 2023 •

edited

Loading

andygrove commented Dec 15, 2023

razajafri commented Dec 20, 2023

razajafri commented Dec 20, 2023 •

edited

Loading

razajafri commented Jan 10, 2024 •

edited

Loading

razajafri commented Jan 12, 2024

razajafri commented Jan 12, 2024

[BUG] test_cast_decimal_to_decimal[to:DecimalType(1,-1)-from:Decimal(5,-3)] fails with DATAGEN_SEED=1702439569 #10050

[BUG] test_cast_decimal_to_decimal[to:DecimalType(1,-1)-from:Decimal(5,-3)] fails with DATAGEN_SEED=1702439569 #10050

Comments

NVnavkumar commented Dec 14, 2023 • edited Loading

NVnavkumar commented Dec 14, 2023 • edited Loading

andygrove commented Dec 15, 2023

razajafri commented Dec 20, 2023

razajafri commented Dec 20, 2023 • edited Loading

razajafri commented Jan 10, 2024 • edited Loading

razajafri commented Jan 12, 2024

razajafri commented Jan 12, 2024

NVnavkumar commented Dec 14, 2023 •

edited

Loading

NVnavkumar commented Dec 14, 2023 •

edited

Loading

razajafri commented Dec 20, 2023 •

edited

Loading

razajafri commented Jan 10, 2024 •

edited

Loading