[BUG] Poor handling of boolean expressions in WHERE clauses #3266

Swiddis · 2025-01-27T20:17:35Z

What is the bug?

This is a bug for three related issues around booleans in WHERE clauses. I suspect without proof that they all have a similar root cause. All examples here use the ecommerce dataset as an example index, but should be reproducible with any index.

How can one reproduce the bug?

WHERE FALSE matches all records.

Query:

POST _plugins/_sql/_explain
{
  "query": "SELECT COUNT(*) FROM opensearch_dashboards_sample_data_ecommerce WHERE FALSE"
}

Result:

{
  "schema": [
    {
      "name": "COUNT(*)",
      "type": "integer"
    }
  ],
  "datarows": [
    [
      4675 // should be 0
    ]
  ],
  "total": 1,
  "size": 1,
  "status": 200
}

WHERE NOT(x) causes an error for any constant boolean.

Query:

POST _plugins/_sql
{
  "query": "SELECT * FROM opensearch_dashboards_sample_data_ecommerce WHERE NOT(FALSE)"
}

Result:

{
  "error": {
    "reason": "Invalid SQL query",
    "details": "inner bool query clause cannot be null",
    "type": "IllegalArgumentException"
  },
  "status": 400
}

The same happens with WHERE NOT(TRUE), WHERE NOT(NOT(FALSE)), WHERE NOT(FALSE OR TRUE), and similar.

Internal druid exceptions raised when building expressions with NOT(NULL).

Query:

POST _plugins/_sql
{
  "query": "SELECT * FROM opensearch_dashboards_sample_data_ecommerce WHERE TRUE = NOT(NULL)"
}

Result:

{
  "error": {
    "reason": "Invalid SQL query",
    "details": "err find condition class com.alibaba.druid.sql.ast.expr.SQLBooleanExpr",
    "type": "SqlParseException"
  },
  "status": 400
}

NOT(NULL) is equivalent to NULL, so this whole expression should be equivalent to WHERE NULL and return no records¹.

What is the expected behavior?
These expressions should be correctly evaluated and applied. They're somewhat weird examples for human-written queries, but automatic query builders may produce queries like this, especially for WHERE FALSE.

What is your host/environment?

SQL: 0e61d20

Do you have any screenshots?
N/A

Do you have any additional context?
Found by the WIP distributed-testing suite. See: #3220

It's worth noting that SQL really uses ternary logic: NULL lives among the typical boolean values and the 3 values generate their own truth tables. This is (for better or for worse) pretty fundamental to SQL's operation and is the principle that TLP is built on. As such, in boolean handling, we should really treat NULL as a bona fide boolean. ↩

The text was updated successfully, but these errors were encountered:

Swiddis · 2025-01-27T21:45:02Z

Another parsing exception happens if you compare non-constants with NOT(_). Here's a query that uses a slightly-roundabout way of expressing "find all orders that are either bulk orders of cheap products, or single-orders of expensive products" (there probably exists another context where this type of construction is more natural):

POST _plugins/_sql
{
  "query": "SELECT * FROM opensearch_dashboards_sample_data_ecommerce WHERE (total_quantity > 1) = NOT(products.base_price > 10.0)"
}

Result:

{
  "error": {
    "reason": "Invalid SQL query",
    "details": "No enum constant org.opensearch.sql.legacy.domain.Where.CONN.=",
    "type": "IllegalArgumentException"
  },
  "status": 400
}

It's annoying to have tests repeatedly output the same issues. An example bug that was found after adding the disableConfigExprs option: opensearch-project/sql#3266 (comment) Signed-off-by: Simeon Widdis <sawiddis@amazon.com>

Swiddis · 2025-01-27T22:12:30Z

Ended up disabling NOT in distributed-testing since it seems like it's fundamentally broken if you breathe on it the wrong way.

Swiddis · 2025-01-28T02:08:11Z

Another one: if you create an index with a boolean field, then WHERE bool_field = NULL also throws an exception (should evaluate to NULL for all values of bool_field). So in disttest we also need to disable generating NULL literals.

Swiddis added bug Something isn't working untriaged labels Jan 27, 2025

RyanL1997 removed the untriaged label Jan 28, 2025

Swiddis added the dynamic-test Issues found by or related to Dynamic Testing label Jan 28, 2025

penghuo added the calcite calcite migration releated label Jan 28, 2025

Swiddis mentioned this issue Jan 28, 2025

[RFC] Dynamic Correctness Testing Framework #3220

Open

Swiddis added the SQL label Jan 29, 2025

Swiddis changed the title ~~[BUG] Poor handling of constant booleans in WHERE clauses~~ [BUG] Poor handling of boolean expressions in WHERE clauses Jan 29, 2025

Swiddis self-assigned this Jan 29, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[BUG] Poor handling of boolean expressions in WHERE clauses #3266

[BUG] Poor handling of boolean expressions in WHERE clauses #3266

Swiddis commented Jan 27, 2025 •

edited

Loading

Swiddis commented Jan 27, 2025 •

edited

Loading

Swiddis commented Jan 27, 2025

Swiddis commented Jan 28, 2025

[BUG] Poor handling of boolean expressions in WHERE clauses #3266

[BUG] Poor handling of boolean expressions in WHERE clauses #3266

Comments

Swiddis commented Jan 27, 2025 • edited Loading

Footnotes

Swiddis commented Jan 27, 2025 • edited Loading

Swiddis commented Jan 27, 2025

Swiddis commented Jan 28, 2025

Swiddis commented Jan 27, 2025 •

edited

Loading

Swiddis commented Jan 27, 2025 •

edited

Loading