bugfix `has_columns()` passing character vector of non-existing columns #540

yjunechoe · 2024-06-15T07:16:22Z

I introduced a regression in #539 when I let hard-coded character vectors bypass the tidyselect-based column resolving mechanism (in resolve_columns()). This introduced a side-effect in has_columns() (which shares the same mechanism), making it pass character vectors through without checking for existence:

library(pointblank)

# Thinks that it "found" the column `y`
data.frame(x = 1) %>% has_columns("y")
#> [1] TRUE

This happened because a column selecting expression may successfully resolve to a character vector, but the columns in the vector themselves may not exist in the data. has_columns() was missing a check for the latter.

The PR ensures that has_columns() does an additional non-lazy check for existence of the resolved columns from the data - since active = ... has_columns() ... is called during interrogate, the tbl is resolved to an actual dataframe by then.

Now all variants align in producing the expected behavior when checking for a non-existent column:

devtools::load_all()
any(c(
  data.frame(x = 1) %>% has_columns(y),
  data.frame(x = 1) %>% has_columns("y"),
  data.frame(x = 1) %>% has_columns(vars(y)),
  data.frame(x = 1) %>% has_columns(vars("y"))
))
#> [1] FALSE

I've added more unit tests to cover this nuance specific to has_columns() (so this shouldn't accidentally regress again!)

rich-iannone

LGTM!

yjunechoe added 2 commits June 15, 2024 15:47

ensure columns resolved to chr do exist in data

274d7b3

add tests for bad character vector input

9c2606a

yjunechoe added the Type: ☹︎ Bug label Jun 15, 2024

rich-iannone approved these changes Jun 15, 2024

View reviewed changes

yjunechoe merged commit 21b3228 into rstudio:main Jun 15, 2024
12 of 13 checks passed

yjunechoe deleted the has_columns-character-bugfix branch June 15, 2024 13:31

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

bugfix `has_columns()` passing character vector of non-existing columns #540

bugfix `has_columns()` passing character vector of non-existing columns #540

yjunechoe commented Jun 15, 2024 •

edited

Loading

rich-iannone left a comment

bugfix has_columns() passing character vector of non-existing columns #540

bugfix has_columns() passing character vector of non-existing columns #540

Conversation

yjunechoe commented Jun 15, 2024 • edited Loading

rich-iannone left a comment

Choose a reason for hiding this comment

bugfix `has_columns()` passing character vector of non-existing columns #540

bugfix `has_columns()` passing character vector of non-existing columns #540

yjunechoe commented Jun 15, 2024 •

edited

Loading