[FEA] csv_reader_options
to read empty strings as blank (i.e. ""
), not null
.
#12145
Labels
0 - Backlog
In queue waiting for assignment
cuIO
cuIO issue
feature request
New feature or request
libcudf
Affects libcudf (C++/CUDA) code.
Milestone
When
cudf::io::read_csv()
encounters two consecutive field delimiters within a row, it deems the corresponding string column value as null. E.g.:Reading the input above via
read_csv()
produces rows{a,null,c}
and{d,null,f}
. This is conformant with Spark's CSV reader (and presumably Pandas).It would be useful if the column value could be optionally interpreted as an empty string (
""
) instead. This would permit support for reading Hive delimited text, where empty strings are empty by default.The text was updated successfully, but these errors were encountered: