We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? # for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “#”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? # to your account
cudf::io::json::detail::normalize_single_quotes
\n
Reproducible with this input:
{\"a\": \"1\n2\"} {\'a\': 12}
The output tokens, generated by cudf::io::json::detail::get_token_stream after preprocessing with cudf::io::json::detail::normalize_single_quotes are:
cudf::io::json::detail::get_token_stream
Input: {"a": "1 2"}{'a': 12} Tokens: 0, 4, 6, 7, 8, 9, 5, 1, 0, 1 Token indices: 0, 1, 1, 3, 6, 10, 11, 11, 0, 0
If remove the \n character then the output is correct:
Input: {"a": "12"}{"a": 12} Tokens: 0, 4, 6, 7, 8, 9, 5, 1, 0, 4, 6, 7, 10, 11, 5, 1 Token indices: 0, 1, 1, 3, 6, 9, 10, 10, 12, 13, 13, 15, 18, 20, 20, 20
Note:
\0
allow_unquoted_control
true
static_cast
enum token_t
cudf/cpp/include/cudf/io/detail/tokenize_json.hpp
Line 88 in 06b3f83
I suspect that it is due to the leftover character \n in
cudf/cpp/src/io/json/json_normalization.cu
Lines 76 to 77 in 06b3f83
The text was updated successfully, but these errors were encountered:
5cbdcd0
shrshi
Successfully merging a pull request may close this issue.
Reproducible with this input:
The output tokens, generated by
cudf::io::json::detail::get_token_stream
after preprocessing withcudf::io::json::detail::normalize_single_quotes
are:If remove the
\n
character then the output is correct:Note:
\0
, not\n
.allow_unquoted_control
is set totrue
.static_cast
fromenum token_t
atcudf/cpp/include/cudf/io/detail/tokenize_json.hpp
Line 88 in 06b3f83
I suspect that it is due to the leftover character
\n
incudf/cpp/src/io/json/json_normalization.cu
Lines 76 to 77 in 06b3f83
The text was updated successfully, but these errors were encountered: