-
Notifications
You must be signed in to change notification settings - Fork 32
New issue
Have a question about this project? # for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “#”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? # to your account
Access to Undefined Reference #122
Comments
This issue could potentially be related to Issue #120. There could potentially be a string column that fails to parse a delimiter, putting a whole whack of data into a single cell, causing the rest of the columns. |
If I iterate through the file with no batch size specified, I get an inexact error (trying to convert a NaN to Int32)
|
I just verified the fix to Issue #120, that fix doesn't fix this problem unfortunately so this issue is still open. |
i also get exactly this |
I can only partially iterate from a file created by parquet-mr. I can iterate through it once, but trying to do this a second time yields
Note that NamedTuple{...} is abridged becasue the actual tuple is a massive long list of the entire file schema. I can't give you the original file for this one, but I wouldn't be surprised if it has something to do with initializing a mutable type with #undef and failing to populate it. There could be sizable gaps in data for some of the columns. Note that it was created by parquet-mr
Parquet file: Input/input_data.parquet
version: 1
nrows: 4887400
created by: parquet-mr version 1.9.0 (build 38262e2c80015d0935dad20f8e18f2d6f9fbd03c)
cached: 157 column chunks
The text was updated successfully, but these errors were encountered: