Don't convert the index columns produced by Pandas. #9

matz-e · 2024-11-15T14:03:15Z

If scientists write Parquet files with Pandas, via
DataFrame.to_parquet, they will have to add index=False to skip
writing the index to disk, too. In the most simple and common case, this
index will show up as an additional column __index_level_0__ and end
up in the edge files.

Make scientists and our lives a little simpler and skip converting this
index column.

If scientists write Parquet files with Pandas, via `DataFrame.to_parquet`, they will have to add `index=False` to skip writing the index to disk, too. In the most simple and common case, this index will show up as an additional column `__index_level_0__` and end up in the edge files. Make scientists and our lives a little simpler and skip converting this index column.

matz-e requested a review from 1uc November 15, 2024 14:03

1uc approved these changes Nov 15, 2024

View reviewed changes

matz-e merged commit 68a5788 into main Nov 15, 2024
2 checks passed

matz-e deleted the matz-e/block-pandas-index branch November 15, 2024 14:30

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Don't convert the index columns produced by Pandas. #9

Don't convert the index columns produced by Pandas. #9

matz-e commented Nov 15, 2024

Don't convert the index columns produced by Pandas. #9

Don't convert the index columns produced by Pandas. #9

Conversation

matz-e commented Nov 15, 2024