Skip to content
This repository was archived by the owner on Feb 26, 2025. It is now read-only.

Don't convert the index columns produced by Pandas. #9

Merged
merged 1 commit into from
Nov 15, 2024

Conversation

matz-e
Copy link
Member

@matz-e matz-e commented Nov 15, 2024

If scientists write Parquet files with Pandas, via
DataFrame.to_parquet, they will have to add index=False to skip
writing the index to disk, too. In the most simple and common case, this
index will show up as an additional column __index_level_0__ and end
up in the edge files.

Make scientists and our lives a little simpler and skip converting this
index column.

If scientists write Parquet files with Pandas, via
`DataFrame.to_parquet`, they will have to add `index=False` to skip
writing the index to disk, too. In the most simple and common case, this
index will show up as an additional column `__index_level_0__` and end
up in the edge files.

Make scientists and our lives a little simpler and skip converting this
index column.
@matz-e matz-e requested a review from 1uc November 15, 2024 14:03
@matz-e matz-e merged commit 68a5788 into main Nov 15, 2024
2 checks passed
@matz-e matz-e deleted the matz-e/block-pandas-index branch November 15, 2024 14:30
# for free to subscribe to this conversation on GitHub. Already have an account? #.
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants