Skip to content
New issue

Have a question about this project? # for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “#”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? # to your account

Efficient DataTable::sort shared with DataStore #1981

Closed
Tracked by #1898
teh-cmc opened this issue Apr 26, 2023 · 0 comments · Fixed by #6438
Closed
Tracked by #1898

Efficient DataTable::sort shared with DataStore #1981

teh-cmc opened this issue Apr 26, 2023 · 0 comments · Fixed by #6438
Assignees
Labels
🏹 arrow Apache Arrow 🚀 performance Optimization, memory use, etc ⛃ re_datastore affects the datastore itself

Comments

@teh-cmc
Copy link
Member

teh-cmc commented Apr 26, 2023

  • Implement an efficient multi-column sorting routine for DataTable
  • Make sure it follows the bucketing semantics of the store so the data is already sorted even post-bucketization
  • Share as much of the code with the in-bucket sort routine

Related:

# for free to join this conversation on GitHub. Already have an account? # to comment
Labels
🏹 arrow Apache Arrow 🚀 performance Optimization, memory use, etc ⛃ re_datastore affects the datastore itself
Projects
None yet
Development

Successfully merging a pull request may close this issue.

1 participant