Store a compressed bitmap alongside the index pages #1552

westonpace · 2023-11-07T23:21:56Z

When we match a page in the btree search we then delegate to the sub-index to search that page. This works well for high cardinality data and equality / inequality searches. However, it can be inefficient when the data is low cardinality since we have to search a very large page (or many small pages)

Luckily, when we search the btree, we can know if a page is entirely included in the result. In other words, if a page has min: 5, max: 10 and the query is value < 20 then we don't need to search the page because we know we want all of the rows in the page. If we had stored a compressed bitmap of all the row ids in the page then all we would need to do would be to load that.

The text was updated successfully, but these errors were encountered:

westonpace · 2023-11-07T23:24:04Z

This issue has high synergy with #1551 . For example, consider a low cardinality column with only 5 choices and 100 million rows. The query is then something like color == 'blue'. Satisfying this query today is pretty expensive. We need to search all of the tiny pages that match that query.

On the other hand, if we only had five pages, and each page had their own compressed bitmap, then we could satisfy the query very quickly with a single IOP to read the compressed bitmap for the correct page.

This is basically giving us the benefits of both btree indices AND bitmap indices in one combined structure.

westonpace · 2024-05-08T15:46:11Z

Note that this is likely going to be related to #2307 as well.

westonpace · 2024-05-08T15:47:19Z

And #1887 suggests we should probably just avoid this scenario in most situations.

westonpace mentioned this issue Nov 7, 2023

[EPIC]: Scalar index follow-up issues tracker #1553

Open

10 tasks

wjones127 mentioned this issue Mar 15, 2024

Roadmap 2024 #2079

Open

20 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Store a compressed bitmap alongside the index pages #1552

Store a compressed bitmap alongside the index pages #1552

westonpace commented Nov 7, 2023

westonpace commented Nov 7, 2023

westonpace commented May 8, 2024

westonpace commented May 8, 2024

Store a compressed bitmap alongside the index pages #1552

Store a compressed bitmap alongside the index pages #1552

Comments

westonpace commented Nov 7, 2023

westonpace commented Nov 7, 2023

westonpace commented May 8, 2024

westonpace commented May 8, 2024