Plan for rewrite branch #50

gingerwizard · 2022-09-27T14:03:43Z

Is https://github.com/ulikunitz/xz/tree/rewrite production ready? When do you anticipate this being promoted to main?

Thanks

ulikunitz · 2022-09-28T05:45:20Z

ulikunitz · 2022-12-11T22:33:02Z

Update: The rewrite branch is now working. Using multiple threads I have achieved write rates over 150 MByte/s, but the compression ratio is larger (39% vs. 33%). I have not done any work on the defaults. Such parallel encoded streams can also be read in a multi-threaded way and I achieve there reading rates of over 190 MByte/s.

There are still some bug fixes required. I need to make the xz Reader a ReadCloser to stop the threads if not the whole stream is read, but so far it looks promising.

ulikunitz · 2023-04-16T07:34:19Z

Just an update.

I have done optimization work and found that I have very fast compressors but those cannot bring the compression rate smaller on 29% measured for the Silesia corpus. The bt4 match finder mode in xz can achieve compression rates of 23% for the same thing. So I currently write a tree-based match finder to achieve the same results. I have updated the task list above to reflect the activity.

ulikunitz · 2023-06-12T19:59:04Z

I have now a very slow parser (ca. 1 MiB/s) that reaches 26% on the Silesia corpus, but the code supports now multithreaded compression and decompression. I have published an alpha release v0.6.0-alpha.3. The new lz module with the Lempel-Ziv parsers is published as well, so you can actually test it.

wagoodman · 2024-09-16T18:50:06Z

It looks like your list is a little outdated -- it appears that you're ahead of what's still left (🎉 ). What additional tasks are really left? Would you like any help with some of these tasks?

ulikunitz · 2024-09-18T18:25:46Z

Sorry, there has been a lot of work in my day job. There is a v0.6.0-alpha.3 you can experiment with, it supports the parallel modes. I would be interested in some feedback regarding it. Compression rates are still 2% below the original xz, but encoding is much faster especially using the parallel modes.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Plan for rewrite branch #50

Plan for rewrite branch #50

gingerwizard commented Sep 27, 2022

ulikunitz commented Sep 28, 2022 •

edited

Loading

ulikunitz commented Dec 11, 2022

ulikunitz commented Apr 16, 2023

ulikunitz commented Jun 12, 2023 •

edited

Loading

wagoodman commented Sep 16, 2024 •

edited

Loading

ulikunitz commented Sep 18, 2024

Plan for rewrite branch #50

Plan for rewrite branch #50

Comments

gingerwizard commented Sep 27, 2022

ulikunitz commented Sep 28, 2022 • edited Loading

ulikunitz commented Dec 11, 2022

ulikunitz commented Apr 16, 2023

ulikunitz commented Jun 12, 2023 • edited Loading

wagoodman commented Sep 16, 2024 • edited Loading

ulikunitz commented Sep 18, 2024

ulikunitz commented Sep 28, 2022 •

edited

Loading

ulikunitz commented Jun 12, 2023 •

edited

Loading

wagoodman commented Sep 16, 2024 •

edited

Loading