Skip to content
New issue

Have a question about this project? # for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “#”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? # to your account

SHA1 Payload-Digest should use base 32 and not base 16 #213

Open
thomasegense opened this issue Nov 1, 2020 · 0 comments
Open

SHA1 Payload-Digest should use base 32 and not base 16 #213

thomasegense opened this issue Nov 1, 2020 · 0 comments

Comments

@thomasegense
Copy link

I know that the Warc specification do allow base 16 (https://iipc.github.io/warc-specifications/specifications/warc-format/warc-1.1/#warc-payload-digest). But since most other Warc tools all produce base 32, SHA1, it would be beneficial for the community to use the same standard.

The following tools all produce base32:
Heritrix
wget
Webrecorder

It was discovered when the warc-indexer in the webarchive-discovery project (https://github.com/ukwa/webarchive-discovery/tree/master/warc-indexer) reported errors with the payload when indexing warc-files generated woth Brozzler.

@thomasegense thomasegense changed the title SHA1 Payload-Digest should use base 16 and not base 32 SHA1 Payload-Digest should use base 32 and not base 16 Nov 1, 2020
# for free to join this conversation on GitHub. Already have an account? # to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant