Skip to content
New issue

Have a question about this project? # for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “#”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? # to your account

Some *.pdf files are HTML documents / 302 redirections or 403 forbidden #2

Open
WildCryptoFox opened this issue Mar 14, 2021 · 0 comments

Comments

@WildCryptoFox
Copy link

WildCryptoFox commented Mar 14, 2021

(Not an issue for me. Just a notification.)

While searching for redirection and forbidden pages in my own archive directory (which transitively includes your library), I noticed you've hit the same annoying issue.

$ find -type f -name '*.pdf' -exec file {} + | grep -v PDF
./machine learning/Learning to Protect Communications Using Adverarial Neural Cryptography (2016) - Abadi, Andersen.pdf:                                                                                                                                   HTML document, ASCII text
./music theory & sheet music/Op. 27 No. 2: Sonata quasi una Fantasia - Beethoven.pdf:                                                                                                                                                                      HTML document, UTF-8 Unicode text
./maps/cards/amharic.pdf:                                                                                                                                                                                                                                  HTML document, ASCII text
./maps/cards/georgia.pdf:                                                                                                                                                                                                                                  HTML document, ASCII text
./maps/cards/nepali.pdf:                                                                                                                                                                                                                                   HTML document, ASCII text
./maps/cards/ukranian.pdf:                                                                                                                                                                                                                                 HTML document, ASCII text
./physics/HEP theory/Can We Mine a Black Hole? (2015) [SciAm] - Brown.pdf:                                                                                                                                                                                 HTML document, UTF-8 Unicode text, with very long lines
./cryptography & mathematics/quantum algorithms & cryptanalysis/Breaking Symmetric Cryptosystems using Quantum Period Finding (2016) - Kaplan, Leurent, Leverrier, Naya-Placencia.pdf:                                                                     HTML document, ASCII text
./cryptography & mathematics/hashes/keccak/First Analysis of Keccak (2009) - Aumasson, Khovratovich.pdf:                                                                                                                                                   HTML document, ASCII text
./cryptography & mathematics/stream ciphers/All Your Biases Belong To Us: Breaking RC4 in WPA-TKIP and TLS (2015) - Vanhoef, Piessens.pdf:                                                                                                                 HTML document, ASCII text, with very long lines
./cryptography & mathematics/isogeny-based cryptography/Constructing elliptic curve isogenies in quantum subexponential time (2011) - Childs, Jao, Soukharev.pdf:                                                                                          HTML document, ASCII text
./cryptography & mathematics/zero knowledge/Proof Systems for General Statements about Discrete Logarithms (1997) - Camenisch, Stadler.pdf:                                                                                                                HTML document, ASCII text, with CRLF, LF line terminators
./anonymity & circumvention/bitcoin/Mixcoin: Anonymity for Bitcoin with accountable mixes (2014) - Bonneau, Narayanan, Miller, Clark, Kroll, Felten.pdf:                                                                                                   HTML document, ASCII text
./anonymity & circumvention/dpi/Empirical Study of a National-Scale Distributed IDS: Backbone-Level Filtering of HTML Responses in China (2010) - Park, Crandall.pdf:                                                                                      PDF document, version 1.4

The following are all arxiv.org 302 redirections, thus easy fixes. The maps/cards/*.pdf appear to be dead links that not even archive.org caught. I didn't investigate the rest further.

# for free to join this conversation on GitHub. Already have an account? # to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant