Skip to content
New issue

Have a question about this project? # for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “#”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? # to your account

Investigate/build support for separate text file #1446

Closed
5 tasks
mbakeryo opened this issue Dec 20, 2017 · 1 comment
Closed
5 tasks

Investigate/build support for separate text file #1446

mbakeryo opened this issue Dec 20, 2017 · 1 comment

Comments

@mbakeryo
Copy link

mbakeryo commented Dec 20, 2017

As a publisher, I may provide a separate text file that is more accessible than the original file. E.g. I provide PDFs that are made up of page scans and bundled into a PDF that cannot be made accessible. The generated OCR derived from the PDF is not accessible.

I need to be able to upload a separate, cleaned up/rekeyed and accessible text file to the platform.

  • This file would be associated with the original PDF file (not replacing it).

  • This file would be downloadable.

Questions

  • Need to determine if the download button should say something different than Download OCR Text.

  • Need to determine if the disclaimer message would appear (see ticket Add disclaimer for OCR text/accessibility #1429).

  • Is it possible to include this text in the PDF that could then be part of the auto-generated OCR text?

For Turner, we need to get the rekeyed or better OCR-generated (through Prime OCR) file.

@mbakeryo
Copy link
Author

We are waiting to get the rekeyed text file. The ability to replace the Extracted File text file should be ready(ish).

@mbakeryo mbakeryo modified the milestones: Winter, Beyond Grant Feb 15, 2018
@mbakeryo mbakeryo removed the blocked label Apr 30, 2018
@jmcglone jmcglone closed this as completed Dec 4, 2024
# for free to join this conversation on GitHub. Already have an account? # to comment
Projects
None yet
Development

No branches or pull requests

2 participants