You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I wanted to extract text from an URL, URL contains a pdf file that is hosted on firebase.
Now I'm facing this issue with URL only it is working correctly with local pdf file.
Here are the logs: 2022-02-19T16:31:26+0530 <error> processing.js:97 () Error: Error for type: [[ application/octet-stream ]], file: [[ C:\Users\lenovo\AppData\Local\Temp\60980467322.pdf ]] at extract (C:\Users\lenovo\Desktop\React\Open-Source\linkin_port-\node_modules\textract\lib\extract.js:151:15) at Timeout._onTimeout (C:\Users\lenovo\Desktop\React\Open-Source\linkin_port-\node_modules\textract\lib\extract.js:159:7) at listOnTimeout (node:internal/timers:557:17) at processTimers (node:internal/timers:500:7) { typeNotFound: true }
@nosferatu500 nope it has the same error in the original also, I tried your version to tackle that error but it still persists
If you could tell me what this error means, I can try debugging it somehow
@nosferatu500 I debugged into it and found that it was passing type as binary file but actually the file type is pdf. So for URLs we have to pass typeoverride option and provide type of the file, like this:
I wanted to extract text from an URL, URL contains a pdf file that is hosted on firebase.
Now I'm facing this issue with URL only it is working correctly with local pdf file.
Here are the logs:
2022-02-19T16:31:26+0530 <error> processing.js:97 () Error: Error for type: [[ application/octet-stream ]], file: [[ C:\Users\lenovo\AppData\Local\Temp\60980467322.pdf ]] at extract (C:\Users\lenovo\Desktop\React\Open-Source\linkin_port-\node_modules\textract\lib\extract.js:151:15) at Timeout._onTimeout (C:\Users\lenovo\Desktop\React\Open-Source\linkin_port-\node_modules\textract\lib\extract.js:159:7) at listOnTimeout (node:internal/timers:557:17) at processTimers (node:internal/timers:500:7) { typeNotFound: true }
version : "@nosferatu500/textract": "^3.0.3"
Please suggest what can be done @nosferatu500
The text was updated successfully, but these errors were encountered: