-
Notifications
You must be signed in to change notification settings - Fork 2.3k
New issue
Have a question about this project? # for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “#”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? # to your account
ocr quality on chi_sim
#148
Comments
I believe we need to train the Chinese language again, I know you've got a reply after so long. have you tried to fix it |
Trying to download few different ocr models to analyse https://gd-pub.jinshujufiles.com/di/20180308130431_f4fead but how to download it?
## https://github.com/tesseract-ocr/tessdata
if(is.na(match('chi_sim.traineddata', tesseract_info()$available)))
tesseract_download('tesseract-ocr/tessdata/chi_sim.traineddata')
Downloaded: 0.10 MB
错误: Download failed: HTTP 404
if(is.na(match('chi_sim_vert.traineddata', tesseract_info()$available)))
tesseract_download('tesseract-ocr/tessdata/chi_sim_vert.traineddata')
Downloaded: 0.10 MB
错误: Download failed: HTTP 404
## https://github.com/tesseract-ocr/tessdata_best
if(is.na(match('chi_sim.traineddata', tesseract_info()$available)))
tesseract_download('tesseract-ocr/tessdata_best/chi_sim.traineddata')
Downloaded: 0.10 MB
错误: Download failed: HTTP 404
if(is.na(match('chi_sim_vert.traineddata', tesseract_info()$available)))
tesseract_download('tesseract-ocr/tessdata_best/chi_sim_vert.traineddata')
Downloaded: 0.10 MB
错误: Download failed: HTTP 404
## https://github.com/tesseract-ocr/tessdata_fast
if(is.na(match('chi_sim.traineddata', tesseract_info()$available)))
tesseract_download('tesseract-ocr/tessdata_fast/chi_sim.traineddata')
Downloaded: 0.10 MB
错误: Download failed: HTTP 404
if(is.na(match('chi_sim_vert.traineddata', tesseract_info()$available)))
tesseract_download('tesseract-ocr/tessdata_fast/chi_sim_vert.traineddata')
Downloaded: 0.10 MB
错误: Download failed: HTTP 404 |
I tried to
ocr
an image inchi_sim
but there quality is not too good, some characters unable recognize... Is there anyway to improve accuracy?Originally posted by @englianhu in #146 (comment)
The text was updated successfully, but these errors were encountered: