pytesseract-ocr

Here are 140 public repositories matching this topic...

NanoNets / ocr-python

OCR library to extract text & tables from PDF files and images. Convert any image or PDF to CSV / TXT / JSON / Searchable PDF.

python pdf ocr tesseract pdf-to-text image-to-text textract pdf-to-csv pdf-to-json searchable-pdf pytesseract-ocr extract-table table-extract image-to-text-converter extract-text-from-image extract-text-from-pdf

Updated Dec 2, 2022
Jupyter Notebook

shayanalibhatti / Designing-a-PDF-Audiobook-using-Python

Star

In this code, a simple implementation of PDF to audio converter is shown

python python3 pdf-reader audio-converter gtts pytesseract pymupdf pdf-to-audio pdf-text pytesseract-ocr

Updated Mar 30, 2021
Python

lamnguyenkhoa / container-code-recognition

Star

Detect and extract containers code in a video.

recognition computer-vision deep-learning pycharm object-detection truck darknet opencv-python pytesseract-ocr yolov4

Updated Oct 15, 2022
Python

bhavita / Auto-Audio-Books

Star

Convert pdf to audiobooks 📚

pdf google-text-to-speech audiobooks pytesseract-ocr pdf-to-audiobook

Updated Sep 13, 2020
Jupyter Notebook

Team-Cornflakes / VitaFile

Star

Google Solution Challenge 2024. Team Cornflakes VIT Chennai

react django bert palm translate-api pytesseract-ocr gemini-pro-vision gemini-pro

Updated Feb 25, 2024
JavaScript

icaropires / pdf2dataset

Star

Converts a whole subdirectory with a big (or small) volume of PDF documents to a dataset (pandas DataFrame) with error tracking and choice of features

python pdf distributed-systems data-science ocr pandas-dataframe parallel distributed-computing tesseract python3 tesseract-ocr parquet ray pdftotext pytesseract pdf2image pyarrow pytesseract-ocr

Updated Jan 9, 2025
Python

radioactive11 / ALPR-India

Star

Detect and scan the license plate number from vehicle images

opencv cnn pytesseract-ocr

Updated Jun 16, 2021
Python

prathyyyyy / Medical-Data-Extraction

Star

Medical Data Extraction By Pytesseract (Google Optical Character Recognition Engine) and Computer Vision

python computer-vision pytest pytesseract pdf2image fastapi pytesseract-ocr

Updated Feb 11, 2023
Jupyter Notebook

goldenryu2000 / Discord-OCR-Bot

Star

This is an OCR Bot for Discord made using OpenCV and Pytesseract

python heroku bot opencv ocr discord discord-bot python3 hacktoberfest heroku-deployment heroku-app ocr-recognition pytesseract ocr-python pytesseract-ocr ocr-bot ocr-discord-bot

Updated Jul 18, 2022
Python

anilsathyan7 / AI-Sudoku-Solver

Star

Solving Sudoku Puzzles With Computer Vision And Neural Networks

computer-vision tensorflow cnn pytorch recurrent-neural-networks neural-networks sudoku-solver digit-recognition sudoku-puzzles opencv-python keras-tensorflow pytesseract-ocr py-sudoku

Updated Jan 17, 2021
Jupyter Notebook

deepshig / Textual-Video-to-Speech-Interface

Star

An interface to extract text from a video and convert it to speech

python text-to-speech image text-analysis video-processing google-text-to-speech optical-character-recognition mosaic-images mosaic pytesseract btech-project-proposal computer-science-project image-binarization undergraduate-project pytesseract-ocr btech-project python-mosaic

Updated May 29, 2020
Python

Flask ALPR is a web service for automatic license plate recognition (ALPR). The web service is written in Python using Flask for REST API and OpenCV with PyTesseract for plate recognition. The service offers two REST API-s, one for checking if licence plate is detected and one for detecting licence plate from camera image. All detected licence p…

opencv flask sqlalchemy numpy tesseract tesseract-ocr sqlite3 opencv-python flask-sqlalchemy alpr python-ocr pytesseract python-dateutil imutils sqlite-python pytesseract-ocr

Updated Feb 13, 2022
Python

moebius-analitica / meetup-webscraping

Star

Charla de web scraping sobre datos públicos de Chile

python beautifulsoup selenium-webdriver selenium-python tabula-py pytesseract-ocr

Updated Dec 8, 2022
Python

Jishnnu / InvoiceAI-Document-Parser

Star

Simple Streamlit application that parses the data from Invoice images and returns it in JSON format

machine-learning numpy matplotlib opencv-python imutils kor doctr keras-ocr pytesseract-ocr streamlit-webapp mindee langchain jina-chat

Updated Aug 19, 2023
Jupyter Notebook

pavtiger / Parse-tables-from-PDF

Star

A tool that automizes the process of pulling data tables from PDF documents where they are as scans

python pdf opencv webserver socketio pytesseract pytesseract-ocr

Updated Jun 20, 2023
Python

7410abhi / Image_detector-using-python-libraries

Star

PROJECT(Image_detector)_using_python_Libraries

pillow pil python3 python-programming kraken opencv-python pytesseract python-project python-libraries pytesseract-ocr

Updated May 12, 2020
Python

Sweatnessstrong / pdf-to-word-converter

Star

This Python script converts a PDF file to Word format using OCR (Optical Character Recognition). It extracts text from each page of the PDF, converts the pages to images, performs OCR on the images, and saves the extracted text to text files.

python pypdf2 pytesseract pdf2image pytesseract-ocr

Updated May 26, 2024
Python

bhattbhavesh91 / pytesseract-demo

Sponsor

Star

A simple demo to show the power of PyTesseract: Simple Python Optical Character Recognition

python demo ocr optical-character-recognition pytesseract python-tesseract pytesseract-ocr

Updated Jun 1, 2021
Jupyter Notebook

ScottStevenWhite / DocsInARow

Star

"Docs in a Row" is an automated script designed to handle image data extraction, correction, categorization, and storage. It utilizes a variety of technologies including OpenAI, Google Cloud Vision, pytesseract, and PIL to extract and correct text from images, categorize the content, and store useful metadata.

openai vision-api good-first-issue pytesseract-ocr openai-api

Updated May 31, 2023
Python

MvMukesh / autoKYC

Star

Named Entity Extraction with OpenCV, Pytesseract, Spacy (OCR + NER), BIO Labelling

nlp opencv computer-vision deep-learning ocr-service flask-application labelling regular-expressions ner pytesseract spacy-nlp datapreprocessing bert-ner pytesseract-ocr bio-tagging

Updated Jan 7, 2025

Improve this page

Add a description, image, and links to the pytesseract-ocr topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the pytesseract-ocr topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

pytesseract-ocr

Here are 140 public repositories matching this topic...

NanoNets / ocr-python

shayanalibhatti / Designing-a-PDF-Audiobook-using-Python

lamnguyenkhoa / container-code-recognition

bhavita / Auto-Audio-Books

Team-Cornflakes / VitaFile

icaropires / pdf2dataset

radioactive11 / ALPR-India

prathyyyyy / Medical-Data-Extraction

goldenryu2000 / Discord-OCR-Bot

anilsathyan7 / AI-Sudoku-Solver

deepshig / Textual-Video-to-Speech-Interface

SanjinKurelic / FlaskALPR

moebius-analitica / meetup-webscraping

Jishnnu / InvoiceAI-Document-Parser

pavtiger / Parse-tables-from-PDF

7410abhi / Image_detector-using-python-libraries

Sweatnessstrong / pdf-to-word-converter

bhattbhavesh91 / pytesseract-demo

ScottStevenWhite / DocsInARow

MvMukesh / autoKYC

Improve this page

Add this topic to your repo