Trained using YOLOv8
Check out the Github Repo for more info
Note : Requires external installation of tesseract , and setting the environment variable of the same
from Aadhaar_extractor.Extractor import AadhaarExtractor
#create a object of the AadhaarExtractor class by passing the image of the aadhaar card as argument
obj = AadhaarExtractor("aadhaar.jpg")
#use obj.extract() for extracting the data
extractedData = obj.extract()
The extract() method will return , a list of all the fields detected .
This module currently detects five fields:
- Aadhaar Number (aadhaar_no)
- Date of Birth (dob)
- Gender (gender)
- Name (name)
- Address (address)
Each field is member of the list(detected field), has four fields in the following order
-
The coordinates of the detected field in x1,y1,x2,y2 format (type=list)
-
The name of the field
-
Confidence of detection
-
Extracted text from the detected field
Note:All the data used to train , were publicly available in the internet
This project is far from complete, and can be made much better, feel free to contribute