To the best of the abilities of the underlying OCR engine (Google Tesseract). Not perfect, yet seemingly decent as far as test documents available to the developer were concerned - inĪround 80% of the cases, whenever there is a clearly visible MRZ on a page, the system will recognize it and extract the text
The recognition procedure may be rather slow - around 10 or more seconds for some documents. The documents may be located rather arbitrarily on the page - the code tries to find anything resembling a MRZ The package provides tools for recognizing machine readable zones (MRZ) from scanned identification documents. PassportEye: Python tools for image processing of identification documents