Skip to main content

A simple, effective post-processing OCR improvement.

01 December 2016

New Image

High quality optical character recognition (OCR) is very important for search and retrieval of scanned materials. This paper presents a simple, low-cost method of significantly improving OCR output. In particular, it improves the text, and thus the retrieval results, by repairing words being broken up by the OCR incorrectly adding spaces between letters of a word.