OCR Software-- Optical Character Recognition or Optical Crud Recognition?

Is it really possible to get high OCR accuracyNoise removal of borders, speckles and skews
from poor quality documents?are now common on the more advanced
Optical Character Recognition (OCR) refers to adocument scanners.
software technology and processes that involveFurthermore, advanced color filter technologies
the translation of printed text into computermay be used to reduce any page background
searchable text.colors, in conjunction with multi-light image capture
Done correctly, OCR enables users to search fortechnologies to remove any shadows cast by
and retrieve individual words contained within a filepage creases that could impact image quality or
or page. In addition, when a set of files is indexed,recognition accuracy.
users are able to search for keywords across anOnce document scanning and processing are
entire document library and retrieve each pagecomplete, an OCR text layer can actually be
with exact precision. OCR enables users toadded and hidden behind each image. An additional
execute searches in seconds, searches that onceorientation filter can be used to ensure that the
could take several hours or days to complete.best image is presented to the OCR engines.
However, this technology did not work well onTo achieve the highest conversion accuracy
older or poor quality documents that containedpossible, the characters in the image can be
mixed fonts or combinations of texts andprocessed using multi-engine OCR voting
graphics. Until now!!technologies that rank each character to
Due to several recent technology advances, it isdetermine the best text recognition fit. Then once
now possible to obtain six-sigma level charactera word is generated, it will be filtered through a
accuracy from these types of documentproprietary lexicon to ensure the highest quality
collections.results.
Although it is important to keep in mind that theFinally, this text can be processed utilizing
quality and condition of the paper documents aresophisticated layout retention technologies to
still key factors in the successful OCR conversion,represent the image text layout, to provide the
dramatically improved results can be obtained bybest possible text representation for precise
enhancing the quality of the scanned image priorsearch and retrieval. After all, isn't that why they
to processing.call it Optical Character Recognition?