Xerox M118i OmniPage SE User Guide - Page 93

Text does not get recognized properly, Turn IntelliTrain on and make some proofing corrections.

Page 93 highlights

Chapter 6 Text does not get recognized properly Try these solutions if any part of the original document is not converted to text properly during OCR: X Look at the original page image and ensure that all text areas are enclosed by text zones. If an area is not enclosed by a zone, it is generally ignored during OCR. See the section on creating and modifying zones, "Working with zones" on page 59. X Make sure text zones are identified correctly. Reidentify zone types and contents, if necessary, and perform OCR on the document again. See "Zone types and properties" on page 57. X Be sure you do not have an unsuitable template loaded by mistake. If zone borders cut through text, recognition is impaired. X Adjust the brightness and contrast sliders in the Scanner panel of the Options dialog box. You may need to experiment with different settings combinations to get the desired results. X Check the resolution of the original image. Hover the cursor over a page thumbnail for a popup display. If the resolution is significantly above or below 300 dpi, recognition is likely to suffer. X Make sure the correct document languages are selected in the OCR panel of the Options dialog box. Only languages included in the document should be selected. X Turn IntelliTrain on and make some proofing corrections. This is most likely to help with stylized fonts or uniformly degraded documents. Do some manual training, or edit existing training to remove unsuccessful training. The references to training do not apply to OmniPage SE. X If you use True Page as the Text Editor view or for export, recognized text is put into text boxes or frames. Some text may be hidden if a text box is too small. To view the text, place the cursor in the text box and use the arrow keys on your keyboard to scroll to the top, bottom, left, or right of the box. X Check the glass, mirrors, and lenses on your scanner for dust, smudges or scratches. Clean if necessary. Troubleshooting 93

  • 1
  • 2
  • 3
  • 4
  • 5
  • 6
  • 7
  • 8
  • 9
  • 10
  • 11
  • 12
  • 13
  • 14
  • 15
  • 16
  • 17
  • 18
  • 19
  • 20
  • 21
  • 22
  • 23
  • 24
  • 25
  • 26
  • 27
  • 28
  • 29
  • 30
  • 31
  • 32
  • 33
  • 34
  • 35
  • 36
  • 37
  • 38
  • 39
  • 40
  • 41
  • 42
  • 43
  • 44
  • 45
  • 46
  • 47
  • 48
  • 49
  • 50
  • 51
  • 52
  • 53
  • 54
  • 55
  • 56
  • 57
  • 58
  • 59
  • 60
  • 61
  • 62
  • 63
  • 64
  • 65
  • 66
  • 67
  • 68
  • 69
  • 70
  • 71
  • 72
  • 73
  • 74
  • 75
  • 76
  • 77
  • 78
  • 79
  • 80
  • 81
  • 82
  • 83
  • 84
  • 85
  • 86
  • 87
  • 88
  • 89
  • 90
  • 91
  • 92
  • 93
  • 94
  • 95
  • 96
  • 97
  • 98
  • 99
  • 100
  • 101
  • 102

Chapter 6
Troubleshooting
93
Text does not get recognized properly
Try these solutions if any part of the original document is not converted
to text properly during OCR:
X
Look at the original page image and ensure that all text areas are
enclosed by text zones. If an area is not enclosed by a zone, it is
generally ignored during OCR. See the section on creating and
modifying zones,
Working with zones
on page 59.
X
Make sure text zones are identified correctly. Reidentify zone
types and contents, if necessary, and perform OCR on the
document again. See
Zone types and properties
on page 57.
X
Be sure you do not have an unsuitable template loaded by
mistake. If zone borders cut through text, recognition is
impaired.
X
Adjust the brightness and contrast sliders in the Scanner panel of
the Options dialog box. You may need to experiment with
different settings combinations to get the desired results.
X
Check the resolution of the original image. Hover the cursor
over a page thumbnail for a popup display. If the resolution is
significantly above or below 300 dpi, recognition is likely to
suffer.
X
Make sure the correct document languages are selected in the
OCR panel of the Options dialog box. Only languages included
in the document should be selected.
X
Turn IntelliTrain on and make some proofing corrections. This
is most likely to help with stylized fonts or uniformly degraded
documents. Do some manual training, or edit existing training
to remove unsuccessful training. The references to training do
not apply to OmniPage SE.
X
If you use True Page as the Text Editor view or for export,
recognized text is put into text boxes or frames. Some text may
be hidden if a text box is too small. To view the text, place the
cursor in the text box and use the arrow keys on your keyboard
to scroll to the top, bottom, left, or right of the box.
X
Check the glass, mirrors, and lenses on your scanner for dust,
smudges or scratches. Clean if necessary.