Xerox M118i OmniPage SE User Guide - Page 22
What is optical character recognition, Omni SE’s OCR capabilities
UPC - 095205219265
View all Xerox M118i manuals
Add to My Manuals
Save this manual to your list of manuals |
Page 22 highlights
What is optical character recognition Optical character recognition is the process of extracting text from an image. This image can result from scanning a paper document or opening an electronic image file. Images do not have editable text characters; they have many tiny dots (pixels) that together form character shapes. These present a picture of the text on a page. During OCR, OmniPage SE analyzes the character shapes in an image and defines solutions to produce editable text. After OCR, you can save the resulting text to a variety of word-processing, desktop publishing or spreadsheet applications. OmniPage SE's OCR capabilities In addition to text recognition, OmniPage SE can retain the following elements of a document through the OCR process. Graphics Photos, logos, and drawings are examples of graphics. Text formatting Font types, sizes and styles (such as bold, italic and underlines) are examples of character formatting. Indents, tabs, margins and line spacing are examples of paragraph formatting. Page formatting Column structure, table formats, and placement of graphics and headings are examples of page formatting. The graphics, text and page formatting elements that OmniPage SE retains are determined by the settings you select. Refer to the Settings Guidelines in the online Help for more information about selecting settings. OmniPage SE only recognizes machine-generated characters such as offset or laserprinted or typewritten text. However, it can retain handwritten text, such as a signature, as a graphic. 22 Introduction