Adobe 22020737 Acrobat X Pro Manual - Page 56

Recognize text in scanned documents, Recognize Text - General Settings dialog box

Page 56 highlights

USING ACROBAT X PRO 50 Creating PDFs • Avoid dithering or halftone scanner settings. These settings can improve the appearance of photographs, but they make it difficult to recognize text. • For text printed on colored paper, try increasing the brightness and contrast by about 10%. If your scanner has color-filtering capability, consider using a filter or lamp that drops out the background color. Or if the text isn't crisp or drops out, try adjusting scanner contrast and brightness to clarify the scan. • If your scanner has a manual brightness control, adjust it so that characters are clean and well formed. If characters are touching, use a higher (brighter) setting. If characters are separated, use a lower (darker) setting. Recognize text in scanned documents You can use Acrobat to recognize text in previously scanned documents that have already been converted to PDF. Optical character recognition (OCR) software enables you to search, correct, and copy the text in a scanned PDF. To apply OCR to a PDF, the original scanner resolution must have been set at 72 dpi or higher. For more information on text recognition, see these videos: • Recognizing Text in Scanned PDF Documents: www.adobe.com/go/lrvid_025_acrx_en • How to Edit a Scanned PDF: www.adobe.com/go/learn_acr_edit_scans_en Note: Scanning at 300 dpi produces the best text for conversion. At 150 dpi, OCR accuracy is slightly lower. More Help topics "Adding unifying page elements" on page 110 Converting Scanned PDF Files to Other File Formats Recognize text in a single document 1 Open the scanned PDF. 2 Choose Tools > Recognize Text > In This File. 3 In the Recognize Text dialog box, select an option under Pages. 4 Optionally, click Edit to open the Recognize Text - General Settings dialog box, and specify the options as needed. Recognize text in multiple documents 1 In Acrobat, choose Tools > Recognize Text > In Multiple Files. 2 In the Recognize Text dialog box, click Add Files, and choose Add Files, Add Folders, or Add Open Files. Then select the files or folder. 3 In the Output Options dialog box, specify a target folder for output files, and filename preferences. 4 In the Recognize Text - General Settings dialog box, specify the options, and then click OK. Recognize Text - General Settings dialog box Primary OCR Language Specifies the language for the OCR engine to use to identify the characters. PDF Output Style Determines the type of PDF to produce. All options require an input resolution of 72 dpi or higher (recommended). All formats apply OCR and font and page recognition to the text images and convert them to normal text. • Searchable Image Ensures that text is searchable and selectable. This option keeps the original image, deskews it as needed, and places an invisible text layer over it. The selection for Downsample Images in this same dialog box determines whether the image is downsampled and to what extent. Last updated 10/11/2011

  • 1
  • 2
  • 3
  • 4
  • 5
  • 6
  • 7
  • 8
  • 9
  • 10
  • 11
  • 12
  • 13
  • 14
  • 15
  • 16
  • 17
  • 18
  • 19
  • 20
  • 21
  • 22
  • 23
  • 24
  • 25
  • 26
  • 27
  • 28
  • 29
  • 30
  • 31
  • 32
  • 33
  • 34
  • 35
  • 36
  • 37
  • 38
  • 39
  • 40
  • 41
  • 42
  • 43
  • 44
  • 45
  • 46
  • 47
  • 48
  • 49
  • 50
  • 51
  • 52
  • 53
  • 54
  • 55
  • 56
  • 57
  • 58
  • 59
  • 60
  • 61
  • 62
  • 63
  • 64
  • 65
  • 66
  • 67
  • 68
  • 69
  • 70
  • 71
  • 72
  • 73
  • 74
  • 75
  • 76
  • 77
  • 78
  • 79
  • 80
  • 81
  • 82
  • 83
  • 84
  • 85
  • 86
  • 87
  • 88
  • 89
  • 90
  • 91
  • 92
  • 93
  • 94
  • 95
  • 96
  • 97
  • 98
  • 99
  • 100
  • 101
  • 102
  • 103
  • 104
  • 105
  • 106
  • 107
  • 108
  • 109
  • 110
  • 111
  • 112
  • 113
  • 114
  • 115
  • 116
  • 117
  • 118
  • 119
  • 120
  • 121
  • 122
  • 123
  • 124
  • 125
  • 126
  • 127
  • 128
  • 129
  • 130
  • 131
  • 132
  • 133
  • 134
  • 135
  • 136
  • 137
  • 138
  • 139
  • 140
  • 141
  • 142
  • 143
  • 144
  • 145
  • 146
  • 147
  • 148
  • 149
  • 150
  • 151
  • 152
  • 153
  • 154
  • 155
  • 156
  • 157
  • 158
  • 159
  • 160
  • 161
  • 162
  • 163
  • 164
  • 165
  • 166
  • 167
  • 168
  • 169
  • 170
  • 171
  • 172
  • 173
  • 174
  • 175
  • 176
  • 177
  • 178
  • 179
  • 180
  • 181
  • 182
  • 183
  • 184
  • 185
  • 186
  • 187
  • 188
  • 189
  • 190
  • 191
  • 192
  • 193
  • 194
  • 195
  • 196
  • 197
  • 198
  • 199
  • 200
  • 201
  • 202
  • 203
  • 204
  • 205
  • 206
  • 207
  • 208
  • 209
  • 210
  • 211
  • 212
  • 213
  • 214
  • 215
  • 216
  • 217
  • 218
  • 219
  • 220
  • 221
  • 222
  • 223
  • 224
  • 225
  • 226
  • 227
  • 228
  • 229
  • 230
  • 231
  • 232
  • 233
  • 234
  • 235
  • 236
  • 237
  • 238
  • 239
  • 240
  • 241
  • 242
  • 243
  • 244
  • 245
  • 246
  • 247
  • 248
  • 249
  • 250
  • 251
  • 252
  • 253
  • 254
  • 255
  • 256
  • 257
  • 258
  • 259
  • 260
  • 261
  • 262
  • 263
  • 264
  • 265
  • 266
  • 267
  • 268
  • 269
  • 270
  • 271
  • 272
  • 273
  • 274
  • 275
  • 276
  • 277
  • 278
  • 279
  • 280
  • 281
  • 282
  • 283
  • 284
  • 285
  • 286
  • 287
  • 288
  • 289
  • 290
  • 291
  • 292
  • 293
  • 294
  • 295
  • 296
  • 297
  • 298
  • 299
  • 300
  • 301
  • 302
  • 303
  • 304
  • 305
  • 306
  • 307
  • 308
  • 309
  • 310
  • 311
  • 312
  • 313
  • 314
  • 315
  • 316
  • 317
  • 318
  • 319
  • 320
  • 321
  • 322
  • 323
  • 324
  • 325
  • 326
  • 327
  • 328
  • 329
  • 330
  • 331
  • 332
  • 333
  • 334
  • 335
  • 336
  • 337
  • 338
  • 339
  • 340
  • 341
  • 342
  • 343
  • 344
  • 345
  • 346
  • 347
  • 348
  • 349
  • 350
  • 351
  • 352
  • 353
  • 354
  • 355
  • 356
  • 357
  • 358
  • 359
  • 360
  • 361
  • 362
  • 363
  • 364
  • 365
  • 366
  • 367
  • 368
  • 369
  • 370
  • 371
  • 372
  • 373
  • 374
  • 375
  • 376
  • 377
  • 378
  • 379
  • 380
  • 381
  • 382
  • 383
  • 384
  • 385
  • 386
  • 387
  • 388
  • 389
  • 390
  • 391
  • 392
  • 393
  • 394
  • 395
  • 396
  • 397
  • 398
  • 399
  • 400
  • 401
  • 402
  • 403
  • 404
  • 405
  • 406
  • 407
  • 408
  • 409
  • 410
  • 411
  • 412
  • 413
  • 414
  • 415
  • 416
  • 417
  • 418
  • 419
  • 420
  • 421
  • 422
  • 423
  • 424
  • 425
  • 426
  • 427
  • 428
  • 429
  • 430
  • 431
  • 432
  • 433
  • 434
  • 435
  • 436
  • 437
  • 438
  • 439
  • 440
  • 441
  • 442
  • 443
  • 444
  • 445
  • 446
  • 447
  • 448
  • 449
  • 450
  • 451
  • 452
  • 453
  • 454
  • 455
  • 456
  • 457
  • 458
  • 459
  • 460
  • 461
  • 462
  • 463
  • 464
  • 465
  • 466
  • 467
  • 468
  • 469
  • 470
  • 471
  • 472
  • 473
  • 474
  • 475
  • 476
  • 477
  • 478
  • 479
  • 480
  • 481
  • 482
  • 483
  • 484
  • 485
  • 486
  • 487
  • 488
  • 489
  • 490
  • 491
  • 492
  • 493
  • 494
  • 495
  • 496

50
USING ACROBAT X PRO
Creating PDFs
Last updated 10/11/2011
Avoid dithering or halftone scanner settings. These settings can improve the appearance of photographs, but they
make it difficult to recognize text.
For text printed on colored paper, try increasing the brightness and contrast by about 10%. If your scanner has
color-filtering capability, consider using a filter or lamp that drops out the background color. Or if the text isn’t
crisp or drops out, try adjusting scanner contrast and brightness to clarify the scan.
If your scanner has a manual brightness control, adjust it so that characters are clean and well formed. If characters
are touching, use a higher (brighter) setting. If characters are separated, use a lower (darker) setting.
Recognize text in scanned documents
You can use Acrobat to recognize text in previously scanned documents that have already been converted to PDF.
Optical character recognition (OCR) software enables you to search, correct, and copy the text in a scanned PDF. To
apply OCR to a PDF, the original scanner resolution must have been set at 72 dpi or higher.
For more information on text recognition, see these videos:
Recognizing Text in Scanned PDF Documents:
www.adobe.com/go/lrvid_025_acrx_en
How to Edit a Scanned PDF:
www.adobe.com/go/learn_acr_edit_scans_en
Note:
Scanning at 300 dpi produces the best text for conversion. At 150 dpi, OCR accuracy is slightly lower.
More Help topics
Adding unifying page elements
” on page
110
Converting Scanned PDF Files to Other File Formats
Recognize text in a single document
1
Open the scanned PDF.
2
Choose Tools > Recognize Text > In This File.
3
In the Recognize Text dialog box, select an option under Pages.
4
Optionally, click Edit to open the Recognize Text - General Settings dialog box, and specify the options as needed.
Recognize text in multiple documents
1
In Acrobat, choose Tools > Recognize Text > In Multiple Files.
2
In the Recognize Text dialog box, click Add Files, and choose Add Files, Add Folders, or Add Open Files. Then
select the files or folder.
3
In the Output Options dialog box, specify a target folder for output files, and filename preferences.
4
In the Recognize Text - General Settings dialog box, specify the options, and then click OK.
Recognize Text - General Settings dialog box
Primary OCR Language
Specifies the language for the OCR engine to use to identify the characters.
PDF Output Style
Determines the type of PDF to produce. All options require an input resolution of 72 dpi or higher
(recommended). All formats apply OCR and font and page recognition to the text images and convert them to normal text.
Searchable Image
Ensures that text is searchable and selectable. This option keeps the original image, deskews it as
needed, and places an invisible text layer over it. The selection for Downsample Images in this same dialog box
determines whether the image is downsampled and to what extent.