Adobe 62000112DM User Guide - Page 76

Correct OCR text in PDFs, Enable Fast Web View in a PDF

Page 76 highlights

ADOBE ACROBAT 3D VERSION 8 69 User Guide Black-and-white scanning at 300 ppi produces the best text for conversion. At 150 ppi, OCR accuracy is slightly lower, and more font-recognition errors occur. For text printed on colored paper, try increasing the brightness and contrast by about 10%. If your scanner has color-filtering capability, consider using a filter or lamp that drops out the background color. Downsample Images Decreases the number of pixels in color, grayscale, and monochrome images after OCR is complete. Choose the degree of downsampling that you want to apply. Higher-numbered options do less downsam­ pling, producing higher-resolution PDFs. Correct OCR text in PDFs When you scan to Formatted Text & Graphics output, Acrobat analyzes bitmaps of text and substitutes words and characters for those bitmap areas. If the ideal substitution is uncertain, Acrobat marks the word as suspect. Suspects appear in the PDF as the original bitmap of the word, but the text is included on an invisible layer behind the bitmap of the word. This makes the word searchable even though it is displayed as a bitmap. You can accept these suspects as they are, or you can use the TouchUp Text tool to correct them. Note: If you try to select text in a scanned PDF that does not have OCR applied, or try to perform a Read Out Loud operation on an image file, Acrobat asks if you want to run OCR. If you click OK, the Recognize Text dialog box opens and you can select options, which are described in detail under the previous topic. 1 Do one of the following: • Choose Document > OCR Text Recognition > Find All OCR Suspects. All suspect words on the page are enclosed in boxes. Click any suspect word to show the suspect text in the Find Element dialog box. • Choose Document > OCR Text Recognition > Find First OCR Suspect. Note: If you close the Find Element window before correcting all suspect words, you can return to the process by choosing Document > OCR Text Recognition > Find First OCR Suspect, or by clicking any suspect word with the TouchUp Text tool. 2 In the Find option, choose OCR Suspects. 3 Compare the word in the Suspect text box with the actual word in the scanned document, and accept, correct, or ignore the word. If the suspect was incorrectly identified as text, click the Not Text button. 4 Review and correct the remaining suspect words, and then close the Find Element dialog box. Enable Fast Web View in a PDF Fast Web View restructures a PDF document for page-at-a-time downloading (byte-serving) from web servers. With Fast Web View, the web server sends only the requested page, rather than the entire PDF. This is especially important with large documents that can take a long time to download from a server. Check with your webmaster to make sure that the web server software you use supports page-at-a-time downloading. To ensure that the PDF documents on your website appear in older browsers, you may also want to create HTML links (versus ASP scripts or the POST method) to the PDF documents and use relatively short path names (256 characters or fewer). Verify that an existing PDF is enabled for Fast Web View ❖ Do one of the following: • Open the PDF in Acrobat, and choose File > Properties. Look in the lower right area of the Description panel of the dialog box for the Fast Web View setting (Yes or No).

  • 1
  • 2
  • 3
  • 4
  • 5
  • 6
  • 7
  • 8
  • 9
  • 10
  • 11
  • 12
  • 13
  • 14
  • 15
  • 16
  • 17
  • 18
  • 19
  • 20
  • 21
  • 22
  • 23
  • 24
  • 25
  • 26
  • 27
  • 28
  • 29
  • 30
  • 31
  • 32
  • 33
  • 34
  • 35
  • 36
  • 37
  • 38
  • 39
  • 40
  • 41
  • 42
  • 43
  • 44
  • 45
  • 46
  • 47
  • 48
  • 49
  • 50
  • 51
  • 52
  • 53
  • 54
  • 55
  • 56
  • 57
  • 58
  • 59
  • 60
  • 61
  • 62
  • 63
  • 64
  • 65
  • 66
  • 67
  • 68
  • 69
  • 70
  • 71
  • 72
  • 73
  • 74
  • 75
  • 76
  • 77
  • 78
  • 79
  • 80
  • 81
  • 82
  • 83
  • 84
  • 85
  • 86
  • 87
  • 88
  • 89
  • 90
  • 91
  • 92
  • 93
  • 94
  • 95
  • 96
  • 97
  • 98
  • 99
  • 100
  • 101
  • 102
  • 103
  • 104
  • 105
  • 106
  • 107
  • 108
  • 109
  • 110
  • 111
  • 112
  • 113
  • 114
  • 115
  • 116
  • 117
  • 118
  • 119
  • 120
  • 121
  • 122
  • 123
  • 124
  • 125
  • 126
  • 127
  • 128
  • 129
  • 130
  • 131
  • 132
  • 133
  • 134
  • 135
  • 136
  • 137
  • 138
  • 139
  • 140
  • 141
  • 142
  • 143
  • 144
  • 145
  • 146
  • 147
  • 148
  • 149
  • 150
  • 151
  • 152
  • 153
  • 154
  • 155
  • 156
  • 157
  • 158
  • 159
  • 160
  • 161
  • 162
  • 163
  • 164
  • 165
  • 166
  • 167
  • 168
  • 169
  • 170
  • 171
  • 172
  • 173
  • 174
  • 175
  • 176
  • 177
  • 178
  • 179
  • 180
  • 181
  • 182
  • 183
  • 184
  • 185
  • 186
  • 187
  • 188
  • 189
  • 190
  • 191
  • 192
  • 193
  • 194
  • 195
  • 196
  • 197
  • 198
  • 199
  • 200
  • 201
  • 202
  • 203
  • 204
  • 205
  • 206
  • 207
  • 208
  • 209
  • 210
  • 211
  • 212
  • 213
  • 214
  • 215
  • 216
  • 217
  • 218
  • 219
  • 220
  • 221
  • 222
  • 223
  • 224
  • 225
  • 226
  • 227
  • 228
  • 229
  • 230
  • 231
  • 232
  • 233
  • 234
  • 235
  • 236
  • 237
  • 238
  • 239
  • 240
  • 241
  • 242
  • 243
  • 244
  • 245
  • 246
  • 247
  • 248
  • 249
  • 250
  • 251
  • 252
  • 253
  • 254
  • 255
  • 256
  • 257
  • 258
  • 259
  • 260
  • 261
  • 262
  • 263
  • 264
  • 265
  • 266
  • 267
  • 268
  • 269
  • 270
  • 271
  • 272
  • 273
  • 274
  • 275
  • 276
  • 277
  • 278
  • 279
  • 280
  • 281
  • 282
  • 283
  • 284
  • 285
  • 286
  • 287
  • 288
  • 289
  • 290
  • 291
  • 292
  • 293
  • 294
  • 295
  • 296
  • 297
  • 298
  • 299
  • 300
  • 301
  • 302
  • 303
  • 304
  • 305
  • 306
  • 307
  • 308
  • 309
  • 310
  • 311
  • 312
  • 313
  • 314
  • 315
  • 316
  • 317
  • 318
  • 319
  • 320
  • 321
  • 322
  • 323
  • 324
  • 325
  • 326
  • 327
  • 328
  • 329
  • 330
  • 331
  • 332
  • 333
  • 334
  • 335
  • 336
  • 337
  • 338
  • 339
  • 340
  • 341
  • 342
  • 343
  • 344
  • 345
  • 346
  • 347
  • 348
  • 349
  • 350
  • 351
  • 352
  • 353
  • 354
  • 355
  • 356
  • 357
  • 358
  • 359
  • 360
  • 361
  • 362
  • 363
  • 364
  • 365
  • 366
  • 367
  • 368
  • 369
  • 370
  • 371
  • 372
  • 373
  • 374
  • 375
  • 376
  • 377
  • 378
  • 379
  • 380
  • 381
  • 382
  • 383
  • 384
  • 385
  • 386
  • 387
  • 388
  • 389
  • 390
  • 391
  • 392
  • 393
  • 394
  • 395
  • 396
  • 397
  • 398
  • 399
  • 400
  • 401
  • 402
  • 403
  • 404
  • 405
  • 406
  • 407
  • 408
  • 409
  • 410
  • 411
  • 412
  • 413
  • 414
  • 415
  • 416
  • 417
  • 418
  • 419
  • 420
  • 421
  • 422
  • 423
  • 424
  • 425
  • 426
  • 427
  • 428
  • 429
  • 430
  • 431
  • 432
  • 433
  • 434
  • 435
  • 436
  • 437
  • 438
  • 439
  • 440
  • 441
  • 442
  • 443
  • 444
  • 445
  • 446
  • 447
  • 448
  • 449
  • 450
  • 451
  • 452
  • 453
  • 454
  • 455
  • 456
  • 457
  • 458
  • 459
  • 460
  • 461
  • 462
  • 463
  • 464
  • 465
  • 466
  • 467
  • 468
  • 469
  • 470
  • 471
  • 472
  • 473
  • 474
  • 475
  • 476
  • 477
  • 478
  • 479
  • 480
  • 481
  • 482
  • 483
  • 484
  • 485
  • 486
  • 487
  • 488
  • 489
  • 490
  • 491
  • 492
  • 493
  • 494
  • 495
  • 496
  • 497
  • 498
  • 499
  • 500
  • 501
  • 502
  • 503
  • 504
  • 505
  • 506
  • 507
  • 508
  • 509
  • 510
  • 511
  • 512
  • 513
  • 514
  • 515
  • 516
  • 517
  • 518
  • 519
  • 520
  • 521
  • 522
  • 523
  • 524
  • 525
  • 526
  • 527
  • 528
  • 529
  • 530
  • 531
  • 532
  • 533
  • 534
  • 535
  • 536
  • 537
  • 538
  • 539
  • 540
  • 541
  • 542
  • 543
  • 544
  • 545
  • 546
  • 547
  • 548
  • 549
  • 550
  • 551
  • 552
  • 553
  • 554
  • 555
  • 556
  • 557
  • 558
  • 559
  • 560
  • 561
  • 562
  • 563
  • 564
  • 565
  • 566
  • 567
  • 568
  • 569
  • 570
  • 571
  • 572
  • 573
  • 574
  • 575
  • 576
  • 577
  • 578
  • 579
  • 580
  • 581
  • 582
  • 583
  • 584
  • 585
  • 586
  • 587
  • 588
  • 589
  • 590
  • 591
  • 592
  • 593
  • 594
  • 595
  • 596
  • 597
  • 598
  • 599
  • 600

69
ADOBE ACROBAT 3D VERSION 8
User Guide
Black-and-white scanning at 300 ppi produces the best text for conversion. At 150 ppi, OCR accuracy is slightly lower,
and more font-recognition errors occur. For text printed on colored paper, try increasing the brightness and contrast
by about 10%. If your scanner has color-filtering capability, consider using a filter or lamp that drops out the background
color.
Downsample Images
Decreases the number of pixels in color, grayscale, and monochrome images after OCR is
complete. Choose the degree of downsampling that you want to apply. Higher-numbered options do less downsam±
pling, producing higher-resolution PDFs.
Correct OCR text in PDFs
When you scan to Formatted Text & Graphics output, Acrobat analyzes bitmaps of text and substitutes words and
characters for those bitmap areas. If the ideal substitution is uncertain, Acrobat marks the word as suspect. Suspects
appear
in
the PDF as the original bitmap of the word, but
the text is included on an invisible layer behind the bitmap
of the word. This makes the word searchable even though it is displayed as a bitmap. You can accept these suspects
as they are, or you can use the TouchUp Text tool
to correct them.
Note:
If you try to select text in a scanned PDF that does not have OCR applied, or try to perform a Read Out Loud
operation on an image
file, Acrobat asks if
you want to run
OCR.
If
you
click
OK,
the
Recognize
Text
dialog box opens
and you can select options, which are described in detail under the previous topic.
1
Do one of the following:
Choose Document > OCR Text Recognition > Find All OCR Suspects. All suspect words on the page are enclosed
in boxes. Click any suspect word to show the suspect text in the Find Element dialog box.
Choose Document > OCR Text Recognition > Find First OCR Suspect.
Note:
If you close the Find Element window before correcting all suspect words, you can return to the process by choosing
Document > OCR Text Recognition > Find First OCR Suspect, or by clicking any suspect word with the TouchUp Text tool.
2
In the Find option, choose OCR Suspects.² ²
3
Compare the word in the Suspect text box with the actual word in the scanned document, and accept, correct, or² ²
ignore the word. If the suspect was incorrectly identified as text, click the Not Text button.² ²
4
Review and correct the remaining suspect words, and then close the Find Element dialog box.² ²
Enable Fast Web View in a PDF
Fast Web View restructures a PDF document for page-at-a-time downloading (byte-serving) from web servers. With
Fast Web View, the web server sends only the requested page, rather than the entire PDF. This is especially important
with large documents that can take a long time to download from a server.
Check with your webmaster to make sure that the web server software you use supports page-at-a-time
downloading. To ensure that the PDF documents on your website appear in older browsers, you may also want to
create HTML links (versus ASP scripts or the POST method) to the PDF documents and use relatively short path
names (256 characters or fewer).
Verify that an existing PDF is enabled for Fast Web View
Do one of the following:
Open the PDF in Acrobat, and choose File > Properties. Look in the lower right area of the Description panel of
the dialog box for the Fast Web View setting (Yes or No).