PDF Research Adobe Acrobat Components...
Reader
PDFWriter
Distiller
Acrobat
Catalog
Plug-in

Capture Plug-in...

  1. Required Components
  2. Making a PDF file from a Paper Document


Many of the most important documents within companies are NOT already digital files. These documents are assets of the company representing final packaged results of countless manhours of labor.

There are two ways to convert these paper documents into digital files:

  1. Scanning the document for a bitmap rendition saved as a digital file
  2. Capturing the document for content with OCR software or Adobe Capture® - here the text of a document would be "read" and converted into word processing text.

Scanning for a bitmap usually precedes character recogition conversion. For some organizations it makes sense to use a system to convert huge reams of information without capturing the content. Scanning is easy, indexing is hard and labor intensive.

Virtually every organization has documents for which scanning for content is desireable because the text needs to be searchable and possibly cross-referenced to other documents in more detail than manhour indexing will accomplish. Plus, text files take up much less storage capacity than their bitmap renditions.

OCR scanning reads the document for content and saves the text as a word processing or text file.

Capture® software goes much further. It scans the page for content but also scans for font, bolding and italics, layout of the document, and bitmap images. It then saves the result as a PDF document.

Adobe Acrobat Exchange® features a Capture plug-in that can be used for small capturing projects. It is a memory hog so if you choose to use it, be sure to allocate at least 64Mb RAM to Exchange before starting. This plug-in is not to be confused with the Adobe Capture® program which is designed to be used as an industrial capacity utility for converting reams of documents to PDF. Its features are described in a different section of this website.

1. Required Components

  • A scanner that can make 200dpi scans
  • Minimum system requirements
  • Acrobat Exchange® software (which is a program module of the Adobe Acrobat software suite)
  • Capture Plug-in is a part of Acrobat Exchange (NOTE: downloadable versions of this plug-in are available from the Adobe Acrobat site to registered owners of the program if they purchased the 3.x version before the plug-in was available for shipment).
  • 64Mb RAM allocated to Exchange
  • Acrobat Catalog® for cataloging and indexing files
  • Acrobat Reader® for opening converted files
  • Acrobat Exchange® for conducting cross document indexed searches and for recomposing pages from different sources into new documents

NOTE: The Capture Plug-in described here is not to be confused with the full program sold by Adobe called Capture®. Its features and utility is described in a different section of this website.


Component Availability

PDFWriter, Acrobat Distiller®, Catalog, Reader, and Exchange are modules of the Adobe Acrobat package that can be purchased directly via mail order for about $200.

2. Making a PDF file from a Paper Document

Verify that you have the Capture plug-in installed under the Document menu of Acrobat Exchange®.

  • If not, it may not have been available at the time you purchased Acrobat 3.x. Check Adobe Acrobat's website by CLICKING HERE.
  • Install the downloaded plug-in per the attached instructions.

Make a grayscale scan of your target document at 200-600dpi. 200dpi is standard. Save it as a TIFF file (at 300dpi, 256 grays, the resulting lettersize page is about 6.9Mb).

If you have Photoshop, save the TIFF as a PDF and open it in Exchange.

Otherwise, open a PDF document in Exchange. Select Import/Image in the File menu and choose the TIFF format file. At this point you can elect to either append the current PDF document with the scanned page or create a new PDF document.

Select DOCUMENT/CAPTURE PAGES and the program will perform a complicated set of tasks to produce the final PDF file. (The resulting PDF file, depending upon the content of the original, will reduce to as little as 27K).

The images of "suspects" will be retained along with the program's best guess concerning the character recognition of the source document. Go to Edit/Find First Suspect to initiate finding and correcting suspects.

Modify the file in Adobe Acrobat Exchange®:

  1. Open Exchange and edit your files - add other pages, crop, rotate, create links and bookmarks, append notes, etc.
  2. "Save As" an optimized file and add security passwords if desired. Optimizing reduces the size of the PDF file and adds byteserving (a.k.a., linearization) - which means that the end user will be downloading files one page at a time while the full document downloads in the background.

Catalog the files using Adobe Acrobat Catalog®.

The end user can open the individual files in Adobe Reader® (free) or in Adobe Exchange®.


Return to PDF Research Companion home page.
a production of Performance Graphics
©1998 The Miller De Wulf Corporation