Document Format Information

Below is an outline of the most common data formats for scanned documents.
This information is provided purely to provide a basic explanation
of the type of data services that we provide.
For more detailed information regarding the best method for your purposes click here.
 
FORMAT
FEATURES AND USES
OCR
Optical Character Recognition
  • Keeps documents in a format that allows "one touch" import into most word processors.
  • Allows you to use Digital Business Forms within word processors as you would with pen and paper.
  • Easy portability between system platforms - Mac, PC, UNIX, XENIX, LINUX
  • Easy to import into other document formats or for internet and intranet use - see PDF
  • Very easy to change or update the information.
  • Relatively small file sizes for easy storage. One 8½ x 11 document = 30K approx.
  • Document are grayscale - no colour, but the image quality is crisp.
PDF
Portable Document Format
  • PDF files can be used on any system platform - Mac, PC, UNIX, XENIX, LINUX
  • Documents can be easily converted for use in internet, intranet, HTML and Java applications.
  • Easy use of Digital Business Forms.
  • Easy to export for use in most word processing applications.
  • Allows easy use and linking of mixed file types such as text, image, movie, sound and other multi-media files.
  • Allows inclusion of actions with all types data files:
    • Automatically print, fax, or e-mail digital forms once completed.
    • Click an image or link to start a multi-media file.
    • Turn your company catalogue into a multi use digital presentation:
      • Put your catalogue on CD
      • Put the files on the internet
  • PDF has larger file sizes than OCR as a rule ( one 8½ x 11 page = 4 - 6 Megabytes approx.), but that is generally when documents are scanned full colour.
  • Easy to change and update the information on any page, form or document.
  • You can use OCR files within PDF.
  • Crisp images in grayscale or colour.
HTML
Hyper Text Mark-up Language
  • Can be used on any platform in any internet browser - e.g. Netscape, Internet Explorer
  • Relatively small file sizes.
  • Can be used on any computer without an internet connection.
  • Easy to share files among all computers in a networked environment.
  • Files are easy to link, so it works well as a presentation source.
  • Links on disk files can be used on the internet for customer orders, product updates, e-mail and various other business purposes.
Grayscale Images
&
Line Art
  • Used primarily to scan plain text documents into an image format.
  • Grayscale images, like other images, are usually scanned for and used on one platform: however, your software may allow portability.
  • Larger file sizes than text or OCR, but smaller than colour.
  • Easy to convert between the various image formats.
  • Good format to use for documents that will be faxed repeatedly, or a backup of non-essential images or hard copy catalogues.
Colour Images
  • Large file sizes.
  • Generally used for scanning images to be used for graphic applications.
  • Easy to convert between image file formats.
  • Easy to import into PDF format for backup, presentations, or other dynamic digital purposes.
  • Usable in most popular imaging software.