Fonts, Formats, and Text Conexiom Reads

Conexiom applications read a wide variety of formats. However, the text must meet a narrower standard, as explained below.

Formats

Conexiom supports data from a large number of document formats:

PDF

Email

Word

XML

Excel/CSV

Text HTML

EDI

TXT

 
Fonts
Processing through Conexiom’s Touchless Automation application provides 100% accuracy for clear-text documents, regardless of document format. “Clear text” means that the document cannot be a picture of text, as a scanner or fax machine would provide. That type of image is a raster image and is more like a photograph than a typeface. Encrypted text presents a similar problem. It can be copied, but once pasted, it is no longer readable. Clear text is almost always from an established digital font. Think of the typefaces your word processing program produces, such as Times New Roman, Arial, Helvetica, etc. (See also Clear Text vs. Images of Text).

Verifying Text

Before sending in test documents, check them to see if they will be viable as samples. To check if a document is likely to be acceptable, follow these steps:

  1. After opening a PDF document, right-click in empty space on the document and from the menu that displays, select Document Properties.

  2. On the Description tab, look for the PDF Producer field. If it refers to the term “Ghost Script” in this field, the document will probably not work.

  3. Go to the Fonts tab and check the Type. If it shows any of the fonts are embedded, it will probably not work.

  4. If both of the above properties appear acceptable, close the Document Properties window.

  5. To see if the text is clear text, do the following:

    1. Press Ctrl-a to Select All of the text in the document.

    2. Press Ctrl-c Copy the text.

    3. Press Ctrl-v to Paste it into a blank document in a text-edit program such as Notepad (Windows) or TextEdit (Apple IOS). The text should be readable without any strange characters or large spaces.

    4. Zoom in on the text.
      If the edges of individual characters remain crisp and clean, the document is likely a good candidate for use.
      If the text pixelates or gets jagged edges, then the document is likely not a good candidate for use in the Touchless Automation or Base Automation applications. (See Clear Text vs. Images of Text).