Fonts and Formats
Conexiom applications read what is called clear text.
Conexiom applications read a wide variety of text formats. However, for optimal use, the text must meet a narrower standard, as explained below.
Formats
Conexiom supports data from a large number of document formats:
Word | XML | ||
Excel/CSV | Text HTML | EDI | TXT |
Fonts
Processing through Conexiom’s Premier and Express applications provide an extremely high degree of accuracy for clear-text documents, regardless of document format. “Clear text” means that the document cannot be a picture of text, as a scanner or fax machine would provide. That type of image is a raster image and is more like a photograph than a typeface. Encrypted text presents a similar problem. It can be copied, but once pasted, it is no longer readable. Clear text is almost always from an established digital font. Think of the typefaces your word processing program produces, such as Times New Roman, Arial, Helvetica, etc. (See also Clear Text vs. Images of Text).
Verifying Text
Before sending in test documents, check them to see if they will be viable as samples. To check if a document is likely to be acceptable, follow these steps:
After opening a PDF document, right-click in empty space on the document and from the menu that displays, select Document Properties.
On the Description tab, look for the PDF Producer field. If it refers to the term “Ghost Script” in this field, the document will probably not work.
Go to the Fonts tab and check the Type. If it shows any of the fonts are embedded, it will probably not work.
If both of the above properties appear acceptable, close the Document Properties window.
To see if the text is clear text, do the following:
Press Ctrl-a to Select All of the text in the document.
Press Ctrl-c Copy the text.
Press Ctrl-v to Paste it into a blank document in a text-edit program such as Notepad (Windows) or TextEdit (Apple IOS). The text should be readable without any strange characters or large spaces.
Zoom in on the text.
If the edges of individual characters remain crisp and clean, the document is likely a good candidate for use.
If the text pixelates or gets jagged edges, then the document is likely not a good candidate for use in the Touchless Automation or Base Automation applications. (See Clear Text vs. Images of Text).