Glossary

A B C D F I L M O P R S T U

A

ABBYY Business Card Reader is a handy application that enables users to scan business cards, capture the contact data, and export the captured contacts into various electronic formats. The application can also handle scans and photos of business cards stored on users' computers.

ABBYY FineReader document is an object created by ABBYY FineReader to process a paper document. It contains the images of the document pages, their recognized text (if any), and the program settings.

ABBYY Hot Folder is a scheduling agent which allows you to select a folder with images and specify the time for processing the images in this folder. At the specified time, the images from the selected folder will be processed automatically.

ABBYY Screenshot Reader is an application that enables you to create screenshots and recognize texts on them.

Abbreviation is a shortened form of a word or phrase used to represent the whole. For example, MS–DOS (for Microsoft Disk Operating System), UN (for United Nations), etc.

Activation is the process of obtaining a special code from ABBYY which allows the user to use his/her copy of the software in full mode on a given computer.

Activation code is a code that is issued by ABBYY to each user of ABBYY FineReader 11 during the activation procedure. An activation code is used to activate ABBYY FineReader on the computer that generated the Product ID.

Active area is a selected area on an image that can be deleted, moved or modified. To make an area active, click it. The frame enclosing an active area is bold and has small squares that can be dragged to change the size of the area.

Automatic Document Feeder (ADF) is a device that automatically feeds documents to a scanner. A scanner with an ADF can scan multiple pages without manual intervention. ABBYY FineReader supports multi-page documents.

ADRT® (Adaptive Document Recognition Technology) is a technology that increases the quality of conversion of multi-page documents. For example, it can recognize such structural elements as headings, headers and footers, footnotes, page numbering, and signatures.

Area is a section of an image enclosed by a frame and containing a certain type of data. Before performing OCR, ABBYY FineReader detects text, picture, table, and barcode areas in order to determine which sections of the image should be recognized and in what order.

Area template is a template that contains information about the size and location of the areas for a set of similar-looking documents.

Back to Top

B

Background picture area is an image area that contains a picture with text printed over it.

Barcode area is an image area that contains a barcode.

Brightness is a scanning parameter that indicates the contrast between the black and white areas on an image. Setting the correct brightness value increases recognition quality.

C

Code page is a table that establishes correspondences between characters and their codes. Users can select the characters they need from those available in a code page.

Color mode determines whether document colors are to be retained. Black-and-white images produce smaller FineReader documents and are faster to process.

Compound word is a word made up of two or more stems (general meaning). In ABBYY FineReader, a compound word is a word which is not in the dictionary but which the program thinks may be made up of two or more dictionary words.

D

Document analysis is a process of identifying the elements of the logical structure of a document and areas with different types of data. Document analysis can be carried out automatically or manually.

Document open password is a password which prevents users from opening a PDF document unless they type the password specified by the author.

Document options is the set of options that can be selected in the Options dialog box (Tools>Options). Document options also include user languages and patterns. Document options can be saved and then used in other ABBYY FineReader documents.

Dots per inch (dpi) is a measure of image resolution.

Driver is a software program that controls a computer peripheral (e.g., a scanner, a monitor, etc).

Back to Top

F

Font effects is the appearance of a font (i.e. bold, italic, underlined, strikethrough, subscript, superscript, small caps).

Ignored characters are any non–letter characters found in words (e.g. syllable characters or stress marks). These characters are ignored during the spell check.

Inverted image is an image with white characters against a dark background.

L

License Manager is a utility used for managing ABBYY FineReader licenses and activating ABBYY FineReader 11 Corporate Edition.

Ligature is a combination of two or more  characters which are "glued together" (e.g. fi, fl, ffi). These characters are difficult for ABBYY FineReader to separate. Treating them as a single compound character improves OCR accuracy.

M

Monospaced font is a font (such as Courier New) in which all characters are equally spaced. For better OCR results on monospaced fonts, select Tools>Options..., click the Document tab, and select Typewriter under Document print type.

O

Omnifont system is a recognition system that recognizes characters set in any font and font size without prior training.

Optional hyphen is a hyphen (¬) that indicates exactly where a word or word combination should be split if it occurs at the end of a line (e.g. "autoformat" should be split into "auto–format"). ABBYY FineReader replaces all hyphens found in dictionary words with optional hyphens.

Back to Top

P

Page layout is the arrangement of text, tables, pictures, paragraphs, and columns on a page. The fonts, font sizes, font colors, text background, and text orientation are also part of the page layout.

Page layout analysis is the process of detecting areas on a page image. Areas can be of six types: text, picture, table, barcode, background picture, and recognition area. Page layout analysis can be performed automatically when clicking the Read button, or manually by the user prior to OCR.

Paradigm is the set of all grammatical forms of a word.

Pattern is a set of pairs of type "character image - actual character."

PDF security settings are restrictions that prevent a PDF document from being opened, edited, copied or printed. These settings include Document Open Passwords, Permissions Passwords, and encryption levels.

Permissions Password is a password which prevents other users from printing and editing a PDF document unless they type the password specified by the author. If some security settings are selected for the document, other users will not be able to change these settings until they type the password.

Picture area is an image area that contains a picture. This type of area may enclose an actual picture or any other object that should be displayed as a picture (e.g. a section of text).

Primary form is the "dictionary" form of a word (headwords of dictionary entries are usually given in heir primary forms).

Print type is a parameter reflecting how the source text was printed (on a laser printer or a similar device, on a typewriter, etc.). For laser-printed texts, select Auto; for typewritten texts, select Typewriter; for faxes, select Fax.

Product ID is a parameter that is automatically generated on the basis of the hardware configuration when activating ABBYY FineReader on a given computer.

Prohibited characters — If certain characters will never occur in a text to be recognized, they may be included in a list of prohibited characters. Specifying prohibited characters increases the speed and quality of OCR.

R

Resolution is a scanning parameter measured in dots per inch (dpi). Resolution of 300 dpi should be used for texts set in 10 pt fonts and larger, 400 to 600 dpi is preferable for texts of smaller font sizes (9 pt and less).

Recognition area is an image area that ABBYY FineReader should analyze automatically.

Back to Top

S

Scanner is a device for inputting images into a computer.

Scanning mode is a scanning parameter that determines whether an image must be scanned in black and white, grayscale, or color.

Separators are symbols that can separate words (e.g. /, \, dash) and that are separated by spaces from the words themselves.

Support ID is a unique identifier of a serial number with information about the license and the computer on which it is used. A support ID provides additional protection and is checked by the technical support service before providing technical support.

T

Table area is an image area that contains data in table form. When the application reads this type of area, it draws vertical and horizontal separators inside the area to form a table. This area is then rendered as a table in the output text.

Tagged PDF is a PDF document which contains information about the document structure, such as its logical parts, pictures, and tables. The structure of a document is encoded in PDF tags. A PDF file with such tags may be reflowed to fit different screen sizes and will display well on handheld devices.

Task Manager is an ABBYY FineReader feature that allows you to run an automated task, create and modify automated tasks, and delete custom automated tasks which you no longer use.

Text area is an image area that contains text. Note that text areas should only contain single-column text.

Training is establishing a correspondence between a character image and the character itself. (For details, see the Recognition with Training section.)

U

Uncertain characters are characters that may have been recognized by the program incorrectly. ABBYY FineReader highlights uncertain characters.

Uncertain words are words containing one or more uncertain characters.

Unicode is a standard developed by the Unicode Consortium (Unicode, Inc.). The standard is a 16–bit international encoding system for processing texts. The standard is easily extended. The Unicode Standard determines the character encoding, the properties, and procedures used in processing texts written in a certain language.

Back to Top