The HTML Tab in the Formats Settings Dialog

Set parameters for saving the recognized text into an HTML file:

Option name

Option description

Retain layout

Set the layout retention. The following choices are available:

  • Original layout
    Select this option if you wish the recognition results look exactly like the original document.
    Note: This option will not allow a lot of editing in the recognized text. It is most suitable for short artistic or brochure-like documents.
  • Remove all formatting
    Only structure of tables and arrangement into paragraphs are retained. The text is set in the same font of the same size.
  • Retain font and font size
    Structure of tables, arrangement into paragraphs, font and font size are retained in this mode.

Tip. If you wish to retain pictures in the recognition results, make sure that the Keep pictures option is selected in the Picture settings group.

Save mode Set the saving mode. The following choices are available:
  • Full
    If you set this option, the new HTML format - HTML 4 - is used. It supports any type of the document layout retention (the actual retention type used depends on the options selected from the Retain layout drop-down list); the built-in style sheet is used.
    Note: Internet Explorer 4.0 or later is required for viewing a document saved in this mode.
  • Simple
    This saving mode provides documents compatible with all browsers. If you set this option, the HTML 3 format is used. The document layout is retained approximately: first line indent and indents in tables are not retained. This HTML format is supported by all browsers including out-of date versions, for example, Internet Explorer 3.0.
Text settings group
Keep line breaks

Set this option if you want the original arrangement into lines to be retained in HTML format, otherwise the text in the HTML file will be formatted as a single line.

Retain text color Set this option if you want the original character color to be retained.
Use solid line as page break The original arrangement into pages will be retained, pages will be separated by a solid line.
Picture settings group
Keep pictures Set this option to keep pictures in the recognized text.
The option is set by default.
Quality Set the quality for saving pictures. Select one of the following values:
  • High if you are planning to print the recognition results.
  • Medium if the recognition results are intended for viewing on the screen. This value is set by default.
  • Low if you are planning to place the recognition results on the Web.

The higher the value you specify in this field, the higher will be the quality of the pictures you save. The size of the file is also affected by this value: the higher the value, the larger the file you get.
Tip. In order to tune the best proportion size/quality, try to save the recognition results with different Quality values, and then open them in an image viewing application.
Note:
This option is disabled if the Keep pictures option is not selected.

Format Specify the format for saving pictures. The following choices are available:
  • Automatic,
    If this option is selected ABBYY
    FineReader will automatically select the most suitable format. This value is set by default.

  • JPEG, Color (for photos),

  • JPEG, Gray (for photos),

  • PNG, Color (for charts, diagrams),

  • PNG, Gray (for charts, diagrams),

  • PNG, Black and white.

Note: This option is disabled if the Keep pictures option is not selected.

Character encoding
Code page By default the code page is detected automatically. Select the Automatic value to use the automatic detection. Still, you may select the code page manually if necessary, just choose the value you need from the list. 
Code page type Specifies the code page type (Windows, DOS, Macintosh, ISO).