Saving Recognized Text in HTML Format

All saving options for HTML format are set on the HTML tab in the Formats Settings dialog. To open this dialog, click the Formats Settings button on the Save tab of the Options dialog or press CTRL+SHIFT+X.

Note: When saving text in HTML format, ABBYY FineReader uses the fonts set on the Save tab in the Options dialog (Tools>Options menu) or those you set during text editing in the Text window.

The following options enable you to customize the saving mode so that the resulting document is most suitable for later retrieval and processing:

Retaining page layout

Layout retention modes are set in the Retain layout group. The following choices are available:

Format options

HTML formats available:

  1. Full (uses CSS and requires Internet Explorer 4.0 or later) - the latest HTML format - HTML 4 - is used. HTML 4 supports all document layout retention types (the actual retention type used depends on the options set on the Save tab in the Retain layout group). The built-in style sheet is used.
    Note: Internet Explorer 4.0 or later is required for viewing a document saved in this mode.
  2. Simple (compatible with all (Internet-) browsers) - HTML 3 format is used. The approximate document layout is retained i.e. the first line indent is not retained but the approximate font size is (HTML 3 format supports only a limited number of font sizes; ABBYY FineReader will choose the HTML 3 format font size that corresponds to the actual font size of your text). This HTML format is supported by all browsers (Netscape Navigator, Internet Explorer 3.0 and later).

Text settings

Note that the default values of Text settings (an option is set or not) depend on the page layout retention mentioned above.

Picture Settings

If you wish to keep pictures in the recognized text, make sure that the Keep pictures option is set in the Picture settings group.

If the recognized document contains many pictures, you can reduce the size of the resulting file: select the desired picture quality and format in the Picture settings group.

Quality

Three quality levels are available in the Quality drop-down list. Select:

The higher the value you choose from the Quality drop-down list, the higher will be the quality of the pictures you save. The size of the file is also affected by this value: the higher the value, the larger the file you get.

Tip. In order to tune the best 'size/quality' ratio, try to save the recognition results with different Quality values, and then open them in an image viewing application.

Format

As a rule, ABBYY FineReader selects the picture format automatically. To ensure that this is the case, make sure that the (Automatic) item is selected from the Format drop-down list.

If you wish to set up the format manually, select one of the following items:

Character encoding options

ABBYY FineReader detects the code page automatically. To change the code page, select the code page of your choice or the code page type in the Character encoding group.