Saving in HTML

To save your text in HTML:

Tip. Additional saving options are available in the Options dialog box: select Tools>Options…, click the Save tab, and then click the HTML tab.
The saving options on this tab are grouped into the following categories:

Retain layout Save mode Picture settings Character encoding Book settings

Depending on how you are planning to use your electronic document, select the appropriate option:

  • Flexible layout
    Produces a document that maintains the formatting of the original. The output document can be easily edited.
  • Formatted text
    Retains fonts, font sizes, and paragraphs, but does not retain the exact locations of the objects on the page or the spacing. The resulting text will be left-aligned. (Right-to-left texts will be right-aligned.)

    Note: Vertical texts will be changed to horizontal in this mode.

  • Plain text
    This mode does not retain formatting.

Select this option to save the file in HTML 4 format, which allows maintaining the document layout by using a style table incorporated in the HTML-file.

  • Keep line breaks
    Retains the original arrangement of lines.
  • Retain text and background colors
    Retains the original color of the letters.
  • Keep headers and footers
    Preserves headers and footers in the output document.

Documents containing a large number of pictures are very large. To reduce the size of the file, select the desired option in the Image quality group.


  • To change the picture saving parameters, click Custom…. In the Custom Picture Settings dialog box, select the desired parameters and click OK.
  • If you don't want to keep pictures in the recognized text, make sure the Keep pictures option is clear.

ABBYY FineReader detects the code page automatically. To change the code page or its type, select the desired code page or code page type from the drop-down lists under Character encoding.

Adjust Book settings if you are converting a printed book into electronic form. ABBYY FineReader can automatically save book chapters into separate HTML files and recreate links to them in the table of contents. You can also split the document into separate HTML files based on level 1 or level 2 headings.