Saving the Recognized Text in RTF, DOC and Word XML
Formats
Important! The option of saving in Word XML is only
available for Microsoft Word 2003.
All saving options for RTF, DOC and Word
XML formats are set on the RTF/DOC/Word XML tab in the
Formats Settings dialog. To open this dialog, click the
Formats Settings button on the Save tab of the
Options dialog or press CTRL+SHIFT+X.
Note: When saving text in RTF, DOC and Word XML
formats, ABBYY FineReader uses the fonts set on the Save tab in the
Options dialog (Tools>Options menu) or those you
set during text editing in the Text window.
The following options enable you to customize
the saving mode so that the resulting document is most suitable for
later retrieval and processing:
Layout retention modes are set in the Retain
layout group. The following choices are available:
Original layout Select this option if you wish the
recognition results to look exactly like the original document. Note: This option will not allow a lot of editing in the
recognized text. It is most suitable for short artistic or
brochure-like documents.
Columns, tables, paragraphs, fonts This option will
retain the original layout in full, but in some cases there might
be a slight difference from the original layout. Select this option
if you are planning a lot of editing and re-formatting in the
recognized text.
Tables, paragraphs, fonts Select this option if you
need the content of the original document, but do not need to
retain the exact layout of the document.
You may select the default paper size that will be used for
saving in RTF, DOC or Word XML format. To do this, specify the
required paper size in the Default paper size drop-down
list.
Tips.
If you do not find a suitable paper size in the list, you can
add your own - custom - paper size. In order to do this, select the
Add custom paper size item from the list and in the dialog
that appears specify the name, height and width for the custom
paper size.
To ensure the recognition results fit the paper size, select
the Increase paper size if content does not fit option.
ABBYY FineReader will automatically select the most suitable paper
size when saving the recognized text and pictures.
Note that the default values of Text settings (an option
is set or not) depend on the page layout retention mentioned
above.
Keep line breaks This option saves the the original
arrangement into lines to be retained the RTF/DOC/Word XML
format.
Keep page breaks This option saves the original
document page arrangement to be retained in RTF/DOC/Word XML
format.
Retain text color This option saves the original
character color to be retained. Note: Word 6.0, 7.0 and 97 (8.0) have a limited text and
background color palette. The original document colors may be
replaced with the ones from the Word palette. Word 2000 (9.0) or
later, on the contrary, retains the source document colors in
full.
Remove optional hyphens This option removes the
optional hyphen sign (¬) from the recognized text. If the Keep
line breaks option is set, the optional hyphen signs will be
replaced with the hyphen signs (-).
Highlight uncertain characters Select this options
if you wish to edit the recognized text in Microsoft Word rather
than in the ABBYY FineReader Text window. If this option is
set all uncertain characters will be highlighted in Microsoft Word
window. Tip. You may change the color of uncertain characters in the
View tab of the
Options dialog (Tools>Options menu).
Enable compatibility with Microsoft Word 95 This
option allows the recognition results to be saved in Microsoft Word
95. Note: When saving in Microsoft Word 95, only the BMP image
format is available for saving pictures.
Enable ABBYY FineReader's Zoom window in Microsoft Word
2003 This option enables displaying ABBYY FineReader's
Zoom window in Microsoft Word 2003. When saving results in
Word XML, the recognized image can be viewed in the Zoom
window integrated into Microsoft Word. This window presents the
magnified image of the current line or portion of the
document.
If you wish to keep pictures in the recognized text, make sure
that the Keep pictures option is set in the Picture
settings group.
If the recognized document contains many pictures, you can
reduce the size of the resulting file: select the desired picture
quality and format in the Picture settings group.
Quality
Three quality levels are available in the Quality
drop-down list. Select:
High if you are planning to print the recognition
results.
Medium if the recognition results are intended for
viewing on the screen.
Low if you are planning to place the recognition results
on the Web.
The higher the value you choose from the Quality
drop-down list, the higher will be the quality of the pictures you
save. The size of the file is also affected by this value: the
higher the value, the larger the file you get.
Tip. In order to tune the best 'size/quality' ratio , try
to save the recognition results with different Quality
values, and then open them in an image viewing application.
Format
As a rule, ABBYY FineReader selects the picture format
automatically. To ensure that this is the case, make sure that the
(Automatic) item is selected from the Format
drop-down list.
If you wish to set up the format manually, select one of the
following items:
JPEG, Color (for photos),
This option is suitable for documents containing color scanned or
digital photos.
JPEG, Gray (for photos),
This option is suitable for scanned or digital photos saved in
gray-scale mode.
PNG, Color (for charts, diagrams),
This option allows you to save charts, diagrams or drawings while
retaining their colors.
PNG, Gray (for charts, diagrams),
This option is suitable for saving charts and diagrams in
gray-scale mode.
PNG, Black and white.
This option allows you to save pictures in black-and-white
mode.