RecAPI
PLUS2W and PLUS3W omnifont recognition modules
Module name: PLUS2W and PLUS3W
Module identifier: RM_OMNIFONT_PLUS2W and RM_OMNIFONT_PLUS3W
Filling methods supported: FM_OMNIFONT
Filters supported: FILTER_ALL, FILTER_DIGIT and FILTER_ALPHA
Trade-off supported: TO_FAST, TO_BALANCED, TO_ACCURATE
Knowledge base file: RECOGN.BCT, RECOGN24.BCT
Training file supported: yes (supported on: Windows, Linux)

PLUS2W and PLUS3W engines are voting engines combining the results of other OMNIFONT OCR engines of the CSDK. In different trade-off modes they use different engine combination. These modules are supplied in both the Professional Recognition Kit and the OCR Kit. Their inclusion in your application must be covered by your distribution licensing. See the topic on Licensing in the General Information help system.

PLUS3W module is supported on: Windows, Linux. The FAST mode of PLUS2W and PLUS3W modules is supported on: Windows.

IMPORTANT NOTES

The default settings of OmniPage 20 (Nuance's desktop application) and OmniPage Capture SDK 20 are not the same. In default, RecAPI of the CSDK does not run in the most accurate mode, but in a less accurate and faster mode, which is a good compromise between the speed and the accuracy. But it can be easily switched into the most accurate mode modifying the value of the setting Kernel.OcrMgr.PreferAccurateEngine to true. This most accurate mode of the CSDK is equivalent to the default of the desktop application. See also kRecSetDefaultRecognitionModule and its notes.

Application areas

This recognition module recognizes machine printed text; i.e. from printed publications, laser or ink-jet printers and electric typewriters. Output from mechanical typewriters in good condition may also be acceptable.

Range of characters

This module supports the same set of characters as the RM_OMNIFONT_MOR module.

Accuracy issues

The PLUS2W and PLUS3W modules use voting technology to provide improved recognition results. The PLUS2W and PLUS3W modules use the results from one or more of FRX, MOR and MTX modules according to the trade-off. With either of these two voting modules, the accuracy is considerably better, but the recognition may need significantly more time than any single module.

Please consult the topic Performance comparison for information on the balance between speed and accuracy for the most common engine combinations and trade-off settings.

Suspicious marking

With these modules, the suspicious character and word marking feature is different from that used in MOR, MTX or FRX. These modules do not mark characters as suspicious if all the voting modules provided the same recognition result, even if they were suspiciously recognized in any of them. Consequently, there are likely to be fewer words marked as non-dictionary.

Character attributes

The omnifont recognition module can detect and transmit character attributes: bold, italic or underlined text (or any combination of them). It can also detect and transmit character size, and can classify font types into three broad categories: serif, sans serif and monospaced.