RecAPI
MTX omnifont recognition module
Module name: MTX
Module identifier: RM_OMNIFONT_MTX
Filling methods supported: FM_OMNIFONT, FM_DRAFTDOT9, FM_DRAFTDOT24, FM_OCRA, FM_OCRB
Filters supported: FILTER_ALL, FILTER_DIGIT and FILTER_ALPHA
Trade-off supported: TO_FAST, TO_ACCURATE (TO_BALANCED is equal to this)
Knowledge base file: N / A
Training file supported: yes

This recognition module is supported on: Windows.

The PLUS2W and PLUS3W recognition modules also require the presence of this module. This module is supplied in both the Professional Recognition Kit and the OCR Kit. Its inclusion in your application must be covered by a distribution license. See the topic on Licensing in the General Information help system.

Recognition module language binaries are xi*.bin, as follows:

  • For recognizing an English document, the filenames include the identifier ENG:
    • xiengb.bin
    • xiengc.bin
    • xiengd.bin
    • xienge.bin
    • xiengf.bin (used unaltered for all languages)
    • xiengl.bin (used unaltered for all languages)
    • xiengp.bin
    • xiengs.bin
    • xiengz.bin

The files xiengf.bin and xiengl.bin are required, unaltered, for all languages. All other languages have language-specific equivalents of the remaining seven files. The identifier eng is changed as follows:

  • French: frn
  • Italian: itl
  • German: grm
  • Dutch: dut
  • Spanish: spn
  • Portuguese: prt
  • Swedish: swd
  • Norwegian: nrw
  • Danish: dan
  • Finnish: fin
  • Portuguese (Brazilian): brz

Application areas

This recognition module recognizes machine printed text; i.e. from printed publications, laser or ink-jet printers and electric typewriters. Output from mechanical typewriters in good condition may also be acceptable. It should also be used for Letter or Near Letter Quality output from dot-matrix printers, and can also be used for Draft Quality.

Range of characters

This module supports the characters of the following languages

Language Language identifier
English LANG_ENG
French LANG_FRE
Spanish LANG_SPA
Italian LANG_ITA
German LANG_GER
Norwegian LANG_NOR
Portuguese LANG_POR
Danish LANG_DAN
Dutch LANG_DUT
Finnish LANG_FIN
Swedish LANG_SWE
Brazilian LANG_BRA

Any of these languages can be combined.

Accuracy issues

This module is influenced by the page-level trade-off setting, but reduces the three settings to two: TO_FAST is respected, while TO_BALANCED and TO_ACCURATE are merged to one value.

Character attributes

The omnifont recognition module can detect and transmit character attributes: bold, italic or underlined text (or any combination of them). It can also detect and transmit character size, and can classify font types into three broad categories: serif, sans serif and monospaced.