NHocr is a command line OCR (Optical Character Recognition) program for Japanese language, etc. It has been designed to recognize machine-printed Japanese characters and some ASCII characters/symbols in an image. NHocr is probably the first Open Source Japanese OCR software (offline, machine-printed), except some experimental, partial codes open to academic communities.
NHocr ver 0.22 source code distribution
* Some code fixes have been made.
* A part of the image manipulation library O2-tools has been included in the source tree.
* Vertical writing has been supported. However, the accompanying dictionaries in this release lack vertical fonts, especially in symbols. Many characters are compatible and could be recognized well also in vertical documents.
The dictionaries with vertical fonts will be available in future releases.