Project Description

Mpaligner is the many-to-many string alignment tool based on the generative model which is modified to find a minimum mapping between two strings, such as notation and pronunciation. Mpaligner has some functions. The partial annotation function enables you to give correct alignment to part data by manpower (to provide Semi-supervised training). The detection function of special data detects data that is difficult to do alignment (for example tri'plei). The data which alignment is done is employed as training data. For example, when two strings which alignment is done are notation and pronunciation, it is employed as training data to construct a model for grapheme-to-phoneme conversion (g2p conversion). The license of mpaligner is GNU GPL.

If you hope to learn a model with aligned data produced by mpaligner to estimate pronunciation and to estimate a pronunciation with the learned model, please use slearp ( http://sourceforge.jp/projects/slearp/ ) which implements the learning methods for the model and a predict function to estimate a pronunciation.

Developer implementing mpaligner is below.

NAIST(Nara Institute of Science and Technology)
Graduate School of Information Science
Augmented Human Communication Laboratory
The Doctoral Program
Keigo Kubo

System Requirements

Linux and Cygwin

Install

mpaligner のインストール方法 以下の通りです. $ tar xvfz mpaligner_<version>.tar.gz $ cd mpaligner_<version> $ make $ cp mpaligner <パスの通ったディレクトリ> Show How to Install

Usage

使用例: $ cat source/test.utf8.txt | ./script/separate_for_char.pl utf8 \ source/joint_chars.utf8.txt > source/test.utf8.char_unit $ mpaligner -i source/test.utf8.char_unit このコマンドにより,... Show Usage

Download

Review
Your rating
Review this project

Statistics

Recent Activities