Develop and Download Open Source Software

Uplug

We call for your help with the Japanese translations for project summaries. The submitted translations will be posted after simple evaluation. (Not only the translation itself is going to be checked, but we also make sure it's clear of wiki format problems. Especially problems that are related to the layout, such as line breaks and links, will be checked, so please give heed. When there appears to be a problem with the translation itself or the layout, we may not be able to post your translation. We often see problematic cases related to links. When there's going to be a line of Japanese text right after the URL, please make sure to leave a space to separate.) Also, even with projects that may already have their summaries translated, if you find an outdated translation or typographical errors, please update. Login to start translating. When you are done, you will find your name at the bottom of the summary as the translator.

You are not Logged in. If you submit this translation, your name will not show on the this project page (will be logged as "translated by an anonymous user").

Project Description:

English:
(Last Update 2009-06-12 15:46)

Uplug is a collection of tools for linguistic
corpus processing, word alignment, and term
extraction from parallel corpora. Several tools
have been integrated in Uplug. Pre-processing
tools include a sentence splitter, tokenizer, and
external part-of-speech tagger and shallow
parsers. The following external tools are used:
the Grok system for English (tagging and chunking)
and the morphological analyzer ChaSen for
Japanese. Other tools such as the TreeTagger can
easily be added. Translated documents can be
sentence aligned using the length-based approach
by Gale & Church. Words and phrases can be aligned
using the clue alignment approach and the toolbox
for training statistical alignment models GIZA++.

No Japanese Translated Data
Japanese: