Yasuhiro Horimoto 2019-01-04 14:19:02 +0900 (Fri, 04 Jan 2019) Revision: 2e539e3b9350472bef71f909735309e5a5064e39 https://github.com/groonga/groonga/commit/2e539e3b9350472bef71f909735309e5a5064e39 Message: doc: Separate from tokenizers page Added files: doc/source/reference/tokenizers/token_trigram.rst Modified files: doc/locale/ja/LC_MESSAGES/reference.po doc/source/reference/tokenizers.rst Modified: doc/locale/ja/LC_MESSAGES/reference.po (+5 -9) =================================================================== --- doc/locale/ja/LC_MESSAGES/reference.po 2019-01-04 14:12:16 +0900 (ab7923212) +++ doc/locale/ja/LC_MESSAGES/reference.po 2019-01-04 14:19:02 +0900 (b54322a95) @@ -27776,26 +27776,22 @@ msgstr "" msgid "Outputs reading of token." msgstr "トークンの読みがなを出力します。" -#, fuzzy msgid "" "``TokenTrigram`` is similar to :ref:`token-bigram`. The differences between " "them is token unit." msgstr "" -"``TokenBigramSplitSymbol`` は :ref:`token-bigram` と似ています。違いは記号の" -"扱いです。" +"``TokenTrigram`` は :ref:`token-bigram` と似ています。違いはトークンの単位で" +"す。" -#, fuzzy msgid "``TokenTrigram`` hasn't parameter::" -msgstr "``TokenBigram`` には、引数がありません。" +msgstr "``TokenTrigram`` には、引数がありません。" -#, fuzzy msgid "" ":ref:`token-bigram` uses 2 characters per token. ``TokenTrigram`` uses 3 " "characters per token as below example." msgstr "" -"``TokenTrigram`` は :ref:`token-bigram` に似ています。違いはトークンの単位で" -"す。 :ref:`token-bigram` は各トークンが2文字ですが、 ``TokenTrigram`` は各" -"トークンが3文字です。" +":ref:`token-bigram` は各トークンが2文字ですが、以下の例のように " +"``TokenTrigram`` は各トークンが3文字です。" msgid "``TokenUnigram``" msgstr "" Modified: doc/source/reference/tokenizers.rst (+0 -16) =================================================================== --- doc/source/reference/tokenizers.rst 2019-01-04 14:12:16 +0900 (5d3dda525) +++ doc/source/reference/tokenizers.rst 2019-01-04 14:19:02 +0900 (3abf02d04) @@ -107,10 +107,7 @@ Built-in tokenizsers Here is a list of built-in tokenizers: - * ``TokenTrigram`` - * ``TokenDelimit`` * ``TokenDelimitNull`` - * ``TokenMecab`` * ``TokenRegexp`` .. toctree:: @@ -119,19 +116,6 @@ Here is a list of built-in tokenizers: tokenizers/* -.. _token-trigram: - -``TokenTrigram`` -^^^^^^^^^^^^^^^^ - -``TokenTrigram`` is similar to :ref:`token-bigram`. The differences -between them is token unit. :ref:`token-bigram` uses 2 characters per -token. ``TokenTrigram`` uses 3 characters per token. - -.. groonga-command -.. include:: ../example/reference/tokenizers/token-trigram.log -.. tokenize TokenTrigram "10000cents!!!!!" NormalizerAuto - .. _token-delimit-null: ``TokenDelimitNull`` Added: doc/source/reference/tokenizers/token_trigram.rst (+34 -0) 100644 =================================================================== --- /dev/null +++ doc/source/reference/tokenizers/token_trigram.rst 2019-01-04 14:19:02 +0900 (18a4545d0) @@ -0,0 +1,34 @@ +.. -*- rst -*- + +.. highlightlang:: none + +.. groonga-command +.. database: tokenizers + +.. _token-trigram: + +``TokenTrigram`` +================ + +Summary +------- + +``TokenTrigram`` is similar to :ref:`token-bigram`. The differences +between them is token unit. + +Syntax +------ + +``TokenTrigram`` hasn't parameter:: + + TokenTrigram + +Usage +----- + +:ref:`token-bigram` uses 2 characters per +token. ``TokenTrigram`` uses 3 characters per token as below example. + +.. groonga-command +.. include:: ../../example/reference/tokenizers/token-trigram.log +.. tokenize TokenTrigram "10000cents!!!!!" NormalizerAuto -------------- next part -------------- An HTML attachment was scrubbed... URL: <https://lists.osdn.me/mailman/archives/groonga-commit/attachments/20190104/be841b87/attachment-0001.html>