[Groonga-commit] groonga/groonga [master] kytea: use normalized string

Back to archive index

Kouhei Sutou null+****@clear*****
Tue Nov 13 14:03:59 JST 2012


Kouhei Sutou	2012-11-13 14:03:59 +0900 (Tue, 13 Nov 2012)

  New Revision: 958d5107cda39d2e9388e4eac098dd66cbc5cd0d
  https://github.com/groonga/groonga/commit/958d5107cda39d2e9388e4eac098dd66cbc5cd0d

  Log:
    kytea: use normalized string

  Modified files:
    plugins/tokenizers/kytea.cpp

  Modified: plugins/tokenizers/kytea.cpp (+10 -1)
===================================================================
--- plugins/tokenizers/kytea.cpp    2012-11-13 13:05:22 +0900 (d90d77a)
+++ plugins/tokenizers/kytea.cpp    2012-11-13 14:03:59 +0900 (9c05db1)
@@ -187,9 +187,18 @@ grn_obj *grn_kytea_init(grn_ctx *ctx, int num_args, grn_obj **args,
 
   tokenizer->query = query;
 
+  grn_obj *normalized_query = query->normalized_query;
+  const char *normalized_string;
+  unsigned int normalized_string_length;
+  grn_string_get_normalized(ctx,
+                            normalized_query,
+                            &normalized_string,
+                            &normalized_string_length,
+                            NULL);
+
   grn_plugin_mutex_lock(ctx, kytea_mutex);
   try {
-    const std::string str(query->ptr, query->length);
+    const std::string str(normalized_string, normalized_string_length);
     const kytea::KyteaString &surface_str = kytea_util->mapString(str);
     const kytea::KyteaString &normalized_str = kytea_util->normalize(surface_str);
     tokenizer->sentence = kytea::KyteaSentence(surface_str, normalized_str);
-------------- next part --------------
HTML����������������������������...
Download 



More information about the Groonga-commit mailing list
Back to archive index