kazusa@host01[39]% cat ~/.bsfilter/bsfilter.conf
jtokenizer MeCab
db gdbm
kazusa@host01[40]% bsfilter tmp/test-message.jpn.txt
/usr/local/bin/bsfilter:1105:in `=~': incompatible encoding regexp match (EUC-JP regexp with ASCII-8BIT string) (Encoding::CompatibilityError)
from /usr/local/bin/bsfilter:1105:in `mecab'
from /usr/local/bin/bsfilter:1043:in `block in initialize'
from /usr/local/bin/bsfilter:1060:in `call'
from /usr/local/bin/bsfilter:1060:in `split'
from /usr/local/bin/bsfilter:1384:in `block in tokenize_str'
from /usr/local/bin/bsfilter:1383:in `each'
from /usr/local/bin/bsfilter:1383:in `tokenize_str'
from /usr/local/bin/bsfilter:1547:in `tokenize_body'
from /usr/local/bin/bsfilter:1260:in `tokenize_buf'
from /usr/local/bin/bsfilter:3404:in `block (2 levels) in run'
from /usr/local/bin/bsfilter:330:in `open_ro'
from /usr/local/bin/bsfilter:3400:in `block in run'
from /usr/local/bin/bsfilter:3399:in `each'
from /usr/local/bin/bsfilter:3399:in `run'
from /usr/local/bin/bsfilter:3445:in `<main>'
kazusa@host01[41]%
kazusa@host01[14]% cat ~/.bsfilter/bsfilter.conf
jtokenizer MeCab
db gdbm
kazusa@host01[15]% bsfilter spam.jpn.01.txt
/usr/local/bin/bsfilter:1102:in `split': invalid byte sequence in UTF-8 (ArgumentError)
from /usr/local/bin/bsfilter:1102:in `mecab'
from /usr/local/bin/bsfilter:1043:in `block in initialize'
from /usr/local/bin/bsfilter:1060:in `call'
from /usr/local/bin/bsfilter:1060:in `split'
from /usr/local/bin/bsfilter:1223:in `block in tokenize_headers'
from /usr/local/bin/bsfilter:1197:in `each'
from /usr/local/bin/bsfilter:1197:in `tokenize_headers'
from /usr/local/bin/bsfilter:1251:in `tokenize_buf'
from /usr/local/bin/bsfilter:3408:in `block (2 levels) in run'
from /usr/local/bin/bsfilter:330:in `open_ro'
from /usr/local/bin/bsfilter:3404:in `block in run'
from /usr/local/bin/bsfilter:3403:in `each'
from /usr/local/bin/bsfilter:3403:in `run'
from /usr/local/bin/bsfilter:3449:in `<main>'
kazusa@host01[16]%
RE: Ruby 1.9のサポートについて (2013-07-04 10:07 by kazusaoda #69383)
いつ頃からか判らないのですが、1.0.18.ruby1.9.rc5に
(1) 文字コードがASCII以外である
(2) 内容がSpam判定されるものである
という条件を満たすメールを食わせると、以下のようなエラーが発生するよう
になりました。
kazusa@host01[14]% cat ~/.bsfilter/bsfilter.conf
jtokenizer MeCab
db gdbm
kazusa@host01[15]% bsfilter spam.jpn.01.txt
/usr/local/bin/bsfilter:937:in `=~': invalid byte sequence in UTF-8 (ArgumentError)
from /usr/local/bin/bsfilter:937:in `block in get_lang'
from /usr/local/bin/bsfilter:929:in `each'
from /usr/local/bin/bsfilter:929:in `get_lang'
from /usr/local/bin/bsfilter:908:in `get_lang_from_buf'
from /usr/local/bin/bsfilter:1464:in `tokenize_body'
from /usr/local/bin/bsfilter:1265:in `tokenize_buf'
from /usr/local/bin/bsfilter:3410:in `block (2 levels) in run'
from /usr/local/bin/bsfilter:330:in `open_ro'
from /usr/local/bin/bsfilter:3406:in `block in run'
from /usr/local/bin/bsfilter:3405:in `each'
from /usr/local/bin/bsfilter:3405:in `run'
from /usr/local/bin/bsfilter:3451:in `<main>'
kazusau@host01[16]%
/usr/local/bin/bsfilter:1403:in `scan': invalid byte sequence in EUC-JP (ArgumentError)
from /usr/local/bin/bsfilter:1403:in `tokenize_str'
from /usr/local/bin/bsfilter:1591:in `tokenize_body'
from /usr/local/bin/bsfilter:1303:in `tokenize_buf'
from /usr/local/bin/bsfilter:3464:in `block (2 levels) in run'
from /usr/local/bin/bsfilter:330:in `open_ro'
from /usr/local/bin/bsfilter:3460:in `block in run'
from /usr/local/bin/bsfilter:3459:in `each'
from /usr/local/bin/bsfilter:3459:in `run'
from /usr/local/bin/bsfilter:3505:in `<main>'