Ticket #36402

Open Date: 2016-06-19 22:54

Last Update: 2016-08-24 11:09

絵文字の辞書整備

Reporter:nishimotoOwner:(None)
Priority:5 - MediumMileStone:2016.3jp (closed)
Type:PatchesSeverity:5 - Medium
Component:(None)Status:Closed
ResolutionNone

Details

#30841 サロゲートペア対応で作成した絵文字の辞書を試してみると、 Windows 10 + ATOK 「えもじ」で変換して出てくる文字をあまりカバーできていない。 文字コード 26xx や 27xx あたりに入っている絵文字の定義が抜けていると思われる。

Attachment File

Attachment File ListNo attachments
Add New attachment
Add attachment filesPlease login to add new attachment

Ticket History - 3/4 Histories [Show all old Histories]

2016-06-19 22:54 Updated by: nishimoto

  • New Ticket "絵文字の辞書整備" created

2016-06-20 17:14 Updated by: nishimoto

2016-06-20 17:44 Updated by: nishimoto

Comment

mecab-ipadic-neologd から Unicode 2xxx の絵文字っぽいものを探すスクリプト:

$ xzcat seed/mecab-user-dict-seed.20160526.csv.xz | tse -F "," -s ".*" "if len(L1) == 1 and 0x2000 <= ord(L1) <= 0x2fff: print(L1 + ',' + repr(L1) + ',' + L11 + ',' + L12)" |uniq > emoji2.txt

https://github.com/nvdajp/nvdajp/issues/7

2016-08-24 11:09 Updated by: nishimoto

  • Ticket Close date is changed to 2016-08-24 11:09
  • Status Update from Open to Closed

Add Comment/Update #36402 (絵文字の辞書整備)

You are not logged in. I you are not logged in, your comment will be treated as an anonymous post. » Login