MeCab user dictionary: JST Thesaurus Headwords and Synonyms
| Data description | |||||||||||||||||||||||||||||||||||||||||||||
|
|
MeCab user dictionary: JST Thesaurus Headwords and Synonyms | ||||||||||||||||||||||||||||||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
|
|
10.18908/lsdba.nbdc02358-001.V002 |
||||||||||||||||||||||||||||||||||||||||||||
|
|
We have made a user dictionary of morphological analysis engine MeCab (<a href="http://taku910.github.io/mecab/" target="_blank">http://taku910.github.io/mecab/</a>) headwords and synonyms of JST Thesaurus (2015 edition) . As no reading was given to synonyms (Headword Flag: 'V') in the original thesaurus, NBDC had given natural reading to synonyms in life science (Category code: 'LSxx', where xx is a two-digit number) and computer science (Category code: 'EG01') and reading of base form to synonyms in other categories. The dictionary items are based on IPA dictionary. Csv file is encoded in Shift-JIS and dic file is encoded in UTF-8. Entries with zenkaku alphabets, numerals and symbols converted into corresponding hankaku characters are also included. Please note that this dictionary can not be used as a thesaurus because information on relations between words is not included in the dictionary. | ||||||||||||||||||||||||||||||||||||||||||||
|
|
File name :
Thesaurus2015.dic.zip (MeCab dic format)
File URL :
File size :
7.4 MB
|
||||||||||||||||||||||||||||||||||||||||||||
|
|
http://togodb.biosciencedbc.jp/togodb/view/mecab_thesaurus#en | ||||||||||||||||||||||||||||||||||||||||||||
|
|
IPA dictionary (mecab-ipadic-2.7.0-20070801 downloaded from MeCab's site [see above]), JST Science and Technology Thesaurus (2015 edition) |
||||||||||||||||||||||||||||||||||||||||||||
|
|
- |
||||||||||||||||||||||||||||||||||||||||||||
|
|
127,214 entries |
||||||||||||||||||||||||||||||||||||||||||||
| Data detail | |||||||||||||||||||||||||||||||||||||||||||||
|
|||||||||||||||||||||||||||||||||||||||||||||