• 検索結果がありません。

近世後期口語資料の形態素解析―ルビ情報を利用した精度向上の試み―

N/A
N/A
Protected

Academic year: 2021

シェア "近世後期口語資料の形態素解析―ルビ情報を利用した精度向上の試み―"

Copied!
6
0
0

読み込み中.... (全文を見る)

全文

(1)「人文科学とコンピュータシンポジウム」 2016 年 12 月. ㏆ୡᚋᮇཱྀㄒ㈨ᩱࡢᙧែ⣲ゎᯒ ̿ࣝࣅ᝟ሗࢆ฼⏝ࡋࡓ⢭ᗘྥୖࡢヨࡳ̿ ᮧᒣ ᐇ࿴Ꮚ㸦஑ᕞ኱Ꮫ኱Ꮫ㝔⏕㸪ᅜ❧ᅜㄒ◊✲ᡤ㸧 㖹㇂ ┿ே㸦᪩✄⏣኱Ꮫ኱Ꮫ㝔⏕㸪ᅜ❧ᅜㄒ◊✲ᡤ㸧 ⸨ᮏ ⅉ㸦ᅜ❧ᅜㄒ◊✲ᡤ㸧 ᒸ ↷᫭㸦ᅜ❧ᅜㄒ◊✲ᡤ㸧 ㏆ୡᚋᮇࡢཱྀㄒ㈨ᩱ࡛ࡣࠕᚰ㦫㸦ࡧࡘࡃࡾ㸧ࠖࠕㄏᘬ㸦ࡉࡑࡣ|ࢀ㸧ࠖࡢࡼ࠺࡞≉Ṧ࠿ࡘ౑⏝㢖ᗘࡢ 㝈ࡽࢀࡓ₎Ꮠ⾲グࡀከࡃ㸪ᙧែㄽ᝟ሗ௜ࡁࢥ࣮ࣃࢫࢆᵓ⠏ࡍࡿ㝿ࡢ⮬ືᙧែ⣲ゎᯒࡢ⢭ᗘྥୖࡢጉࡆ ࡜࡞ࡗ࡚࠸ࡿ㸬ᚑ᮶ࡢᡭἲ࡛ࡣ᣺ࡾ௬ྡࡣゎᯒ࡟ࡣ୍ษ౑⏝ࡉࢀ࡚ࡇ࡞࠿ࡗࡓ㸬ࡑࡢࡓࡵ≉Ṧ࡞₎Ꮠ ⾲グࢆゎᯒࡋ࡚ࡶ㸪⾲グ࡜᣺ࡾ௬ྡ࡜ࡢ஋㞳࡟ࡼࡗ࡚㸪᣺ࡾ௬ྡ࡛♧ࡉࢀࡿㄒ࡜ࡋ࡚ゎᯒ⤖ᯝࡀᚓࡽ ࢀ࡞࠸ၥ㢟ࡀ࠶ࡗࡓ㸬ࡇࡢၥ㢟࡟ᑐࡋᮏⓎ⾲࡛ࡣ㸪୺࡟௨ୗࡢ㸰ࡘࡢ᪉ἲࢆ⏝࠸ࡓᑐฎ࡟ࡘ࠸࡚㏙࡭ ࡿ㸬ࡲࡎ⮬ືᙧែ⣲ゎᯒࡢ๓ฎ⌮࡜ࡋ࡚㸪≉Ṧ࡞₎Ꮠ⾲グࢆᖹ௬ྡ࡛⾲グࡉࢀࡓ᣺ࡾ௬ྡ࡟⨨᥮ࡍࡿ㸬 ࡑࡋ࡚⮬ືᙧែ⣲ゎᯒ⏝㎡᭩࡟௬ྡᙧࡢࣇ࢕࣮ࣝࢻࢆᇶ࡟ࡋࡓᖹ௬ྡࡢ᭩Ꮠᙧࢆ㏣ຍࡍࡿ㸬ࡇࡢᡭἲ ࢆ⏝࠸ࡓ⤖ᯝ㸪㎡᭩ᮍⓏ㘓ࡢ≉Ṧ࡞₎Ꮠ⾲グࡔࡅ࡛࡞ࡃ㸪ࠕㄏᘬ㸦ࡉࡑࡣ|ࢀ㸧ࠖࡢࡼ࠺࡞」ᩘㄒ࡟ࡲ ࡓࡀࡿ₎Ꮠ⾲グࡶṇࡋࡃゎᯒ࡛ࡁࡿࡼ࠺࡟࡞ࡗࡓ㸬. Morphological Analysis of Early Modern Japanese: An Approach to Improve the Analysis Precision Using the Ruby Information Miwako MURAYAMA (Kyushu University, National Institute for Japanese Language and Linguistics) Masato ZENIYA (Waseda University, National Institute for Japanese Language and Linguistics) Akari FUJIMOTO (National Institute for Japanese Language and Linguistics) Teruaki OKA (National Institute for Japanese Language and Linguistics) We can see many special and rare usages of Kanji characters in the historical documents written in spoken language of Edo era, such as "ᚰ㦫 (bikkuri) ", "ㄏᘬ㸦sasowa-re㸧". Although we use an automatic morphological analyzer for creating a word-segmented and pos-tagged corpus, such notations prevent the analyzer from improving its performance. Ordinal automatic analysis does not use information of ruby. Therefore it does not treat the special notations; it does not understand the meaning of which ruby has from the Kanji notation. In this presentation, for dealing with this problem, we replace the special Kanji notations with its ruby characters and analyze by using morphological analysis dictionaries which has surface-forms created from kana-fields in the original dictionaries. As some results of these methods, we can correctly analysis not only special notations that are not resisted our dictionaries, but also notations that are across several word boundaries, such as "ㄏᘬ㸦sasowa-re㸧".. 1㸬ࡣࡌࡵ࡟ ᅜ❧ᅜㄒ◊✲ᡤ࡛ࡣ⌧ᅾ㸪᪥ᮏㄒṔྐࢥ࣮ࣃࢫ 㸦௨ୗ㸪CHJ㸧ࡢᵓ⠏࡟ྲྀࡾ⤌ࢇ࡛࠸ࡿ㸬CHJ ࡛ࡣྂ௦ㄒ㹼㏆௦ㄒ࡟ࡘ࠸࡚ࡢ௦⾲ⓗ࡞㈨ᩱࢆ 㞟ࡵ㸪᪥ᮏㄒࡢ㏻᫬ⓗ◊✲࡟฼⏝ྍ⬟࡞ࢥ࣮ࣃࢫ ࡜ࡋ࡚㡰ḟබ㛤ࢆ㐍ࡵ࡚࠸ࡿ㸬ࡇࡢ࠺ࡕࠕỤᡞ᫬ ௦⦅Ϩ࣭ϩࠖ࡜ࡋ࡚ࢥ࣮ࣃࢫ໬ࡢᑐ㇟࡜ࡋ࡚࠸ࡿ ὗⴠᮏ࡜ே᝟ᮏࡣ㸪Ⓨヰ㒊ศ࡟ᙜ᫬ࡢヰࡋゝⴥࡀ ཯ᫎࡉࢀ࡚࠾ࡾ㸪㏆ୡㄒࡢᐇែࢆ᫂ࡽ࠿࡟ࡋ㸪୰ ୡ࠿ࡽ㏆⌧௦࡟࠿ࡅ࡚ࡢ᪥ᮏㄒࡢኚ໬ࢆ᥈ࡿୖ ࡛ᴟࡵ࡚㔜せ࡞㈨ᩱ࡛࠶ࡿ㸬ࡇࢀࡽࡣὗⴠᮏࢥ࣮ ࣃࢫ㸪ே᝟ᮏࢥ࣮ࣃࢫ࡜ࡋ࡚㸪ࡑࢀࡒࢀࡍ࡛࡟ヨ ⾜∧ࡀබ㛤ࡉࢀ࡚࠸ࡿ㸬ࡋ࠿ࡋ࡞ࡀࡽ㸪ࡇࢀࡽࡢ ヨ⾜∧ࢆᵓ⠏ࡍࡿ㝿࡟㸪㏆ୡ≉᭷ࡢ⾲グࡢከᵝᛶ. ࡀ㸪ᙧែㄽ᝟ሗ௜୚ࡢపࢥࢫࢺ໬ࡢࡓࡵ࡟ᑟධࡉ ࢀࡓ⮬ືᙧែ⣲ゎᯒࢆጉࡆࡿ࡜ࡋ࡚㸪௨๓ࡼࡾၥ 㢟どࡉࢀ࡚ࡁࡓ[1]㸬≉࡟㢧ⴭ࡞஦㇟࡜ࡋ࡚㸪௨ ୗࡢࡼ࠺࡞⌧௦࡛ࡣ୍⯡ⓗ࡛ࡣ࡞࠸₎Ꮠ⾲グ࡜ ᣺ࡾ௬ྡ࡜ࡢ⤌ࡳྜࢃࡏࡀ࠶ࡿ㸦⏝౛ࡣὗⴠᮏ ࠗ⋢⳥඲ఏⰼ⾤㚷࠘ࡼࡾᘬ⏝㸬ᣓᘼෆࡣཎᮏ࡟௜ ࡉࢀࡓ᣺ࡾ௬ྡ㸧㸬 (1) ᜊ࡟៳᝔㸦ࡸࡘࡿ࢏㸧㠃ᙳࢆ (2) ኤ࡭࠾ࡸࡋࡁࡢ⾗࡟㸬ㄏᘬ㸦ࡉࡑࡣࢀ㸧࡚ ⮬ືᙧែ⣲ゎᯒ࡟࠾࠸࡚ࡣ㸪ᮏ⾜ࡢ₎Ꮠ⾲グࢆゎ ᯒᑐ㇟࡜ࡍࡿࡓࡵ㸪ᑓ⏝ࡢゎᯒ⏝㎡᭩࡟Ⓩ㘓῭࡛ ࠶ࡿࠕ៳᝔㸦ࢩࣙ࢘ࢫ࢖㸧ࠖࠕㄏᘬ㸦ࣘ࢘࢖ࣥ㸧ࠖ. ⓒ 2016 Information Processing Society of Japan. ─ 69 ─.

(2) The Computers and the Humanities Symposium, Dec. 2016. ࡜࠸࠺ㄒ࡟ゎᯒࡉࢀࡿࡇ࡜࡟࡞ࡿࡀ㸪ࡇࢀࡣཎᮏ ࡟௜ࡉࢀࡓ᣺ࡾ௬ྡ࡜ࡣᑐᛂࡏࡎ㸪ㄗゎᯒ࡜࡞ࡿ㸬 ࡉࡽ࡟(2)ࡢሙྜ㸪ᮏᩥࡢㄞࡳ࡜ࡋ࡚ࡣືモࠕㄏ࠺ࠖ 㸩ຓືモࠕࢀࡿࠖࡢ஧༢఩࡟ゎᯒࡉࢀࡿࡢࡀᮃࡲ ࡋ࠸ࡢ࡟ᑐࡋ㸪ゎᯒ⤖ᯝࡣ୍༢఩┦ᙜ࡟࡞ࡿ࡜࠸ ࠺㸪༢఩ᩘࡢ㱈㱒ࡶ⏕ࡌࡿ㸬 ᮏⓎ⾲࡛ࡣ㏆ୡᚋᮇཱྀㄒ㈨ᩱ࡟≉ᚩⓗ㸪࠿ࡘㄗ ゎᯒࡢཎᅉ࡜࡞ࡾࡸࡍ࠸≉Ṧ࡞₎Ꮠ⾲グ㸪࠸ࢃࡺ ࡿᙜ࡚Ꮠࡢᐇែࢆ᫂ࡽ࠿࡟ࡋ㸪⮬ືᙧែ⣲ゎᯒ࡟ 㝿ࡋ࡚⏕ࡌࡿㄢ㢟࡜㸪ࡑࡢゎỴᡭἲࢆᥦ᱌ࡍࡿ㸬. 2㸬CHJ Ụᡞ᫬௦⦅ࡢᴫせ 2.1 CHJ Ụᡞ᫬௦⦅ࡢ⌧≧  CHJ ࡢ࠺ࡕ㸪Ụᡞ᫬௦⦅࡛ࡣ㸪㏆ୡᚋᮇࡢ᭷⏝ ࡞ཱྀㄒ㈨ᩱ࡛࠶ࡿὗⴠᮏ࡜ே᝟ᮏࡢࢥ࣮ࣃࢫ໬ ࢆ㐍ࡵ࡚࠸ࡿ㸬ࡇࡢ࠺ࡕὗⴠᮏࢥ࣮ࣃࢫࡣ㸪ࠗὗ ⴠᮏ኱ᡂ࠘ࢆᗏᮏ࡜ࡋࡓ [2]㸬ᩥ᭩ᵓ㐀ࡸヰ⪅᝟ ሗࢆ XML ᙧᘧ࡛௜୚ࡋ㸪ࡉࡽ࡟ࢸ࢟ࢫࢺࢹ࣮ࢱ ࢆᅜ❧ᅜㄒ◊✲ᡤࡢつᐃࡋࡓゝㄒ༢఩࡛࠶ࡿ▷ ༢఩࡟ศ๭ࡋ㸪ྛ▷༢఩࡟ᙧែㄽ᝟ሗ㸦ရモ㸪ά ⏝ᙧ㸪ㄞࡳ࡞࡝㸧௜୚ࢆ⾜ࡗ࡚࠸ࡿ㸦▷༢఩ࡢヲ ⣽ࡣᚋ㏙ࡍࡿ㸧㸬⌧ᅾ㸪ࠗ⪷㐟㒌࠘,ࠗἙᮾ᪉ゝ ⟽ࡲࡃࡽ࠘,ࠗ⋢⳥඲ఏⰼ⾤㚷࠘ࡢ୕సရ࡟ࡘ࠸ ࡚㸪ࠕࡦࡲࢃࡾ∧ࠗὗⴠᮏࢥ࣮ࣃࢫ࠘Ver.0.5ࠖ[3] ࢆヨసබ㛤୰࡛࠶ࡿ㸬ே᝟ᮏࢥ࣮ࣃࢫ࡟ࡘ࠸࡚ࡣ㸪 Ụᡞᮇࡢ∧ᮏࢆᗏᮏ࡜ࡋ࡚᪂ࡓ࡟⩻้ࢆ⾜࠸㸦ᅗ 1㸧㸪᣺ࡾ௬ྡ᝟ሗ௜ࡁࡢ XML ࢹ࣮ࢱࢆᵓ⠏ࡋ ࡚࠸ࡿ㸦ᅗ 2㸧㸬ࡇࡕࡽࡶࠕࡦࡲࢃࡾ∧ࠕே᝟ᮏ. ࢥ࣮ࣃࢫࠖVer.0.1ࠖ[4]࡜ࡋ࡚㸪ࠗẚ⩼㐃⌮ⰼᘕᚿ ‶ྎ࠘ࡢヨస∧ࢆබ㛤୰࡛࠶ࡿ㸬⌧᫬Ⅼ࡛ࡣ㸪ᮏ ᩥ࠾ࡼࡧ᣺ࡾ௬ྡࢆᑐ㇟࡜ࡋࡓᩥᏐิ᳨⣴ࡢࡳ ࡀྍ⬟࡛࠶ࡾ㸪௒ᚋ㸪ᙧែㄽ᝟ሗࢆ௜୚ࡋࡓࢥ࣮ ࣃࢫࢆබ㛤ࡍࡿணᐃ࡛࠶ࡿ㸬࠸ࡎࢀࡢࢥ࣮ࣃࢫࡶ ඲ᩥ᳨⣴ࢩࢫࢸ࣒ࠕࡦࡲࢃࡾࠖ[5]ୖ࡛฼⏝ྍ⬟࡛ ࠶ࡿ㸦ᅗ 3㸧ࡀ㸪ᑗ᮶ⓗ࡟ࡣ㸪ࡑࡢ௚ࡢ CHJ సရ ࡜ྠᵝ㸪Web ୖࡢࢥ࣮ࣃࢫ᳨⣴࢔ࣉࣜࢣ࣮ࢩࣙࣥ ୰⣡ゝ࡟࠾ࡅࡿ฼⏝ࢆ᝿ᐃࡋ࡚࠸ࡿ㸬୰⣡ゝୖ ࡛ࡢබ㛤᫬࡟ࡣὗⴠᮏ࣭ே᝟ᮏ࡜ࡶ࡟ᑐ㇟సရࢆ ᣑ඘ࡋ㸪ࡲࡓே᝟ᮏࡣᙧែㄽ᝟ሗ௜ࡁࡢࢥ࣮ࣃࢫ ࡜ࡋ࡚බ㛤ࡍ࡭ࡃ㸪⌧ᅾࡶ㛤Ⓨࢆ㐍ࡵ࡚࠸ࡿ㸬 1. https://chunagon.ninjal.ac.jp/. ⓒ 2016 Information Processing Society of Japan. ─ 70 ─.

(3) 「人文科学とコンピュータシンポジウム」 2016 年 12 月. 2.2 ᙧែㄽ᝟ሗࡢ௜୚ ⏕ࢸ࢟ࢫࢺ࡟ᑐࡋ୍࠿ࡽேᡭ࡛ᙧែㄽ᝟ሗࢆ ௜୚ࡍࡿࡇ࡜ࡣேဨ࡜᫬㛫ࡢ㠀ᖖ࡟࠿࠿ࡿసᴗ ࡛࠶ࡿ㸬ࡑࡢࡓࡵ CHJ ࡛ࡣ㸪ࡲࡎ⮬ືᙧែ⣲ゎ ᯒჾ MeCab[6]࠾ࡼࡧ⮬ືᙧែ⣲ゎᯒჾ⏝ࡢ㎡᭩ UniDic ࢆ⏝࠸ࡓ⮬ືゎᯒࢆ⾜࠸㸪ࡑࡢゎᯒ⤖ᯝࢆ ேᡭಟṇࡋ࡚࠸ࡃ࡜࠸࠺సᴗ᪉㔪ࢆ᥇⏝ࡋ࡚࠸ ࡿ[7]㸬Ụᡞ᫬௦⦅࡛ࡶ㸪㎡᭩࡟㏆ୡཱྀㄒ UniDic ࢆ౑⏝ࡋ㸪ேᡭ࡟ࡼࡿಟṇసᴗࢆᐇ᪋ࡋ࡚࠸ࡿ㸬 UniDic ࡣࡶ࡜ࡶ࡜⌧௦ㄒࡢࢥ࣮ࣃࢫ࡬ࡢᙧែㄽ ᝟ሗ௜୚ࡢࡓࡵ࡟㛤Ⓨࡉࢀࡓ㎡᭩࡛࠶ࡿ[8]㸬ࡍ࡭ ࡚ࡢぢฟࡋㄒ࡟ࡘ࠸࡚▷༢఩࡜࠸࠺ᩧ୍࡞ゝㄒ ༢఩ࡀタᐃࡉࢀ࡚࠾ࡾ㸪ྛぢฟࡋㄒࡣ㸪ㄒᙡ⣲㸪 ㄒᙧ㸪᭩Ꮠᙧ㸪Ⓨ㡢ᙧ࡜࠸࠺㝵ᒙᵓ㐀㸦ᅗ 4㸧ࢆ ᣢࡗ࡚࠸ࡿ㸬ࡇࡢ㝵ᒙᵓ㐀࡟ࡼࡾ㸪ࢥ࣮ࣃࢫ᳨⣴ ࡢ㝿㸪⾲グࡢࡺࢀࡸㄒᙧࡢኚ␗࡟࠿࠿ࢃࡾ࡞ࡃ㸪 ⥙⨶ⓗ࡟⏝౛ࢆ཰㞟ࡍࡿࡇ࡜ࡀྍ⬟࡜࡞ࡗ࡚࠸ ࡿ㸬⌧ᅾ㸪ྂᩥゎᯒ⏝ࡢ㎡᭩࡟࠾࠸࡚ࡶ㸪ࡇࢀࡽ ࡢ≉ᛶࡣࡑࡢࡲࡲ࡟㸪ᚲせ࡞ぢฟࡋㄒࢆ⿵඘㸪ㄒ ࡢ༢఩ࢆಟṇࡍࡿ࡞࡝ࡋ࡚㸪ྛ᫬௦ࡢࢸ࢟ࢫࢺ࡟ ᑐᛂࡉࡏ࡚࠸ࡿ㸬. ௕   8QL'LF भమಽଡୗ‫ق‬৅ఠ஄म੄റ‫ك‬. 3. ㏆ୡᚋᮇཱྀㄒ㈨ᩱࡢ≉ᚩ࡜ၥ㢟 3.1 ⮬ືᙧែ⣲ゎᯒ᫬ࡢၥ㢟࡜ࡑࡢせᅉ CHJ ࡢ࠺ࡕ᪤࡟බ㛤ࡉࢀ࡚࠸ࡿᖹᏳ᫬௦⦅㸪ᐊ ⏫᫬௦⦅Ϩࡢ≬ゝ࡛ࡣ㸪ࢥ࣮ࣃࢫࡢᣑ඘࡟ẚ౛ࡋ ࡚ྛ᫬௦⏝ UniDic ࡢゎᯒ⢭ᗘࡶྥୖࡋ࡚࠸ࡗࡓ㸬 ὗⴠᮏ࡛ࡶ㸪2015 ᖺ 3 ᭶᫬Ⅼ࡛ࡢゎᯒ⢭ᗘ㸦F-1 ್㸸Ⓨ㡢ᙧࡲ࡛᥎ᐃ㸧ࡣ⣙ 86 ࡛࠶ࡗࡓࡀ㸪ᑓ⏝ ࡢゎᯒ㎡᭩ࡢసᡂ㸪࠾ࡼࡧᩥయ࡟ྜࢃࡏࡓ㎡᭩ࡢ ౑࠸ศࡅ࡟ࡼࡾ㸪⌧ᅾ࠾ࡼࡑ 90 ࡢゎᯒ⢭ᗘࢆᐇ ⌧ࡋ࡚࠸ࡿ[9]㸬ࡋ࠿ࡋබ㛤῭ࡢ௚ࡢ᫬௦ࡢࢥ࣮ࣃ ࢫ㸦ゎᯒ⢭ᗘ⣙ 96㸧[10]࡟ẚ࡭ࡿ࡜㸪ࡇࡢ್ࡣప ࠸㸬ே᝟ᮏࡶྠᵝ࡛࠶ࡾ㸪ヨ㦂ⓗ࡞⮬ືᙧែ⣲ゎ ᯒࡢ⤖ᯝ㸪⢭ᗘࡣ⣙ 87 ࡛࠶ࡗࡓ[11]㸬ࡑࡇ࡛࢚ࣛ ࣮ศᯒࢆᐇ᪋ࡋࡓ࡜ࡇࢁ㸪ㄗゎᯒࡢせᅉ࡜ࡋ࡚≉ ࡟┠❧ࡗࡓࡢࡀ㸪(1)(2)ࡢࡼ࠺࡞ᮏ⾜ࡢ₎Ꮠ⾲グ ࡟௜ࡉࢀࡓ᣺ࡾ௬ྡࡀ㸪ᅜㄒ㎡᭩࡟Ⓩ㘓ࡉࢀ࡚࠸ ࡿࡼ࠺࡞୍⯡ⓗ࡞ㄞࡳ࡜୍⮴ࡋ࡞࠸౛࡛࠶ࡗࡓ㸬. 3.2 ᣺ࡾ௬ྡ࡜₎Ꮠ⾲グ ㏆ୡᚋᮇཱྀㄒ㈨ᩱࡢ⾲グୖࡢ≉ᚩ࡜ࡋ࡚㸪᣺ࡾ ௬ྡ௜ࡁࡢ₎Ꮠࡀከ⏝ࡉࢀࡿࡇ࡜ࡀᣲࡆࡽࢀࡿ [12]㸬୰࡛ࡶே᝟ᮏࡢ᣺ࡾ௬ྡ௜୚⋡ࡣ✺ฟࡋ࡚ ࠾ࡾ㸪ㄪᰝᑐ㇟୰ࡢ₎Ꮠࡢ 86%࡟᣺ࡾ௬ྡࡀ௜ࡉ ࢀ࡚࠸ࡓ࡜࠸࠺ሗ࿌ࡶ࠶ࡿ[13]㸬ὗⴠᮏࡸே᝟ᮏ ࡞࡝ࡢ㏻಑ᑠㄝ㢮ࡣ኱⾗ྥࡅࡢฟ∧≀࡛࠶ࡾ㸪㞴 ࡋ࠸₎ㄒࡀㄞࡵ࡞࠸ㄞ⪅࡛ࡶ㸪᣺ࡾ௬ྡ࡜ᮏ⾜ࡢ ᖹ௬ྡࡢ㒊ศࡔࡅࢆ┠࡛㏣ࡗ࡚࠸ࡅࡤ㸪ෆᐜࢆ⌮ ゎ࡛ࡁࡿࡼ࠺࡟ᕤኵࡉࢀ࡚࠸ࡓ㸬ࡓࡔࡋ㸪⌧௦࡟ ࠾࠸࡚₎Ꮠ⾲グ࡜ࡑࡢㄞࡳ᪉ࡀᅛᐃⓗ࡛࠶ࡿࡢ ࡟ᑐࡋ㸪㏆ୡ࡟࠾ࡅࡿ₎Ꮠ⾲グ࡜᣺ࡾ௬ྡࡢ⤌ࡳ ྜࢃࡏࡣ㸪ࡼࡾ⮬⏤ᗘࡢ㧗࠸ࡶࡢ࡛࠶ࡗࡓ㸬ᅗ 1. ⓒ 2016 Information Processing Society of Japan. ─ 71 ─.

(4) The Computers and the Humanities Symposium, Dec. 2016. ࡟ᣲࡆࡓ∧㠃ࢆぢ࡚ࡶ㸪ࠕᐙ୺㸦࠾ࡩࡸ㸧ࠖࠕ㑇 ᭩㸦࠿ࡁ࠾ࡁ㸧ࠖࠕ᝿ീ㸦࠾ࡶࡦࡸࡾ㸧࡚ࠖ࡜࠸ ࡗࡓ⌧௦ㄒ࡛ࡣ୍⯡ⓗ࡛࡞࠸⤌ࡳྜࢃࡏࡀᏑᅾ ࡍࡿ㸦⩻้ࢸ࢟ࢫࢺෆ㸪ᯟ࡛ᅖࢇࡔࡶࡢ㸧㸬ࠕỤ ᡞࡢᡙసࡢሙྜ㸪ㄗㄞ㞴ㄞࡢᜍࢀࡢ࠶ࡿㄒ࡟᣺ࡾ ௬ྡࢆ௜ࡅࡿࡔࡅ࡛࡞ࡃ㸪స⪅ࡢ⾲⌧ᡭἲ࡜ࡋ࡚ ₎Ꮠ⾲グ࡜୪❧ࡍࡿࡼ࠺࡞᣺ࡾ௬ྡࡀ௜ࡅࡽࢀ ࡿࠖ[14]ࡓࡵ㸪㏆ୡࡢ∧ᮏ࡟࠾ࡅࡿ᣺ࡾ௬ྡࡣ㸪 ᮏ⾜ᩥᏐิ࡟ᑐࡋ࡚ᚑᒓⓗ࠶ࡿ࠸ࡣ⿵㊊ⓗ࡞ࡶ ࡢ࡛ࡣ࡞ࡃ㸪ᮏ⾜ᩥᏐิ࡜ేࡏ࡚ᮏᩥࡢ୍㒊㸪ࡶ ࡋࡃࡣᮏᩥࡑࡢࡶࡢ࡜ࡋ࡚ᤊ࠼ࡿᚲせࡀ࠶ࡿ㸬 ࡋ࠿ࡋ࡞ࡀࡽᙧែㄽ᝟ሗࢆ௜୚ࡍࡿ㝿ࡣ㸪ᮏ⾜ ᩥᏐิࢆ XML ࡢᮏᩥᩥᏐิ࡜ࡋ࡚౑⏝ࡋ࡚࠸ࡿ㸬 ࡑࡋ࡚⮬ືᙧែ⣲ゎᯒࡢ㝿࡟ࡣᮏᩥᩥᏐิࡢࡳ ࡀゎᯒᑐ㇟࡜࡞ࡾ㸪᣺ࡾ௬ྡࡢ᝟ሗࡣ୍ษཧ↷ࡋ ࡞࠸㸬ࡑࡢ⤖ᯝ㸪⾲グࡀከᵝ࠿ࡘ㸪₎Ꮠ࡜ࡑࡢㄞ ࡳࡀ୍⯡ⓗ࡞ᑐᛂࢆࡋ࡞࠸㸪࠸ࢃࡺࡿᙜ࡚Ꮠࡢሙ ྜ㸪ࡑࡢ⾲グࡀ UniDic ࡟ᮍⓏ㘓࡛࠶ࡿࡓࡵ࡟ㄗ ゎᯒࡀ⏕ࡌࡸࡍࡃ࡞ࡿ㸬. ⾲   ୁ౐ਜ਼भਊथஊभ୻. 3.3 ᙜ࡚Ꮠࡢᐇែ ㏆ୡ∧ᮏ࡟࠾࠸࡚ࡣ㸪ุㄞࡢࡓࡵ࡟₎ㄒ࡟ࢃ࠿ ࡾࡸࡍ࠸᣺ࡾ௬ྡࢆ௜ࡋࡓ࡜⪃࠼ࡽࢀࡿカࡶ࠶ ࢀࡤ㸪ࡑࡢሙ㠃࡟࠾࠸࡚᭱ࡶ┦ᛂࡋ࠸₎Ꮠࢆᙜ࡚ ࡓ࡜ᛮࢃࢀࡿ᣺ࡾ௬ྡࡶ࠶ࡿ㸬ࡑࢀࡀ㏆ୡ࡟࠾࠸ ࡚㏻⏝࡛࠶ࡗࡓ࠿㸪࢖ࣞࢠ࣮ࣗࣛ࡞ࡶࡢ࡛࠶ࡗࡓ ࠿ࢆ᫂ࡽ࠿࡟ࡍࡿ࡟ࡣ㸪㏆ୡ᪥ᮏㄒ࡟ᑐࡍࡿ⥙⨶ ⓗ࡞ㄪᰝࡀᚲせ࡟࡞ࡿ㸬ࡋ࠿ࡋゎᯒ⢭ᗘࢆྥୖࡉ ࡏࡼ࠺࡜ࡍࡿ௨ୖ㸪࢖ࣞࢠ࣮࡛ࣗࣛ࠶ࡿ࡜⪃࠼ࡽ ࢀࡿࡶࡢࢆ୍᪦つᐃࡍࡿᚲせࡀ࠶ࡿ㸬ࡑࡇ࡛ᮏ◊ ✲࡟࠾࠸࡚ࡣ㸪௨ୗࡢ≉ᚩࢆ᭷ࡍࡿ₎Ꮠ⾲グࢆᙜ ࡚Ꮠ࡜ࡋ࡚ྲྀࡾୖࡆࡿ㸬 A) B) C). ᣺ࡾ௬ྡ࡜₎ᏐࡀᩥᏐ༢఩࡛ᑐᛂࡋ࡞࠸ UniDic ࡟᭩ᏐᙧࡀᮍⓏ㘓࡛࠶ࡿ ᅜㄒ㎡᭩➼࡛ࡶ㏻⏝࡜ㄆࡵࡽࢀ࡚࠸࡞࠸. ခஊ਀੶. ஷॉෘ੡. ୨ඌ. औध. ঳௕. ऱधघग. ঳એ. सञ. ૯์. खभ‫ح‬ी. ആ೎. धअड. ྒྷ஖. मऩृऊ. ྒྷ௡. अीष. ੇ঵. ऩॉॎः. ඿ே. ऒ‫ح‬ौेऎ. ဝর. ऴधऒौ. ෟሴ. खृो. ෹ೆ. अञएॉ. ጲੱ. अऌऩ. ਘ౿. णोऩऎ. ໻ઑ. अञऋऱ. ੱຐ. लणऎॉ. ສଠ. अैैऊ. ૏஡. ऴञॉ. ੭ੱ. ऌ‫ح‬ःो. ௯௬. अमऔ. ౥ተ. पणऒॉ. ૕৛. ञ॒ध. ੓ు. िघी. අ಼. ञणखृ. ஜཽ. पैा.      ਀   ಢ౐ਜ਼॑ऒइॊਊथஊभ୻. ࡞࠾㸪㏻ᖖࡢ㡢カ࠿ࡽࡣࡎࢀࡿࡶࡢ࡜ࡋ࡚ࡣ㸪 ࠕఱ ฎ㸦࡝ࡇ㸧ࠖࠕ᫂᪥㸦࠶ࡋࡓ㸧ࠖ࡜࠸ࡗࡓ⇍Ꮠカࡶ Ꮡࡍࡿࡀ㸪ࡇࡢࡼ࠺࡟ᗈࡃ୍⯡࡟㏻⏝ࡋ࡚࠾ࡾ㸪 ࡍ࡛࡟ゎᯒ㎡᭩࡟Ⓩ㘓ࡀぢࡽࢀࡿࡶࡢࡣᙜ࡚Ꮠ ࡢ౛࠿ࡽࡣ㝖እࡍࡿ㸬ᙜ࡚Ꮠ࡟ࡣ㸪ḟ࡟ᣲࡆࡿࡼ ࠺࡟㸪ㄒ༢఩ࡢࡶࡢ㸦⾲ 1㸧㸪▷༢఩ࢆࡇ࠼ࡿࡶ ࡢ㸦⾲ 2㸧࡞࡝㸪ᵝࠎ࡞ࣃࢱ࣮ࣥࡀぢࡽࢀࡿ㸬 㸦ࠗ⋢ ⳥඲ఏⰼ⾤㚷࠘ࠗẚ⩼㐃⌮ⰼᘕᚿ‶ྎ࠘࠿ࡽ౛ࢆ ᣲࡆࡓ㸧 ⾲ 1 ࡢࡼ࠺࡞౛࡟㛵ࡋ࡚ࡣ㸪UniDic ࡟᭩Ꮠᙧࢆ ㏣ຍࡍࡿࡇ࡜࡛ฎ⌮ࡣྍ⬟࡟࡞ࡿ㸬ࡋ࠿ࡋ㸪సရ ࡟ࡼࡗ࡚㸪స⪅ࡸ᫬௦⫼ᬒ㸪⯙ྎ࡜࡞ࡿᆅᇦࡀ␗ ࡞ࡿὗⴠᮏ࣭ே᝟ᮏࡣࢸ࢟ࢫࢺࡢᆒ㉁ᛶࡀపࡃ㸪 ⾲グࡢࣂ࢚࣮ࣜࢩࣙࣥࡣⳘ኱࡞ࡶࡢ࡜࡞ࡿࡇ࡜ ࡀண᝿ࡉࢀࡿ㸬ࡑࡢࡼ࠺࡟≉Ṧ࠿ࡘ౑⏝㢖ᗘࡢᑡ. ခஊ਀੶. ஷॉෘ੡. ଖ. ध‫ػ‬म. ৸඿थ. ेऎ‫ػ‬ऩण‫ػ‬थ. 㵄৕. यऊऑ‫ػ‬ञ. ่઻. ाॊ‫ػ‬ऱध. ഡৢ. ॎॊः‫ػ‬ऒध. ාਬ. औजम‫ػ‬ो.      ࡞࠸ㄒࢆ᪂ࡓ࡟Ⓩ㘓ࡋ⥆ࡅࡿࡇ࡜ࡣ UniDic ࡢᵓ 㐀ୖࡶ㈇ᢸࡀ኱ࡁࡃ㸪ࡑࡶࡑࡶ㏻ᖖࡢ㡢カ࣭⾲グ ࡜ྠࣞ࣋ࣝࡢࡶࡢ࡜ࡋ࡚ᢅ࠺࡭ࡁ࠿࡝࠺࠿㸪᳨ウ ࡍࡿᚲせࡀ࠶ࢁ࠺㸬ࡲࡓே᝟ᮏࢥ࣮ࣃࢫ࡛ࡣ㸪୍ సရࡢࢸ࢟ࢫࢺ㔞ࡀ⭾኱࡛࠶ࡿࡇ࡜࠿ࡽ㸪኱㒊ศ ࢆ㠀ࢥ࢔ࢹ࣮ࢱ㸦ேᡭಟṇࢆ⾜ࢃࡎ㸪⮬ືゎᯒࡢ ࡳࢆ⾜ࡗࡓࢹ࣮ࢱ㸧࡜ࡋ࡚බ㛤ࡍࡿணᐃ࡛࠶ࡾ㸪 ⮬ືゎᯒࡢ⢭ᗘࡢపࡉࡣ኱ࡁ࡞ၥ㢟࡜࡞ࡿ㸬⾲ 2. ⓒ 2016 Information Processing Society of Japan. ─ 72 ─.

(5) 「人文科学とコンピュータシンポジウム」 2016 年 12 月. ࡢࡼ࠺࡟▷༢఩ࢆࡇ࠼ࡿࣃࢱ࣮ࣥࡶ㸪⌧≧ࡢ₎Ꮠ ⾲グࡢࡲࡲ࡛ࡣ㐺ษ࡞ᙧែㄽ᝟ሗࡢ௜୚ࡀ࡛ࡁ ࡞࠸㸬ࡇࡢࡼ࠺࡟ᙜ࡚Ꮠࡢ㢖ฟࡍࡿ㏆ୡᚋᮇཱྀㄒ ㈨ᩱࢆ⏝࠸ࡓࢥ࣮ࣃࢫࡢᵓ⠏࡟࠶ࡓࡗ࡚ࡣ㸪᣺ࡾ ௬ྡࡢ᝟ሗࢆ฼⏝ࡋࡓ᪂ࡓ࡞ゎᯒ᪉ἲࡢ㛤Ⓨࡀ ồࡵࡽࢀࡿ㸬.  ਀  ঽ৿஄ଙಞੰෲಖ২‫)ق‬க‫ك‬. 4. XML ᵓ㐀໬ࢱࢢࢆ฼⏝ࡋࡓᙧែ⣲ゎ ᯒ ᙜ࡚Ꮠࢆ⮬ືᙧែ⣲ゎᯒࡍࡿࡓࡵ㸪ᮏ◊✲࡛ࡣ ᙜ࡚Ꮠࡢᮏ⾜ᩥᏐิࢆ XML ࡢࣝࣅࢱࢢࡢᒓᛶ࡜ ࡋ࡚᱁⣡ࡉࢀ࡚࠸ࡿ᣺ࡾ௬ྡ࡟⨨᥮ࡋ࡚㸦௨ୗ㸪 ࡇࡢ᧯సࢆࠕࣝࣅࢆ㛤ࡃࠖ࡜⾲⌧㸧ゎᯒࢆᐇ᪋ࡍ ࡿ㸬⨨᥮ᑐ㇟࡜࡞ࡿࣝࣅࢱࢢࡣ࠶ࡽ࠿ࡌࡵ๓㏙ࡢ ᇶ‽࡛㑅ᐃࡋ㸪type ᒓᛶ࡟ࠕᙜ࡚Ꮠࠖ࡜࠸࠺್ࢆ タᐃࡋࡓ㸬ࡇࡢసᴗࡣ௒ᅇࡍ࡭࡚ேᡭ࡛ᐇ᪋ࡋࡓ ࡀ㸪[15]࡛ᥦ᱌ࡉࢀ࡚࠸ࡿᙜ࡚Ꮠࡢ⮬ື᳨ฟࢆ⏝ ࠸ࡓ⮬ື໬ࢆ᳨ウ୰࡛࠶ࡿ㸬 [9]࡛ࡣ㸪ὗⴠᮏࡢᙧែ⣲ゎᯒࢆᩥయู࡟ศࡅ㸪 ࡑࢀࡒࢀᑓ⏝ࡢ㎡᭩ࢆ౑ࡗࡓゎᯒࢆᐇ᪋ࡋ࡚࠸ ࡓ㸬ᮏ◊✲࡛ࡶྠᵝ࡟ཱྀㄒ㸪ᩥㄒࡑࢀࡒࢀᑓ⏝ࡢ ㎡᭩ࢆ⏝ពࡍࡿ㸬ࡓࡔࡋࣝࣅࢆ㛤࠸ࡓᖹ௬ྡ⾲グ ࡟ࡶᑐᛂࡍࡿࡓࡵ㸪㎡᭩ࡢ௬ྡᙧฟ⌧ᙧࡢࣇ࢕࣮ ࣝࢻ࠿ࡽ௬ྡ⾲グ㸦࢝ࢱ࢝ࢼ㸧ࢆྲྀࡾฟࡋ㸪ᖹ௬ ྡ࡟⨨᥮ࡋࡓᚋ㸪㎡᭩ࡢ࣮࢟࡜࡞ࡿ⾲ᒙᙧ㸪࠾ࡼ ࡧ᭩Ꮠᙧฟ⌧ᙧࡢࣇ࢕࣮ࣝࢻ࡜⨨᥮ࡋ㸪㎡᭩ࡢ᪂ ࡓ࡞࣮࢟࡜ࡋ࡚㏣ຍࢆ⾜ࡗࡓ㸬ࡇࢀ࡟ࡼࡾ㎡᭩ࡢ Ⓩ㘓࣮࢟ᩘࡣ࠾ࡼࡑ㸰ಸࡢࢧ࢖ࢬ࡜࡞ࡗࡓ㸦⣙ 300 ୓㸧㸬ࡲࡓྛ▷༢఩࡟ᑐࡋ㸪᪂ࡓ࡟㸯ಶࣇ࢕ ࣮ࣝࢻ㸦ิ㸧ࢆ㏣ຍࡋ㸪ࡑࡇ࡟⨨᥮๓ࡢ₎Ꮠ⾲グ ࡛ࡢ᭩Ꮠᙧࢆṧࡋࡓ㸬⨨ࡁ᥮࠼ࢆ⾜ࡗ࡚࠸࡞࠸ሙ ྜࡣ㸪࣮࢟࡜ྠࡌᩥᏐิࡀࡇࡇ࡟᱁⣡ࡉࢀ࡚࠸ࡿ㸬 ୖグࡢ㎡᭩ࢆࡑࢀࡒࢀୗグࡢ▷༢఩᝟ሗ࢔ࣀ ࢸ࣮ࢩࣙࣥ῭ࡳࢥ࣮ࣃࢫୖ࡛ᩥㄒཱྀ࣭ㄒࡢᩥయู ࡟Ꮫ⩦ࢆ⾜ࡗࡓ㸬 ⏥㥐᪂ヰ㸪㜿㜑㝀㙾㸪໭⳹㏻᝟㸪⯆ᩯ᭶㸪᪂᭶ⰼ వ᝟㸪㝧ྎ㑇⦅࣭㲞㛶⛎ゝ㸪㢼ὶ〄ேᙧ㸪␗ᮏ㒌 ୰ወ㆓㸪ி㒔_⟽ࡲࡃࡽ㸪⢋ࡢ᭖㸪ⰼ⾤㚷㸪ⰼ⾤ ᑑࠎዪ㸪㊠፬ேఏ㸪㐟Ꮚ᪉ゝ㸪ഴᇛ㈙ᅄ༑ඵᡭ㸪 ⦾༓ヰ㸪ഴᇛ㈙஧➽㐨㸪㒌୰ወ㆓㸪౮⪅᪉ゝ㸪⪷ 㐟ᗷ㸪᭶ⰼవ᝟ ࡲࡓࢥ࣮ࣃࢫࡣ㏻ᖖࡢᮏ⾜⾲グࡔࡅ࡛࡞ࡃ㸪ᮏ ⾜⾲グࢆᖹ௬ྡ໬ࡋࡓ௬ྡᙧฟ⌧ᙧ࡟⨨ࡁ᥮࠼ ࡓࡶࡢࡶే⏝ࡋࡓ㸬౑⏝ࡋࡓࢥ࣮ࣃࢫࢆカ⦎ 9㸸 ホ౯㸯࡟ศ๭ࡋ࡚⢭ᗘࢆホ౯ࡋࡓ.ศ๭ࡢ⤖ᯝ㸪 ᩥㄒࡢカ⦎⏝ࢥ࣮ࣃࢫࡣ 12,447 ᩥ㸪107321 ▷༢ ఩㸪ホ౯⏝ࢥ࣮ࣃࢫࡣ 1383 ᩥ㸪12,351 ▷༢఩࡜ ࡞ࡗࡓ㸬ࡲࡓཱྀㄒࡢカ⦎⏝ࢥ࣮ࣃࢫࡣ 13176 ᩥ㸪 165534 ▷༢఩㸪ホ౯⏝ࢥ࣮ࣃࢫࡣ 1464 ᩥ㸪17274 ▷༢఩࡜࡞ࡗࡓ㸬ホ౯⤖ᯝࢆ⾲ 3 ࡟♧ࡍ2㸬 2. ホ౯᫬࡟ࣝࣅࢆ㛤ࡃฎ⌮ࡣࡋ࡚࠸࡞࠸ࡇ࡜࡟ὀព. ୖグ࡛సᡂࡋࡓ㎡᭩ࢆ౑࠸㸪ᙜ࡚Ꮠᒓᛶࡢ௜୚ࡉ ࢀࡓࣝࣅࢱࢢࢆ㛤࠸ࡓ XML ࡢᮏ⾜ᩥᏐิࢆゎᯒ ࡋࡓ㸬ࡑࡢ⤖ᯝ㸪㎡᭩ᮍⓏ㘓ࡢࠕཱྀㄿ㸦ࡇ࠺ࡖࡸ ࠺㸧ࠖࡢࡼ࠺࡞≉Ṧ࡞₎Ꮠ⾲グࢆࠕཱྀୖ㸪ྡモᬑ㏻ྡモ-୍⯡ࠖࡢࡼ࠺࡟ゎᯒ࡛ࡁࡿࡼ࠺࡟࡞ࡗ ࡓࡔࡅ࡛࡞ࡃ㸪ࠕㄏᘬ㸦ࡉࡑࡣ|ࢀ㸧ࠖࡢࡼ࠺࡞ 」ᩘㄒ࡟ࡲࡓࡀࡿ₎Ꮠ⾲グࡶṇࡋࡃゎᯒ࡛ࡁࡿ ࡼ࠺࡟࡞ࡗࡓ㸬. 5㸬࠾ࢃࡾ࡟  ᮏ◊✲࡛ࡣ㸪ㄗゎᯒࡢཎᅉ࡜࡞ࡾࡸࡍ࠸≉Ṧ࡞. ₎Ꮠ⾲グ㸦ᙜ࡚Ꮠ㸧ࡀከᩘฟ⌧ࡍࡿࢸ࢟ࢫࢺ࡟ᑐ ࡋ࡚㸪(1) XML ᵓ㐀໬ࢱࢢࢆ฼⏝ࡋ࡚㸪ゎᯒᑐ㇟ ࢆ₎Ꮠ⾲グ࠿ࡽ᣺ࡾ௬ྡ࡟⨨ࡁ᥮࠼㸪(2) ᙧែ⣲ ゎᯒ⏝㎡᭩࡟ᖹ௬ྡࡢ᭩Ꮠᙧࢆ㏣ຍࡍࡿࡇ࡜࡛㸪 ゎᯒ⢭ᗘࡢྥୖࡀྍ⬟࡛࠶ࡿࡇ࡜ࢆ᫂ࡽ࠿࡟ࡋ ࡓ㸬≉࡟㸪ᩧ୍࡞ゝㄒ༢఩ࢆタᐃࡋ࡚࠸ࡿṔྐࢥ ࣮ࣃࢫ࡟࠾࠸࡚㸪ࠕㄏᘬ㸦ࡉࡑࡣ|ࢀ㸧ࠖࡢࡼ࠺ ࡟㸪ᮏ⾜ᩥᏐิ࡜ࡋ࡚ࡣ୍▷༢఩㸦ㄏᘬ㸧㸪௜ࡉ ࢀࡓㄞࡳ᪉࡜ࡋ࡚ࡣ஧▷༢఩㸦ㄏࡣ㹺ࢀ㸧࡜࡞ࡿ ሙྜࡢฎ⌮᪉ἲࡀႚ⥭ࡢㄢ㢟࡜࡞ࡗ࡚࠸ࡓࡀ㸪ᮏ ᡭἲࡢᑟධ࡟ࡼࡾゎỴࡉࢀࡿࡇ࡜ࡀศ࠿ࡗࡓ㸬 ὗⴠᮏ࣭ே᝟ᮏࡢࢥ࣮ࣃࢫࡣ㸪⌧ᅾヨ⾜∧ࢆබ 㛤୰࡛࠶ࡾ㸪௒ᚋ㸪බ㛤సရࡢᣑ඘ࢆィ⏬ࡋ࡚࠸ ࡿ㸬ᙜ࡚Ꮠࡀ㢖ฟࡍࡿࢸ࢟ࢫࢺ࡜ࡋ࡚ࡣ㸪ྠࡌࡃ ㏆ୡᚋᮇࡢ⁥✍ᮏࡸ㸪᫂἞ᮇࡢᑠㄝ㢮ࡶᣲࡆࡽࢀ㸪 ௒ᚋࡑࢀࡽࡢ㈨ᩱࢆࢥ࣮ࣃࢫ໬ࡍࡿ㝿࡟ࡶ㸪ࡇࡢ ᡭἲࡀᛂ⏝ྍ⬟࡛࠶ࡿ㸬 ࡞࠾௒ᅇ㸪◊✲ᑐ㇟࡜ࡋࡓᙜ࡚Ꮠࡣ඲࡚ேᡭ࡛ ᢳฟ㸪ࢱࢢ௜ࡅࢆ⾜ࡗࡓࡀ㸪[15]࡛ᥦ᱌ࡉࢀࡿࡼ ࠺࡞ᡭἲࢆ⏝࠸ࢀࡤ㸪ᑗ᮶ⓗ࡟ࡣࡼࡾ⢭☜࡞⮬ື ุูࡀྍ⬟࡟࡞ࡿ㸬ᘬࡁ⥆ࡁ㸪ゎᯒᡭἲࡢ᳨ウࡸ ゎᯒ⏝㎡᭩ࡢ㛤Ⓨࢆ㐍ࡵࡿᚲせࡀ࠶ࡿ㸬. ௜グ ᮏ◊✲ࡣ㸪ᅜ❧ᅜㄒ◊✲ᡤඹྠ◊✲ࠕ㏻᫬ࢥ࣮ ࣃࢫࡢᵓ⠏࡜᪥ᮏㄒྐ◊✲ࡢ᪂ᒎ㛤ࠖ㸦࣮ࣜࢲ ࣮㸸ᑠᮌ᭮ᬛಙ㸧࡞ࡽࡧ࡟㸪ே㛫ᩥ໬◊✲ᶵᵓᗈ 㡿ᇦ㐃ᦠᇶᖿ◊✲ࣉࣟࢪ࢙ࢡࢺࠕ␗ศ㔝⼥ྜ࡟ࡼ ࡿ⥲ྜ᭩≀Ꮫࡢᵓ⠏ࠖࡢࣘࢽࢵࢺࠕ⾲グ᝟ሗ࡜᭩ ㄅᙧែ᝟ሗࢆຍ࠼ࡓ᪥ᮏㄒṔྐࢥ࣮ࣃࢫࡢ⢭⦓ ⓒ 2016 Information Processing Society of Japan. ─ 73 ─.

(6) The Computers and the Humanities Symposium, Dec. 2016. ໬ࠖ㸦࣮ࣜࢲ࣮㸸㧗⏣ᬛ࿴㸧ࡢ◊✲ᡂᯝࢆሗ࿌ࡋ ࡓࡶࡢ࡛࠶ࡿ㸬.  ཧ⪃ᩥ⊩. 15㸧ᒸ↷᫭㸸ᩥᏐ༢఩ࡢከᑐከ⮬ື࢔ࣛ࢖࣓ࣥ ࢺࢆ⏝࠸ࡓ᪥ᮏㄒṔྐࢥ࣮ࣃࢫࡢࣝࣅ࢔ࣀࢸ࣮ ࢩࣙࣥࡢ⮬ືಟṇ㸪ࡌࢇࡶࢇࡇࢇ 2016㸦2016 ᮍ බห㸧㸬. 1) ᕷᮧኴ㑻㸸㏆ୡཱྀㄒ㈨ᩱࡢࢥ࣮ࣃࢫ໬̿≬ ゝ࣭ὗⴠᮏࡢࢥ࣮ࣃࢫ໬ࡢ㐣⛬࡜ㄢ㢟̿㸪᪥ᮏㄒ Ꮫ 11 ᭶⮫᫬ቑหྕ ᪥ᮏㄒྐ◊✲࡜Ṕྐࢥ࣮ࣃ ࢫ㸪Vol. 33㸪No. 14㸪pp.96-109㸪᫂἞᭩㝔㸦2014㸧㸬 2) ὗⴠᮏ኱ᡂ⦅㞟ጤဨ఍⦅㸸ὗⴠᮏ኱ᡂ㸪୰ ኸබㄽ♫㸦1978-88㸧㸬 3) ᅜ❧ᅜㄒ◊✲ᡤࢥ࣮ࣃࢫ㛤Ⓨࢭࣥࢱ࣮㸦ᕷ ᮧኴ㑻࡯࠿㸧⦅㸸ࠗࡦࡲࢃࡾ∧ࠕὗⴠᮏࢥ࣮ࣃࢫࠖ Ver.0.5࠘㸪 ࠑ http://pj.ninjal.ac.jp/corpus_center/chj/edo.html#sh areࠒ㸦ཧ↷ 2016-11-01㸧㸬 4) ᅜ❧ᅜㄒ◊✲ᡤࢥ࣮ࣃࢫ㛤Ⓨࢭࣥࢱ࣮㸦⸨ ᮏⅉ࣭㧗⏣ᬛ࿴࡯࠿㸧⦅㸸ࠗࡦࡲࢃࡾ∧ࠕே᝟ᮏ ࢥ࣮ࣃࢫࠖVer.0.5࠘㸪 ࠑ http://pj.ninjal.ac.jp/corpus_center/chj/edo.html#ni njouࠒ㸦ཧ↷ 2016-11-01㸧 㸬 5) ᒣཱྀᫀஓ㸸ᵓ㐀໬ࢸ࢟ࢫࢺ࡟ᑐᛂࡋࡓ඲ᩥ ᳨⣴ࢩࢫࢸ࣒ࠗࡦࡲࢃࡾ࠘㸪ᅜ❧ᅜㄒ◊✲ᡤሗ࿌ 122㸪pp.49-82㸪༤ᩥ㤋᪂♫㸦2002㸧 㸬 6) Taku Kudo, Kaoru Yamamoto(Titech), Yuji Matsumoto㸸Proceedings of the 2004 Conference on Empirical Methods in Natural Language Processing (EMNLP-2004)㸪pp.230-237㸦2004㸧 㸬 7) ఏᗣᬕ㸪ᑠᮌ᭮ᬛಙ㸪ᑠ᳚⚽ᶞ࡯࠿㸸ࢥ࣮ ࣃࢫ᪥ᮏㄒᏛࡢࡓࡵࡢゝㄒ㈨※̿ᙧែ⣲ゎᯒ⏝ 㟁Ꮚ໬㎡᭩ࡢ㛤Ⓨ࡜ࡑࡢᛂ⏝㸪᪥ᮏㄒ⛉Ꮫ㸪Vol. 22㸪ᅜ❧ᅜㄒ◊✲ᡤ㸦2007㸧㸬 8) ๓ᕝ႐ஂ㞝㸸KOTONOHAࠗ⌧௦᪥ᮏㄒ᭩ࡁ ゝⴥᆒ⾮ࢥ࣮ࣃࢫ࠘ࡢ㛤Ⓨ㸪᪥ᮏㄒࡢ◊✲㸪Vol. 4㸪 No. 1㸪pp.82-95㸪᪥ᮏㄒᏛ఍㸦2008㸧 㸬 9㸧ᕷᮧኴ㑻㸪ᑠᮌ᭮ᬛಙ㸸ᩥ᭩ᵓ㐀ࢆ฼⏝ࡋ ࡓ㏆ୡᮇὗⴠᮏࡢᙧែ⣲ゎᯒ㸪ゝㄒฎ⌮Ꮫ఍➨ 2 2 ᅇᖺḟ኱఍Ⓨ⾲ㄽᩥ㞟㸪pp.4-5㸪ゝㄒฎ⌮Ꮫ఍ 㸦2016㸧 㸬 10) ᑠᮌ᭮ᬛಙ࣭୰ᮧኊ⠊㸸ࠗ⌧௦᪥ᮏㄒ᭩ࡁ ゝⴥᆒ⾮ࢥ࣮ࣃࢫ࠘ᙧែㄽ᝟ሗ࢔ࣀࢸ࣮ࢩࣙࣥᨭ ᥼ࢩࢫࢸ࣒ࡢタィ࣭ᐇ⿦࣭㐠⏝㸦ࢥ࣮ࣃࢫ࢔ࣀࢸ ࣮ࢩࣙࣥ:᪂ࡋ࠸ྍ⬟ᛶ࡜ඹ᭷໬࡟ࡴࡅ࡚ࡢヨ ࡳ㸧 㸪⮬↛ゝㄒฎ⌮ 21-2㸪pp.301-332㸦2014㸧 㸬 11㸧⸨ᮏⅉ㸪໭㷂ຬᕹ㸪ᕷᮧኴ㑻࡯࠿㸸ࠕே᝟ ᮏࢥ࣮ࣃࢫࠖࡢタィ࡜ᵓ⠏㸪ᅜ❧ᅜㄒ◊✲ᡤㄽ㞟㸪 Vol. 12㸦2017 ᮍබห㸧 㸬 12) ᑠᯇᑑ㞝㸸Ụᡞ᫬௦ࡢᅜㄒ㸪ᮾிᇽฟ∧ 㸦1985㸧 㸬 13) ▮㔝‽㸸ே᝟ᮏࡢ₎Ꮠ㸪₎Ꮠㅮᗙ 7 ㏆ୡࡢ ₎Ꮠ࡜ࡇ࡜ࡤ㸪pp.199-218㸪᫂἞᭩㝔㸦1987㸧㸬 14) ᅵᒇಙ୍㸸ᘧீ୕㤿ࡢ₎Ꮠ౑⏝̿ࠗᾋୡ㢼 ࿅࠘ࢆ㈨ᩱ࡜ࡋ࡚̿㸪᪥ᮏㄒᏛ㸪 Vol.5, No.5, pp34-40㸦1986㸧㸬. ⓒ 2016 Information Processing Society of Japan. ─ 74 ─.

(7)

参照

関連したドキュメント

In particular, we consider a reverse Lee decomposition for the deformation gra- dient and we choose an appropriate state space in which one of the variables, characterizing the

In order to be able to apply the Cartan–K¨ ahler theorem to prove existence of solutions in the real-analytic category, one needs a stronger result than Proposition 2.3; one needs

This paper presents an investigation into the mechanics of this specific problem and develops an analytical approach that accounts for the effects of geometrical and material data on

While conducting an experiment regarding fetal move- ments as a result of Pulsed Wave Doppler (PWD) ultrasound, [8] we encountered the severe artifacts in the acquired image2.

The explicit treatment of the metaplectic representa- tion requires various methods from analysis and geometry, in addition to the algebraic methods; and it is our aim in a series

We have avoided most of the references to the theory of semisimple Lie groups and representation theory, and instead given direct constructions of the key objects, such as for

Amount of Remuneration, etc. The Company does not pay to Directors who concurrently serve as Executive Officer the remuneration paid to Directors. Therefore, “Number of Persons”

(今後の展望 1) 苦情解決の仕組みの活用.