文書分類の手法と一般化線形モデルを用いた英語ライティングにおける文法的誤りの影響

(1)

ᩥ᭩ศ㢮ࡢᡭἲ࡜୍⯡໬⥺ᙧࣔࢹࣝࢆ⏝࠸ࡓ

ⱥㄒࣛ࢖ࢸ࢕ࣥࢢ࡟࠾ࡅࡿᩥἲⓗㄗࡾࡢᙳ㡪

▼஭ 㞝㝯㸦᪩✄⏣኱Ꮫ ኱Ꮫ⥲ྜ◊✲ࢭࣥࢱ࣮㸧㏆⸨ ᝆ௓㸦᪩✄⏣኱Ꮫ ࢢ࣮ࣟࣂ࢚ࣝࢹࣗࢣ࣮ࢩࣙࣥࢭࣥࢱ࣮

㸧

ᮏ◊✲࡛ࡣ㸪ᩥ᭩ศ㢮ࡢᡭἲ࡜୍⯡໬⥺ᙧࣔࢹࣝࢆ⏝࠸࡚ⱥㄒࣛ࢖ࢸ࢕ࣥࢢ࡟࠾ࡅࡿᩥἲⓗㄗࡾࡢ ᙳ㡪࡟ࡘ࠸࡚ㄪᰝࡋࡓ㸬᪥ᮏேⱥㄒᏛ⩦⪅ࡢࢫࣆ࣮࢟ࣥࢢࡸࣛ࢖ࢸ࢕ࣥࢢࡢࣃࣇ࢛࣮࣐ࣥࢫࢆ⮬ື᥇ Ⅼࡍࡿヨࡳࡀ㏆ᖺὀ┠ࢆ㞟ࡵ࡚࠸ࡿࡀ㸪࡝ࡢ≉ᚩ㔞ࡀ࡝ࢀ࡯࡝ホ౯࡟ᐤ୚ࡋ࡚࠸ࡿ࠿࡟ࡘ࠸࡚ࡣ㸪ࡲ ࡔ༑ศ࡟᫂ࡽ࠿࡟ࡉࢀ࡚࠸࡞࠸㸬ᮏ◊✲࡛ࡣ㸪ࣃࣇ࢛࣮࣐ࣥࢫࢆ ࡿ୍ࡘࡢᣦᶆ࡛࠶ࡿᩥἲⓗṇ☜ࡉ ࡟↔Ⅼࢆᙜ࡚࡚㸪ࡑࡢᙳ㡪ࢆㄪᰝࡋࡓ㸬ࡑࡢ⤖ᯝ㸪࢚ࢵࢭ࢖ホ౯࡟ᙳ㡪ࢆ୚࠼࡚࠸ࡿᩥἲⓗㄗࡾࡣ㸪 ືモࡢㄒᙡ࡟㛵ࡍࡿ࢚࣮ࣛ࡜ㄒ㡰࡟㛵ࡍࡿ࢚࣮ࣛࡢ஧✀㢮ࡢᩥἲⓗㄗࡾ࡛࠶ࡗࡓ㸬ⱥㄒᏛ⩦⪅ࡢࣛ࢖ ࢸ࢕ࣥࢢホ౯࡟࠾࠸࡚ࡣ㸪ືモ࡟㛵ࡍࡿㄗࡾࡣ㸪௚ࡢᩥἲⓗㄗࡾ࡟ẚ࡭࡚㸪࢚ࢵࢭ࢖ࡢ඲యⓗホ౯࡟ ኱ࡁ࡞ᙳ㡪ࢆ୚࠼࡚࠸ࡿࡇ࡜ࢆ♧၀ࡍࡿ⤖ᯝ࡜࡞ࡗࡓ㸬

Investigating effects of accuracy in English writing based on the

technique of essay classification and generalized linear model

Yutaka Ishii (Center for Higher Education Studies, Waseda University) Yusuke Kondo (Global Education Center, Waseda University)

This study investigates effects of grammatical accuracy in English writing based on the technique of essay classification and generalized linear model. In recent years, there has been an increasing interest in automated scoring of learners' production skills such as speaking and writing. However, one major issue in the research concerned what linguistic features contribute to essay grading. This paper assesses the significance of grammatical accuracy in essay scoring. The results showed that verb related errors (errors in word selection) and errors in word order affect learners’ essay evaluation.

㸯㸬ࡣࡌࡵ࡟

ᮏ◊✲࡛ࡣ㸪ᩥ᭩ศ㢮ࡢᡭἲ࡜୍⯡໬⥺ᙧࣔࢹ ࣝࢆ⏝࠸࡚ⱥㄒࣛ࢖ࢸ࢕ࣥࢢ࡟࠾ࡅࡿᩥἲⓗㄗ ࡾࡢᙳ㡪࡟ࡘ࠸࡚ㄪᰝࡋࡓ㸬᪥ᮏேⱥㄒᏛ⩦⪅ࡢ ࢫࣆ࣮࢟ࣥࢢࡸࣛ࢖ࢸ࢕ࣥࢢࡢࣃࣇ࢛࣮࣐ࣥࢫ ࢆ⮬ື᥇Ⅼࡍࡿヨࡳࡀ㏆ᖺὀ┠ࢆ㞟ࡵ࡚࠸ࡿࡀ㸪 ࡝ࡢ≉ᚩ㔞ࡀ࡝ࢀ࡯࡝ホ౯࡟ᐤ୚ࡋ࡚࠸ࡿ࠿࡟ ࡘ࠸࡚ࡣ㸪ࡲࡔ༑ศ࡟᫂ࡽ࠿࡟ࡉࢀ࡚࠸࡞࠸㸬ᮏ ◊✲࡛ࡣ㸪ࣃࣇ࢛࣮࣐ࣥࢫࢆ ࡿ୍ࡘࡢᣦᶆ࡛࠶ ࡿᩥἲⓗṇ☜ࡉ࡟↔Ⅼࢆᙜ࡚࡚㸪ࡑࡢᙳ㡪ࢆㄪᰝ ࡍࡿ㸬

㸰㸬ඛ⾜◊✲

ࡇࢀࡲ࡛ࡢࣛ࢖ࢸ࢕ࣥࢢ◊✲ࡢከࡃࡣ㸪➨஧ゝㄒ࡜ࡋ࡚ࡢⱥㄒࣛ࢖ࢸ࢕ࣥࢢࢆᑐ㇟࡜ࡋ࡚⾜ࢃ ࢀ࡚ࡁ࡚࠾ࡾ㸪᪥ᮏேⱥㄒᏛ⩦⪅ࢆᑐ㇟࡜ࡋࡓ◊ ✲ࡣ࠶ࡲࡾ⾜ࢃࢀ࡚ࡇ࡞࠿ࡗࡓ㸬ࡲࡓ᪥ᮏ࡟࠾ࡅ ࡿⱥㄒࣛ࢖ࢸ࢕ࣥࢢ◊✲࡟ࡣ኱ࡁࡃศࡅ࡚஧ࡘ ࡢၥ㢟ࡀᏑᅾࡍࡿ࡜ゝࢃࢀ࡚ࡁࡓ㸬୍ࡘࡣ㸪ࣛ࢖ ࢸ࢕ࣥࢢࡢホ౯ࡣ㸪ᩍᖌ࡟࡜ࡗ࡚㸪኱ࡁ࡞㈇ᢸ࡛ ࠶ࡿ࡜࠸࠺ࡇ࡜࡜㸪ホᐃ⪅㛫ࡢホ౯ࡢ୙୍⮴ࡢၥ 㢟࡛࠶ࡿ㸬 ୖグࡢၥ㢟ࢆゎỴࡍࡿࡓࡵ࡟㸪㏆ᖺὀ┠ࢆ㞟ࡵ ࡚࠸ࡿࡢࡀࣛ࢖ࢸ࢕ࣥࢢࡢ⮬ືホ౯࡛࠶ࡿ㸬ࣛ࢖ ࢸ࢕ࣥࢢࡢ⮬ືホ౯࡟࠾࠸࡚ࡣ㸪ࡉࡲࡊࡲ࡞ゝㄒ ⓗᣦᶆࡀ⏝࠸ࡽࢀࡿ㸬௦⾲౛࡜ࡋ࡚Educational Testing Service㸦ETS㸧ࡢ e-rater ࡣ㸪ୗグ 12 ಶࡢኚᩘࢆ⏝࠸࡚࠸ࡿ㸬 1. ⥲ㄒᩘ࡟ᑐࡍࡿᩥἲ࢚࣮ࣛࡢ๭ྜ 2. ⥲ㄒᩘ࡟ᑐࡍࡿㄒࡢ౑⏝ἲ࡟ࡘ࠸࡚ࡢ࢚ࣛ ࣮ࡢ๭ྜ 3. ⥲ㄒᩘ࡟ᑐࡍࡿᡭ㡰ࡢ࢚࣮ࣛࡢ๭ྜ 4. ⥲ㄒᩘ࡟ᑐࡍࡿࢫࢱ࢖ࣝ࡟ࡘ࠸࡚ࡢ࢚࣮ࣛ ࡢ๭ྜ 5. ᚲせ࡜ࡉࢀࡿㄯヰせ⣲ࡢᩘ 6. ㄯヰせ⣲࡟࠾ࡅࡿᖹᆒㄒᩘ 7. సᩥࢆ 6 Ⅼἲ࡛᥇Ⅼࡍࡿ㝿࡟ㄒᙡࡢ㢮ఝ ᗘࡀ୍␒㏆࠸Ⅼᩘ 8. ᭱㧗Ⅼࢆྲྀࡗࡓసᩥ࡜ࡢㄒᙡࡢ㢮ఝᗘ 9. Type-Token Ratio 10. ㄒᙡࡢᅔ㞴ᗘ

(2)

11. ᖹᆒ༢ㄒ㛗 12. ⥲ㄒᩘ

㸦Burstein, Chodorow & Leacock, 2004㸧 ୖグࡢኚᩘࡢ୰࡛඲యࡢ3 ศࡢ 1 ࢆ༨ࡵࡿࡢࡀᩥ ἲⓗㄗࡾ࡛࠶ࡿ㸬ୖグࢆ㚷ࡳࡿ࡜㸪ࣛ࢖ࢸ࢕ࣥࢢ ࡢホ౯ࡣᩥἲⓗㄗࡾ࡟኱ࡁࡃᐤ୚ࡍࡿ࡜⪃࠼ࡽ ࢀࡿ㸬୍᪉࡛㸪Jones㸦2006㸧ࡣ㸪ࣛ࢖ࢸ࢕ࣥࢢ ࡟࠾ࡅࡿ46 ⟠ᡤࡢᩥἲⓗㄗࡾࢆゞṇࡋ㸪ࡑࢀࢆ ࠶ࡿ⮬ື᥇Ⅼࢩࢫࢸ࣒࡟ホ౯ࡉࡏࡓ࡜ࡇࢁ㸪ゞṇ ๓࡜ゞṇᚋࡢࢫࢥ࢔ࡀ඲ࡃྠࡌ࡛࠶ࡗࡓ࡜࠸࠺ ࡇ࡜ࢆሗ࿌ࡋ࡚࠸ࡿ㸬ࡇࡢ◊✲ࡣ㸪Ꮫ⩦⪅ࡢᩥἲ ⓗㄗࡾࡀࣛ࢖ࢸ࢕ࣥࢢࡢホ౯࡟࠾࠸࡚ᐤ୚ࡋ࡞ ࠸ྍ⬟ᛶࢆ♧၀ࡍࡿ◊✲࡛࠶ࡿ㸬ࡑࡇ࡛ᮏ◊✲࡛ ࡣ㸪ᩥἲⓗㄗࡾ࡜࠸࠺ኚᩘࡢࡳ࡛࡝ࢀ࡯࡝ࣛ࢖ࢸ ࢕ࣥࢢࡢホ౯ࢆண ࡍࡿࡇ࡜ࡀ࡛ࡁࡿ࠿ࢆ᳨ウ ࡍࡿ㸬 ᮏ◊✲ࡀ㸪ᩥἲⓗㄗࡾ࡟↔Ⅼࢆᙜ࡚ࡿ⌮⏤࡜ࡋ ࡚ࡣ㸪஧Ⅼࡀᣲࡆࡽࢀࡿ㸬୍Ⅼ┠ࡣ㸪⮬↛ゝㄒฎ ⌮ࡢᩍ⫱ᛂ⏝࡛㏆ᖺ┒ࢇ࡟ᩥἲⓗㄗࡾࡢ⮬ື᳨ ฟ࡟㛵ࡍࡿඹ㏻ࢱࢫࢡ࡞࡝ࡀ⾜ࢃࢀ࡚࠸ࡿ࡜࠸ ࠺Ⅼ࡛࠶ࡿ㸬౛࠼ࡤ㸪ࡇࢀࡲ࡛࡟ୗグ⾲1 ࡢࡼ࠺ ࡞ඹ㏻ࢱࢫࢡࡀ⾜ࢃࢀ࡚࠸ࡿ㸬⾲-1 ⮬↛ゝㄒฎ⌮ࡢᩍ⫱ᛂ⏝ࡢඹ㏻ࢱࢫࢡ ఍㆟ྡ ࢱࢫࢡ

Helping Our Own 㸦HOO㸧2011

ㄽᩥࡢᩥἲⓗㄗࡾゞṇ Helping Our Own

㸦HOO㸧2012 ๓⨨モ࡜㝈ᐃモࡢᩥἲ ⓗㄗࡾゞṇ Native Language Identification Shared Task ⱥసᩥ࠿ࡽⱥㄒᏛ⩦⪅ ࡢẕㄒ᥎ᐃ CoNLL 2013 㝈ᐃモ㸪๓⨨モ㸪ᩘ㸪 ືモࡢᙧ㸪୍⮴㸪ࢫ࣌ ࣝ㸪ྃㄞⅬࡢᩥἲⓗㄗ ࡾゞṇ ㄗࡾ᳨ฟ࣭ゞṇ࣮࣡ࢡ ࢩࣙࢵࣉ 2012 ๓⨨モㄗࡾ࣭ືモ㸦୺ ㄒ-ືモࡢ୍⮴㸧ㄗࡾࡢ 2 ࡘࡢࢺࣛࢵࢡ࡟ຍ࠼㸪ㄗࡾࡢ✀㢮ࢆ㝈ᐃࡋ࡞ ࠸ࣃ࢖ࣟࢵࢺࢺࣛࢵࢡ ୖグࡢࡼ࠺࡞◊✲࡟࠾ࡅࡿ▱ぢࡣ㸪እᅜㄒᩍ⫱◊ ✲ࡢⓎᒎ࡟኱ࡁࡃᐤ୚ࡍࡿࡶࡢ࡛࠶ࡿ࡜⪃࠼ࡽ ࢀࡿ㸬ࡋ࠿ࡋ࡞ࡀࡽ㸪እᅜㄒᩍ⫱◊✲⪅࡟࡜ࡗ࡚ ࡶ᭱ࡶ㛵ᚰࡀ࠶ࡿࡢࡣ㸪࡝࠺࠸ࡗࡓᩥἲⓗㄗࡾࡀ ࣛ࢖ࢸ࢕ࣥࢢࡢホ౯࡟ᐤ୚ࡍࡿ࠿࡜࠸࠺Ⅼ࡛࠶ ࡿ㸬ࡑࡢ◊✲ᡂᯝࡣ㸪ᩍᐊ࡟࠾ࡅࡿࣛ࢖ࢸ࢕ࣥࢢ ᣦᑟ࡞࡝࡟ࡶ⏕࠿ࡍࡇ࡜ࡀྍ⬟࡛࠶ࡿ㸬 ஧Ⅼ┠ࡣ㸪እᅜㄒᩍ⫱࡬ࡢά⏝ࡀ㐍ࡵࡽࢀ࡚࠸ ࡿ࣮ࣚࣟࢵࣃゝㄒඹ㏻ཧ↷ᯟ࡟࠾ࡅࡿᩥἲⓗṇ ☜ࡉࡢグ㏙ࡢ⢭⦓໬࡛࠶ࡿ㸬ୗグ⾲2 ࡣ㸪ᩥἲⓗ ㄗࡾ࡟ᑐࡍࡿ⬟ຊグ㏙ᩥ࡛࠶ࡿࡀ㸪ࡑࡢグ㏙ࢆぢ ࡚ࡳࡿ࡜㸪ẁ㝵ࡈ࡜ࡢㄗࡾࡢ⛬ᗘ࡟ࡘ࠸࡚₍↛࡜ ࡋࡓグ㏙ࡀ࡞ࡉࢀ࡚࠸ࡿࡢࡳ࡛࠶ࡿ㸬Ꮫ⩦⪅ࡀ⩦ ⇍ᗘࡈ࡜࡟࠾࠿ࡋࡸࡍ࠸ᩥἲⓗㄗࡾࢆ≉ᐃࡍࡿ ࡇ࡜ࡣ㸪ࡇ࠺ࡋࡓ⬟ຊグ㏙ᩥࡢ⢭⦓໬࡟ࡶᛂ⏝ࡍ ࡿࡇ࡜ࡀྍ⬟࡛࠶ࡿ㸬⾲-2 ࣮ࣚࣟࢵࣃゝㄒඹ㏻ཧ↷ᯟࡢᩥἲⓗṇ☜ࡉࡢ ⬟ຊグ㏙ᩥ ࣞ࣋ࣝ ᩥἲⓗṇ☜ࡉ C2 㸦౛࠼ࡤ㸪ࡇࢀ࠿ࡽゝ࠺ࡇ࡜ࢆ⪃࠼࡚࠸ ࡿ᫬ࡸ㸪௚ேࡢ཯ᛂࢆࣔࢽࢱ࣮ࡋ࡚࠸ࡿ ࡼ࠺࡞᫬࡜࠸ࡗࡓ㸧௚ࡢࡇ࡜࡟ὀពࢆᡶ ࡗ࡚࠸ࡿ᫬࡛ࡶ㸪」㞧࡞ゝⴥ࡟ࡘ࠸࡚ᖖ ࡟㧗࠸ᩥἲ㥑౑ຊࢆ⥔ᣢࡋ࡚࠸ࡿ㸬 C1 ᖖ࡟㧗࠸ᩥἲⓗṇ☜ࡉࢆ⥔ᣢࡍࡿ㸬ㄗࡾ_{ࡣᑡ࡞ࡃ㸪ぢࡘࡅࡿࡇ࡜ࡣ㞴ࡋ࠸㸬} B2 㧗࠸ᩥἲ㥑౑ຊࡀ࠶ࡿ㸬᫬࡟ࡣࠕゝ࠸㛫㐪࠸ࠖࡸ㸪ᩥᵓ㐀࡛ࡢഅ↛㉳ࡇࡋࡓㄗࡾ ࡸல⣽࡞୙ഛࡀぢࡽࢀࡿሙྜࡀ࠶ࡿࡀ㸪 ࡑࡢᩘࡣᑡ࡞ࡃ㸪ᚋ࡛ぢ┤ࡏࡤゞṇ࡛ࡁ ࡿࡶࡢࡀከ࠸㸬 ẚ㍑ⓗ㧗࠸ᩥἲ㥑౑ຊࡀぢࡽࢀࡿ㸬ㄗゎ ࡟ࡘ࡞ࡀࡿࡼ࠺࡞㛫㐪࠸ࡣ≢ࡉ࡞࠸㸬 B1 㥆ᰁࡳࡢ࠶ࡿ≧ἣ࡛ࡣ㸪๭ྜṇ☜࡟ࢥ࣑ ࣗࢽࢣ࣮ࢩࣙࣥࢆ⾜࠺ࡇ࡜ࡀ࡛ࡁࡿ㸬ከ ࡃࡢሙྜ㧗࠸࡛ࣞ࣋ࣝࡢ㥑౑⬟ຊࡀ࠶ ࡿࡀ㸪ẕㄒࡢᙳ㡪ࡀ᫂ࡽ࠿࡛࠶ࡿ㸬ㄗࡾ ࡶぢࡽࢀࡿࡀ㸪ᮏேࡀ㏙࡭ࡼ࠺࡜ࡋ࡚࠸ ࡿࡇ࡜ࡣ᫂ࡽ࠿࡟ศ࠿ࡿ㸬 ẚ㍑ⓗண ྍ⬟࡞≧ἣ࡛㸪㢖⦾࡟౑ࢃࢀ ࡿࠕ⧞ࡾ㏉ࡋࠖࡸࣃࢱ࣮ࣥࡢࣞࣃ࣮ࢺࣜ ࣮ࢆ㸪๭ྜṇ☜࡟౑࠺ࡇ࡜ࡀ࡛ࡁࡿ㸬 A2 ࠸ࡃࡘ࠿ࡢ༢⣧࡞ᩥἲᵓ㐀ࢆṇࡋࡃ౑ ࠺ࡇ࡜ࡀ࡛ࡁࡿࡀ㸪౫↛࡜ࡋ࡚Ỵࡲࡗ࡚ ≢ࡍᇶᮏⓗ࡞㛫㐪࠸ࡀ࠶ࡿ Ɇ ౛࠼ࡤ㸪 ᫬ไࢆΰྠࡋࡓࡾ㸪ᛶ࣭ᩘ࣭᱁࡞࡝ࡢ୍ ⮴ࢆᛀࢀࡓࡾࡍࡿഴྥࡀ࠶ࡿ㸬ࡋ࠿ࡋ㸪 ᮏேࡀఱࢆゝ࠾࠺࡜ࡋ࡚࠸ࡿࡢ࠿ࡣࡓ ࠸࡚࠸ࡢሙྜ᫂ࡽ࠿࡛࠶ࡿ㸬 A1 Ꮫ⩦῭ࡳࡢࣞࣃ࣮ࢺ࣮ࣜ࡟ࡘ࠸࡚㸪࠸ࡃ ࡘ࠿ࡢ㝈ࡽࢀࡓ༢⣧࡞ᩥἲᵓ㐀ࡸᵓᩥ ࢆ౑࠺ࡇ࡜ࡣ࡛ࡁࡿ㸬 ᪥ᮏேⱥㄒᏛ⩦⪅ࡢࣛ࢖ࢸ࢕ࣥࢢホ౯࡜ᩥἲ ⓗㄗࡾࡢ㛵ಀᛶࢆ᳨ウࡋࡓࡶࡢ࡜ࡋ࡚Kitamura 㸦2011㸧ࡀᏑᅾࡍࡿ㸬Kitamura㸦2011㸧࡛ࡣ㸪 Ỵᐃᮌศᯒࢆ⏝࠸࡚㸪ᩥἲⓗㄗࡾࡢ࢚ࢵࢭ࢖ホ౯ ࡬ࡢᙳ㡪ࢆㄪᰝࡋࡓ㸬ࡑࡢ⤖ᯝ㸪ࢺࣆࢵࢡ࡟ࡼࡗ ࡚ࡣᩥἲⓗㄗࡾࡀ࢚ࢵࢭ࢖ホ౯࡟ᐤ୚ࡍࡿࡇ࡜ ࡀ♧ࡉࢀࡓ㸬ࡋ࠿ࡋ࡞ࡀࡽ㸪ᑐ㇟࡜ࡉࢀࡓᩥἲⓗ

(3)

ㄗࡾࡣ୺ㄒ࡜ືモࡢ୍⮴㸦He have been living there since June.㸧㸪ືモࡢᙧ㸦I can’t skiing well, but… 㸧㸪 ୙ ᏶ ඲ ࡞ ᩥ 㸦 Because people’s interesting thing is not the same.㸧ࡢ୕✀㢮࡛ ࠶ࡗࡓࡓࡵ㸪ᮏ◊✲࡛ࡣ㸪20 ಶࡢᩥἲⓗㄗࡾࢆ ᑐ㇟࡜ࡋ㸪᳨ウࢆヨࡳࡿ㸬

Tang and Liu㸦2005㸧࡛ࡣ㸪ᩥ᭩ศ㢮ࢱࢫࢡ ࡛Ⰻࡃ⏝࠸ࡽࢀࡿ⣲ᛶ㑅ᢥ࡜ࡋ࡚㸪Information Gain㸪Chi-squared test㸪Odds ratio㸪Bi-Nominal Separation ࢆ ᣲ ࡆ ࡚ ࠸ ࡿ 㸬 ᮏ ◊ ✲ ࡛ ࡣ 㸪 Bi-Nominal Separation ࢆ⏝࠸ࡿ㸬

㸱㸬ࢹ࣮ࢱ

ᮏ ◊ ✲ ࡛ ⏝ ࠸ ࡓ ࢹ ࣮ ࢱ ࡣ 㸪Konan-JIEM learner corpus ࡜ ࿧ ࡤ ࢀ ࡿ ゝㄒ㈨ ※ ࡛ ࠶ ࡿ 㸦Nagata, Whittaker, & Sheinman, 2011㸧㸬ࡇ ࡢࢥ࣮ࣃࢫࡣ㸪⏥༡኱Ꮫ࡜ᩍ⫱ ᐃ◊✲ᡤ㸦JIEM㸧 ࡀඹྠ࡛཰㞟ࡋ㸪࢔ࣀࢸ࣮ࢩࣙࣥࢆ⾜ࡗࡓࢥ࣮ࣃ ࢫ࡛࠶ࡿ㸬᪥ᮏேⱥㄒᏛ⩦⪅ࡢ170 ࢚ࢵࢭ࢖࠿ࡽ ᡂࡾ㸪ࡲࡓ㸪ᩥἲㄗࡾ᝟ሗ࡜ရモ㸭ྃ᝟ሗࡀேᡭ ࡛௜୚ࡉࢀ࡚࠸ࡿ㸬◊✲┠ⓗࡢࡓࡵ࡛࠶ࢀࡤゝㄒ㈨※༠఍ࢆ㏻ࡌ࡚㸪㉎ධࡀྍ⬟࡛࠶ࡿ㸬ᴫせࢆ⾲ 3 ࡟㸪࢚࣮ࣛࢱࢢࡢ✀㢮ࢆ⾲ 4 ࡟♧ࡍ㸬⾲-3 ࢹ࣮ࢱࡢᴫせ ࢚ࢵࢭ࢖ࡢᩘ 170 ࢚ࢵࢭ࢖ࢆ᭩࠸ࡓᏛ⏕ࡢᩘ 10 ⥲ᩥᩘ 2409 ⥲༢ㄒᩘ 19285 ␗࡞ࡾㄒᩘ 2054 ᡂ⏣㸦2013㸧࡛ࡣ㸪Konan-JIEM learner corpus ࡢಶࠎࡢ࢚ࢵࢭ࢖࡟᪥ᮏࡢ኱Ꮫ࡛ⱥㄒᩍ ⫱࡟ᚑ஦ࡋ࡚࠸ࡿᩍဨࡀホ౯ࢆ୚࠼࡚࠸ࡿ㸬ᡂ⏣ 㸦2013㸧࡛ࡣࡍ࡭࡚ࡢ࢚ࢵࢭ࢖ࢆ 3 ேࡢᩍဨࡀ ホ౯ࡋ࡚ඃࢀࡓࡶࡢ࡜ࡑ࠺࡛࡞࠸ࡶࡢ࡟ศ㢮ࡋ ࡚࠸ࡿ㸬 ᮏ◊✲ࡣ㸪ᡂ⏣㸦2013㸧ࡢホ౯ࢆ᥇⏝ࡋ㸪 ඲య࡛ 170 ࠶ࡿ࢚ࢵࢭ࢖࠿ࡽࣛࣥࢲ࣒࡟ඃࢀࡓ ࡶࡢ50 ࡜ࡑ࠺࡛࡞࠸ࡶࡢ 50 ࢆ㑅ࡧ 100 ࡢ࢚ࢵ ࢭ࢖ࢆศᯒࡢᑐ㇟࡜ࡋࡓ㸬⾲-4 ࢚࣮ࣛࢱࢢࡢ✀㢮 ࢱࢢ ෆᐜ n_num ྡモ-༢」࢚࣮ࣛ ຍ⟬㸪୙ྍ⟬࢚࣮ࣛ n_lxc ྡモ-ㄒᙡ㑅ᢥ࢚࣮ࣛ n_o ྡモ-ࡑࡢ௚ࡢ࢚࣮ࣛ pn ௦ྡモ࡟㛵ࡍࡿ࢚࣮ࣛ v_agr ືモ-ே⛠࣭ᩘࡢ୙୍⮴ v_tns ືモ-᫬ไ࢚࣮ࣛ v_lxc ືモ-ㄒᙡ㑅ᢥ࢚࣮ࣛ v_o ືモ-ࡑࡢ௚ࡢ࢚࣮ࣛ mo ຓືモ࡟㛵ࡍࡿ࢚࣮ࣛ aj ᙧᐜモ࡟㛵ࡍࡿ࢚࣮ࣛ av ๪モ㸦ྃ㸧࡟㛵ࡍࡿ࢚࣮ࣛ prp ๓⨨モ࡟㛵ࡍࡿ࢚࣮ࣛ at ෙモ࡟㛵ࡍࡿ࢚࣮ࣛ con ᥋⥆モ࡟㛵ࡍࡿ࢚࣮ࣛ rel 㛵ಀモ࡟㛵ࡍࡿ࢚࣮ࣛ itr ␲ၥモ࡟㛵ࡍࡿ࢚࣮ࣛ o_lxc ஧ㄒ௨ୖ࠿ࡽᡂࡿᡂ࡛ྃࡢ ㄒᙡ㑅ᢥ࣑ࢫ ord ㄒ㡰࢚࣮ࣛ uk ≉ᐃ୙⬟࡞࢚࣮ࣛ ᵓᡂୖࡢ⮴࿨ⓗ࡞࣑ࢫ f ࣇࣛࢢ࣓ࣥࢺ㸦᩿∦㸪ᮍ᏶ࡢᩥ➼㸧

㸲㸬ศᯒ

ࡲ ࡎ 㸪 ࢚ ࢵ ࢭ ࢖ ୰ ࡢ ㄗ ࡾ ࡢ Bi-Normal Separation㸦BNS: Forman, 2003㸧ࢆ⟬ฟࡋ ࡓ㸦⾲ 5㸧㸬BNS ࡣ㸪࠶ࡿ஦㇟ࡀ࠶ࡿ࢝ࢸࢦ ࣜ࡟ฟ⌧ࡋࡓ࠿࡝࠺࠿ࡔࡅ࡛࡞ࡃ㸪ࡑࡢ஦㇟ࡀ ఱᅇฟ⌧ࡋࡓ࠿࡜࠸࠺㢖ᗘࢆ⾲ࡋࡓᣦᶆ࡛࠶ ࡿ㸬ᩥἲⓗㄗࡾࡣࡑࡢ✀㢮ࡔࡅ࡛࡞ࡃ㸪ࡑࡢฟ ⌧㢖ᗘࡀ࢚ࢵࢭ࢖ࡢホ౯࡟ᙳ㡪ࢆ୚࠼ࡿ࡜⪃ ࠼ࡽࢀࡿࡓࡵ㸪ᮏ◊✲࡛ᣦᶆ࡜ࡋ࡚᥇⏝ࡋࡓ㸬 ᮏ◊✲࡟࠾࠸࡚BNS ࡣ㸪ඃࢀࡓ࢚ࢵࢭ࢖࡜ࡑ ࠺࡛࡞࠸࢚ࢵࢭ࢖࡟࠾࠸࡚ࡑࢀࡒࢀࡢㄗࡾࡀ ฟ⌧ࡋ࡚࠸ࡿ㔞ࡢᕪࢆ♧ࡍࡇ࡜࡟࡞ࡿ㸬 ฟ⌧㢖ᗘࡀᴟ➃࡟ప࠸㸦10 ௨ୗ㸧ࡢᣦᶆࢆ㝖 ࡃ࡜㸪ඃࢀࡓ࢚ࢵࢭ࢖࡜ࡑ࠺࡛࡞࠸ࡶࡢ࡟࠾࠸ ࡚኱ࡁ࡞್ࢆ♧ࡍࡶࡢࡣv_lxc㸦ືモࡢㄒᙡ࡟ 㛵ࡍࡿ࢚࣮ࣛ㸧࡜ ord㸦ㄒ㡰࡟㛵ࡍࡿ࢚࣮ࣛ㸧 ࡢ2 ✀㢮ࡢᩥἲⓗㄗࡾ࡛࠶ࡗࡓ㸬⾲-5 ࢚࣮ࣛࡢ㢖ᗘ࡜ Bi-Normal Separation ್ ࢱࢢ ฟ⌧㢖ᗘ BNS Good Poor n_num 52 60 0.18 n_lxc 15 8 0.78 n_o 41 63 0.54 pn 33 38 0.18 v_agr 67 68 0.02 v_tns 40 69 0.68 v_lxc 15 49 1.45 v_o 9 7 0.31 mo 22 40 0.74 aj 28 27 0.05 av 79 125 0.57 prp 122 169 0.41 at 17 18 0.07 con 5 5 0.00

(4)

rel 2 0 1.00 itr 6 2 1.35 o_lxc 9 8 0.15 ord 20 60 1.35 uk 2 4 0.86 f 0 4 1.00 ḟ࡟㸪ᩥἲⓗㄗࡾࡢ㢖ᗘࢆண ኚᩘ㸪࢚ࢵࢭ࢖ࡢ ホ౯ࢆᇶ‽ኚᩘ࡜ࡋ࡚ࣟࢪࢫࢸ࢕ࢵࢡᅇᖐࢆ⾜ ࠸㸪ࡍ࡭࡚ࡢᩥἲⓗㄗࡾࢆண ኚᩘ࡜ࡋ࡚ጞࡵ㸪 ࡦ࡜ࡘࡎࡘኚᩘࢆῶࡽࡋ㸪AIC ࡀᨵၿࡋ࡞ࡃ࡞ࡿ ࡲ࡛⧞ࡾ㏉ࡋࡓ㸬᭱⤊ⓗ࡟AIC ࡣ 125.6 ࡜࡞ࡾ㸪 ṧࡗࡓኚᩘࡣ㸪v_lxc㸦ືモࡢㄒᙡ࡟㛵ࡍࡿ࢚ࣛ ࣮㸧࡜ord㸦ㄒ㡰࡟㛵ࡍࡿ࢚࣮ࣛ㸧ࡢ 2 ✀㢮ࡢᩥ ἲⓗㄗࡾ࡛࠶ࡗࡓ㸬௨ୗ࡟ v_lxc㸦ືモࡢㄒᙡ࡟ 㛵ࡍࡿ࢚࣮ࣛ㸧࡜ord㸦ㄒ㡰࡟㛵ࡍࡿ࢚࣮ࣛ㸧ࡢ 㢖ᗘࢆホ౯ู࡟♧ࡍ㸬ᅗ1 ࠾ࡼࡧ 2 ࡀ♧ࡍࡼ࠺࡟㸪ホ౯ࡀGood ࡢ࢚ࢵࢭ࢖ࡢ࡯࡜ࢇ࡝࡟ࡇࢀࡽࡢᩥ ἲⓗㄗࡾࡣྵࡲࢀ࡚࠸࡞࠸୍᪉࡛㸪ホ౯Poor ࡢ ࢚ࢵࢭ࢖ࡢ⣙༙ᩘ࡟ࡇࢀࡽࡢㄗࡾࡀྵࡲࢀ࡚࠸ ࡿ㸬ࡲࡓ㸪ホ౯Good ࡢ࢚ࢵࢭ࢖࡟ࡇࢀࡽ 2 ࡘࡢ ࢚࣮ࣛࡀ3 ࡘ௨ୖྵࡲࢀ࡚࠸ࡿࡇ࡜ࡣ࡞࠿ࡗࡓ㸬 ౛እࡣ࠶ࡿࡀ㸪ࡇࡢഴྥࡣ௚ࡢᩥἲⓗㄗࡾ࡟࠾࠸ ࡚ࡶぢࡽࢀࡓ㸬 ᅗ-1 ホ౯ Good ࡢ v_lxc㸦ᕥ㸧࡜ ord㸦ྑ㸧ࡢ㢖ᗘ ᅗ-2 ホ౯ Poor ࡢ v_lxc㸦ᕥ㸧࡜ ord㸦ྑ㸧ࡢ㢖ᗘ ࡇࡢࡼ࠺࡞ࢹ࣮ࢱࡢഴྥ࠿ࡽ㸪ᩥἲⓗㄗࡾࡢᅇᩘ ࡟ホ౯ࡢቃ⏺⥺ࡀᘬࡅࡿ࡜⪃࠼㸪Ỵᐃᮌศᯒࢆ⏝ ࠸࡚ホ౯ࡢண ࢆヨࡳࡓ㸬Ỵᐃᮌศᯒ࡟ࡼࡿண ⤖ᯝࢆ⾲6 ࡟♧ࡍ㸦㐺ྜ⋡: 0.62㸪෌⌧⋡: 0.81㸪 ≉␗ᗘ: 0.69㸪ṇ☜ᗘ: 0.74㸪F ್: 0.70㸧㸬ᅗ 3 ࡟ Ỵᐃᮌศᯒࡢ⤖ᯝࢆ♧ࡍ㸬ࣟࢪࢫࢸ࢕ࢵࢡᅇᖐศ ᯒࡢ⤖ᯝ࡜ࡣ␗࡞ࡿࡀ㸪ࡇࡢ⤖ᯝࡶ࢚ࢵࢭ࢖ホ౯ ࡟࠾࠸࡚ືモࡢㄒᙡ࡟㛵ࡍࡿ࢚࣮ࣛࡀ㔜せ࡞ᩥ ἲⓗㄗࡾ࡛࠶ࡿࡇ࡜ࢆ♧ࡋ࡚࠸ࡿ㸬⾲-6 Ỵᐃᮌศᯒࡢศ㢮⢭ᗘ ┿ࡢ⤖ᯝ ண ⤖ᯝ Good Poor Good 31 19 Poor 7 43 ᅗ-3 Ỵᐃᮌศᯒࡢ⤖ᯝ ᐇ㝿࡟Ꮫ⩦⪅ࡢ⏘ฟࡋࡓⱥᩥࡢ౛ࢆୗグ࡟♧ ࡍ㸬

Last, I want to earn much money and I want to <v_lxc crr="travel">trip</v_lxc> <ord crr="everywhere in Japan">in Japan everywhere</ord>. ୖグࡢࡼ࠺࡟㸪ືモ travel ࢆ౑ࢃ࡞ࡅࢀࡤ࡞ ࡽ࡞࠸⟠ᡤ࡟ྡモ trip ࡀ᭩࠿ࢀ࡚࠸ࡿ㸬ࡇ࠺ ࠸ࡗࡓືモࢆ᭩࠿࡞ࡅࢀࡤ࡞ࡽ࡞࠸⟠ᡤ࡟ྡ モࢆグࡋ࡚ࡋࡲ࠺ࢱ࢖ࣉࡢㄗࡾࡢ⏘ฟࡣ v_lxc 㸦ືモࡢㄒᙡ࡟㛵ࡍࡿ࢚࣮ࣛ㸧࡟ከࡃ㸪ࡇ࠺ࡋ ࡓㄗࡾࡣసᩥࡢホ౯࡟ᐤ୚ࡋࡸࡍ࠸࡜࠸࠺ࡇ ࡜ࢆᩍᖌࡣᩍᐊ࡛ᩍ࠼ࡿᚲせࡀ࠶ࡿ࡜⪃࠼ࡽ ࢀࡿ㸬

(5)

㸳㸬⪃ᐹ

ᮏ◊✲࡛ࡣ㸪BNS㸪ࣟࢪࢫࢸ࢕ࢵࢡᅇᖐศᯒ㸪 Ỵᐃᮌศᯒࢆ⏝࠸࡚㸪ⱥㄒᏛ⩦⪅ࡢ࢚ࢵࢭ࢖࡟ᑐ ࡍࡿホ౯࡟୚࠼ࡿᩥἲⓗㄗࡾࡢᙳ㡪ࢆㄪᰝࡋࡓ㸬 ࡑࢀࡒࢀࡢศᯒ࡟࠾࠸࡚␗࡞ࡿ⤖ᯝࡀᚓࡽࢀࡓ㸬 ඹ㏻ࡍࡿ⤖ᯝ࡜ࡋ࡚ࡣ v_lxc㸦ືモࡢㄒᙡ࡟㛵ࡍ ࡿ࢚࣮ࣛ㸧ࡀඃࢀࡓ࢚ࢵࢭ࢖࡜ࡑ࠺࡛࡞࠸ࡶࡢࢆ ศࡅࡿୖ࡛㔜せ࡞ᩥἲⓗㄗࡾ࡛࠶ࡿࡇ࡜ࡀ♧ࡉ ࢀࡓ㸬 Ỵᐃᮌศᯒ࡛ࡣ㸪カ⦎ࢹ࣮ࢱࡢࡳࡢ⤖ᯝ࡛࠶ࡿ ࡀ㸪4 ✀㢮ࡢᩥἲⓗㄗࡾࡢࡳࢆ⏝࠸࡚࠿࡞ࡾ㧗࠸ ⢭ᗘ࡛ホ౯ࢆண ࡛ࡁࡿࡇ࡜ࡀ♧ࡉࢀࡓ㸬ᡂ⏣ 㸦2013㸧࡟ࡼࡿホ౯ࡣᩥἲⓗㄗࡾ࡟↔Ⅼࢆᙜ࡚ ࡓホ౯࡛ࡣ࡞ࡃ㸪࢚ࢵࢭ࢖඲యࡢホ౯࡛࠶ࡿ㸬ࡇ ࡢホ౯ࢆᩥἲⓗㄗࡾࡢࡳࢆ⏝࠸࡚ண ࡀ࡛ࡁࡿ ࡜࠸࠺ࡇ࡜ࡣ㸪࢚ࢵࢭ࢖ホ౯࡟࠾ࡅࡿᩥἲⓗㄗࡾ ࡢ㔜せᗘࡀ㧗࠸࡜ゎ㔘࡛ࡁࡿ㸬 ᮏ◊✲⤖ᯝࡣ㸪ⱥㄒᏛ⩦⪅ࡢࣛ࢖ࢸ࢕ࣥࢢᣦᑟ ࡸ⮬ືホ౯࡟࠾࠸࡚㔜せ࡞♧၀ࢆྵࡴ㸬ࣛ࢖ࢸ࢕ ࣥࢢᣦᑟ࡟࠾࠸࡚ࡣ㸪ືモ࡟㛵ࡍࡿ▱㆑ࡀࣛ࢖ࢸ ࢕ࣥࢢࡢホ౯࡟኱ࡁࡃᐤ୚ࡍࡿࡇ࡜࠿ࡽ㸪ᩍᖌࡣ㸪 ࡑ࠺࠸ࡗࡓ▱㆑࡟ࡘ࠸࡚ࡣ㸪㔜Ⅼⓗ࡟ᣦᑟࢆࡍࡿ ࡇ࡜ࡀồࡵࡽࢀࡿ㸬ࡲࡓ⮬ື᥇Ⅼ࡟㛵ࡍࡿ◊✲㛤 Ⓨ࡟࠾࠸࡚ࡣ㸪ྃᵓ㐀࡞࡝ࡢ⤫ㄒゎᯒࡸn-gram ࢆ⏝࠸࡚ㄒ㡰࡟㛵ࡍࡿ࢚࣮ࣛࢆ᳨ฟࡋ㸪௚ࡢㄗࡾ ࡟ẚ࡭࡚㔜ࡳ࡙ࡅࡍࡿᚲせࡀ࠶ࡿࡇ࡜ࡀ♧၀ࡉ ࢀࡓ㸬

ཧ⪃ᩥ⊩

1) Burstein, J., Chodorow, M., & Leacock, C. Automated essay evaluation: The Criterion online writing service. AI Magazine, 25(3), pp.27–36. (2004)

2) Forman, G. An extensive empirical study of feature selection metrics for text classification. The Journal of machine learning research, 3, pp.1289-1305. (2003)

3) Jones, E. ACCUPLACER's essay-scoring technology: When reliability does not equal validity. In P. F. Ericsson & R. Haswell (Eds.), Machine scoring of student essays. (pp. 93-113). Logan, UT: Utah State University Press. (2006)

4) Kitamura, M. Influence of Japanese EFL Learner Errors on Essay Evaluation. Annual Review of English Language Education in Japan, Vol.22, pp.169-184. (2011)

5) Nagata, R. Whittaker, E., & Sheinman, V. Creating a manually error-tagged and shallow-parsed performance learner corpus. Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, pp.1210-1219. (2011)

6) ᡂ⏣┿⃈. Konan-JIEM Ꮫ⩦⪅ࢥ࣮ࣃࢫ࡟࠾ ࡅࡿྡモᚋ⨨ಟ㣭せ⣲ࡢศᯒ. ᮾிᅜ㝿኱Ꮫㄽ ྀɆゝㄒࢥ࣑ࣗࢽࢣ࣮ࢩࣙࣥᏛ㒊⦅.Vol.9,pp. 1-12. (2013)

7) Tang, L. & Liu, H. Bias analysis in text classification for highly skewed data. ICDM ’05: Proceedings of the Fifth IEEE International Conference on Data Mining, pp. 781–784. (2005)

(6)

㈨ᩱ-1 Ꮫ⩦⪅ࡢ࢚ࢵࢭ࢖ࡢ౛㸦University Life㸧

My university life is very interesting. Because I <v_lxc crr="do">act</v_lxc> many things <prp crr="">since</prp> now. First I <uk crr="am a member of">join</uk> <at crr="a"></at> cercle. I feel <ord crr="very good about this"><prp crr="about"></prp> this very good</ord>. <uk crr="I kill time by">My killing time is</uk> writing <n_num crr="novels">novel</n_num> or drawing <n_num crr="pictures">picture</n_num>. <uk>This has many people like me</uk>.So I concentrate <prp crr="on"></prp> this. Second is summer vacation. I <v_tns crr="did"><v_lxc crr="do">act</v_lxc></v_tns> many <n_num crr="things">thing</n_num> in <pn crr="my"></pn> summer vacation. My best memory is <at crr="the"></at> seminar on the sea. I went to Ho-chi-min and Singapore. I got many friends <prp crr="from">around</prp> Hyogo university. And I <av crr="sometimes">sometime</av> meet <pn crr="them">friends</pn>. Last I have many friends <prp crr="from">in</prp> high school, junior high school and <aj crr="other">etc</aj> <n_num crr="groups">group</n_num>. We always talk about each <n_o crr="other's">other</n_o> <n_num crr="lives">life</n_num> <prp crr="by">in</prp> e-mail or internet. And We play <prp crr="">in</prp> inside or outside home. We play funny <n_num crr="games">game</n_num>. For example, <n_lxc crr="one of us">a friend</n_lxc> <v_agr crr="calls">call</v_agr> <prp crr="">in</prp> Macdonald <con crr="and"></con> <v_lxc crr="says"></v_lxc> "Please give me <at crr="a"></at> hundred <n_num crr="hamburgers">hunbergar</n_num>.“ And others look <prp crr="at"></prp> him and laugh. I have many friends, so my university life is very interesting.

㈨ᩱ-2 ࢚࣮ࣛࢱࢢࡢ౛

ࢱࢢ ౛ᩥ

n_num This is the only one <n_num crr="thing"> things</n_num> you have to do. n_lxc She listened to his <n_lxc crr="speech">speak</n_lxc>.

n_o I went to <n_o crr="Nihonbashi in Osaka">Osaka Nihonbashi</n_o>. pn I took Martin and a frien of<pn crr=”his”>him</pn> to the park.

v_agr The number of students who work part-time after school <v_agr crr="has been increasing"> have been increasing</v_agr>.

v_tns I'll make researvations for the ferry as soon as I <v_tns crr="find">will find</v_tns> out the schedule.

v_lxc He wanted to <v_lxc crr="conceal">cancel</v_lxc> his guilt.

v_o If it <v_o crr="is forgotten"><v_agr crr="forgets">forget</v_agr></v_o>, plants are going to die.

mo “The phone is ringing.” “I <mo crr=”will”>’m going to </mo>answer it.” aj It was a <aj crr=”genuine”>genius</aj>diamond.

av He worked <av crr=”hard”>hardly</av>today. prp He took full advantage<prp crr=”of”>with</prp>his position.

at She is active in <at crr=”the”>a</at> development of low cost water pumps. con Clint hit a home run, <con crr=”but”>and </con> I didn’t.

rel I phoned all his friends, none of <rel crr=”whom”>who</rel> could tell me where he was. itr <itr crr=”Which”>What</itr> would you like to eat, Japanese or Chinese food? o_lxc He <o_lxc crr=”made an attempt”>had an attempt</o_lxc> at the conquest of the peak.

ord When did you buy that <ord crr=”large old brown wooden”>old brown large wooden</ord>table?

uk <uk crr=”X”>…</uk>

In case of the UK tag, correction 㸦crr=”X”㸧 is not annotated in the tag. f <f>The last day,</f>