The 28th Annual Conference of the Japanese Society for Artificial Intelligence, 2014
1H3-NFC-02b-5
ར༻ऀͷߘ׆ಈʹجͮ͘ఆྔԽख๏ͷ
ࡂใࢧԉγεςϜʹ͚ͯͷల
Current status and prospects of Twitter users’ quantification method based on their posting
activity for constructing disaster information support system
∗1
দຊ ৻ฏ
Shimpei Matsumoto∗2
ޱ େو
Hiroki Kawaguchi∗3
ௗւ ෆೋ
Fujio Toriumi∗1
ౡۀେֶใֶ෦
Faculty of Applied Information Science, Hiroshima Institute of Technology
∗2
ౡۀେֶେֶӃֶܥݚڀՊ
Graduate School of Science and Technology, Hiroshima Institute of Technology
∗3
౦ژେֶେֶӃֶܥݚڀՊ
Graduate School of Engineering, The University of TokyoAt the time of the Great East Japan Earthquake, many Tweets of the disaster had posted and Twitter had been effectively-utilized as an infrastructure for sharing disaster information and confirming safety. From now the authors have been addressed the researches to utilize Twitter for disaster under the importance of user classification. Concretely by focusing on Twitter user’s tweeting, replying, and retweeting activities which is assumed to be the source of Twitter’s real time feature, and by numerically-expressing each Twitter user’s activities with a quantification method based on entropy, the Twitter users’ tendency under the disaster and the possibility for user filtering have been examined. This paper shows the summary of our research results previously reported, and expresses the prospects of the quantification method for constructing disaster information support system.
1.
͡Ίʹ
౦ຊେࡂൃੜ࣌ɼTwitterوॏͳใަखஈͱͯ͠
ੵۃతʹ׆༻͞ΕॏཁͳׂΛՌͨ͠ɼ·ͨར༻ऀͷใఏ
ڙऩूʹߩݙͨ͠[ௗւ14]ɽࡂɼଟ͘ͷਓʑ͕҆൱
֬ೝใऩूͳͲͷతͰ༷ʑͳ௨৴खஈͷར༻ΛࢼΈ͍ͯ ͨɽ௨৴ճઢͷඃఀిɼτϥϑΟοΫ૿௨৴ن੍ͷͨΊ
ʹిϝʔϧ͕ܨ͕Γʹ͍͘ঢ়گͱͳͬͨҰํͰɼWebӾ
ཡͷύέοτ௨৴ൺֱతར༻Ͱ͖Δঢ়گͰ͋ͬͨͨΊɼଟ͘ ͷਓ͕TwitterΛར༻͠ࡂʹؔ͢ΔେྔͳใΛଈ࣌ऩू ͢Δ͜ͱ͕Ͱ͖ͨɽ·ͨɼճઢʹେ͖ͳෛ୲Λ͔͚Δ͜ͱͳ͘ ใ֦ࢄ͕ՄೳͰ͋ͬͨɽଟ͘Ոਓͷ҆൱ͷ֬ೝɼඃ ࡂҬͷҩࢣʹΑΔҩྍ૬ஊɼߦػؔʹΑΔใൃ৴ͳͲͰ ͋ΓɼϚεϝσΟΞʹཔΒͳ͍ใ֦ࢄΛతͱͯ͠ɼඃࡂ
֎ͷํ࣏ࣗମݸਓʹΑͬͯ׆༻͞Ε͍ͯͨ[ࠤʑ11]ɽ
ஶऀΒ͜Ε·Ͱɼࡂ࣌ʹ͓͚ΔTwitterͷ༗ޮͳ׆༻
๏ͷߏஙΛࢦͯ͠ݚڀΛਐΊ͍ͯΔɽࡂ࣌ʹ͓͚Δར༻ऀ
ྨͷॏཁੑΛഎܠʹɼTwitterͷϦΞϧλΠϜੑͷݯઘͰ͋
Δͱߟ͑ΒΕΔར༻ऀͷߘɾฦ৴ɾҾ༻ͷ׆ಈʹணͯ͠ɼ ࡂ࣌ͷTwitterͷΘΕํͷੳར༻ऀͷଐੑʹԠͨ͡ಛ ͷௐࠪΛਐΊ͍ͯΔɽ۩ମతʹɼ·ͣɼใྔͷߟ͑ํʹ
ج͍ͮͯաڈͷߘ׆ಈΛఆྔԽ͢Δ͜ͱʹΑΓ[Ghosh 11]
֤ར༻ऀΛଟมྔఆྔԽ͠ɼTwitterར༻ऀͷ׆ಈΛఆྔతʹ
ղऍ͢ΔͨΊͷख๏ͷߏஙΛߦͬͨɽ࣍ʹɼ౦ຊେࡂલޙ ʹTwitter্ʹ࣮ࡍʹྲྀ௨ͨ͠ߘΛରʹ࣮ݧΛߦ͍ɼར ༻ऀଐੑʹԠͨ͡ಛΛੳ͢Δͱڞʹɼར༻ऀͷࣗಈผʹ ͚ͯͷՄೳੑΛݕূͨ͠ɽຊߘͰɼ͜Ε·ͰஶऀΒ͕ਐΊ ͖ͯͨऔͷ֓ཁΛड़ΔͱڞʹɼࡂใγεςϜߏஙʹ ͚ͯͷࠓޙͷలΛड़Δɽ
࿈བྷઌ:দຊ৻ฏɼౡۀେֶใֶ෦తใγεςϜֶ
Պɼ˟731-5193ౡࢢࠤഢ۠ࡾ2-1-1, E-Mail: [email protected]
2.
ιʔγϟϧϝσΟΞͷࡂ࣌׆༻ʹ͚ͯ
ͷपลಈ
౦ຊେࡂ࣌TwitterͰඃࡂͷঢ়گΛΔਓؒͷ
ߘͳͲϚεϝσΟΞ͕ใ͡ͳ͍وॏͳใ͕ଟ͘ྲྀ௨͍ͯ͠ ͕ͨɼࡂʹؔ͢ΔใΛޮతʹ֫ಘ͢Δٕज़͕ඞཁͰ͋
Δͱߟ͑ΒΕΔɽେنࡂ࣌ʹ͓͚ΔSNSʹΑΔۓٸ௨ใ
ͷՄೳੑʹؔ͢Δݕ౼ձͰɼࡂ࣌ʹ͓͍ͯԻ௨ใ్͕
ઈ͑ͨ߹ʹSNSͳͲͷใΛ׆༻ͨ͠ۓٸ௨ใͷՄೳੑʹ
͍ͭͯใࠂ͠ɼٕज़త͋Δ͍ӡ༻্ͷ՝ɼSNSͷ׆༻ํ
๏ͷࡏΓํΛݕ౼͠·ͱΊ͍ͯΔ[૯লফி13]ɽTwitter
্ʹ༷ʑͳछྨͷେͳྔͷߘ͕ৗʹྲྀ௨͍ͯ͠ΔͨΊɼ
ࡂ࣌ʹ͓͍ͯTwitterΛԁʹར༻Ͱ͖ΔΑ͏ʹ͢ΔͨΊ
ʹɼతʹԠͯ͡ߘΛత֬ʹࣗಈผ͢ΔΈ͕ඞཁͰ ͋Δͱड़͍ͯΔɽ·ͨɼൃ৴ऀͷҐஔͱࡂͷద߹ੑɼใ ͷ৴པੑ্Λ՝ͱ͍ͯ͋͛ͯ͠Δɽಉ༷ʹɼௗւΒͷऔΓ
ΈͰ[ௗւ14]ɼใॲཧʹؔ͢Δ՝ͱͯ͠ɼใͷ࣌
ؒతɾۭؒతͳ֬ྼԽͷରԠɼใͷ৴པੑอূɼใͷ ෆɾܽམͷରԠɼେنॲཧΛ͍͋͛ͯΔɽ͜ͷதͰ৴པ ੑΛ֬อ͢ΔͨΊͷٕज़՝ͱͯ͠ɼϊΠζআڈɼใ౷߹ɼ σϚͷൃݟͱࢭɼγϛϡϨʔγϣϯʹΑΔ֬ͷݕূ͕ࣔ͞ Ε͍ͯΔ͕ɼࡂ࣌ʹ͓͚ΔιʔγϟϧϝσΟΞΛରͱͨ͠ ଟ͘ͷݚڀɼҎ্Ͱࣔ͞ΕͨϑϨʔϜϫʔΫͷͱͰߦ͞ Ε͍ͯΔɽ
3.
ஶऀΒͷऔΓΈͷҐஔ͚
ࡂ࣌ʹ͓͍ͯॏཁͱͳΔใɼࡂʹڧؔ͘͢Δར ༻ऀඃࡂऀࣗʹΑͬͯൃ৴͞ΕΔ߹͕ଟ͍ɽ͜ΕΒར༻ ऀͷૌ͑Λత֬ʹநग़͢ΔͨΊʹɼਓ͕ൃ৴ͨ͠ใͷࣝผ ར༻ऀͷଐੑʹԠͨ͡બผɼ͋Δ͍ࣗಈߘϓϩάϥϜ
(bot)ʹΑΓൃ৴͞ΕΔߘͷݕ͕ॏཁͳ՝Ͱ͋Δͱߟ͑
ΒΕΔɽ·ͨɼޡใͷ֦ࢄɼීஈͷTwitterʹݟΒΕ
The 28th Annual Conference of the Japanese Society for Artificial Intelligence, 2014
ͳ͍ಛผͳ͍ํΛͨ͠ར༻ऀɼଞͷϦιʔεΛࣗಈҾ༻
͢Δbot͕ݪҼͷͻͱͭͰ͋ͬͨͱߟ͑ΒΕ͍ͯΔɽҎ্Α
Γɼbotͷݕग़ͷΈͳΒͣར༻ऀͷ׆ಈܗଶͷѲɼޡใ
ରԠ͢ΔͨΊͷॏཁͳ՝Ͱ͋Δͱߟ͑ΒΕΔɽࡂ࣌ͷ
Twitterར༻ऀͷಛΛܭࢉՄೳͳܗͰ༰қʹѲͰ͖Εɼ
botͷݕɾআ֎ɼ·ͨɼࣄ࣮Λ͍ͬͯΔඃࡂऀͷͷऔಘ
ʹ׆༻ՄೳͰ͋Δͱߟ͑ΒΕΔɽͦͯͦ͠ͷ݁Ռͱͯ͠ɼޡ ใͷగਖ਼ɾޡใ֦ࢄͷ੍ʹ͚ͯͷ׆༻ɼඃࡂͨ͠Ҭ ͷਓୡͷੜͷͷऔಘ͕ظͰ͖Δɽར༻ऀͷબผɼͱΓΘ͚
botݕग़ʹண͢Εɼͦͷٕज़ͱͯ͠ɼߘ༰ʹج͍ͮͨ
ػցతϑΟϧλϦϯά͕ߟ͑ΒΕΔɽ͔͠͠ͳ͕Βɼߘ༰ ʹج͍ͮͨใબผैདྷ੩తͳใΛରʹ͓ͯ͠Γɼ͞Β ʹɼ͜ΕΒݱࡏٕज़తʹख़ͷҬʹ͋ΔɽιʔγϟϧϝσΟ Ξͷ࣮࣌ؒੑʹద߹ͨ͠ใબผख๏Λߏங͢ΔͨΊʹɼ࣭ తख๏Ҏ֎͔Βͷ؍ʹج͍ͮͨܭࢉखॱ͕ඞཁͰ͋Δͱߟ͑ ΒΕ͍ͯΔɽҎ্എܠͷͱͰɼஶऀΒར༻ऀͷߘ׆ಈʹ ج͍ͮͨఆྔԽख๏ʹண؟͠ɼଟมྔʹΑͬͯදݱ͞Εͨࡂ ࣌ʹ͓͚Δར༻ऀͷఆྔղऍͱͦͷੳɼར༻ऀྨʹ͚ͯ
ͷՄೳੑݕূʹऔΓΜͰ͖ͨ[ޱ13a]ɽ
4.
͜Ε·ͰͷՌ
Twitterར༻ऀͷఆྔԽख๏ɼॲཧΛىͱͯ͠લ
Λղੳରظؒͱ͠ɼظؒͷར༻ऀͷ௨ৗߘɼRetweetɼ
Replyͷ׆ಈύλʔϯ͔ΒΤϯτϩϐʔΛࢉग़͢Δɽ·ͣɼຊ
ݚڀͷख๏ͷجૅͱͳͬͨGhoshΒͷख๏ͱͷൺֱΛߦͬͨɽ
GhoshΒͷख๏ͰɼपғͷRetweet׆ಈΛੳରͱͯ͠ ͍ΔɽҰํɼຊݚڀͰࡂ࣌ͷ׆༻͕తͰ͋Γɼࣄྫʹର ͢Δଈ࣌ੑΛอͭ͜ͱͷॏཁੑΛ౿·͑ɼΤϯτϩϐʔͷࢉग़ ʹ͔͔ΔܭࢉྔΛߟྀͯ͠ར༻ऀ͕ߘͨ͠શߘͷΈΛର ͱͨ͠ɽ࣮ݧͷ݁Ռɼରσʔλͷ࣭͕େ͖͘ҟͳΔʹؔ ΘΒͣɼ྆ख๏ͱͷࠩҧ͋Δͷͷར༻ऀͷಛ͚ ͕֓ͶՄೳͰ͋Δ͜ͱ͕֬ೝ͞Εͨɽ·ͨɼࡂ࣌ʹ͓͚Δࠃ ͷTwitterใʹ͓͍ͯGhoshΒ͕ࣔͨ͠ಛͱಉ༷ͷ ͕͋ΓɼఆྔԽʹΑΔྨ͕ՄೳͰ͋Δ͜ͱ͔ΒΤϯτϩ ϐʔʹΑΔఆྔԽख๏ීวੑΛ࣋ͭ͜ͱ͕ࣔࠦ͞Εͨɽ
࣍ʹɼղੳରظؒͷ͕͞Τϯτϩϐʔ࣌ܥྻʹ༩͑ΔӨ ڹΛੳͨ͠ɽͦͷ݁ՌɼղੳରظؒΛ࠷͘ઃఆͨ͠ ߹ʹಥൃతࣄͷӨڹ͕࠷ݦஶʹදΕ͍ͯΔ͜ͱɼҰํͰɼ ੳظ͕͍ؒ΄ͲΤϯτϩϐʔͷͷมಈͷԠϘϥςΟ ϦςΟ͕খ͘͞ͳΔ͜ͱΛ໌Β͔ʹͨ͠ɽಥൃతࣄͷݕग़ͱ ར༻ऀଐੑͷྨ૬ͷؔʹ͋Δ͜ͱΛ౿·͑ͯɼಥൃత ࣄͷݕग़ͱར༻ऀଐੑͷྨΛಉ࣌ʹߦ͏ͨΊͷख๏ΛఏҊ ͨ͠ɽ͜͜ͰଞʹɼΤϯτϩϐʔͷִ࣌ؒؒΛ༷ʑͳ؍͔ Β༩͑Δ͜ͱʹΑΓɼैདྷ๏ͷಛΛࣦ͏͜ͱͳ࣍͘ݩΛ֦ு ͢Δ͜ͱʹޭͨ͠ɽ
Ҏ্ͷऔΓΈͷதͰఏҊख๏ͷࣗಈผͷԠ༻Մೳ ੑΛݕূ͓ͯ͠Γɼ࣮ݧͷ݁Ռ͔ΒɼଟมྔఆྔԽʹΑΓར ༻ऀଐੑʹԠͨࠩ͡ҧΛදݱͰ͖͔ͨΓͰͳ͘ɼࣗಈผͷ Ԡ༻ՄೳੑΛࣔ͢͜ͱ͕Ͱ͖ͨɽ
5.
·ͱΊͱࠓޙͷల
ຊߘͰɼ͜Ε·ͰஶऀΒ͕ਐΊ͖ͯͨऔͷ֓ཁΛड़ ͨɽࡂใγεςϜߏஙʹ͚ͯͷࠓޙͷలͱͯ͠ɼଟม ྔͰදݱ͞Εͨར༻ऀͷ͔Βར༻ऀಉ࢜ͷڑΛࢉग़͠ɼ͜
ͷΛ׆༻ͨ͠WebγεςϜͷ։ൃͱԠ༻Λߟ͍͑ͯΔɽ۩
ମతʹɼߘ׆ಈͱߘཤྺͷؔੑΛಉ࣌ʹѲ͢Δͨ ΊɼΤϯτϩϐʔͷ؍͔Βߘ׆ಈͷ͍ۙෳར༻ऀͷߘ
ਤ1: λάΫϥυγεςϜͷҰྫ
༰ΛλάΫϥυͰදݱ͢ΔγεςϜͷߏஙΛݕ౼͍ͯ͠Δ
(ਤ1ࢀর)ɽ·ͣɼଟ༷ͳߘ༰ͷར༻ऀΛผ͢ΔͨΊɼ
༧Ί༻ҙ͞ΕͨࡂΩʔϫʔυͷॏෳΛJaccardΛ༻
͍ͯར༻ऀ͝ͱʹಛ͚Δɽࣅͨߘ׆ಈͷར༻ऀಉ࢜͋ ΔఔͷڞىੑΛ͕࣋ͭ໌֬ʹߘ༰͕ॏෳ͍ͯ͠ͳ͍ Λ౿·͑ɼपғͷڑͷ͍ར༻ऀ͔Βऩूͨ͠ςΩετσʔ λΛ༻͍ͯλάΫϥυΛੜ͢Δɽλάपลͷར༻ऀͷ ߘཤྺ͔Βࢺใ໊͕ࢺͰ͋ΔͷΛରͱ͢ΔɽҎ্ͷৄ ࡉൃදͰࣔ͢ɽ
ࢀߟจݙ
[Ghosh 11] Ghosh et al.: Entropy-based Classification of ‘Retweeting’ Activity on Twitter, Proc. of KDD workshop on Social Network Analysis (SNA-KDD), http://arxiv.org/pdf/1106.0346.pdf (2011).
[ޱ13a] ޱ ଞ: Twitterར༻ऀͷఆྔԽख๏ͷࡂ࣌׆
༻ʹ͚ͯͷ༗ޮੑධՁ,ୈ15ճIEEEౡࢧ෦ֶੜγ
ϯϙδϜจू, pp.474-477 (2013).
[ࠤʑ11] ࠤʑ: ֦େΛଓ͚ΔTwitterͷࡂʹ͓͚Δ׆༂ ͱࠓޙͷ՝ɼAD STUDIES, Vol.36, pp.20-24 (2011).
[૯লফி13] ૯লফி: େنࡂ࣌ʹ͓͚Δ
ι ʔ γϟϧɾωοτ ϫ ʔ Ω ϯ άɾα ʔ Ϗ ε ʹ Α Δ ۓ ٸ ௨ ใ ͷ ׆ ༻ Մ ೳ ੑ ʹ ؔ ͢ Δ ݕ ౼ ձ ใ ࠂ ॻ (2013)ɼ
http://www.fdma.go.jp/neuter/topics/houdou/h25/ 2503/250327 1houdou/02 houkokusho.pdf,
2013/12/10ࢀর
[ௗւ14] ௗւ ଞɼҟछڠௐܕࡂใࢧԉγεςϜ࣮ݱʹ
͚ͨج൫ٕज़ͷߏஙɼਓೳֶձจࢽɼVol.29ɼNo.1ɼ
pp.113-119 (2014)ɽ