統計的言語モデルにおける確率的潜在意味解析の学習初期化手法の一検討
全文
(2) Vol.2013-SLP-97 No.6 2013/7/26. ใॲཧֶձڀݚใࠂ IPSJ SIG Technical Report. ͚Λ༻͍ͨదԠ [10] ͳͲͷൃల͕ڀݚଘࡏ͢Δɽ͠ ͔͠ͳ͕Βɼͦͷੑೳ͕౷ֶܭशͷࡍʹ༩͑Δॳظʹڧ ͘ґଘ͢Δ͜ͱͳͲɼ͍͔ͭ͘ͷ͕ଘࡏ͍ͯ͠Δɽ. 2. ୯ޠͷසʹͮ͘جϞσϧ จ຺ͳͲͷޠݴతͳใΛ༻͍ͯൃ͞ΕΔԻ ͷจܕޠኮΛߜΓࠐΈɼೝࣝੑೳΛ্͛Δ͜ͱ͕ɼԻ ೝࣝʹ͓͚ΔޠݴϞσϧͷॏཁͳׂͰ͋ΔɽԿΒ͔ͷ Λઃఆ͠ɼఆ͞ΕΔޠኮΛ੍͢ݶΔख๏ʹ͓͍ͯɼ ͷछྨɼͦΕʹରԠ͢Δޠኮू߹ΛਓखͰઃఆ͢Δͷ ࠔͰ͋Δɽ͜ͷΛղܾ͢ΔͨΊʹɼͦΕΒͷཁ ૉΛࣗಈతʹܾఆ͢ΔΑ͏ͳख๏͕·ΕΔɽ ຊઅͰɼͦͷΑ͏ͳख๏ͷҰͭͰ͋Δ֬తજࡏҙຯ ղੳͷ౷ֶܭशʹ༻͍Δ EM ΞϧΰϦζϜʹ͓͍ͯɼॳظ ͷ༩͑ํɼಛʹֶशͷࡍʹ༩͑Δॳظʹ·ؚΕΔ 0 ͷ ༗ແʹΑΔ౷ܭతޠݴϞσϧͱͯ͠ͷ֬తજࡏҙຯղੳ ͷৼΔ͍Λ͠؍ɼੑೳॳظґଘੑͷมԽΛଊ͑Δɽ. ਤ 1 PLSA ͷ֓೦ਤ. l(θ; N ) =. w∈W d∈D. . P (w|z)P (z|d). (2). z∈Z. ͜͜Ͱ n(d, w) จॻ d ʹ͓͚Δ୯ ޠw ͷग़ݱճΛ ද͢ɽ. 2.1 ֬తજࡏҙຯղੳͷΈ ֬తજࡏҙຯղੳ ( Probabilistic Latent Semantic. Analysis ɼҎԼ PLSA) [6] ͱɼֶशσʔλ͔ΒಘΒΕ Δ୯ޠͷग़ݱසΛʹجɼΛϞσϧԽ͢Δख๏Ͱ͋ ΔɽPLSA ͕ k-means ๏ [11], [12] ͳͲͷΛϞσϧԽ ͢Δख๏ͱҧ͍ͬͯΔͷɼෳͷ͕ೖΓࠞͬͨ͡ Α͏ͳɼෳࡶͳʹରͯ͠ޮՌΛൃ͢شΔͱ͜ΖͰ ͋ΔɽPLSA Ͱɼ෦ʹ͝ͱʹͦͷಛΛөͨ͠. unigram Ϟσϧʢ୯ޠग़֬ݱϕΫτϧʣΛ࣋ͪɼͦΕΒ. ౷ֶܭशʹ Tempered EM ΞϧΰϦζϜ (ҎԼ T-EM) ͱ͍͏෮ֶश๏Λ༻͍ΔɽT-EM ʹ༻͍ΔࣜҎԼͷࣜ. (3) ∼ (6) ͱͳΔɽ E-Step: P (k) (z|d, w) = . ʹର͠࠷దԽ͞Εͨ unigram ΛಘΔ͜ͱ͕Ͱ͖Δɽ(ਤ 1ʣ. PLSA ʹ͓͚Δ h Λөͨ͠୯ ޠw ͷग़ݱස. (1). z∈Z. ͜͜ͰɼP (w|z) unigram z ͕୯ ޠw ʹରͯ͠༩. {P (k) (z)P (k) (d|z)P (k) (w|z)}β (3) (k) (z)P (k) (d|z)P (k) (w|z)}β z∈Z {P. M-Step: P. (k+1). (w|z) = . ͷ unigram Λదʹࠞ߹͢Δ͜ͱʹΑΓɼతͷ. P (w|h) ҎԼͷࣜ (1) Ͱ༩͑ΒΕΔɽ P (w|h) = P (w|z)P (z|h). n(d, w) log. . n(d, w)P (k) (z|w, d) (4) (k) (z|w, d)} d∈D n(d, w)P. d∈D. w∈W {. . n(d, w)P (k) (z|w, d) (5) (k) (z|w, d)} d∈D { w∈W n(d, w)P n(d, w)P (k) (z|d, w) w∈W (k+1) d∈D P (z|d, w) = (6) w∈W d∈D n(d, w). P. (k+1). (d|z) = . w∈W. E-Step ͱ M-Step Λަ܁ʹޓΓฦ͢͜ͱͰࣜ (2) Λ࠷େ Խ͢ΔϞσϧΛੜ͢Δ͜ͱ͕Ͱ͖Δɽ. ͑Δ֬ͱͳΔɽଞํɼP (w|h) తͷ h ʹରͯ͠ ࠷దͳ unigram ͷࠞ߹ൺͰ͋Δɽ. 2.3 PLSA Λ༻͍ͨಉఆ. 2.2 PLSA ʹΑΔू߹͓Αͼ unigram ͷֶश. ͷࠞ߹Λߦ͏ɽ͜͜Ͱతจॻͱɼྫ͑ɼલʹൃ. PLSA ͰʹॏΈ͚Λߦ͍ɼతจॻʹଈͨ͠ ܗ. ͞ΕͨԻͷೝࣝ݁Ռɼ͋Δ͍ɼࠓ͔Βೝࣝ͠Α͏ͱ. PLSA ෦ύϥϝʔλͱͯ͠ z ʹ͓͚Δ୯ ޠw ͷ. ͍ͯ͠ΔԻͷೝࣝީิͰ͋Δɽ͜ΕΒͷจॻ͕ PLSA ͷ. ग़ݱසΛද͢ P (w|z) ͱɼจॻ d ʹ͓͚Δ unigram. แ͢ΔͲͷʹͲΕ͚ͩଐ͢Δ͔Λɼ unigram ͷ. ͷࠞ߹ൺΛද͍ͯ͠Δ P (z|d) Λ࣋ͭɽP (z|d) P (z) ٴ. ࠞ߹ൺͱͯ͠ਪఆ͢Δɽ͜Ε unigram ͷࠞ߹ൺΛ. ͼ P (d|z) ͔ΒϕΠζͷఆཧʹΑͬͯಋ͖ग़͢͜ͱ͕Ͱ͖. తͷจॻʹର͠࠷ਪఆʹΑΓ࠷దԽ͢Δ͜ͱͰߦ͏ɽ. Δɽ͜ΕΒͷΛ༻͍ͯɼֶशσʔλʹ·ؚΕΔ 1 จॻ͝. ͦͷࡍʹֶश࣌ͱಉ͘͡ T-EM Λ༻͍ΔɽT-EM ʹ༻. ͱͷ୯ޠͷग़ݱճΛֶशσʔλͱ͠ɼEM ΞϧΰϦζϜ. ͍ΔࣜҎԼͷࣜ (7),(8) ͱͳΔɽ. ʹΑΓ෮ֶशΛ܁Γฦ͢͜ͱͰɼҎԼͷࣜ (2) Λ࠷େ Խ͢Δ͜ͱͰɼΛ࠷େԽ͢ΔΑ͏ͳҙͷͷ. unigram Λֶश͢Δ͜ͱͰੜ͞ΕΔɽ ⓒ 2013 Information Processing Society of Japan. E-step: {P (z)P (k) (h|z)P (w|z)}β (k) (h|z)P (w|z)}β z∈Z {P (z)P. P (k) (z|h, w) = . (7). 2.
(3) Vol.2013-SLP-97 No.6 2013/7/26. ใॲཧֶձڀݚใࠂ IPSJ SIG Technical Report ද 1. M-step: . P. (k+1). PLSA ͷֶश݅. ֶशσʔλ. n(h, w)P (k) (z|h, w) (8) (h|z) = (k) (z|h, w)} z∈Z { w∈W n(h, w)P w∈W. CSJ. จॻ. 972. ޠኮ. 10000. જࡏϞσϧ. 50. EM ෮ճ. 100. ΞχʔϦϯάεέδϡʔϧ. inverse annealing. Λ͙తͰɼΞχʔϦϯάͱݺΕΔૢ࡞͕ߦΘΕΔɽ. β ॳظ. 1.0. ௨ৗͷ EM ΞϧΰϦζϜͱͷҧ͍ͱͯ͠ɼࣜʢ3ʣͷΑ͏. β ऴ. 0.8. β ߋ৽ճ. 4. 2.4 ΞχʔϦϯά T-EM ʹ͓͍ͯɼֶशͱ࠷ॴہదղͷམͪࠐΈ. ʹ E-Step ͷࡍʹӈลશମΛ β ( β ≥ 0 ) ͢Δɽβ = 1.0 ͷ߹ʹ௨ৗͷ EM ΞϧΰϦζϜͱ͘͠ͳΔɽβ ͕ 1.0 ΑΓখ͚͞Ε͕ؔฏԽ͞Εɼ࠷ॴہదղͷऩ ଋΛ͙ͷޮՌ͕͋Δɽ. T-EM Ͱ͜ͷ β Λ෮ֶश͕ਐߦ͢Δʹ࿈ΕͯมԽ ͍ͤͯ͘͞ɽ͜ͷ β ΛมԽͤ͞Δखଓ͖ΛΞχʔϦϯά εέδϡʔϧͱ͍͏ɽΞχʔϦϯάεέδϡʔϧେ͖͘ ͚ͯೋछྨ͋ΔɽҰͭ β ͷॳظΛ 1.0 ΑΓখ͞ ͍͔Β࢝Ίɼঃʑʹ૿͍ͯ͘͜͠ͱͰ࠷ऴతʹ 1.0 ʹ ͢ΔͷͰɼDAEM ( Deterministic Annealing EM ) ͱݺ ΕΔɽ͜Εʹɼֶशॳظஈ֊Ͱͷ࠷ॴہదղͷऩଋ Λ͙ޮՌ͕͋Δɽଞํ β ͷॳظΛ 1.0 ͱ͠ɼֶश͕ ਐΉʹͭΕͯ β ΛݮΒ͍ͯ͘͠ͷͰɼinverse annealing ͱݺΕΔɽͪ͜ΒʹɼֶशΛՃͤ͞ɼ·ͨɼաֶश Λ͙ޮՌ͕͋ΔɽຊߘͰֶशͷΛૣΊΔత͔Β. inverse annealing ʹΑΓֶशΛߦ͏͜ͱͱ͢Δɽ. 3. PLSA ֶशʹ͓͚Δॳظͷݕ౼ PLSA ͷֶशʹࡍͯ͠ɼ͡Ίʹ L Λ༩͑Δɽແ. ͷ͕ߟ͑ΒΕΔɽҰํͰɼॳظґଘੑେ෯ʹվ ળ͞ΕΔͱߟ͑ΒΕΔɽ ͦ͜Ͱॳظʹ 0 ΛؚΉ߹ͱ͍ͳ·ؚ߹ʹ͓͚Δ. PLSA ͷ౷ܭతޠݴϞσϧͱͯ͠ͷੑೳͱॳظґଘੑͷ ൺֱΛߦͬͨɽ. 3.2 ࣮ݧ݅ (1) ࣜʹ͓͚Δ P (w|z) ΛɼҎԼͷ 3 ௨Γͷํ๏Ͱॳظ Խ͢Δɽ. 1.. ཚʹΑΓແ࡞ҝʹ L ݸͷจॻΛબͼɼͦΕΒʹ·ؚ ΕΔ୯ޠසͷʹ ͍ͯͮجL ݸͷ unigram ΛॳظԽ͢ΔɽཚΛม͑ͯ 5 ௨Γߦ͏ɽ. 2a. 1. ͰಘΒΕͨ unigram ͷ֤ཁૉΛҰ༷ʹϑϩΞ Ϧϯά͢Δɽ۩ମతʹɼ1. ͰಘΒΕͨ P1 (w|z) ͔ ΒҎԼͷࣜʹΑͬͯٻΊΔ. P2a (w|z) = p · P1 (w|z) + (1 − p) ·. 1 n(W ). (9). ࡞ҝʹֶशσʔλΑΓநग़ͨ͠ L ݸͷจॻͷ୯ ޠunigram. ͜ͷ࣌ɼn(W ) ޠኮΛද͢ɽ·ͨɼࠓճ p = 10−6. Λ֤ unigram ͷॳظͱͯ͠༻͍Δɽͦͷࡍɼॳظ. ͱͨ͠ɽ1. Ͱ࡞ͨ͠ཚΛม͑ͨ 5 ௨ΓͷॳظΛ. ͱͯ͠༩͑Δʹ 0 Λଟؚ͘Ήεύʔεͳ unigram Λ. ϕʔεʹ֤ʑϑϩΞϦϯάΛߦ͏ɽ. ༻͍Δ߹ͱɼͯ͢ͷཁૉʹԿΒ͔ͷΛ༩͑Δ߹ʹ. 2b. 1. ͰಘΒΕͨΛશֶशσʔλΛ༻͍ͯಘͨ unigram. ͍ͭͯɼܗ͞Εͨ PLSA ʹΑΔɼิਖ਼ Perplexity ͷҧ. ʹΑΓฏԽ͢Δɽ۩ମతʹɼ1. ͰಘΒΕͨ P1 (w|z). ͍Λൺֱ͢Δɽ. ͔ΒҎԼͷࣜʹΑͬͯٻΊΔ. P2b (w|z) = p · P1 (w|z) + (1 − p) · uniall (w) (10). 3.1 ॳظͱͯ͠ͷ 0 PLSA ͷֶशͷࡍʹ༻͍Δࣜ (3) ( ͼٴ4) ΑΓɼॳظԽ ͷࡍʹಛఆͷ z ʹରͯ͠ P. (0). ͜ͷ࣌ɼuniall (w) ֶशσʔλͯ͢Λͬͯߏங. (w|z) = 0 ͱͳΔ w ͕ଘࡏ. ͨ͠ unigram ʹ͓͚Δ୯ ޠw ͷग़֬ݱΛද͢ɽ·. ͢Δͱ͖ɼk ͷʹ͔͔ΘΒͣɼP (k) (w|z) = 0 ͱͳΔɽ͜. ͨɼࠓճ p = 10−6 ͱͨ͠ɽ1. Ͱ࡞ͨ͠ཚΛม. ͷ͜ͱ͔ΒɼॳظԽʹ͓͍ͯ P (w|z) ʹ 0 Λ༩͑Δ߹ɼ. ͑ͨ 5 ௨ΓͷॳظΛϕʔεʹ֤ʑϑϩΞϦϯάΛߦ. PLSA ଞͷΛ༩͑Δ߹ͱൺͯಛผͳৼΔ͍Λ͢. ͏ɽ. Δͱߟ͑ΒΕΔɽ ԾʹॳظԽͷࡍʹ 0 ͕ଟ͘·ؚΕͨ߹ɼ0 Λ࣋ͭ෦. ͜ͷͱ͖ɼ1. ͷख๏ʹΑͬͯಘΒΕͨॳظʹ 0 ͕ଟ. ʹؔͯ͠ࢉܭͷඞཁੑ͕ͳ͘ͳΔͨΊɼֶशͷࡍʹඞཁ. ͘·ؚΕΔͷʹରͯ͠ɼ2. ͷํ๏ͰಘΒΕͨॳظʹ 0. ͳࢉܭίετͷ͕ݮՄೳͱͳΔɽ͔͠͠ͳ͕Βɼநग़͞. ͕·ؚΕΔ͜ͱͳ͍ɽ·ͨɼ2a. ͱ 2b. Λൺֱ͢Δ͜ͱ. Εͨจॻͷಛ͕͘ڧө͞ΕΔͨΊɼॳظґଘੑ͕ߴ. ͰฏԽͷख๏ʹΑΔ࣮݁ݧՌͷӨڹΛ؍ଌ͢Δ͜ͱ͕. ·Δ͜ͱ͕ݒ೦͞ΕΔɽ·ͨɼະޠͷ্ঢߟ͑ΒΕ. Ͱ͖Δͱߟ͑ΒΕΔɽ. ΔɽଞํɼॳظԽͷࡍʹ 0 Λ͍ͳ·ؚ߹ʹɼֶशʹ͓. PLSA ͷֶश݅Λද 1 ʹࣔ͢ɽֶशσʔλʹຊޠ. ͚Δऩଋ͕͘ͳΔɼ͋Δ͍࠷దղʹ୧Γ͚ͭͳ͍ͳͲ. ͠ݴ༿ίʔύε (ҎԼ CSJ) ʹ·ؚΕΔ࣮ߨԋσʔλ 987. ⓒ 2013 Information Processing Society of Japan. 3.
(4) Vol.2013-SLP-97 No.6 2013/7/26. ใॲཧֶձڀݚใࠂ IPSJ SIG Technical Report ද 2. 3.3 ࣮݁ݧՌ. PLSA ͷదԠ݅. EM ෮ճ. 60. ΞχʔϦϯάεέδϡʔϧ. inverse annealing. β ॳظ. 1.0. β ऴ. 0.9. β ߋ৽ճ. 2. ॳظԽ๏ʹΑΔ PLSA ͷੑೳͷࢄਤΛਤ 2 ʹࣔ͢ɽά ϥϑͷԣ࣠༻͍ͨςετηοτΛɼॎ࣠ิਖ਼ Perplexity ͷΛද͢ɽ֤ςετηοτʹର͠ɼ1. ʹΑΔ 5 छͷ݁Ռ ʢؙʣ ɼ2a. ʹΑΔ 5 छͷ݁Ռʢˎʣ ɼ2b. ʹΑΔ 5 छͷ ݁Ռʢ੨ʴʣ͕ϓϩοτ͞Ε͍ͯΔɽͨͩ͠ɼͱ੨ʹͭ ͍ͯ΄ͱΜͲॖୀͯ͠Ұͭͷʹ͑ݟΔɽ ͜ͷਤΑΓ 1. ͷख๏͕ͬͱิਖ਼ Perplexity Λ͘ ͑Δ͜ͱ͕Ͱ͖͍ͯΔͷ͕Θ͔ΔɽͦͷҰํͰ 1. ʹ͓͍ ͯཚͷγʔυʹΑΓ࠷େͰ 15% ఔͷิਖ਼ Perprexity ͷେ͖ͳมಈ͕ݟΒΕͨɽ͜ͷ͜ͱ͔Βɼॳظʹ 0 Λଟ ؚ͘Ή 1. ͷख๏Ͱॳظґଘੑ͕ݱ͘ڧΕΔ͜ͱ͕Θ ͔ΔɽҰํɼਤதͷ 2a. ͱ 2b. ͰཚͷγʔυʹΑΔ ͕ࠩ΄΅ݟΒΕͳ͍ɽ͜ͷݪҼͱͯ͠ 2 ͭͷՄೳੑ͕ߟ ͑ΒΕΔɽҰͭॳظԽͷࡍʹ 0 ΛظॳͰͱ͍͜ͳͤ·ؚ ґଘੑ͕ऑ͘ͳΔՄೳੑɽͦͯ͠͏ͻͱͭɼॳظԽ ͷࡍʹ 0 ΛֶͰͱ͍͜ͳͤ·ؚशʹඞཁͳ EM ΞϧΰϦ ζϜͷ෮ճ͕૿͑Δɼ·ͨɼ࠷దղʹ୧Γ͚ͭͳ͘. ਤ 2. ͳ͍ͬͯΔՄೳੑͰ͋Δɽ·ͨɼ͜ͷ࣮͍͓ͯʹݧֶश. ॳظԽ๏ʹΑΔ PLSA ͷੑೳͷࢄਤ. ͷࡍͷ EM ΞϧΰϦζϜͷ෮ճΛఆͱ͍ͯͨ͜͠ͱ ߨԋͷ͏ͪɼධՁ༻σʔλ 15 ߨԋΛআ͍ͨ 972 ߨԋ. ͔Βɼ͠ॳظʹΑͬͯ EM ΞϧΰϦζϜʹΑΔֶशͷ. ͷσʔλΛ༻͍ͨɽޠኮֶशσʔλʹ·ؚΕΔ୯ޠͷ͏. ਐߦʹมԽ͕͋Δ߹ɼͦͷ͕ࠩੑೳࠩͱͯ͠ݱΕͯ. ͪɼ10 ճҎ্ग़ͨ͠ݱ 1 ສ୯ͨ͠ͱޠɽ·ͨɼજࡏϞσ. ͠·ͬͨՄೳੑ͕͋Δɽ. ϧ 50ɼֶश࣌ͷ EM ΞϧΰϦζϜͷ෮ճ 100 ճ ͱͨ͠ɽT-EM ͷ β ߋ৽ͷͨΊͷΞχʔϦϯάεέδϡʔ ϧʹ inverse annealing Λ༻͍ɼ࠷ऴతͳ β ͷ͕ 0.8 ʹͳΔΑ͏ʹ 5 ஈ֊ʹߋ৽Λߦͬͨɽ. 3.4 ॳظͷੑ࣭ͷҧ͍ʹΑΔֶश࣌ͷৼΔ͍ ॳظͷੑ࣭ͷҧ͍ʹΑΔֶश࣌ͷৼΔ͍Λ؍ଌ͢Δ ͨΊɼ3.3 ͱಉ݅Ͱ EM ෮ճͷΈΛ 200 ʹ͠ɼ10 ճ. ධՁͷࡍͷ PLSA ͷదԠ݅Λද 2 ʹࣔ͢ɽςετηο τʹ CSJ ʹ·ؚΕΔ࣮ߨԋσʔλ 987 ߨԋ͔Βແ࡞ҝ. ຖͷิਖ਼ Perprexity ͷมԽ͔Βɼֶश࣌ͷৼΔ͍ʹͭ ͍ͯ؍Λߦͬͨɽ. ʹநग़ͨ͠ 15 ߨԋΛ༻͍ͨɽධՁͷࡍͷ EM ΞϧΰϦζ. ͦͷ݁ՌΛਤ 3 ͓Αͼਤ 4 ʹࣔ͢ɽਤͷॎ࣠ิਖ਼ Per-. Ϝͷ෮ճ 60 ͱ͠ɼ·ͨɼదԠ࣌ͷΞχʔϦϯάε. prexity Λɼԣ࣠ EM ΞϧΰϦζϜͷֶशʹ͓͚Δ෮. έδϡʔϧաֶशΛࢭ͢ΔͨΊ inverse annealing Λ. ճͱͳ͍ͬͯΔɽͦΕͧΕͷਤʹ͓͍ͯɼԼଆͷ͕ 1.. ༻͍ɼ2 ஈ֊ʹߋ৽ΛߦͬͨɽධՁईʹςετηοτ. ͷख๏ʹΑΓॳظԽͨ͠߹Λɼ্ଆͷ͕ 2b. ͷख๏ʹ. ʹର͢Δิਖ਼ Perplexity Λ༻͍ͨɽิਖ਼ Perplexity ͷܭ. ΑΓॳظԽΛߦͬͨ߹Λද͢ɽ. ࢉࣜ࣍ࣜʢ11ʣʹͳΔɽ. AP P = {P (w1 , w2 ...wn ) · m−o }. ςετηοτͷ 15 จॻʹ͓͚Δֶश࣌ͷৼΔ͍Λ؍ 1 −n. (11). ͨ݁͠Ռɼେ͖͘͜ΕΒͷ 2 छʹ͚ΒΕΔ͜ͱ͕Θ ͔ͬͨɽਤ 3 ͷΑ͏ͳৼΔ͍Λ͢Δͷɼֶश͕ਐΉ. ͜ͷͱ͖ɼP (w1 , w2 ...wn ) ୯ ྻޠw1 , w2 ...wn ͕ੜ. ʹͭΕͯΏΔ͔ʹݮগ͍͍ͯͬͯ͠Δͷ͕ͯݟऔΕΔɽ. ͞ΕΔ֬ΛɼO ະޠͷΛɼm ະޠͷछྨΛ. ͜Εॳظʹ 0 ΛͰͱ͍͜ͳ·ؚɼֶशʹඞཁͳ෮ճ. ͦΕͧΕද͢ɽ. ͕૿Ճ͍ͯ͠Δͱଊ͑Δ͜ͱ͕Ͱ͖Δɽଞํɼਤ 4 ͷΑ. P (d|z) ͱ P (z) ʹ͍ͭͯࣜʢ12ʣࣜͼٴʢ13ʣʹΑΓ ॳظԽΛߦͬͨɽ. P (d|z) = P (z) =. 1 n(D). 1 L. ্͢Δ͜ͱͳ͘ɼૣ͍ஈ֊Ͱॴہղʹऩଋͯ͠͠·ͬͯ. (12). ⓒ 2013 Information Processing Society of Japan. ͍Δ͜ͱ͕͑Δɽ. 4. ͓ΘΓʹ (13). ͜ͷ࣌ n(D) ֶशσʔλʹ·ؚΕΔจॻΛɼL જ ࡏϞσϧΛͦΕͧΕද͍ͯ͠Δɽ. ͏ͳৼΔ͍Λ͢ΔͷɼֶशΛ܁Γฦͯ͠ੑೳ͕. ຊߘͰɼ֬తજࡏҙຯղੳʹ͓͍ͯɼॳظͷ༩͑ ํɼಛʹֶशͷࡍʹ༩͑Δॳظʹ 0 ͕·ؚΕΔ͔Ͳ͏͔ ʹΑΔʹΑΔ౷ܭతޠݴϞσϧͱͯ͠ͷ֬తજࡏҙຯղ. 4.
(5) Vol.2013-SLP-97 No.6 2013/7/26. ใॲཧֶձڀݚใࠂ IPSJ SIG Technical Report. [5]. [6] [7]. [8]. [9] [10]. ਤ 3. ॳظԽ๏ʹΑΔ PLSA ͷֶशਐߦͷҧ͍ 1. ਤ 4. ॳظԽ๏ʹΑΔ PLSA ͷֶशਐߦͷҧ͍ 2. [11] [12]. ୍ߒ, ࡔ๕య, ٱ, ՏݪୡɿMAP ਪఆ Λ༻͍ͨ N-gram ޠݴϞσϧͷλεΫదԠɼ৴ֶٕใ, SP96-103(1997). Thomas HofmannɿProbabilistic Latent Semantic AnalysisɼUncertainity in Artificial Intelligence (1999). Daniel Gildea, Thomas HofmannɿTOPIC-BASED LANGUAGE MODELS USING EMɼEuroSpeechʟ99, pp.2167-2170(1999). ࢁ܀ਓ, ླج೭, ҏ౻জଇ, ਖ਼ࡾɿPLSA ޠݴϞ σϧͷֶश࠷దԽͱޠኮׂʹؔ͢Δݕ౼ɼॲݚใ, 2006-SLP-60(2006). ळా༞࠸, Տݪୡɿͱऀʹؔ͢Δ PLSA ʹͮ͘ج ޠݴϞσϧదԠɼॲݚใ, 2003-SLP-49(2003). ࡚ٶকོɿWWW ͔ΒಘΒΕΔ Term Frequency ใ ʹ ͮ͘جPLSA ޠݴϞσϧɼॲݚใ, 2011-SLP-85, No.14(2011). ٶຊఆ໌ɿΫϥελʔੳೖɼग़൛ʢ1999ʣ. ా؛໌ɿจॻΫϥελϦϯάͷٕ๏ɼLibrary and Information Science, Vol. 49, pp. 33?75, 2003.. ੳͷৼΔ͍ͷมԽΛͨ͠؍ɽ ݁Ռͱͯ͠ɼॳظʹ 0 Λଟؚ͘Ή͜ͱΛೝΊͨ߹ɼ ֶश͕ૣ͘ऴΘΓɼ·ͨɼิਖ਼ Perprexity ʹ͓͍ͯྑ͍ ੑೳ͕ͰΔ݁Ռͱͳͬͨɽ͔͠͠ͳ͕Βɼཚͷγʔυʹ ΑΔੑೳͷϒϨ͕େ͖͍ͱ͍͕ͬͨܽ͋Δɽ·ͨɼ͜ͷ ॳظԽ๏ʹ͓͍ͯɼະͳ͘ߴ͕ޠΔ͜ͱ͕֬ೝ͞Ε ͓ͯΓɼ࣮ࡍʹԻೝࣝʹ࣋ͪࠐΜͩ߹ͷੑೳʹෆ҆ ͕Δɽଞํɼॳظʹ 0 Λ͍ͳ·ؚ߹ʹɼֶशͷ ͕͘ͳΔ͜ͱ͕Θ͔ͬͨɽ·ͨɼ߹ʹΑͬͯॴہ ࠷దղͷऩଋ͕ݟΒΕͨɽͪ͜Βͷख๏Ͱɼະޠ 0 ΛؚΉ߹ͷҎԼʹ͑ΒΕ͍ͯͨɽ ࢀߟจݙ [1] [2]. [3]. [4]. ݚೋɿ֬తޠݴϞσϧɼ౦ژେֶग़൛ձ (1999). ֿӜହஐɼླج೭ɼҏ౻জଇɼਖ਼ࡾ: WWW Λར ༻ͨ͠ޠݴϞσϧ͠ͳࢣڭλεΫదԠʹ͓͚Δ༗ޮࡧݕ ΫΤϦܾఆ๏, ॲݚใ, 2006-SLP-64(2006). ૿ଜ྄ɼҏ౻ਔɼҏ౻জଇɼਖ਼ࡾ: WWW Λར༻͠ ͨޠݴϞσϧదԠͷͨΊͷࡧݕΫΤϦߏͷݕ౼, ॲݚ ใɼVol.2009-SLP-76 Noɽ10(2009). ༤, ླج೭, ਖ਼ࡾɿHMM Λ༻͍ͨෳ n-gram ϞσϧʹΑΔޠݴϞσϧͷߏஙɼॲ, Vol.J43 , No.7, pp.2075-2081(2002).. ⓒ 2013 Information Processing Society of Japan. 5.
(6)
関連したドキュメント
ベクトル計算と解析幾何 移動,移動の加法 移動と実数との乗法 ベクトル空間の概念 平面における基底と座標系
2813 論文の潜在意味解析とトピック分析により、 8 つの異なったトピックスが得られ
(Construction of the strand of in- variants through enlargements (modifications ) of an idealistic filtration, and without using restriction to a hypersurface of maximal contact.) At
Theorem 4.8 shows that the addition of the nonlocal term to local diffusion pro- duces similar early pattern results when compared to the pure local case considered in [33].. Lemma
This paper presents an investigation into the mechanics of this specific problem and develops an analytical approach that accounts for the effects of geometrical and material data on
While conducting an experiment regarding fetal move- ments as a result of Pulsed Wave Doppler (PWD) ultrasound, [8] we encountered the severe artifacts in the acquired image2.
[r]
einer rechtliche Wirkung gerichtete