非択一式データベースにおける実験

2 共起成分の含意関係と ZDD

5.2 非択一式データベースにおける実験

択一式ではないデータベースに本手法を適用した場合では，どのような関係が抽出できるのか調べるために，

インターネット通販サイトの閲覧履歴のデータベースであるBMS-Web-View-1(FIMIベンチマーク[7]より)へ適用した．このデータベースには，497個のアイテムからなる，42629件のレコードが登録されている．一つ一つのアイテムはの通販サイト上のページ一つ一つに対応しており，一つのレコードは，一人の顧客が訪問した際，

閲覧したページの情報で構成されている．なお，例外を許さない共起成分の含意関係は25組抽出されている．

本手法を適用したところ，αが空集合ではないペアは

91139組存在した．この，データベースもインフルエン

ザウイルスのデータベースと同様に，αがβ，γよりも小さいことから，例外を許した共起成分の含意関係として，新たに抽出された関係はなかった．また，αが空集合でないペアの数もかなり大きな数字であることから，

少なくともこのデータベースに関しては，本手法を用いることによって，有意な関係を抽出することは難しい可能性が高い．

6 _結論

本稿では，例外を許容した共起成分の含意関係を検出するする手法を提案し，この手法を実際のデータベースへ適用する方法について述べた．択一式のデータ構造を持つデータベースに適用したところ，そのデータ構造を大まかではありながらも，抽出することができた．しかしながら，例外を許さない共起成分の含意関係からの拡張という観点からみると，観察者の視点というアナログ

的要素を持つmushroomにおいては，有意な関係を抽出することに成功したが，インフルエンザウイルスのアミノ酸配列データでは，新しい関係を抽出することはできなかった．むしろ，例外を許さない共起成分の含意関係が成り立つ状況こそが特異な状況であると言えるかもしれない．また，非択一式構造のデータベースであるネット通販サイトのページ閲覧履歴データでは，本手法を用いて，有用な関係性を抽出することはできなかった．

なお，今後の展開としては，以下のようなものが挙げられる．

• 択一式データベースで，本手法により，抽出した結果をグラフ化した際，カテゴリーごとのクラスタが形成されるが，そのクラスタ内での関係性から，有用な情報が抽出できるか検討し，抽出できるのであれば，既存の手法と比較する．

• 本稿では，例外を許すことが有用であるか否かに着目したために，全てのアイテムのペア（アイテム数の２乗個）についての検査を単純に繰り返し実行しているが，アルゴリズムを工夫することにより計算回数を削減し高速化できる可能性があると思われる．

• 本稿では，個々のアイテムのペアに関する共起成分の含意関係について考察したが，アイテムの組合せのペアに関する共起成分まで議論を拡張し，

その有用性や計算時間について検討する．

7 _謝辞

ご指導をいただいた北海道大学アルゴリズム研究室

Thomas Zeugman教授に感謝いたします．またご討論

いただいた岩崎玄弥さんを始めとする同研究室の方々に感謝いたします．

参考文献

[1] 湊: “「共起成分の含意関係」を満たすアイテム集合のデータマイニング,”人工知能学会第６５回人工知能基本問題研究会資料, SIG-FPAI-A603-10, pp.

53–58（Mar. 2005）

[2] R. Agrawal, H. Mannila, R. Strikant, H. Toivonen and A. I. Verkamo, “Fast Discovery of Association Rules,” In Advances in Knowledge Discovery and Dat Mining, MIT Press, 307–328（1996）

[3] R. E. Bryant : “Graph-based algorithms for Boolean function manipulation,” IEEE

Transac-tions on Competers, Vol. C-35, No. 8, pp. 677–691

（1986）

[4] S. Minato: “Zero-Suppressed BDDs for Set Manip-ulation in Combinatorial Problems,” In Proc. of 30th ACM/IEEE Design Automation Conference (DAC’93), pp. 272–277（Jun. 1993）

[5] S.Minato and K.Ito: “Symmetric Item Set Mining Method Using Zero-suppressed BDDs and Aplica-tion to Biological Data,” Trans. of the Japanese Society of Artiﬁcial Intelligence, Vol. 22, No. 2, pp.

156–164（Feb. 2007）

[6] 湊: “VSOP：ゼロサプレス型BDDに基づく「重み付き積和集合」計算プログラム,”電子情報通信学会コンピュテーション研究会, 信学技報 Vol. 105, No.

72, COMP2005-10, pp. 31–38（May. 2005）

[7] B. Goethals, M. Javeed Zaki (Eds.), quent Itemset Mining Dataset Repository, Fre-quent Itemset Mining Implementations (FIMI’03), http://ﬁmi.cs.helsinki.ﬁ/data/ (2003)

[8] 湊,有村: “ゼロサプレス型二分決定グラフを用いたトランザクションデータベースの効率的解析手法,”

電子情報通信学会論文誌, Vol. J89-D, No. 2, pp.

172–182（Feb. 2006）

[9] 湊: “データベースの頻出アイテム集合を表すゼロサプレス型BDDの変数順序付けの理論的考察,”電子情報通信学会コンピュテーション研究会, 信学技報 Vol. 106, No. 566, COMP2006-55 pp. 37–42（Mar.

2007）

[10] 湊: “二分決定グラフ（BDD）を活用したデータマイニング・知識発見技術の最近の話題,”電子情報通信学会人工知能と知識処理研究会, 信学技報, Vol.

107, No. 78, AI2007-6, pp. 27–32（May. 2007）

[11] 湊: “「共起成分の含意関係」に基づくデータベースからの知識発見,”人工知能学会第4回データマイニングと統計数理研究会(SIG-DMSM)講演報告集, SIG-DMSM（July. 2007）

情報論的学習理論テクニカルレポート2009

Technical Report on Information-Based Induc-tion Sciences 2009 (IBIS2009)

局所 MFCC 特徴量の最適線形結合によるテキスト独立型話者照合

Text-Independent Speaker Verification using

Optimized Linear Combination of Local MFCC Features

坂井俊亮

^∗

Shunsuke Sakai

亀山啓輔

^†

Keisuke Kameyama

Abstract: In recent years, studies of speaker veriﬁcation have been conducted as a means for biometric person authentication. However, because of the overall veriﬁcation perfor-mance, only few actual implementations exist. This paper focuses on the text-independent speaker veriﬁcation system. We propose an eﬀective method for speaker veriﬁcation by adaptive weighting of local Mel Frequency Cepstrum Coeﬃcient (MFCC) features. For a given set of registered persons, an optimal linear weightings of multiple speech frames are searched based on the likelihood ratio error, generalizing the scheme of the conventional use of Δ parameters [1]. It was observed that using the proposed adaptive parameters, superior veriﬁcation performance was achieved compared with the cases using conventional features.

Keywords: text-independent speaker veriﬁcation, biometric authentication, adaptive fea-ture weighting, inter-frame feafea-ture.

1 Introduction

In recent years, as a technology to verify individuals, studies for authentication using human biometrics have been conducted actively [2]. In password authentica-tion convenauthentica-tionally often used, there are problems that the users forget the phrase and impostor can be easily veriﬁed due to leakage or theft. Therefore, biometrics technology verifying the individuals using physical in-formation such as ﬁngerprint, vein pattern, iris, face, and speech is in the spotlight [3]. Speaker veriﬁca-tion technology to verify the speaker by speech features can be useful as it does not require special veriﬁcation

∗筑波大学システム情報工学研究科, 305-8573 茨城県つくば市天王台 1-1-1, tel. 029-853-6200 ext. 8480, e-mail [email protected],

Graduate School of Systems and Information Engineering, Uni-versity of Tsukuba, 1-1-1 Tennodai, Tsukuba, Ibaraki 305-8573, Japan

†筑波大学システム情報工学研究科, 305-8573 茨城県つくば市天王台 1-1-1, tel. 029-853-6200 ext. 8480, e-mail [email protected],

Graduate School of Systems and Information Engineering, Uni-versity of Tsukuba, 1-1-1 Tennodai, Tsukuba, Ibaraki 305-8573, Japan

hardware, is less stressful for the users, and can be used from remote places across the telephone network.

However, actual use of speaker veriﬁcation technology is still less common. The main reason is due to the fact that the veriﬁcation performance is still low when compared with the use of other modalities.

This study proposes a framework of speaker model-ing to use the inter-frame dynamics in addition to per-frame Mel Frequency Cepstrum Coeﬃcient (MFCC) speech feature aiming to improve the veriﬁcation pre-cision. While ΔMFCC feature [1] is known to take a similar approach, it uses ﬁxed weights of local MFCC features. In contrast, the proposed feature employs adaptive weights optimized for improving the veriﬁca-tion performance. Also, some methods that present mel-cepstral analysis method and its adaptive algo-rithm [4], and method which optimizes the weights for each likelihood such that the overall expected loss can be minimized [5], have been reported. In this work, we examine the improvement of speaker discrimina-tion ability by using an optimized linear combinadiscrimina-tion

of relatively long-term features.

2 Text-independent speaker

ドキュメント内 COE SITAIE- ICE IEICE IEICE IEICE IEICE (PRMU) () IEEE Committee Members of IT Society Japan ChapterIEEE Computational Intelligence Society Japan Chap (ページ 116-119)