• 検索結果がありません。

JAIST Repository: 社会情報可視化システムと適応クラスタリング

N/A
N/A
Protected

Academic year: 2021

シェア "JAIST Repository: 社会情報可視化システムと適応クラスタリング"

Copied!
3
0
0

読み込み中.... (全文を見る)

全文

(1)JAIST Repository https://dspace.jaist.ac.jp/. Title. 社会情報可視化システムと適応クラスタリング. Author(s). 岩本, 雅道. Citation Issue Date. 2001-03. Type. Thesis or Dissertation. Text version. author. URL. http://hdl.handle.net/10119/724. Rights Description. Supervisor:石崎 雅人, 知識科学研究科, 修士. Japan Advanced Institute of Science and Technology.

(2) A Visualization System for Social Information and Adaptive Clustering Masamichi Iwamoto School of Knowledge Science, Japan Advanced Institute of Science and Technology March 2001 Keywords: Document clustering, Visualization, Adaptive clustering, User feedback The rapid diffusion of the Internet in recent years greatly changes our information environment. The quantity of information obtainable on the Internet becomes huge especially by World Wide Webs (WWW). In this information flooding environment, assisting systems of searching information are very important, which are now realized as search engines. The search engines collect and rank the information or documents based on the user s request, usually expressed by keywords. The underlying assumption of these search engines is that there exists one document which facilitates the user s request. This is true for simple quiz type problems such as the population of Japan and the height of Mt. Fuji. However, difficult problems concerning social issues (environment, nuclear power plant, ...) cannot be solved in one document. Rather, various kinds of information should be collected from the user s viewpoint and used for deciding on one s position toward the problems. The purpose of this system is to construct a system which assists the users to cope with such difficult problems. The functions of the systems are as follows: 1) clustering of the documents and displaying them using the terms, original texts, and the spatial relations of the terms on time axis, which helps the user with understanding how the problems have been discussed, and 2) re-clustering the document based on the user s feedback (we call this adaptive clustering ). The feedback consists of the user s selection of related documents and.

(3) the clustering mode: precision-based and recall-based. The former collects as few unrelated documents as possible; the latter collects as many related documents as possible. The system can learn the user s preference and display it as a cluster, which enables the user to realize her/his implicit thought. The methods of adaptive clustering can be classified into two kinds: one is to change the term weight and the other is to increase the number of the terms. In this thesis, the term selection function, the rate of the term weight change, the number of the terms were used as parameters for evaluating the adaptive clustering methods. Term frequency-inverse document frequency (TFIDF), χ 2 and the degree of context dependence were used as the term selection function. The rate of the term weight is changed from 1.5 to 10. The number of the term is changed from 10 to 100%. For the number of the terms, the inclusion of the terms which do not co-occur in the user s selected documents were also evaluated (we call this forced co-occurrence ). The evaluation results showed that 1) TFIDF with 56.7% of the terms and the term weight change by 5 achieved the highest f-value, and 2) the forced co-occurrence with 41.7% of the terms and no term weight achieved the highest f-value, not degrading precision..

(4)

参照

関連したドキュメント

Characte r is t ic b ipo lar waveforms were frequen t ly observed by the e lec tr ic waveform rece iver onboard the lunar orb i ter named

[r]

*2 Kanazawa University, Institute of Science and Engineering, Faculty of Geosciences and civil Engineering, Associate Professor. *3 Kanazawa University, Graduate School of

* Ishikawa Prefectural Institute of Public Health and Environmental Science 1-11 Taiyougaoka, Kanazawa, Ishikawa 920-1154 [Received April 23, 2001] Summary The cell...

Regional Clustering and Visualization of Industrial Structure based on Principal Component Analysis for Input-output Table Data.. Division of Human and Socio-Environmental

* Department of Mathematical Science, School of Fundamental Science and Engineering, Waseda University, 3‐4‐1 Okubo, Shinjuku, Tokyo 169‐8555, Japan... \mathrm{e}

This research was supported by Natural Science Foundation of the Higher Education Institutions of Jiangsu Province (10KJB110003) and Jiangsu Uni- versity of Science and

Arnold This paper deals with recent applications of fractional calculus to dynamical sys- tems in control theory, electrical circuits with fractance, generalized voltage di-