• 検索結果がありません。

Japan Advanced Institute of Science and Technology

N/A
N/A
Protected

Academic year: 2021

シェア "Japan Advanced Institute of Science and Technology"

Copied!
3
0
0

読み込み中.... (全文を見る)

全文

(1)

Japan Advanced Institute of Science and Technology

JAIST Repository

https://dspace.jaist.ac.jp/

Title

繰り返し連続化囚人のジレンマゲームによるマルチエ

ージェント系の解析

Author(s)

千葉, 一博

Citation

Issue Date

1999‑09

Type

Thesis or Dissertation

Text version

author

URL

http://hdl.handle.net/10119/894

Rights

Description

Supervisor:平石 邦彦, 情報科学研究科, 博士

(2)

Iterated Continuous Prisoner's Dilemma Game

Kazuhiro Chiba

School of Information Science,

Japan Advanced Institute of Science and Technology

July 9, 1999

Abstract

The purp ose of this pap er is to prop ose a new model of interactions between agents

and to show its usefulness for analyzing multi-agent systems. This model is called the

iterated continuous prisoner's dilemma game. It is an extended version of the usual it-

erated prisoner's dilemma game, and can deal with intermediate decisions. Using the

iterated continuous prisoner's dilemma game, we analyze a dynamic b ehavior of multi-

agent systems, esp ecially for the process of invasion between two groups of agents. The

analysis is done on a typical go od strategy on the usual game, and the protability of

intermediate decisions is claried. In a multi-agent system, each agent pursues his own

interests through interactions with other agents. However, there is no sup ervisor which

controlsthewholesystem toresolveeachconictofinterestsamongagents. Forsuchsys-

tems, thereare several results onthe mechanismof computationfor cooperationand the

structure of a system in order to achieve it eciently. For instance, T. Ishida proposed

coordinator agents on a at network that support people autonomously to make con-

structive agreements. On the other hand, there were researches oninvestigating natures

of decisions in the interaction which is necessary to achieve robustness or stability, not

restrictedtoco operation,ofthesystem. Forinstance, R.Axelro dinvestigatedcollectively

stable strategies onthe iteratedprisoner's dilemma game. This research also approaches

analytically to multi-agent systems just like that by Axelrod. In many researches, the

prisoner's dilemma game was used as a model for analyzing interactions between agents

in multi-agent systems. This game is a two-p erson non-zero-sum game. In this game,

each player chooses an action b etween two alternatives called `Cooperate' and `Defect'.

As a result, each player gains a payo by a certain matrix. This game accurately rep-

resents the situation of a dilemma that a rational action taken by each agent does not

result inthe Paretooptimum. Bysucha characteristic,it isconsidered that this gameis

suitable for study on a recipro cal decision in interactions between agents in a dilemma.

In recent years, this game has been also acknowledged as one of standard problems in

the eld of study ondistributed articial intelligence. Especially, the iterated prisoner's

c

(3)

of the prisoner's dilemma game. In the iterated prisoner's dilemma game, Tit for Tat

strategy (TFT) is well-known as atypical go od strategy. When considering multi-agent

systems such as the general public or the cyberworld, it is also considered that there

mightbea case that a decision between two alternatives leavessomething to be desired.

Becausepeopleoftentakeinexact standsagainstunknownopponents. Theresultsof this

research are concerning the protability of intermediate decisions. They can contribute

to designing the elastic decisionmechanismof each agentin amulti-agent system.

Key Words: distributed articial intelligence, multi-agent,

interaction between agents, game theory, prisoner's dilemma

参照

関連したドキュメント

[r]

By those facts, E-nose technology which employs array of MOS gas sensors driven by the advanced temperature modulation technique was used to measure the gases and

According to multi- variate analysis, expression of CD42b, a platelet marker, in our biopsy specimens from advanced gastric cancer with preoperative DCS therapy was

Design of a radiopharmaceutical for the palliation of painful bone metastases: rhenium-186-labeled bisphosphonate

Nov, this definition includ.ing the fact that new stages on fundamental configuration begin at the rows 23 imply, no matter what the starting configuration is, the new stages

In the complete model, there are locally stable steady states, coexisting regular or irregular motions either above or below Y 1 100, and complex dynamics fluctuating across bull

In 2003, Agiza and Elsadany 7 studied the duopoly game model based on heterogeneous expectations, that is, one player applied naive expectation rule and the other used

In particular, building on results of Kifer 8 and Kallsen and K ¨uhn 6, we showed that the study of an arbitrage price of a defaultable game option can be reduced to the study of