• 検索結果がありません。

社会ジレンマをモデル化する

N/A
N/A
Protected

Academic year: 2021

シェア "社会ジレンマをモデル化する"

Copied!
35
0
0

読み込み中.... (全文を見る)

全文

(1)

エネルギー環境論

担当教官:谷本 潤 教授

第6回講義

社会ジレンマをモデル化する

-統計物理学,進化ゲーム理論と社会ジレンマ-

(2)

都市境界層

湿気

空気

106 104

103

104 105

101

100 10-1

10-2 -∞

-∞

長さスケール[m]

Global scale Urban scale

Room

Human Urban

Human Architecture

Mutually-interpenetrative view over wide spatial-scales

Two physical systems having neighboring special scales are mutually connected through boundary conditions.

Small scale

Interaction

Large scale

Building Building-block

To elaborate the Human -

Environment -Social System, it’s important a concept of

“Simultaneous” or “Bridging to various scales”.

(3)

× ×

Unless bridging,

appropriate boundary

conditions MUST be given.

Revised AUSSSM

都市 建築 土壌

Between physical systems

Environ ment

Human Social

How “bridges” are defined?

“Environmental problems”

mean social dilemmas conflicting those three systems.

Decision making

Social

Environment

Science for complex system

Evolutionary game theory, Multi-agent simulation, Artificial intelligence (GA, NNw etc)

Human Social System

Mutually-interpenetrative view over mutually different systems

Environment

(4)

Game theory is a study of strategic decision making. More formally, it is "the study of mathematical models of conflict and cooperation between intelligent rational decision- makers.“

John von Neumann & Oskar Morgenstern; Theory of games and economic behavior, 1944.

What is the Game Theory ?

Zero-sum (Constant-sum) games

(Japanese) Chess, Go. Minimax theorem (von Neumann); For every two- person, zero-sum game with finitely many strategies, there exists a value V and a mixed strategy for each player, such that (a) Given player 2's strategy, the best payoff possible for player 1 is V, and (b) Given player 1's strategy, the best payoff possible for player 2 is −V.

Non zero-sum (Non constant-sum) games

Many applications happening in real world. Social dilemma, Prisoner’s

Dilemma, Chicken games etc. Cuba Crisis -->Chicken game?

Game theory has been widely recognized as an important tool in many fields; economics, political science, psychology, as well as

biology, information science and even statistical physics. Eight game- theorists, including John Nash have won the Nobel Memorial Prize in Economic Sciences, and John Maynard Smith was awarded the

Crafoord Prize for his application of game theory to biology.

(5)

2 by 2 game

Cooperation

( C )

Defection

( D ) Cooperation

( C ) RR ST Defection

( D ) TS PP Agent1

Agent2

RRewardTTemptationSSuckerPPunishment

Agent1 Agent2

(6)

Application; Analytical approach concerning

equilibrium (steady-state) for Nonlinear systems

2-player 2-strategy game (2 by 2 game)

Class Dilemma? GID RAD

Prisoner’s Dilemma; PD Yes Yes Yes Chicken (Snow Drift; Hawk-Dove) Yes Yes No

Stag Hunt; SH Yes No Yes

Trivial No No No

Basic Assumption

- Infinite population.

- One-shot game; well-mixed situation (with neither social viscosity nor assortment

among agents).

(7)

Cooperation

( C )

Defection

( D ) Cooperation

( C ) RR ST Defection

( D ) TS PP Agent1

Agent2

RRewardTTemptationSSuckerPPunishment

Prisoner’s Dilemma

Agent1 Agent2

(8)

Cooperation

( C )

Defection

( D ) Cooperation

( C ) 5, 5 1, 7

Defection

( D ) 7, 1 3, 3

Agent1

Agent2

RRewardTTemptation SSuckerPPunishment

C D

C R, R S, T D T, S P, P

2R ( =8 ) >T+S ( =6 ) >2P ( =4 )

Gamble-Intending Dilemma (GID); D

g

=T-R=7-5>0

Risk-Averting Dilemma (RAD); D

r

=P-S=3-1>0

Equal Pareto Optimum

Nash Equilibrium

Prisoner’s Dilemma

Agent1 Agent2

(9)

Cooperation

( C )

Defection

( D ) Cooperation

( C ) 5 1

Defection

( D ) 7 3

Agent1

Agent2

RRewardTTemptation SSuckerPPunishment

C D

C R S

D T P

2R ( =8 ) >T+S ( =6 ) >2P ( =4 )

Prisoner’s Dilemma

Agent1 Agent2

Gamble-Intending Dilemma (GID); D

g

=T-R=7-5>0

Risk-Averting Dilemma (RAD); D

r

=P-S=3-1>0

Equal Pareto Optimum

Nash Equilibrium

(10)

Player1 Player2

P R

S T

Prisoner’s Dilemma

Pareto Optimum

Most preferable for Player 1

Worst preferable for Player 1

Pareto Inverse- Optimum

Equal Pareto Optimum

Equal

Pareto

Inverse-Optimum

S < P < R < T D

r

> 0

D

g

> 0

(11)

Chicken

Hawk–Dove Game (Maynard Smith (1982))

Snowdrift Game

Player1 Player2

S

P R T

P < S < R < T D

r

< 0

D

g

> 0

Pareto Optimum

Most preferable for Player 1

Equal Pareto Optimum

Worst

(12)

Cooperation

( C )

Defection

( D ) Cooperation

( C ) 5 3

Defection

( D ) 7 1

Agent1

Agent2

RRewardTTemptation SSuckerPPunishment

C D

C R S

D T P

2R ( =8 ) >T+S ( =6 ) >2P ( =4 )

Chicken

Agent1 Agent2

Gamble-Intending Dilemma (GID); D

g

=T-R=7-5>0

Risk-Averting Dilemma (RAD); D

r

=P-S=3-1<0

Equal Pareto Optimum

Nash Equilibrium

Nash Equilibrium Worst

(13)

Stag Hunt

Inspired by Jean-Jacques Rousseau; “Discours sur l'origine et les fondements de l'inégalité parmi les hommes” (Chapter 2)

Player1 Player2

S P T R

S < P < T < R

D

g

< 0

D

r

> 0

Best

Worst preferable for Player 1

Pareto Inverse- Optimum

Equal

Pareto

Inverse-Optimum

(14)

Cooperation

( C )

Defection

( D ) Cooperation

( C ) 7 1

Defection

( D ) 5 3

Agent1

Agent2

RRewardTTemptation SSuckerPPunishment

C D

C R S

D T P

Stag Hunt

Agent1 Agent2

Gamble-Intending Dilemma (GID); D

g

=T-R=5-7<0

Risk-Averting Dilemma (RAD); D

r

=P-S=3-1>0

Best=Equal Pareto Optimum Nash Equilibrium

Nash Equilibrium

(15)

Trivial Dilemma Free game

Player1 Player2

P S T R

P < S < T < R

D

g

< 0 D

r

< 0

Best

Worst

(16)

Cooperation

( C )

Defection

( D ) Cooperation

( C ) 7 3

Defection

( D ) 5 1

Agent1

Agent2

RRewardTTemptation SSuckerPPunishment

C D

C R S

D T P

Trivial

Agent1 Agent2

Gamble-Intending Dilemma (GID); D

g

=T-R=5-7<0

Risk-Averting Dilemma (RAD); D

r

=P-S=1-3<0

Best=Equal Pareto Optimum

Nash Equilibrium

(17)

Evolutionary game

C D

C 1 -Dr D 1+Dg 0

Dg

GID

Dr

RAD

Cooperation

A focal player plays a game with a randomly selected opponent.

Strategy (whether C or D)

adaptation based on obtained payoff is considered.

1 . 2 .

In case if PD

D

g

>0, D

r

>0 )

Time step

Cooperation fraction

2 by 2 game considered time evolution

You never see emerging cooperation, unless some additional mechanism for social viscosity is implemented.

-D

r

1+D

g

1

0 1

0

-D

r

-D

r

1+D

g

1+D

g

1

-D

r

0

0

Defection

Battle field

(18)

・ Kin selection

・ Direct reciprocity

・ Indirect Reciprocity

・ Network Reciprocity

・ Group selection

What is Social Viscosity? A restricted relation among agents

Lessing Anonymity

Emerging cooperation

Well-mixed situation A Game on a network

(19)

Let us back to the Basic Assumption again;

- Infinite population.

- One-shot game; well-mixed situation (with neither social viscosity nor assortment among agents).

0 1

2

T e

1 0

1

T e

Let us describe Cooperation and defection strategies by;

; C

; D

M

 

 

P T

S R

Also, let us define game structure, i.e. payoff matrix as below;

s 1 s 2

T s

Further, let us define strategy frequency among agents at a certain time step as below;

Fraction of C D

(20)

Let us think a simple example. When a focal player who offers D, how much of payoff expectation she can get in case of paying with another D player as her game opponent?

By simplex constraint; . s 2  1  s 1

  P

P T

S

P  

 

 

 

 

1 1 0

0

By analogy, payoff expectations of both a C and D players respectively paying with average players at this time step are;

s M e 1

T

s M e 2

T

(21)

Let us consider the following system dynamics, called

Replicator Dynamics , which is thought to be a good model for describing the reproduction process of population dynamics for animal species.

s M s

s M

e i   

T T

i i

s s

Changing rate of strategy i; C when i=1

& D when i=2

Payoff expectations of a strategy i player paying

with an average player at this time step

Payoff expectations of an average player paying with an average

player at this time step

Implying benefit brought to a player who

adopts strategy i.

(22)

s M s

s M

e i   

T T

i i

s s

Replicator Dynamics: has three equilibriums.

Two obvious equilibriums are;

(1,0) ; A state absorbed by

C

where all players offer C (C Dominate phase) .

(0,1) ; A state absorbed by

D

where all players offer D (D Dominate phase) .

The third one is;

 

 

R S

T P

T R

R S

T P

S

P (Polymorphic phase).

A question is what dynamics would be if analytic approach is applied to the Replicator Dynamics, which is a (nonlinear) cubic equation for s1 or s2.

(23)

s M s

s M

e i   

T T

i i

s s

Let us describe Replicator Dynamics explicitly by substituting i=1 and 2.

   

 

   

 

 

 

2 1

2 1

2

2 1

2 1

1

s s

s S

P s

T R

s

s s

s S

P s

T R

s

1 2

1

1

f s , s

s   s

2

f

2

s

1

, s

2

1

2

1 s

s  

When defining and as well as

reminding Simplex constraint; , we know;

2

1 f

f  

   

   

   

* x x

* x x

*

x x

x x

x x x f

f

n n n

n

x f x

f

x f x

f

1

1 1

1

Again, Our current target is to evaluate Eigen values of Jacobi Matrix at respective three equilibrium; s*.

R S T P

T R R

S T P

S

(1,0) (0,1) P

(24)

 

R S T Ps S P

s P T

S s R

f s

f

 

 

 

1

2 1 1

2 1

1

2 2

2

3

 

R S T Ps S P

-

s P T

S s R

f s

f

 

 

 

1

2 1 2

2 2

1

2 2

2

3

 

 

2 1 1

1

2 1 1

1

2 2 1

2

2 1 1

1

s f s

f

s f s

f

s f s

f

s f s

f J

We know two Eaigen values of are;

0

and (its eiven vector is (1,-1)) .

2

1 1

1

s f s

f

 

(25)

Thus, what we should currently do is evaluate sings of

at respective three equilibrium; s*.

2

1 1

1

s f s

f

 

 

 

R S T PsS P

s P T

S s R

f s

f

 

 

 

2 2

2 4

6

1

2 1 2

1 1

1

  1 , 0

* 

s    2 R  2 T

(1) At ; .

Thus, for , it must be .   0 TRD

g

 0

(2) At ; .

Thus, for , it must be .

0

 

  0 , 1

* 

s   2 S  2 P

0

 

(3) At ; . Thus, for , it must be;

.

 

 

 

R S T P

T R R

S T P

S

s* P   

P T

S R

S P T R

 2 

0

S D

r

P

0

0    

S R T P S D

r

T R D

g

P

(26)

Source or sink at Equilibrium; s*

Game class

Trait Nash Equilibrium Sing of GID;

Dg

Sing of RSD;

Dr

(1,0) (0,1)

r g

g r

g r

D D

D D

D D

PD D-dominate (0,1) + + Source sink Saddle Chicken Polymorphic

r g

g r

g r

D D

D D

D

D + - Source Source Sink

Stag Hunt Bi-stable (0,1) or (1,0) - + Sink Sink Source

Trivial C-Dominate (1,0) - - Sink Source Saddle

Summing up all so far, we obtain;

Where  

 

 

 

 

 

g r

g r

g r

D D

D D

D D R

S T

P

T R R

S T

P

S

s* P

(27)

Phase diagram of 2 by 2 games D

g

D

r

Chicken PD

Trivial Stag Hunt

(28)

Prisoner’s Dilemma, PD

D

g

D

r

Chicken PD

Trivial Stag Hunt

s

0 1

Source

Sink

All agents are

absorbed by D.

(29)

Chicken

D

g

D

r

Chicken PD

Trivial Stag Hunt

s

0 1

Source

Sink

All agents are

absorbed by Internal

Equilibrium.

D-dominate

Source

(30)

Stag Hunt

D

g

D

r

Chicken PD

Trivial Stag Hunt

s

0 1

Sink

Depending on initial distribution, some agents are absorbed by D and other are

absorbed by C.

D-dominate Source

Sink

(31)

Trivial, dilemma free game

D

g

D

r

Chicken PD

Trivial Stag Hunt

s

0 1

Source

Sink

All agents are

absorbed by C.

Polymorphic

Bi-stable

(32)

Phase diagram of 2 by 2 games D

g

D

r

Chicken PD

Trivial Stag Hunt Polymorphic D-dominate

C-dominate Bi-stable

(33)

Backgrounds & Purpose

Most previous studies

Entirely

cooperation

Entirely

defection

Agents can offer either

cooperation

(C) or defection (D)

The real world

Actual options might be

continuous rather than discrete Entirely

cooperation

Entirely

defection

Discrete strategy Continuous or mixed strategy

One crucial question is whether there is a considerable

difference in game equilibria between the continuous or

mixed strategies and those of discrete strategies?

(34)

Continuous strategy Mixed strategy

1.0

1.0 0 0

C D

C

1, 1 -Dr, 1+Dg

D

1+Dg, -Dr 0, 0

Agent i

Agent j

1. Strategy value:

2. Payoff function

(0.8) (0.5) (0.2)

S(=-Dr

)

T(=1+Dg

)

R(=1) P(=0)

(0.8) (0.5) (0.2)

Setting for continuous, and mixed strategy games

] 1 , 0 [

i

s

si=1 complete cooperation si=0 complete defection

1. Strategy value:

si=1 complete cooperation si=0 complete defection

j i

j

i

s D s D s

s , ) ( 1 )

(  

r

 

g

j i

s s D

D )

( 

g

r

2. Payoff function

Agents can only offer either

C

or D according to this strategy

C

when Rnd[] < s

i

, otherwise D

Rnd[ ]: a random number

] 1 , 0 [

i

s

0.3

0.7

(35)

Results

C D

C 1 -Dr D 1+Dg 0 Dg

GID

Dr

RAD Averaged cooperation fraction

C D

0.2 0 0.4 0.6 0.8 1

0.2

0 0.4 0.6 0.8 1 0.2

0 0.4 0.6 0.8 1

0.2

0 0.4 0.6 0.8 1 0.2

0 0.4 0.6 0.8 1

0.2

0 0.4 0.6 0.8 1

Discrete strategy Continuous strategy Mixed strategy

Dg

Dr Dr Dr

Games are played on lattices (k = 8)

0 1

参照

関連したドキュメント

Theorem 1.2 For each connected graph = (f, α) defined in Construction 6.1, with automorphism group A = Aut () given in Proposition 8.1, G is an index two subgroup of A, is

Key words: Evolution family of bounded linear operators, evolution operator semigroup, Rolewicz’s theorem.. 2001 Southwest Texas

Following Speyer, we give a non-recursive formula for the bounded octahedron recurrence using perfect matchings.. Namely, we prove that the solution of the recur- rence at some

Motivated by complex periodic boundary conditions which arise in certain problems such as those of modelling the stator of a turbogenerator (see next section for detail), we give

A sequence α in an additively written abelian group G is called a minimal zero-sum sequence if its sum is the zero element of G and none of its proper subsequences has sum zero..

In this paper, based on the concept of rough variable proposed by Liu 14, we discuss a simplest game, namely, the game in which the number of players is two and rough payoffs which

However, for σ-weakly closed bimodules over a Cartan MASA in a hyperfinite von Neumann algebra, the Spectral Theorem for Bimodules follows from a more general result of Fulman, see

Related to this, we examine the modular theory for positive projections from a von Neumann algebra onto a Jordan image of another von Neumann alge- bra, and use such projections