Transmission types Strategies Mutant - Author's postprint 全文総合研究大学院大学学術情報リポジトリ KWO

z

i,τ

(T) z

ρ(i),τ-1

(T)

z

i,τ

(T) z

ρ(i),τ-1

(T)

Figure 1

0 0.2 0.4 0.6 0.8 1

0 5 10 15 20 25 30 35 40 COS

q=1-10^-6 (N=10⁶) q=1-10^-4 (N=10⁴) q=1-10^-2 (N=10²)

(a) x*

0 0.2 0.4 0.6 0.8 1

0 5 10 15 20 25 30 35 40 COS

q=1-10

-6 (N=10

6) q=1-10

-4 (N=10

4) q=1-10

-2 (N=10₂ )

(b) v*

0 5 10 15 20 25 30 35 40 1

10² 10⁴ 10⁶ 10⁸

10^-2

(c) z*(T)

COS

q=1-10^-6 (N=10⁶) q=1-10^-4 (N=10⁴) q=1-10^-2 (N=10²)

Figure 2

0 0.2 0.4 0.6 0.8 1

0 5 10 15 20

Generation [x10

⁵

]

0 50 100 150 200 250 300

0 5 10 15 20

(a)

(b)

x v

z(T)

Figure 3

0 0.2 0.4 0.6 0.8 1

0 5 10 15 20 25 30 35 40 q=1.0

q=0.99 q=0.9 Theory (q=1.0)

0 0.2 0.4 0.6 0.8 1

0 5 10 15 20 25 30 35 40 q=1.0

q=0.99 q=0.9 Theory (q=1.0)

0 20 40 60 80 100 120 140

0 5 10 15 20 25 30 35 40 q=1.0

q=0.99 q=0.9

(a)

(b)

(c)

x

v

z(T)

Theory (q=1.0)

Figure 4

0 0.2 0.4 0.6 0.8 1

0 200 400 600 800 1000 1200

0 5 10 15 20

Mutant (CO

S) fitness

Resident (ESS) fitness

Probability of mutant survival

Generation

F it ne ss

P roba bi li ty

Figure 5

N Population size

q Vertical transmission rate

T Lifetime

 Efficiency of social learning

 Efficiency of individual learning

 ,

vi The fraction of the lifetime invested in learning by individual (i,⁾

 ,

The fraction of the learning time invested in individual learning by individual (i,⁾

)

, (t

z_i_ _The_z-value of individual (i,) at within-generation time t.

)

~(_T

z The equilibrium mature z-value in a genetically monomorphic population

 ,

wi The fitness of individual (i,⁾

w~ The equilibrium fitness in a genetically monomorphic population v, x^, ~_z^(_T) The COS values of v_i_,_ , x_i_,_, and ~_z(_T), respectively.

v , x*, ~_z*(_T) The ESS values of v_i_,_, x_i_,_, and ~_z(_T), respectively.

v, x, z(T) The population averages of v_i_,_, x_i_,_, and ~_z(_T), respectively.

Table 1: Notation Tables

Supporting information

A Paradox of Cumulative Culture

Yutaka Kobayashi

^a,b

, Joe Y. Wakano

, Hisashi Ohtsuki

March 12, 2015

aResearch Center for Social Design Engineering, Kochi University of Technology, Kochi 782-8502, Japan, ^bDepartment of Management, Kochi

University of Technology, Kochi 782-8502, Japan, ^cMeiji Institute for Advanced Study of Mathematical Sciences, Nakano, Tokyo 164-8525, Japan, ^dThe Graduate University for Advanced Studies, Shonan Village,

Hayama, Kanagawa 240-0193, Japan

Appendix A: Derivation of the COS

To derive the COS, let us assume that the population is monomorphic for

a learning strategy (x, v). Solving eq. (1) in the main text with respect to

zi,τ(t) under the assumption that zi,τ(0) = 0 and (xi,τ, vi,τ) = (x, v), we have

zi,τ(t) =zρτ(i),τ−1(T)(1−e⁻^βt). (11)

It follows that thez-value at the end of the social learning stage (t=v(1−x))

is given by

zi,τ(v(1−x)) =zρτ(i),τ−1(T)(1−e⁻^βv(1⁻^x)). (12) Further, from eq. (2) in the main text, the value of zi,τ(t) at the end of the

individual-learning stage (t=v) is given by

zi,τ(v) = zi,τ(v(1−x)) +vx

= zρτ(i),τ−1(T)(1−e⁻^βv(1⁻^x)) +vx. (13)

Noting that zi,τ(v) = zi,τ(T), we have

zi,τ(T) = zρτ(i),τ−1(T)(1−e⁻^βv(1⁻^x)) +vx. (14)

This equation gives the between-generation dynamics of zi,τ(T). From

eq. (14), the equilibrium value of zi,τ(T), denoted by ˜z(T), is given by

z(T) = lim

τ→∞zi,τ(T) = vxe^βv(1⁻^x). (15)

The equilibrium fitness function, denoted by ˜w, is therefore given by

w= lim

τ→∞wi,τ = lim

τ→∞zi,τ(T)·(1−v)

= v(1−v)xe^βv(1⁻^x). (16)

The COS is the strategy (x, v) which maximizes eq. (16). It is easily shown

that the strategy (x^◦, v^◦) given by eq. (6) in the main text maximizes eq.

(16) and hence gives the COS.

Appendix B: Derivation of the ESS in an

infi-16

nite population

We define an evolutionarily stable learning strategy in an infinite population

as a learning strategy that is resistant against invasion by rare mutants with

any slightly deviated strategy. We will derive eq. (7) in the main text, which

an ESS must satisfy.

Let (x, v) and (x^′, v^′) denote the resident and mutant strategies,

respec-22

tively. We assume that the resident population is at cultural equilibrium, so

that all residents have thez-value given by eq. (15) at the end of the learning

stage. In order to derive the ESS, we classify individuals as follows. Residents

are class 0. The mutants who socially learned from residents are class 1. The

mutants who socially learned from class-1 individuals are class 2. Class-j

individuals are defined recursively. Note that offspring of class-j mutants fall

back to class 1 when their cultural role models are residents (oblique social

learning). In this case, cultural accumulation over j generations by mutants

is reset.

From eq. (14), the mature z-value of an individual (i, τ) in class j ≥ 1

satisfies

zi,τ(T) =zρτ(i),τ−1(T)(1−e⁻^βv^′⁽¹⁻^x^′⁾) +v^′x^′. (17) Note that the above equation recursively applies, so thatzρτ(i),τ−1(T) is given

as a function of zρτ−1(ρ^τ(i)),τ−2(T), which is in turn given as a function of

zρτ−2(ρτ−1(ρτ(i))),τ−3(T), and so on. Given that individual (i, τ) belongs to

class j, individual (ρτ−(j−1)(ρτ−(j−2)(. . .(ρτ−1(ρτ(i))). . .)), τ −j) belongs to

class 0 and is hence a resident. Noting this and eq.(15), eq. (17) can be

solved to yield

zi,τ(T) =v^′x^′e^βv^′⁽¹⁻^x^′⁾+r^C^τ⁽ⁱ⁾(vxe^βv(1⁻^x)−v^′x^′e^βv^′⁽¹⁻^x^′⁾), (18)

where Cτ(i) denotes the class of individual (i, τ) and

r= 1−e⁻^βv^′⁽¹⁻^x^′⁾. (19)

Note that eq. (18) does not depend on iand τ but only on the classCτ(i) of

individual (i,τ). This implies that the fitness of an individual also depends

only on its class. Therefore, we let w^′_j denote the fitness of class-j mutants:

w^′_Cτ(i) :=zi,τ(T)(1−v^′), (Cτ(i)≥1) (20)

It is easily confirmed that mutants have the same fitness as residents

irre-44

spective of classes (i.e. w_j^′ = ˜w = v(1−v)xe^βx(1⁻^v) for arbitrary j ≥ 1) if

they adopt the same strategy as residents ((x^′, v^′) = (x, v)).

Letpj,τ denote the frequency of class-j mutants (j ≥1) in the population

in generationτ. Since mutants are rare, we may assume that a mutant’s role

model is a mutant only when vertical transmission occurs. The offspring of

a class-j mutant hence belong to class-(j+ 1) and class-1 with probabilities

q and 1−q, respectively. Further, because of rarity of mutants, the average

fitness of the population is approximated by the residents’ fitness ˜wgiven by

eq. (16). From these arguments, it holds that

p1,τ+1 =

∞

∑

j=1

(1−q)w_j^′

wpj,τ, (21)

pj+1,τ+1 =qw^′_j

wpj,τ, (22)

where j ≥1.

Note that the above equation is formally equivalent to the standard model

of age structure. Substitutingpj,τ+1 =λpj,τ into eqs. (21) and (22) and

rear-57

ranging the resulting equations, it is easily shown that the leading eigenvalue

λ, i.e. the asymptotic growth rate of mutants, should satisfy the following

(Euler-Lotka) characteristic equation:

1 =

∞

∑

i=0

(1−q)qⁱλ⁻ⁱ⁻¹

i+1

∏

j=1

w_j^′

w. (23)

Note that, when mutants have the same fitness as residents (i.e. wj = ˜w

for all j’s), λ = 1 is the only solution of eq. (23). This implies that the

frequency of mutants remains constant when they adopt the same strategy

as residents.

Differentiating eq. (23) with respect to a mutant strategic variable y^′

(y^′ ∈ {x^′, v^′}) yields

0 =

∞

∑

i=0

(1−q)qⁱ(−i−1)λ⁻ⁱ⁻²∂λ

∂y^′

i+1

∏

j=1

w_j^′

˜ w +

∞

∑

i=0

(1−q)qⁱλ⁻ⁱ⁻¹

i+1

∑

k=1

w_k^′−¹∂w^′_k

∂y^′

i+1

∏

j=1

w^′_j

w. (24)

Substitutingx^′ =x,v^′ =v,w^′_j = ˜w, and λ= 1 into eq. (24) and rearranging

the resulting equation yield

˜ w ∂λ

∂y^′

_x_′

=x,v^′=v

= ∂w^′

∂y^′

_x_′

=x,v^′=v

, (25)

where

w^′ =

∞

∑

i=1

(1−q)qⁱ⁻¹w_i^′. (26) If the stationary growth rate of mutants is larger than one, mutants can

invade. Therefore, for the resident strategy (x, v) to be evolutionarily stable,

λ must be maximized at (x^′, v^′) = (x, v) as a function of the mutant strategy

(x^′, v^′). However, this and eq. (25) together imply that w^′ is maximized at

(x^′, v^′) = (x, v). Thus, for our ESS analysis we may treatw^′ like the mutant

invasion fitness.

In fact, w^′ can be interpreted as the asymptotic average of the mutant

invasion fitness, as follows. Note that the leading eigenvector of the system

(21-22) is given by (1, q, q², . . . , qⁱ⁻¹, . . .). This means that the fraction of

class i among mutants asymptotically approaches (1−q)qⁱ⁻¹ when selection

is absent ((x^′, v^′) = (x, v)). Thus, when selection is sufficiently weak, the

average fitness of mutants is asymptotically given by^∑^∞_i=1(1−q)qⁱ⁻¹w_i^′ =w^′.

Using eq. (18), (26) and (20), we find that

w^′ = (1−v^′)v^′x^′e^βv^′⁽¹⁻^x^′⁾ +(1−v^′)r(1−q)

1−rq (vxe^βv(1⁻^x)−v^′x^′e^βv^′⁽¹⁻^x^′⁾). (27)

For (x, v) to be the ESS, w^′ as a function of (x^′, v^′) must be maximized at

(x^′, v^′) = (x, v). Thus, the ESS (x^∗, v^∗) satisfies

∂w^′

∂x^′

_x_′

=x=x^∗,v^′=v=v^∗

= 0, (28)

∂w^′

∂v^′

_x_′_=x=x_∗_,v_′_=v=v_∗

= 0. (29)

It is easily shown that these equations reduce to eqs. (7a) and (7b) in the

main text. Finally, substituting eq. (7a) in the main text into eq. (15) yields

eq. (7c).

Appendix C: Derivation of the ESS in a finite

population

Here we derive the ESS in a finite population assuming pure vertical

trans-91

mission (q = 1) (eq. (9) in the main text). More specifically, we show that

the ESS for a finite population of size N under q = 1 is identical with the

ESS for an inifinite population under q = 1−1/N. Thus, in terms of the

ESS, decreasing the population size from ∞ to N under q = 1 has exactly

the same effect as decreasing q by 1/N in an infinite population.

To compute the ESS under q = 1, we need the fixation probability of a

mutant strategy that is initially expressed by a single individual. For this

purpose, we apply the method introduced by Rousset (2004) below.

Imagine that a mutant strategy (x^′, v^′) is expressed by a single individual

100

in the population of the resident strategy (x, v). For convenience sake, let us

101

reuse the classification of individuals introduced in Appendix B. Then, the

102

initial single mutant is obviously of class 1 because there is no mutant in the

103

previous generation. Sinceq = 1 (pure vertical transmission), any mutant in

104

any generation τ inherits culture from its own parent, which is a mutant in

105

generationτ−1. This implies that all mutants in generationτ belong to class

106

τ (Cτ(i) = τ for any mutant (i, τ)), given that the mutant was introduced

107

in generation 1. Therefore, all mutants in generation τ have equal fitnesses

108

given by w_τ^′ in eq. (20). It is important that the mutant fitness is not a

109

stochastic variable but is determined by the number of generations passed

110

since introduction of the initial mutant. By virtue of this property, we can

111

treat this process as a Wright-Fisher process in which the selection coefficient

112

depends deterministically on time (see below).

113

LetPτ denote the frequency of mutants in generationτ. Since all mutants

114

in generation τ belong to class τ, it holds that Pτ = ^∑_jpj,τ = pτ,τ in

Ap-115

pendix B’s notation. Note that we assume a Wright-Fisher-type update for

116

the genetic state of the population and also culture is transmitted between

117

adjacent generations; thus, Pτ obeys a time-inhomogeneous Markov process

118

with the initial state P1 = 1/N. Obviously, this stochastic process has only

119

two absorbing states: Pτ = 1 (fixation) and Pτ = 0 (extinction). Let π

120

denote the fixation probability of the mutant strategy. Then, the expected

121

frequency of mutants in the infinitely distant future should be given by

122

τlim→∞E[Pτ] = 1·π+ 0·(1−π) =π, (30)

where E[·] denotes expectation. Below we use this relationship to compute

123

π.

124

Note that we can write

125

Pτ =P1+ ∆P1+ ∆P2 +. . .+ ∆Pτ−1, (31)

where ∆Pτ = Pτ+1−Pτ denotes the frequency change between generations

126

τ and τ+ 1 and is a stochastic variable itself. Substituting eq. (31) into eq.

127

(30) yields

128

π = E[P1+

∞

∑

τ=1

∆Pτ]

= 1

N +

∞

∑

τ=1

E[∆Pτ], (32)

where we used E[P₁] = P₁ = 1/N. From the standard theory of population

129

genetics, the frequency change ∆Pτ is given by

130

∆Pτ = w^′_τ−w˜

w+Pτ(w_τ^′ −w)˜ Pτ(1−Pτ), (33)

where ˜w is the equilibrium fitness of residents given by eq. (16). Let us

131

define the selection coefficient sτ as

132

sτ = w^′_τ −w˜

w . (34)

Substituting (34) into eq. (33) yields

133

∆Pτ = sτ

1 +Pτsτ

Pτ(1−Pτ)≈sτPτ(1−Pτ), (35)

where the approximation holds for small sτ.

134

Substituting eq. (35) into eq. (32) yields

135

π ≈ 1 N +

∞

∑

t=1

sτE[Pτ(1−pτ)]. (36)

Note that the expectationE[Pτ(1−Pτ)] in the above equation is itself affected

136

by selection coefficients of up to generation τ −1 (i.e., s1, s2, s3, . . . , sτ−1).

137

However, Rousset (2004) has shown that the expectation E[·] can be

approx-138

imately replaced by the expectation under neutrality (i.e. s0 = s1 = . . . =

139

st = . . . = 0) provided selection is sufficiently weak. We denote the

expec-140

tation under neutrality by E^◦[·] following Rousset (2004). Thus, it holds

141

that

142

π ≈ 1 N +

∞

∑

t=1

sτE^◦[Pτ(1−Pτ)]. (37) Note that E^◦[2Pτ(1−Pτ)] can be interpreted as the probability that two

143

individuals drawn at random with replacement from generation τ have

dif-144

ferent genotypes under selective neutrality. Such two individuals can have

145

different genotypes only if their ancestral lineages trace back to generation 1

146

without coalescing and, in addition, only one of them hits the initial mutant.

147

From the standard coalescent theory this probability is given by

148

E^◦[2Pτ(1−Pτ)] =

(

1− 1 N

)τ−1

·2P1(1−P1)

= 2 1 N

(

1− 1 N

)τ

, (38)

where we used P1 = 1/N.

149

Substituting eqs. (34) and (38) into eq. (37) yields

150

π ≈ 1

N + 1 N

∞

∑

τ=1

(w^′_τ

˜ w −1

)(

1− 1 N

)τ

= 1 N +

(

1− 1 N

)(

w^′

˜ w −1

)

, (39)

where

151

w^′ =

∞

∑

τ=1

w_τ^′ 1 N

(

1− 1 N

)τ−1

. (40)

Remember that for a finite population we define an ESS as the strategy

152

that never allows a mutant strategy expressed by a single individual to have

153

a fixation probability higher than 1/N (i.e. the fixation probability of the

154

ESS itself). This implies that for our ESS analysis we can treat w^′ like the

155

mutant invasion fitness in the standard ESS analysis in an infinite-population

156

model. Note that eq. (40) is formally identical with eq. (26) except that qis

157

replaced by 1−1/N. This implies that the ESS for a finite population under

158

pure vertical transmission (q = 1) is equivalent with the ESS for an infinite

159

population with q= 1−1/N.

160

Appendix D: Probabilistic engagement in

so-161

cial and individual learning

162

In the main text, we assumed that social and individual learning occur in

163

separate stages of life. In this Appendix, we instead assume that each

in-164

dividual engages in individual and social learning with probabilities x and

165

1−x, respectively, at any moment in the learning stage and derive eq. (14)

166

under some additional assumptions. Thus, the results of the present paper

167

all apply to this modified model.

168

Suppose that zi,τ(t) represents the amount of knowledge that the

indi-169

vidual (i, τ) acquires by time t through individual and social learning. Let

170

zi,τ,IL(t) and zi,τ,SL(t) denote the amounts of knowledge acquired through

171

individual and social learning, respectively, by time t. In addition, assume

172

that the knowledge acquired through individual learning never overlaps with

173

that acquired through social learning. This implies that any piece of

knowl-174

edge produced by an individual through individual learning is always new

175

to the role model of the focal individual as well as the focal individual

it-176

self. Then, the total amount of knowledge individual (i, τ) bears is given by

177

zi,τ(t) = zi,τ,SL(t) +zi,τ,IL(t).

178

Note that each individual engages in social learning with probability 1−x

179

at any moment in the learning stage. This implies thatzi,τ,SL(t) grows in the

180

learning stage as follows:

181

dtzi,τ,SL(t) =β(1−x)(zρτ(i),τ−1(T)−zi,τ,SL(t)). (0≤t≤v) (41)

Likewise, zi,τ,IL(t) follows

182

dtzi,τ,IL(t) = αx=x. (0≤t≤v) (42)

Integrating both equations yield

183

zi,τ,SL(v) =zρτ(i),τ−1(T)(1−e⁻^βv(1⁻^x)). (43)

ドキュメント内 Author's postprint 全文総合研究大学院大学学術情報リポジトリ KWO (ページ 51-70)