060310391 0560565
9
2016/12/5 13:00-14:45
@1 - 4
1
Fig. 4. Convergent evolu0on of feeding morphology and color among East African cichlid fishes. Species from Lake Tanganyika are in the leA column; those from Lake Malawi are to the right. Each of the illustrated fishes from Lake Malawi are more closely related to one another than to any species in Lake Tanganyika.
h/p://www.pnas.org/ Albertson et al. (2003)
1996 2
•
•
•
•
•
•
•
•
•
3
• 2 DNA RNA
• DNA RNA
4
A-TGTAAACGCTA
AGTGTAAT-GCTA
ATGTAAACGCTA
AGTGTAATGCTA
ATGTAAACGCTA
AGTGTAATGCTA
…
5
…
-
3 …
①
②
match 1 (mismatch) −1 -2
① 7×1+5×(-1)=2 ② 10×1+1×(-1)+2×(-2)=5
5
•
–
Needleman-Wunsch
•
–
Smith-Waterman
6
Needleman-Wunsch algorithm
Needleman and Wunsch (1970) J Mol Biol 48:443
- A T G T C
- 0 -2 -4 -6 -8 -10
A -2
G -4
T -6
A -8
C -10
match = 1 mismatch = -1 gap = -2
7
Needleman-Wunsch algorithm
Needleman and Wunsch (1970) J Mol Biol 48:443
A T G T C
0 -2 -4 -6 -8 -10
A -2 1 -1 -3 -5 -7
G -4
T -6
A -8
C -10
match = 1 mismatch = -1 gap = -2
0 -2
-2 -4
0 -2
-2 -4
0 -2
-2 1
3
(a)
(b)
(c)
(a) (c) gap (b) match mismatch 8
Needleman-Wunsch algorithm
Needleman and Wunsch (1970) J Mol Biol 48:443
- A T G T C
- 0 -2 -4 -6 -8 -10
A -2 1 -1 -3 -5 -7
G -4 -1 0 0 -2 -4
T -6 -3 0 -1 1 -1
A -8 -5 -2 -1 -1 0
C -10 -7 -4 -3 -2 0
match = 1 mismatch = -1 gap = -2
9
Needleman-Wunsch algorithm
Needleman and Wunsch (1970) J Mol Biol 48:443
- A T G T C
- 0 -2 -4 -6 -8 -10
A -2 1 -1 -3 -5 -7
G -4 -1 0 0 -2 -4
T -6 -3 0 -1 1 -1
A -8 -5 -2 -1 -1 0
C -10 -7 -4 -1 -2 0
match = 1 mismatch = -1 gap = -2
ATGT-C
A-GTAC
1
2 10
ATGT-C
A-GTAC
4 × 1 + 0 × (-1) + 2 × (-2) = 0
ATGTC
AGTAC
2 × 1 + 3 × (-1) + 0 × (-2) = -1
ATGT-C
AG-TAC
3 × 1 + 1 × (-1) + 2 × (-2) = -2
11
Smith-Waterman algorithm
Smith and Waterman (1981) J Mol Biol 147:195
- A T G T C
- 0 0 0 0 0 0
A 0 1 0 0 0 0
G 0 0 0 1 0 0
T 0 0 0 0 2 1
C 0 0 0 0 1 3
A 0 0 0 0 0 2
match = 1 mismatch = -1 gap = -1
GTC
GTC
1. 0
2. 0
3. 0
4. 0 12
• 2 3
• Needleman-Wunsch
m O(n
m)
• Clustal
13
Clustal
Thompson et al. (1994) Nuc Acid Res 22: 4673
2
NJ
root
root
NJ 1. Hbb_Human vs. Hbb_Horse 2. Hba_Human vs. Hba_Horse 3. (1) vs. (2)
4. (3) vs. Myg_Phyca :
2
14
Thompson et al. (1994) Nuc Acid Res 22: 4673 15
(phylogene_cs)
• DNA
RNA
•
2005
16
•
– UPGMA
•
–
•
–
•
– MCMC
17
• 2
•
A T G A C G A T A
C G G C C A C
A T G T C G A C C
G T
A C
C C
G
A
18
Jukes-Cantor model
q
tq
t+1q
: [ ]/[ ]A T G C
A T G C 1 − λ
λ/3λ/3 λ/3
h/p://www.veritastk.co.jp/kamon/pdf/kamon27/popu27.pdf
qt +1≅ (1 −λ)2qt+ 2(1 −λ)(λ/ 3)(1 − qt)
≅ (1 − 2λ)qt+2λ 3 (1 − qt) qt +1− qt=2λ
3 − 8λ
3qt
dq/dt
dq dt=
2λ 3 −
8λ 3q
q =1 −3
4
(
1 − exp(−8λt 3))
d
dxy + P(x)y = Q(x)
⇒ y = exp(− Pdx∫ )(∫Qexp(∫Pdx) + C)
2λt = −3 4ln 1−
4 3(1− q)
$
% & '
( ) d =−3 4ln 1−
4 3p
#
$ % & ' (
d=2λt (
p = 1−q 19
C
t = 0 q=1
Kimura’s 2-parameters model
A T
G C
Jukes-Cantor model
A T
G C
Kimura’s 2-parameters model
→ α
β
P, Q
h/p://www.veritastk.co.jp/kamon/pdf/kamon27/popu27.pdf P=1
4(1 − 2exp(−4(α + β)t) + exp(−8βt)) Q=1
2(1 − exp(−8βt))
d≡ 2λt = 2αt + 4βt
= −1
2ln(1− 2P − Q) −1 4ln(1− 2Q)
20
p = P + Q
UPGMA
(Unweighted Pair Group Method with Arithme_c mean)
•
• A B AB
→
•
21
Neighbor Joining method
Saitou and Nei (1987) Mol Biol Evol 4: 406
•
•
•
•
2005
Saitou and Nei (1987) 22
Saitou and Nei (1987)
internal branch
SO= LiX= 1 N−1
∑
i< jDiji=1 N
∑
OUT N-1 OTU OTU N-1 N-1Dij: OTU i j
Lab: ab
LXY= 1
2(N− 2) (D1k+ D2k)− (N − 2)(L1X+ L2X)− 2 LiY i= 3
N
∑
k= 3 N
$
∑
% &
' ( ) Fig.2a
XY Fig.2b
1 OTU1, 2 OTU
2 3 XY
S12= LXY+ (L1X+ L2X) + LiY
i= 3 N
∑
= 1
2(N− 2) (D1k+ D2k) + 1
2(L1X+ L2X) + N− 3 N− 2i= 3LiY
N
∑
k = 3 N
∑
=2(N1− 2) (D1k+ D2k) +1 2D12+
1 N− 23≤i< jDij
∑
k = 3 N
∑
2OTU 23
1. 2OTU
2. 2OTU 1 OTU
OTU1 2 OTU(1-2)
OTU :
D
(1− 2) j= (D
1 j+ D
2 j) /2
3. 1,2 OTU 3
4.
L
1X= (D
12+ D
1Z− D
2Z) /2
L
2X= (D
12+ D
2Z− D
1Z) /2
D1Z = D1i
i= 3
∑
N /(N− 2)D2Z = i= 3D2i
∑
N /(N− 2) 24A
C
B D
(unrooted tree)
A
A B C D
(rooted tree)
?
25
•
26
A A C C C
T → A
1
2
1
2
T → A
T → A
most parsimonious tree 27
1 2 3 4 5
Out A C T A C
A T C T A T
B T A T G C
C T C G G C
1 2 3 4 5
Out 0 0 0 0 0
A 1 0 0 0 1
B 1 1 0 1 0
C 1 0 1 1 0
Out A B C
h/p://www.gwu.edu/~clade/faculty/lipscomb/Cladis_cs.pdf
1
5 (0,1,1,0)
28
•
29
Sa Sb
Sd Sc S0
S1 t1
t2 ta tb
td
L= πS 0PS 0S1(t1)PS1Sa(ta)PS1Sb(tb)PS 0S 2(t2)PS 2Sc(tc)PS 2Sd(td)
S 2
∑
S1
∑
S 0
∑
1
S0, S1, S2 {A, T, C, G}
43
Sa, Sb, Sc {A, T, C, G} 1
S2
πX X 1/4
PXY(t) t X Y
n
L = πS0jPS0jS1j(t1)PS1jSaj(ta)PS1jSbj(tb)PS0jS2j(t2)PS2jScj(tc)PS2jSdj(td) S2j
∑
S1j
∑
S0j
∑
j n
∏
30
h/p://www.stat.wisc.edu/~larget/phylogeny/Holmes-Sta_s_calScience-2003.pdf 49% CONS-CPZ,O4,N2,B34
monophyle_c group
31
0123456789 AATAATCACA GGCAATTATG GGTGGCCTCG AACGATCACG A
B C D
3229168977 ATTAACCAAA ACCGGTTGAA GTTGGCCGTT GCCGACCGAA
1
9831664065 ACAACCAACT GTAGTTAGTT GCGGCCGGCC GCGACCAACT
2
:
:
32
h/p://www.stat.wisc.edu/~larget/phylogeny/Holmes-Sta_s_calScience-2003.pdf
Phylip
33
9
1. ACTG CTAG Needleman & Wunsch
match = 1, mismatch = -1, gap = -2
2. Jukes-Cantor 4
d
3. A, B
1 2 3 4
AATAATCA GGTAATCT GGTAGTCT AGCAATCA
1 2 3 4
A
1 2 3 4
B
34
PDF MS Word
e
2016 12 12
35