• 検索結果がありません。

名古屋大集中講義 iwatawiki lec06 s

N/A
N/A
Protected

Academic year: 2018

シェア "名古屋大集中講義 iwatawiki lec06 s"

Copied!
12
0
0

読み込み中.... (全文を見る)

全文

(1)

6

2016/11/30 13:00-14:30

•  2050

1.7

• 

2050 90

Tester M, Langridge P(2010) Science 327: 818

SSA

• 

1975

/ 292

1982

1984

(2)

Meuwissen et al. (2001) GeneGcs 157:1819

DNA 5

DNA

•  DNA

hKp://www.illuminakk.co.jp/product/genoTYPING/bovinesnp50_beadchip.shtml

Illumina BovineSNP50 BeadChip

24 54,609 DNA

Garcia-Ruiz et al.

Proc Natl Acad Sci 113(28): E3995-4004

“The most drama8c response to

genomic selec8on was observed

for the lowly heritable traits

DPR, PL, and SCS. Gene8c trends

changed from close to zero to

large and favorable, resul8ng in

rapid gene8c improvement in

fer8lity, lifespan, and health in a

breed where these traits eroded

over 8me.”

(3)

(

, ,

)

Watanabe et al. (2005)

Ann Bot. 95:1131

DNA

DNA

DNA

LD

(4)

GS

DNA

y

i

: i

x

ij

: i j

y

x 3

x 4

x 1

x 2

x K y = f (x 1 , x 2 ,, x K )

(y) DNA (x i )

f (x)

f (.)

y DNA x 1 ,...,x K

vs.

Watanabe et al. (2005)

Ann Bot. 95:1131

DNA

DNA

y i = µ + β j x ij + e i

j= 0

N

...

DNA

Watanabe et al. (2005)

Ann Bot. 95:1131

DNA

1

hKp://www.soran.net/index_html/A0084070.htm

1 3

F1

6

GS GS

GS GS 3

(5)

x

y = f (x) \\\

x

y

273 142

304

423

173

373 234 138

203 133

223

y = f (x)

“large p small n” problem

p >> n

x, y 1

60K …

y i = µ + β j x ij + e i

j=0

J

PRESS

S e

GS

0 2 4 6 8 10

0 2 4 6 8 1 0 1 2

0 2 4 6 8 10

0 2 4 6 8 1 0 1 2

x

y

0 2 4 6 8 10

0 2 4 6 8 1 0 1 2

x

b

b

y = 0 + 1

=

+

=

7

1

0

k

k

k x

b

b

y

     

(6)

(n-fold cross-validation)

1. n

2.  i

3.   i 2

4.  2, 3 n

5. 

2

PRESS

n leave-one-out

y ˆ i

y i

y ˆ i

y ˆ i

y i

, LASSO

E (k ) = ( y i − u − β j x ij

j

J

) 2

i

+ λ β j k

j

J

y i = u + β j x ij

j

J

+ e i

k = 1 LASSO

k = 2

-2 -1 0 1 2

0 .0 0 .5 1 .0 1 .5 2 .0

x

y

β j

•  Zhao 2011; Nature CommunicaGons 2:467

•  Rice Diversity hKp://www.ricediversity.org/

data/

•  395 × 1311 SNPs

•  (brown.rice.seed.length)

ridge regression, lasso

ridge lasso

●●

● ●●

● ●

●●

● ●

● ●

●●

●●

● ●

●●

● ●

●●

● ●

● ●

●●

●●

● ●

● ●

● ●

● ●

● ●● ●

● ●

●●

●●

● ●

● ●

●●

●●

● ●

● ●

4 5 6 7 8

4 5 6 7 8

Observed seed length

Pre d ict e d se e d l e n g th

r = 0.72

mse = 0.28

● ●

● ●

●●

●●

●●

● ●

● ●

●●

● ●

● ●

●●

●●

●●

● ●

● ●

● ●

● ●

●●

● ●

4 5 6 7 8

4 5 6 7 8

Observed seed length

Pre d ict e d se e d l e n g th

r = 0.77

mse = 0.24

(7)

Ridge, LASSO

Ridge LASSO

0 shrink

GS3

GS3

0e+00 1e+08 2e+08 3e+08

− 0 .0 1 0 .0 0 0 .0 1 0 .0 2

Ridge

Position (bp)

C o e ff ici e n ts

0e+00 1e+08 2e+08 3e+08

− 0 .1 0 0 .0 0 0 .1 0 0 .2 0

LASSO

Position (bp)

C o e ff ici e n ts

ElasGc net

argmin

b

(y i − x i

T b) 2

i

+ λ( 1−α

2

b 2 +α b

1 )

#

$ % &

' (

y i = x ij b j

j

J

+ e i = x i T b + e i

b

1 = b j

j

J

b 2 = b

j

2

j

J

α = 1: lasso

α = 0: ridge

ElasGc Net

Ridge LASSO

●●●

● ●

● ●

● ●

● ●

●●

● ●

●●

●●

● ●

● ●

●●

●●

●●

●●

● ●

● ●

● ●

●● ●

● ●

●●

● ●

● ●

●●

● ●

●●

●● ●

● ●

●●

● ●

4 5 6 7 8

45678

Observed seed length

Predicted seed length

r = 0.72 mse = 0.28

● ●

● ●

●●

● ●

●●

●●

● ●

●●

●●

● ●

●●

● ●

● ●

●●

●●

●●

● ●

● ●

● ●

●●

● ●

●●

● ●

●●

● ●

● ●

4 5 6 7 8

45678

Observed seed length

Predicted seed length

r = 0.77 mse = 0.24

● ●

● ●

● ●

●●

●●

●●

●●

● ●

●●

●●

●●

● ●

●●

● ●

●●

●●

● ●

● ●

●●

● ●

●●

● ●

● ●

●●

● ●

● ●

4 5 6 7 8

45678

Observed seed length

Predicted seed length

r = 0.77 mse = 0.24

ElasGc Net

X'Xb + λ b = (X'X + λ I)b = X'y

⇔ b = (X'X + λ I p )

−1 X'y

argmin

b

(y i − x i T b ) 2

i

+ λ b 2

#

$ % &

' (

b = λ −1 X'(y − Xb) = X'a

⇔ a = (XX'+ λ I n )

−1 y

(λ > 0

E b

b

p × p

n × n

ˆy i = x i T b = x T i X'a = x i T a j x j

j

n = a j x i

T x

j j

n

i j

a j

(8)

b = λ −1 X'(y − Xb) = X'a

a = λ −1 ( y − Xb) = λ −1 (y − XX'a)

⇔ λ a = y − XX'a

⇔ (XX' − λ I n ) a = y

⇔ a = (XX'− λ I n )

−1 y

RR-BLUP

Ridge Kernel Ridge

●● ●

● ●

●●

● ●

● ●

●●

● ●

●●

● ●

●●

●●

● ●

● ●

●●

●●

● ●

● ●

● ●

● ● ●

● ●● ●

● ●

●●

● ●

● ●

●●

● ●

● ●

● ●

● ●

4 5 6 7 8

4 5 6 7 8

Observed seed length

Pre d ict e d se e d l e n g th

r = 0.72

mse = 0.28

●●

●●

● ●

●●

●●

●●●

● ●

● ●

● ●

● ●

● ●

● ●

● ●

● ●

● ●

● ●

● ●

● ●

● ●

4 5 6 7 8

4 5 6 7 8

Observed seed length

Pre d ict e d se e d l e n g th

r = 0.72

mse = 0.28

x y

•  x φ(x) basis function .

•  φ(x) (x, z) = φ x) T φ(z)

.

• 

x φ x

x y

y y

a = (XX'+ λ I) −1 y

XX' =

x 1 T x 1 x 1 T x 2 x 1 T x N

x 2 T x 1 x 2 T x 2 x 2 T x N

  

x N

T x 1 x 2 T x N x N

T x N

!

"

# #

# #

#

$

%

&

&

&

&

&

y i = x i T w + e i

(9)

a = (G + λ I) −1 y

G =

φ(x 1 ) T φ(x 1 ) φ(x 1 ) T φ(x 2 ) φ(x 1 ) T φ(x N )

φ(x 2 ) T φ(x 1 ) φ(x 2 ) T φ(x 2 ) φ(x 2 ) T φ(x N )

  

φ(x N )

T φ(x 1 ) φ(x 2 ) T φ(x N )  φ(x N )

T φ(x N )

!

"

# #

# #

#

$

%

&

&

&

&

&

y i = φ(x i ) T w + e i

y i = φ (x i ) T w + e i

= α j κ (x i , x j )

j=1

n

+ e i

a = (G + λI) −1 y

κ (x i , x j ) = x i T x j

G ij = κ (x i , x j ) = φ (x i ) T φ (x j )

a = (G + λ I) −1 y

G =

κ (x 1 , x 1 ) κ (x 1 , x 2 ) κ (x 1 , x N )

κ (x 2 , x 1 ) κ (x 2 , x 1 ) κ (x 2 , x N )

  

κ (x N , x 1 ) κ (x N , x 2 )  κ (x N , x N )

!

"

# #

# #

#

$

%

&

&

&

&

&

y i = φ(x i ) T w + e i

κ (x, x n ) = φ (x)' φ (x n )

= exp −h(x − x n )

( 2 )

36

h = 2 / d 2 median

(10)

Kernel ridge Gaussian kernel

●●

● ●

●●

● ●

● ●

● ●

●●●

● ●

● ●

● ●

● ●

● ●

●●

● ●

● ●

● ●

● ●

● ●

● ●

● ●

● ●

4 5 6 7 8

4 5 6 7 8

Observed seed length

Pre d ict e d se e d l e n g th

r = 0.72

mse = 0.28

●●

●●

● ●

●●

● ●

●●

● ●

● ●

●●

● ●

● ●

● ●

● ●

●●

● ●

● ●

● ●

4 5 6 7 8

4 5 6 7 8

Observed seed length

Pre d ict e d se e d l e n g th

r = 0.73

mse = 0.27

GS R

•  ridge regression, LASSO, elasGc net

–  glmnet etc.

•  BLUP

–  rrBLUP

•  SVM, RVM

–  kernlab etc.

•  random forest

–  randomForest etc

•  Bayesian linear regression (Bayesian ridge, Bayesian LASSO)

–  BLR etc

•  RKHS regression

–  RKHSw

Crossa et al. (2010) GeneGcs 186: 713 R

GS

Zhong et al. (2009) GeneGcs 182: 355

Crossa et al. (2010) GeneGcs 186: 713

Iwata and Jannink (2011) Crop Sci 51: 1915

Heffner et al. (2011) Crop Sci 51: 2597

GS

1 GS1 GS2 2 GS3 GS4 3 GS5 GS6

GS1

1

GS1

(

)

GS1

n = 48

GS2

1

GS2

×

2

n = 192

× ×

×

n = 12

n = 12

4

+

y=f(x)

y=f(x)

14,598 DNA

Hiseq

2000

Illumina HP

14,598 DNA

Hiseq

2000

Illumina HP

(11)

150

100 cm

500 g/L

12

4

25

30 g

× 0.728

× 0.550

× 0.306

× 0.053

× 0.015

× 0.011

× -0.001

= 109.2

= 55.0

= 153.0

= 0.636

= 0.060

= 0.275

= -0.030

318.1

11

kg/10a

12

y=f(x)

14,598 DNA

Hiseq

2000

Illumina HP

   

1.00

1.11 1.11

1.38

1.32

1.36

1.44

1.0

1.1

1.2

1.3

1.4

1.5

  GS1 GS2 GS3 GS4 GS5 GS6

   



1 2 3

1.44

3

1

2.0 1.5

1

1.37

1

1.27

  











0

5

10

15

0-20- 40- 60- 80- 100-120-140-160-180-200-220-



GS6

0

5

10

15

0

5

10

15

0

5

10

15

0

5

10

15

0

5

10

15

0

3

6

9

12

60- 70- 80- 90- 100-110-120-130-140-150-160-



GS6

0

3

6

9

12

0

3

6

9

12

0

3

6

9

12

0

3

6

9

12

0

3

6

9

12

0

5

10

15

20

350- 400- 450- 500- 550- 600- 650- 700-



GS6

0

5

10

15

20

0

5

10

15

20

0

10

20

0

5

10

15

20

0

5

10

15

20

0

5

10

15

0-100- 200- 300- 400- 500- 600- 700- 800- 900-



GS6

0

5

10

15

0

5

10

15

0

5

10

15

0

5

10

15

0

5

10

15

0.728

63.7 %

44.3 %

0.550

28.6 %

0.306

8.0 %











0

3

6

9

12

8 9 10 11 12 13 14 15 16 17



GS6

0

3

6

9

12

0

3

6

9

12

0

3

6

9

12

0

3

6

9

12

0

3

6

9

12

0

6

12

18

2 3 4 5 6 7 8



GS6

0

6

12

18

0

6

12

18

0

6

12

18

0

6

12

18

0

6

12

18

0

4

8

12

16

0

4

8

12

16

0

4

8

12

16

0

4

8

12

16

0

4

8

12

16

0

4

8

12

16

17 18 19 20 21 22 23 24 25 26 27 28



GS6

0

3

6

9

12

16-18-20-22-24-26-28-30-32-34-36-38-



GS6

0

3

6

9

12

0

3

6

9

12

0

3

6

9

12

0

3

6

9

12

0

3

6

9

12

0.053

23.4 %

0.015 0.011

8.3 %

-0.001

(12)

• 

• 

• 

参照

関連したドキュメント

大学教員養成プログラム(PFFP)に関する動向として、名古屋大学では、高等教育研究センターの

運営、環境、経済、財務評価などの面から、途上国の

講義の目標.

Nakajima, Crystal bases and two-sided cells of quantum affine algebras, Duke Math., 123 (2004), no.. Beck, Braid group action and quantum affine

The Admissions Office for International Programs is a unit of the Admissions Division of Nagoya University that builds and develops a successful international student recruitment

[r]

[r]

区分 授業科目の名称 講義等の内容 備考.. 文 化