• 検索結果がありません。

Microarray基礎分析Idocx

N/A
N/A
Protected

Academic year: 2018

シェア "Microarray基礎分析Idocx"

Copied!
11
0
0

読み込み中.... (全文を見る)

全文

(1)

Visit the web site for related materials:

http://lucas.genome-analyst.org/activities-articles/course/nrpb2011q4

Microarra

y 基礎分

第一次分析就上手

董建億

(2)

What is Microarray Technique?

Principle of microarray

Technique platform

 Spotting array

 Pre-synthetic probe (glass slide)

 In situ synthesis

 Ink-jet technology (e.g. Agilent system)

 Photolithographic technology (e.g. Affymetrix system)

Quality Check before Microarray experiments

Sample quality check

Experiment Design

Where is experiment variation?

 Signal: Fat or not.

 Noise: Batch, gender, age…..

System errors

 Technique repeats

 Biological repeats

 How many repeats?

1

Good Fat Normal

Batch Age Batch Age

1 1 6

2 2 9

3 3 18

4 1 15

5 2 8

6 3 10

(3)

General Analysis workflow

Data preprocessing

Normalization

Signal adjustment (flag, flooring…)

Probe summarization

Signal transform (fold change, log transform)

Data QC/QA

QC index (Hybridization control)

Visualization (PCA, MDS, Box plot..)

Find differentially expressed genes

Statistical analysis

p-Value correction

Biological interpretation

Gene annotation

Gene set analysis

Gene network analysis

2

(4)

Data preprocessing

Normalization

Dual color system

Single color system

Dual color system

– Probe flag

– R/G Normalization

– Signal flooring

– Convert to signal ratio.

– Log Transform

Single color system

– Background correction

– Summarization

– Normalization

– Log Transform

– Signal flooring

3

(5)

Signal adjustment (flag, flooring…)

Probe summarization

Signal transform (fold change, log transform)

4

(6)

Microarray data quality control and assessment

Is hybridization reaction OK?

Is signal comparable?

Is expected variations observed?

Distance measurement

 Euclidean

i=16

(

XiA

XiB

)

2

 Manhattan

i=1 6

|

XiA

XiB

|

 Pearson Correlation

5

(7)

i=1 6

(

Xi

−X )(Y

i

−Y )

(n−1)S

XSY

6

(8)

Distance Matrics

Hierarchical Clustering

Dimension reduction

Principle Component Analysis or Matric MultiDimensional Scaling

7

(9)

Find differentially expressed genes

Statistics

Two samples (ratio)

One class sample

Two class sample

 t-test(Welch’s t test)

 limma

Multiple Classes sample

 One way ANOVA

 Limma

Pair comparison

 pair t-test

 2 way ANOVA

Multi-factor analysis (n-way ANOVA)

Time course

1 2 3 4 5

8

(10)

Comparison of Statistics

Test Variance modeling Reference Package R

Welch’s T-test - Fixed variance

- Heterodasticity Welch CIT (internal)

ANOVA -Fixed variance

-homoscedasticity Fisher CIT (internal)

Wilcoxon Non parametric Wilcoxon stats

SAM Non parametric Tusher et al

2001 samr

RVM

Inverse gamma distribution on the variance (estimated from all the data set)

Wright & Simon

2003 CIT (internal)

Limma Moderate t-test. Usual variance replaced by a conditional variance. Bayesian approach

Smyth

2004 limma

VARMIXT Gamma mixture model on the variance Delmar et al 2005 varmixt

SMVAR Mixed model (fixed condition effect and random gene effect)

Jaffrézic et al

2007 SMVar

http://www.ncbi.nlm.nih.gov/pmc/articles/PMC2933223/?tool=pubmed

Type I Error Power Stablility Ease of Calucation

Sample size small large small large   use  time 

t-test + +++ + +++ +++ +++ +++

anova +++ +++ + +++ +++ +++ +++

wilcoxon + + + ++ ++ +++ ++

SAM +++ +++ + ++ ++ ++ ++

RVM + + +++ +++ ++ + +

Limma +++ +++ +++ +++ +++ ++ +++

VarMixt +++ +++ +++ +++ + + +

SMVar + + ++ +++ + ++ +++

9

(11)

p-Value Correction

Multiple test correction (MTC)

αe

Experiment-wise significant level

αc

comparison-wise significance level

Bonferroni correction

K independent test, so all α is equal. so overall number of type I errors = K x α

α

e =

α

c/K =>

α

c =

α

e x K

Correct p = p x K

Dunn-Sidak

α

e = 1-(1-

α

c)k

Correct p = 1-(1-p)1/k

Bonferroni Step-down (Holm)

1st. corrected p-value = p-value x n(gene number) 2nd. corrected p-value = p-value x n-1

....

nth. corrected p-value = p-value x 1

False positive discovery (FDR)

Bootstraping

10

参照

関連したドキュメント

In [12], as a generalization of highest weight vectors, the notion of extremal weight vectors is introduced, and it is shown that the uni- versal module generated by an extremal

[3] Chari, Vyjayanthi, On the fermionic formula and the Kirillov-Reshetikhin conjecture, Int. and Yamada, Y., Remarks on fermionic formula, Contemp. and Tsuboi, Z., Paths, crystals

Abstract We show that the transition matrices between the standard and the canon- ical bases of infinitely many weight subspaces of the higher-level q -deformed Fock spaces are

System Organ Class 器官別大分類 High Level Group Term 高位グループ語 High Level Term 高位語. Preferred

Moreover, by (4.9) one of the last two inequalities must be proper.. We briefly say k-set for a set of cardinality k. Its number of vertices |V | is called the order of H. We say that

発行日 2005.10.1 改訂番号 - 大成基礎設計株式会社

• Soft−Skip Mode: This proprietary feature of the NCP1271 minimizes the standby low−frequency acoustic noise by ramping the peak current envelope whenever skip is activated..

During standby or light load operation the duty ratio on the controller becomes very small. At this point, a significant portion of the power dissipation is related to the power