-->

GoogleSearch



Scientist. Husband. Daddy. --- TOLLE. LEGE
외부자료의 인용에 있어 대한민국 저작권법(28조)과 U.S. Copyright Act (17 USC. §107)에 정의된 "저작권물의 공정한 이용원칙 | the U.S. fair use doctrine" 을 따릅니다. 저작권(© 최광민)이 명시된 모든 글과 번역문들에 대해 (1) 복제-배포, (2) 임의수정 및 자의적 본문 발췌, (3) 무단배포를 위한 화면캡처를 금하며, (4) 인용 시 URL 주소 만을 사용할 수 있습니다. [후원 | 운영] [대문으로] [방명록] [옛 방명록] [티스토리 (백업)]

이 블로그 검색

FDR and FNR dillemma

라벨:


Source: https://stat.ethz.ch/pipermail/bioconductor/2008-December/025556.html
 
The ball model does not apply to microarray studies.  (And the 
probability of drawing the red ball in 20 draws is not 1).

But FDR does apply to microarray studies, and so does a less 
discussed concept, the false nondiscovery rate or FNR.

Suppose I take 20 independent samples of mouse liver tissue - same 
strain, gender ... and hybridize independently to 20 microarrays - 
any platform.
Then arbitrarily divide into 2 groups of size 10.  If there are 
10,000 genes on the array, you should see 1 gene with p-value .0001or 
less, 10 genes with p-value .001 or less, 100 genes with p-value .01 
or less etc.  Now suppose you take the 100 genes with the highest 
degree of differential expression and do a PCR study with independent 
samples.  You should still have 1 gene which is significant with 
p=.01 and 5 genes which are significant at p=.05.

The problem is - there is no systematic difference between the 
samples.  You have detected noise - i.e. chance variation.  If you 
use the same samples to do your PCR, you may get closer to 100% 
"significance" for the selected genes, because the variation that 
caused the false detection will still be in the sample unless it was 
due only to the hybridization.

FDR is an estimate of the excess of significant findings, compared to 
what is expected by chance.  You can reduce FDR greatly by doing 
independent follow-up studies (on another microarray or on another 
platform such as PCR).  You cannot reduce FDR much by reusing the 
same samples on a different platform, although you will reduce 
affects due to technical variation.

However, FDR reduces your power to detect differential 
expression.  This means that you will have higher FNR if you use 
multiple comparisons adjustments.  Again, if you do independent 
follow-up studies, you can reduce FNR.

The purpose of the FDR computation is to reduce effort wasted on 
large gene lists which are mostly reporting noise.  But if your 
genelist is smaller than you think is reasonable, you may certainly 
follow up a larger set of genes and sorting by p-value will give you 
the most reasonable set of genes to follow up.  Again,
the only valid follow-up uses independent samples and independent platforms.  \
--Naomi





라벨:





Scientist. Husband. Daddy. --- TOLLE. LEGE
외부자료의 인용에 있어 대한민국 저작권법(28조)과 U.S. Copyright Act (17 USC. §107)에 정의된 "저작권물의 공정한 이용원칙 | the U.S. fair use doctrine" 을 따릅니다. 저작권(© 최광민)이 명시된 모든 글과 번역문들에 대해 (1) 복제-배포, (2) 임의수정 및 자의적 본문 발췌, (3) 무단배포를 위한 화면캡처를 금하며, (4) 인용 시 URL 주소 만을 사용할 수 있습니다. [후원 | 운영] [대문으로] [방명록] [옛 방명록] [티스토리 (백업)] [신시내티]

-