Comprehensive analysis of correlation coefficients estimated from pooling heterogeneous microarray data

dc.contributor.author Almeida-de-Macedo, Márcia
dc.contributor.author Ransom, Nick
dc.contributor.author Feng, Yaping
dc.contributor.author Hurst, Jonathan
dc.contributor.author Wurtele, Eve
dc.contributor.department Department of Genetics, Development, and Cell Biology (LAS)
dc.date 2018-02-18T04:07:57.000
dc.date.accessioned 2020-06-30T04:02:39Z
dc.date.available 2020-06-30T04:02:39Z
dc.date.copyright Tue Jan 01 00:00:00 UTC 2013
dc.date.issued 2013-01-01
dc.description.abstract <p><h3>Background</h3></p> <p>The synthesis of information across microarray studies has been performed by combining statistical results of individual studies (as in a mosaic), or by combining data from multiple studies into a large pool to be analyzed as a single data set (as in a melting pot of data). Specific issues relating to data heterogeneity across microarray studies, such as differences within and between labs or differences among experimental conditions, could lead to equivocal results in a melting pot approach. <h3>Results</h3></p> <p>We applied statistical theory to determine the specific effect of different means and heteroskedasticity across 19 groups of microarray data on the sign and magnitude of gene-to-gene Pearson correlation coefficients obtained from the pool of 19 groups. We quantified the biases of the pooled coefficients and compared them to the biases of correlations estimated by an effect-size model. Mean differences across the 19 groups were the main factor determining the magnitude and sign of the pooled coefficients, which showed largest values of bias as they approached ±1. Only heteroskedasticity across the pool of 19 groups resulted in less efficient estimations of correlations than did a classical meta-analysis approach of combining correlation coefficients. These results were corroborated by simulation studies involving either mean differences or heteroskedasticity across a pool of <em>N</em> > 2 groups. <h3>Conclusions</h3></p> <p>The combination of statistical results is best suited for synthesizing the correlation between expression profiles of a gene pair across several microarray studies.</p>
dc.description.comments <p>This article is from <em>BMC Bioinformatics </em>14 (2013): 214, doi: <a href="http://dx.doi.org/10.1186/1471-2105-14-214" target="_blank">10.1186/1471-2105-14-214</a>. Posted with permission.</p>
dc.format.mimetype application/pdf
dc.identifier archive/lib.dr.iastate.edu/gdcb_las_pubs/62/
dc.identifier.articleid 1065
dc.identifier.contextkey 9640633
dc.identifier.s3bucket isulib-bepress-aws-west
dc.identifier.submissionpath gdcb_las_pubs/62
dc.identifier.uri https://dr.lib.iastate.edu/handle/20.500.12876/37973
dc.language.iso en
dc.source.bitstream archive/lib.dr.iastate.edu/gdcb_las_pubs/62/2013_Wurtele_ComprehensiveAnalysis.pdf|||Sat Jan 15 01:18:15 UTC 2022
dc.source.uri 10.1186/1471-2105-14-214
dc.subject.disciplines Computational Biology
dc.subject.disciplines Genetics
dc.subject.disciplines Statistical Theory
dc.title Comprehensive analysis of correlation coefficients estimated from pooling heterogeneous microarray data
dc.type article
dc.type.genre article
dspace.entity.type Publication
relation.isAuthorOfPublication a7de6326-d86c-4395-b9e6-51187c7f1782
relation.isOrgUnitOfPublication 9e603b30-6443-4b8e-aff5-57de4a7e4cb2
File
Original bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
2013_Wurtele_ComprehensiveAnalysis.pdf
Size:
1.34 MB
Format:
Adobe Portable Document Format
Description:
Collections