Correction of scaling mismatches in oligonucleotide microarray data

作者: Martino Barenco , Jaroslav Stark , Daniel Brewer , Daniela Tomescu , Robin Callard

DOI: 10.1186/1471-2105-7-251

关键词: Set (abstract data type)GeneSystems biologyExperimental dataDNA microarrayContext (language use)Microarray analysis techniquesConstant (mathematics)AlgorithmGene expression profilingComputer scienceData miningMicroarrayScaling

摘要: Gene expression microarray data is notoriously subject to high signal variability. Moreover, unavoidable variation in the concentration of transcripts applied microarrays may result poor scaling summarized which can hamper analytical interpretations. This especially relevant a systems biology context, where systematic biases signals particular genes have severe effects on subsequent analyses. Conventionally it would be necessary replace mismatched arrays, but individual time points cannot rerun and inserted because experimental It therefore repeat whole series experiment, both impractical expensive. We explain how mismatches occur by popular MAS5 (GCOS; Affymetrix) algorithm, propose simple recursive algorithm correct them. Its principle identify set constant use this rescale signals. study properties using artificially generated apply data. show that generates used from other experiments, provided underlying system similar original. also demonstrate, example, method successfully existing imbalancesin The obtained for given experiment studied are sufficiently similar. type rescaling applications

参考文章(16)
Thomas Lengauer, Alexander Zien, Ralf Zimmer, T. Aigner, Centralization: A biologically sensible method for the normalization of gene expression data Bioinformatics. ,vol. 17, ,(2001)
Thomas B Kepler, Lynn Crosby, Kevin T Morgan, Normalization and analysis of DNA microarray data by self-consistency and local regression Genome Biology. ,vol. 3, pp. 1- 12 ,(2002) , 10.1186/GB-2002-3-7-RESEARCH0037
Alexander Ploner, Lance D Miller, Per Hall, Jonas Bergh, Yudi Pawitan, Correlation test to assess low-level processing of high-density oligonucleotide microarray data BMC Bioinformatics. ,vol. 6, pp. 80- 80 ,(2005) , 10.1186/1471-2105-6-80
A. Zien, T. Aigner, R. Zimmer, T. Lengauer, Centralization: a new method for the normalization of gene expression data. Bioinformatics. ,vol. 17, pp. 323- 331 ,(2001) , 10.1093/BIOINFORMATICS/17.SUPPL_1.S323
Jaroslav Stark, Robin Callard, Michael Hubank, From the top down: towards a predictive biology of signalling networks Trends in Biotechnology. ,vol. 21, pp. 290- 293 ,(2003) , 10.1016/S0167-7799(03)00140-9
C. Li, W. H. Wong, Model-based analysis of oligonucleotide arrays: Expression index computation and outlier detection Proceedings of the National Academy of Sciences of the United States of America. ,vol. 98, pp. 31- 36 ,(2001) , 10.1073/PNAS.98.1.31
George C Tseng, Min-Kyu Oh, Lars Rohlin, James C Liao, Wing Hung Wong, Issues in cDNA microarray analysis: quality filtering, channel normalization, models of variations and assessment of gene effects Nucleic Acids Research. ,vol. 29, pp. 2549- 2557 ,(2001) , 10.1093/NAR/29.12.2549
Sung E Choe, Michael Boutros, Alan M Michelson, George M Church, Marc S Halfon, Preferred analysis methods for Affymetrix GeneChips revealed by a wholly defined control dataset Genome Biology. ,vol. 6, pp. 1- 16 ,(2005) , 10.1186/GB-2005-6-2-R16
Rafael A Irizarry, Benjamin M Bolstad, Francois Collin, Leslie M Cope, Bridget Hobbs, Terence P Speed, Summaries of Affymetrix GeneChip probe level data Nucleic Acids Research. ,vol. 31, ,(2003) , 10.1093/NAR/GNG015