Microarray Expression Data Analysis References
Newton MA, Yandell BS, Shavlik J, Craven M (2001)
The dimension and complexity of raw gene expression data obtained by
oligonucleotide chips, spotted arrays, or whatever technology is used,
create challenging data analysis and data management problems.
In a limited way these challenges can be met
by existing software systems and analysis methods in the hands of
end users. However, we are convinced that a much more active scientific
endeavor is called for. We anticipate that, broadly defined, bioinformatics
will encompass statistical and biometrical questions of experimental
design, data analysis, graphics and modeling, and computational questions
concerning efficient algorithms for various learning tasks such as
classification and clustering.
Microarray data can be analysed using several approaches
(Claverie, 1999). Clustering methods
(i.e. unsupervised learning) are used widely and have the ability to
uncover coordinated expression patterns
from a collection of microarrays (e.g., Eisen et al. 1998;
Getz et al. 2000; Tibshirani et al. 2000; Dudoit, Fridlyand et al. 2000;
Kerr and Churchill 2000a).
The use of standard clustering methods is most appropriate when the
microarrays arise from some common source cell type, for example from
a common tissue type from animals in some controlled cross. Refinements
may be necessary when other sources of variation affect the microarrays
(van der Laan and Bryan 2000). Classification methods (i.e. supervised
learning) have proven very useful to identify patterns of gene expression
that can be correlated with qualitative disease phenotypes (e.g. Golub
et al. 1999) and for classifying
genes according to their functional role (Brown et al. 2000). Related
methods of multivariate statistical analysis, such as those using the
singular value decomposition (Alter et al. 2000; West et al. 2000)
or multidimensional scaling can be effective at reducing the dimension of
the objects under study.
Statistical methods are emerging to account for multiple sources of
variation when trying to pool information from many microarrays and
to identify genes exhibiting significant differential expression between
cell types. One approach is to decompose the appropriately transformed
expression measurement as a linear combination of effects from different
sources of variation (Kerr et al. 2000). This is basically ANOVA for microarrays. In the context of a two-group comparison with replication Dudoit, Yang et
al. (2000) have proposed the use of permutation-testing and p-value
adjustment to account for the multiple-testing problem.
Lin et al. (2001) describe a nonparametric method suited to uncovering
differential expression for low-abundance transcripts.
Alternatively, the mixture-model approach can be used to directly assay the
probability that a given gene is truly expressed (Lee et al. 2000)
or the probability that a gene is truly
differentially expressed between two conditions (Newton et al. 2001;
Efron et al. 2001).
The functional patterns of expression identified by such statistical
calculations will be backed up by laboratory
examination to verify findings (cf. Nadler et al. 2000).
Although analysis methods have been a central concern in most
bioinformatics research to date, the issue of experimental design
is critical. The use of replication, for example, in controlled
experiments can significantly improve power to uncover differentially
expressed genes (Kerr and Churchill 2000b, Lee et al. 2000).
Our internal review of requests for microarray support will include
careful examination of experimental design considerations.
Microarray analysis typically uses background-adjusted expression
intensities, (PM-MM for Affymetrix chips). However, this can create
problems with negative adjusted values, since the log-transform is often
applied to these adjusted values. This has prompted ad hoc procedures (cf.
Roberts et al. 2000). However, arbitrary handling of low expression genes
is unsatisfactory since these may be the most interesting, e.g.
transcription factors and receptors. Instead Lin et al. (2001) advocated an
approximate normal scores transformation of background-adjusted expression
which allows the use of all data (see also Efron et al. 2001). These normal
scores appear to have better properties for clustering, and are well
behaved for inference on differential expression.
Patterns of gene expression evinced by data analysis is only the beginning.
In many cases, greater biological understanding can be attained by
using expression data in conjunction with sequence
data (Craven et al. 2000), pathway data (Zien et al. 2000), and
biomedical text sources (Shatkay et al. 2000). It may in addition involve constructing
predictive models from diverse data sources (Craven et al. 2000),
and developing automated methods for exploiting text and Web data
(Craven and Kumlien, 1999; Shavlik et al. 1999).
Return to Statistical Genomics References.
Lockhart et al. (1996);
Ermolaeva et al. (1998);
Bassett, Eisen, Boguski (1999);
Brown, Botstein (1999);
Claverie (1999);
Duggan, Bittner, Chen, Meltzer, Trent (1999);
Lander ES (1999);
Lipshutz, Fodor, Gingeras, Lockhart (1999);
Zhang MQ (1999);
Lockhart, Winzeler (2000)
Spellman et al. (1998);
Lee, Klopp, Weindruch, Prolla (1999);
Richmond et al. (1999) ;
Nadler et al. (2000);
Soukas, Cohen, Socci, Friedman (2000)'
Craven, Kumlien (1999);
Shavlik et al. (1999);
Craven et al. (2000);
Shatkay et al. (2000);
Zien et al. (2000);
Eisen, Chiang, Brown (2001);
Liu (2001)
Schadt et al. (1999);
Yang et al. (2000);
Yang et al. (2000);
Brown, Goodwin, Sorger (2001);
Li, Wong (2001);
Schadt et al. (2001);
Stuart, Bush, Nigam (2001);
Irizarray et al. (2003);
Zhang, Miles, Aldape (2003)
Baldi, Long (2001);
de Risi et al. (1996);
Chen, Dougherty, Bittner (1997);
Hughes et al. (2000);
Cui, Churchill (2003);
Long et al. (2001) ;
Newton et al. (2001);
Roberts et al. (2000);
Theilhaber et al. (1999);
Woolf, Wang (2000);
Dudoit et al. (2000);
Kerr, Churchill (2000b);
Kim et al. (2000);
Lee et al. (2000);
Kerr, Churchill (2001);
Kerr, Martin, Churchill (2001);
Efron et al. (2001);
Lin et al. (2001);
Mills, Gordon (2001);
Pan, Lin, Le (2001)
Pan, Lin, Le (2001)
Thomas et al. (2001);
Wolfinger et al. (2001);
Zhao, Prentice, Breeden (2001);
Lonnstedt, Speed (2002);
Newton et al. (2003)
Eisen et al. (1998);
Golub et al. (1999);
Tamayo et al. (1999);
Tibshirani et al. (1999);
Brown et al. (2000);
Dudoit, Fridlyand, Speed (2000)
Getz, Levine, Domany (2000);
Hastie et al. (2000);
Hastie et al. (2000);
Lazzeroni, Owen (2000);
Spang et al. (2000);
van der Laan, Bryan (2000)
Barash, Friedman (2001);
Ben-Dor, Friedman, Yakhini (2001);
Kerr, Churchill (2001);
Pan, Lin, Le (2001);
Smith et al. (2001);
West et al. (2001);
Yeung et al. (2001);
Pollard, van der Laan (2002);
Jörnsten, Yu (2003);
Kluger et al. (2003)
Hilsenbeck et al. (1999);
Wittes, Friedman (1999);
Alter, Brown, Botstein (2000);
West (2000);
West et al. (2000);
Fellenberg et al. (2001);
Holter et al. (2001);
Wall (2001);
Kim, Tidor (2003)
Paper
A
B
C
D
E
F
G
H
I
J
K
L
M
N
O
P
R
S
T
U
V
W
X
Y
Z
- Alizadeh AA, Eisen MB, Davis RE, Ma C, Lossos IS, Rosenwald A,
Boldrick JC, Sabet H, Tran T, Yu X, Powell JI, Yang L, Marti
GE, Moore T, Hudson J Jr, Lu L, Lewis DB, Tibshirani R,
Sherlock G, Chan WC, Greiner TC, Weisenburger DD, Armitage JO,
Warnke R, Levy R, Wilson W, Grever MR, Byrd JC, Botstein D,
Brown PO, Staudt LM (2000)
Distinct types of diffuse large B-cell lymphoma identified by
gene expression profiling.
Nature 403: 503-511.
[doi:10.1038/35000501]
- Alter O, Brown P, Botstein D (2000) Singular value
decomposition for genome-wide expression data processing and
modeling. PNAS 97: 10101-10106.
[Paper]
[Supplement]
[Data]
- Baldi P, Long AD (2001) A Bayesian framework for the analysis of
microarray expression data: regularized t-test and statistical
inferences of gene changes. Bioinformatics 17: 509-519.
[doi:10.1093/bioinformatics/17.6.509]
- Barash Y, Friedman N (2001) Context-specific Bayesian clustering
for gene expression data. In Proc. Fifth Annual Inter. Conf. on
Computational Molecular Biology (RECOMB).
[Paper]
- Bassett DE, Eisen MB, Boguski MS (1999) Gene expression
informatics--it's all in your mine. Nat Genet 21 (Suppl): 51-5.
[doi:10.1038/4478]
- Ben-Dor A, Friedman N, Yakhini Z (2001) Class discovery in gene
expression data. In Proc. Fifth Annual Inter. Conf. on
Computational Molecular Biology (RECOMB)
[Paper]
- Brown CS, Goodwin PC, Sorger PK (2001)
Image metrics in the statistical analysis of DNA microarray data
PNAS 98: 8944-8949.
[Paper]
- Brown PO, Botstein D (1999) Exploring the new world of the genome
with DNA microarrays. Nat Genet 21 (Suppl): 33-7.
[doi:10.1038/4462]
- Brown MPS, Grundy WN, Lin D, Cristianini N, Sugnet CW, Furey TS,
Ares M Jr, Haussler D (2000) Knowledge-based analysis of
microarray gene expression data by using support vector
machines. PNAS 97: 262-267.
[Paper]
- Camp RL, Chung GG, Rimm DL (2002)
Automated subcellular localization and quantification of protein
expression in tissue microarrays.
Nat Med 8: 1323-1328.
[doi:10.1038/nm791]
- Chen Y, Dougherty ER, Bittner ML (1997) Ratio-based decisions and
the quantitative analysis of cDNA microarray images. J Biomed
Opt 2: 364-374.
[doi:10.1117/1.429838]
- Claverie, J. (1999) Computational methods for the
identification of differential and coordinated gene
expression. Hum Mol Genet 8:
1821-1832.
[Paper]
- Craven M, Kumlien J (1999) Constructing biological knowledge
bases by extracting information from text sources. In Proceedings of
the 7th International Conference on Intelligent Systems for Molecular
Biology. AAAI Press.
[Paper]
- Craven M, Page D, Shavlik J, Bockhorst J, Glasner J (2000)
A Probabilistic Approach to Whole-Genome Operon Prediction.
In Proceedings of the 8th International Conference on Intelligent
Systems for Molecular Biology. AAAI Press.
[Paper]
- Cui X, Churchill GA (2003)
Statistical tests for differential expression in cDNA microarray
experiments.
Genome Biology 4: 210.
[doi:10.1186/gb-2003-4-4-210]
- de Risi J, Penland L, Brown PO, Bittner ML, Meltzer PS, Ray M,
Chen Y, Su YA, Trent JM (1996) Use of a cDNA microarray to analyse
gene expression patterns in human cancer. Nat Genet 14:
457-60.
[doi:10.1038/ng1296-457]
- Dressman HK, Muramoto GG, Chao NJ, Meadows S, Marshall D,
Ginsburg GS, Nevins JR, Chute JP (2007)
Gene expression signatures that predict radiation exposure in
mice and humans.
PLoS Med 4: e106.
[doi:10.1371/journal.pmed.0040106]
- Dudoit S, Fridlyand J, Speed TP (2002) Comparison of
discrimination methods for the classification of tumors using gene
expression data.
J Amer Statist Assoc 97: 77-87.
[doi:10.1198/016214502753479248]
- Dudoit S, Yang YH, Callow MJ, Speed TP (2002)
Statistical methods for identifying differentially
expressed genes in replicated cDNA microarray experiments.
Statist Sinica 12: 111-139.
[Paper]
- Duggan DJ, Bittner M, Chen Y, Meltzer P, Trent JM (1999)
Expression profiling using cDNA
microarrays. Nat Genet 21
(Suppl): 10-4.
[doi:10.1038/4434]
- Dunning MJ, Barbosa-Morais NL, Lynch AG, Tavaré S, Ritchie ME (2008)
Statistical issues in the analysis of Illumina data.
BMC Bioinfo 9: 85.
[doi:10.1186/1471-2105-9-85]
- Efron B, Tibshirani R (2007)
On testing the significance of sets of genes.
Ann Appl Statist 1: 107-129.
[doi:10.1214/07-AOAS101]
- Efron B, Tibshirani R, Goss V, Chu G (2001) Microarrays and
their use in a comparative experiment.
J Amer Statist Assoc 96: 1151-1160.
[doi:10.1198/016214501753382129]
- Eisen MB, Chiang DY, Brown PO (2001) Discovery of regulatory elements from microarray data using
genome mean expression profiles.
9th International Conf on Intelligent Systems for Molecular Biology,
Copenhagen.
[Conference]
- Eisen MB, Spellman PT, Brown PO, Botstein D (1998) Cluster
analysis and display of genome-wide expression patterns.
PNAS 95: 14863-14868.
[Paper]
- Ermolaeva O, Rastogi M, Pruitt KD, Schuler GD, Bittner ML, Chen
Y, Simon R, Meltzer P, Trent JM, Boguski MS (1998) Data management and
analysis for gene expression arrays. Nat Genet 20:
19-23.
[doi:10.1038/1670]
- Fellenberg K, Hauser NC, Brors B, Neutzner A, Hoheisel JD,
Vingron M (2001) Correspondence analysis applied to microarray
data. PNAS 98: 10781-10786.
[Paper]
- Getz G, Levine E, Domany E (2000) Coupled two-way
clustering analysis of gene microarray data.
PNAS 97: 12079-12084.
[Paper]
- Golub TR, Slonim DK, Tamayo P, Huard C, Gaasenbeek M, Mesirov JP,
Coller H, Loh ML, Downing JR, Caligiuri MA, Bloomfield CD,
Lander ES (1999) Molecular classification of cancer: Class
discovery and class prediction by gene expression
monitoring. Science 286: 531-537.
[doi:10.1126/science.286.5439.531]
- Guo L, Lobenhofer EK, Wang C, Shippy R, Harris SC, Zhang L, Mei N, Chen T, Herman D, Goodsaid FM, Hurban P, Phillips KL, Xu J, Deng X, Sun YA, Tong W, Dragan YP, Shi L (2006)
Rat toxicogenomic study reveals analytical consistency across microarray platforms.
Nat Biotech 24: 1162-1169.
[doi:10.1038/nbt1238]
- Hastie T, Tibshirani R, Botstein D, Brown P (2001)
Supervised harvesting of expression trees.
Genome Biol 2: research0003.
[doi:10.1186/gb-2001-2-1-research0003]
- Hastie T, Tibshirani R, Eisen MB, Alizadeh A, Levy R,
Staudt L, Chan WC, Botstein D, Brown P (2000) 'Gene shaving' as a
method for identifying distinct sets of genes with similar expression
patterns. Genome Biology 1: research0003.
[doi:10.1186/gb-2000-1-2-research0003]
- Hilsenbeck SG, Friedrichs WE, Schiff R, O'Connell P,
Hansen RK, Osborne CK, Fuqua SAW (1999)
Statistical analysis of array expression data as applied to
the problem of tamoxifen resistance. J Nat Cancer Inst
91: 453-459.
[doi:10.1093/jnci/91.5.453]
- Holter NS, Maritan A, Cieplak M, Fedoroff NV, JR Banavar (2001)
Dynamic modeling of gene expression data.
PNAS 98: 1693-1698.
[Paper]
- Hu J, Zou F, Wright FA (2005)
Practical FDR-based sample size calculations in microarray experiments.
Bioinformatics 21: 3264-3272.
[doi:10.1093/bioinformatics/bti519]
- Huber W, Toedling J, Steinmetz LM (2006)
Transcript mapping with high-density oligonucleotide tiling arrays.
Bioinformatics 22:1963-70.
[doi:10.1093/bioinformatics/btl289]
- Hughes TR, Marton MJ, Jones AR, Roberts CJ, Stoughton R, Armour
CD, Bennett HA, Coffey E, Dai HY, He YDD, Kidd MJ, King AM,
Meyer MR, Slade D, Lum PY, Stepaniants SB, Shoemaker DD,
Gachotte D, Chakraburtty K, Simon J, Bard M, Friend SH (2000)
Functional discovery via a compendium of expression
profiles. Cell 102: 109-126.
[Paper]
[Supplement]
[Data
(80Mb)]
- Irizarry RA, Hobbs B, Collin F, Beazer-Barclay YD, Antonellis KJ,
Scherf U, Speed TP (2003)
Exploration, normalization, and summaries of high density
oligonucleotide array probe level data.
Biostatistics 4: 249-264.
[doi:10.1093/biostatistics/4.2.249
| Software
]
- Jansen RC (2003)
Studying complex biological systems using multifactorial perturbation.
Nat Rev Gen 4: 145-151.
[doi:10.1038/nrg996]
- Jörnsten R, Yu B (2003)
Simultaneous Gene Clustering and Subset Selection for
Classification via MDL.
Bioinformatics 19: 1100-1109.
[Paper]
- Kattan M (2002)
Statistical prediction models, artificial neural networks, and
the sophism "I am a patient, not a statistic".
J Clin Oncol 20: 885-887.
[Paper]
- Kendziorski CM, Newton MA, Lan H, Gould MN (2003)
On parametric empirical Bayes methods for comparing multiple
groups using replicated gene expression profiles.
Statistics in Medicine 22: 3899-3914.
[doi:10.1002/sim.1548]
- Kendziorski C, Wang P (2006)
On statistical methods for expression quantitative trait loci mapping.
Mammal Genome 17: 509-517.
[doi:10.1007/s00335-005-0189-6]
- Kerr MK, Churchill GA (2001) Experimental design for
gene expression microarrays. Biostatistics 2: 183-201.
[doi:10.1093/biostatistics/2.2.183]
- Kerr MK, Churchill GA (2001) Bootstrapping cluster analysis:
Assessing the reliability of conclusions from microarray experiments.
PNAS 97: 8961-8965.
[Paper]
- Kerr MK, Churchill GA (2001) Statistical design and the analysis
of gene expression microarrays. Genet Res 77: 123-128.
[doi:10.1017/S0016672301005055]
- Kerr MK, Martin M, Churchill GA (2001) Analysis of
variance for gene expression microarray data.
J Comp Biol 7: 819-837.
[doi:10.1089/10665270050514954]
- Kim PM, Tidor B (2003)
Subsystem identification through dimensionality reduction of
large-scale gene expression data.
Genome Research 13: 1706-1718.
[doi:10.1101/gr.903503]
- Kim S, Dougherty ER, Bittner ML, Chen Y, Sivakumar K, Meltzer P,
Trent JM (2000) General nonlinear framework for the analysis of gene
interaction via multivariate expression arrays. J Biomed Opt
5: 411-424.
[doi:10.1117/1.1289142]
- Kluger Y, Basri R, Chang JT, Gerstein M (2003)
Spectral biclustering of microarray data: coclustering genes and
conditions.
Genome Res 13: 703-716.
[doi:10.1101/gr.648603]
- Lander ES (1999) Array of hope. Nat Genet 21 (Suppl): 3-4.
[doi:10.1038/4427]
- Larkin JE, Frank BC, Gavras H, Sultana R, Quackenbush J (2005)
Independence and reproducibility across microarray platforms.
Nat Methods 2: 337-344.
[doi:10.1038/nmeth757]
- Lazzeroni L, Owen AB (2000) Plaid models for gene
expression data.
TR 211, Department of Statistics, Stanford U, August 2000.
[Paper]
- Lee CK, Klopp RG, Weindruch R, Prolla TA (1999) Gene expression profile of aging and its retardation by caloric restriction.
Science 285: 1390-1393.
[doi:10.1126/science.285.5432.1390]
- Lee MLT, Kuo FC, Whitmore GA, Sklar J (2000)
Importance of replication in microarray gene expression
studies: statistical methods and evidence from repetitive
cDNA hybridizations. PNAS 97: 9834-9839.
[Paper]
- Li C, Wong WH (2001) Model-based analysis of oligonucleotide
arrays: Expression index computation and outlier
detection. PNAS 98: 31-36.
[Paper]
[Software]
- Lin Y, Nadler ST, Lan H,Attie AD, Yandell BS (2002)
Adaptive gene picking with microarray data: detecting important
low abundance signals. in
The Analysis of Gene
Expression Data: Methods and Software, ed by
G Parmigiani, ES Garrett, RA Irizarry, SL Zeger.
Springer-Verlag, ch. 13.
[Paper]
- Lipshutz RJ, Fodor SP, Gingeras TR, Lockhart DJ (1999)
High density synthetic oligonucleotide arrays.
Nat Genet 21(Suppl): 20-24.
[doi:10.1038/4447]
- Liu J (2002)
Bayesian modeling and computation in bioinformatics research.
In Topics in Computational Biology , ed. T. Jiang, Y. Xu, and
M. Zhang. MIT Press: Cambridge, MA.
[Paper]
- Lockhart DJ, Dong H, Byrne MC, Follettie MT, Gallo MV, Chee MS,
Mittmann M, Wang C, Kobayashi M, Horton H, Brown EL (1996)
Expression monitoring by hybridization to high-density oligonucleotide arrays.
Nat Biotech 14: 1675-1680.
[doi:10.1038/nbt1296-1675]
- Lockhart DJ, Winzeler EA (2000)
Genomics, gene expression and DNA arrays.
Nature 405: 827-836.
[doi:10.1038/35015701]
- Long AD, Mangalam HJ, Chan BYP, Tolleri L, Hatfield GW, Baldi P (2001)
Gene expression profiling in Escherichia coli
K12: Improved statistical inference from DNA
microarray data using analysis of variance
and a Bayesian statistical framework.
J Biol Chem 276: 19937-19944.
[doi:10.1074/jbc.M010192200]
- Lonnstedt I, Speed TP (2002)
Replicated microarray data.
Statistical Sinica 12: 31-46.
[Paper]
- Mills JC, Gordon JI (2001) A new approach for filtering noise
from high-density oligonucleotide microarray datasets.
Nucleic Acids Res 29: e72.
[Paper]
- Nadler ST, Stoehr JP, Schueler KL, Tanimoto G, Yandell BS, Attie AD
(2000) The expression of adipogenic genes is decreased in obesity and
Diabetes mellitus. PNAS 97: 11371-11376.
[Paper]
- Newton MA, Kendziorski CM, Richmond CS,
Blattner FR, Tsui KW (2001) On differential variability of
expression ratios: Improving statistical inference about gene
expression changes from microarray data. J Comp Biol 8:
37-52.
[doi:10.1089/106652701300099074 | Software | Data]
- Newton MA, Noueiry A, Sarkar D, Ahlquist P (2004)
Detecting differential gene expression with a semiparametric hierarchical mixture method.
Biostatistics 5: 155-176.
[doi:10.1093/biostatistics/5.2.155]
- Newton MA, Quintana FA, den Boon JA, Sengupta S, Ahlquist P (2007)
Random-set methods identify distinct aspects of the enrichment signal in gene-set analysis.
Ann Appl Statist 1: 85-106
[doi:10.1214/07-AOAS104]
- Newton MA, Wang P, Kendziorski C (2006)
Hierarchical mixture models for expression profiles.
In Bayesian Inference for Gene Expression and
Proteomics,
Ahn Do K, Mueller PM, Vannucci M, eds. Cambridge U Press.
[Paper]
- Page GP, Edwards JW, Yelisetti P, Wang J, Trivedi P, Allison DB (2006)
The PowerAtlas: a power and sample size atlas for microarray experimental design and research.
BMC Bioinfo 7: 84.
[doi:10.1186/1471-2105-7-84 |
Software]
- Pan W, Lin J, Le C (2003)
A mixture model approach to detecting
differentially expressed genes with microarray data.
Functional Integrative Genomics 3: 117-124.
[doi:10.1007/s10142-003-0085-7]
- Pan W, Lin J, Le C (2002) How many replicates of arrays are required to
detect gene expression changes in microarray experiments? A mixture model
approach.
Genome Biol 3: research0022.
[doi:10.1186/gb-2002-3-5-research0022]
- Pan W, Lin J, Le C (2002)
Model-based cluster analysis of microarray gene
expression data.
Genome Biol 3: research0009.
[doi:10.1186/gb-2002-3-2-research0009]
- Pollard KS, van der Laan MJ (2002)
Statistical inference for simultaneous clustering of gene
expression data.
Math Biosci 176: 99-121.
[doi:10.1016/S0025-5564(01)00116-X | Paper]
- Richmond CS, Glasner JD, Mau R, Jin H, Blattner FR (1999)
Genome-wide expression profiling in Escherichia coli K-12.
Nucleic Acids Res. 27: 3821-3835.
[Paper]
- Roberts CJ, Nelson B, Marton MJ, Stoughton R, Meyer MR, Bennett
HA, He YDD, Dai HY, Walker WL, Hughes TR, Tyers M, Boone C,
Friend SH (2000) Signaling and circuitry of multiple MAPK
pathways revealed by a matrix of global gene expression
profiles. Science 287: 873-880.
[doi:10.1126/science.287.5454.873]
[Supplement]
[Data (7.5Mb)]
- Schadt EE, Li C, Ellis B, Wong WH (1999) Feature extraction and
normalization algorithms for high-density oligonucleotide gene
expression array data. Technical Report 303, Department of Statistics,
UCLA.
[Paper]
- Schadt EE, Li C, Su C, Wong WH (2001) Analyzing high-density
oligonucleotide gene expression array data.
J Cell Biochem 80: 192-202.
[doi:10.1002/1097-4644(20010201)80:2<192::AID-JCB50>3.0.CO;2-W]
- Shatkay H, Edwards S, Wilbur WJ, Boguski M (2000) Genes
themes and microarrays: using information retrieval for large-scale
gene analysis. In Proceedings of the 8th International Conference on
Intelligent Systems for Molecular Biology. AAAI Press.
- Shavlik J, Calcari S, Eliassi-Rad T, Solock J (1999) An
instructable, adaptive interface for discovering and monitoring
information on the world-wide web. In Proceedings of the International
Conference on User Interfaces. ACM Press.
[Paper]
- Shi L, et al. Slikker W Jr (2006)
The MicroArray Quality Control (MAQC) project shows inter- and intraplatform reproducibility of gene expression measurements.
Nat Biotech 24: 1151-1161.
[http://dx.doi.org/10.1038/nbt1239]
- Shi J, Levinson DF, Whittemore AS (2007)
Significance levels for studies with correlated test
statistics.
Biostatistics
[doi:10.1093/biostatistics/kxm047]
- Smith A, Satagopan JM, Gonen M, Begg CB (2001) Class prediction
for gene expression data.
Nat Genet 0: 000-000.
- Soukas A, Cohen P, Socci ND, Friedman JM (2000)
Leptin-specific patterns of gene expression in white adipose tissue
Genes Dev. 14: 963-980.
[Paper]
[Supplement]
- Spang R, Zuzan H, West M, Nevins JR, Marks JR, Blanchette C
(2000) Prediction and uncertainty in the analysis of
gene expression profiles. Technical Report 00-31, Institute of
Statistics and Decision Sciences, Duke University.
[Paper]
- Spellman PT, Sherlock G, Zhang MQ, Iyer VR,
Anders K, Eisen MB, Brown PO, Botstein D Futcher B (1998)
Comprehensive identification of cell
cycle-regulated genes of the yeast Saccharomyces cerevisiae by
microarray hybridization. Molecular Biology of the Cell 9:
3273-3297.
[Paper]
- Storey JD, Tibshirani R (2003)
Statistical significance for genomewide studies.
PNAS 100: 9440-9445.
[Paper]
- Stuart JM, Segal E, Koller D, Kim SK (2003)
A gene-coexpression network for global discovery of conserved
genetic modules.
Science 302: 249-255.
- Stuart RO, Bush KT, Nigam SK (2001)
Changes in global gene expression patterns
during development and maturation of the
rat kidney. PNAS 98: 5649-5654.
[Paper]
- Sun N, Carroll RJ, Zhao H (2006)
Bayesian error analysis model for reconstructing
transcriptional regulatory networks.
PNAS 103: 7988-7993.
[doi:10.1073/pnas.0600164103]
- Tamayo P, Slonim D, Mesirov J, Zhu Q, Kitareewan S, Dmitrovsky
E, Lander ES, Golub TR (1999) Interpreting patterns of gene
expression with self-organizing maps: Methods and application
to hematopoietic differentiation. PNAS 96: 2907-2912.
[Paper]
- Theilhaber J, Bushnell S, Jackson A, Fuchs R (1999) Bayesian
estimation of fold-changes in the analysis of gene expression: the
PFOLD algorithm. Nat Genet: The Microarray Meeting:
Technology, Application and Analysis (submitted).
[Abstract]
- Thomas JG, Olson JM, Tapscott SJ, Zhao LP (2001)
An efficient and robust statistical modeling approach to discover
differentially expressed genes using genomic expression profiles
Genome Res 11: 1227-1236.
[Paper]
- Tibshirani R, Hastie T, Eisen M, Ross D, Botstein D, Brown P
(1999) Clustering methods for the analysis of DNA microarray data.
Technical Report, Department of Statistics, Stanford U.
[Paper]
- Trivedi P, Edwards JW, Wang J, Gadbury GL, Srinivasasainagendra V, Zakharkin SO, Kim K, Mehta T, Brand JPL, Patki A, Page GP, Allison DB (2005)
HDBStat!: A platform-independent software suite for
statistical analysis of high dimensional biology data.
BMC Bioinfo 6: 86.
[doi:10.1186/1471-2105-6-86
| Software]
- van der Laan MJ, Bryan JF 2000. Gene expression analysis with the
parametric bootstrap. Technical Report, Biostatistics Program,
UC Berkeley.
[Paper]
- Wall M (2001)
Singular value decomposition analysis of microarray data.
Los Alamos National Labs.
[Abstract]
- West M (2000) Bayesian regression analysis in the "Large p, Small
n" paradigm. Technical Report 00-22, Institute of Statistics and
Decision Sciences, Duke University.
[Paper]
- West M, Blanchette C, Dressman H, Huang E, Ishida S, Spang R,
Zuzan H, Marks JR, Nevins JR (2001)
Predicting the clinical status of human breast cancer using
gene expression profiles. Technical Report 01-01, Institute of
Statistics and Decision Sciences, Duke University.
[Paper]
- West M, Nevins JR,
Marks JR, Spang R, Zuzan H (2000). DNA microarray data
analysis and regression modeling for genetic expression
profiling. Technical Report 00-15, Institute of Statistics and
Decision Sciences, Duke University.
[Paper]
- Wittes J, Friedman HP (1999) Searching for evidence
of altered gene expression: a comment on statistical analysis
of microarray data. J Nat Cancer Inst 91: 400-401.
[Paper]
- Wolfinger RD, Gibson G, Wolfinger ED, Bennett L, Hamadeh H,
Bushel P, Afshari C, Paules RS (2001) Assessing gene
significance from cDNA microarray expression data via mixed
models. {\em J. Comp. Biol 8}: 625-637.
[Software]
- Woolf PJ, Wang Y (2000) A fuzzy logic approach to analyzing gene
expression data. Physiological Genomics 3 9-15.
[Paper]
- Yang YH, Buckley MJ, Dudoit S, Speed TP (2000) Comparison of
methods for image analysis on cDNA microarray data.
Technical Report 584, Department of Statistics, UC-Berkeley.
[Paper]
- Yang YH, Dudoit S, Luu P, Speed TP (2000) Normalization for cDNA
microarray data.
Technical Report 589, Department of Statistics, UC-Berkeley.
[Paper]
- Yeung KY, Fraley C, Murua A, Raftery AE, Ruzzo WL (2001)
Model-based clustering and data transformations for gene expression data.
TR 396, Statistics Department, U WA Seattle.
[Paper]
- Zhang K, Zhao H (2000)
Assessing reliability of gene clusters from gene expression
data.
Funct Integr Genomics 1: 156-173.
[doi:10.1007/s101420000019]
- Zhang L, Miles MF, Aldape KD (2003)
A model of molecular interactions on short oligonucleotide microarrays.
Nat Biotechnol 21: 818-21.
[Paper
|
Software]
- Zhang MQ (1999) Large-scale gene expression data analysis: a new
challenge to computational biologists.
Genome Res. 1999 9: 681-688.
[Paper]
- Zhao LP, Prentice R, Breeden L (2001)
Statistical modeling of large microarray data sets
to identify stimulus-response profiles
Proc. Natl. Acad. Sci. USA 98: 5631-5636.
[Paper]
- Zien A, Kuffner R, Zimmer R, Lenguaer T (2000) Analysis of
gene expression data with pathway scores. In Proceedings of the 8th
International Conference on Intelligent Systems for Molecular Biology.
AAAI Press.
Return to Statistical Genomics References.
Brian Yandell
(yandell@stat.wisc.edu)