Data driven derivation of cutoffs from a pool of 3,030 Affymetrix arrays to stratify distinct clinical types of breast cancer

Abstract

Pooling of microarray datasets seems to be a reasonable approach to increase sample size when a heterogeneous disease like breast cancer is concerned. Different methods for the adaption of datasets have been used in the literature. We have analyzed influences of these strategies using a pool of 3,030 Affymetrix U133A microarrays from breast cancer samples. We present data on the resulting concordance with biochemical assays of well known parameters and highlight critical pitfalls. We further propose a method for the inference of cutoff values directly from the data without prior knowledge of the true result. The cutoffs derived by this method displayed high specificity and sensitivity. Markers with a bimodal distribution like ER, PgR, and HER2 discriminate different biological subtypes of disease with distinct clinical courses. In contrast, markers displaying a continuous distribution like proliferation markers as Ki67 rather describe the composition of the mixture of cells in the tumor.

DOI: 10.1007/s10549-009-0416-z

10 Figures and Tables

0200400200920102011201220132014201520162017
Citations per Year

944 Citations

Semantic Scholar estimates that this publication has 944 citations based on the available data.

See our FAQ for additional information.

Cite this paper

@article{Karn2009DataDD, title={Data driven derivation of cutoffs from a pool of 3,030 Affymetrix arrays to stratify distinct clinical types of breast cancer}, author={Thomas Karn and Dirk Metzler and Eugen Ruckhaeberle and Lars Christian Hanker and Regine Gaetje and Christine Solbach and Andr{\'e} Ahr and Marcus Schmidt and Uwe Holtrich and Manfred Kaufmann and Achim Rody}, journal={Breast Cancer Research and Treatment}, year={2009}, volume={120}, pages={567-579} }