#### Filter Results:

- Full text PDF available (82)

#### Publication Year

2003

2018

- This year (8)
- Last 5 years (47)
- Last 10 years (80)

#### Publication Type

#### Co-author

#### Journals and Conferences

#### Method

Learn More

- Carolin Strobl, Anne-Laure Boulesteix, Achim Zeileis, Torsten Hothorn
- BMC Bioinformatics
- 2006

Variable importance measures for random forests have been receiving increased attention as a means of variable selection in many classification tasks in bioinformatics and related scientific fields, for instance to select a subset of genetic markers relevant for the prediction of a certain disease. We show that random forest variable importance measures areâ€¦ (More)

- Carolin Strobl, Anne-Laure Boulesteix, Thomas Kneib, Thomas Augustin, Achim Zeileis
- BMC Bioinformatics
- 2008

Random forests are becoming increasingly popular in many scientific fields because they can cope with "small n large p" problems, complex interactions and even highly correlated predictor variables. Their variable importance measures have recently been suggested as screening tools for, e.g., gene expression studies. However, these variable importanceâ€¦ (More)

- Anne-Laure Boulesteix, Korbinian Strimmer
- Briefings in Bioinformatics
- 2007

Partial least squares (PLS) is an efficient statistical regression technique that is highly suited for the analysis of genomic and proteomic data. In this article, we review both the theory underlying PLS as well as a host of bioinformatics applications of PLS. In particular, we provide a systematic comparison of the PLS approaches currently employed, andâ€¦ (More)

- Anne-Laure Boulesteix, Korbinian Strimmer
- Theoretical Biology and Medical Modelling
- 2005

The study of the network between transcription factors and their targets is important for understanding the complex regulatory mechanisms in a cell. Unfortunately, with standard microarray experiments it is not possible to measure the transcription factor activities (TFAs) directly, as their own transcription levels are subject to post-translationalâ€¦ (More)

- Anne-Laure Boulesteix
- Biometrical journal. Biometrische Zeitschrift
- 2006

The association between a binary variable Y and a variable X having an at least ordinal measurement scale might be examined by selecting a cutpoint in the range of X and then performing an association test for the obtained 2 x 2 contingency table using the chi-square statistic. The distribution of the maximally selected chi-square statistic (i.e. theâ€¦ (More)

- Anne-Laure Boulesteix
- Statistical applications in genetics andâ€¦
- 2004

Partial Least Squares (PLS) dimension reduction is known to give good prediction accuracy in the context of classification with high-dimensional microarray data. In this paper, the classification procedure consisting of PLS dimension reduction and linear discriminant analysis on the new components is compared with some of the best state-of-the-artâ€¦ (More)

- Anne-Laure Boulesteix, Martin Slawski
- Briefings in Bioinformatics
- 2009

Ranked gene lists are highly instable in the sense that similar measures of differential gene expression may yield very different rankings, and that a small change of the data set usually affects the obtained gene list considerably. Stability issues have long been under-considered in the literature, but they have grown to a hot topic in the last few years,â€¦ (More)

- Nicole KrÃ¤mer, Juliane SchÃ¤fer, Anne-Laure Boulesteix
- BMC Bioinformatics
- 2009

Graphical Gaussian models are popular tools for the estimation of (undirected) gene association networks from microarray data. A key issue when the number of variables greatly exceeds the number of samples is the estimation of the matrix of partial correlations. Since the (Moore-Penrose) inverse of the sample covariance matrix leads to poor estimates inâ€¦ (More)

- Anne-Laure Boulesteix, Christine Porzelius, Martin Daumer
- Bioinformatics
- 2008

MOTIVATION
In the context of clinical bioinformatics methods are needed for assessing the additional predictive value of microarray data compared to simple clinical parameters alone. Such methods should also provide an optimal prediction rule making use of all potentialities of both types of data: they should ideally be able to catch subtypes which are notâ€¦ (More)

- Wessel N. van Wieringen, David Kun, Regina Hampel, Anne-Laure Boulesteix
- Computational Statistics & Data Analysis
- 2009

Preprint submitted to Elsevier 25 May 2008 Knowledge of the transcription of the human genome might greatly enhance our understanding of cancer. In particular, gene expression may be used to predict the survival of cancer patients. Microarray data are characterized by their highdimensionality: the number of covariates (p âˆ¼ 1000) greatly exceeds the numberâ€¦ (More)