Sayan Mukherjee

Learn More
Although genomewide RNA expression analysis has become a routine tool in biomedical research, extracting biological insight from such information remains a major challenge. Here, we describe a powerful analytical method called Gene Set Enrichment Analysis (GSEA) for interpreting gene expression data. The method derives its power by focusing on gene sets,(More)
The problem of automatically tuning multiple parameters for pattern recognition Support Vector Machines (SVMs) is considered. This is done by minimizing some estimates of the generalization error of SVMs using a gradient descent algorithm over the set of parameters. Usual methods for choosing parameters, based on exhaustive search become intractable as soon(More)
We introduce a method of feature selection for Support Vector Machines. The method is based upon finding those features which minimize bounds on the leave-one-out error. This search can be efficiently performed via gradient descent. The resulting algorithms are shown to be superior to some standard feature selection algorithms on both toy data and real-life(More)
A novel method for regression has been recently proposed by V Vapnik et al The technique called Support Vector Machine SVM is very well founded from the mathematical point of view and seems to provide a new insight in function approximation We implemented the SVM and tested it on the same data base of chaotic time series that was used in to compare the(More)
Two key models for examining activity-dependent development of primary visual cortex (V1) involve either reduction of activity in both eyes via dark-rearing (DR) or imbalance of activity between the two eyes via monocular deprivation (MD). Combining DNA microarray analysis with computational approaches, RT-PCR, immunohistochemistry and physiological(More)
Using gene expression data to classify tumor types is a very promising tool in cancer diagnosis. Previous works show several pairs of tumor types can be successfully distinguished by their gene expression patterns (Golub et al. 1999, Ben-Dor et al. 2000, Alizadeh et al. 2000). However, the simultaneous classification across a heterogeneous set of tumor(More)
Developing theoretical foundations for learning is a key step towards understanding intelligence. 'Learning from examples' is a paradigm in which systems (natural or artificial) learn a functional relationship from a training set of examples. Within this paradigm, a learning algorithm is a map from the space of training sets to the hypothesis space of(More)
A significant number of human X-linked genes escape X chromosome inactivation and are thus expressed from both the active and inactive X chromosomes. The basis for escape from inactivation and the potential role of the X chromosome primary DNA sequence in determining a gene's X inactivation status is unclear. Using a combination of the X chromosome sequence(More)
Using advanced gene targeting methods, generating mouse models of cancer that accurately reproduce the genetic alterations present in human tumors is now relatively straightforward. The challenge is to determine to what extent such models faithfully mimic human disease with respect to the underlying molecular mechanisms that accompany tumor progression.(More)