M. Maruf Hossain

Learn More
Gene expression information from microarray experiments is a primary form of data for biological analysis and can offer insights into disease processes and cellular behaviour. Such datasets are particularly challenging to build classifiers for, due to their very high dimensional nature and small sample size. Decision trees are a seemingly attractive(More)
This paper presents an ensemble of feature selection and classification technique for classifying two types of breast lesion, benign and malignant. Features are selected based on their area under the ROC curves (AUC) which are then classified using a hybrid hidden Markov model (HMM)-fuzzy approach. HMM generated log-likelihood values are used to generate(More)
Support Vector Machines (SVMs) are a leading tool in classification and pattern recognition and the kernel function is one of its most important components. This function is used to map the input space into a high dimensional feature space. However, it can perform rather poorly when there are too many dimensions (e.g. for gene expression data) or when there(More)
This paper introduces a new cost function for evaluating the multi-class classifier. The new cost function facilitates both a way to visualize the performance (expected cost) of the multi-class classifier and a summary of the misclassification costs. This function overcomes the limitations of ROC in not being able to represent the classifier performance(More)
The k-nearest neighbour (k-NN) technique, due to its inter-pretable nature, is a simple and very intuitively appealing method to address classification problems. However, choosing an appropriate distance function for k-NN can be challenging and an inferior choice can make the classifier highly vulnerable to noise in the data. In this paper, we propose a new(More)
  • 1