Mahmoud Ghandi

Learn More
Oligomers of length k, or k-mers, are convenient and widely used features for modeling the properties and functions of DNA and protein sequences. However, k-mers suffer from the inherent limitation that if the parameter k is increased to resolve longer features, the probability of observing any specific k-mer becomes very small, and k-mer counts approach a(More)
Recent studies have revealed that ARID1A, encoding AT-rich interactive domain 1A (SWI-like), is frequently mutated across a variety of human cancers and also has bona fide tumor suppressor properties. Consequently, identification of vulnerabilities conferred by ARID1A mutation would have major relevance for human cancer. Here, using a broad screening(More)
PIK3CA (which encodes the PI3K alpha isoform) is the most frequently mutated oncogene in breast cancer. Small-molecule PI3K inhibitors have shown promise in clinical trials; however, intrinsic and acquired resistance limits their utility. We used a systematic gain-of-function approach to identify genes whose upregulation confers resistance to the PI3K(More)
The use of targeted therapeutics directed against BRAF(V600)-mutant metastatic melanoma improves progression-free survival in many patients; however, acquired drug resistance remains a major medical challenge. By far, the most common clinical resistance mechanism involves reactivation of the MAPK (RAF/MEK/ERK) pathway by a variety of mechanisms. Thus,(More)
Pediatric-type nodal follicular lymphoma (PTNFL) is a variant of follicular lymphoma (FL) characterized by limited-stage presentation and invariably benign behavior despite often high-grade histological appearance. It is important to distinguish PTNFL from typical FL in order to avoid unnecessary treatment; however, this distinction relies solely on(More)
Oligomers of fixed length, k, commonly known as k-mers, are often used as fundamental elements in the description of DNA sequence features of diverse biological function, or as intermediate elements in the constuction of more complex descriptors of sequence features such as position weight matrices. k-mers are very useful as general sequence features(More)
UNLABELLED We present a new R package for training gapped-kmer SVM classifiers for DNA and protein sequences. We describe an improved algorithm for kernel matrix calculation that speeds run time by about 2 to 5-fold over our original gkmSVM algorithm. This package supports several sequence kernels, including: gkmSVM, kmer-SVM, mismatch kernel and wildcard(More)
The use of proteasome inhibitors to target cancer's dependence on altered protein homeostasis has been greatly limited by intrinsic and acquired resistance. Analyzing data from thousands of cancer lines and tumors, we find that those with suppressed expression of one or more 19S proteasome subunits show intrinsic proteasome inhibitor resistance. Moreover,(More)
Data normalization is a crucial preliminary step in analyzing genomic datasets. The goal of normalization is to remove global variation to make readings across different experiments comparable. In addition, most genomic loci have non-uniform sensitivity to any given assay because of variation in local sequence properties. In microarray experiments, this(More)
  • 1