Learn More
We propose a mathematical formulation for the notion of optimal projective cluster, starting from natural requirements on the density of points in subspaces. This allows us to develop a Monte Carlo algorithm for iteratively computing projective clusters. We prove that the computed clusters are good with high probability. We implemented a modified version of(More)
We propose a representation for gene expression data called conserved gene expression motifs or XMOTIFs. A gene's expression level is conserved across a set of samples if the gene is expressed with the same abundance in all the samples. A conserved gene expression motif is a subset of genes that is simultaneously conserved across a subset of samples. We(More)
RankGene is a program for analyzing gene expression data and computing diagnostic genes based on their predictive power in distinguishing between different types of samples. The program integrates into one system a variety of popular ranking criteria, ranging from the traditional t-statistic to one-dimensional support vector machines. This flexibility makes(More)
Infectious diseases result in millions of deaths each year. Mechanisms of infection have been studied in detail for many pathogens. However, many questions are relatively unexplored. What are the properties of human proteins that interact with pathogens? Do pathogens interact with certain functional classes of human proteins? Which infection mechanisms and(More)
Free Air [CO(2)] Enrichment (FACE) allows for plant growth under fully open-air conditions of elevated [CO(2)] at concentrations expected to be reached by mid-century. We used Arabidopsis thaliana ecotypes Col-0, Cvi-0, and WS to analyze changes in gene expression and metabolite profiles of plants grown in "SoyFACE" (http://www.soyface.uiuc.edu/), a system(More)
MOTIVATION Infectious diseases such as malaria result in millions of deaths each year. An important aspect of any host-pathogen system is the mechanism by which a pathogen can infect its host. One method of infection is via protein-protein interactions (PPIs) where pathogen proteins target host proteins. Developing computational methods that identify which(More)
We suggest that state policy makers begin by eliminating those activities that are clearly best left to federal agencies—drug approval and drug safety, for example, or situations in which national health and security are at stake, such as pandemic preparedness. Next, it's important to recognize that federal/ state partnerships make considerable sense in a(More)
BACKGROUND Biclustering has emerged as a powerful algorithmic tool for analyzing measurements of gene expression. A number of different methods have emerged for computing biclusters in gene expression data. Many of these algorithms may output a very large number of biclusters with varying degrees of overlap. There are no systematic methods that create a(More)
BACKGROUND Bacillus anthracis, Francisella tularensis, and Yersinia pestis are bacterial pathogens that can cause anthrax, lethal acute pneumonic disease, and bubonic plague, respectively, and are listed as NIAID Category A priority pathogens for possible use as biological weapons. However, the interactions between human proteins and proteins in these(More)
We introduce two new related metrics, the geodesic width and the link width, for measuring the " distance " between two non-intersecting polylines in the plane. If the two polylines have n vertices in total, we present algorithms to compute the geodesic width of the two polylines in O(n 2 log n) time using O(n 2) space and the link width in O(n 3 log n)(More)