Learn More
Randomized Aggregatable Privacy-Preserving Ordinal Response, or RAPPOR, is a technology for crowdsourcing statistics from end-user client software, anonymously, with strong privacy guarantees. In short, RAPPORs allow the forest of client data to be studied, without permitting the possibility of looking at individual trees. By applying randomized response in(More)
Blood pressure is a heritable trait influenced by several biological pathways and responsive to environmental stimuli. Over one billion people worldwide have hypertension (≥140 mm Hg systolic blood pressure or  ≥90 mm Hg diastolic blood pressure). Even small increments in blood pressure are associated with an increased risk of cardiovascular events. This(More)
The R package clValid contains functions for validating the results of a clustering analysis. There are three main types of cluster validation measures available, " internal " , " stability " , and " biological ". The user can choose from nine clustering algorithms in existing R packages , including hierarchical, K-means, self-organizing maps (SOM), and(More)
The prevalence of hypertension in African Americans (AAs) is higher than in other US groups; yet, few have performed genome-wide association studies (GWASs) in AA. Among people of European descent, GWASs have identified genetic variants at 13 loci that are associated with blood pressure. It is unknown if these variants confer susceptibility in people of(More)
Discovering genes involved in multiple types of cancers is of significant therapeutic importance. We show that collective evidence for such genes can be obtained via a form of meta-analysis that aggregates the results (rankings and p values) from various cancer-specific microarray experiments. This method is illustrated by a combined analysis of 20(More)
Numerous genetic loci have been associated with systolic blood pressure (SBP) and diastolic blood pressure (DBP) in Europeans. We now report genome-wide association studies of pulse pressure (PP) and mean arterial pressure (MAP). In discovery (N = 74,064) and follow-up studies (N = 48,607), we identified at genome-wide significance (P = 2.7 × 10(-8) to P =(More)
BACKGROUND Researchers in the field of bioinformatics often face a challenge of combining several ordered lists in a proper and efficient manner. Rank aggregation techniques offer a general and flexible framework that allows one to objectively perform the necessary aggregation. With the rapid growth of high-throughput genomic and proteomic studies, the(More)
MOTIVATION Biologists often employ clustering techniques in the explorative phase of microarray data analysis to discover relevant biological groupings. Given the availability of numerous clustering algorithms in the machine-learning literature, an user might want to select one that performs the best for his/her data set or application. While various(More)
Autism is a multifactorial neurodevelopmental disorder affecting more males than females; consequently, under a multifactorial genetic hypothesis, females are affected only when they cross a higher biological threshold. We hypothesize that deleterious variants at conserved residues are enriched in severely affected patients arising from female-enriched(More)
MOTIVATION Gene association/interaction networks provide vast amounts of information about essential processes inside the cell. A complete picture of gene-gene associations/interactions would open new horizons for biologists, ranging from pure appreciation to successful manipulation of biological pathways for therapeutic purposes. Therefore, identification(More)