• Publications
  • Influence
Hadoop-GIS: A High Performance Spatial Data Warehousing System over MapReduce
TLDR
Hadoop-GIS - a scalable and high performance spatial data warehousing system for running large scale spatial queries on Hadoop and integrated into Hive to support declarative spatial queries with an integrated architecture is presented. Expand
The Immune Landscape of Cancer
TLDR
An extensive immunogenomic analysis of more than 10,000 tumors comprising 33 diverse cancer types by utilizing data compiled by TCGA identifies six immune subtypes that encompass multiple cancer types and are hypothesized to define immune response patterns impacting prognosis. Expand
Analysis of the Clustering Properties of the Hilbert Space-Filling Curve
TLDR
This work analyzes the clustering property of the Hilbert space-filling curve by deriving closed-form formulas for the number of clusters in a given query region of an arbitrary shape and shows that the Hilbert curve achieves better clustering than the z curve. Expand
Patch-Based Convolutional Neural Network for Whole Slide Tissue Image Classification
TLDR
A novel Expectation-Maximization (EM) based method is formulated that automatically locates discriminative patches robustly by utilizing the spatial relationships of patches and applies it to the classification of glioma and non-small-cell lung carcinoma cases into subtypes. Expand
Sumatra: A Language for Resource-Aware Mobile Programs
TLDR
In this chapter, the design and implementation of Sumatra, an extension of Java that supports resource-aware mobile programs, is described and a distributed resource monitor that provides the information required by Sumatra progams is described. Expand
Active disks: programming model, algorithms and evaluation
TLDR
This paper evaluates Active Disk architectures which integrate significant processing power and memory into a disk drive and allow application-specific code to be downloaded and executed on the data that is being read from (written to) disk. Expand
Communication Optimizations for Irregular Scientific Computations on Distributed Memory Architectures
TLDR
A detailed performance and scalability analysis of the communication primitives is presented, carried out using a workload generator, kernels from real applications, and a large unstructured adaptive application. Expand
Titan: a high-performance remote-sensing database
TLDR
The design, implementation and evaluation of Titan, a parallel shared nothing database designed for handling remote sensing data, are described and the experimental results show that Titan provides good performance for global queries and interactive response times for local queries. Expand
Caveats for the use of operational electronic health record data in comparative effectiveness research.
TLDR
A list of caveats is developed to inform would-be users of such data as well as provide an informatics roadmap that aims to insure this opportunity to augment comparative effectiveness research can be best leveraged. Expand
Histopathological Image Analysis Using Model-Based Intermediate Representations and Color Texture: Follicular Lymphoma Grading
TLDR
A model-based intermediate representation of cytological components that enables higher level semantic description of tissue characteristics and a novel color-texture analysis approach that combines the MBIR with low level texture features, which capture tissue characteristics at pixel level are introduced. Expand
...
1
2
3
4
5
...