• Publications
  • Influence
OLTP-Bench: An Extensible Testbed for Benchmarking Relational Databases
TLDR
OLTP-Bench is presented, an extensible "batteries included" DBMS benchmarking testbed with its ease of use and extensibility, support for tight control of transaction mixtures, request rates, and access distributions over time, as well as the ability to support all major DBMSs and DBaaS platforms. Expand
ZenCrowd: leveraging probabilistic reasoning and crowdsourcing techniques for large-scale entity linking
TLDR
A probabilistic framework to make sensible decisions about candidate links and to identify unreliable human workers is developed and developed to improve the quality of the links while limiting the amount of work performed by the crowd. Expand
P-Grid: a self-organizing structured P2P system
TLDR
Self-organizing Structured P2P systems are described, which have generated substantial interest because of emergent globalscale phenomena and the most prominent class of approaches are distributed hash tables (DHT) and Chord. Expand
HYRISE - A Main Memory Hybrid Storage Engine
TLDR
This paper describes a main memory hybrid database system called HYRISE, which automatically partitions tables into vertical partitions of varying widths depending on how the columns of the table are accessed, and shows that it is both more scalable and produces better designs than previous vertical partitioning approaches for main memory systems. Expand
GridVine: Building Internet-Scale Semantic Overlay Networks
TLDR
This paper addresses the problem of building scalable semantic overlay networks by separating a logical layer, the semantic overlay for managing and mapping data and metadata schemas, from a physical layer consisting of a structured peer-to-peer overlay network for efficient routing of messages. Expand
A Demonstration of SciDB: A Science-Oriented DBMS
TLDR
An overview of Sci DB's key features is presented and a demonstration of the first version of SciDB on data and operations from one of the authors' lighthouse users, the Large Synoptic Survey Telescope (LSST). Expand
Revisiting User Mobility and Social Relationships in LBSNs: A Hypergraph Embedding Approach
TLDR
The asymmetric impact of mobility and social relationships on predicting each other is discovered, which can serve as guidelines for future research on friendship and location prediction in LBSNs. Expand
TrajStore: An adaptive storage system for very large trajectory data sets
TLDR
TrajStore is a dynamic storage system optimized for efficiently retrieving all data in a particular spatiotemporal region that maintains an optimal index on the data and dynamically co-locates and compresses spatially and temporally adjacent segments on disk. Expand
The chatty web: emergent semantics through gossiping
TLDR
This paper describes a novel approach for obtaining semantic interoperability among data sources in a bottom-up, semi-automatic manner without relying on pre-existing, global semantic models and develops a formal framework that takes into account both syntactic and semantic criteria. Expand
Are Meta-Paths Necessary?: Revisiting Heterogeneous Graph Embeddings
TLDR
Just, a heterogeneous graph embedding technique using random walks with JUmp and STay strategies to overcome the aforementioned bias in an more efficient manner is proposed, which can not only gracefully balance between homogeneous and heterogeneous edges, it can also balance the node distribution over different domains. Expand
...
1
2
3
4
5
...