Dynamo: amazon's highly available key-value store

@inproceedings{deCandia2007DynamoAH,
  title={Dynamo: amazon's highly available key-value store},
  author={Giuseppe deCandia and Deniz Hastorun and M. Jampani and Gunavardhan Kakulapati and A. Lakshman and A. Pilchin and S. Sivasubramanian and Peter Vosshall and W. Vogels},
  booktitle={SOSP},
  year={2007}
}
Reliability at massive scale is one of the biggest challenges we face at Amazon.com, one of the largest e-commerce operations in the world; even the slightest outage has significant financial consequences and impacts customer trust. The Amazon.com platform, which provides services for many web sites worldwide, is implemented on top of an infrastructure of tens of thousands of servers and network components located in many datacenters around the world. At this scale, small and large components… Expand
Millions of Tiny Databases
TLDR
Physalia is a transactional keyvalue store, optimized for use in large-scale cloud control planes, which takes advantage of knowledge of transaction patterns and infrastructure design to offer both high availability and strong consistency to millions of clients. Expand
Exploring the design space of highly-available distributed transactions
TLDR
A three-way trade-off between read isolation, delay (latency), and data freshness is found and demonstrated, and two isolation properties are proposed: TCC- and PSI-. Expand
Scalable and elastic transactional data stores for cloud computing platforms
TLDR
This dissertation shows that with careful choice of design and features, it is possible to architect scalable DBMSs that efficiently support transactional semantics to ease application design and elastically adapt to fluctuating operational demands to optimize the operating cost. Expand
Performance Sensitive Replication in Geo-distributed Cloud Datastores
TLDR
This paper presents models that optimize percentiles of response time under normal operation and under a data-center (DC) failure in quorum-based cloud storage systems, and evaluates their models using real-world traces of Twitter, Wikipedia and Go Walla on a Cassandra cluster deployed in Amazon EC2. Expand
ElasTraS: An elastic, scalable, and self-managing transactional database for the cloud
TLDR
ElasTraS leverages Albatross, a low overhead on-demand live database migration technique, for elastic load balancing by adding more servers during high load and consolidating to fewer servers during usage troughs, which minimizes the operating cost and ensures good performance even in the presence of unpredictable changes to the workload. Expand
ElasTraS: An Elastic Transactional Data Store in the Cloud
TLDR
This paper proposes ElasTraS which addresses the issue of scalability and elasticity of the data store in a cloud computing environment to leverage from the elastic nature of the underlying infrastructure, while providing scalable transactional data access. Expand
Efficient and low-cost fault tolerance for web-scale systems
TLDR
This thesis proposes a novel algorithm, called Scrooge, which reduces the replication costs of fast BFT replication in presence of unresponsive replicas, and shows the existence of an inherent tradeoff between optimal redundancy and minimal latency in Presence of faulty replicas. Expand
Materialized views in Cassandra
TLDR
This paper presents an efficient implementation of materialized views in key-value stores that enables complex query processing and is tailored for efficient maintenance. Expand
Associate Adaptable TransactionalInformation Store in the Cloud UsingDistributed Storage and Meta Data Manager
TLDR
This paper aims at providing the planning of a system current; highlight the major style selections, analysing the various guarantees provided by the system, and distinguishing several vital challenges for the analysis community striving for computing within the cloud. Expand
Adaptive Query Scheduling in Key-Value Data Stores
TLDR
This work proposes the AFIT scheduling strategy, which allows for selective data refreshing and integrates the benefits of SJF-based scheduling with an EDF-like policy, and does not only strike a fine trade-off between QoS and QoD but also automatically adapts to workload settings. Expand
...
1
2
3
4
5
...

References

SHOWING 1-10 OF 30 REFERENCES
Cluster-based scalable network services
TLDR
A general, layered architecture for building cluster-based scalable network services that encapsulates the above requirements for reuse, and a service-programming model based on composable workers that perform transformation, aggregation, caching, and customization (TACC) of Internet content is proposed. Expand
FAB: building distributed enterprise disk arrays from commodity components
TLDR
It is argued that voting is practical and necessary for reliable, high-throughput storage systems such as FAB, a distributed disk array that provides the reliability of traditional enterprise arrays with lower cost and better scalability. Expand
Beehive: O(1) Lookup Performance for Power-Law Query Distributions in Peer-to-Peer Overlays
TLDR
A proactive replication framework that can provide constant lookup performance for common Zipf-like query distributions and can realistically achieve good latencies, outperform passive caching, and adapt efficiently to sudden changes in object popularity, also known as flash crowds. Expand
SEDA: an architecture for well-conditioned, scalable internet services
TLDR
This work presents the SEDA design and an implementation of an Internet services platform based on this architecture, and describes several control mechanisms for automatic tuning and load conditioning, including thread pool sizing, event batching, and adaptive load shedding. Expand
OceanStore: an architecture for global-scale persistent storage
TLDR
OceanStore monitoring of usage patterns allows adaptation to regional outages and denial of service attacks; monitoring also enhances performance through pro-active movement of data. Expand
Chord: A scalable peer-to-peer lookup service for internet applications
TLDR
Results from theoretical analysis, simulations, and experiments show that Chord is scalable, with communication cost and the state maintained by each node scaling logarithmically with the number of Chord nodes. Expand
Bigtable: A Distributed Storage System for Structured Data
TLDR
The simple data model provided by Bigtable is described, which gives clients dynamic control over data layout and format, and the design and implementation of Bigtable are described. Expand
Antiquity: exploiting a secure log for wide-area distributed storage
TLDR
Antiquity uses a secure log to maintain data integrity, replicates each log on multiple servers for durability, and uses dynamic Byzantine fault-tolerant quorum protocols to ensure consistency among replicas. Expand
Farsite: federated, available, and reliable storage for an incompletely trusted environment
TLDR
The design of Farsite is reported on and the lessons learned by implementing much of that design are reported, including how to locally caching file data, lazily propagating file updates, and varying the duration and granularity of content leases. Expand
Storage management and caching in PAST, a large-scale, persistent peer-to-peer storage utility
TLDR
The storage management and caching in PAST, a large-scale peer-to-peer persistent storage utility based on a self-organizing, Internet-based overlay network of storage nodes that cooperatively route file queries, store multiple replicas of files, and cache additional copies of popular files, is evaluated. Expand
...
1
2
3
...