Efficient Batched Synchronization in Dropbox-Like Cloud Storage Services

@inproceedings{Li2013EfficientBS,
  title={Efficient Batched Synchronization in Dropbox-Like Cloud Storage Services},
  author={Zhenhua Li and Christo Wilson and Zhefu Jiang and Yao Liu and Ben Y. Zhao and Cheng Jin and Zhi-Li Zhang and Yafei Dai},
  booktitle={Middleware},
  year={2013}
}
As tools for personal storage, file synchronization and data sharing, cloud storage services such as Dropbox have quickly gained popularity. These services provide users with ubiquitous, reliable data storage that can be automatically synced across multiple devices, and also shared among a group of users. To minimize the network overhead, cloud storage services employ binary diff, data compression, and other mechanisms when transferring updates among users. However, despite these optimizations… 
Efficient Batched Synchronization for Cloud Storage Services
TLDR
This work proposes the update-batched delayed synchronization (UDS) mechanism, which acts as a middleware between the user’s file storage system and a cloud storage application to significantly reduce the overhead caused by session maintenance traffic, while preserving the rapid file synchronization that users expect from cloud storage services.
Understanding and Surpassing Dropbox: Efficient Incremental Synchronization in Cloud Storage Services
TLDR
This paper designs and implements an efficient incremental synchronization system named Minbox, and employs an efficient locality-sensitive hash for better similar chunk matching and significantly reduces network traffic and resolves file conflict with little overhead.
T-CloudDisk: a tunable cloud storage service for flexible batched synchronization
TLDR
This paper introduces the characteristics, technical approach, and preliminary timeline of T-CloudDisk, a tunable cloud storage service for flexible batched synchronization for flexiblebatched synchronization in the cloud.
Towards Network-level Efficiency for Cloud Storage Services
TLDR
A novel metric named TUE is defined to quantify the Traffic Usage Efficiency} of data synchronization of cloud storage services and demonstrates that a considerable portion of the data sync traffic is in a sense wasteful, and can be effectively avoided or significantly reduced via carefully designed data sync mechanisms.
QuickSync: Improving Synchronization Efficiency for Mobile Cloud Storage Services
TLDR
QuickSync is proposed, a system with three novel techniques to improve the sync efficiency for mobile cloud storage services, and is able to reduce up to 73.1 percent sync time in the authors' experiment settings.
A Quantitative and Comparative Study of Network-Level Efficiency for Cloud Storage Services
TLDR
A novel metric TUE is defined to quantify the Traffic Usage Efficiency of data synchronization and reveals that a considerable portion of the data sync traffic is, in a sense, wasteful and can be effectively avoided or significantly reduced via carefully designed data sync mechanisms.
PandaSync: Network and Workload Aware Hybrid Cloud Sync Optimization
TLDR
A hybrid cloud sync scheme that combines full sync and delta sync dynamically based on file size and network conditions is proposed, and an optimization, Full2Sync, is proposed that merges the sync request with the file-sending request to reduce the number of network round-trips between the client and the cloud servers.
A Performance Evaluation of Dropbox in the light of Personal Cloud Storage Systems
TLDR
The proposed study gives an idea about the architecture of personal cloud storage services in general and how can they be improved in terms of performance, security, and usability.
HyCloud: Tweaking Hybrid Cloud Storage Services for Cost-Efficient Filesystem Hosting
TLDR
An open-source system called HyCloud is designed and implemented, which automatically invokes the data APIs of S3 and EFS on behalf of users, and handles the data transfer among S3, EFS and the clients.
HyCloud: Tweaking Hybrid Cloud Storage Services for Cost-Efficient Filesystem Hosting
TLDR
This article designs and implements an open-source system called HyCloud, which automatically invokes the data APIs of S3 and EFS on behalf of users, and intelligently schedules the data transfer among S3, EFS and the clients in a distributed manner.
...
1
2
3
4
5
...

References

SHOWING 1-10 OF 32 REFERENCES
On the impact of virtualization on Dropbox-like cloud file storage/synchronization services
TLDR
This paper re-model the resource provisioning problem in the Dropbox-like systems and presents an interference-aware solution that smartly allocates the Dropbox tasks to different cloud instances and remarkably reduces the synchronization delay for this new generation of file hosting service.
Inside dropbox: understanding personal cloud storage services
TLDR
This paper is the first to study Dropbox, which is shown to be the most widely-used cloud storage system, already accounting for a volume equivalent to around one third of the YouTube traffic at campus networks on some days.
BlueSky: a cloud-backed file system for the enterprise
TLDR
BlueSky stores data persistently in a cloud storage provider such as Amazon S3 or Windows Azure, allowing users to take advantage of the reliability and large storage capacity of cloud providers and avoid the need for dedicated server hardware.
Dark Clouds on the Horizon: Using Cloud Storage as Attack Vector and Online Slack Space
TLDR
It is shown that Dropbox is used to store copyright-protected files from a popular filesharing network and Dropbox can be exploited to hide files in the cloud with unlimited storage capacity, defined as online slack space.
Windows Azure Storage: a highly available cloud storage service with strong consistency
TLDR
The WAS architecture, global namespace, and data model is described, as well as its resource provisioning, load balancing, and replication systems.
Side Channels in Cloud Services: Deduplication in Cloud Storage
As the volume of data increases, so does the demand for online storage services, from simple backup services to cloud storage infrastructures. Although deduplication is most effective when applied
DepSky: Dependable and Secure Storage in a Cloud-of-Clouds
TLDR
DepSky is presented, a system that improves the availability, integrity, and confidentiality of information stored in the cloud through the encryption, encoding, and replication of the data on diverse clouds that form a cloud-of-clouds.
Proofs of ownership in remote storage systems
TLDR
This work identifies attacks that exploit client-side deduplication, allowing an attacker to gain access to arbitrary-size files of other users based on a very small hash signatures of these files, and introduces the notion of proofs-of-ownership (PoWs), which lets a client efficiently prove to a server that that the client holds a file, rather than just some short information about it.
Benchmarking personal cloud storage
TLDR
The methodology is applied to compare 5 popular offers of cloud storage services, revealing different system architectures and capabilities, and the implications on performance of different designs are assessed executing a series of benchmarks.
The good, the bad and the ugly of consumer cloud storage
TLDR
Four popular consumer cloud storage offerings - Mozy, Carbonite, Dropbox, and CrashPlan are evaluated to determine if they live up to the benefits users expect and derive a set of lessons and recommendations that if followed more uniformly, could substantially improve the cloud storage experience for many consumers.
...
1
2
3
4
...