Dynamic Selection of Virtual Machines for Application Servers in Cloud Environments

@inproceedings{Grozev2017DynamicSO,
  title={Dynamic Selection of Virtual Machines for Application Servers in Cloud Environments},
  author={Nikolay Grozev and Rajkumar Buyya},
  booktitle={Research Advances in Cloud Computing},
  year={2017}
}
  • N. Grozev, R. Buyya
  • Published in
    Research Advances in Cloud…
    7 February 2016
  • Computer Science
Autoscaling is a hallmark of cloud computing as it allows flexible just-in-time allocation and release of computational resources in response to dynamic and often unpredictable workloads. This is especially important for web applications, whose workload is time dependent and prone to flash crowds. Most of them follow the 3-tier architectural pattern, and are divided into presentation, application/domain and data layers. In this work, we focus on the application layer. Reactive autoscaling… 
Auto-Scaling Web Applications in Clouds
TLDR
A taxonomy of auto-scalers according to the identified challenges and key properties is presented and new future directions that can be explored in this area are proposed.
Auto-scaling Web Applications in Clouds: A Taxonomy and Survey
TLDR
A taxonomy of auto-scalers according to the identified challenges and key properties is presented and new future directions that can be explored in this area are proposed.
Auto-scaling and deployment of web applications in distributed computing clouds
CLOUD Computing, which allows users to acquire/release resources based on realtime demand from large data centers in a pay-as-you-go model, has attracted considerable attention from the ICT industry.
Performance Prediction for Enabling Intelligent Resource Management on Big Data Processing Workflows
Mobile cloud computing offers an augmented infrastructure that allows resource-constrained devices to use remote computational resources as an enabler for highly intensive computation, thus improvi

References

SHOWING 1-10 OF 58 REFERENCES
Dynamic Scaling of Web Applications in a Virtualized Cloud Computing Environment
TLDR
A novel architecture for the dynamic scaling of web applications based on thresholds in a virtualized Cloud Computing environment is described and a dynamic scaling algorithm for automated provisioning of virtual machine resources based on threshold number of active sessions will be introduced.
Adaptive, Model-driven Autoscaling for Cloud Applications
TLDR
This paper proposes a new cloud service, Dependable Compute Cloud (DC2), that automatically scales the infrastructure to meet the user-specified performance requirements, and employs Kalman filtering to automatically learn the (possibly changing) system parameters for each application.
EC2 Performance Analysis for Resource Provisioning of Service-Oriented Applications
TLDR
It is demonstrated that the performance of virtual instances in Amazon EC2 is relatively stable over time with fluctuations of mean response time within at most 8% of the longterm average.
EC 2 Performance Analysis for Resource Provisioning of Service-Oriented Applications
TLDR
It is demonstrated that the performance of virtual instances in Amazon EC2 is relatively stable over time with fluctuations of mean response time within at most 8% of the long-term average.
Modeling virtual machine performance: challenges and approaches
TLDR
This paper takes a look at the challenges of modeling virtual machine (VM) performance on a datacenter server and describes a VM performance modeling approach and performs a detailed case study based on the vConsolidate benchmark.
Scalability and Performance of Web Applications in a Compute Cloud
TLDR
A novel dynamic scaling architecture with a front-end load-balancer for routing user requests to web applications deployed on virtual machine instances with the goal of maximizing resource utilization in instances while minimizing total number of instances is described.
Performance Modelling and Simulation of Three-Tier Applications in Cloud and Multi-Cloud Environments
TLDR
An analytical performance model of 3-tier applications in Cloud and Multi-Cloud environments that takes into account the performance of the persistent storage and the heterogeneity of cloud data centres in terms of Virtual Machine (VM) performance is proposed.
Autoscaling Web Applications in Heterogeneous Cloud Infrastructures
TLDR
This paper presents an autoscaling system that overcomes limitations by exploiting heterogeneous types of resources, and by defining multiple levels of QoS requirements, and shows significant reductions in QoS-level violations when faced with highly variable workloads.
Autonomic mix-aware provisioning for non-stationary data center workloads
TLDR
It is argued that the non-stationarity in Internet application workloads, which causes the request mix to change over time, can have a significant impact on the overall processing demands imposed on data center servers.
A Review of Auto-scaling Techniques for Elastic Applications in Cloud Environments
TLDR
This work proposes a classification of techniques for automating application scaling in the cloud into five main categories: static threshold-based rules, control theory, reinforcement learning, queuing theory and time series analysis, and uses this classification to carry out a literature review of proposals.
...
...