Model Construction and Data Management of Running Log in Supporting SaaS Software Performance Analysis

  title={Model Construction and Data Management of Running Log in Supporting SaaS Software Performance Analysis},
  author={Rui Wang and Shi Ying and Chengai Sun and Hongyan Wan and Huo-lin Zhang and Xiangyang Jia},
Changes in operating environment may result in the performance degradation to a SaaS software. Analyzing running log is an efficient method to locate this problem. However, as a long-running software, SaaS may generate huge log data which is difficult to analyze, and it lacks a systematic approach to implement management of the running log. These all threaten the timeliness of SaaS performance analysis. In this paper, we define a log format to standardize multi-source heterogeneous log data and… 

Figures and Tables from this paper

Log Data Modeling and Acquisition in Supporting SaaS Software Performance Issue Diagnosis
Data logging is helpful for the operation and maintenance manager of SaaS-based solutions to diagnose performance issues but long-running SAAS software may generate huge amounts of log data that need to be accounted for.
SaaS software performance issue identification using HMRF‐MAP framework
The evaluation results show that the proposed automatic performance issue identification method has small system overhead, is accurate in identifying the time frame during which a performance issue occurs, is indeed useful and assists an operation and maintenance manager in recovering the service capability of SaaS software.
HSACMA: a hierarchical scalable adaptive cloud monitoring architecture
HSACMA, a hierarchical scalable adaptive monitoring architecture that achieves the scalability of the monitoring based on microservices and adaptively adjusts the monitoring interval and data transmission strategy according to the running state of the cloud computing system, is proposed.
SaaS software performance issue diagnosis using independent component analysis and restricted Boltzmann machine
This article proposed a deep performance issue classification method based on Independent Component Analysis (ICA) and Restricted Boltzmann Machine (RBM) that is superior to the classical shallow classification algorithm, and it also meet the efficiency requirement.


A framework for analyzing software system log files
This dissertation investigates, identifies and develops components needed for an adaptable end-to-end framework for the analysis of logs, and proposes an empirical approach, in addition to the heuristic approach, to maintain the speed but improve the accuracy.
UiLog: Improving Log-Based Fault Diagnosis by Log Analysis
A management and analysis system of log information which can assist system administrators to understand the real-time status of the entire system, classify logs into different fault types, and determine the root cause of the faults is implemented.
Model-Based Software Performance Analysis
This book provides the cross-knowledge that allows developers to tackle software performance issues from the very early phases of software development and explains the basic concepts of performance analysis and describes the most representative methodologies to annotate and transform software models into performance models.
Fault detection for cloud computing systems with correlation analysis
This paper proposes an online incremental clustering method to recognize access behavior patterns, and uses CCA to model the correlation between workloads and the metrics of application performance/resource utilization in a specific access behavior pattern.
ASDF: An Automated, Online Framework for Diagnosing Performance Problems
This work automates problem localization using ASDF, an online diagnostic framework that transparently monitors and analyzes different time-varying data sources and narrows down performance problems to a specific node or a set of nodes in Hadoop.
Detecting large-scale system problems by mining console logs
This work first parse console logs by combining source code analysis with information retrieval to create composite features, and then analyzes these features using machine learning to detect operational problems to automatically detect system runtime problems.
LogMaster: Mining Event Correlations in Logs of Large-Scale Cluster Systems
The experimental results show that the proposed method can predict failures with a high precision and an acceptable recall rates, and an ECGs-based algorithm for fast predicting events is presented.
Cloud monitoring: A survey
X-ray: Automating Root-Cause Diagnosis of Performance Anomalies in Production Software
X-ray is a tool that implements performance summarization, a technique for automatically diagnosing the root causes of performance problems and shows that X-ray accurately diagnoses 17 performance issues in Apache, lighttpd, Postfix, and PostgreSQL, while adding 2.3% average runtime overhead.
A Scalable, Non-Parametric Method for Detecting Performance Anomaly in Large Scale Computing
  • Li Yu, Z. Lan
  • Computer Science
    IEEE Transactions on Parallel and Distributed Systems
  • 2016
The design is generic for anomaly detection in a variety of parallel and distributed systems exhibiting peer-comparable property and adopts a divide-and-conquer approach to address the scalability challenge and explores the use of non-parametric clustering and two-phase majority voting to improve detection flexibility and accuracy.