Learn More
We present a method for automatically extracting from a running system an indexable <i>signature</i> that distills the essential characteristic from a system state and that can be subjected to automated clustering and similarity-based retrieval to identify when an observed system state is similar to a previously-observed state. This allows operators to(More)
Violations of service level objectives (SLO) in Internet services are urgent conditions requiring immediate attention. Previously we explored (I. Cohen et al., 2004) an approach for identifying which low-level system properties were correlated to high-level SLO violations (the metric attribution problem). The approach is based on automatically inducing(More)
This paper demonstrates that the dependability of generic, evolving J2EE applications can be enhanced through a combination of a few recovery-oriented techniques. Our goal is to reduce downtime by automatically and efficiently recovering from a broad class of transient software failures without having to modify applications. We describe here the integration(More)
Most modern systems generate abundant and diverse log data. With dwindling storage costs, there are fewer reasons to summarize or discard data. However, the lack of tools to efficiently store and cross-correlate heterogeneous datasets makes it tedious to mine the data for analytic insights. In this paper, we present Splunk, a semistructured time series(More)
Recent research activity [2, 12, 27, 10, 1] has shown encouraging results for performance debugging and failure diagnosis and detection in systems by using approaches based on automatically inducing models and deriving correlations from observed data. This paper explores research questions and preliminary results regarding the next steps in advancing this(More)
Java virtual machine (JVM) provides a virtualized execution environment for Java applications. To maintain safety and efficiency of the execution environment, JVM employs some major runtime sub-systems such as garbage collection, just-in-time compilation and lock management. Recent work found that the runtime overheads increase significantly when workload(More)
Josee Laganiere,1* Adrian P. Kells,2* Jeffrey T. Lai,1 Dmitry Guschin,1 David E. Paschon,1 Xiangdong Meng,1 Lauren K. Fong,1 Qi Yu,1 Edward J. Rebar,1 Philip D. Gregory,1 Krystof S. Bankiewicz,2 John Forsayeth,2 and H. Steve Zhang1 1Sangamo BioSciences, Point Richmond Tech Center, Richmond, California 94804, and 2Department of Neurological Surgery,(More)