• Corpus ID: 14680997

A Toolkit for Scalable Spreadsheet Visualization

  title={A Toolkit for Scalable Spreadsheet Visualization},
  author={Markus Clermont},
This paper presents a toolkit for spreadsheet visualization based on logical areas, semantic classes and data modules. Logical areas, semantic classes and data modules are abstract representations of spreadsheet programs that are meant to reduce the auditing and comprehension effort, especially for large and regular spreadsheets. The toolkit is integrated as a plug-in in the Gnumeric spreadsheet system for Linux. It can process large, industry scale spreadsheet programs in reasonable time and… 

Figures from this paper

Automated model-based spreadsheet debugging
Several new algorithmic enhancements of the general MBD approach are combined in this thesis to allow spreadsheet users to debug their spreadsheets and to efficiently find the reason of the observed unexpected output values.
A methodology for constructing the calculation model of scientific spreadsheets
A methodology for semi-automatically deriving the calculation workflow underlying a set of spreadsheets, and constructed calculation models approximate the ground truth calculation workflows, both in terms of content and size, but are not a perfect match.
Semi-structured data extraction and modelling: the WIA Project
The WIA-algorithm shows how to provide a description of spreadsheet contents in terms of higher level of abstractions or conceptualisations to provide spreadsheets with abstract representations useful for further model refinements and optimizations through evolutionary algorithms computations.
CUSTODES: Automatic Spreadsheet Cell Clustering and Smell Detection Using Strong and Weak Features
CUSTODES is proposed to effectively cluster spreadsheet cells and detect smells in these clusters using strong and weak features and successfully detected harmful smells that can induce computation anomalies in spreadsheets with an F-measure of 0.72, outperforming state-of-the-art techniques.
Excelsior: Bringing the Benefits of Modularisation to Excel
Excelsior is presented, a system for bringing modular design features to Excel that would save time, avoid unneeded programming, make mistakes less likely, make code-control easier, help organisations adopt a uniform house style, and open business opportunities in buying and selling spreadsheet modules.
A Review of Spreadsheet Error Reduction Techniques
This paper presents a meta-analyses of the EMMARM, a large-scale probabilistic study of the determinants of infectious disease in eight operation rooms of the immune system and its consequences.
Requirements for the domain model of environmental computational spreadsheets
Computer Science Network Institute VU University Amsterdam The Netherlands martine.de.vos@vu.nl Jan Wielemaker Computer Science Network Institute VU University Amsterdam The Netherlands
Intelligent Voice Navigation of Spreadsheets: An Empirical Evaluation
This paper outlines the iVoice system and details an experiment that was conducted to determine the efficiency of iVoice when compared to a leading voice recognition technology.
Intelligent Voice Navigation of Spreadsheets
This paper outlines the iVoice system and details an experiment that was conducted to determine the efficiency of iVoice when compared to a leading voice recognition technology.


Modeling Spreadsheet Audit: A Rigorous Approach to Automatic Visualization
A theoretical model of spreadsheets and a technique to describe spreadsheet auditing tools are presented and it is hoped that visualizations that point out parts ofspreadsheets that contain anomalies will be found.
Analyzing large spreadsheet programs
  • Markus Clermont
  • Computer Science
    10th Working Conference on Reverse Engineering, 2003. WCRE 2003. Proceedings.
  • 2003
An approach is introduced that enables the decomposing of large spreadsheets into self contained parts, that are similar to modules in conventional software, and which can be considered orthogonal to semanticclasses, that have been introduced in prior work.
Slicing spreadsheets: an integrated methodology for spreadsheet testing and debugging
This paper presents an integrated testing and debugging methodology for spreadsheets that is tightly integrated into the spreadsheet environment and provides an interface to the methodology that does not require an understanding of testing and debug theory, and that takes advantage of the immediate visual feedback that is characteristic of the spreadsheet paradigm.
Toward a logical/physical theory of spreadsheet modeling
This work identifies four principal components that characterize any spread sheet model: schema, data, editorial, and binding and presents a factoring algorithm for identifying and extracting these components from conventional spreadsheets with minimal user intervention, and a synthesis algorithm that assists users in the construction of executable spreadsheets from reusable model components.
Using Layout Information for Spreadsheet Visualization
This paper extends a spreadsheet visualization technique by using layout information. The original approach identifies logically or semantically related cells by relying exclusively on the content of
A Spreadsheet Auditing Tool Evaluated in an Industrial Context
The evaluated auditing tool offers the auditor a new view on the formula structure of the spreadsheet by grouping similar formulas into equivalence classes, and the auditing approach helps to find irregularities in the geometrical pattern of similar formulas.
Detecting Errors in Spreadsheets
The paper presents two complementary strategies for identifying errors in spreadsheet programs, grounded on the assumption that spreadsheets are software, albeit of a different nature than conventional procedural software.
What we know about spreadsheet errors
Although spreadsheet programs are used for small "scratchpad" applications, they are also used to develop many large applications, and error rates are in line with those in programming and other human cognitive domains.
Quality control in spreadsheets: a software engineering-based approach to spreadsheet development
The principles and techniques of tree-based formula representation, unique definition of spreadsheet elements, and separation of data and operations are among the main features explored.
An open graph visualization system and its applications to software engineering
A package of practical tools and libraries for manipulating graphs and their drawings that includes stream and event interfaces for graph operations, high‐quality static and dynamic layout algorithms, and the ability to handle sizeable graphs is described.