TLabel: A New OLAP Aggregation Operator in Text Cubes

  title={TLabel: A New OLAP Aggregation Operator in Text Cubes},
  author={Lamia Oukid and Omar Boussa{\"i}d and Nadjia Benblidia and Fadila Bentayeb},
  journal={Int. J. Data Warehous. Min.},
Data Warehousing technologies and On-Line Analytical Processing OLAP feature a wide range of techniques for the analysis of structured data. However, these techniques are inadequate when it comes to analyzing textual data. Indeed, classical aggregation operators have earned their spurs in the online analysis of numerical data, but are unsuitable for the analysis of textual data. To alleviate this shortcoming, on-line analytical processing in text cubes requires new analysis operators adapted to… 

The percentage cube

Eris: Measuring discord among multidimensional data sources

A set of algebraic operators are defined to describe the alignments, together with two alternative relational implementations that reduce the problem to linear or quadratic programming, and experimental results show that discordancy measurement can be performed efficiently in realistic situations.

Measuring Discord Among Multidimensional Data Sources

The discord measurement problem is defined, in which given a set of uncertain raw observations or aggregate results and information on the alignment of different data, whether the different sources are concordant, or if not, how discordant they are is evaluated.

Modélisation de la dynamique des territoires : Méta-données et lacs de données dédiés à l'information spatiale

La solution conceptuelle proposée s’adosse à la norme ISO 19115 pour décrire des méta-données spatiales qui est étendue dans le cadre des lacs de donnée.

30 Years Business Intelligence: FromData Analytics to Big Data

  • Isabelle Linden
  • Computer Science
    Integrated Series in Information Systems
  • 2021



Contextualized Text OLAP Based on Information Retrieval

This paper proposes a query expansion method based on a decision-maker profile which evaluates the proposed aggregation operator in different cases using several data analysis queries and shows that the precision of the system is significantly better than that of a Text OLAP system based on classical IR.

Text Cube: Computing IR Measures for Multidimensional Text Database Analysis

This paper proposes a text-cube model on multidimensional text database and conducts systematic studies on efficient text-Cube implementation, OLAP execution and query processing and shows the high promise of the methods.

CXT-cube: contextual text cube model and aggregation operator for text OLAP

A contextual text cube model denoted CXT-Cube is proposed which considers several contextual factors during the OLAP analysis in order to better consider the contextual information associated with textual data.

Topic modeling for OLAP on multidimensional text databases: topic cube and its applications

A new data model called topic cube is studied to combine OLAP with probabilistic topic modeling and enable OLAP on the dimension of text data in a multidimensional text database and proposes two heuristic aggregations to speed up the iterative Expectation‐Maximization (EM) algorithm for estimating topic models.

Toward total business intelligence incorporating structured and unstructured data

The proposed architecture, which integrates information retrieval, text mining, and information extraction technologies all together as well as relational OLAP technologies, are expected to make an effective platform toward total business intelligence.

iNextCube: Information Network-Enhanced Text Cube

The power of iNextCube is shown in the search and analysis of two multidimensional text databases: (i) a DBLP-based CS bibliographic database, and (ii) an online news database.

MiTexCube: MicroTextCluster Cube for online analysis of text cells and its applications

Experimental results on real multidimensional text databases show that Mi TexCube can be materialized efficiently with reasonable overhead in space, and applications based on the proposed materialized MiTexCube are more efficient than the baseline method of direct analysis based on document units in each cell.

A Conceptual Model for Multidimensional Analysis of Documents

This paper introduces an OLAP multidimensional conceptual model without facts, based on the unique concept of dimensions and adapted for multiddimensional document analysis, and provides a set of manipulation operations.