A Multi-dimensional Analysis and Data Cube for Unstructured Text and Social Media

  title={A Multi-dimensional Analysis and Data Cube for Unstructured Text and Social Media},
  author={Suan Lee and Namsoo Kim and Jinho Kim},
  journal={2014 IEEE Fourth International Conference on Big Data and Cloud Computing},
  • Suan Lee, Namsoo Kim, Jinho Kim
  • Published 3 December 2014
  • Computer Science
  • 2014 IEEE Fourth International Conference on Big Data and Cloud Computing
Recently, unstructured data like texts, documents, or SNS messages has been increasingly being used in many applications, rather than structured data consisting of simple numbers or characters. Thus it becomes more important to analysis unstructured text data to extract valuable information for usres decision making. Like OLAP (On-Line Analytical Processing) analysis over structured data, Multi-dimensional analysis for these unstructured data is popularly being required. To facilitate these… 
Cloud-Based Phrase Mining and Analysis of User-Defined Phrase-Category Association in Biomedical Publications.
A protocol for a cloud-based environment supporting the end-to-end phrase-mining and analyses platform CaseOLAP, which successfully quantifies user-defined phrase-category relationships through the analysis of textual data.
The framework needed for analysis to this large amount of data must support statistical analysis and data mining so that big data and traditional data can be combined, so results that come analyzing new data with the old data are combined.
Application of Big Data Analysis and Cloud Computing in Network Platform Building
  • Jichao Sun, Tao Liu, Yong Wang, Ping Zhang, Dan Yang
  • Physics
    Journal of Physics: Conference Series
  • 2021
With the advent of the cloud era, big data has attracted more and more attention. At present, due to the expansion of the scale of chain educational training institutions and the complexity of
Retrieval of Ontological Knowledge from Unstructured Text
The issue of automatic ontology formation process from unstructured text data is examined to improve ontology of the domain and appropriate machine learning classifier needs to be investigated for feature classification.
Text mining in manufacturing process using unsupervised techniques of Machine learning
Different unsupervised machine learning techniques used for text mining for improving the consistency of the manufacturing processes, equipment failures prediction, designing of manufacturing equipment, and discover new technologies with the help of this historical data are focus.
National Seminar On Smart Materials: Energy and Environment for Smart Cities
Wireless sensor networks (WSNs) are combination of distributed independent devices. It uses sensors to cooperatively monitor environmental or physical circumstance like pressure, temperature, voice,
OLAP Textual com Múltiplas Hierarquias de Tópicos e Rankings Segmentados
This artigo apresenta uma abordagem para OLAP textual that constroi multiplas hierarquias de topicos para cada celula do cubo, denominada DTCubing, y pretende contribuir com a apresentacao dos resultados das consultas multidimensionais.


Topic Cube: Topic Modeling for OLAP on Multidimensional Text Databases
A new data model called topic cube is proposed to combine OLAP with probabilistic topic modeling and enable OLAP on the dimension of text data in a multidimensional text database and a heuristic method to speed up the iterative EM algorithm for estimating topic models is proposed.
iNextCube: Information Network-Enhanced Text Cube
The power of iNextCube is shown in the search and analysis of two multidimensional text databases: (i) a DBLP-based CS bibliographic database, and (ii) an online news database.
Text Cube: Computing IR Measures for Multidimensional Text Database Analysis
This paper proposes a text-cube model on multidimensional text database and conducts systematic studies on efficient text-Cube implementation, OLAP execution and query processing and shows the high promise of the methods.
TopCells: Keyword-based search of top-k aggregated documents in text cube
This paper aims to support keyword search in a data cube with text-rich dimension(s) (so-called text cube) by proposing a relevance scoring model and efficient ranking algorithms.
Data Cube: A Relational Aggregation Operator Generalizing Group-By, Cross-Tab, and Sub-Totals
This paper explains the cube and roll-up operators, shows how they fit in SQL, explains how users can define new aggregatefunctions for cubes, and discusses efficient techniques to compute the cube.
Introduction to information retrieval
This groundbreaking new textbook teaches web-era information retrieval, including web search and the related areas of text classification and text clustering from basic concepts from a computer science perspective by three leading experts in the field.
Information diffusion in online social networks: a survey
A survey of representative methods dealing with information diffusion in social networks and a taxonomy that summarizes the state-of-the-art is proposed, intended to help researchers in quickly understanding existing works and possible improvements to bring.
Okapi at TREC-7: Automatic Ad Hoc, Filtering, VLC and Interactive
Three runs were submitted: medium (title and description), short (title only) and a run which was a combination of a long run (title, description and narrative) with the medium and short runs, which led to the discovery that due to a mistake in the indexing procedures part of the LA Times documents had been indexed.
Okapi at TREC{7: automatic ad hoc, ltering, VLC and interactive track
Three runs were submitted: medium (title and description), short (title only) and a run which was a combination of a long run (title, description and narrative) with the medium and short runs. The
Red Opal: product-feature scoring from reviews
A new search system called Red Opal is presented that enables users to locate products rapidly based on features, and which products to show when a user specifies a desired product feature.