Managing Lifecycle of Big Data Applications

  title={Managing Lifecycle of Big Data Applications},
  author={Ivan Ermilov and Axel-Cyrille Ngonga Ngomo and Aad Versteden and Hajira Jabeen and Gezim Sejdiu and Giorgos Argyriou and Luigi Selmi and J{\"u}rgen Jakobitsch and Jens Lehmann},
The growing digitization and networking process within our society has a large influence on all aspects of everyday life. Large amounts of data are being produced continuously, and when these are analyzed and interlinked they have the potential to create new knowledge and intelligent solutions for economy and society. To process this data, we developed the Big Data Integrator (BDI) Platform with various Big Data components available out-of-the-box. The integration of the components inside the… 
Examining the Interplay Between Big Data and Microservices - A Bibliometric Review
This bibliometric review provides an analysis of the literature, using the scientific database Scopus and its provided tools for search and analytics, and discusses avenues for future research.
Efficient Distributed In-Memory Processing of RDF Datasets
A novel approach for statistical calculations of large RDF datasets, which scales out to clusters of machines and the first distributed in-memory approach for computing 32 different statistical criteria for RDF dataset using Apache Spark is described.
Implementation of an Intelligent Model based on Big Data and Decision Making using Fuzzy Logic Type-2 for the Car Assembly Industry in an Industrial Estate in Northern Mexico
The objective of the project is the implementation of a QMS that supports the correct management of the information and knowledge that the organization obtains and generates in order to value and improve the services received by the citizen, reducing both wage costs and staff training.
GeoSensor: On-line Scalable Change and Event Detection over Big Data
Through the presented demonstration, both the effectiveness and the efficiency of GeoSensor's functionalities are highlighted.


The BigDataEurope Platform - Supporting the Variety Dimension of Big Data
The BDE platform was designed based upon the requirements gathered from seven of the societal challenges put forward by the European Commission in the Horizon 2020 programme and targeted by the BigDataEurope pilots, which allows to perform a variety of Big Data flow tasks like message passing, storage, analysis or publishing.
An extended IoT framework with semantics, big data, and analytics
This paper proposes a combined framework that brings Big Data, IoT, and semantic web together to build an augmented framework for this new era and provides a realistic use case that demonstrates how the model can implement the desired functionality and achieve the goals of such a model.
Model-driven deployment and management of workflows on analytics frameworks
The paper explores where the state-of-the-art falls short in meeting the operational challenges behind deploying and managing workflows on top of analytics platforms, proposes an architecture to solve the open challenges, and implements and evaluates this architecture.
KDD meets Big Data
  • N. Grady
  • Computer Science
    2016 IEEE International Conference on Big Data (Big Data)
  • 2016
Data Science Edge (DSE) is an enhanced process model to accommodate big data technologies and data science activities and promotes the use of a new term, Knowledge Discovery in Data Science (KDDS) as a call for the community to develop a new industry standard data science process model.
Semantic Web Technologies and Big Data Infrastructures: SPARQL Federated Querying of Heterogeneous Big Data Stores
An extension of the Semagrow federated SPARQL query processor that is able to seamlessly federatedSPARQL endpoints, Cassandra databases, and Solr databases is presented, and future directions of this line of work are discussed.
On-demand data analytics in HPC environments at leadership computing facilities: Challenges and experiences
This paper proposes an on-demand Spark service that mitigates difficulties, allowing facility users to flexibly create Spark instances quickly and easily, and defines a systematic approach for creating these Spark instances and validate that optimal performance benefits are maintained.
An architecture for the deployment of statistical models for the big data era
This paper presents the Model Deployment and Execution Framework (MDEF), to tackle challenges in response to the volume, velocity, and variety of big data.
State-of-the-art Web Applications using Microservices and Linked Data
This paper describes the current state of, a platform for building state-of-the-art web applications fuelled by Linked Data aware microservices. The platform assumes a mashup-like
A novel big-data processing framwork for healthcare applications: Big-data-healthcare-in-a-box
This work presents a novel approach of building a big-data framework that can be adapted to various healthcare applications with relative use, making this a one-stop “Big-Data-Healthcare-in-a-Box”.