A Review of Hadoop Ecosystem for BigData

@article{Nagdive2018ARO,
  title={A Review of Hadoop Ecosystem for BigData},
  author={Ashlesha S. Nagdive and R. M. Tugnayat},
  journal={International Journal of Computer Applications},
  year={2018},
  volume={180},
  pages={35-40}
}
  • A. NagdiveR. Tugnayat
  • Published 17 January 2018
  • Computer Science
  • International Journal of Computer Applications
This paper, describes Concept of Big Data which is collection of large data set that cannot be proceed by traditional computational techniques. Therefore Hadoop technology designed to process Big Data. Hadoop is the platform in businesses for Big Data processing. Hadoop is an open source, Java-based programming framework which supports the processing and storage of extremely large data sets in a distributed computing environment. It helps Big Data analytics by overcoming the difficulties that… 

Figures from this paper

BIG DATA AND PREDICTIVE ANALYTICS IN MANUFACTURING ENTERPRISES FOR ENHANCED DECISION MAKING

An overview of the benefits of big data analytics in manufacturing and a brief explanation of the Hadoop ecosystem is presented which helps to understand a suite of services available on the cloud to solve big data problems.

A Proposed Architecture for IoT Big Data Analysis in Smart Supply Chain Fields

This article presents an architecture for the analysis of data generated by IoT that allows the extraction of knowledge, focusing on the case of specific use of the “Smart Supply Chain fields”.

A Hadoop Framework Realization of Online Physical Education Practice System based on Large-Scale Data Analysis

  • M. Zhong
  • Computer Science
    2022 3rd International Conference on Electronics and Sustainable Communication Systems (ICESC)
  • 2022
A Hadoop framework realization of the physical education practice research based on large-scale data analysis and a high-performance MapReduce state fusion and task scheduling model is established by using open source coding and program loading methods.

A REAL-TIME DATA STREAM PROCESSING MODEL FOR A SMART TRAFFIC APPLICATION, LEVERAGING INTELLIGENT INTERNET OF THINGS (IOT) CONCEPTS

A real-time data stream processing model for a smart traffic application was proposed and data streaming trends were used to monitor traffic which enables people to know if the roads in an IoT environment are congested.

Hadoop Performance Tuning based on Parameter Optimization

The benchmark test method is used to verify the optimization of parameters and to optimize the performance of Hadoop, and the experimental results show that after the Hadoops parameters are adjusted and optimized, theHadoop performance tuning is achieved under certain conditions.

Fraud Prediction in Smart Supply Chains Using Machine Learning Techniques

In the domain of Big Data, the company’s supply chain has a very high-risk exposure and this must be observed from a preventive perspective, that is, act before such situations occur. As a company

Parallel training models of deep belief network using MapReduce for the classifications of emotions

  • Gaurav AgarwalH. Om
  • Computer Science
    International Journal of System Assurance Engineering and Management
  • 2021
Two parallel models for the training of the deep belief networks (DBNs) based on the map-reduce framework are presented and they significantly accelerates the training speed of DBNs and given significant results in class classification of the emotions.

References

SHOWING 1-5 OF 5 REFERENCES

Optimization of the search graph using Hadoop and Linux Operating System

A MapReduce based search algorithm was designed and processed on a selected Hadoop cluster and optimized by processing the BigData of different sizes and varying the swappiness which behave as a virtual memory of the system.

Development of multiple big data analysis platforms for business intelligence

Although the optimized platform selection can reduce the execution time for the data retrieval and analysis significantly, furthermore scheduling optimization definitely increases the system efficiency a lot and improves the system throughput significantly.

Applying big data analysis technique to students' learning behavior and learning resource recommendation in a MOOCs course

A big data analysis technique on learning portfolio is proposed to explore how the interaction, unknown correlations, and hidden patterns among learners on MOOCs and social media platforms starts, and recognize the using big data analytics transforming on students' learning behavior.

Performance evaluation of cluster algorithms for Big Data analysis on cloud

The experimental results show that with a cloud of 9 compute nodes, about 3.5 times of acceleration are attainable; hence Hadoop cloud with R can be applied to solving insufficient memory issues in big data applications.

Chiu-Nan Su5,"Applying Big Data Analysis Technique to Students’ Learning Behavior and Learning

  • Proceedings of the 2017 IEEE International Conference on Applied System Innovation IEEE-ICASI 2017
  • 2017