Opinion Mining of Twitter Data using Hadoop and Apache Pig

  title={Opinion Mining of Twitter Data using Hadoop and Apache Pig},
  author={Anjali Barskar and Ajay Kumar Phulre},
  journal={International Journal of Computer Applications},
Twitter, one of the largest and famous social media site receives millions of tweets every day on variety of important topic. This large amount of raw data can be used for industrial , Social, Economic, Government policies or business purpose by organizing according to our need and processing. Hadoop is one of the best tool options for twitter data analysis and hadoop works for distributed Big data , Streaming data , Time Stamped data , text data etc. This paper discuss how to use FLUME for… 

Figures from this paper

Sentiment Analysis of Real Time Twitter Data Using Big Data Approach
This paper has discussed how effectively sentiment analysis is performed on the tweets gathered from twitter using Big data approach and used Apache Flume to stream ongoing twitter data into the Hadoop Distributed File System (HDFS).
Twitter data analysis using hadoop ecosystems and apache zeppelin
The location from where the tweets is posted and the language in which the tweets are written can be effectively analysed by using Hadoop, a tool used to analyze distributed big data, streaming data, timestamp data and text data.
Use of Hadoop for Sentiment Analysis on Twitter’s Big Data
This research is going to analyze nature of a particular person on the basis of their behavior on social sites using Hadoop, and the result shows sentiment analysis with good accuracy.
Twitter Data Analysis for Live Streaming by Using Flume Technology
This work is going to analyze the tweets to find out the trending word among the given word using big data, and make use of flume technology to extract real-time data on twitter as well as store inthe Hadoop distributed file system.
Twitter Based Capital Market Analysis Using Cloud Statistics
Peoplein the-modern-day-worldソareﻴ�attracted-to-towards-smart-ness-and-situational-awareness-awarenessエ; will-be-in-the-know-by-tomorrows, will- be-out-of-towners, will be able to find out as soon as possible.
Real-time Twitter data analysis using Hadoop ecosystem
A method for finding recent trends in tweets and sentiment analysis on real-time tweets is proposed and conclusion can be drawn that Pig is more efficient than Hive as Pig takes less time for execution than Hive.
A new big data approach for topic classification and sentiment analysis of Twitter data
The proposed HL-NBC method for sentimental analysis does sentiment classification in an improved way and gives accuracy of 82%, which is comparatively better than other methods, and achieves 93% improvement in processing time for larger datasets.


Tweet Analysis: Twitter Data processing Using Apache Hadoop
A way of analyzing of big data such as twitter data using Apache Hadoop which will process and analyze the tweets on a Hadooper clusters is provided, which also includes visualizing the results into a pictorial representations of twitter users and their tweets.
Sentiment Analysis of Twitter Data Using Hadoop
This project proposes to analyse the sentiments of Twitter users through their tweets in order to extract what they think using hadoop for sentiment analysis which will process the huge amount of data on a hadoop cluster faster.
Big-SoSA:Social Sentiment Analysis and Data Visualization on Big Data
A method of sentiment analysis on twitter is proposed by using Hadoop and its ecosystems that will process the large volume of data on a Hadoops and the MapReduce function will perform the sentiment analysis.
Big Data Sentiment Analysis using Hadoop
The main focus of the research was to find a technique that can efficiently perform Sentiment Analysis on Big Data sets using Hadoop and the performance of the technique was measured in form of speed and accuracy.
A Survey on Twitter Data Analysis Techniques to Extract Public Opinion
Various twitter data analysis techniques that are based on dictionary and that are using the machine learning approaches are discussed.
TSentiment: On gamifying Twitter sentiment analysis
This paper considers the gamification approach to sentimentally classify tweets and proposes TSentiment, a game with a purpose that uses human beings to classify the polarity of tweets and their sentiment and obtained results showed that the game approach was well accepted.
Addressing big data problem using Hadoop and Map Reduce
The size of the databases used in today's enterprises has been growing at exponential rates day by day. Simultaneously, the need to process and analyze the large volumes of data for business decision
Sentiment Analysis and Opinion Mining: A Survey
A survey which covers Opining Mining, Sentiment Analysis, techniques, tools and classification is presented which covers the polarity of extracted public opinions.
Big data and advanced analytics tools
  • R. Chawda, G. Thakur
  • Computer Science
    2016 Symposium on Colossal Data Analysis and Networking (CDAN)
  • 2016
The study of big data 5V's definition, Analysis requirements, tools, frame works and different type of cloud based big data analytics tools provide by different companies and functioning of Hadoop or MapReduce Process is dealt with.
Nadagoud, Mr. Kotresh Naik.D, “Market Sentiment Analysis for Popularity of Flipkart
  • Journal of Advanced Research in Computer Engineering & Technology (IJARCET),
  • 2015