Data Mining and Data Pre-processing for Big Data

  author={Ashish R. Jagdale and Kavita Sonawane and Shamsuddin Sultan Khan},
Big Data is a term which is used to describe massive amount of data generating from digital sources or the internet usually characterized by 3 V's i.e. Volume, Velocity and Variety. From the past few years data is exponentially growing due to the use of connected devices such as smart phone's, tablets, laptops and desktop computer. Moreover E-commerce which is also known as online market, internet services and social networking sites are generating tremendous user data in the form of documents… 
