Skip to search form
Skip to main content
Skip to account menu
Semantic Scholar
Semantic Scholar's Logo
Search 219,937,495 papers from all fields of science
Search
Sign In
Create Free Account
Apache Spark
Known as:
Resilient Distributed Datasets
, Resilient Distributed Dataset
, Spark (cluster computing framework)
Expand
Apache Spark is an open source cluster computing framework. Originally developed at the University of California, Berkeley's AMPLab, the Spark…
Expand
Wikipedia
(opens in a new tab)
Create Alert
Alert
Related topics
Related topics
50 relations
Apache Avro
Apache Flume
Apache Giraph
Apache Hadoop
Expand
Papers overview
Semantic Scholar uses AI to extract papers important to this topic.
2018
2018
Visible-near infrared spectrum-based classification of apple chilling injury on cloud computing platform
Ji’an Xia
,
Yuwang Yang
,
H. Cao
,
Chen Han
,
D. Ge
,
Wenyu Zhang
Computers and Electronics in Agriculture
2018
Corpus ID: 3273293
2018
2018
Enhancing copper infiltration into alumina using spark plasma sintering to achieve high performance Al2O3/Cu composites
Yingge Shi
,
Wenge Chen
,
Longlong Dong
,
Hanyan Li
,
Y. Fu
2018
Corpus ID: 56394627
2017
2017
A Big Data Analysis Framework Using Apache Spark and Deep Learning
Anand Gupta
,
H. Thakur
,
Ritvik Shrivastava
,
Pulkit Kumar
,
Sreyashi Nag
IEEE International Conference on Data Mining…
2017
Corpus ID: 447043
With the spreading prevalence of Big Data, many advances have recently been made in this field. Frameworks such as Apache Hadoop…
Expand
Highly Cited
2016
Highly Cited
2016
Fuzzy Based Scalable Clustering Algorithms for Handling Big Data Using Apache Spark
Neha Bharill
,
Aruna Tiwari
,
A. Malviya
IEEE Transactions on Big Data
2016
Corpus ID: 11315673
A huge amount of digital data containing useful information, called Big Data, is generated everyday. To mine such useful…
Expand
Highly Cited
2016
Highly Cited
2016
Large-scale detection of non-technical losses in imbalanced data sets
P. Glauner
,
Andre Boechat
,
+4 authors
Diogo Duarte
IEEE PES Innovative Smart Grid Technologies…
2016
Corpus ID: 8917127
Non-technical losses (NTL) such as electricity theft cause significant harm to our economies, as in some countries they may range…
Expand
2016
2016
Approximate Parallel High Utility Itemset Mining
Yan Chen
,
Aijun An
Big Data Research
2016
Corpus ID: 29956517
Review
2015
Review
2015
Large Scale Distributed Data Science using Apache Spark
J. Shanahan
,
Liang Dai
Knowledge Discovery and Data Mining
2015
Corpus ID: 36839247
Apache Spark is an open-source cluster computing framework for big data processing. It has emerged as the next generation big…
Expand
Highly Cited
2015
Highly Cited
2015
Scaling Machine Learning for Target Prediction in Drug Discovery using Apache Spark
Dries Harnie
,
A. Vapirev
,
+4 authors
W. Meuter
15th IEEE/ACM International Symposium on Cluster…
2015
Corpus ID: 10620941
In the context of drug discovery, a key problem is the identification of candidate molecules that affect proteins associated with…
Expand
2011
2011
Thermoelectric Properties of Fine-Grained PbTe Bulk Materials Fabricated by Cryomilling and Spark Plasma Sintering
C. Kuo
,
H. S. Chien
,
C. Hwang
,
Y. Chou
,
M. Jeng
,
M. Yoshimura
2011
Corpus ID: 73654801
Dense fine-grained PbTe bulk materials without oxide phases are fabricated using a process that combines cryomilling (mechanical…
Expand
Highly Cited
2007
Highly Cited
2007
Application of spark plasma sintering to the fabrication of binary and ternary skutterudites
C. Recknagel
,
N. Reinfried
,
+4 authors
A. Leithe-Jasper
2007
Corpus ID: 55318508
By clicking accept or continuing to use the site, you agree to the terms outlined in our
Privacy Policy
(opens in a new tab)
,
Terms of Service
(opens in a new tab)
, and
Dataset License
(opens in a new tab)
ACCEPT & CONTINUE