Skip to search form
Skip to main content
Skip to account menu
Semantic Scholar
Semantic Scholar's Logo
Search 226,115,617 papers from all fields of science
Search
Sign In
Create Free Account
Apache Hadoop
Known as:
HDFS
, Hadoop YARN
, Hadoop Distributed Filesystem
Expand
Apache Hadoop (pronunciation: /həˈduːp/) is an open-source software framework for distributed storage and distributed processing of very large data…
Expand
Wikipedia
(opens in a new tab)
Create Alert
Alert
Related topics
Related topics
50 relations
Amazon Elastic Compute Cloud (EC2)
Apache Flume
Apache Giraph
Apache Gora
Expand
Papers overview
Semantic Scholar uses AI to extract papers important to this topic.
Highly Cited
2016
Highly Cited
2016
An efficient divide-and-conquer approach for big data analytics in machine-to-machine communication
Awais Ahmad
,
Anand Paul
,
M. M. Rathore
Neurocomputing
2016
Corpus ID: 7429790
Review
2015
Review
2015
Data Algorithms: Recipes for Scaling Up with Hadoop and Spark
2015
Corpus ID: 64185647
If you are ready to dive into the MapReduce framework for processing large datasets, this practical book takes you step by step…
Expand
Highly Cited
2014
Highly Cited
2014
COSHH: A classification and optimization based scheduler for heterogeneous Hadoop systems
Aysan Rasooli Oskooei
,
D. Down
Future generations computer systems
2014
Corpus ID: 2614264
Highly Cited
2012
Highly Cited
2012
A Parallel Genetic Algorithm Based on Hadoop MapReduce for the Automatic Generation of JUnit Test Suites
Linda Di Geronimo
,
F. Ferrucci
,
Alfonso Murolo
,
Federica Sarro
IEEE Fifth International Conference on Software…
2012
Corpus ID: 9314009
Software testing represents one of the most explored fields of application of Search-Based techniques and a range of testing…
Expand
Highly Cited
2012
Highly Cited
2012
Shared disk big data analytics with Apache Hadoop
Anirban Mukherjee
,
J. Datta
,
Raghavendra Jorapur
,
Ravi Singhvi
,
S. Haloi
,
W. Akram
International Conference on High Performance…
2012
Corpus ID: 18511020
Big Data is a term applied to data sets whose size is beyond the ability of traditional software technologies to capture, store…
Expand
Highly Cited
2012
Highly Cited
2012
Extending Map-Reduce for Efficient Predicate-Based Sampling
Raman Grover
,
M. Carey
IEEE International Conference on Data Engineering
2012
Corpus ID: 6736304
In this paper we address the problem of using MapReduce to sample a massive data set in order to produce a fixed-size sample…
Expand
Highly Cited
2011
Highly Cited
2011
Play It Again, SimMR!
Abhishek Verma
,
L. Cherkasova
,
R. Campbell
IEEE International Conference on Cluster…
2011
Corpus ID: 13893770
A typical MapReduce cluster is shared among different users and multiple applications. A challenging problem in such shared…
Expand
Highly Cited
2011
Highly Cited
2011
Comparing High Level MapReduce Query Languages
Robert J. Stewart
,
P. Trinder
,
Hans-Wolfgang Loidl
Advanced Parallel Programming Technologies
2011
Corpus ID: 829167
The MapReduce parallel computational model is of increasing importance. A number of High Level Query Languages (HLQLs) have been…
Expand
Highly Cited
2009
Highly Cited
2009
CloudWF: A Computational Workflow System for Clouds Based on Hadoop
Chen Zhang
,
H. Sterck
International Conference on Cloud Computing
2009
Corpus ID: 1466437
This paper describes CloudWF, a scalable and lightweight computational workflow system for clouds on top of Hadoop. CloudWF can…
Expand
Highly Cited
2001
Highly Cited
2001
Color and Number Counts
P. Saracco
,
E. Giallongo
,
+5 authors
E. Vanzella
2001
Corpus ID: 16018505
We present near-IR (J and Ks) number counts and colors of galaxies detected in deep VLT-ISAAC images centered on the Chandra Deep…
Expand
By clicking accept or continuing to use the site, you agree to the terms outlined in our
Privacy Policy
(opens in a new tab)
,
Terms of Service
(opens in a new tab)
, and
Dataset License
(opens in a new tab)
ACCEPT & CONTINUE