• Publications
  • Influence
KEEL Data-Mining Software Tool: Data Set Repository, Integration of Algorithms and Experimental Analysis Framework
TLDR
The aim of this paper is to present three new aspects of KEEL: KEEL-dataset, a data set repository which includes the data set partitions in theKEELformatandshowssomeresultsofalgorithmsinthesedatasets; some guidelines for including new algorithms in KEEL, helping the researcherstomaketheirmethodseasilyaccessibletootherauthorsandtocompare theresults of many approaches already included within the KEEL software. Expand
  • 1,600
  • 100
  • PDF
Managing by Values: A Corporate Guide to Living, Being Alive, and Making a Living in the 21st Century
Preface Introduction PART I: MANAGEMENT BY VALUES: LOGIC AND CONTENT Managing by Values (MBV): Its Foundation and Evolution Values: But, what Actually are they? Renew or Die: The Importance ofExpand
  • 73
  • 6
Big data preprocessing: methods and prospects
TLDR
The massive growth in the scale of data has been observed in recent years being a key factor of the Big Data scenario. Expand
  • 138
  • 4
  • PDF
Big Data: Tutorial and guidelines on information and process fusion for analytics algorithms with MapReduce
TLDR
We enumerate and analyze two alternative methodologies that may be found both in the specialized literature and in standard Machine Learning libraries for Big Data. Expand
  • 88
  • 4
  • PDF
Multivariate Discretization Based on Evolutionary Cut Points Selection for Classification
TLDR
In this paper, we propose the use of evolutionary algorithms to select a subset of cut points that defines the best possible discretization scheme of a data set using a wrapper fitness function. Expand
  • 35
  • 3
A survey of fingerprint classification Part II: Experimental analysis and ensemble proposal
TLDR
We reviewed the fingerprint classification literature from two different perspectives: the feature extraction and the classifier learning. Expand
  • 44
  • 3
Diagnose Effective Evolutionary Prototype Selection Using an Overlapping Measure
TLDR
In this paper, we analyze the behavior of the evolutionary prototype selection strategy, considering a complexity measure for classification problems based on overlapping. Expand
  • 22
  • 3
  • PDF
A distributed evolutionary multivariate discretizer for Big Data processing on Apache Spark
TLDR
This paper proposes a distributed discretization algorithm for Big Data analytics based on evolutionary optimization. Expand
  • 20
  • 2
  • PDF
Data discretization: taxonomy and big data challenge
TLDR
Discretization of numerical data is one of the most influential data preprocessing tasks in knowledge discovery and data mining. Expand
  • 82
  • 1
  • PDF
A comparison on scalability for batch big data processing on Apache Spark and Apache Flink
TLDR
We perform a comparative study for batch data processing of the scalability of two popular frameworks for processing and storing Big Data, Apache Spark and Apache Flink. Expand
  • 45
  • 1