Big data: the end of the scientific method?

  title={Big data: the end of the scientific method?},
  author={Sauro Succi and Peter V. Coveney},
  journal={Philosophical transactions. Series A, Mathematical, physical, and engineering sciences},
  • S. Succi, P. Coveney
  • Published 25 July 2018
  • Philosophy
  • Philosophical transactions. Series A, Mathematical, physical, and engineering sciences
For it is not the abundance of knowledge, but the interior feeling and taste of things, which is accustomed to satisfy the desire of the soul. (Saint Ignatius of Loyola). We argue that the boldest claims of big data (BD) are in need of revision and toning-down, in view of a few basic lessons learned from the science of complex systems. We point out that, once the most extravagant claims of BD are properly discarded, a synergistic merging of BD with big theory offers considerable potential to… 

Figures from this paper

Perspective: Dimensions of the scientific method
  • E. Voit
  • Education
    PLoS Comput. Biol.
  • 2019
It is proffered here that hypothesis-driven, data-mining–inspired, and “allochthonous” knowledge acquisition, based on mathematical and computational models, are vectors spanning a 3D space of an expanded scientific method.
Zero Sales Resistance: The Dark Side of Big Data and Artificial Intelligence
The role of BD and AI in steering the world population toward the state of Zero Sales Resistance (ZSR): the inability to exert critical judgment over the most seductive aspects of the aforementioned data deluge is analyzed.
  • Computer Science
  • 2019
This editorial clarifies the logic of big data from a research perspective, arguing that management researchers may enrich the perspective in some important ways, and outlines research opportunities that can leverage the strengths ofbig data and management scholarship to mutual advantage.
Is Big Digital Data Different? Towards a New Archaeological Paradigm
The nature of digital data and its use within a Big Data environment is sought as a prerequisite to rational and appropriate digital data analysis in archaeology, and a means towards developing a more reflexive, contextual approach to Big Data.
Big Data in Biodiversity Science: A Framework for Engagement
Despite best efforts, the loss of biodiversity has continued at a pace that constitutes a major threat to the efficient functioning of ecosystems. Curbing the loss of biodiversity and assessing its
Will Big Data and personalized medicine do the gender dimension justice?
The struggle to represent exhaustively the uniqueness of the patient through Big Data is highlighted prior to a deeper investigation of the digital representation of gender in personalized medicine.
From Digital Archaeology to Data-Centric Archaeological Research
Since the end of the 20th century the widespread use of digital applications in archaeology has legitimized their inclusion in the archaeological toolbox. Together with archaeological sciences,
Multiscale modelling, simulation and computing: from the desktop to the exascale
This theme issue provides a state-of-the-art account of generic multiscale computing, as well as exciting examples of applications of such concepts in domains ranging from astrophysics, via material science and fusion, to biomedical sciences.
The Smart Forest Conundrum: Contextualizing Pitfalls of Sensors and AI in Conservation Science for Tropical Forests
The term ‘smart forest’ is not yet common, but the proliferation of sensors, algorithms, and technocentric thinking in conservation, as in most other aspects of our lives, suggests we are at the
Application of Systems Engineering Principles and Techniques in Biological Big Data Analytics: A Review
This review focuses on the following three key areas in biological big data analytics where systems engineering principles and techniques have been playing important roles: the principle of parsimony in addressing overfitting, the dynamic analysis of biological data, and the role of domain knowledge in biological data analytics.


Big data need big theory too
It is argued that it is vital to use theory as a guide to experimental design for maximal efficiency of data collection and to produce reliable predictive models and conceptual knowledge in multiscale modelling.
The Deluge of Spurious Correlations in Big Data
It is proved that very large databases have to contain arbitrary correlations, and can be found in “randomly” generated, large enough databases, whichimplies that most correlations are spurious.
A Trio of Inference Problems That Could Win You a Nobel Prize in Statistics (If You Help Fund It)
All of these challenges call for an expanded paradigm with greater emphases on qualitative consistency and relative optimality than do the current inference paradigms.
The signal and the noise : why so many predictions fail but some don't
Nate Silver emphasizes the need of precise wording of predictions, like “this person has a 85% probability of winning the election, with an error range of ±2%”.
The Black Swan: The Impact of the Highly Improbable
Nassim Nicholas Taleb's phenomenal international bestseller The Black Swan: The Impact of the Highly Improbable shows us how to stop trying to predict everything - and take advantage of uncertainty.
Social Physics: How Good Ideas Spread - the Lessons from a New Science
SOCIAL PHYSICS: HOW GOOD IDEAS SPREAD - THE LESSONS FROM A NEW SCIENCE Alex Pentland New York, Penguin Press, 2014,300 pp. ISBN: 978-1-59420-565-1This book is about "mathematizing" the social
Fooled by Randomness: The Hidden Role of Chance in Life and in the Markets
If the prescriptions for getting rich that are outlined in books such as The Millionaire Next Door and Rich Dad Poor Dad are successful enough to make the books bestsellers, then one must ask, Why
Life 3.0: Being Human in the Age of Artificial Intelligence
New York Times Best Seller How will Artificial Intelligence affect crime, war, justice, jobs, society and our very sense of being human? The rise of AI has the potential to transform our future more
Frontiers of Complexity: The Search for Order in a Chaotic World, Peter Coveney and Roger Highfield. 1995. Random House, Inc., New York, NY. 480 pages. ISBN: 0-449-90832-1. $27.50
This comprehensive work investigates: the role of computers in the understanding of complexity; spontaneous emergence of order in the universe; how nature can solve problems that traditionally defeat
The Role of Multiscale Protein Dynamics in Antigen Presentation and T Lymphocyte Recognition
A mechanistic description of antigen presentation and TCR activation is advocated which is explanatory, predictive, and quantitative, drawing on modeling approaches that collectively span several length and time scales, being capable of furnishing reliable biological descriptions that are difficult for experimentalists to provide.