Data Science

@article{Cao2017DataS,
  title={Data Science},
  author={Longbing Cao},
  journal={ACM Computing Surveys (CSUR)},
  year={2017},
  volume={50},
  pages={1 - 42}
}
  • Longbing Cao
  • Published 29 June 2017
  • Computer Science
  • ACM Computing Surveys (CSUR)
The 21st century has ushered in the age of big data and data economy, in which data DNA, which carries important knowledge, insights, and potential, has become an intrinsic constituent of all data-based organisms. An appropriate understanding of data DNA and its organisms relies on the new field of data science and its keystone, analytics. Although it is widely debated whether big data is only hype and buzz, and data science is still in a very early phase, significant challenges and… 

Figures and Tables from this paper

Data Science: Profession and Education

  • Longbing Cao
  • Education, Computer Science
    IEEE Intelligent Systems
  • 2019
TLDR
This article intends to address the deviated status of existing professional and educational markets, the restricted benchmarldng and accreditation of responsibilities and capabilities of data scientists, and the urgent need of standardizing and upgrading competencies and maturity of data science qualifications and education.

Data Science is Here

TLDR
The chapter considers a new digital divide between the society and this small group of people that make sense out of the vast data and help the organization in informed decision making.

Introduction to Data Science

  • M. Govindarajan
  • Computer Science
    Advances in Data Mining and Database Management
  • 2021
TLDR
The purpose of this chapter is to provide emphasis on integration and synthesis of concepts, techniques, applications, and tools to deal with various facets of data science practice, including data collection and integration, exploratory data analysis, predictive modeling, descriptive modeling, data product creation, evaluation, and effective communication.

Data science in organizations: Conceptualizing its breakthroughs and blind spots

TLDR
It is concluded that data-scientific advances across the four domains, in aggregate, could constitute requisite variety for particular types of business problems and draws on Ashby’s Law of Requisite Variety as theoretical principle.

The Development of Data Science: Implications for Education, Employment, Research, and the Data Revolution for Sustainable Development

TLDR
The rapidly growing post-graduate university course provisioning for Data Science; a preliminary study of employability requirements; and how Data Science builds collaboratively on other domains, potentially with innovative methodologies and practice are covered.

Data science strategies leading to the development of data scientists' skills in organizations

TLDR
The paper offers an in-depth comparative analysis between companies with or without a data science strategy, and the key limitation is regarding the literature review as a consequence of the novelty of the theme.

Data objects for knowing

TLDR
This conceptual paper argues that technology-driven science created a need for more technology- driven science, culminating in data science, and adapts Hacking and Galison’s views on physics to argue that data science is also an experimental science, which uses data objects in experiments.

Data science education programmes in Middle Eastern institutions: A survey study

In response to the current trends in dealing with data in academia, various research institutions and commercial entities around the world are building new programmes to fill the gaps in workforce

Supporting open science in big data frameworks and data science education

TLDR
This work proposes and demonstrates methods to facilitate openness and reproducibility in data science investigations using a big data framework, and outlines some of the Functional Data support in ScalaTion.
...

References

SHOWING 1-10 OF 334 REFERENCES

Data science and analytics: a new era

  • Longbing Cao
  • Computer Science
    International Journal of Data Science and Analytics
  • 2016
TLDR
The field encompasses the larger areas of statistics, data analytics, machine learning, big data management, and other disciplines, including complex systems, communications, social science, decision science, and management science.

Intrinsic Relations between Data Science, Big Data, Business Analytics and Datafication

TLDR
The definition of Data Science is provided, based on the review of available literature and a schematic representation of the concepts are proposed, to bring a paradigm change to a number of areas.

Data Science: Profession and Education

  • Longbing Cao
  • Education, Computer Science
    IEEE Intelligent Systems
  • 2019
TLDR
This article intends to address the deviated status of existing professional and educational markets, the restricted benchmarldng and accreditation of responsibilities and capabilities of data scientists, and the urgent need of standardizing and upgrading competencies and maturity of data science qualifications and education.

Data Science and its Relationship to Big Data and Data-Driven Decision Making

TLDR
It is argued that there are good reasons why it has been hard to pin down exactly what is data science, and that to serve business effectively, it is important to understand its relationships to other important related concepts, and to begin to identify the fundamental principles underlying data science.

Data Science Professional Uncovered: How the EDISON Project will Contribute to a Widely Accepted Profile for Data Scientists

TLDR
The strategy proposed is based on both the analysis of the demand side (industries, research centers and research infrastructure organisations) and the supply side (Universities and training centers) bridging between the providers and employers by cooperating on the establishment of a Competence Framework and a Body of Knowledge for the Data Scientist Professional.

Data science, big data and granular mining

A Big Data Guide to Understanding Climate Change: The Case for Theory-Guided Data Science

TLDR
A theory-guided data science paradigm is proposed that uses scientific theory to constrain both the big data techniques as well as the results-interpretation process to extract accurate insight from large climate data.

What Is Data Science

TLDR
The minor program in data science is intended to equip students with computational and analytical comprehension and tools that will allow them to work on a variety of data-driven problems in any discipline.

Analyzing the Analyzers: An Introspective Survey of Data Scientists and Their Work

TLDR
It is found that data scientists today can be clustered into four subgroups, each with a different mix of skillsets, and their purpose is to identify a new, more precise vocabulary for data science roles, teams, and career paths.

Statistics: a data science for the 21st century

TLDR
It is argued that, although there is a threat to the long-term status of statistics, there is also a much greater opportunity to re-emphasize the universal relevance of statistical method to the interpretation of data.
...