Big Data and discrimination: perils, promises and solutions. A systematic review

  title={Big Data and discrimination: perils, promises and solutions. A systematic review},
  author={Maddalena Favaretto and Eva M. De Clercq and Bernice Simone Elger},
  journal={Journal of Big Data},
BackgroundBig Data analytics such as credit scoring and predictive analytics offer numerous opportunities but also raise considerable concerns, among which the most pressing is the risk of discrimination. Although this issue has been examined before, a comprehensive study on this topic is still lacking. This literature review aims to identify studies on Big Data in relation to discrimination in order to (1) understand the causes and consequences of discrimination in data mining, (2) identify… 
Exploring big data traits and data quality dimensions for big data analytics application using partial least squares structural equation modelling
The findings revealed that big data traits can significantly affect all constructs for data quality dimensions and that the ease of operation construct has a significant effect on BDA application.
The Big Data World: Benefits, Threats and Ethical Challenges
  • Marina Da Bormida
  • Computer Science
    Ethical Issues in Covert, Security and Surveillance Research
  • 2021
The debate should further investigate ‘data altruism’, deep-ening how the increasing amounts of data in the authors' society can be concretely used for public good and the best implementation modalities, and further investigate aspects like the ‘ownership’ of data and other hurdles.
The Right Not to Be Subject to Automated Individual Decision-Making/Profiling Concerning Big Health Data. Developing an Algorithmic Culture
This paper identifies and systematises the legal provisions concerned, offering interpretative solutions that tackle dangers concerning data subjects’ rights while embracing the opportunities that Big Health Data have to offer.
The analysis of influence mechanism for internet financial fraud identification and user behavior based on machine learning approaches
The Internet financial fraud identification model based on BPNN demonstrates satisfying performance and is worth of promotion, and the authentication technology, non-repudiation of transactions, privacy protection, data integrity, and users' sense of trust of the Internet finance have a significant impact on users' behavior.
Algorithmic bias: review, synthesis, and future research directions
ABSTRACT As firms are moving towards data-driven decision making, they are facing an emerging problem, namely, algorithmic bias. Accordingly, algorithmic systems can yield socially-biased outcomes,
Causal effect of racial bias in data and machine learning algorithms on user persuasiveness & discriminatory decision making: An Empirical Study
The paper bridges the gap across the harm caused in establishing poor customer trustworthiness due to an inequitable system design and provides strong support for researchers, policymakers, and data scientists to build responsible AI frameworks within organizations.
An Empirical Study on the Employment Monitoring and Early Warning Mechanism of Medical Graduates in Universities with Big Data and Complex Computing System
  • Haixia Wu, S. Tsai
  • Economics, Business
    Discrete Dynamics in Nature and Society
  • 2021
Based on the management of big data, the analysis and forecast of the employment demand cycle business situation studied in this article is based on the employment cycle theory and a complete set of
Discrimination in Modern Labor Markets: A Field Experiment
The preliminary study to determine whether ATS used in the hiring process racially discriminate does not find evidence of racial discrimination, and the initial resume evaluation process may be less significant than in the past.
Big Data and Digitalization in Dentistry: A Systematic Review of the Ethical Issues
The study results highlight how most of the issues discussed by the retrieved literature are in line with the ethical challenges that digital technologies are introducing in healthcare such as privacy, anonymity, security, and informed consent.


A Methodology for Direct and Indirect Discrimination Prevention in Data Mining
This paper discusses how to clean training data sets and outsourced data sets in such a way that direct and/or indirect discriminatory decision rules are converted to legitimate (nondiscriminatory) classification rules and proposes new techniques applicable for direct or indirect discrimination prevention individually or both at the same time.
Toward Accountable Discrimination-Aware Data Mining: The Importance of Keeping the Human in the Loop - and Under the Looking Glass
"Big Data" and data-mined inferences are affecting more and more of our lives, and concerns about their possible discriminatory effects are growing. Methods for discrimination-aware data mining and
The Ethics of Big Data: Current and Foreseeable Issues in Biomedical Contexts
This article systematically and comprehensively analyses academic literature concerning the ethical implications of Big Data, providing a watershed for future ethical investigations and regulations and identifies eleven themes that provide a thorough critical framework to guide ethical assessment and governance of emerging Big Data practices.
Measuring discrimination in algorithmic decision making
Various discrimination measures that have been used, analytically and computationally analyze their performance, and highlight implications of using one or another measure are reviewed to produce a unifying view of performance criteria when developing new algorithms for non-discriminatory predictive modeling.
Integrating induction and deduction for finding evidence of discrimination
An implementation, called LP2DD, of the overall reference model that integrates induction, through data mining classification rule extraction, and deduction, through a computational logic implementation of the analytical tools is presented.
Big Data's Disparate Impact
Advocates of algorithmic techniques like data mining argue that these techniques eliminate human biases from the decision-making process. But an algorithm is only as good as the data it works with.
The era of Big Data has begun. Computer scientists, physicists, economists, mathematicians, political scientists, bio-informaticists, sociologists, and other scholars are clamoring for access to the
[89WashLRev0001] The Scored Society: Due Process for Automated Predictions
P Procedural regularity is essential for those stigmatized by “artificially intelligent” scoring systems, and regulators should be able to test scoring systems to ensure their fairness and accuracy.
The new profiling: Algorithms, black boxes, and the failure of anti-discriminatory safeguards in the European Union
With pattern-based categorizations in data-driven profiling, safeguards such as the Charter of Fundamental Rights of the European Union or the EU data-protection framework essentially lose their applicability, leading to a diminishing role of the tools of the anti-discrimination framework.
Known or knowing publics? Social media data mining and the question of public agency
New methods to analyse social media data provide a powerful way to know publics and capture what they say and do. At the same time, access to these methods is uneven, with corporations and