Data Leverage: A Framework for Empowering the Public in its Relationship with Technology Companies

@article{Vincent2021DataLA,
  title={Data Leverage: A Framework for Empowering the Public in its Relationship with Technology Companies},
  author={Nicholas Vincent and Hanlin Li and Nicole Tilly and Stevie Chancellor and Brent J. Hecht},
  journal={Proceedings of the 2021 ACM Conference on Fairness, Accountability, and Transparency},
  year={2021}
}
  • Nicholas Vincent, Hanlin Li, Brent J. Hecht
  • Published 18 December 2020
  • Computer Science, Political Science
  • Proceedings of the 2021 ACM Conference on Fairness, Accountability, and Transparency
Many powerful computing technologies rely on implicit and explicit data contributions from the public. This dependency suggests a potential source of leverage for the public in its relationship with technology companies: by reducing, stopping, redirecting, or otherwise manipulating data contributions, the public can reduce the effectiveness of many lucrative technologies. In this paper, we synthesize emerging research that seeks to better understand and help people action this data leverage… 

Tables from this paper

Can "Conscious Data Contribution" Help Users to Exert "Data Leverage" Against Technology Companies?
TLDR
Using simulations, it is found thatconscious data contribution (CDC) could be highly effective at reducing the gap in intelligent technologies performance between an incumbent and their competitors.
A Deeper Investigation of the Importance of Wikipedia Links to Search Engine Results
TLDR
An investigation into the incidence of Wikipedia links in search engine results pages (SERPs) finds that Wikipedia links are extremely common in important search contexts, appearing in 67-84% of desktop SERPs for common and trending queries, but less often for medical queries.
From Data Leverage to Data Co-Ops: An Institutional Model for User Control over Information Access
TLDR
This work presents one possible instantiation of the data co-op, including the Platform for Untrusted Resource Evaluation (PURE), a system for assigning labels provided by untrusted and semi-trusted parties to Internet resources.
Conceptualising fairness: three pillars for medical algorithms and health equity
TLDR
Any claim to fairness must reflect critical assessment and ongoing social and political deliberation around these three pillars with a range of stakeholders, including patients, according to the ‘Three Pillars for Fairness’.
From Treatment to Healing:Envisioning a Decolonial Digital Mental Health
The field of digital mental health is making strides in the application of technology to broaden access to care. We critically examine how these technology-mediated forms of care might amplify
Language technology practitioners as language managers: arbitrating data bias and predictive bias in ASR
TLDR
The lens of language policy is used to analyse how current practices in training and testing ASR systems in industry lead to the data bias giving rise to systematic error differences, and proposes a re-framing of language resources as (public) infrastructure which should not solely be designed for markets, but for, and with meaningful cooperation of, speech communities.
Towards Conviviality in NavigatingHealth Information on Social Media
HCI is increasingly concerned with health information quality and spread of misinformation on social media. Despite many major platforms having been adopted across the world, the situated evaluation
“A Reasonable Thing to Ask For”: Towards a Unified Voice in Privacy Collective Action
People feel concerned, angry, and powerless when subjected to surveillance, data breaches and other privacy-violating experiences with institutions (PVEIs). Collective action may empower groups of
Addressing Privacy Threats from Machine Learning
TLDR
A brief overview of strategies for resisting these surveillance technologies is presented and calls for greater collaboration between machine learning and human-computer interaction researchers to address the threats that these technologies pose.
Adversarial for Good? How the Adversarial ML Community's Values Impede Socially Beneficial Uses of Attacks
TLDR
It is found that most adversarial ML researchers at NeurIPS hold two fundamental assumptions that will make it difficult for them to consider socially beneficial uses of attacks: it is desirable to make systems robust, independent of context, and attackers of systems are normatively bad and defenders of systems is normatively good.
...
1
2
...

References

SHOWING 1-10 OF 147 REFERENCES
Can "Conscious Data Contribution" Help Users to Exert "Data Leverage" Against Technology Companies?
TLDR
Using simulations, it is found thatconscious data contribution (CDC) could be highly effective at reducing the gap in intelligent technologies performance between an incumbent and their competitors.
Too Much Data: Prices and Inefficiencies in Data Markets
TLDR
It is demonstrated that the data externalities depress the price of data because once a user’s information is leaked by others, she has less reason to protect her data and privacy.
Data Donation: How to Resist the iLeviathan
  • B. Prainsack
  • Political Science
    Philosophical Studies Series
  • 2019
Large corporations are attracting criticism for their quasi-monopolist role in the digital data domain. It has been argued that they are no longer regular market participants but have become de facto
Should We Treat Data as Labor ? Moving Beyond “ Free ”
In the previous paper in this session and in a forthcoming book (Posner and Weyl, 2018), one of us argues that by creating or strengthening absent markets, we can simultaneously address the
How Do People Change Their Technology Use in Protest?
TLDR
It is found that protest use and protest non-use are relatively common, with 30% of respondents in 2019 reporting they were protesting at least one major tech company and the most common tactics were stopping use and leveraging ad blockers.
Mapping the Potential and Pitfalls of "Data Dividends" as a Means of Sharing the Profits of Artificial Intelligence
TLDR
This paper begins the process of developing a concrete design space for data dividends, and simulates the effects of a variety of important design decisions using well-known datasets and algorithms to highlight the need for dividend implementers to make design decisions cautiously.
Nonrivalry and the Economics of Data
Data is nonrival: a person’s location history, medical records, and driving data can be used by any number of firms simultaneously. Nonrivalry leads to increasing returns and implies an important
Impact of Wikipedia on Market Information Environment: Evidence on Management Disclosure and Investor Reaction
TLDR
Findings support the view that Wikipedia improves the information environment in the financial market and underscore the value of information aggregation through the use of information technology.
Building Successful Online Communities: Evidence-Based Social Design
Online communities are among the most popular destinations on the Internet, but not all online communities are equally successful. For every flourishing Facebook, there is a moribund Friendster--not
Big Data from the bottom up
This short article argues that an adequate response to the implications for governance raised by ‘Big Data’ requires much more attention to agency and reflexivity than theories of ‘algorithmic power’
...
1
2
3
4
5
...