Using APIs for Data Collection on Social Media

@article{Lomborg2014UsingAF,
  title={Using APIs for Data Collection on Social Media},
  author={Stine Lomborg and Anja Bechmann},
  journal={The Information Society},
  year={2014},
  volume={30},
  pages={256 - 265}
}
This article discusses how social media research may benefit from social media companies making data available to researchers through their application programming interfaces (APIs). An API is a back-end interface through which third-party developers may connect new add-ons to an existing service. The API is also an interface for researchers to collect data off a given social media service for empirical analysis. Presenting a critical methodological discussion of the opportunities and… 
Looking Through the Twitter Glass: Bridging the Data - Researcher Gap
TLDR
A method and prototypical web-based system that utilizes Twitter’s latest API offering to enable researchers to collect high quality data themselves at minimal cost and alleviate the major issues faced by non-technical researchers collecting social media data.
Social media comparison and analysis: The best data source for research?
  • Neli BlagusSlavko Žitnik
  • Computer Science
    2018 12th International Conference on Research Challenges in Information Science (RCIS)
  • 2018
TLDR
It is uncovered that popularity of selected topics in social media is proportional to the number of hits on Google, celebrities and politicians are the most talked topics and that behaviour of users across social mediaIs different, and basic usage statistics, reach and engagement differences, language, sentiment and gender identification are shown.
Research in Social Media: Data Sources and Methodologies
TLDR
The paper provides an overview of the characteristics of four commonly used social network sites: Facebook, Twitter, LinkedIn, and StockTwits, and provides illustrative examples of codes that a researcher might employ to extract information from the SNSs.
Big Data, Collection of (Social Media, Harvesting)
TLDR
Social media harvesting is a way to collect data from social media platforms unobtrusively and automatically by employing simple programing tools, and can handle large-scale data sources more easily than do traditional methods.
Applying user analytics to uses and effects of social media in China
TLDR
This paper presents an application study of Chinese bloggers to demonstrate how to employ user analytics methods to study the production, consumption, and effects of user-generated content.
Towards a General Architecture for Social Media Data Capture from a Multi-Domain Perspective
TLDR
A general architecture for data capture from Social Media sources has been devised, and the interfaces of the composing modules have been defined and the resulting API can be exploited for an orderly re-engineering of crawling tools in the reference domains, thus implementing specific versions of the generic architecture.
“Meaning” in Social Media
Researchers of social media struggle to stay up to speed: empirical findings are most often very context- and time-specific and quickly become outdated because the object of study changes. By
Twitter as research data
TLDR
This article compares the distributions of moral discussions from two commonly used tools for accessing Twitter data to the ground truth, the Twitter full archive, and highlights the importance of assessing the comparability of data sources to improve confidence in findings based on Twitter data.
THE EVOLUTION OF FACEBOOK'S GRAPH API
TLDR
An in-depth technical and empirical perspective on the evolution of Facebook’s application programming interfaces (APIs) is contributes to the public debate about data privacy and protection.
...
...

References

SHOWING 1-10 OF 62 REFERENCES
Researching Communicative Practice: Web Archiving in Qualitative Social Media Research
TLDR
The article discusses the merits and limitations of using social media archives in empirical social media research, and a theoretical examination of web archiving as a new method enabled by the web itself.
Is the Sample Good Enough? Comparing Data from Twitter's Streaming API with Twitter's Firehose
TLDR
Data collected using Twitter's sampled API service is compared with data collected using the full, albeit costly, Firehose stream that includes every single published tweet to help researchers and practitioners understand the implications of using the Streaming API.
The Value of Big Data in Digital Media Research
TLDR
Qualitative aspects of Big Data analyses with regard to their applicability and usefulness in digital media research are discussed, and it is argued that researchers need to consider whether the analysis of huge quantities of data is theoretically justified.
Social Science Research Methods in Internet Time
TLDR
It is proposed that scholars embrace the values of transparency and kludginess in order to answer important research questions in a rapidly-changing communications environment.
Is it really about me?: message content in social awareness streams
TLDR
A content-based categorization of the type of messages posted by Twitter users is developed, based on which the analysis shows two common types of user behavior in terms of the content of the posted messages, and exposes differences between users in respect to these activities.
Mapping actor roles in social media: Different perspectives on value creation in theories of user participation
TLDR
It is demonstrated that social media are either addressed in economic and socio-political value creation, that is, power, exploitation and business revenues, or in terms of value creation as sense-making, that has consequences for the conceptualization of the media user as a participatory agent.
Analyzing (social media) networks with NodeXL
TLDR
A sequence of NodeXL operations from data import to computation of network statistics and refinement of network visualization through sorting, filtering, and clustering functions is described, revealing sociologically relevant differences in the patterns of interconnection among employee participants in the social media space.
AGILE ETHICS FOR MASSIFIED RESEARCH AND VISUALIZATION
TLDR
The authors conclude by making the counterintuitive suggestion that researchers make themselves as vulnerable to potential data mining as the subjects who comprise their data sets: a parity of practice.
Social Networks that Matter: Twitter Under the Microscope
TLDR
A study of social interactions within Twitter reveals that the driver of usage is a sparse and hidden network of connections underlying the “declared” set of friends and followers.
Internet and Surveillance: The Challenges of Web 2.0 and Social Media
TLDR
This book explores topics such as commercial data collection by advertising, consumer sites and interactive media; self-disclosure in the social web; surveillance of file-sharers; privacy in the age of the internet; civil watch-surveillance on social networking sites; and networked interactive surveillance in transnational space.
...
...