Tyler Schnoebelen

Learn More
We present a study of the relationship between gender, linguistic style, and social networks, using a novel corpus of 14,000 Twitter users. Prior quantitative work on gender often treats this social variable as a female/male binary; we argue for a more nuanced approach. By clustering Twitter users, we find a natural decomposition of the dataset into various(More)
We present a study of the relationship between gender, linguistic style, and social networks, using a novel corpus of 14,000 users of Twitter. Prior quantitative work on gender often treats this social variable as a binary; we argue for a more nuanced approach. By clustering Twitter feeds, we find a range of styles and interests that reflects the(More)
that allowed us to collect this data. Participants heard 75 sentences from one of 16 artificially constructed languages, and were asked to identify individual words. Results from MTurk were Brainwaves are known to dip when semantic anomalies psycholinguistic data Screening for Confirming corpus trends statistically indistinguishable from controlled(More)
This paper demonstrates how to measure the compositionality of phrasal verbs using corpus frequencies from the BNC. This allows us to distinguish semantically transparent phrasal verbs (they lifted up their hats) from opaque ones (they summed up their feelings). Working by analogy to paradigmatic approaches to morphology (Moscoso del Prado Martín et al(More)
We present an after-action report for a large-scale damage assessment project that followed Hurricane Sandy's landfall on the Eastern seaboard of the USA in 2012. The Civil Air Patrol (CAP) took over 35,000 GPS-tagged images of damage-affected areas, as part of their mandate to provide aerial photographs for disaster assessment and response agencies,(More)
We present a compendium of recent and current projects that utilize crowdsourcing technologies for language studies, finding that the quality is comparable to controlled laboratory experiments, and in some cases superior. While crowdsourcing has primarily been used for annotation in recent language studies, the results here demonstrate that far richer data(More)
  • 1