Share This Author
Understanding and explaining Delta measures for authorship attribution
It is shown that feature vector normalization, that is, the transformation of the feature vectors to a uniform length of 1 (implicit in the cosine measure), is the decisive factor for the improvement of Delta proposed recently.
TaDiRAH: a Case Study in Pragmatic Classification
Topic Modeling Genre: An Exploration of French Classical and Enlightenment Drama
- Christof Schöch
- LinguisticsDigit. Humanit. Q.
- 1 June 2015
The concept of literary genre is a highly complex one: not only are different genres frequently defined on several, but not necessarily the same levels of description, but consideration of genres as…
Revisiting Style, a Key Concept in Literary Studies
Abstract Language and literary studies have studied style for centuries, and even since the advent of ›stylistics‹ as a discipline at the beginning of the twentieth century, definitions of ›style‹…
Big? Smart? Clean? Messy? Data in the Humanities
- Christof Schöch
- 1 August 2013
This paper is about data in the humanities, and how the digital humanities aim to raise to the challenge and realize the potential of this data for humanistic inquiry.
Straight Talk! Automatic Recognition of Direct Speech in Nineteenth-Century French Novels
- Christof Schöch, Daniel Schlör, Stefanie Popp, Ann Brunner, Ulrike Henny, J. C. Tello
- Art, LinguisticsDH
The work presented here addresses both the question of how to identify direct speech in French prose fiction and that of how prevalent direct speech is in different subgenres of the nineteenth-century French novel.
In search of comity: TEI for distant reading
- L. Burnard, Christof Schöch, C. Odebrecht
- Computer ScienceJournal of the Text Encoding Initiative
- 17 March 2021
The focus of the ELTeC encoding scheme is not to represent texts in all their original complexity, nor to duplicate the work of scholarly editors, but to facilitate a richer and better-informed distant reading than a transcription of lexical content alone would permit.
Explaining Delta, or: How do distance measures for authorship attribution work?
- S. Evert, Fotis Jannidis, Thomas Proisl, Steffen Pielström, Christof Schöch, Thorsten Vitt
- 5 June 2015
This paper presents findings from a larger investigation of authorship attribution methods, which pertains to the effects of normalization methods and distance measures in different languages, describing the aims, data and methods.
Towards a better understanding of Burrows’s Delta in literary authorship attribution
- S. Evert, Thomas Proisl, Thorsten Vitt, Christof Schöch, Fotis Jannidis, Steffen Pielström
- 1 May 2015
The effects of standardization and vector normalization on the statistical distributions of features and the resulting text clustering quality are evaluated and supervised selection of discriminant words are explored as a procedure for further improving authorship attribution.