• Corpus ID: 59750364

Some Frequency based Differences between Spoken and Written Swedish

@inproceedings{Allwood1998SomeFB,
  title={Some Frequency based Differences between Spoken and Written Swedish},
  author={Jens Allwood},
  year={1998}
}
This is a report on the differences in word frequency foundbetween two Swedish corpora, - a transcribed spoken languagecorpus of 276,391 words and a written language corpus of 271,216words. The spoken language corpus contains material from 14activity types while the written language corpus contains materialfrom novels (40%) and newspapers (60%). The report expands andcontinues earlier work on differences between spoken and writtenlanguage, e.g. Jirgensen (1976) or Biber (1988).The word… 

Tables from this paper

Comparing Syllable Frequencies in Corpora of Written and Spoken Language

The results indicate that syllable frequencies in written corpora can be taken as a rough estimate for their frequency in spoken language.

Transliteration between spoken language corpora

Comparison of languages and linguistic data is essential if progress in our understanding of the nature of spoken languages is to be made. We understand phenomena better through comparison and

Radically Data-driven Methods of Speech Analysis

We present two simple statistical methods for analyzing spoken language as represented in transcription corpora. The methods are strictly data-driven, in the sense that no preset grammatical category

The Spoken Language Corpus at the Department of Linguistics, Göteborg University

The standard of transcription (MSO) which is used in creating the transcriptions, as well as some types of quantitative and qualitative analysis that have been done at the Department of Linguistics Goteborg University are discussed.

Siblings and cousins — statistical methods for spoken language analysis

Abstract In this paper we discuss two simple statistical methods for analyzing spoken language as represented in transcription corpora. The methods are strictly data-driven, in the sense that no

Oral frequency norms for 67,979 Spanish words

Validity analyses showed significant correlations of oral frequency with other frequency measures and suggest that oral frequency can predict some types of lexical processing with the same or higher levels of precision, when contrasted with text- or subtitle-based frequencies.

What kind of corpus is a web corpus?

This paper discusses an investigation into the Norwegian NoWaC corpus, which has compared this web corpus with one corpus of spoken language and one of written language, showing that this is a possible and simple way of comparing corpora.

Lexical Coverage in Taiwan Mandarin Conversation

  • S. Tseng
  • Linguistics
    Int. J. Comput. Linguistics Chin. Lang. Process.
  • 2013
Lexical coverage in Taiwan Mandarin conversation is revealed and is compared with a balanced corpus of texts in terms of words, syllables, and word categories.

Annotations and Tools for an Activity Based Spoken Language Corpus

The paper contains a description of the Spoken Language Corpus of Swedish at the Department of Linguistics, Goteborg University (GSLC), and a summary of the various types of analysis and tools that

Work on Spoken (Multimodal) Language Corpora in South Africa

This paper describes past, ongoing and planned work on the collection and transcription of spoken language samples for all the South African official languages and as part of this the training of

References

SHOWING 1-10 OF 12 REFERENCES

Speech Management—on the Non-written Life of Speech

This paper introduces the concept of speech management (SM), which refers to processes whereby a speaker manages his or her linguistic contributions to a communicative interaction, and which involves

Variation across Speech and Writing

The model applied in this study addressed textual dimensions and relations in speech and writing, as well as situations and functions, and its application to linguistic research on speech andWriting.

On the Semantics and Pragmatics of Linguistic Feedback

This paper is an exploration in the semantics and pragmatics of linguistic feedback, i.e., linguistic mechanisms which enable the participants in spoken interaction to exchange information about

TalsprŒksfrekvenser. Gothenburg Papers in Theoretical Linguistics

  • University of Gšteborg, Dept of Linguistics
  • 1996

TalsprOEksfrekvenser. Gothenburg Papers in Theoretical Linguistics

  • TalsprOEksfrekvenser. Gothenburg Papers in Theoretical Linguistics
  • 1996

Meningsbyggnader i talad svenska

  • 1976

On the Semantics and Pragmatics of Linguistic Feedback : Gothenburg Papers in

  • Theoretical Linguistics
  • 1991

On the Semantics and Pragmatics of Linguistic Feedback: Gothenburg Papers in Theoretical Linguistics 64

  • On the Semantics and Pragmatics of Linguistic Feedback: Gothenburg Papers in Theoretical Linguistics 64
  • 1991

Meningsbyggnader i talad svenska

  • Meningsbyggnader i talad svenska
  • 1976

Om det svenska systemet fšr sprŒklig Œterkoppling

  • Svenskans Beskrivning
  • 1988