• Publications
  • Influence
Automatic Text Categorization in Terms of Genre and Author
TLDR
We propose a set of style markers including analysis-level measures that represent the way in which the input text has been analyzed and capture useful stylistic information without additional cost. Expand
  • 438
  • 25
  • PDF
Speech enhancement based on audible noise suppression
TLDR
A novel speech enhancement technique is presented based on the definition of the psychoacoustically derived quantity of audible noise spectrum and its subsequent suppression using optimal nonlinear filtering of the short-time spectral amplitude (STSA) envelope. Expand
  • 211
  • 21
  • PDF
Computer-Based Authorship Attribution Without Lexical Measures
TLDR
The most important approaches to computer-assistedauthorship attribution are exclusively based onlexical measures that either represent the vocabularyrichness of the author or simply comprise frequenciesof occurrence of common words. Expand
  • 266
  • 18
  • PDF
Comparative Evaluation of Various MFCC Implementations on the Speaker Verification Task
TLDR
We perform a comparative evaluation of the most popular MFCC (Mel Frequency Cepstral Coefficients) implementations on the task of speaker verification. Expand
  • 301
  • 17
Text Genre Detection Using Common Word Frequencies
TLDR
We present a method for detecting the text genre quickly and easily following an approach originally proposed in authorship attribution studies which uses as style markers the frequencies of occurrence of the most frequent words in a training corpus (Burrows, 1992). Expand
  • 206
  • 13
  • PDF
EUROM - a spoken language resource for the EU - the SAM projects
  • 135
  • 10
Automatic Stochastic Tagging of Natural Language Texts
TLDR
Five language and tagset independent stochastic taggers, handling morphological and contextual information, are presented and tested in corpora of seven European languages using two sets of grammatical tags. Expand
  • 121
  • 10
  • PDF
Computer aided diagnosis of breast cancer in digitized mammograms.
A computer aided neural network classification of regions of suspicion (ROS) on digitized mammograms is presented in this paper which employs features extracted by a new technique based onExpand
  • 140
  • 6
Automatic Authorship Attribution
TLDR
In this paper we present an approach to automatic authorship attribution dealing with real-world (or unrestricted) text. Expand
  • 100
  • 6
  • PDF
Slant estimation algorithm for OCR systems
TLDR
A slant removal algorithm is presented based on the use of the vertical projection profile of word images and the Wigner-Ville distribution. Expand
  • 65
  • 4
  • PDF