Towards Link Characterization From Content: Recovering Distributions From Classifier Output

@article{Grothendieck2008TowardsLC,
  title={Towards Link Characterization From Content: Recovering Distributions From Classifier Output},
  author={John Grothendieck and A. Gorin},
  journal={IEEE Transactions on Audio, Speech, and Language Processing},
  year={2008},
  volume={16},
  pages={847-858}
}
In processing large volumes of speech and language data, we are often interested in the distribution of languages, speakers, topics, etc. For large data sets, these distributions are typically estimated at a given point in time using pattern classification technology. It is well known that such estimates can be highly biased, especially for rare classes. While these biases have been addressed in some applications, they have thus far been ignored in the speech and language literature. This… Expand
Estimating and Exploiting Language Distributions of Unlabeled Data
Random attributed graphs for statistical inference from content and context
Social correlates of turn-taking style

References

SHOWING 1-10 OF 65 REFERENCES
Tracking changes in language
  • John Grothendieck
  • Computer Science
  • IEEE Transactions on Speech and Audio Processing
  • 2005
Beyond ASR 1-best: Using word confusion networks in spoken language understanding
Spoken Language Recognition -a Step towards Multilinguality in Speech Processing
Spoken language recognition-a step toward multilinguality in speech processing
Application-independent evaluation of speaker detection
An overview of automatic speaker diarization systems
Estimating and evaluating confidence for forensic speaker recognition
Sparseness vs Estimating Conditional Probabilities: Some Asymptotic Results
Stochastic language adaptation over time and state in natural spoken dialog systems
...
1
2
3
4
5
...