Information Retrieval Test Collection for Searching Spontaneous Czech Speech

  title={Information Retrieval Test Collection for Searching Spontaneous Czech Speech},
  author={Pavel Ircing and Pavel Pecina and Douglas W. Oard and Jianqiang Wang and Ryen W. White and Jan Hoidekr},
This paper describes the design of the first large-scale IR test collection built for the Czech language. The creation of this collection also happens to be very challenging, as it is based on a continuous text stream from automatic transcription of spontaneous speech and thus lacks clearly defined document boundaries. All aspects of the collection building are presented, together with some general findings of initial experiments. 
Highly Cited
This paper has 138 citations. REVIEW CITATIONS

From This Paper

Figures, tables, and topics from this paper.


Publications citing this paper.

139 Citations

Citations per Year
Semantic Scholar estimates that this publication has 139 citations based on the available data.

See our FAQ for additional information.

Similar Papers

Loading similar papers…