Alena Böhmová

Learn More
The availability of annotated data (with as rich and “deep” annotation as possible) is desirable in any new developments. Textual data are being used for so-called training phase of various empirical methods solving various problems in the field of computational linguistics. While there are many methods that use texts in their plain (or raw) form (in most(More)
A semi-automatic syntactic annotation of a part of the Czech National Corpus in the Prague Dependency Treebank (PDT) has among its aims the possibility to check the theoretical approach chosen (Functional Generative Description, see [2]). While the first phases of the annotation of PDT, i.e. the morphemic representations and the dependency trees on an(More)
$ FRPSOH[ SURFHGXUH RI V\QWDFWLF DQQRWDWLRQ RI D ODUJH WH[W FRUSXV PD\ EH KHOSIXO LQ FKHFNLQJ D ULFK GHVFULSWLYH IUDPHZRUN WKH 3UDJXLDQ )XQFWLRQDO *HQHUDWLYH 'HVFULSWLRQ WKDW PDNHV LW SRVVLEOH WR GLVWLQJXLVK EHWZHHQ WKH FRUH RI QDWXUDO ODQJXDJH VWUXFWXUHG LQ D UHODWLYHO\ VLPSOH ZD\ DQG LWV ODUJH SHULSKHU\ ZLWK LQGLVWLQFW ERUGHUOLQHV 6XFK D SURFHGXUH(More)
The syntactic tagging of the Prague Dependency Treebank (PDT) is divide into two steps, the rst resulting in analytic tree structures (ATS) and the second in tectogrammatical tree structures (TGTS). The present paper describes the transition procedures, automatic and manual, from ATS to TGTS and illustrates these procedures on two Czech sentences. Syntactic(More)
  • 1