Learn More
This paper describes the University of Houston team's efforts toward the problem of identifying reference spans in a reference document given sentences from other documents that cite the reference document. We investigated the following approaches: cosine similarity with multiple incremental modifications and SVMs with a tree kernel. Although the best(More)
Since 1990 extensive funds have been spent on research in climate change. Although Earth Sciences, including climatology and hydrology, have benefited significantly, progress has proved incommensurate with the effort and funds, perhaps because these disciplines were perceived as " tools " subservient to the needs of the climate change enterprise rather than(More)
Papers published in Hydrology and Earth System Sciences Discussions are under open-access review for the journal Hydrology and Earth System Sciences Abstract Since 1990 extensive funds have been spent on research in climate change. Although Earth Sciences, including climatology and hydrology, have benefited significantly , progress has proved incommensurate(More)
Since 1990 extensive funds have been spent on research in climate change. Although Earth Sciences, including climatology and hydrology, have benefited significantly , progress has proved incommensurate with the effort and funds, perhaps because these disciplines were perceived as " tools " subservient to the needs of the climate change enterprise rather(More)
Collocation and idiom extraction are well-known challenges with many potential applications in Natural Language Processing (NLP). Our experimental, open-source software system, called ICE, is a python package for flexibly extracting colloca-tions and idioms, currently in English. It also has a competitive POS tagger that can be used alone or as part of(More)
We focus on email-based attacks, a rich field with well-publicized consequences. We show how current Natural Language Generation (NLG) technology allows an attacker to generate masquerade attacks on scale, and study their effectiveness with a within-subjects study. We also gather insights on what parts of an email do users focus on and how users identify(More)
The CL-SciSumm 2016 shared task introduced an interesting problem: given a document D and a piece of text that cites D, how do we identify the text spans of D being referenced by the piece of text? The shared task provided the first annotated dataset for studying this problem. We present an analysis of our continued work in improving our system’s(More)
  • 1