Learn More
Automatic knowledge base population from text is an important technology for a broad range of approaches to learning by reading. Effective automated knowledge base population depends critically upon coreference resolution of entities across sources. Use of a wide range of features, both those that capture evidence for entity merging and those that argue(More)
Grammatical structures for word-level sentiment detection. Title = {Grammatical structures for word-level sentiment detection}, } Links: • Data [ Abstract Existing work in fine-grained sentiment analysis focuses on sentences and phrases but ignores the contribution of individual words and their grammatical connections. This is because of a lack of both (1)(More)
Texture analysis of positron emission tomography (PET) images of the brain is a very difficult task, due to the poor signal to noise ratio. As a consequence, very few techniques can be implemented successfully. We use a new global analysis technique known as the Trace transform triple features. This technique can be applied directly to the raw sinograms to(More)
We present an end-to-end pipeline including a user interface for the production of word-level annotations for an opinion-mining task in the information technology (IT) domain. Our pre-annotation pipeline selects candidate sentences for annotation using results from a small amount of trained annotation to bias the random selection over a large corpus. Our(More)
How can the relationships among information technology innovations be described and analyzed in a representative, dynamic, and scalable way? THEORETICAL FRAMEWORK Innovation concepts are interrelated in an idea network, where using an ecological perspective they can be likened to species in a competitive and symbiotic resource space. METHODS We apply(More)
We present results of a novel experiment to investigate speech production in conversational data that links speech rate to information density. We provide the first evidence for an association between syntactic surprisal and word duration in recorded speech. Using the AMI corpus which contains transcriptions of focus group meetings with precise word(More)
Most recent unsupervised methods in vector space semantics for assessing thematic fit (e.g. create prototypical role-fillers without performing word sense disam-biguation. This leads to a kind of sparsity problem: candidate role-fillers for different senses of the verb end up being measured by the same " yardstick " , the single prototypical role-filler. In(More)
Due to increased competition in the IT Services business, improving quality, reducing costs and shortening schedules has become extremely important. A key strategy being adopted for achieving these goals is the use of an asset-based approach to service delivery, where standard reusable components developed by domain experts are minimally modified for each(More)
We investigate the effect of linguistic complexity on cogni-tive load in a dual-task scenario, namely simultaneous driving and language use. To this end, we designed an experiment where participants use a driving simulator while listening to spoken stimuli and answering comprehension questions. On-line physiological measures of cognitive load, including the(More)
This thesis contributes to a larger social science research program of analyzing the diffusion of IT innovations. We show how to automatically discriminate portions of text dealing with opinions about innovations by finding {source, target, opinion} triples in text. In this context, we can discern a list of innovations as targets from the domain itself. We(More)