Learn More
Large volumes of content (bookmarks, reviews, videos, etc.) are currently being created on the " Social Web " , i.e. on Web 2.0 community sites, and this content is being annotated and commented upon. The ability to view an individual's entire contribution to the Social Web would be an interesting and valuable service, particularly important as social(More)
We investigate the application of classification techniques to the problem of information extraction (IE). In particular we use support vector machines and several different feature-sets to build a set of classifiers for IE. We show that this approach is competitive with current state-of-the-art IE algorithms based on specialized learning algorithms. We(More)
The need for labeled documents is a key bottleneck in adaptive information extraction. One way to solve this problem is through active learning algorithms that require users to label only the most informative documents. We investigate several document selection strategies that are particularly relevant to information extraction. We show that some strategies(More)