Learn More
Classification and categorization are common tasks in data mining and knowledge discovery. Visualizations of classification models can create understanding and trust in data mining models. However, existing visualizations are often complex or restricted to specific classifiers and attributes. In this work, we propose an intuitive visualiza-tion system to(More)
The information obtained from the Web is increasingly important for decision making and for our everyday tasks. Due to the growth of uncertified sources, blogosphere, comments in the social media and automatically generated texts, the need to measure the quality of text information found on the Internet is becoming of crucial importance. It has been(More)
In this paper, we outline our experiments carried out at the TREC Microblog Track 2011. Our system is based on a plain text index extracted from Tweets crawled from twitter.com. This index has been used to retrieve candidate Tweets for the given topics. The resulting Tweets were post-processed and then analyzed using three different approaches: (i) a burst(More)
People use weblogs to express thoughts, present ideas and share knowledge. However, weblogs can also be misused to influence and manipulate the readers. Therefore the credibility of a blog has to be validated before the available information is used for analysis. The credibility of a blogentry is derived from the content, the credibility of the author or(More)
In this work we present APA Labs, a generic framework for visualizing the news article domain. APA Labs is a web-based platform enabling retrieval and analysis of news repositories provided by the Austrian Press Agency. APA Labs is designed as a rich internet application combined with a modular system of interactive visualizations. News articles are(More)
The study explores the citedness of research data, its distribution over time and how it is related to the availability of a DOI (Digital Object Identifier) in Thomson Reuters' DCI (Data Citation Index). We investigate if cited research data " impact " the (social) web, reflected by altmetrics scores, and if there is any relationship between the number of(More)
Introduction We are currently witnessing a change in scholarly communication. Next to the paper, complementary materials, such as research data, source code, and images are regarded as important outcomes that should be shared and built upon (Kraker et al., 2011). In this new ecosystem, many archives have been established that cater to the needs of a digital(More)
In this work, we study social and academic network activities of researchers from Computer Science. Using a recently proposed framework, we map the researchers to their Twitter accounts and link them to their publications. This enables us to create two types of networks: first, networks that reflect social activities on Twitter, namely the researchers'(More)
Breaking news and events are often posted in the blogo-sphere before they are published by any media agency. Therefore , the blogosphere is a valuable resource for news-related blog analysis. However, it is crucial to first sort out news-unrelated content like personal diaries or advertising blogs. Besides, there are different levels of emotionality or(More)