Frederik Vaassen

Learn More
We introduce a new emotion classification task based on Leary's Rose, a framework for interpersonal communication. We present a small dataset of 740 Dutch sentences, outline the annotation process and evaluate an-notator agreement. We then evaluate the performance of several automatic classification systems when classifying individual sentences according to(More)
In this paper we present a new approach for detecting online pedophiles in chat rooms that combines the results of predictions on the level of the individual post, the level of the user and the level of the entire conversation, and describe the results of this three-stage system in the PAN 2012 competition. Also, we describe a resampling and a filtering(More)
We describe the natural language processing component of a new serious gaming project, deLearyous, which aims at developing an environment in which users can improve their communication skills by interacting with a virtual character in (Dutch) written natural language. The virtual characters' possible dialogue paths are defined by Leary's Rose, a framework(More)
We describe project deLearyous, in which the goal is to develop a proof-of-concept of a serious game that will assist in the training of communication skills following the Interpersonal Circumplex (also known as Leary's Rose) –a framework for interpersonal communication. Users will interact with the application using unconstrained written natural language(More)
Although in recent years numerous forms of Internet communication – such as e-mail, blogs, chat rooms and social network environments – have emerged, balanced corpora of Internet speech with trustworthy meta-information (e.g. age and gender) or linguistic annotations are still limited. In this paper we present a large corpus of Flemish Dutch chat posts that(More)
We present a system to automatically identify emotion-carrying sentences in suicide notes and to detect the specific fine-grained emotion conveyed. With this system, we competed in Track 2 of the 2011 Medical NLP Challenge,14 where the task was to distinguish between fifteen emotion labels, from guilt, sorrow, and hopelessness to hopefulness and(More)
Despite the existence of many effective methods to solve topic classification tasks for such widely used languages as English, there is no clear answer whether these methods are suitable for languages that are substantially different. We attempt to solve a topic classification task for Lithuanian, a relatively resource-scarce language that is highly(More)
The hardest part of writing a PhD is not the research. It's not setting up the experiments , it's not analyzing the results. It's not even writing it all down. The hardest part is to keep convincing yourself, year after year, that you can pull it off, and that what you found out is worth writing about. The support of the people around me has been(More)
Despite the existence of many effective methods to solve topic classification tasks for such widely used languages as English, there is no clear answer whether these methods are suitable for languages that are substantially different. We attempt to solve a topic classification task for Lithuanian, a relatively resource-scarce language that is highly(More)
  • 1