In order to solve task 2 of the KDD Cup 2002, we exploited various available information sources. In particular, use of relational information describing the interactions among genes and information automatically extracted from scientific abstracts improves the accuracy of our predictions.
We focus on the problem of predicting yeast gene regulation experiments. In order to construct a good solution, we study combinations of different methods that are not yet to be found in any single data mining application. We describe our approach to propositionalizing the given relational data that describes the interaction among proteins. We study how we… (More)