Akiko Aizawa

Learn More
Record linkage refers to techniques for identifying records associated with the same real-world entities. Record linkage is not only crucial in integrating multi-source databases that have been generated independently, but is also considered to be one of the key issues in integrating heterogeneous Web resources. However, when targeting large-scale data, the(More)
In this paper, we develop new methods for adjusting connguration parameters of genetic algorithms operating in a noisy environment. Such methods are related to the scheduling of resources for tests performed in genetic algorithms. Assuming that the population size is given, we address two problems related to the design of eecient scheduling algorithms(More)
•Mathematics plays a fundamental role in Science, Technology, and Engineering (learn from Math, apply for STEM) •Mathematical knowledge is rich in content, sophisticated in structure, and technical in presentation! •There is a lot of documents with maths – 120.000 journal articles per year in pure/applied math, 3.5 Million overall – 50 million science(More)
This paper gives an overview of the Informational Retrieval Task 2 that was conducted from 2003 to 2004 as a subtask of the WEB Task at the Fourth NTCIR Workshop (‘NTCIR-4 WEB’). In the Informational Retrieval Task, we attempted to assess the retrieval effectiveness of Web search engine systems from a viewpoint of topical relevance, and to build a re-usable(More)
This paper describes an overview of the Navigational Retrieval Subtask 2 that was conducted from 2004 to 2005 as a subtask of the WEB Task at the Fifth NTCIR Workshop. In the Subtask, we attempted to assess the retrieval effectiveness of web search systems from a viewpoint of “Known Item Search” using a common data set, and built a re-usable test(More)
The feature quantity, a quantitative representation of specificity introduced in this paper, is based on an information theoretic perspective of co-occurrence events between terms and documents. Mathematically, the feature quantity is defined as a product of probability and information, and maintains a good correspondence with the tfidflike measures(More)