Dinusha Vatsalan

Learn More
Integrating data from diverse sources with the aim to identify similar records that refer to the same real-world entities without compromising privacy of these entities is an emerging research problem in various domains. This problem is known as privacy-preserving record linkage (PPRL). Scalability of PPRL is a main challenge due to growing data size in(More)
The task of linking multiple databases with the aim to identify records that refer to the same entity is occurring increasingly in many application areas. If unique identifiers for the entities are not available in all the databases to be linked, techniques that calculate approximate similarities between records must be used for the identification of(More)
The high penetration of mobile devices and networks globally implies that mobile technologies can be used very effectively in the field of Healthcare in order to compensate the scarcity of resources problem, particularly in developing countries. With the proliferation of mobile technologies, mobile health (mHealth) will play a vital role in the rapidly(More)
Record linkage is the process of identifying which records in different databases refer to the same realworld entities. When personal details of individuals, such as names and addresses, are used to link databases across different organisations, then privacy becomes a major concern. Often it is not permissible to exchange identifying data among(More)
Privacy-preserving record linkage (PPRL) is the process of identifying records that correspond to the same real-world entities across several databases without revealing any sensitive information about these entities. Various techniques have been developed to tackle the problem of PPRL, with the majority of them only considering linking two databases.(More)
Linking data from multiple sources enables more sophisticated analysis and data mining by improving the quality of data through the identification and resolution of conflicting data values, the enrichment of data, and the imputation of missing values [30]. The analysis of integrated data can, for example, facilitate the detection of adverse drug reactions(More)
Record linkage is an emerging research area which is required by various real-world applications to identify which records in different data sources refer to the same real-world entities. Often privacy concerns and restrictions prevent the use of traditional record linkage applications across different organizations. Linking records in situations where no(More)
Entity resolution (ER) has wide-spread applications in many areas, including e-commerce, health-care, the social sciences, and crime and fraud detection. A crucial step in ER is the accurate classification of pairs of records into matches (assumed to refer to the same entity) and non-matches (assumed to refer to different entities). In most practical ER(More)
We demonstrate GeCo, an online personal data GEnerator and COrruptor that facilitates the creation of realistic personal data ranging from names, addresses, and dates, to social security and credit card numbers, as well as numerical values such as salary or blood pressure. Using an intuitive Web interface, a user can create records containing such data(More)