Chia-Wei Chu

Learn More
Consensus clustering involves combining multiple clusterings of the same set of objects to achieve a single clustering that will, hopefully, provide a better picture of the groupings that are present in a dataset. This Letter reports the use of consensus clustering methods on sets of chemical compounds represented by 2D fingerprints. Experiments with DUD,(More)
Standardization is used to ensure that the variables in a similarity calculation make an equal contribution to the computed similarity value. This paper compares the use of seven different methods that have been suggested previously for the standardization of integer-valued or real-valued data, comparing the results with unstandardized data. Sets of(More)
The large sized data sets are replicated in more than one site for the better availability to the nodes in a grid. Downloading the dataset from these replicated locations have practical difficulties, due to network traffic, congestion, frequent change-in performance of the servers, etc. In order to speed up the download, complex server selection techniques,(More)
In data grid environments, datasets are usually replicated to many servers when taking into consideration its efficiency. Since these files are usually huge in size, how to efficiently transmit and access between servers and grid users is an important issue. In this paper, we present an economy-based parallel file transfer technique using P2P co-allocation(More)
  • 1