Yingping Huang

Learn More
Design and implementation of a research support system for web data mining has become a challenge for researchers wishing to utilize useful information on the web. This paper proposes a framework for web data mining support systems. These systems are designed for identifying, extracting, filtering and analyzing data from web resources. They combines web(More)
In this paper, we describe the design and implementationof a self-manageable multi-tiered infrastructure tosupport web-based scientific simulations. This infrastructuredemonstrates not only the successful integration ofWeb servers, simulation servers, database servers, reportsservers, data warehousing and mining, but also the abilityto achieve self(More)
Obstacle detection and classification in a complex urban area are highly demanding, but desirable for pedestrian protection, stop & go, and enhanced parking aids. The most difficult task for the system is to segment objects from varied and complicated background. In this paper, a novel position-based object segmentation method has been proposed to solve(More)
A scientific collaboratory for supporting research in the field of environmental science is presented in this paper. The purpose for building this Web-based research support system is to promote collaboration among a geographically separated group of NSF sponsored scientists from different research areas and allow them to share their data and information(More)
Understanding the Open Source Software (OSS) movement came into focus for many researchers due to the recent fast expansion of OSS communities. SourceForge, which is the data source of this research, is one of the biggest OSS communities. While most of the existing research about OSS communities is focused on the community itself, our research is focused on(More)
Web data integration is an important preprocessing step for web mining. It is highly likely that several records on the web whose textual representations differ may represent the same real world entity. These records are called approximate duplicates. Data integration seeks to identify such approximate duplicates and merge them into integrated records. Many(More)