Learn More
Sentiment classification aims at mining reviews of people for a certain event's topic or product by automatic classifying the reviews into positive or negative opinions. With the fast developing of World Wide Web applications, sentiment classification would have huge opportunity to help people automatic analysis of customers' opinions from the web(More)
In deep web, a significant amount of information can only be accessed through query interface of a back-end database, however, general search engine can not interact with the query interface, resulting in the myriad hidden and unvisible information can not be accessed. Therefore, this paper proposes a novel method of filling forms of deep web entries by(More)
Automatic text classification is one of the most important tools in Information Retrieval. As the traditional methods for text classification cannot find the best feature set, the GA is applied to the feature selection because it can get the global optimal solution. This paper presents a novel text classifier from positive and unlabeled documents based on(More)
For integrating web databases, the very first challenge is to understand what a query interface says or what query capabilities a source supports. From the view of people, the interior structure of web pages is not concerned to for people. In the most cases, semantic block is identified via visual elements. Therefore, in this paper, a novel arithmetic of(More)
Topical Web crawling is an established technique for domain-specific information retrieval. However, almost all the conventional topical Web crawlers focus on building crawlers using different classifiers, which needs a lot of labeled training data that is very difficult to labelmanually. This paper presents a novel approach called clustering-based topical(More)
Query interface is used to formulate queries to receive needed data from web databases in deep web. In order to access Domain-Specific databases, the most important step is to construct an integration interface that allows uniform access to disparate relevant sources. Therefore, this paper proposes a novel method of integrating query interfaces based on(More)