Learn More
In this paper we describe a method of acquiring word order fl'om corpora. Word order is defined as the order of modifiers, or the order of phrasal milts called 'bunsetsu' which depend on the stone modifiee. The method uses a model which automatically discovers what the tendency of the word order in Japanese is by using various kinds of information in and(More)
We constructed a system for answering non-factoid Japanese questions. We used passage retrieval methods for the system. We extracted paragraphs based on terms from an input question and output them as the desired answer. We classified the non-factoid questions into six categories. We used a particular method for each category. For example, we increased the(More)
We submitted four systems to the Japanese dictionary-based lexical-sample task of SENSEVAL-2. They were i) the support vector machine method ii) the simple Bayes method, iii) a method combining the two, and iv) a method combining two kinds of each. The combined methods obtained the best precision among the submitted systems. After the contest, we tuned the(More)
This paper presents a part-of-speech tagging method based on a min-max modular neural-network model. The method has three main steps. First, a large-scale tagging problem is decomposed into a number of relatively smaller and simpler subproblems according to the class relations among a given training corpus. Secondly, all of the subproblems are learned by(More)
A hybrid system R)r tagging part of speech is descril)ed that consists of a neuro tagger and a rule-based correcter. The neuro tagger is an initia.1-state a.nnotator tha.t uses difl'ertnt h_,,ngths of contexts based on longe, st context l)ri-ority. Its inputs a.re weighted 1)y information gains tha.t are obtained by information ma.xi-mization. The(More)
Robertson's 2-poisson information retrieve model does not use location and category information. We constructed a framework using location and category information in a 2-poisson model. We submitted two systems based on this framework to the IREX contest, Japanese language information retrieval contest held in Japan in 1999. For precision in the A-judgement(More)
Our information retrieval system which achieves its goals by taking advantage of numerous characteristics of the information and applying numerous sophisticated techniques is described. Robertson's 2-Poisson model and Rocchio's formula, both of which are known to be effective, have been applied in the system. Characteristics of newspapers such as(More)
The elastic-input neuro-tagger and hybrid tagger, combined with a neural network and Brill's error-driven learning, have already been proposed to construct a practical tagger using as little training data as possible. When a small Thai corpus is used for training, these taggers have tagging accuracies of, respectively, 94.4% and 95.5% (accounting only for(More)