Yoshiki Mikami

Learn More
The first part of the paper provides a brief description of the Language Observatory Project (LOP) and highlights the major technical difficulties to be challenged. The latter part gives how we responded to these difficulties by adopting UbiCrawler as a data collecting engine for the project. An interactive collaboration between the two groups is producing(More)
Papillary squamous cell carcinoma (PSCC) of the uterine cervix is difficult to diagnose due to its rarity and limited data regarding its clinical behavior. We attempted to assess the degree of stromal invasion using magnetic resonance imaging (MRI) and evaluate possible treatments for this lesion in view of its clinical behavior. We analyzed 28 cases of(More)
An N-gram-based language, script, and encoding scheme-detection method is introduced in this article. The method detects language, script, and encoding schemes using a target text document encoded by computer by checking how many byte sequences of the target match the byte sequences that can appear in the texts belonging to a language, script, and encoding(More)
We report a rare case of olfactory ensheathing cell tumor. A female presented a large soft mass extending medially to the olfactory cleft and laterally to the middle meatus in the left nasal cavity. Imaging studies confirmed a cystic mass extending superiorly into the frontal lobe, indicating that the tumor arouse from the olfactory mucosa. A subtotal(More)
1. Good morning ladies and gentleman. Today, on behalf of Language Observatory project and on behalf of the MAAYA, a global multi-stakeholder network for linguistic diversity, I would like to share with you my experience of measuring linguistic diversity on cyberspace. 2. My presentation today will follow this line. First, I start with a brief description(More)
The ccTLD (country code Top Level Domain) in a URL does not necessarily point to the geographic location of the server concerned. The authors have surveyed sample servers belonging to 60 ccTLDs in Africa, with regard to the number of hops required to reach the target site from Japan, the response time, and the NIC registration information of each domain.(More)
Language identification of written text in the domain of Latin-script based languages is a well-studied research field. However, new challenges arise when it is applied to non-Latin-script based languages, especially for Asian languages' web pages. The objective of this paper is to propose and evaluate the effectiveness of adapting Universal Declaration of(More)
While entertainment web forums provide a dynamic medium for interaction, not many researchers feel the need to go deeply into the contents. One of the reasons behind this attitude lies on a widely perceived assumption that web forums do not deal with knowledge matter and have the inclination to take place only as small talk. In this paper we will mainly(More)
This study aims to develop a phonetic similarity measurement method across Asian languages. The method, cross-language similarity algorithm aggregates the transcription of language-specific Romanization, the International Phonetic Alphabet, the Soundex algorithm, and Levenshtein distance. To evaluate the proposed algorithm, this study involves an experiment(More)