Learn More
Web provides a large-scale corpus for researchers to study the language usages in real world. Developing a web-scale corpus needs not only a lot of computation resources, but also great efforts to handle the large variations in the web texts, such as character encoding in processing Chinese web texts. In this paper, we aim to develop a web-scale Chinese(More)
The conversations between posters and repliers in microblogs form a valuable writer-reader emotion corpus. In a microblog conversation, the writer of the initial post and the reader who replies to the initial post can both express their emotions. The process of changing from writer emotion to reader emotion is called a writer-reader emotion transition in(More)
While microblogging has gained popularity on the Internet, analyzing and processing short messages has become a challenging task in natural language processing. This paper analyzes the differences between Internet short messages (or " microtext ") and general articles by comparing the Plurk Corpus and the Sinica Balanced Corpus. Likelihood ratio and the(More)
Respiratory syncytial virus (RSV) and human metapneumovirus (HMPV) are two common viral pathogens in acute lower respiratory tract infections (ALRTI). However, the association of viral load with clinical characteristics is not well-defined in ALRTI. To explore the correlation between viral load and clinical characteristics of RSV and HMPV in children(More)
  • 1