Xueji Sun

Learn More
This paper reports the approaches to the task of Entity Track applied by PRIS lab of BUPT in TREC 2010. We used Document-Centered Model (DCM) and Entity-Centered Model (ECM) for the task. BM25 method was introduced into ECM besides indri retrieval model. Another improvement aimed at entity extraction. Special web page, NER tool and entity list generated by(More)
This paper presents the system adopted for the Faceted Blog Distillation task by PRIS team. The PRIS system is submitted by Pattern Recognition and Intelligent System Lab at Beijing University of Posts and Telecommunications. And a two-stage strategy is involved for this task. First, an adaptable Voting Model is carried out for blog distillation. Then,(More)
Since the blog service brings a wealth of information resources, blog search and classification are showing their great research value. This paper focuses on the blog classification on the personal vs. official facet. Our system adopts a two-stage strategy; in training model, lexicons are built automatically; in classification model, scoring and ranking are(More)
  • 1