Learn More
Many methods, including supervised and unsupervised algorithms, have been developed for extractive document summarization. Most supervised methods consider the summarization task as a twoclass classification problem and classify each sentence individually without leveraging the relationship among sentences. The unsupervised methods use heuristic rules to(More)
Most traditional text clustering methods are based on "bag of words" (<i>BOW</i>) representation based on frequency statistics in a set of documents. <i>BOW</i>, however, ignores the important information on the semantic relationships between <i>key</i> terms. To overcome this problem, several methods have been proposed to enrich text representation with(More)
In this paper, we propose an innovative approach to the segmentation of tubular structures. This approach combines all of the benefits of minimal path techniques such as global minimizers, fast computation, and powerful incorporation of user input, while also having the capability to represent and detect vessel surfaces directly which so far has been a(More)
Demographic information plays an important role in personalized web applications. However, it is usually not easy to obtain this kind of personal data such as age and gender. In this paper, we made a first approach to predict users' gender and age from their Web browsing behaviors, in which the Webpage view information is treated as a hidden variable to(More)
Foxtail millet (Setaria italica), a member of the Poaceae grass family, is an important food and fodder crop in arid regions and has potential for use as a C4 biofuel. It is a model system for other biofuel grasses, including switchgrass and pearl millet. We produced a draft genome (∼423 Mb) anchored onto nine chromosomes and annotated 38,801 genes. Key(More)
In this paper, we propose a novel ranking scheme named Affinity Ranking (AR) to re-rank search results by optimizing two metrics: (1) diversity -- which indicates the variance of topics in a group of documents; (2) information richness -- which measures the coverage of a single document to its topic. Both of the two metrics are calculated from a directed(More)
In this paper, we propose an innovative approach to the segmentation of tubular or vessel-like structures which combines all the benefits of minimal path techniques (global minimizers, fast computation, powerful incorporation of user input) with some of the benefits of active surface techniques (representation of a full 3D tubular surface rather than a just(More)
Genomic DNA copy number aberrations are frequent in solid tumors, although the underlying causes of chromosomal instability in tumors remain obscure. Genes likely to have genomic instability phenotypes when mutated (e.g. those involved in mitosis, replication, repair, and telomeres) are rarely mutated in chromosomally unstable sporadic tumors, even though(More)
microRNAs (miRNA) are small noncoding RNAs that participate in diverse biological processes by suppressing target gene expression. Altered expression of miR-21 has been reported in cancer. To gain insights into its potential role in tumorigenesis, we generated miR-21 knockout colon cancer cells through gene targeting. Unbiased microarray analysis combined(More)