Learn More
We have produced a draft sequence of the rice genome for the most widely cultivated subspecies in China, Oryza sativa L. ssp. indica, by whole-genome shotgun sequencing. The genome was 466 megabases in size, with an estimated 46,022 to 55,615 genes. Functional coverage in the assembled sequences was 92.0%. About 42.2% of the genome was in exact(More)
The geographic and temporal origins of the domestic dog remain controversial, as genetic data suggest a domestication process in East Asia beginning 15,000 years ago, whereas the oldest doglike fossils are found in Europe and Siberia and date to >30,000 years ago. We analyzed the mitochondrial genomes of 18 prehistoric canids from Eurasia and the New World,(More)
Exponential growth of information generated by online social networks demands effective recommender systems to give useful results. Traditional techniques become unqualified because they ignore social relation data; existing social recommendation approaches consider social network structure, but social context has not been fully considered. It is(More)
We analyzed 314,254 soybean expressed sequence tags (ESTs), including 29,540 from our laboratory and 284,714 from GenBank. These ESTs were assembled into 56,147 unigenes. About 76.92% of the unigenes were homologous to genes from Arabidopsis thaliana (Arabidopsis). The putative products of these unigenes were annotated according to their homology with the(More)
Network embedding is an important method to learn low-dimensional representations of vertexes in networks, aiming to capture and preserve the network structure. Almost all the existing network embedding methods adopt shallow models. However, since the underlying network structure is complex, shallow models cannot capture the highly non-linear network(More)
Social networks enable users to create different types of personal items. In dealing with serious information overload, the major problems of social recommendation are sparsity and cold start. In existing approaches, relational and heterogeneous domains can not be effectively utilized for social recommendation, which brings a challenge to model users and(More)
To compare the two RNA-sequencing protocols, ribo-minus RNA-sequencing (rmRNA-seq) and polyA-selected RNA-sequencing (mRNA-seq), we acquired transcriptomic data-52 and 32 million alignable reads of 35 bases in length-from the mouse cerebrum, respectively. We found that a higher proportion, 44% and 25%, of the uniquely alignable rmRNA-seq reads, is in(More)
Cascades are ubiquitous in various network environments such as epidemic networks, traffic networks, water distribution networks and social networks. The outbreaks of cascades will often bring bad or even devastating effects. How to accurately predict the cascading outbreaks in early stage is of paramount importance for people to avoid these bad effects.(More)
Lung cancer is the leading cause of cancer-related death, with non-small cell lung cancer (NSCLC) being the predominant form of the disease. Most lung cancer is caused by the accumulation of genomic alterations due to tobacco exposure. To uncover its mutational landscape, we performed whole-exome sequencing in 31 NSCLCs and their matched normal tissue(More)
Protein arginine methylation, one of the most abundant and important posttranslational modifications, is involved in a multitude of biological processes in eukaryotes, such as transcriptional regulation and RNA processing. Symmetric arginine dimethylation is required for snRNP biogenesis and is assumed to be essential for pre-mRNA splicing; however, except(More)