Sheng Guo

Learn More
Given a movie comment, does it contain a spoiler? A spoiler is a comment that, when disclosed, would ruin a surprise or reveal an important plot detail. We study automatic methods to detect comments and reviews that contain spoilers and apply them to reviews from the IMDB (Inter-net Movie Database) website. We develop topic models, based on Latent Dirichlet(More)
Modeling the movement of information within social media outlets, like Twitter, is key to understanding to how ideas spread but quantifying such movement runs into several difficulties. Two specific areas that elude a clear characterization are (i) the intrinsic random nature of individuals to potentially adopt and subsequently broadcast a Twitter topic,(More)
A total of 752 odorant receptor (Or) genes, including pseudogenes, were identified in 11 Drosophila species and named after their orthologs in Drosophila melanogaster. The 813 Or genes, including 61 from D. melanogaster, were classified into 59 orthologous groups that are well supported by gene phylogeny. By reconciling with the gene family phylogeny, we(More)
Evolutionary and systems biology increasingly rely on the construction of large phylogenetic trees which represent the relationships between species of interest. As the number and size of such trees increases, so does the need for efficient data storage and query capabilities. Although much attention has been focused on XML as a tree data model,(More)
Given a drug under development, what are other drugs or biochemical compounds that it might interact with? Early answers to this question, by mining the literature, are valuable for pharmaceutical companies, both monetarily and in avoiding public relations nightmares. Inferring drug-drug interactions is also important in designing combination therapies for(More)
We introduce PASTA, a new multiple sequence alignment algorithm. PASTA uses a new technique to produce an alignment given a guide tree that enables it to be both highly scalable and very accurate. We present a study on biological and simulated data with up to 200,000 sequences, showing that PASTA produces highly accurate alignments, improving on the(More)
The HuPrime® human gastric neuroendocrine carcinoma derived xenograft model GA0087 was established in this study. GA0087 PDX model showed high gene expression of vascular endothelial growth factors (VEGF)-A and B, and high potential of lung metastasis. Circulating tumor cells (CTCs) with either large or small size, circulating tumor microemboli (CTM) and(More)
Quetiapine (Que), a commonly used atypical antipsychotic drug (APD), can prevent myelin from breakdown without immune attack. Multiple sclerosis (MS), an autoimmune reactive inflammation demyelinating disease, is triggered by activated myelin-specific T lymphocytes (T cells). In this study, we investigated the potential efficacy of Que as an(More)