Learn More
A total of 752 odorant receptor (Or) genes, including pseudogenes, were identified in 11 Drosophila species and named after their orthologs in Drosophila melanogaster. The 813 Or genes, including 61 from D. melanogaster, were classified into 59 orthologous groups that are well supported by gene phylogeny. By reconciling with the gene family phylogeny, we(More)
Modeling the movement of information within social media outlets, like Twitter, is key to understanding to how ideas spread but quantifying such movement runs into several difficulties. Two specific areas that elude a clear characterization are (i) the intrinsic random nature of individuals to potentially adopt and subsequently broadcast a Twitter topic,(More)
Given a movie comment, does it contain a spoiler? A spoiler is a comment that, when disclosed, would ruin a surprise or reveal an important plot detail. We study automatic methods to detect comments and reviews that contain spoilers and apply them to reviews from the IMDB (Inter-net Movie Database) website. We develop topic models, based on Latent Dirichlet(More)
VGGNets have turned out to be effective for object recognition in still images. However, it is unable to yield good performance by directly adapting the VGGNet models trained on the ImageNet dataset for scene recognition. This report describes our implementation of training the VGGNets on the large-scale Places205 dataset. Specifically , we train three(More)
We introduce PASTA, a new multiple sequence alignment algorithm. PASTA uses a new technique to produce an alignment given a guide tree that enables it to be both highly scalable and very accurate. We present a study on biological and simulated data with up to 200,000 sequences, showing that PASTA produces highly accurate alignments, improving on the(More)
Given a drug under development, what are other drugs or biochemical compounds that it might interact with? Early answers to this question, by mining the literature, are valuable for pharmaceutical companies, both monetarily and in avoiding public relations nightmares. Inferring drug-drug interactions is also important in designing combination therapies for(More)
Evolutionary and systems biology increasingly rely on the construction of large phylogenetic trees which represent the relationships between species of interest. As the number and size of such trees increases, so does the need for efficient data storage and query capabilities. Although much attention has been focused on XML as a tree data model,(More)
High-entropy alloys (HEAs) can have either high strength or high ductility, and a simultaneous achievement of both still constitutes a tough challenge. The inferior castability and compositional segregation of HEAs are also obstacles for their technological applications. To tackle these problems, here we proposed a novel strategy to design HEAs using the(More)
User reviews, like those found on Yelp and Amazon, have become an important reference for decision making in daily life, for example , in dining, shopping and entertainment. However, large amounts of available reviews make the reading process tedious. Existing word cloud visualizations attempt to provide an overview. However their randomized layouts do not(More)