Kyle Williams

Learn More
SPOR domains are approximately 70 amino acids long and occur in >1,500 proteins identified by sequencing of bacterial genomes. The SPOR domains in the FtsN cell division proteins from Escherichia coli and Caulobacter crescentus have been shown to bind peptidoglycan. Besides FtsN, E. coli has three additional SPOR domain proteins--DamX, DedD, and RlpA. We(More)
We introduce a big data platform that provides various services for harvesting scholarly information and enabling efficient scholarly applications. The core architecture of the platform is built on a secured private cloud, crawls data using a scholarly focused crawler that leverages a dynamic scheduler, processes by utilizing a map reduce based(More)
The detection and potential removal of duplicates is desirable for a number of reasons, such as to reduce the need for unnecessary storage and computation, and to provide users with uncluttered search results. This paper describes an investigation into the application of scalable simhash and shingle state of the art duplicate detection algorithms for(More)
During growth in the environment, bacteria encounter stresses which can delay or inhibit their growth. To defend against these stresses, bacteria induce both resistance and repair mechanisms. Many bacteria regulate these resistance mechanisms using a group of alternative σ factors called extracytoplasmic function (ECF) σ factors. ECF σ factors represent the(More)
Web search queries for which there are no clicks are referred to as abandoned queries and are usually considered as leading to user dissatisfaction. However, there are many cases where a user may not click on any search result page (SERP) but still be satisfied. This scenario is referred to as good abandonment and presents a challenge for most approaches(More)
Clostridium difficile is a clinically important pathogen and the most common cause of hospital-acquired infectious diarrhea. Expression of the C. difficile gene csfV, which encodes σ(V), an extracytoplasmic function σ factor, is induced by lysozyme, which damages the peptidoglycan of bacteria. Here we show that σ(V) is required for lysozyme resistance in C.(More)
Deficits in M1 muscarinic receptor system signaling in Alzheimer's disease (AD) prompted an analysis of components of these systems, namely, the G(q/11) protein and the regulator of G-protein signaling (RGS) 4 protein. In AD parietal cortex, total levels of G(q/11) and RGS4 proteins were significantly lower than age-matched control cases by 40% and 53%,(More)
We introduce PDFMEF, a multi-entity knowledge extraction framework for scholarly documents in the PDF format. It is implemented with a framework that encapsulates open-source extraction tools. Currently, it leverages PDFBox and TET for full text extraction, the scholarly document filter described in [5] for document classification, GROBID for header(More)
CiteSeer is a digital library that contains approximately 3.5 million scholarly documents and receives between 2 and 4 million requests per day. In addition to making documents available via a public Website, the data is also used to facilitate research in areas like citation analysis, co-author network analysis, scalability evaluation and information(More)