Learn More
Singleton-Merten syndrome (SMS) is an autosomal-dominant multi-system disorder characterized by dental dysplasia, aortic calcification, skeletal abnormalities, glaucoma, psoriasis, and other conditions. Despite an apparent autosomal-dominant pattern of inheritance, the genetic background of SMS and information about its phenotypic heterogeneity remain(More)
Both multiple sequence alignment and phylogenetic analysis are problematic in the "twilight zone" of sequence similarity (≤ 25% amino acid identity). Herein we explore the accuracy of phylogenetic inference at extreme sequence divergence using a variety of simulated data sets. We evaluate four leading multiple sequence alignment (MSA) methods (MAFFT,(More)
Hepatitis C virus (HCV) nonstructural protein 4B (NS4B) is an integral membrane protein, which plays an important role in the organization and function of the HCV replication complex (RC). Although much is understood about its amphipathic N-terminal and C-terminal domains, we know very little about the role of the transmembrane domains (TMDs) in NS4B(More)
In maintaining Digital Libraries, having bibliographic data up-to-date is critical, yet often minor irregularities may cause information isolation. Unlike documents for which various kinds of unique ID systems exist (e.g., DOI, ISBN), other bibliographic entities such as author and publication venue do not have unique IDs. Therefore, in current Digital(More)
Inferring evolutionary relationships among highly divergent protein sequences is a daunting task. In particular, when pairwise sequence alignments between protein sequences fall <25% identity, the phylogenetic relationships among sequences cannot be estimated with statistical certainty. Here, we show that phylogenetic profiles generated with the Gestalt(More)
As the usage of <i>Web Services</i> proliferates dramatically, new tools to help quickly generate web services are needed. In this paper, we propose a methodology that helps to <i>automatically</i> generate Web Services from the FORM-based query interfaces of a web site. Since the majority of web data are rather "hidden" behind such a FORM interface, we(More)
The recA/RAD51 gene family encodes a diverse set of recombinase proteins that affect homologous recombination, DNA-repair, and genome stability. The recA gene family is expressed across all three domains of life - Eubacteria, Archaea, and Eukaryotes - and even in some viruses. To date, efforts to resolve the deep evolutionary origins of this ancient protein(More)
One of the major challenges in the genomic era is annotating structure/function to the vast quantities of sequence information now available. Indeed, most of the protein sequence database lacks comprehensive annotation, even when experimental evidence exists. Further, within structurally resolved and functionally annotated protein domains, additional(More)
The sequence of amino acids in a protein is believed to determine its native state structure, which in turn is related to the functionality of the protein. In addition, information pertaining to evolutionary relationships is contained in homologous sequences. One powerful method for inferring these sequence attributes is through comparison of a query(More)