Learn More
Singleton-Merten syndrome (SMS) is an autosomal-dominant multi-system disorder characterized by dental dysplasia, aortic calcification, skeletal abnormalities, glaucoma, psoriasis, and other conditions. Despite an apparent autosomal-dominant pattern of inheritance, the genetic background of SMS and information about its phenotypic heterogeneity remain(More)
Both multiple sequence alignment and phylogenetic analysis are problematic in the "twilight zone" of sequence similarity (≤ 25% amino acid identity). Herein we explore the accuracy of phylogenetic inference at extreme sequence divergence using a variety of simulated data sets. We evaluate four leading multiple sequence alignment (MSA) methods (MAFFT,(More)
For the robust practice of genomic medicine, sequencing results must be compatible, regardless of the sequencing technologies and algorithms used. Presently, genome sequencing is still an imprecise science and is complicated by differences in the chemistry, coverage, alignment, and variant-calling algorithms. We identified ~3.33 million single nucleotide(More)
Hepatitis C virus (HCV) nonstructural protein 4B (NS4B) is an integral membrane protein, which plays an important role in the organization and function of the HCV replication complex (RC). Although much is understood about its amphipathic N-terminal and C-terminal domains, we know very little about the role of the transmembrane domains (TMDs) in NS4B(More)
In maintaining Digital Libraries, having bibliographic data up-to-date is critical, yet often minor irregularities may cause information isolation. Unlike documents for which various kinds of unique ID systems exist (e.g., DOI, ISBN), other bibliographic entities such as author and publication venue do not have unique IDs. Therefore, in current Digital(More)
The recA/RAD51 gene family encodes a diverse set of recombinase proteins that affect homologous recombination, DNA-repair, and genome stability. The recA gene family is expressed across all three domains of life - Eubacteria, Archaea, and Eukaryotes - and even in some viruses. To date, efforts to resolve the deep evolutionary origins of this ancient protein(More)
Inferring evolutionary relationships among highly divergent protein sequences is a daunting task. In particular, when pairwise sequence alignments between protein sequences fall <25% identity, the phylogenetic relationships among sequences cannot be estimated with statistical certainty. Here, we show that phylogenetic profiles generated with the Gestalt(More)
As the usage of <i>Web Services</i> proliferates dramatically, new tools to help quickly generate web services are needed. In this paper, we propose a methodology that helps to <i>automatically</i> generate Web Services from the FORM-based query interfaces of a web site. Since the majority of web data are rather "hidden" behind such a FORM interface, we(More)
The sequence of amino acids in a protein is believed to determine its native state structure, which in turn is related to the functionality of the protein. In addition, information pertaining to evolutionary relationships is contained in homologous sequences. One powerful method for inferring these sequence attributes is through comparison of a query(More)
Since modern database applications increasingly need to deal with dirty data due to a variety of reasons (e.g., data entry errors, heterogeneous formats, and ambiguous terms), considerable recent efforts have focused on the (record) linkage problem to determine if two entities represented as relational records are approximately the same or not. In this(More)