Learn More
The human genome holds an extraordinary trove of information about human development, physiology, medicine and evolution. Here we report the results of an international collaboration to produce and make freely available a draft sequence of the human genome. We also present an initial analysis of the data, describing some of the insights that can be gleaned(More)
Almost all protein database search methods use amino acid substitution matrices for scoring, optimizing, and assessing the statistical significance of sequence alignments. Much care and effort has therefore gone into constructing substitution matrices, and the quality of search results can depend strongly upon the choice of the proper matrix. A(More)
MOTIVATION A tool that simultaneously aligns multiple protein sequences, automatically utilizes information about protein domains, and has a good compromise between speed and accuracy will have practical advantages over current tools. RESULTS We describe COBALT, a constraint based alignment tool that implements a general framework for multiple alignment(More)
MOTIVATION The BLAST software package for sequence comparison speeds up homology search by preprocessing a query sequence into a lookup table. Numerous research studies have suggested that preprocessing the database instead would give better performance. However, production usage of sequence comparison methods that preprocess the database has been limited(More)
The nemaline myopathies are characterized by weakness and eosinophilic, rodlike (nemaline) inclusions in muscle fibers. Amish nemaline myopathy is a form of nemaline myopathy common among the Old Order Amish. In the first months of life, affected infants have tremors with hypotonia and mild contractures of the shoulders and hips. Progressive worsening of(More)
Hirschsprung disease (HSCR) is a multigenic neurocristopathy clinically recognized by aganglionosis of the distal gastrointestinal tract. Patients presenting with aganglionosis in association with hypopigmentation are classified as Waardenburg syndrome type 4 (Waardenburg-Shah, WS4). Variability in the disease phenotype of WS4 patients with equivalent(More)
BLAST is a commonly-used software package for comparing a query sequence to a database of known sequences; in this study, we focus on protein sequences. Position-specific-iterated BLAST (PSI-BLAST) iteratively searches a protein sequence database, using the matches in round i to construct a position-specific score matrix (PSSM) for searching the database in(More)
We describe the construction of a high-resolution radiation hybrid (RH) map of the domestic cat genome, which includes 2662 markers, translating to an estimated average intermarker distance of 939 kilobases (kb). Targeted marker selection utilized the recent feline 1.9x genome assembly, concentrating on regions of low marker density on feline autosomes and(More)
We report construction of second-generation integrated genetic linkage and radiation hybrid (RH) maps in the domestic cat (Felis catus) that exhibit a high level of marker concordance and provide near-full genome coverage. A total of 864 markers, including 585 coding loci (type I markers) and 279 polymorphic microsatellite loci (type II markers), are now(More)
We consider the problem of tting an n n distance matrix D by a tree metric T. Let " be the distance to the closest tree metric under the L 1 norm, that is, " = min T fk T ? D k 1 g. First we present an O(n 2) algorithm for nding a tree metric T such that k T ? D k 1 3". Second we show that it is NP-hard to nd a tree metric T such that k T ? D k 1 < 9 8 ".(More)