Learn More
In recent years improvements to existing programs and the introduction of new iterative algorithms have changed the state-of-the-art in protein sequence alignment. This paper presents the first systematic study of the most commonly used alignment programs using BAliBASE benchmark alignments as test cases. Even below the 'twilight zone' at 10-20% residue(More)
Multiple sequence alignment is one of the cornerstones of modern molecular biology. It is used to identify conserved motifs, to determine protein domains, in 2D/3D structure prediction by homology and in evolutionary studies. Recently, high-throughput technologies such as genome sequencing and structural proteomics have lead to an explosion in the amount of(More)
Four consensus sequences are conserved with the same linear arrangement in RNA-dependent DNA polymerases encoded by retroid elements and in RNA-dependent RNA polymerases encoded by plus-, minus- and double-strand RNA viruses. One of these motifs corresponds to the YGDD span previously described by Kamer and Argos (1984). These consensus sequences altogether(More)
BAliBASE is specifically designed to serve as an evaluation resource to address all the problems encountered when aligning complete sequences. The database contains high quality, manually constructed multiple sequence alignments together with detailed annotations. The alignments are all based on three-dimensional structural superpositions, with the(More)
The large (L) protein subunit of unsegmented negative-strand RNA virus polymerases is thought to be responsible for the majority of enzymic activities involved in viral transcription and replication. In order to gain insight into this multifunctional role we compared the deduced amino acid sequences of five L proteins of rhabdoviruses (vesicular stomatitis(More)
The aminoacyl-transfer RNA synthetases (aaRS) catalyse the attachment of an amino acid to its cognate transfer RNA molecule in a highly specific two-step reaction. These proteins differ widely in size and oligomeric state, and have limited sequence homology. Out of the 18 known aaRS, only 9 referred to as class I synthetases (GlnRS, TyrRS, MetRS, GluRS,(More)
The sequence of Rift Valley fever virus L segment that we published in a previous paper was erroneous in the 3'-terminal region of the antigenomic RNA molecule. Here, we have shown that the L segment is in fact 6404 nucleotides long and encodes a polypeptide of 237.7K in the viral complementary sense. Sequence comparisons performed between the RNA-dependent(More)
Age-related macular degeneration (AMD) is a common cause of blindness in older individuals. To accelerate the understanding of AMD biology and help design new therapies, we executed a collaborative genome-wide association study, including >17,100 advanced AMD cases and >60,000 controls of European and Asian ancestry. We identified 19 loci associated at P <(More)
A comprehensive investigation of ribosomal genes in complete genomes from 66 different species allows us to address the distribution of r-proteins between and within the three primary domains. Thirty-four r-protein families are represented in all domains but 33 families are specific to Archaea and Eucarya, providing evidence for specialisation at an early(More)