T-Coffee: A novel method for fast and accurate multiple sequence alignment.

@article{Notredame2000TCoffeeAN,
  title={T-Coffee: A novel method for fast and accurate multiple sequence alignment.},
  author={C. Notredame and Desmond G. Higgins and Jaap Heringa},
  journal={Journal of molecular biology},
  year={2000},
  volume={302 1},
  pages={
          205-17
        }
}
We describe a new method (T-Coffee) for multiple sequence alignment that provides a dramatic improvement in accuracy with a modest sacrifice in speed as compared to the most commonly used alternatives. [] Key Result The resulting alignments are significantly more reliable, as determined by comparison with a set of 141 test cases, than any of the popular alternatives that we tried. The improvement, especially clear with the more difficult test cases, is always visible, regardless of the phylogenetic spread of…

Figures and Tables from this paper

ReformAlign: improved multiple sequence alignments using a profile-based meta-alignment approach

The experimental results suggest that the proposed meta-aligner approach may often lead to statistically significant more accurate alignments, and it is shown that ReformAlign results in more substantial improvement in cases where the starting alignment is of relatively inferior quality or when the input sequences are harder to align.

Integrated multiple sequence alignment

A technique is established that can accurately align sequences containing eventually repeated motifs that can compare tandem repeat sequences by aligning them with respect to their possible repeat histories.

T-Coffee: Tree-based consistency objective function for alignment evaluation.

This chapter presents how the T-Coffee package can be used in its command line mode to carry out the most common tasks and multiply align proteins, DNA, and RNA sequences.

Grammar-based distance in progressive multiple sequence alignment

The proposed multiple sequence alignment algorithm has successfully built multiple alignments comparable to other programs with significant improvements in running time and is especially striking for large datasets.

M-Coffee: combining multiple sequence alignment methods with T-Coffee

M-Coffee is a meta-method for assembling multiple sequence alignments (MSA) by combining the output of several individual methods into one single MSA that is robust to variations in the choice of constituent methods and reasonably tolerant to duplicate MSAs.

DIALIGN-T: An improved algorithm for segment-based multiple sequence alignment

A complete re-implementation of the segment-based approach to multiple protein alignment that contains a number of improvements compared to the previous version 2.2 of DIALIGN and is comparable to the standard global aligner CLUSTAL W, though it is outperformed by some newly developed programs that focus on global alignment.

R-Coffee: a method for multiple alignment of non-coding RNA

R-Coffee is used to compute multiple sequence alignments combining the pairwise output of sequence aligners and structural aligners, and it is suggested that the best protocol for aligning short sequences (less than 200 nt) is the combination of R-Cffee with the RNA pairwise structural aligner Consan.

A min-cut algorithm for the consistency problem in multiple sequence alignment

This work proposes a graph-theoretical approach to find local multiple sequence similarities that consistently outperforms the standard version of DIALIGN where local pairwise alignments are greedily incorporated into a multiple alignment.

IMAP : design and implementation of an interactive and integrative programming environment for progressive multiple sequence alignment and phylogeny reconstruction

A ne~ method (!MAP) for popular tree-based Progressive Multiple Sequence Alignment algorithms that improves the quality of the alignments particularly in Twilight Zone when the sequence similarity is very low and disintegrated the hard coded dependencies between progressive alignment and tree construction.

Align-m-a new algorithm for multiple alignment of highly divergent sequences

Align-m is a new program that uses a non-progressive local approach to guide a global alignment of highly divergent sequences and has comparable or slightly higher accuracy in terms of correctly aligned residues, especially for distantly related sequences.
...

References

SHOWING 1-10 OF 37 REFERENCES

COFFEE: an objective function for multiple sequence alignments

It is shown that multiple sequence alignments can be optimized for their COFFEE score with the genetic algorithm package SAGA and given a library of structure-based pairwise alignments extracted from FSSP, SAG a can produce high-quality multiple sequencealignments.

Significant improvement in accuracy of multiple protein sequence alignments by iterative refinement as assessed by reference to structural alignments.

  • O. Gotoh
  • Biology
    Journal of molecular biology
  • 1996
The improvement in accuracy of alignments obtained by these iterative methods over pairwise or progressive method tends to increase with decreasing average sequence identity, implying that iterative refinement is more effective for the generally difficult alignment of remotely related sequences.

Combining many multiple alignments in one improved alignment

A method that extracts qualitatively good sub-alignments from a set of multiple alignments and combines these into a new, often improved alignment, implemented as a variant of the traditional dynamic programming technique.

DIALIGN 2: improvement of the segment-to-segment approach to multiple sequence alignment

A modification of the weight function used in the original version of the alignment program DIALIGN has two important advantages: it can be applied to both globally and locally related sequence sets, and the running time of the program is considerably improved.

A tool for multiple sequence alignment.

The design and application of a tool for multiple alignment of amino acid sequences that implements a new algorithm that greatly reduces the computational demands of dynamic programming is described.

CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice.

The sensitivity of the commonly used progressive multiple sequence alignment method has been greatly improved and modifications are incorporated into a new program, CLUSTAL W, which is freely available.

The multiple sequence alignment problem in biology

It is proved here that knowledge of the measure of an arbitrarily chosen alignment can be used in combination with information from the pairwise alignments to considerably restrict the size of the region of the lattice in consideration.

A branch-and-cut algorithm for multiple sequence alignment

This work considers a branch-and-cut approach for solving the multiple sequence alignment problem and shows that this method outperforms the best tools developed so far, in that it produces alignments that are better from a biological point of view.