MEGA3: Integrated software for Molecular Evolutionary Genetics Analysis and sequence alignment

@article{Kumar2004MEGA3IS,
  title={MEGA3: Integrated software for Molecular Evolutionary Genetics Analysis and sequence alignment},
  author={Sudhir Kumar and Koichiro Tamura and Masatoshi Nei},
  journal={Briefings in bioinformatics},
  year={2004},
  volume={5 2},
  pages={
          150-63
        }
}
With its theoretical basis firmly established in molecular evolutionary and population genetics, the comparative DNA and protein sequence analysis plays a central role in reconstructing the evolutionary histories of species and multigene families, estimating rates of molecular evolution, and inferring the nature and extent of selective forces shaping the evolution of genes and genomes. The scope of these investigations has now expanded greatly owing to the development of high-throughput… 

Figures and Tables from this paper

MEGA4: Molecular Evolutionary Genetics Analysis (MEGA) software version 4.0.
TLDR
Version 4 of MEGA software expands on the existing facilities for editing DNA sequence data from autosequencers, mining Web-databases, performing automatic and manual sequence alignment, analyzing sequence alignments to estimate evolutionary distances, inferring phylogenetic trees, and testing evolutionary hypotheses.
MEGA: A biologist-centric software for evolutionary analysis of DNA and protein sequences
TLDR
The motivation, design principles and priorities that have shaped the development of MEGA are discussed and how MEGA might evolve in the future to assist researchers in their growing need to analyze large data set using new computational methods are discussed.
MEGA11: Molecular Evolutionary Genetics Analysis Version 11
TLDR
New additions are described that make MEGA a more comprehensive tool for building timetrees of species, pathogens, and gene families using rapid relaxed-clock methods and a Bayesian method for estimating neutral evolutionary probabilities of alleles in a species using multispecies sequence alignments.
New methods for inferring population dynamics from microbial sequences.
PAML 4: phylogenetic analysis by maximum likelihood.
TLDR
PAML, currently in version 4, is a package of programs for phylogenetic analyses of DNA and protein sequences using maximum likelihood (ML), which can be used to estimate parameters in models of sequence evolution and to test interesting biological hypotheses.
Comparative and evolutionary genomics of globin genes in fish.
Bioinformatics software for biologists in the genomics era
TLDR
It is argued that the design principles need to be sensitive to the reality that comparatively small teams of biologists have historically developed some of the most popular biological software packages in molecular evolutionary analysis and biological intuitiveness and investigator empowerment need to take precedence over the current supposition that biologists should re-tool and become programmers when analyzing genome scale datasets.
Tracing evolutionary pressure
TLDR
A novel method, TEA-O (Two-entropies analysis-Objective), to trace the evolutionary pressure from the root to the branches of the phylogenetic tree is presented and allows an unbiased, user-independent, analysis of residue relevance in a protein family.
MultiSeq: unifying sequence and structure data for evolutionary analysis
TLDR
MultiSeq is a major extension of the Multiple Alignment tool provided as part of VMD, a structural visualization program for analyzing molecular dynamics simulations, and contains a new algorithm to generate complete evolutionary profiles that represent the topology of the molecular phylogenetic tree of a homologous group of distantly related proteins.
...
...

References

SHOWING 1-10 OF 74 REFERENCES
MEGA: Molecular Evolutionary Genetics Analysis software for microcomputers
TLDR
In this program, various methods for estimating evolutionary distances from nucleotide and amino acid sequence data, three different methods of phylogenetic inference (UPGMA, neighbor-joining and maximum parsimony) and two statistical tests of topological differences are included.
MEGA2: molecular evolutionary genetics analysis software
TLDR
MEGA2 vastly extends the capabilities of MEGA version 1 by facilitating analyses of large datasets, enabling creation and analyses of groups of sequences, and expanding the repertoire of statistical methods for molecular evolutionary studies.
PAML: a program package for phylogenetic analysis by maximum likelihood
TLDR
The strength of PAML, in comparison with other phylogenetic packages currently available, is its implementation of a variety of evolutionary models, which include several models of variable evolutionary rates among sites, models for combined analyses of multiple gene sequence data and models for amino acid sequences.
Reconstructing evolutionary trees from DNA and protein sequences: paralinear distances.
  • J. Lake
  • Biology
    Proceedings of the National Academy of Sciences of the United States of America
  • 1994
TLDR
The "origin of the eukaryotes" has been investigated by the analysis of elongation factor Tu sequences with a variety of sequence alignments, and it has been found that the order in which sequences are pairwise aligned strongly determines the topology which is reconstructed by paralinear distances.
Disparity index: a simple statistic to measure and test the homogeneity of substitution patterns between molecular sequences.
TLDR
A disparity index, ID, is proposed, which measures the observed difference in evolutionary patterns for a pair of sequences, and a Monte Carlo procedure is developed to test the homogeneity of the observed patterns.
Evolutionary distance estimation under heterogeneous substitution pattern among lineages.
TLDR
This work presents a simple modification for existing distance estimation methods to relax the assumption of the substitution pattern homogeneity among lineages when analyzing DNA and protein sequences and shows that the modified method performs much better than the LogDet methods, which do not require the homogeneity assumption in estimating the number of substitutions per site.
Positive Darwinian selection after gene duplication in primate ribonuclease genes.
TLDR
It was found that the number of arginine residues increased substantially in a short period of evolutionary time after gene duplication, and these amino acid changes probably produced the novel anti-pathogen function of ECP.
Genome-scale approaches to resolving incongruence in molecular phylogenies
TLDR
The results suggest that data sets consisting of single or a small number of concatenated genes have a significant probability of supporting conflicting topologies, and have important implications for resolving branches of the tree of life.
Recovering evolutionary trees under a more realistic model of sequence evolution.
TLDR
The overall conclusions from this study are that irregular A,C,G,T compositions are an important and possible general cause of patterns that can mislead tree-reconstruction methods, even when high bootstrap values are obtained.
An Applications-focused Review of Comparative Genomics Tools: Capabilities, Limitations and Future Challenges
TLDR
A team at the Lawrence Livermore National Laboratory was given the task of using computational tools to speed up the development of DNA diagnostics for pathogen detection, and it was necessary to understand the merits and limitations of the various available comparative genomics tools.
...
...