New Tools in Orthology Analysis: A Brief Review of Promising Perspectives

  title={New Tools in Orthology Analysis: A Brief Review of Promising Perspectives},
  author={Bruno T. L. Nichio and Jeroniza N. Marchaukoski and Roberto Tadeu Raittz},
  journal={Frontiers in Genetics},
Nowadays defying homology relationships among sequences is essential for biological research. Within homology the analysis of orthologs sequences is of great importance for computational biology, annotation of genomes and for phylogenetic inference. Since 2007, with the increase in the number of new sequences being deposited in large biological databases, researchers have begun to analyse computerized methodologies and tools aimed at selecting the most promising ones in the prediction of… 

Figures and Tables from this paper

Inferring Ancient Relationships with Genomic Data: A Commentary on Current Practices.
  • C. Laumer
  • Biology
    Integrative and comparative biology
  • 2018
A subset of bioinformatic tools well-suited to the task of inferring ancient relationships with coding sequence data in particular are discussed, encompassing data generation, orthology assignment, alignment and gene tree inference, supermatrix construction, and analysis under the best-fitting models applicable to large-scale datasets.
OrthoVenn2: a web server for whole-genome comparison and annotation of orthologous clusters across multiple species
An updated version of OrthoVenn2 is presented, which provides new features that facilitate the comparative analysis of orthologous clusters among up to 12 species, and offers improvements to data visualization and interpretation.
RAFTS3G – An efficient and versatile clustering software to analyses in large protein datasets
A new approach for the Data Mining and analysis of homology in large dataset of protein sequences, the RAFTS3G is presented and it proved to be more than 10 times faster than CD-HIT and its strategy increases the performance in low similarity to detect protein families.
GeM-Pro: a tool for genome functional mining and microbial profiling
Gem-Pro and q-GeM perform a synteny analysis to evaluate the integrity and genomic arrangement of specific pathways of interest to infer their presence and were applied to more than 2 million homologous pairs encoded by Bacillus strains generating statistical supported predictions of trait contents.
Comparative analysis of the insect mobile genetic element repertoire and its influence on genome size dynamics
A standardized characterization and an order-level comparison of arthropod TE repertoires, encompassing 62 insect and 11 outgroup species, and finds that it correlates to genome size in both abundance and diversity.
Flexible comparative genomics of prokaryotic transcriptional regulatory networks
CGB’s ability to automatically integrate experimental information from multiple sources and use complete and draft genomic data, coupled with its non-reliance on precomputed databases and its easily interpretable display of gene-centered posterior probabilities of regulation provide users with an unprecedented level of flexibility in launching comparative genomics analyses of prokaryotic transcriptional regulatory networks.
OrthoFinder2: fast and accurate phylogenomic orthology analysis from gene sequences
Ortholog inference has fundamental importance across the biological sciences, underpinning phylogenetics, comparative genomics and prediction of gene function, and OrthoFinder achieves higher ortholog recall than all current methods as assessed by community-standard benchmarks.
OrthoFinder: phylogenetic orthology inference for comparative genomics
This extends OrthoFinder’s high accuracy orthogroup inference to provide phylogenetic inference of orthologs, rooted gene trees, gene duplication events, the rooted species tree, and comparative genomics statistics.
RAFTS3G: an efficient and versatile clustering software to analyses in large protein datasets
RAFTS3G compared to other “standard-gold” methods in the clustering of large biological data maintains the balance between the reduction of biological information redundancy and the creation of consistent groups.


Proteinortho: Detection of (Co-)orthologs in large-scale analysis
Proteinortho significantly reduces the required amount of memory for orthology analysis compared to existing tools, allowing such computations to be performed on off-the-shelf hardware.
Phylogenetic and Functional Assessment of Orthologs Inference Projects and Methods
Accurate genome-wide identification of orthologs is a central problem in comparative genomics, a fact reflected by the numerous orthology identification projects developed in recent years. However,
Inferring Orthologs: Open Questions and Perspectives
Appropriate taxon sampling together with combination of methods based on similarity, phylogeny, synteny, and evolutionary knowledge that may help detecting speciation events appears to be the most accurate strategy.
SPOCS: software for predicting and visualizing orthology/paralogy relationships among genomes
SPOCS implements a graph-based ortholog prediction method to generate a simple tab-delimited table of orthologs and in addition, html files that provide a visualization of the predicted ortholog/paralog relationships to which gene/protein expression metadata may be overlaid.
Computational methods for Gene Orthology inference
Comparisons of tree-based, sequence similarity- and synteny-based approaches can be combined into flexible hybrid methods show that, despite conceptual differences, they produce similar sets of orthologs, especially at short evolutionary distances.
A Brief Review of Software Tools for Pangenomics
OrthoInspector: comprehensive orthology analysis and visual exploration
OrthoInspector is a new software system incorporating an original algorithm for the rapid detection of orthology and inparalogy relations between different species and improves detection sensitivity, with a minimal loss of specificity.
morFeus: a web-based program to detect remotely conserved orthologs using symmetrical best hits and orthology network scoring
BackgroundSearching the orthologs of a given protein or DNA sequence is one of the most important and most commonly used Bioinformatics methods in Biology. Programs like BLAST or the orthology search
PorthoMCL: Parallel orthology prediction using MCL for the realm of massive genome availability
PorthoMCL is a fast and easy to run tool for identifying orthology among any number of genomes with minimal requirements and will facilitate comparative genomics analysis with increasing number of available genomes thanks to the rapidly evolving sequencing technologies.
OrthoMCL: identification of ortholog groups for eukaryotic genomes.
OrthoMCL provides a scalable method for constructing orthologous groups across multiple eukaryotic taxa, using a Markov Cluster algorithm to group (putative) orthologs and paralogs.