PhyloPat: an updated version of the phylogenetic pattern database contains gene neighborhood

  title={PhyloPat: an updated version of the phylogenetic pattern database contains gene neighborhood},
  author={Tim Hulsen and Peter M. A. Groenen and Jacob de Vlieg and Wynand Alkema},
  journal={Nucleic Acids Research},
  pages={D731 - D737}
Phylogenetic patterns show the presence or absence of certain genes in a set of full genomes derived from different species. They can also be used to determine sets of genes that occur only in certain evolutionary branches. Previously, we presented a database named PhyloPat which allows the complete Ensembl gene database to be queried using phylogenetic patterns. Here, we describe an updated version of PhyloPat which can be queried by an improved web server. We used a single linkage clustering… 

Figures from this paper

ProtPhylo: identification of protein–phenotype and protein–protein functional associations via phylogenetic profiling

ProtPhylo infers functional associations by comparing protein phylogenetic profiles for more than 9.7 million non-redundant protein sequences from all three domains of life by ranking phylogenetic neighbors of query proteins or phenotypic properties using the Hamming distance.

ProteinHistorian: Tools for the Comparative Analysis of Eukaryote Protein Origin

ProteinHistorian allows flexibility in the definition of protein age by including several algorithms for estimating ages from different databases of evolutionary relationships, and demonstrates that proteins with high expression in human, compared to chimpanzee and rhesus macaque, are significantly younger than those with human-specific low expression.

PhyloPro: a web-based tool for the generation and visualization of phylogenetic profiles across Eukarya

A new web-tool Phylopro is introduced, which uses the 120 available eukaryotic genome sequences to visualize the evolutionary trajectories of user-defined subsets of model organism genes and provides a valuable resource for the evolutionary and comparative studies of biological systems.

What does the Allen Gene Expression Atlas tell us about mouse brain evolution

It is argued that the conclusions one can draw on evolution of twelve major brain regions from such a molecular level analysis supplements existing knowledge of mouse brain evolution and introduces new quantitative tools, especially for comparative studies, when AGEA-like data sets for other species become available.

Similarly Strong Purifying Selection Acts on Human Disease Genes of All Evolutionary Ages

The age of emergence and propensity for gene loss (PGL) of all human protein–coding genes are determined and disease genes with non-disease genes are compared in terms of their evolutionary rate, strength of purifying selection, mRNA expression, and genetic redundancy.

Properties of human disease genes and the role of genes linked to Mendelian disorders in complex disease aetiology

It is shown that, relative to non‐disease genes, human disease genes have specific evolutionary profiles and protein network properties and that adaptive selection could also contribute to shape their genetic architecture.

A multi-objective evolutionary approach to predict Protein-Protein Interaction network

The proposed technique outperforms existing methods, including gene-ontology based Relative Specific Similarity, Fuzzy SVM, phylogenetic profile and evolutionary/swarm algorithm based approaches, with respect to sensitivity, specificity and F1 score.

Widespread establishment and regulatory impact of Alu exons in human genes

The Alu element has been a major source of new exons during primate evolution. Thousands of human genes contain spliced exons derived from Alu elements. However, identifying Alu exons that have

An endogenous protein inhibitor, YjhX (TopAI), for topoisomerase I from Escherichia coli

TopAI is the first endogenous protein inhibitor specific for topoisomerase I, found in Escherichia coli named yjhX-yjhQ, and renamed as the gene for the TopA inhibitor (the topAI gene).



A phylogenomic gene cluster resource: the Phylogenetically Inferred Groups (PhIGs) database

The PhIGs website contains tools that allow the study of genes within their phylogenetic framework through keyword searches on annotations, and sequence similarity searches by BLAST and HMM, and the website also allows users to view the relative physical positions of homologous genes in specified sets of genomes.

Tree pattern matching in phylogenetic trees: automatic search for orthologs or paralogs in homologous gene sequence databases

An algorithm to infer speciation and duplication events by comparison of gene and species trees (tree reconciliation) and a general method to search in databases the gene families for which the tree topology matches a peculiar tree pattern.

OrthoMCL-DB: querying a comprehensive multi-species collection of ortholog groups

The OrthoMCL-DB provides a centralized warehouse for orthology prediction among multiple species, and will be updated and expanded as additional genome sequence data become available.

SHOT: a web server for the construction of genome phylogenies.

Using the COG Database to Improve Gene Recognition in Complete Genomes

The use of phylogenetic patterns are presented as a means to perform targeted searches for undetected protein-coding genes in complete genomes.

Correlation between sequence conservation and the genomic context after gene duplication

This work analyzes orthologs between pairs of genomes where in one genome the orthologous gene has duplicated after the speciation of the two genomes (i.e. inparalogs) to predict the most probable functional equivalent ortholog in the presence of inparalogys.

EPPS: mining the COG database by an extended phylogenetic patterns search

EPPS has the advantage to detect COGs even if organisms definition to be included are not or organisms defined to be excluded are present in the output COGs.

Entrez Gene: gene-centered information at NCBI

Entrez Gene is a step forward from NCBI's LocusLink, with both a major increase in taxonomic scope and improved access through the many tools associated with NCBI Entrez.

Ensembl 2008

Major additions and improvements to Ensembl since the previous report include extensive support forfunctional genomics data in the form of a specialized functional genomics database, genome-wide maps of protein–DNA interactions and the EnsembL regulatory build; support for customization of the Ensemble web interface through the addition of user accounts and user groups; and increased support for genome resequencing.

TreeFam: 2008 Update

Release 4.0 of TreeFam contains curated trees for 1314 families and automatically generated trees for another 14 351 families, and introduces more accurate approaches for automatically grouping genes into families, for building phylogenetic trees, and for inferring orthologues and paralogues.