PhyloPat: an updated version of the phylogenetic pattern database contains gene neighborhood

  title={PhyloPat: an updated version of the phylogenetic pattern database contains gene neighborhood},
  author={Tim Hulsen and Peter M. A. Groenen and Jacob de Vlieg and Wynand Alkema},
  journal={Nucleic Acids Research},
  pages={D731 - D737}
Phylogenetic patterns show the presence or absence of certain genes in a set of full genomes derived from different species. They can also be used to determine sets of genes that occur only in certain evolutionary branches. Previously, we presented a database named PhyloPat which allows the complete Ensembl gene database to be queried using phylogenetic patterns. Here, we describe an updated version of PhyloPat which can be queried by an improved web server. We used a single linkage clustering… Expand
ProtPhylo: identification of protein–phenotype and protein–protein functional associations via phylogenetic profiling
ProtPhylo infers functional associations by comparing protein phylogenetic profiles for more than 9.7 million non-redundant protein sequences from all three domains of life by ranking phylogenetic neighbors of query proteins or phenotypic properties using the Hamming distance. Expand
ProteinHistorian: Tools for the Comparative Analysis of Eukaryote Protein Origin
ProteinHistorian allows flexibility in the definition of protein age by including several algorithms for estimating ages from different databases of evolutionary relationships, and demonstrates that proteins with high expression in human, compared to chimpanzee and rhesus macaque, are significantly younger than those with human-specific low expression. Expand
PhyloPro: a web-based tool for the generation and visualization of phylogenetic profiles across Eukarya
A new web-tool Phylopro is introduced, which uses the 120 available eukaryotic genome sequences to visualize the evolutionary trajectories of user-defined subsets of model organism genes and provides a valuable resource for the evolutionary and comparative studies of biological systems. Expand
What does the Allen Gene Expression Atlas tell us about mouse brain evolution
It is argued that the conclusions one can draw on evolution of twelve major brain regions from such a molecular level analysis supplements existing knowledge of mouse brain evolution and introduces new quantitative tools, especially for comparative studies, when AGEA-like data sets for other species become available. Expand
Similarly Strong Purifying Selection Acts on Human Disease Genes of All Evolutionary Ages
The age of emergence and propensity for gene loss (PGL) of all human protein–coding genes are determined and disease genes with non-disease genes are compared in terms of their evolutionary rate, strength of purifying selection, mRNA expression, and genetic redundancy. Expand
Properties of human disease genes and the role of genes linked to Mendelian disorders in complex disease aetiology
It is shown that, relative to non‐disease genes, human disease genes have specific evolutionary profiles and protein network properties and that adaptive selection could also contribute to shape their genetic architecture. Expand
A multi-objective evolutionary approach to predict Protein-Protein Interaction network
The proposed technique outperforms existing methods, including gene-ontology based Relative Specific Similarity, Fuzzy SVM, phylogenetic profile and evolutionary/swarm algorithm based approaches, with respect to sensitivity, specificity and F1 score. Expand
Widespread establishment and regulatory impact of Alu exons in human genes
The Alu element has been a major source of new exons during primate evolution. Thousands of human genes contain spliced exons derived from Alu elements. However, identifying Alu exons that haveExpand
An endogenous protein inhibitor, YjhX (TopAI), for topoisomerase I from Escherichia coli
TopAI is the first endogenous protein inhibitor specific for topoisomerase I, found in Escherichia coli named yjhX-yjhQ, and renamed as the gene for the TopA inhibitor (the topAI gene). Expand


PhyloPat: phylogenetic pattern analysis of eukaryotic genes
PhyloPat is the first tool to combine complete genome information with phylogenetic pattern querying and is presented, which allows the complete Ensembl gene database to be queried using phylogenetic patterns. Expand
A phylogenomic gene cluster resource: the Phylogenetically Inferred Groups (PhIGs) database
The PhIGs website contains tools that allow the study of genes within their phylogenetic framework through keyword searches on annotations, and sequence similarity searches by BLAST and HMM, and the website also allows users to view the relative physical positions of homologous genes in specified sets of genomes. Expand
Tree pattern matching in phylogenetic trees: automatic search for orthologs or paralogs in homologous gene sequence databases
An algorithm to infer speciation and duplication events by comparison of gene and species trees (tree reconciliation) and a general method to search in databases the gene families for which the tree topology matches a peculiar tree pattern. Expand
OrthoMCL-DB: querying a comprehensive multi-species collection of ortholog groups
The OrthoMCL-DB provides a centralized warehouse for orthology prediction among multiple species, and will be updated and expanded as additional genome sequence data become available. Expand
SHOT: a web server for the construction of genome phylogenies.
SHOT is a web server for the classification of genomes on the basis of shared gene content or the conservation of gene order that reflects the dominant, phylogenetic signal in these genomic properties. Expand
Using the COG Database to Improve Gene Recognition in Complete Genomes
The use of phylogenetic patterns are presented as a means to perform targeted searches for undetected protein-coding genes in complete genomes. Expand
Correlation between sequence conservation and the genomic context after gene duplication
This work analyzes orthologs between pairs of genomes where in one genome the orthologous gene has duplicated after the speciation of the two genomes (i.e. inparalogs) to predict the most probable functional equivalent ortholog in the presence of inparalogys. Expand
EPPS: mining the COG database by an extended phylogenetic patterns search
EPPS has the advantage to detect COGs even if organisms definition to be included are not or organisms defined to be excluded are present in the output COGs. Expand
Entrez Gene: gene-centered information at NCBI
Entrez Gene is a step forward from NCBI's LocusLink, with both a major increase in taxonomic scope and improved access through the many tools associated with NCBI Entrez. Expand
Ensembl 2008
Major additions and improvements to Ensembl since the previous report include extensive support forfunctional genomics data in the form of a specialized functional genomics database, genome-wide maps of protein–DNA interactions and the EnsembL regulatory build; support for customization of the Ensemble web interface through the addition of user accounts and user groups; and increased support for genome resequencing. Expand