Minimotif miner 2nd release: a database and web system for motif search

@article{Rajasekaran2009MinimotifM2,
  title={Minimotif miner 2nd release: a database and web system for motif search},
  author={Sanguthevar Rajasekaran and Sudha Balla and Patrick Gradie and Michael Robert Gryk and Krishna Kadaveru and Vamsi Kundeti and Mark W. Maciejewski and Tian Mi and Nicholas Rubino and Jay Vyas and Martin R. Schiller},
  journal={Nucleic Acids Research},
  year={2009},
  volume={37},
  pages={D185 - D190}
}
Minimotif Miner (MnM) consists of a minimotif database and a web-based application that enables prediction of motif-based functions in user-supplied protein queries. We have revised MnM by expanding the database more than 10-fold to approximately 5000 motifs and standardized the motif function definitions. The web-application user interface has been redeveloped with new features including improved navigation, screencast-driven help, support for alias names and expanded SNP analysis. A sample… 

Figures and Tables from this paper

Minimotif Miner 3.0: database expansion and significantly improved reduction of false-positive predictions from consensus sequences
TLDR
The third release of the MnM database is reported, which has now grown 60-fold to approximately 300 000 minimotifs, and a new set of false-positive filters and linear regression scoring that vastly enhance minimotIF prediction accuracy on a test data set are summarized.
Minimotif Miner 4: a million peptide minimotifs and counting
TLDR
An update to MnM is presented, growing the database from ∼300 000 to >1 000 000 minimotif consensus sequences and instances and revealing a more detailed mechanistic hypothesis concerning the role of ERCC2 in cancer.
A Structure Based Algorithm for Improving Motifs Prediction
TLDR
An algorithm that exploits structural information for reducing false positives in motifs prediction, and the experimental results show that the parameters learnt by the algorithm can significantly reduce false positives.
SLiMSearch: A Webserver for Finding Novel Occurrences of Short Linear Motifs in Proteins, Incorporating Sequence Context
TLDR
The SLiMSearch webserver is a flexible tool that enables researchers to identify novel occurrences of predefined SLiMs in sets of proteins, and provides user-friendly output and visualizations of motif context to gain insight into the validity of a putatively functional motif occurrence.
HIVToolbox, an Integrated Web Application for Investigating HIV
TLDR
HIVToolbox is built, which integrates much of the knowledge about HIV proteins and allows virologists and structural biologists to access sequence, structure, and functional relationships in an intuitive web application and shows how data integration facilitates identification of new questions and hypotheses much more rapid and convenient than current approaches using isolated repositories.
MimoSA: a system for minimotif annotation
TLDR
MimoSA is an extensible application that facilitates minimotif annotation and integrates with the Minimotif Miner database as an application that integrates dynamic abstract scoring with a high performance relational model of minimot if syntax.
Partitioning of Minimotifs Based on Function with Improved Prediction Accuracy
TLDR
Testing these functional filters on known and random minimotifs has revealed that they are capable of separating true motifs from false positives, and it is strongly suggested that the new filters differentiate true motifS from random background with good confidence.
SLiMSearch 2.0: biological context for short linear motifs in proteins
TLDR
The SLiMSearch 2.0 (Short, Linear Motif Search) web server allows researchers to identify occurrences of a user-defined SLiM in a proteome, using conservation and protein disorder context statistics to rank occurrences.
SLiMFinder: a web server to find novel, significantly over-represented, short protein motifs
TLDR
The Short, Linear Motif Finder web server is a de novo motif discovery tool that identifies statistically over-represented motifs in a set of protein sequences, accounting for the evolutionary relationships between them.
A proposed syntax for Minimotif Semantics, version 1
BackgroundOne of the most important developments in bioinformatics over the past few decades has been the observation that short linear peptide sequences (minimotifs) mediate many classes of cellular
...
...

References

SHOWING 1-10 OF 37 REFERENCES
Minimotif Miner: A Computational Tool to Investigate Protein Function, Disease, and Genetic Diversity
  • M. Schiller
  • Biology, Computer Science
    Current protocols in protein science
  • 2007
TLDR
Scoring based on evolutionary conservation, protein surface prediction, and motif frequency can be used in conjunction with other motif programs and the known biology of the query to reduce false‐positive predictions and select short motifs for experimental pursuit.
ScanProsite: detection of PROSITE signature matches and ProRule-associated functional and structural residues in proteins
TLDR
A new and improved version of the web-based tool for detecting PROSITE signature matches in protein sequences that makes use of ProRules—context-dependent annotation templates—to detect functional and structural intra-domain residues.
The PRINTS Database of Protein Fingerprints: A Novel Information Resource for Computational Molecular Biology
TLDR
Progress is described with the PRINTS database, its compilation and interrogation software, and its Web interface, which covers a range of globular and membrane proteins, modular polypeptides, and so on.
The InterPro database, an integrated documentation resource for protein families, domains and functional sites
TLDR
InterPro is an integrated documentation resource for protein families, domains and functional sites, which amalgamates the efforts of the PROSITE, PRINTS, Pfam and ProDom database projects.
The PROSITE database, its status in 2002
TLDR
The PROSITE database consists of biologically significant patterns and profiles designed in such a way that with appropriate computational tools it can rapidly and reliably help to determine to which known family of proteins (if any) a new sequence belongs, or which known domain(s) it contains.
Recent improvements to the SMART domain-based sequence annotation resource
TLDR
The SMART database now contains information on intrinsic sequence features such as transmembrane regions, coiled-coils, signal peptides and internal repeats and new advanced queries provide direct access to the SMART relational database using SQL.
CDD: a curated Entrez database of conserved domain alignments
TLDR
The Conserved Domain Database (CDD), which mirrors the publicly available domain alignment collections SMART and PFAM, and now also contains alignment models curated at NCBI, is now indexed as a separate database within the Entrez system and linked to other Entrez databases such as MEDLINE(R).
Increased coverage of protein families with the Blocks Database servers
TLDR
The Blocks+ nearly doubles the number of protein families included in the database by adding families from the Pfam-A, ProDom and Domo databases to those from PROSITE and PRINTS.
Minimotif Miner: a tool for investigating protein function
TLDR
A motif database comprising 312 unique motifs and a web-based tool for identifying motifs in proteins are constructed and functional motifs predicted by MnM are validated by analyzing thousands of confirmed examples and by confirming prediction of previously unidentified 14-3-3 motifsIn EFF-1.
Dictionary-driven protein annotation.
TLDR
This paper presents one such approach that is centered around and exploits the Bio-Dictionary, a collection of amino acid patterns that completely covers the natural sequence space and can capture functional and structural signals that have been reused during evolution, within and across protein families.
...
...