InterMine: extensive web services for modern biology
@article{Kalderimis2014InterMineEW, title={InterMine: extensive web services for modern biology}, author={Alexis Kalderimis and Rachel Lyne and Daniela Butano and Sergio Contrino and Mike Lyne and Joshua Heimbach and Fengyuan Hu and Richard N. Smith and Radek Stepan and Julie M. Sullivan and Gos Micklem}, journal={Nucleic Acids Research}, year={2014}, volume={42}, pages={W468 - W472} }
InterMine (www.intermine.org) is a biological data warehousing system providing extensive automatically generated and configurable RESTful web services that underpin the web interface and can be re-used in many other applications: to find and filter data; export it in a flexible and structured way; to upload, use, manipulate and analyze lists; to provide services for flexible retrieval of sequence segments, and for other statistical and analysis tools. Here we describe these features and…
105 Citations
Cross‐organism analysis using InterMine
- Computer Science, BiologyGenesis
- 2015
How InterMine databases have been developed for the major model organisms, budding yeast, nematode worm, fruit fly, zebrafish, mouse, and rat together with a newly developed human database is described to facilitate interoperation and development of cross‐organism analysis tools and reports.
Making Linked Data SPARQL with the InterMine Biological Data Warehouse
- Computer ScienceSWAT4LS
- 2016
This work uses Docker to bring together SPARQL-aware applications to search, browse, explore, and query the InterMine-based data, and supports new query functionality across InterMine installations and the network of open Linked Data.
Shared resources, shared costs—leveraging biocuration resources
- Computer ScienceDatabase J. Biol. Databases Curation
- 2015
The value of this approach in comparison with other apparently less costly options, such as automated annotation or text-mining, is argued, and ways in which databases can make cost savings by sharing infrastructure and tool development are discussed.
NetR and AttR, Two New Bioinformatic Tools to Integrate Diverse Datasets into Cytoscape Network and Attribute Files
- Biology, Computer ScienceGenes
- 2019
NetR and AttR are developed, which allow experimental biologists with little to no programming background to integrate publicly available datasets into files that can be later visualized with Cytoscape to display hypothetical networks that result from combining individual datasets, as well as a series of published attributes related to the genes or proteins in the network.
HumanMine: advanced data searching, analysis and cross-species comparison
- Computer Science, BiologybioRxiv
- 2022
HumanMine (www.humanmine.org) is an integrated database of human genomics and proteomics data that provides a powerful interface to support sophisticated exploration and analysis of data compiled…
BioGraph: a web application and a graph database for querying and analyzing bioinformatics resources
- Computer ScienceBMC Systems Biology
- 2018
BioGraph implements state-of-the-art technologies and provides pre-compiled bioinformatics scenarios, as well as the possibility to perform custom queries and obtaining an interactive and dynamic visualization of results.
WormBase: a modern Model Organism Information Resource
- Computer ScienceNucleic Acids Res.
- 2020
This update discusses the status of literature curation and recently added data, detail new features of the web interface and options for users wishing to conduct data mining workflows, and discusses the efforts to build a robust and scalable architecture by leveraging commercial cloud offerings.
EasyMirror and EasyImport: Simplifying the setup of a custom Ensembl database and webserver for any species
- Biology
- 2016
The EasyMirror and EasyImport pipelines are introduced to facilitate the setup and hosting of custom Ensembl genome browsers.
Metascape provides a biologist-oriented resource for the analysis of systems-level datasets
- Computer Science, BiologyNature Communications
- 2019
A biologist-oriented portal that provides a gene list annotation, enrichment and interactome resource and enables integrated analysis of multi-OMICs datasets, Metascape is an effective and efficient tool for experimental biologists to comprehensively analyze and interpret OMICs-based studies in the big data era.
Machado: open source genomics data integration framework
- Computer SciencebioRxiv
- 2020
Machado aims to be a modern object-relational framework that uses the latests Python libraries to produce an effective open source resource for genomics research.
References
SHOWING 1-10 OF 12 REFERENCES
InterMine: a flexible data warehouse system for the integration and analysis of heterogeneous biological data
- Computer ScienceBioinform.
- 2012
Using InterMine, large biological databases can be created from a range of heterogeneous data sources, and the extensible data model allows for easy integration of new data types.
YeastMine—an integrated data warehouse for Saccharomyces cerevisiae data as a multipurpose tool-kit
- Biology, Computer ScienceDatabase J. Biol. Databases Curation
- 2012
YeastMine is a multifaceted search and retrieval environment that provides access to diverse data types and offers multiple scenarios in which it can be used such as a powerful search interface, a discovery tool, a curation aid and also a complex database presentation format.
modMine: flexible access to modENCODE data
- Biology, Computer ScienceNucleic Acids Res.
- 2012
The modMine database (http://intermine.modencode.org) described here has been built by the modENCODE Data Coordination Center to allow the broader research community to search for and download data sets of interest among the thousands generated by modENCode.
Bioinformatics enrichment tools: paths toward the comprehensive functional analysis of large gene lists
- BiologyNucleic acids research
- 2009
The survey will help tool designers/developers and experienced end users understand the underlying algorithms and pertinent details of particular tool categories/tools, enabling them to make the best choices for their particular research interests.
MitoMiner: a data warehouse for mitochondrial proteomics data
- BiologyNucleic Acids Res.
- 2012
MitoMiner can be used to characterize the variability of the mitochondrial proteome between tissues and investigate how changes in the proteome may contribute to mitochondrial dysfunction and mitochondrial-associated diseases such as cancer, neurodegenerative diseases, obesity, diabetes, heart failure and the ageing process.
FlyTF: improved annotation and enhanced functionality of the Drosophila transcription factor database
- BiologyNucleic Acids Res.
- 2010
The manual classification of TFs in the initial version of FlyTF has now been extended to a more fine-grained annotation of both DNA binding and regulatory properties in the new release.
TargetMine, an Integrated Data Warehouse for Candidate Gene Prioritisation and Target Discovery
- BiologyPloS one
- 2011
An objective protocol for target prioritisation using TargetMine is proposed and the results show that the protocol can identify known disease-associated genes with high precision and coverage.
MEDIC: a practical disease vocabulary used at the Comparative Toxicogenomics Database
- MedicineDatabase J. Biol. Databases Curation
- 2012
The construction, implementation, maintenance and use of MEDIC is described to raise awareness of this resource and to offer it as a putative scaffold in the formal construction of an official disease ontology.
metabolicMine: an integrated genomics, genetics and proteomics data warehouse for common metabolic disease research
- Biology, Computer ScienceDatabase J. Biol. Databases Curation
- 2013
This work presents metabolicMine, a data warehouse with a specific focus on the genomics, genetics and proteomics of common metabolic diseases, developed in collaboration with leading UK metabolic disease groups and freely available online.
Real-time single-molecule observation of rolling-circle DNA replication
- BiologyNucleic acids research
- 2009
By attaching a rolling-circle substrate to a TIRF microscope-mounted flow chamber, this method allows for rapid and precise characterization of the kinetics of DNA synthesis and the effects of replication inhibitors.