PGDSpider: an automated data conversion tool for connecting population genetics and genomics programs

@article{Lischer2012PGDSpiderAA,
  title={PGDSpider: an automated data conversion tool for connecting population genetics and genomics programs},
  author={Heidi E. L. Lischer and Laurent Excoffier},
  journal={Bioinformatics},
  year={2012},
  volume={28 2},
  pages={
          298-9
        }
}
UNLABELLED The analysis of genetic data often requires a combination of several approaches using different and sometimes incompatible programs. In order to facilitate data exchange and file conversions between population genetics programs, we introduce PGDSpider, a Java program that can read 27 different file formats and export data into 29, partially overlapping, other file formats. The PGDSpider package includes both an intuitive graphical user interface and a command-line version allowing… Expand
widgetcon: A website and program for quick conversion among common population genetic data formats
TLDR
W widgetcon, a website and a program which has been developed to quickly and easily convert among various molecular data formats commonly used in phylogenetic analysis, population genetics, and other fields are reported. Expand
Extending GelJ for interoperability: Filling the gap in the bioinformatics resources for population genetics analysis with dominant markers
TLDR
GelJ has become the first tool that fills the gap between gel image processing software and population genetics analysis with dominant markers, phylogenetic reconstruction, and tree editing software by automating the process of generating the input for the latter software from gel banding patterns processed by GelJ. Expand
genepopedit: a simple and flexible tool for manipulating multilocus molecular data in R
Advances in genetic sequencing technologies and techniques have made large, genome‐wide data sets comprised of hundreds or even thousands of individuals and loci the norm rather than the exceptionExpand
Linking genomics and population genetics with R
TLDR
How r, as a computing language and development environment, proposes some solutions to meet the challenges faced by data analysts and software developers when handling very big genetic data sets collected on many individuals is exposed. Expand
Selecting RAD-Seq Data Analysis Parameters for Population Genetics: The More the Better?
TLDR
It is found that recovery of higher numbers of polymorphic loci is not necessarily associated with higher genetic differentiation, and that defining a systematic universal protocol for RAD-seq data analysis may lead to missing relevant information about population differentiation. Expand
hybriddetective: A workflow and package to facilitate the detection of hybridization using genomic data in r
TLDR
A streamlined hybrid analysis platform providing improvements in speed, ease of use and repeatability over current ad hoc approaches, and the quantification and visualization of the efficiency and accuracy with which hybrids can be detected. Expand
SNPConvert: SNP Array Standardization and Integration in Livestock Species
TLDR
The difficulties related to handling SNP array data are described, focusing on file formats, SNP allele coding, and mapping, and the SNPConvert suite is presented, a multi-platform, open-source, and user-friendly set of tools to overcome these issues. Expand
SNP-sites: rapid efficient extraction of SNPs from multi-FASTA alignments
TLDR
SNPs can be extracted from a 8.3 GB alignment file using 59 MB of RAM and 1 CPU core, making it feasible to run on modest computers, and results in multiple formats for downstream analysis are output. Expand
SNP-sites: rapid efficient extraction of SNPs from multi-FASTA alignments
TLDR
SNPs can be extracted from a 8.3 GB alignment file using 59 MB of RAM and 1 CPU core, making it feasible to run on modest computers and easy to install through the Debian and Homebrew package managers. Expand
Software solutions for the livestock genomics SNP array revolution.
TLDR
This work describes a software toolkit for SNP array data management, imputation, genome-wide association studies, population genetics and genomic selection, but this toolkit does not solve the critical need for standardization of the genotypic data and software input files. Expand
...
1
2
3
4
5
...

References

SHOWING 1-10 OF 101 REFERENCES
FORMATOMATIC: a program for converting diploid allelic data between common formats for population genetic analysis.
  • N. Manoukis
  • Biology, Medicine
  • Molecular ecology notes
  • 2007
There has been a great increase in both the number of population genetic analysis programs and the size of data sets being studied with them. Since the file formats required by the most popular andExpand
convert: A user‐friendly program to reformat diploid genotypic data for commonly used population genetic software packages
convert is a user-friendly, 32-bit Windows program that facilitates ready transfer of codominant, diploid genotypic data amongst commonly used population genetic software packages. convert readsExpand
Computer programs for population genetics data analysis: a survival guide
TLDR
The functionalities, special features and assumptions of more than 20 statistical packages, which offer a panel of standard and more sophisticated analyses, are described and discussed. Expand
Analysing georeferenced population genetics data with Geneland: a new algorithm to deal with null alleles and a friendly graphical user interface
TLDR
A new algorithm to account for the presence of null alleles in inferences of populations clusters from individual multilocus genetic data is introduced and it is shown that this algorithm improve signficantly their accuracy. Expand
create: a software to create input files from diploid genotypic data for 52 genetic software programs
create is a Windows program for the creation of new and conversion of existing data input files for 52 genetic data analysis software programs. Programs are grouped into areas of sibshipExpand
NEXUS: an extensible file format for systematic information.
TLDR
The goals of the format are to allow future expansion, to include diverse kinds of information, to be independent of particular computer operating systems, and to be easily processed by a program. Expand
MEGA4: Molecular Evolutionary Genetics Analysis (MEGA) software version 4.0.
TLDR
Version 4 of MEGA software expands on the existing facilities for editing DNA sequence data from autosequencers, mining Web-databases, performing automatic and manual sequence alignment, analyzing sequence alignments to estimate evolutionary distances, inferring phylogenetic trees, and testing evolutionary hypotheses. Expand
The Sanger FASTQ file format for sequences with quality scores, and the Solexa/Illumina FASTQ variants
TLDR
The FASTQ format is defined, covering the original Sanger standard, the Solexa/Illumina variants and conversion between them, based on publicly available information such as the MAQ documentation and conventions recently agreed by the Open Bioinformatics Foundation projects Biopython, BioPerl, BioRuby, BioJava and EMBOSS. Expand
Using AFLP markers and the Geneland program for the inference of population genetic structure
TLDR
How the computer program Geneland designed to infer population structure has been adapted to deal with dominant markers is described and Geneland is used for numerical comparison of dominant and codominant markers to perform clustering. Expand
Identifying Currents in the Gene Pool for Bacterial Populations Using an Integrative Approach
TLDR
To meet the needs of large-scale analyses of population structure for bacteria, several statistical tools for the detection and representation of recombination between populations are introduced and a model-based description of the shape of a population in sequence space, in terms of its molecular variability and affinity towards other populations is introduced. Expand
...
1
2
3
4
5
...