Decoding the oak genome: public release of sequence data, assembly, annotation and publication strategies

@article{Plomion2016DecodingTO,
  title={Decoding the oak genome: public release of sequence data, assembly, annotation and publication strategies},
  author={Christophe Plomion and Jean‐Marc Aury and Joelle Amselem and Tina Alaeitabar and Val{\'e}rie Barbe and Caroline Belser and H{\'e}l{\`e}ne Berg{\`e}s and Cath{\'e}rine Bod{\'e}n{\`e}s and Nathalie Boudet and Christophe Boury and Aur{\'e}lie Canaguier and Arnaud Couloux and Corinne da Silva and S{\'e}bastien Duplessis and François Ehrenmann and Barbara Estrada-Mairey and St{\'e}phanie Fouteau and Nicolas Francillonne and Christine Gaspin and C{\'e}cile Guichard and Christophe Klopp and Karine Labadie and C{\'e}line Lalanne and Isabelle Le Clainche and Jean-Charles Lepl{\'e} and Gr{\'e}goire Le Provost and Thibault Leroy and Isabelle Lesur and Francis M. Martin and Jonathan Mercier and C{\'e}lia Michotey and Florent Murat and Franck Salin and Delphine Steinbach and Patricia Faivre-Rampant and Patrick Wincker and J{\'e}r{\^o}me Salse and Hadi Quesneville and Antoine Kremer},
  journal={Molecular Ecology Resources},
  year={2016},
  volume={16}
}
The 1.5 Gbp/2C genome of pedunculate oak (Quercus robur) has been sequenced. A strategy was established for dealing with the challenges imposed by the sequencing of such a large, complex and highly heterozygous genome by a whole‐genome shotgun (WGS) approach, without the use of costly and time‐consuming methods, such as fosmid or BAC clone‐based hierarchical sequencing methods. The sequencing strategy combined short and long reads. Over 49 million reads provided by Roche 454 GS‐FLX technology… 
The Draft Genome of the MD-2 Pineapple
TLDR
This MD-2 pineapple genome provides a high-quality draft for gene prediction and further downstream applications in pineapple, and the most optimal accuracy and contiguity were achieved in the de novo assembly of error-corrected long reads using Celera.
The draft genome sequence of cork oak
TLDR
The first draft version of the cork oak genome is produced, following a de novo assembly strategy based on high-throughput sequence data, which generated a draft genome comprising 23,347 scaffolds and 953.3 Mb in size.
First Draft Assembly and Annotation of the Genome of a California Endemic Oak Quercus lobata Née (Fagaceae)
TLDR
A draft genome assembly of valley oak (Quercus lobata) is introduced using Illumina sequencing of adult leaf tissue of a tree found in an accessible, well-studied, natural southern California population, along with annotation of encoded genes.
Chromosome‐scale genome assembly of Castanopsis tibetana provides a powerful comparative framework to study the evolution and adaptation of Fagaceae trees
TLDR
The chromosome‐scale genome assembly of C. tibetana will expand Fagaceae genome resources across the family and provide a powerful comparative framework to study the adaptation and evolution of Fagales trees.
An improved method for chloroplast genome sequencing in non-model forest tree species
TLDR
A modified method for the isolation of intact chloroplasts is reported, which needs less than 0.5 g leaf material and can be applied to the reconstruction of chloroplast genomes for plant evolutionary studies, especially in non-model tree species.
De novo genome assembly and annotation of Australia's largest freshwater fish, the Murray cod (Maccullochella peelii), from Illumina and Nanopore sequencing read
TLDR
The whole genome of the Murray cod is presented and it is anticipated this will be a catalyst for a range of genetic, genomic, and phylogenetic studies of theMurray cod and more generally other fish species of the Percichthydae family.
A reference genome of the European beech (Fagus sylvatica L.)
TLDR
The assembled genome will be a valuable resource and reference for future population genomics studies on the evolution and past climate change adaptation of beech and will be helpful for identifying genes, e.g., involved in drought tolerance, in order to select and breed individuals to adapt forestry to climate change in Europe.
Development of Target Sequence Capture and Estimation of Genomic Relatedness in a Mixed Oak Stand
TLDR
A targeted capture-based, next-generation sequencing assay based on the highly heterozygous pedunculate oak reference genome, for the sequencing of 3 Mb of genic and intergenic regions is designed and validated by evaluating its reproducibility, and comparing the genomic relatedness of trees with their known pedigree relationship.
High-quality genetic mapping with ddRADseq in the non-model tree Quercus rubra
TLDR
The ddRADseq approach is useful for the construction of high-quality genetic maps in organisms lacking a reference genome if the parents and progeny are sequenced at sufficient depth and technical improvements in reduced representation sequencing (RRS) approaches are needed.
Whole Genome Mapping with Feature Sets from High-Throughput Sequencing Data
TLDR
A novel method, Feature sets-based Genome Mapping (FGM), with FGM, physical map and draft whole genome sequences can be generated, anchored and integrated using the same data set of NGS sequences, independent of restriction digestion.
...
...

References

SHOWING 1-10 OF 117 REFERENCES
Assembling the 20 Gb white spruce (Picea glauca) genome from whole-genome shotgun sequencing data
TLDR
A whole-genome shotgun sequencing strategy using two Illumina sequencing platforms and an assembly approach using the ABySS software is described, demonstrating how recent improvements in the sequencing technology, especially increasing read lengths and paired end reads from longer fragments have a major impact on the assembly contiguity.
Decoding the massive genome of loblolly pine using haploid DNA and novel assembly strategies
TLDR
A novel strategy to sequence the genome of loblolly pine that combines unique aspects of pine reproductive biology and genome assembly methodology is developed, demonstrating a novel approach to sequencing the large and complex genomes of this important group of plants.
A sample view of the pedunculate oak (Quercus robur) genome from the sequencing of hypomethylated and random genomic libraries
TLDR
St striking similarities are found between Rd sequences and previously analyzed BAC end sequences of pedunculate oak, with a similar percentage of known repeat elements, an almost identical simple sequence repeat density, and twice as many SSR motifs in the Rd library as in the Hp library.
SSPACE-LongRead: scaffolding bacterial draft genomes using long read sequence information
TLDR
The current work describes the SSPACE-LongRead software which is designed to upgrade incomplete draft genomes using single molecule sequences, and concludes that the recent advances of the PacBio sequencing technology and chemistry, in combination with the limited computational resources required to run the program, allow to scaffold genomes in a fast and reliable manner.
Pig genome sequence - analysis and publication strategy
TLDR
This marker paper sets out the plans for analysis of the pig genome sequence, for the application and publication of the results, in a timely manner and in advance of publication.
The sequence and de novo assembly of the giant panda genome
TLDR
Using next-generation sequencing technology alone, a draft sequence of the giant panda genome is generated and assembled, indicating that its bamboo diet might be more dependent on its gut microbiome than its own genetic composition.
Analysis of BAC end sequences in oak, a keystone forest tree species, providing insight into the composition of its genome
TLDR
This BAC library for Quercus robur provides a new resource for genomic studies, including SSR marker development, physical mapping, comparative genomics and genome sequencing.
High quality draft sequences for prokaryotic genomes using a mix of new sequencing technologies
TLDR
The 454 GSFLX and Solexa/Illumina show great complementarity in producing large contigs and supercontigs with a low error rate and the potential to enhance continuity by scaffolding was tested.
Efficient de novo assembly of highly heterozygous genomes from whole-genome shotgun short reads.
TLDR
Platanus provides a novel and efficient approach for the assembly of gigabase-sized highly heterozygous genomes and is an attractive alternative to the existing assemblers designed for genomes of lower heterozygosity.
Comparison of next generation sequencing technologies for transcriptome characterization
TLDR
NG sequencing technologies are a highly flexible set of platforms that can be scaled to suit different project goals and suggest a combination of FLX and Solexa sequencing for optimal transcriptome coverage at modest cost.
...
...