DNA Fountain enables a robust and efficient storage architecture

@article{Erlich2017DNAFE,
  title={DNA Fountain enables a robust and efficient storage architecture},
  author={Yaniv Erlich and Dina Zielinski},
  journal={Science},
  year={2017},
  volume={355},
  pages={950 - 954}
}
DNA is an attractive medium to store digital information. [] Key Method We also tested a process that can allow 2.18 × 1015 retrievals using the original DNA sample and were able to perfectly decode the data. Finally, we explored the limit of our architecture in terms of bytes per molecule and obtained a perfect retrieval from a density of 215 petabytes per gram of DNA, orders of magnitude higher than previous reports.
Data Storage Based on Combinatorial Synthesis of DNA Shortmers
TLDR
A novel method to encode information into DNA molecules using combinatorial encoding and shortmer DNA synthesis, in compatibility with current chemical DNA synthesis technologies is proposed, based on a set of easily distinguishable DNA shortmers serving as building blocks and allowing for near-zero error rates.
Dynamic and scalable DNA-based information storage
TLDR
It is shown that a simple architecture comprised of a T7 promoter and a single-stranded overhang domain (ss-dsDNA), can unlock dynamic DNA-based information storage with powerful capabilities and advantages.
High capacity DNA data storage with variable-length Oligonucleotides using repeat accumulate code and hybrid mapping
TLDR
This work devised and implemented a DNA data storage scheme with variable-length oligonucleotides (oligos), where a hybrid DNA mapping scheme that converts digital data to DNA records is introduced and a oligo-level repeat-accumulate coding scheme is employed for addressing data loss and corruption in the biochemical processes.
A robust and efficient DNA storage architecture based on modulation encoding and decoding
TLDR
A modulation-based DNA storage architecture that is highly efficient and extremely robust for the detection of insertions and deletions, and will kickstart the actualization of a viable, large-scale system for DNA data storage.
High-scale random access on DNA storage systems
TLDR
This work proposes a general-purpose technique for addressing and directly accessing thousands to potentially millions of different data objects within the same DNA pool, and utilizes a fountain code, sophisticated probe design, and microarray technologies.
A high storage density strategy for digital information based on synthetic DNA
TLDR
The proposed strategy introduces quaternary Huffman coding to compress the binary stream of an original file before it is converted into a DNA sequence and can gain a very high compression ratio for files with a non-uniform probability distribution of the source.
Enzymatic DNA synthesis for digital information storage
TLDR
A de novo enzymatic DNA synthesis strategy designed from the bottom-up for information storage is established, and a template-independent DNA polymerase is harnessed for controlled synthesis of sequences with user-defined information content.
DNA-based data storage via combinatorial assembly
TLDR
The ability to replicate and selectively access the information while it is in DNA, opening up the possibility of more sophisticated DNA computation.
Improved DNA based storage capacity and fidelity using composite DNA letters
TLDR
This work codes 6.4 Megabyte data into composite DNA, achieving ∼25% increase in capacity as compared to literature, and demonstrates, on smaller scales, how flexible synthesis leads to 2.7 fold increased capacity per synthesized position.
An alternative approach to nucleic acid memory
TLDR
Digital Nucleic Acid Memory is reported for applications that require a limited amount of data to have high information density, redundancy, and copy number and offers an additional path to explore the advantages and disadvantages of DNA as an emerging memory material.
...
...

References

SHOWING 1-10 OF 65 REFERENCES
Towards practical, high-capacity, low-maintenance information storage in synthesized DNA
TLDR
Theoretical analysis indicates that the DNA-based storage scheme could be scaled far beyond current global information volumes and offers a realistic technology for large-scale, long-term and infrequently accessed digital archiving.
A DNA-Based Archival Storage System
TLDR
An architecture for a DNA-based archival storage system is presented, structured as a key-value store, and leverages common biochemical techniques to provide random access, and a new encoding scheme is proposed that offers controllable redundancy, trading off reliability for density.
A Rewritable, Random-Access DNA-Based Storage System
TLDR
The first DNA-based storage architecture that enables random access to data blocks and rewriting of information stored at arbitrary locations within the blocks is described, which suggests that DNA is a versatile media suitable for both ultrahigh density archival and rewritable storage applications.
Robust chemical preservation of digital information on DNA in silica with error-correcting codes.
TLDR
The original information could be recovered error free, even after treating the DNA in silica at 70 °C for one week, which is thermally equivalent to storing information on DNA in central Europe for 2000 years.
DNA-Based Storage: Trends and Methods
TLDR
The analytic contribution of the work is the construction and design of sequences over discrete alphabets that avoid pre-specified address patterns, have balanced base content, and exhibit other relevant substring constraints.
Scalable gene synthesis by selective amplification of DNA pools from high-fidelity microchips
TLDR
This work uses high-fidelity DNA microchips, selective oligonucleotide pool amplification, optimized gene assembly protocols and enzymatic error correction to develop a method for highly parallel gene synthesis.
Gene Assembly from Chip‐Synthesized Oligonucleotides
TLDR
This unit has recently demonstrated that some commercial DNA chip manufacturers have improved error rates, and that the issues of chemical complexity and low yields can be solved by using barcoded primers to accurately and efficiently amplify subpools of oligonucleotides.
Proceedings of the Twenty-First International Conference on Architectural Support for Programming Languages and Operating Systems
TLDR
This year's ASPLOS conference saw a record, 232 submissions with a total of 986 paper authors from 240 institutions spread across at least 21 countries and spanning 5 continents, a clear indication that the community is growing, and that ASPLOS is the premier venue of choice for disseminating high quality interdisciplinary work.
MOLECULAR CYBERNETICS: THE NEXT STEP?
TLDR
The self‐replicating nature of the double helix, its remarkable stability and its infinitessimal size offer considerable scope for the use of this molecule as a generalized means of storing information over and above its biological function in evolution.
...
...