Automated design of thousands of nonrepetitive parts for engineering stable genetic systems

@article{Hossain2020AutomatedDO,
  title={Automated design of thousands of nonrepetitive parts for engineering stable genetic systems},
  author={Ayaan Hossain and Eriberto Lopez and Sean M. Halper and Daniel P. Cetnar and Alexander C. Reis and Devin Strickland and Eric Klavins and Howard M. Salis},
  journal={Nature Biotechnology},
  year={2020},
  pages={1-10}
}
Engineered genetic systems are prone to failure when their genetic parts contain repetitive sequences. Designing many nonrepetitive genetic parts with desired functionalities remains a difficult challenge with high computational complexity. To overcome this challenge, we developed the Nonrepetitive Parts Calculator to rapidly generate thousands of highly nonrepetitive genetic parts from specified design constraints, including promoters, ribosome-binding sites and terminators. As a demonstration… 
Designing for durability: new tools to build stable, non-repetitive DNA
TLDR
The Non-Repetitive Parts Calculator (NRPC), a set of new algorithms described in a recent publication, is noteworthy for tackling a pervasive problem in synthetic biology, one seemingly at odds with the principles of the field.
Generating complex patterns of gene expression without regulatory circuits
TLDR
This work demonstrates that varying only the placement and strengths of promoters, terminators, and RNase cleavage sites in a computational model of a bacteriophage genome is sufficient to achieve solutions to a variety of basic expression patterns.
Automated Model-Predictive Design of Synthetic Promoters to Control Transcriptional Profiles in Bacteria
TLDR
A 346-parameter model predicted DNA’s interactions with RNA polymerase initiation complex, enabling accurate transcription rate predictions and automated promoter design in bacterial genetic systems.
Supervised generative design of regulatory DNA for gene expression control
TLDR
A deep neural network strategy that models whole gene regulatory structures and generates de novo functional regulatory DNA with prespecified expression levels and is experimentally demonstrate that this generative strategy is more efficient than a mutational one when using purely natural genomic data.
Evolutionary Stability Optimizer (ESO): A Novel Approach to Identify and Avoid Mutational Hotspots in DNA Sequences While Maintaining High Expression Levels
TLDR
The Evolutionary Stability Optimizer (ESO) is presented, a software tool that enables the large-scale automatic design of evolutionarily stable constructs with respect to both mutational and epigenetic hotspots and allows users to define custom hotspots to avoid.
Predicting bacterial promoter function and evolution from random sequences
TLDR
Using random mutant libraries, a biophysical model is developed that accounts for multiple features of σ70 binding bacterial promoters to predict constitutive gene expression levels from any sequence and identified that promoters evolve more rapidly than previously thought.
The Synthesis Success Calculator: Predicting the Rapid Synthesis of DNA Fragments with Machine Learning
TLDR
A machine learning model is developed, called the Synthesis Success Calculator, to predict whether a long DNA fragment can be readily synthesized with a short turnaround time and identifies the sequence determinants associated with the synthesis outcome.
PccGEO: prior constraints conditioned genetic elements optimization
TLDR
A knowledge-constraint deep learning model named PccGEO is proposed to automatically design functional genetic elements with high success rate and efficiency and supported that the implicit patterns are important in the design of functional gene elements.
An Integrative Toolbox for Synthetic Biology in Rhodococcus.
TLDR
A suite of integrative genetic tools to facilitate the engineering of Rhodococcus, a genus of bacteria with considerable biocatalytic potential, are developed and implemented, including the sustainable production of chemicals from lignin-derived aromatics.
Genetic circuit design automation with Cello 2.0.
TLDR
This protocol describes Cello 2.0, a freely available cross-platform software written in Java that enables flexible descriptions of the logic gates' structure and their mathematical models representing dynamic behavior, new formal rules for describing the placement of gates in a genome, a new graphical user interface, support for Verilog 2005 syntax and a connection to the SynBioHub parts repository software environment.
...
1
2
3
...

References

SHOWING 1-10 OF 54 REFERENCES
Simultaneous repression of multiple bacterial genes using nonrepetitive extra-long sgRNA arrays
TLDR
Using ELSAs, three highly selective phenotypes in Escherichia coli are created, including redirecting metabolism to increase succinic acid production by 150-fold, knocking down amino acid biosynthesis to create a multi-auxotrophic strain and repressing stress responses to reduce persister cell formation by 21-fold.
Predicting the Genetic Stability of Engineered DNA Sequences with the EFM Calculator.
TLDR
The Evolutionary Failure Mode (EFM) Calculator is created to computationally detect common sources of genetic instability in an input DNA sequence and predicts two types of mutational hotspots: deletions mediated by homologous recombination and indels caused by replication slippage on simple sequence repeats.
Measurement and modeling of intrinsic transcription terminators
TLDR
A genetic architecture is developed and validated that enables reliable measurement of termination efficiencies and it is found that structures extending beyond the core terminator stem are likely to increase terminator activity.
Composability of regulatory sequences controlling transcription and translation in Escherichia coli
TLDR
This work synthesized 12,563 combinations of common promoters and ribosome binding sites and simultaneously measured DNA, RNA, and protein levels from the entire library, finding that RNA and protein expression were within twofold of expected levels 80% and 64% of the time, respectively.
Deciphering eukaryotic gene-regulatory logic with 100 million random promoters
TLDR
Gene expression levels in yeast are predicted using a massive dataset on promoters with random sequences, showing that massive-throughput assays of fully random DNA can provide the big data necessary to develop complex, predictive models of gene regulation.
Design of synthetic yeast promoters via tuning of nucleosome architecture
TLDR
It is demonstrated that nucleosome architecture can play a role in defining yeast promoter activity and a computationally-guided approach is utilized that can enable both the redesign of endogenous promoter sequences and the de novo design of synthetic promoters.
Systematic Dissection of Sequence Elements Controlling σ70 Promoters Using a Genomically Encoded Multiplexed Reporter Assay in Escherichia coli.
TLDR
A novel multiplexed assay to study promoter function in E. coli is developed by building a site-specific genomic recombination-mediated cassette exchange system that allows for the facile construction and testing of large libraries of genetic designs integrated into precise genomic locations.
Metagenomic mining of regulatory elements enables programmable species-selective gene expression
TLDR
This work reports the large-scale metagenomic mining of thousands of natural 5′ regulatory sequences from diverse bacteria, and their multiplexed gene expression characterization in industrially relevant microbes, and identifies sequences with broad and host-specific expression properties that are robust in various growth conditions.
Characterization of 582 natural and synthetic terminators and quantification of their design constraints
TLDR
The availability of many terminators of varying strength, as well as an understanding of the sequence dependence of their properties, will extend their usability in the forward design of synthetic cistrons.
The development and characterization of synthetic minimal yeast promoters
TLDR
This work creates a set of non-homologous, purely synthetic, minimal promoters for yeast that can achieve high levels of both inducible and constitutive expression with up to an 80% reduction in size.
...
1
2
3
4
5
...