Synthetic genomes
Synthetic genome is a synthetically built genome whose formation involves either genetic modification on pre-existing life forms or artificial gene synthesis to create new DNA or entire lifeforms.[1][2][3] The field that studies synthetic genomes is called synthetic genomics.
Recombinant DNA technology
[edit]Soon after the discovery of restriction endonucleases and ligases, the field of genetics began using these molecular tools to assemble artificial sequences from smaller fragments of synthetic or naturally occurring DNA. The advantage in using the recombinatory approach as opposed to continual DNA synthesis stems from the inverse relationship that exists between synthetic DNA length and percent purity of that synthetic length. In other words, as you synthesize longer sequences, the number of error-containing clones increases due to the inherent error rates of current technologies.[4] Although recombinant DNA technology is more commonly used in the construction of fusion proteins and plasmids, several techniques with larger capacities have emerged, allowing for the construction of entire genomes.[5]
Polymerase cycling assembly
[edit]Polymerase cycling assembly (PCA) uses a series of oligonucleotides (or oligos), approximately 40 to 60 nucleotides long, that altogether constitute both strands of the DNA being synthesized. These oligos are designed such that a single oligo from one strand contains a length of approximately 20 nucleotides at each end that is complementary to sequences of two different oligos on the opposite strand, thereby creating regions of overlap. The entire set is processed through cycles of: (a) hybridization at 60 °C; (b) elongation via Taq polymerase and a standard ligase; and (c) denaturation at 95 °C, forming progressively longer contiguous strands and ultimately resulting in the final genome.[6] PCA was used to generate the first synthetic genome in history, that of the Phi X 174 virus.[7]
Gibson assembly method
[edit]The gibson assembly method, designed by Daniel Gibson during his time at the J. Craig Venter Institute, requires a set of double-stranded DNA cassettes that constitute the entire genome being synthesized. Note that cassettes differ from contigs by definition, in that these sequences contain regions of homology to other cassettes for the purposes of recombination. In contrast to Polymerase Cycling Assembly, Gibson Assembly is a single-step, isothermal reaction with larger sequence-length capacity; ergo, it is used in place of Polymerase Cycling Assembly for genomes larger than 6 kb.
A T5 exonuclease performs a chew-back reaction at the terminal segments, working in the 5' to 3' direction, thereby producing complementary overhangs. The overhangs hybridize to each other, a Phusion DNA polymerase fills in any missing nucleotides and the nicks are sealed with a ligase. However, the genomes capable of being synthesized using this method alone is limited because as DNA cassettes increase in length, they require propagation in vitro in order to continue hybridizing; accordingly, Gibson assembly is often used in conjunction with Transformation-Associated Recombination (see below) to synthesize genomes several hundred kilobases in size.[8]
Transformation-associated recombination
[edit]The goal of transformation-associated recombination (TAR) technology in synthetic genomics is to combine DNA contigs by means of homologous recombination performed by the Yeast Artificial Chromosome (YAC). Of importance is the CEN element within the YAC vector, which corresponds to the yeast centromere. This sequence gives the vector the ability to behave in a chromosomal manner, thereby allowing it to perform homologous recombination.[9]
First, gap repair cloning is performed to generate regions of homology flanking the DNA contigs. Gap Repair Cloning is a particular form of the Polymerase Chain Reaction in which specialized primers with extensions beyond the sequence of the DNA target are utilized.[10] Then, the DNA cassettes are exposed to the YAC vector, which drives the process of homologous recombination, thereby connecting the DNA cassettes. Polymerase Cycling Assembly and TAR technology were used together to construct the 600 kb Mycoplasma genitalium genome in 2008, the first synthetic organism ever created.[11] Similar steps were taken in synthesizing the larger Mycoplasma mycoides genome a few years later.[12]
General creation of synthetic genomes
[edit]It is difficult to directly synthesize oligonucleotides larger than ~200 base pairs and maintain high fidelity.[13] Therefore, smaller oligonucleotides (around 5-20 base pairs) are combined to create genome-size oligonucleotides. Previous methods of stitching the smaller strands involved using T4 polynucleotide ligase. Modern techniques, like PCA/PCR based-methods have improved on this method, increasing speed and fidelity. To further increase fidelity, PCA-based methods can include an error-reversal step in which nucleases recognize and cut mismatched base pairs.[14] Recognition is possible because errors usually cause structural budges and abnormalities in the DNA.[15] Currently, a 4-Mb E. coli genome created in May 2019 holds the record for the largest synthetic genome size.[16]
See also
[edit]References
[edit]- ^ Yong, Ed. "The Mysterious Thing About a Marvelous New Synthetic Cell". The Atlantic. Retrieved 2017-09-12.
- ^ "Here's what we could really learn from a synthetic human genome". STAT. 2016-06-02. Retrieved 2017-09-12.
- ^ "The synthetic human genome could be around the corner - ExtremeTech". ExtremeTech. 2016-05-19. Retrieved 2017-09-12.
- ^ Montague, Michael G; Lartigue, Carole; Vashee, Sanjay (2012). "Synthetic genomics: potential and limitations". Current Opinion in Biotechnology. 23 (5): 659–665. doi:10.1016/j.copbio.2012.01.014. PMID 22342755.
- ^ Gibson, Daniel (2011). Synthetic Biology, Part B: Computer Aided Design and DNA Assembly; Chapter Fifteen - Enzymatic Assembly of Overlapping DNA Fragments. Academic Press. pp. 349–361. ISBN 978-0-12-385120-8.
- ^ Stemmer, Willem P. C.; Crameri, Andreas; Ha, Kim D.; Brennan, Thomas M.; Heyneker, Herbert L. (1995-10-16). "Single-step assembly of a gene and entire plasmid from large numbers of oligodeoxyribonucleotides". Gene. 164 (1): 49–53. doi:10.1016/0378-1119(95)00511-4. PMID 7590320.
- ^ Smith, Hamilton O.; Hutchison, Clyde A.; Pfannkoch, Cynthia; Venter, J. Craig (2003-12-23). "Generating a synthetic genome by whole genome assembly: φX174 bacteriophage from synthetic oligonucleotides". Proceedings of the National Academy of Sciences. 100 (26): 15440–15445. Bibcode:2003PNAS..10015440S. doi:10.1073/pnas.2237126100. ISSN 0027-8424. PMC 307586. PMID 14657399.
- ^ Gibson, Daniel G; Young, Lei; Chuang, Ray-Yuan; Venter, J Craig; Hutchison, Clyde A; Smith, Hamilton O (2009-04-12). "Enzymatic assembly of DNA molecules up to several hundred kilobases". Nature Methods. 6 (5): 343–345. doi:10.1038/nmeth.1318. PMID 19363495. S2CID 1351008.
- ^ Kouprina, Natalay; Larionov, Vladimir (2003-12-01). "Exploiting the yeast Saccharomyces cerevisiae for the study of the organization and evolution of complex genomes". FEMS Microbiology Reviews. 27 (5): 629–649. doi:10.1016/S0168-6445(03)00070-6. ISSN 1574-6976. PMID 14638416.
- ^ Marsischky, Gerald; LaBaer, Joshua (2004-10-15). "Many Paths to Many Clones: A Comparative Look at High-Throughput Cloning Methods". Genome Research. 14 (10b): 2020–2028. doi:10.1101/gr.2528804. ISSN 1088-9051. PMID 15489321.
- ^ Gibson, Daniel G.; Benders, Gwynedd A.; Andrews-Pfannkoch, Cynthia; Denisova, Evgeniya A.; Baden-Tillson, Holly; Zaveri, Jayshree; Stockwell, Timothy B.; Brownley, Anushka; Thomas, David W. (2008-02-29). "Complete Chemical Synthesis, Assembly, and Cloning of a Mycoplasma genitalium Genome". Science. 319 (5867): 1215–1220. Bibcode:2008Sci...319.1215G. doi:10.1126/science.1151721. ISSN 0036-8075. PMID 18218864. S2CID 8190996.
- ^ Gibson, Daniel G.; Glass, John I.; Lartigue, Carole; Noskov, Vladimir N.; Chuang, Ray-Yuan; Algire, Mikkel A.; Benders, Gwynedd A.; Montague, Michael G.; Ma, Li (2010-07-02). "Creation of a Bacterial Cell Controlled by a Chemically Synthesized Genome". Science. 329 (5987): 52–56. Bibcode:2010Sci...329...52G. doi:10.1126/science.1190719. ISSN 0036-8075. PMID 20488990.
- ^ Matsudaira, Paul (1989), Wittmann-Liebold, Brigitte (ed.), "Initial and Repetitive Yields from Proteins Blotted on PVDF Membranes", Methods in Protein Sequence Analysis: Proceedings of the 7th International Conference, Berlin, July 3–8, 1988, Berlin, Heidelberg: Springer, pp. 234–239, doi:10.1007/978-3-642-73834-0_30, ISBN 978-3-642-73834-0, retrieved 2021-04-21
- ^ Zhang, Weimin; Mitchell, Leslie A.; Bader, Joel S.; Boeke, Jef D. (2020-06-20). "Synthetic Genomes". Annual Review of Biochemistry. 89 (1): 77–101. doi:10.1146/annurev-biochem-013118-110704. ISSN 0066-4154. PMID 32569517. S2CID 219986041.
- ^ Davis, Leonard G.; Dibner, Mark D.; Battey, James F. (1986-01-01). "Restriction Endonucleases (REs) and Their Use". Basic Methods in Molecular Biology. Elsevier. pp. 51–57. doi:10.1016/B978-0-444-01082-7.50021-7. ISBN 978-0-444-01082-7.
- ^ Fredens, Julius; Wang, Kaihang; de la Torre, Daniel; Funke, Louise F. H.; Robertson, Wesley E.; Christova, Yonka; Chia, Tiongsun; Schmied, Wolfgang H.; Dunkelmann, Daniel L.; Beránek, Václav; Uttamapinant, Chayasith (May 2019). "Total synthesis of Escherichia coli with a recoded genome". Nature. 569 (7757): 514–518. Bibcode:2019Natur.569..514F. doi:10.1038/s41586-019-1192-5. ISSN 1476-4687. PMC 7039709. PMID 31092918.