Patent application title: MULTIPLEXED DETERMINISTIC ASSEMBLY OF DNA LIBRARIES
Inventors:
Erik Jedediah Dean (Lafayette, CA, US)
Kedar Patel (Fremont, CA, US)
Aaron Miller (Berkeley, CA, US)
Kunal Mehta (Oakland, CA, US)
Philip Weyman (Alameda, CA, US)
IPC8 Class: AC12N1510FI
USPC Class:
1 1
Class name:
Publication date: 2021-10-21
Patent application number: 20210324378
Abstract:
The present disclosure relates to methods of joining three or more
double-stranded (ds) or single-stranded (ss) DNA molecules of interest in
vitro or in vivo. The method allows the joining of a large number of DNA
fragments, in a deterministic fashion. It can be used to rapidly generate
nucleic acid libraries that can be subsequently used in a variety of
applications that include, for example, genome editing and pathway
assembly. Kits for performing the method are also disclosed.Claims:
1.-28. (canceled)
29. A method for generating libraries of polynucleotides, the method comprising: (a) amplifying via polymerase chain reaction (PCR), a first pool of polynucleotides, wherein the first pool contains pairs of polynucleotides, wherein each pair contains a first polynucleotide and a second polynucleotide, and wherein each first polynucleotide and each second polynucleotide in a pair comprises a 5' end and a 3' end, wherein the amplifying introduces a common overlap sequence comprising one or more recognition sequences for one or more site-specific nucleases onto the 5' end of a first polynucleotide and the 3'end of a second polynucleotide in a pair from the first pool; (b) assembling each pair of first polynucleotides and second polynucleotides from the first pool into a single nucleic acid fragment by utilizing the common overlap sequence, wherein the single nucleic fragment for each pair comprises a first polynucleotide and second polynucleotide separated by the common overlap sequence from the 5' end of the first polynucleotide and the 3' end of the second polynucleotide, and wherein the 3' end of the first polynucleotide and the 5' end of the second polynucleotide in the single nucleic fragment for each pair are located on opposing terminal ends of the single nucleic acid fragment, distal to the one or more site-specific nuclease recognition sequence(s); (c) combining the single nucleic acid fragments for each pair with a second pool containing insert polynucleotides, wherein each insert polynucleotide in the second pool comprises a first assembly overlap sequence at its 5' end that is complementary to the 3' end of the first polynucleotide present within the single nucleic acid fragment and a second assembly overlap sequence at its opposing 3'end that is complementary to the 5' end of the second polynucleotide present within the single nucleic acid fragment; (d) assembling the first pool and the second pool into a third pool of circularized products, wherein the assembling is performed via in vitro or in vivo overlap assembly methods, and wherein each circularized product in the third pool comprises an insert sequence from the second pool and a pair of first polynucleotides and second polynucleotides from the first pool; (e) linearizing each circularized product in the third pool via digestion by one or more site-specific nuclease(s) that recognize(s) the one or more site-specific nuclease recognition sequence(s) located between the first polynucleotide sequence and the second polynucleotide sequence in each of the circularized products in the third pool; and (f) assembling the linearized products into cloning vectors by in vivo or in vivo cloning methods.
30. The method of claim 29, wherein the first pool is generated by selecting pairs of polynucleotide sequences from a larger set of such sequences such that no polynucleotide from the first pool shares common sequence with any other polynucleotide from the first pool beyond a specified threshold, excluding designed assembly overlap sequences between the pairs of polynucleotides of the first pool and the insert polynucleotides of the second pool, or the pairs of polynucleotides of the first pool and the cloning vector.
31. The method of claim 30, wherein the specified threshold is between 5 and 15 contiguous nucleotides.
32. The method of claim 29, wherein the one or more site-specific nuclease recognition sequence(s) located between the first polynucleotide sequence and the second polynucleotide sequence is a homing nuclease recognition sequence and the one or more site-specific nuclease(s) is a homing endonuclease.
33. The method of claim 29, wherein the common overlap sequence comprises an assembly overlap sequence of at least 1 nucleotide and the assembly in step (b) is performed by an overlap-based DNA assembly method.
34. The method of claim 33, wherein the overlap-based DNA assembly method is selected from splicing and overlap-extension PCR (SOI-PCR) or an in vitro overlap-assembly method.
35. The method of claim 34, wherein the one or more site-specific nuclease recognition sequence(s) present in the common overlap sequence on the 5' end of the first polynucleotide is complementary to the one or more site-specific nuclease recognition sequence(s) present in the common overlap sequence on the 3' end of the second polynucleotide in each pair, and wherein the utilizing the common overlap sequences of the first and second polynucleotides in each pair in step (b) entails performing SOE-PCR.
36. The method of claim 29, wherein the utilizing the common overlap sequences of the first and second polynucleotides in each pair in step (b) entails digesting the one or more site-specific nuclease recognition sequences present in the common overlap sequence on the 5' end of the first polynucleotide and the 3' end of the second polynucleotide in each pair with one or more site specific nucleases for the one or more site-specific nuclease recognition sequences to generate single-stranded overhangs on the 5' end of the first polynucleotide and the 3' end of the second polynucleotide in each pair that comprise complementary sequence; and ligating the complementary sequence present on the single-stranded overhang on the 5' end of the first polynucleotide and the 3' end of the second polynucleotide in each pair.
37. The method of claim 29, wherein the assembling of step (d) is performed using an overlap-based DNA assembly method.
38. The method of claim 29, wherein the 3' end of the first polynucleotide and the 5' end of the second polynucleotide in the single nucleic acid fragment in each pair comprise an additional set of one or more site-specific nuclease recognition sequences and the first assembly overlap sequence and the second assembly overlap sequence in each insert polynucleotide in the second pool comprise one or more site-specific nuclease recognition sequences.
39. The method of claim 38, wherein the assembling in step (d) entails digesting the additional one or more site-specific nuclease recognition sequences present on the 3' end of the first polynucleotide and the 5' end of the second polynucleotide in the single nucleic acid fragment in each pair and the one or more site-specific nuclease recognition sequences present in the first and second assembly sequences in each insert polynucleotide from the second pool with one or more site specific nucleases for the additional one or more site-specific nuclease recognition sequences on the 3' end of the first polynucleotide and the 5' end of the second polynucleotide in the single nucleic acid fragment in each pair and the one or more site-specific nuclease recognition sequences present in the first and second assembly sequences in each insert polynucleotide from the second pool to generate a single-stranded overhang on the 3' end of the first polynucleotide that comprises sequence complementary to sequence present on a single-stranded overhang on the 5'end of the first assembly sequence of an insert polynucleotide from the second pool and a single stranded overhang on the 5' end of the second polynucleotide that comprises sequence complementary to a sequence present on a single-stranded overhang on the 3'end of the second assembly sequence of the same insert polynucleotide from the second pool; and ligating the complementary sequence present on the single-stranded overhangs.
40. The method of claim 29, wherein the cloning vectors of step (f) comprise one or more site-specific nuclease recognition sequences.
41. The method of claim 40, wherein the assembling in step (1) entails digesting the one or more site-specific nuclease recognition sequences in the cloning vectors with the one or more site-specific nucleases for the one or more site-specific nuclease recognition sequences recognition sequences present in the cloning vectors, wherein the digesting generates single-stranded overhangs on opposing ends of the cloning vectors, wherein the single-stranded overhang on one of the opposing ends of the cloning vector comprises sequence complementary to an end of the linearized product generated in step (e) and the single-stranded overhang on the other of the opposing ends of the cloning vectors comprises sequence complementary to an opposing end of the linearized product generated in step (e); and ligating the complementary sequences present on the single-stranded overhangs of the cloning vectors and the linearized products from step (e).
42. The method of claim 29, wherein the first assembly overlap sequence and the second assembly overlap sequence on each insert polynucleotide in the second pool comprises 1 or more nucleotides that are complementary to the opposing terminal ends of the single nucleic acid fragment.
43. The method of claim 29, wherein, for each pair in the first pool, the first polynucleotide and the second polynucleotide comprises sequence corresponding to a target genomic locus in a host cell.
44. The method of claim 29, wherein each insert polynucleotide in the second pool comprises one or more payload sequences located between the first assembly overlap sequence and the second assembly overlap sequence.
45. The method of claim 44, wherein the one or more payload sequences are selected from promoters, genes, regulatory sequences, nucleic acid sequence encoding degrons, nucleic acid sequence encoding solubility tags; terminators, unique identifier sequence or portions thereof.
46. The method of claim 43, wherein, for each pair in the first pool, the first polynucleotide and the second polynucleotide comprises sequence corresponding to a different target genomic locus in a host cell as compared to each other pair in the first pool.
47. The method of claim 44, wherein each payload sequence in the insert polynucleotides in the second pool is different from the payload sequence in each other insert polynucleotide in the second pool.
48. The method of claim 44, wherein each insert polynucleotide in the second pool is generated by: (i) performing PCR on a mixture comprising the payload sequence, a forward primer and a reverse primer, wherein the forward primer comprises from 5' to 3', a short stretch of one or more nucleotides complementary to the payload sequence, the first assembly overlap sequence, one or more recognition sequences for one or more site-specific nucleases, the second assembly overlap sequence and a second stretch of one or more nucleotides complementary to the payload sequence and wherein the reverse primer comprises sequence complementary to the payload sequence or to other sequence downstream of the payload sequence, wherein the PCR generates a PCR product comprising from 5' to 3', the short stretch of nucleic acid complementary to the payload sequence, the first assembly overlap sequence, the one or more site-specific nuclease recognition sequence(s), the second assembly overlap sequence and the payload sequence; (ii) circularizing the PCR product via an assembly method selected from the group consisting of SOE-PCR, restriction-ligation, blunt-end ligation, overlap based assembly method and recombination-based method, or any other enzymatic or chemical method of joining two DNA molecules; and (iii) linearizing the circularized PCR product with one or more site-specific nuclease(s) that recognize the one or more site-specific nuclease recognition sequence(s), thereby generating the second pool of polynucleotides.
Description:
CROSS-REFERENCE TO RELATED APPLICATIONS
[0001] This application claims the benefit of priority to U.S. Provisional Application Ser. No. 62/753,254, filed Oct. 31, 2018, which is herein incorporated by reference in its entirety for all purposes.
FIELD
[0002] The present disclosure is directed to compositions and methods for joining single-stranded and/or double-stranded nucleic acid molecules permitting in vitro or in vivo assembly of multiple nucleic acid molecules with overlapping terminal sequences in a single reaction. The disclosed methods and compositions can be useful for deterministic assembly of fragments of nucleic acid sequences and can be used for editing any DNA sequence such as, for example, plasmids, cosmid or specific genes in the genome of desired host cells or organisms.
STATEMENT REGARDING SEQUENCE LISTING
[0003] The Sequence Listing associated with this application is provided in text format in lieu of a paper copy, and is hereby incorporated by reference into the specification. The name of the text file containing the Sequence Listing is ZYMR_029_01US_SeqList_ST25.txt. The text file is about 262 KB, and was created on Oct. 31, 2019, and is being submitted electronically via EFS-Web.
BACKGROUND
[0004] Traditionally, nucleic acid assemblies such as plasmid or linear DNA are generated one at a time in a deterministic fashion and, thus, can be slow, expensive and labor-intensive. In contrast, current pooled approaches for generating libraries of complex nucleic acid assemblies can enable the generation of many assemblies at once, but often result in libraries representing all possible combinations between the sets of parts in the assembly. Such approaches are a non-deterministic and combinatorial approach to assembly and can also be time-consuming, labor intensive and expensive, especially in circumstances where a subset of sequences are the desired product of the assembly reaction.
[0005] Thus, there is a need in the art for new methods for generating complex nucleic acid assemblies, which do not suffer from the aforementioned drawbacks inherent with traditional methods for generating nucleic acid assemblies.
SUMMARY
[0006] In one aspect, provided herein is a composition comprising a mixture of polynucleotides, the mixture comprising: a first pool containing pairs of polynucleotides, wherein each pair in the first pool contains a first polynucleotide and a second polynucleotide; and a second pool of insert polynucleotides, wherein each insert polynucleotide in the second pool comprises a first assembly overlap sequence at its 5' end that is complementary to a 3' end of a first polynucleotide and a second assembly overlap sequence at its opposing 3'end that is complementary to a 5' end of a second polynucleotide in a pair of polynucleotides from the first pool. In some cases, the composition further comprises a cloning vector, wherein, for each pair in the first pool, a 5' end of the first polynucleotide and a 3' end of the second polynucleotide comprises sequence complementary to the cloning vector. In some cases, each polynucleotide from the first pool is selected such that no polynucleotide from the first pool shares common sequence with any other polynucleotide from the first pool beyond a specified threshold, excluding designed assembly overlap sequences between the pairs of polynucleotides of the first pool and the insert polynucleotides of the second pool, or the pairs of polynucleotides of the first pool and the cloning vector. In some cases, the specified threshold is between 5 and 15 contiguous nucleotides. In some cases, the composition further comprises a polymerase. In some cases, the polymerase is strand-displacing or non-strand displacing. In some cases, the polymerase is non-strand displacing and the composition further comprises a crowding agent. In some cases, the crowding agent is polyethylene glycol (PEG). In some cases, the PEG is used at a concentration of from about 3 to about 7% (weight/volume). In some cases, the PEG is selected from PEG-200, PEG-4000, PEG-6000, PEG-8000 or PEG-20,000. In some cases, the polymerase is strand displacing and the composition further comprises a single-stranded binding protein. In some cases, the single strand DNA binding protein is an extreme thermostable single-stranded DNA binding protein (ET SSB), E. coli recA, T7 gene 2.5 product, phage lambda RedB or Rac prophage RecT. In some cases, the composition further comprises a 5'-3' exonuclease. In some cases, the composition further comprises a ligase. In some cases, each pair in the first pool is double-stranded DNA (dsDNA) or single-stranded DNA (ssDNA). In some cases, each insert polynucleotide in the second pool is dsDNA or ssDNA. In some cases, for each pair in the first pool, the first polynucleotide and the second polynucleotide comprises sequence corresponding to a target genomic locus in a host cell. In some cases, for each pair in the first pool, the first polynucleotide and the second polynucleotide comprise coding sequence corresponding to a gene that is part of a metabolic pathway. In some cases, for each pair in the first pool, the first polynucleotide and the second polynucleotide comprise coding sequence corresponding to a functional domain or one or more proteins. In some cases, for each pair in the first pool, the first polynucleotide and the second polynucleotide are linked together in a single construct, wherein the single construct comprises one or more recognition sequences for one or more site-specific nuclease(s) between the first polynucleotide and the second polynucleotide. In some cases, the one or more recognition sequences for one or more site-specific nuclease(s) comprises a homing endonuclease recognition sequence. In some cases, the first assembly overlap sequence and the second assembly overlap sequence on each insert polynucleotide in the second pool comprises 1 or more nucleotides that are complementary to the 3' end of a first polynucleotide and the 5' end of a second polynucleotide, respectively, in a pair of polynucleotides from the first pool. In some cases, the first assembly overlap sequence and the second assembly overlap sequence on each insert polynucleotide in the second pool comprises about 25 nucleotides that are complementary to the 3' end of a first polynucleotide and the 5' end of a second polynucleotide, respectively, in a pair of polynucleotides from the first pool. In some cases, each insert polynucleotide in the second pool comprises one or more payload sequences located between the first assembly overlap sequence and the second assembly overlap sequence. In some cases, the one or more payload sequences are selected from promoters, genes, regulatory sequences, nucleic acid sequence encoding degrons, nucleic acid sequence encoding solubility tags, terminators, unique identifier sequence or portions thereof. In some cases, each pair of first and second polynucleotides in the first pool comprises sequence corresponding to a different target genomic locus in a host cell as compared to each other pair in the first pool. In some cases, each pair of first and second polynucleotides in the first pool comprises sequence corresponding to the same target genomic locus in a host cell. In some cases, each payload sequence in the insert polynucleotides in the second pool is different from the payload sequence in each other insert polynucleotide in the second pool. In some cases, each payload sequence in the insert polynucleotides in the second pool is the same as the payload sequence in each other insert polynucleotide in the second pool. In some cases, the site-specific nuclease(s) is one or more of restriction endonuclease(s), Type IIs endonuclease(s), homing endonuclease(s), RNA-guided nuclease(s), DNA-guided nuclease(s), zinc-finger nuclease(s), Transcription activator-like effector nuclease(s) (TALEN(s)) or nicking enzyme(s).
[0007] In another aspect, provided herein is a method for generating libraries of polynucleotides, the method comprising: a.) combining a first pool of polynucleotides and a second pool of polynucleotides, wherein the first pool contains pairs of polynucleotides, wherein each pair in the first pool contains a first polynucleotide and a second polynucleotide, wherein the second pool contains insert polynucleotides, wherein each insert polynucleotide in the second pool comprises a first assembly overlap sequence at its 5' end that is complementary to a 3' end of a first polynucleotide and a second assembly overlap sequence at its opposing 3'end that is complementary to a 5' end of a second polynucleotide in a pair of polynucleotides from the first pool; b.) assembling the first pool and the second pool into a library of polynucleotides, wherein each polynucleotide in the library comprises an insert polynucleotide from the second pool and a pair of first polynucleotides and second polynucleotides from the first pool, wherein the assembling is performed via in vitro cloning methods or in vivo cloning methods. In some cases, the first assembly overlap sequence and the second assembly overlap sequence on each insert polynucleotide in the second pool comprises 1 or more nucleotides that are complementary to the 3' end of a first polynucleotide and the 5' end of a second polynucleotide, respectively, in a pair of polynucleotides from the first pool. In some cases, the first assembly overlap sequence and the second assembly overlap sequence on each insert polynucleotide in the second pool comprises about 25 nucleotides that are complementary to the 3' end of a first polynucleotide and the 5' end of a second polynucleotide, respectively, in a pair of polynucleotides from the first pool. In some cases, for each pair in the first pool, the first polynucleotide and the second polynucleotide are linked together in a single construct, wherein the single construct comprises one or more recognition sequences for one or more site-specific nuclease(s) between the first polynucleotide and the second polynucleotide. In some cases, the one or more recognition sequences for one or more site-specific nuclease(s) comprises a homing endonuclease recognition sequence. In some cases, the linked single construct is produced by joining individual first and second polynucleotides via splicing and overlap-extension PCR (SOE-PCR), restriction-ligation, blunt-end ligation, overlap-based assembly method, recombination-based method, or any other enzymatic or chemical method of joining the first and second polynucleotides, or by synthesizing the single construct directly. In some cases, the method further comprises combining a cloning vector with the first pool and the second pool during step (a), wherein opposing ends of the cloning vector comprise sequence complementary to a 5'end of the first polynucleotide and a 3' end of the second polynucleotide for each pair in the first pool. In some cases, the method further comprises combining a cloning vector with the first pool prior to step (a), wherein opposing ends of the cloning vector comprise sequence complementary to a 5'end of the first polynucleotide and a 3' end of the second polynucleotide for each pair in the first pool. In some cases, the cloning vector and the 5' end of the first polynucleotide and the 3' end of the second polynucleotide in each pair from the first pool comprise one or more recognition sequences for one or more site-specific nucleases. In some cases, the method further comprises generating single-stranded complementary overhangs between the opposing ends of the cloning vector and the 5'end of the first polynucleotide and the 3' end of the second polynucleotide in each pair from the first pool by adding the one or more site-specific nucleases for the one or more recognition sequences. In some cases, the method further comprises ligating the single-stranded complementary overhangs between the opposing ends of the cloning vector and the 5'end of the first polynucleotide and the 3' end of the second polynucleotide in each pair from the first pool. The ligating can be performed using a DNA ligase. In some cases, step (b) results in a circular product comprising an insert polynucleotide from the second pool, a first and second polynucleotide from a pair from the first pool and the cloning vector. In some cases, the first pool is generated by selecting pairs of polynucleotide sequences from a larger set of such sequences such that no polynucleotide from the first pool shares common sequence with any other polynucleotide from the first pool beyond a specified threshold, excluding designed assembly overlap sequences between the pairs of polynucleotides of the first pool and the insert polynucleotides of the second pool, or the pairs of polynucleotides of the first pool and the cloning vector. In some cases, the specified threshold is between 5 and 15 contiguous nucleotides. In some cases, the assembly is an in vitro cloning method, wherein the mixture of the first pool and the second pool is heated to partially or fully denature polynucleotides present in the first and the second pools, then cooled to room temperature before assembly. In some cases, prior to step (a), the first pool of polynucleotides is generated by combining a mixture containing each first polynucleotide from the pairs of polynucleotides with a mixture containing each second polynucleotide from the pairs of polynucleotides. In some cases, each pair in the first pool is double-stranded DNA (dsDNA) or single-stranded DNA (ssDNA). In some cases, each insert polynucleotide in the second pool is dsDNA or ssDNA. In some cases, for each pair in the first pool, the first polynucleotide and the second polynucleotide comprises sequence corresponding to a target genomic locus in a host cell. In some cases, for each pair in the first pool, the first polynucleotide and the second polynucleotide comprise coding sequence corresponding to a gene that is part of a metabolic pathway. In some cases, for each pair in the first pool, the first polynucleotide and the second polynucleotide comprise coding sequence corresponding to a functional domain or one or more proteins. In some cases, each insert polynucleotide in the second pool comprises one or more payload sequences located between the first assembly overlap sequence and the second assembly overlap sequence. In some cases, the one or more payload sequences are selected from promoters, genes, regulatory sequences, nucleic acid sequence encoding degrons, nucleic acid sequence encoding solubility tags, terminators, unique identifier sequence or portions thereof. In some cases, for each pair in the first pool, the first polynucleotide and the second polynucleotide comprises sequence corresponding to a different target genomic locus in a host cell as compared to each other pair in the first pool. In some cases, for each pair in the first pool, the first polynucleotide and the second polynucleotide comprises sequence corresponding to the same target genomic locus in a host cell. In some cases, each payload sequence in the insert polynucleotides in the second pool is different from the payload sequence in each other insert polynucleotide in the second pool. In some cases, each payload sequence in the insert polynucleotides in the second pool is the same as the payload sequence in each other insert polynucleotide in the second pool. In some cases, each insert polynucleotide in the second pool is generated by: (i) performing a polymerase chain reaction (PCR) on a mixture comprising the payload sequence, a forward primer and a reverse primer, wherein the forward primer comprises from 5' to 3', a short stretch of one or more nucleotides complementary to the payload sequence, the first assembly overlap sequence, one or more recognition sequences for one or more site-specific nuclease(s), the second assembly overlap sequence and a second stretch of one or more nucleotides complementary to the payload sequence and wherein the reverse primer comprises sequence complementary to the payload sequence, wherein the PCR generates a PCR product comprising from 5' to 3', the short stretch of nucleic acid complementary to the payload sequence, the first assembly overlap sequence, the one or more site-specific nuclease recognition sequence(s), the second assembly overlap sequence and the payload sequence; (ii) circularizing the PCR product via an assembly method selected from the group consisting of splicing and overlap-extension PCR (SOE-PCR), restriction-ligation, blunt-end ligation, overlap-based assembly method, and recombination-based method, or any other enzymatic or chemical method for joining two DNA molecules; and (iii) linearizing the circularized PCR product with one or more site-specific nuclease(s) that recognizes the one or more site-specific nuclease recognition sequence(s), thereby generating the second pool of polynucleotides. In some cases, the site-specific nuclease(s) is one or more of restriction endonuclease(s), Type IIs endonuclease(s), homing endonuclease(s), RNA-guided nuclease(s), DNA-guided nuclease(s), zinc-finger nuclease(s), TALEN(s) or nicking enzyme(s).
[0008] In yet another aspect, provided herein is a method for generating libraries of polynucleotides, the method comprising: (a) amplifying via polymerase chain reaction (PCR) a first pool of polynucleotides, wherein the first pool contains pairs of polynucleotides, wherein each pair in the first pool contains a first polynucleotide and a second polynucleotide, and wherein each first polynucleotide and each second polynucleotide in a pair comprises a 5' end and a 3' end, wherein the amplifying introduces a common overlap sequence comprising one or more recognition sequences for one or more site-specific nucleases onto the 5' end of a first polynucleotide and the 3' end of a second polynucleotide in a pair from the first pool; (b) assembling each pair of first polynucleotides and second polynucleotides from the first pool into a single nucleic acid fragment by utilizing common overlap sequence, wherein the single nucleic fragment for each pair comprises a first polynucleotide and second polynucleotide separated by the common overlap sequence from the 5' end of the first polynucleotide and the 3' end of the second polynucleotide, and wherein the 3'end of the first polynucleotide and the 5' end of the second polynucleotide in the single nucleic fragment for each pair are located on opposing terminal ends of the single nucleic acid fragment, distal to the one or more site-specific nuclease recognition sequence(s); (c) combining the single nucleic acid fragments for each pair with a second pool containing insert polynucleotides, wherein each insert polynucleotide in the second pool comprises a first assembly overlap sequence at its 5' end that is complementary to the 3' end of the first polynucleotide present within the single nucleic acid fragment and a second assembly overlap sequence at its opposing 3' end that is complementary to the 5' end of the second polynucleotide present within the single nucleic acid fragment; (d) assembling the first pool and the second pool into a third pool of circularized products, wherein the assembling is performed via in vitro or in vivo overlap assembly methods, and wherein each circularized product in the third pool comprises an insert sequence from the second pool and a pair of first polynucleotides and second polynucleotides from the first pool; (e) linearizing each circularized product in the third pool via digestion by one or more site-specific nuclease(s) that recognizes the one or more site-specific nuclease recognition sequence(s) located between the first polynucleotide sequence and the second polynucleotide sequence in each of the circularized products in the third pool; and (f) assembling the linearized products into cloning vectors by in vitro or in vivo cloning methods. In some cases, the one or more site-specific nuclease recognition sequence(s) located between the first polynucleotide sequence and the second polynucleotide sequence is a homing nuclease recognition sequence. In some cases, the one or more site-specific nuclease(s) for the one or more site-specific nuclease recognition sequence(s) located between the first polynucleotide sequence and the second polynucleotide sequence is a homing endonuclease. In some cases, the common overlap sequence comprises an assembly overlap sequence of at least 1 nucleotide and the assembly in step (b) is performed by an overlap-based DNA assembly method. In some cases, the common overlap sequence comprises an assembly overlap sequence of from 10-25 nucleotides and the assembly in step (b) is performed by an overlap-based DNA assembly method. In some cases, the overlap-based DNA assembly method is selected from SOE-PCR or an in vitro overlap-assembly method (e.g., HiFi assembly). In some cases, the one or more site-specific nuclease recognition sequence(s) present in the common overlap sequence on the 5' end of the first polynucleotide is complementary to the one or more site-specific nuclease recognition sequence(s) present in the common overlap sequence on the 3' end of the second polynucleotide in each pair, and wherein the utilizing the common overlap sequences of the first and second polynucleotides in each pair in step (b) entails performing SOE-PCR. In some cases, the utilizing the common overlap sequences of the first and second polynucleotides in each pair in step (b) entails digesting the one or more site-specific nuclease recognition sequences present in the common overlap sequence on the 5' end of the first polynucleotide and the 3' end of the second polynucleotide in each pair with one or more site specific nucleases for the one or more site-specific nuclease recognition sequences to generate single-stranded overhangs on the 5' end of the first polynucleotide and the 3' end of the second polynucleotide in each pair that comprise complementary sequence; and ligating the complementary sequence present on the single-stranded overhang on the 5' end of the first polynucleotide and the 3' end of the second polynucleotide in each pair. In some cases, the assembling of step (d) is performed using an overlap-based DNA assembly method. In some cases, the overlap-based DNA assembly is selected from SOE-PCR and an in vitro overlap-assembly method (e.g., HiFi assembly). In some cases, the 3' end of the first polynucleotide and the 5' end of the second polynucleotide in the single nucleic acid fragment in each pair comprise an additional set of one or more site-specific nuclease recognition sequences and the first assembly overlap sequence and the second assembly overlap sequence in each insert polynucleotide in the second pool comprise one or more site-specific nuclease recognition sequences. In some cases, the assembling in step (d) entails digesting the additional one or more site-specific nuclease recognition sequences present on the 3' end of the first polynucleotide and the 5' end of the second polynucleotide in the single nucleic acid fragment in each pair and the one or more site-specific nuclease recognition sequences present in the first and second assembly sequences in each insert polynucleotide from the second pool with one or more site specific nucleases for the additional one or more site-specific nuclease recognition sequences on the 3' end of the first polynucleotide and the 5' end of the second polynucleotide in the single nucleic acid fragment in each pair and the one or more site-specific nuclease recognition sequences present in the first and second assembly sequences in each insert polynucleotide from the second pool to generate a single-stranded overhang on the 3' end of the first polynucleotide that comprises sequence complementary to sequence present on a single-stranded overhang on the 5'end of the first assembly sequence of an insert polynucleotide from the second pool and a single stranded overhang on the 5' end of the second polynucleotide that comprises sequence complementary to a sequence present on a single-stranded overhang on the 3' end of the second assembly sequence of the same insert polynucleotide from the second pool; and ligating the complementary sequence present on the single-stranded overhangs. In some cases, the cloning vectors of step (f) comprise one or more site-specific nuclease recognition sequences. In some cases, the assembling in step (f) entails digesting the one or more site-specific nuclease recognition sequences in the cloning vectors with the one or more site-specific nucleases for the one or more site-specific nuclease recognition sequences recognition sequences present in the cloning vectors, wherein the digesting generates single-stranded overhangs on opposing ends of the cloning vectors, wherein the single-stranded overhang on one of the opposing ends of the cloning vector comprises sequence complementary to an end of the linearized product generated in step (e) and the single-stranded overhang on the other of the opposing ends of the cloning vectors comprises sequence complementary to an opposing end of the linearized product generated in step (e); and ligating the complementary sequences present on the single-stranded overhangs of the cloning vectors and the linearized products from step (e). In some cases, the first pool is generated by selecting pairs of polynucleotide sequences from a larger set of such sequences such that no polynucleotide from the first pool shares common sequence with any other polynucleotide from the first pool beyond a specified threshold, excluding designed assembly overlap sequences between the pairs of polynucleotides of the first pool and the insert polynucleotides of the second pool, or the pairs of polynucleotides of the first pool and the cloning vector. In some cases, the specified threshold is between 5 and 15 contiguous nucleotides. In some cases, the first assembly overlap sequence and the second assembly overlap sequence on each insert polynucleotide in the second pool comprises 1 or more nucleotides that are complementary to the opposing terminal ends of the single nucleic acid fragment. In some cases, the first assembly overlap sequence and the second assembly overlap sequence on each insert polynucleotide in the second pool comprises about 25 nucleotides that are complementary to the opposing terminal ends of the single nucleic acid fragment. In some cases, prior to step (a), the first pool of polynucleotides is generated by combining a mixture containing each first polynucleotide from the pairs of polynucleotides with a mixture containing each second polynucleotide from the pairs of polynucleotides. In some cases, each pair in the first pool is double-stranded DNA (dsDNA) or single-stranded DNA (ssDNA). In some cases, each insert polynucleotide in the second pool is dsDNA or ssDNA. In some cases, for each pair in the first pool, the first polynucleotide and the second polynucleotide comprises sequence corresponding to a target genomic locus in a host cell. In some cases, for each pair in the first pool, the first polynucleotide and the second polynucleotide comprise coding sequence corresponding to a gene that is part of a metabolic pathway. In some cases, for each pair in the first pool, the first polynucleotide and the second polynucleotide comprise coding sequence corresponding to a functional domain or one or more proteins. In some cases, each insert polynucleotide in the second pool comprises one or more payload sequences located between the first assembly overlap sequence and the second assembly overlap sequence. In some cases, the one or more payload sequences are selected from promoters, genes, regulatory sequences, nucleic acid sequence encoding degrons, nucleic acid sequence encoding solubility tags, terminators, unique identifier sequence or portions thereof. In some cases, for each pair in the first pool, the first polynucleotide and the second polynucleotide comprises sequence corresponding to a different target genomic locus in a host cell as compared to each other pair in the first pool. In some cases, for each pair in the first pool, the first polynucleotide and the second polynucleotide comprises sequence corresponding to the same target genomic locus in a host cell. In some cases, each payload sequence in the insert polynucleotides in the second pool is different from the payload sequence in each other insert polynucleotide in the second pool. In some cases, each payload sequence in the insert polynucleotides in the second pool is the same as the payload sequence in each other insert polynucleotide in the second pool. In some cases, each insert polynucleotide in the second pool is generated by: (i) performing a polymerase chain reaction (PCR) on a mixture comprising the payload sequence, a forward primer and a reverse primer, wherein the forward primer comprises from 5' to 3', a short stretch of one or more nucleotides complementary to the payload sequence, the first assembly overlap sequence, one or more recognition sequences for one or more site-specific nuclease(s), the second assembly overlap sequence and a second stretch of one or more nucleotides complementary to the payload sequence and wherein the reverse primer comprises sequence complementary to the payload sequence or to other sequence downstream of the payload sequence, wherein the PCR generates a PCR product comprising from 5' to 3', the short stretch of nucleic acid complementary to the payload sequence, the first assembly overlap sequence, the one or more site-specific nuclease recognition sequence(s), the second assembly overlap sequence and the payload sequence; (ii) circularizing the PCR product via an assembly method selected from the group consisting of splicing and overlap-extension PCR (SOE-PCR), restriction-ligation, blunt-end ligation, overlap-based assembly method, and recombination-based method, or any other enzymatic or chemical method for joining two DNA molecules; and (iii) linearizing the circularized PCR product with one or more site-specific nuclease(s) that recognizes the one or more site-specific nuclease recognition sequence(s), thereby generating the second pool of polynucleotides. In some cases, the site-specific nuclease(s) is one or more of restriction endonuclease(s), Type IIs endonuclease(s), homing endonuclease(s), RNA-guided nuclease(s), DNA-guided nuclease(s), zinc-finger nuclease(s), TALEN(s) or nicking enzyme(s).
BRIEF DESCRIPTION OF THE FIGURES
[0009] FIG. 1 depicts a method for multiplexed, deterministic assembly of DNA libraries showing an initial composition of insert polynucleotide(s) and first polynucleotides comprising a vector overlap assembly sequence and a second polynucleotide comprising a vector overlap assembly sequence, and optional cloning vector. The insert polynucleotide can comprise a payload sequence that has a length of zero nucleotides if a deletion, or nonzero if an insertion or replacement,
[0010] FIG. 2 illustrates an inside-out assembly method to pre-associate first and second polynucleotides for use in the method of FIG. 1 to allow for assembling insert polynucleotides (e.g., promoters) longer than a maximum synthetic oligonucleotide length.
[0011] FIG. 3 illustrates adaptation of the method of FIG. 1 to allow for assembling insert polynucleotides (e.g., promoters) longer than the maximum synthetic oligonucleotide length.
[0012] FIG. 4 illustrates assembly of deterministic libraries using the method of FIG. 1. The number of unique loci, payloads, and total possible constructs in each library is given in the table.
[0013] FIG. 5 illustrates the results of the successful in vitro assembly of deterministic libraries employing a pool comprising precisely designed DNA parts including a circular-permuted payload (insert). The long bars at the top indicate the structure of the plasmids to be assembled in the pool, and the shorter bars below represent Sanger sequences aligned to the corresponding reference sequence for three separate samples from the pool of assemblies. Faint vertical lines at the inner ends of the reads represent expected sequencing artifacts at the tail ends of Sanger reads.
[0014] FIG. 6 illustrates total success rates for pooled assemblies for which payload containing parts were created via PCR using primers that append the assembly overlaps from templates derived from a host genome.
DETAILED DESCRIPTION
Definitions
[0015] While the following terms are believed to be well understood by one of ordinary skill in the art, the following definitions are set forth to facilitate explanation of the presently disclosed subject matter.
[0016] As used herein, the term "a" or "an" can refer to one or more of that entity, i.e. can refer to a plural referents. As such, the terms "a" or "an", "one or more" and "at least one" can be used interchangeably herein. In addition, reference to "an element" by the indefinite article "a" or "an" does not exclude the possibility that more than one of the elements is present, unless the context clearly requires that there is one and only one of the elements.
[0017] Unless the context requires otherwise, throughout the present specification and claims, the word "comprise" and variations thereof, such as, "comprises" and "comprising" are to be construed in an open, inclusive sense that is as "including, but not limited to".
[0018] Reference throughout this specification to "one embodiment" or "an embodiment" means that a particular feature, structure or characteristic described in connection with the embodiment may be included in at least one embodiment of the present disclosure. Thus, the appearances of the phrases "in one embodiment" or "in an embodiment" in various places throughout this specification may not necessarily all referring to the same embodiment. It is appreciated that certain features of the disclosure, which are, for clarity, described in the context of separate embodiments, may also be provided in combination in a single embodiment. Conversely, various features of the disclosure, which are, for brevity, described in the context of a single embodiment, may also be provided separately or in any suitable sub-combination.
[0019] As used herein, the terms "cellular organism" "microorganism" or "microbe" should be taken broadly. These terms are used interchangeably and include, but are not limited to, the two prokaryotic domains, Bacteria and Archaea, as well as certain eukaryotic fungi and protists. In some embodiments, the disclosure refers to the "microorganisms" or "cellular organisms" or "microbes" of lists/tables and figures present in the disclosure. This characterization can refer to not only the identified taxonomic genera of the tables and figures, but also the identified taxonomic species, as well as the various novel and newly identified or designed strains of any organism in said tables or figures. The same characterization holds true for the recitation of these terms in other parts of the Specification, such as in the Examples.
[0020] As used herein, the term "prokaryotes" is art recognized and refers to cells that contain no nucleus or other cell organelles. The prokaryotes are generally classified in one of two domains, the Bacteria and the Archaea. The definitive difference between organisms of the Archaea and Bacteria domains is based on fundamental differences in the nucleotide base sequence in the 16S ribosomal RNA.
[0021] As used herein, the term "Archaea" refers to a categorization of organisms of the division Mendosicutes, typically found in unusual environments and distinguished from the rest of the prokaryotes by several criteria, including the number of ribosomal proteins and the lack of muramic acid in cell walls. On the basis of ssrRNA analysis, the Archaea consist of two phylogenetically-distinct groups: Crenarchaeota and Euryarchaeota. On the basis of their physiology, the Archaea can be organized into three types: methanogens (prokaryotes that produce methane); extreme halophiles (prokaryotes that live at very high concentrations of salt (NaCl); and extreme (hyper) thermophilus (prokaryotes that live at very high temperatures). Besides the unifying archaeal features that distinguish them from Bacteria (i.e., no murein in cell wall, ester-linked membrane lipids, etc.), these prokaryotes exhibit unique structural or biochemical attributes which adapt them to their particular habitats. The Crenarchaeota consists mainly of hyperthermophilic sulfur-dependent prokaryotes and the Euryarchaeota contains the methanogens and extreme halophiles.
[0022] As used herein, "bacteria" or "eubacteria" can refer to a domain of prokaryotic organisms. Bacteria include at least 11 distinct groups as follows: (1) Gram-positive (gram+) bacteria, of which there are two major subdivisions: (1) high G+C group (Actinomycetes, Mycobacteria, Micrococcus, others) (2) low G+C group (Bacillus, Clostridia, Lactobacillus, Staphylococci, Streptococci, Mycoplasmas); (2) Proteobacteria, e.g., Purple photosynthetic+non-photosynthetic Gram-negative bacteria (includes most "common" Gram-negative bacteria); (3) Cyanobacteria, e.g., oxygenic phototrophs; (4) Spirochetes and related species; (5) Planctomyces; (6) Bacteroides, Flavobacteria; (7) Chlamydia; (8) Green sulfur bacteria; (9) Green non-sulfur bacteria (also anaerobic phototrophs); (10) Radioresistant micrococci and relatives; (11) Thermotoga and Thermosipho thermophiles.
[0023] As used herein, a "eukaryote" is any organism whose cells contain a nucleus and other organelles enclosed within membranes. Eukaryotes belong to the taxon Eukarya or Eukaryota. The defining feature that sets eukaryotic cells apart from prokaryotic cells (the aforementioned Bacteria and Archaea) is that they have membrane-bound organelles, especially the nucleus, which contains the genetic material, and is enclosed by the nuclear envelope.
[0024] As used herein, the terms "genetically modified host cell," "recombinant host cell," and "recombinant strain" are used interchangeably herein and can refer to host cells that have been genetically modified by the cloning and transformation methods of the present disclosure. Thus, the terms include a host cell (e.g., bacteria, yeast cell, fungal cell, CHO, human cell, etc.) that has been genetically altered, modified, or engineered, such that it exhibits an altered, modified, or different genotype and/or phenotype (e.g., when the genetic modification affects coding nucleic acid sequences of the microorganism), as compared to the naturally-occurring organism from which it was derived. It is understood that in some embodiments, the terms refer not only to the particular recombinant host cell in question, but also to the progeny or potential progeny of such a host cell
[0025] As used herein, the term "wild-type microorganism" or "wild-type host cell" can describe a cell that occurs in nature, i.e. a cell that has not been genetically modified.
[0026] As used herein, the term "genetically engineered" may refer to any manipulation of a host cell's genome (e.g. by insertion, deletion, mutation, or replacement of nucleic acids).
[0027] As used herein, the term "control" or "control host cell" can refer to an appropriate comparator host cell for determining the effect of a genetic modification or experimental treatment. In some embodiments, the control host cell is a wild type cell. In other embodiments, a control host cell is genetically identical to the genetically modified host cell, save for the genetic modification(s) differentiating the treatment host cell. In some embodiments, the present disclosure teaches the use of parent strains as control host cells (e.g., the Si strain that was used as the basis for the strain improvement program). In other embodiments, a host cell may be a genetically identical cell that lacks a specific promoter or SNP being tested in the treatment host cell.
[0028] As used herein, the term "allele(s)" can mean any of one or more alternative forms of a gene, all of which alleles relate to at least one trait or characteristic. In a diploid cell, the two alleles of a given gene occupy corresponding loci on a pair of homologous chromosomes.
[0029] As used herein, the term "locus" (loci plural) can mean any site at which an edit to the native genomic sequence is desired. In one embodiment, said term can mean a specific place or places or a site on a chromosome where for example a gene or genetic marker is found.
[0030] As used herein, the term "genetically linked" can refer to two or more traits that are co-inherited at a high rate during breeding such that they are difficult to separate through crossing.
[0031] A "recombination" or "recombination event" as used herein can refer to a chromosomal crossing over or independent assortment.
[0032] As used herein, the term "phenotype" can refer to the observable characteristics of an individual cell, cell culture, organism, or group of organisms, which results from the interaction between that individual's genetic makeup (i.e., genotype) and the environment.
[0033] As used herein, the term "chimeric" or "recombinant" when describing a nucleic acid sequence or a protein sequence can refer to a nucleic acid, or a protein sequence, that links at least two heterologous polynucleotides, or two heterologous polypeptides, into a single macromolecule, or that rearranges one or more elements of at least one natural nucleic acid or protein sequence. For example, the term "recombinant" can refer to an artificial combination of two otherwise separated segments of sequence, e.g., by chemical synthesis or by the manipulation of isolated segments of nucleic acids by genetic engineering techniques.
[0034] As used herein, a "synthetic nucleotide sequence" or "synthetic polynucleotide sequence" is a nucleotide sequence that is not known to occur in nature or that is not naturally occurring. Generally, such a synthetic nucleotide sequence can comprise at least one nucleotide difference when compared to any other naturally occurring nucleotide sequence.
[0035] As used herein, the term "nucleic acid" can refer to a polymeric form of nucleotides of any length, either ribonucleotides or deoxyribonucleotides, or analogs thereof. This term can refer to the primary structure of the molecule, and thus includes double- and single-stranded DNA, as well as double- and single-stranded RNA. It also includes modified nucleic acids such as methylated and/or capped nucleic acids, nucleic acids containing modified bases, backbone modifications, and the like. The terms "nucleic acid" and "nucleotide sequence" are used interchangeably.
[0036] As used herein, the term "gene" can refer to any segment of DNA associated with a biological function. Thus, genes can include, but are not limited to, coding sequences and/or the regulatory sequences required for their expression. Genes can also include non-expressed DNA segments that, for example, form recognition sequences for other proteins. Genes can be obtained from a variety of sources, including cloning from a source of interest or synthesizing from known or predicted sequence information, and may include sequences designed to have desired parameters.
[0037] As used herein, the term "homologous" or "homologue" or "ortholog" or "orthologue" is known in the art and can refer to related sequences that share a common ancestor or family member and are determined based on the degree of sequence identity.
[0038] The terms "homology," "homologous," "substantially similar" and "corresponding substantially" can be used interchangeably herein. Said terms can refer to nucleic acid fragments wherein changes in one or more nucleotide bases do not affect the ability of the nucleic acid fragment to mediate gene expression or produce a certain phenotype. These terms can also refer to modifications of the nucleic acid fragments of the instant disclosure such as deletion or insertion of one or more nucleotides that do not substantially alter the functional properties of the resulting nucleic acid fragment relative to the initial, unmodified fragment. It is therefore understood, as those skilled in the art will appreciate, that the disclosure encompasses more than the specific exemplary sequences. These terms describe the relationship between a gene found in one species, subspecies, variety, cultivar or strain and the corresponding or equivalent gene in another species, subspecies, variety, cultivar or strain. For purposes of this disclosure homologous sequences are compared.
[0039] "Homologous sequences" or "homologues" or "orthologs" are thought, believed, or known to be functionally related. A functional relationship may be indicated in any one of a number of ways, including, but not limited to: (a) degree of sequence identity and/or (b) the same or similar biological function. Preferably, both (a) and (b) are indicated. Sequence homology between amino acid or nucleic acid sequences can be defined in terms of shared ancestry. Two segments of nucleic acid can have shared ancestry because of either a speciation event (orthologs) or a duplication event (paralogs). Homology among amino acid or nucleic acid sequences can be inferred from their sequence similarity such that amino acid or nucleic acid sequences are said to be homologous is said amino acid or nucleic acid sequences share significant similarity. Significant similarity can be strong evidence that two sequences are related by divergent evolution from a common ancestor. Alignments of multiple sequences can be used to discover the homologous regions. Homology can be determined using software programs readily available in the art, such as those discussed in Current Protocols in Molecular Biology (F. M. Ausubel et al., eds., 1987) Supplement 30, section 7.718, Table 7.71. Some alignment programs are BLAST (NCBI), MacVector (Oxford Molecular Ltd, Oxford, U.K.), ALIGN Plus (Scientific and Educational Software, Pennsylvania) and AlignX (Vector NTI, Invitrogen, Carlsbad, Calif.). Another alignment program is Sequencher (Gene Codes, Ann Arbor, Mich.), using default parameters.
[0040] As used herein, the term "endogenous" or "endogenous gene," can refer to the naturally occurring gene, in the location in which it is naturally found within the host cell genome. In the context of the present disclosure, operably linking a heterologous promoter to an endogenous gene means genetically inserting a heterologous promoter sequence in front of an existing gene, in the location where that gene is naturally present. An endogenous gene as described herein can include alleles of naturally occurring genes that have been mutated according to any of the methods of the present disclosure.
[0041] As used herein, the term "exogenous" is used interchangeably with the term "heterologous," and refers to a substance coming from some source other than its native source. For example, the terms "exogenous protein," or "exogenous gene" refer to a protein or gene from a non-native source or location, and that have been artificially supplied to a biological system.
[0042] As used herein, the term "nucleotide change" refers to, e.g., nucleotide substitution, deletion, and/or insertion, as is well understood in the art. For example, mutations can contain alterations that produce silent substitutions, additions, or deletions, but do not alter the properties or activities of the encoded protein or how the proteins are made. Alternatively, mutations can be nonsynonymous substitutions or changes that can alter the amino acid sequence of the encoded protein and can result in an alteration in properties or activities of the protein.
[0043] As used herein, the term "protein modification" can refer to, e.g., amino acid substitution, amino acid modification, deletion, and/or insertion, as is well understood in the art.
[0044] As used herein, the term "at least a portion" or "fragment" of a nucleic acid or polypeptide can mean a portion having the minimal size characteristics of such sequences, or any larger fragment of the full-length molecule, up to and including the full-length molecule. A fragment of a polynucleotide of the disclosure may encode a biologically active portion of a genetic regulatory element. A biologically active portion of a genetic regulatory element can be prepared by isolating a portion of one of the polynucleotides of the disclosure that comprises the genetic regulatory element and assessing activity as described herein. Similarly, a portion of a polypeptide may be 4 amino acids, 5 amino acids, 6 amino acids, 7 amino acids, and so on, going up to the full length polypeptide. The length of the portion to be used will depend on the particular application. A portion of a nucleic acid useful as a hybridization probe may be as short as 12 nucleotides; in some embodiments, it is 20 nucleotides. A portion of a polypeptide useful as an epitope may be as short as 4 amino acids. A portion of a polypeptide that performs the function of the full-length polypeptide would generally be longer than 4 amino acids.
[0045] Variant polynucleotides can also encompass sequences derived from a mutagenic and recombinogenic procedure such as DNA shuffling. Strategies for such DNA shuffling are known in the art. See, for example, Stemmer (1994) PNAS 91:10747-10751; Stemmer (1994) Nature 370:389-391; Crameri et al. (1997) Nature Biotech. 15:436-438; Moore et al. (1997) J. Mol. Biol. 272:336-347; Zhang et al. (1997) PNAS 94:4504-4509; Crameri et al. (1998) Nature 391:288-291; and U.S. Pat. Nos. 5,605,793 and 5,837,458.
[0046] For PCR amplifications disclosed herein, oligonucleotide primers can be designed for use in PCR reactions to amplify corresponding DNA sequences from cDNA or genomic DNA extracted from any organism of interest. Methods for designing PCR primers and PCR cloning are generally known in the art and are disclosed in Sambrook et a/. (2001) Molecular Cloning: A Laboratory Manual (3.sup.rd ed., Cold Spring Harbor Laboratory Press, Plainview, N.Y.). See also Innis et al., eds. (1990) PCR Protocols: A Guide to Methods and Applications (Academic Press, New York); Innis and Gelfand, eds. (1995) PCR Strategies (Academic Press, New York); and Innis and Gelfand, eds. (1999) PCR Methods Manual (Academic Press, New York). Known methods of PCR include, but are not limited to, methods using paired primers, nested primers, single specific primers, degenerate primers, gene-specific primers, vector-specific primers, partially-mismatched primers, and the like.
[0047] The term "primer" as used herein can refer to an oligonucleotide which is capable of annealing to the amplification target allowing a DNA polymerase to attach, thereby serving as a point of initiation of DNA synthesis when placed under conditions in which synthesis of primer extension product is induced, i.e., in the presence of nucleotides and an agent for polymerization such as DNA polymerase and at a suitable temperature and pH. The (amplification) primer can be single stranded for maximum efficiency in amplification. The primer can be an oligodeoxyribonucleotide. The primer must be sufficiently long to prime the synthesis of extension products in the presence of the agent for polymerization. The exact lengths of the primers will depend on many factors, including temperature and composition (A/T vs. G/C content) of primer. A pair of bi-directional primers consists of one forward and one reverse primer as commonly used in the art of DNA amplification such as in PCR amplification.
[0048] As used herein, "promoter" can refer to a DNA sequence capable of controlling the expression of a coding sequence or functional RNA. In some embodiments, the promoter sequence consists of proximal and more distal upstream elements, the latter elements often referred to as enhancers. Accordingly, an "enhancer" can be a DNA sequence that can stimulate promoter activity, and may be an innate element of the promoter or a heterologous element inserted to enhance the level or tissue specificity of a promoter. Promoters may be derived in their entirety from a native gene, or be composed of different elements derived from different promoters found in nature, or even comprise synthetic DNA segments. It is understood by those skilled in the art that different promoters may direct the expression of a gene in different tissues or cell types, or at different stages of development, or in response to different environmental conditions. It is further recognized that since in most cases the exact boundaries of regulatory sequences have not been completely defined, DNA fragments of some variation may have identical promoter activity.
[0049] As used herein, the phrases "recombinant construct", "expression construct", "chimeric construct", "construct", and "recombinant DNA construct" are used interchangeably herein. A recombinant construct can comprise an artificial combination of nucleic acid fragments, e.g., regulatory and coding sequences that are not found together in nature. For example, a chimeric construct may comprise regulatory sequences and coding sequences that are derived from different sources, or regulatory sequences and coding sequences derived from the same source, but arranged in a manner different than that found in nature. Such construct may be used by itself or may be used in conjunction with a vector. If a vector is used then the choice of vector is dependent upon the method that will be used to transform host cells as is well known to those skilled in the art. For example, a plasmid vector can be used. The skilled artisan is well aware of the genetic elements that must be present on the vector in order to successfully transform, select and propagate host cells comprising any of the isolated nucleic acid fragments of the disclosure. The skilled artisan will also recognize that different independent transformation events will result in different levels and patterns of expression (Jones et al., (1985) EMBO J. 4:2411-2418; De Almeida et al., (1989) Mol. Gen. Genetics 218:78-86), and thus that multiple events must be screened in order to obtain lines displaying the desired expression level and pattern. Such screening may be accomplished by direct sequencing, Southern analysis of DNA, Northern analysis of mRNA expression, immunoblotting analysis of protein expression, or phenotypic analysis, among others. Vectors can be plasmids, viruses, bacteriophages, pro-viruses, phagemids, transposons, artificial chromosomes, and the like, that replicate autonomously or can integrate into a chromosome of a host cell. A vector can also be a naked RNA polynucleotide, a naked DNA polynucleotide, a polynucleotide composed of both DNA and RNA within the same strand, a poly-lysine-conjugated DNA or RNA, a peptide-conjugated DNA or RNA, a liposome-conjugated DNA, or the like, that is not autonomously replicating. As used herein, the term "expression" refers to the production of a functional end-product e.g., an mRNA or a protein (precursor or mature).
[0050] "Operably linked" or "functionally linked" can mean the sequential arrangement of any functional payload according to the disclosure (e.g., promoter, terminator, degron, solubility tag, etc.) with a further oligo- or polynucleotide. In some cases, the sequential arrangement can result in transcription of said further polynucleotide. In some cases, the sequential arrangement can result in translation of said further polynucleotide. The functional payloads can be present upstream or downstream of the further oligo or polynucleotide. In one example, "operably linked" or "functionally linked" can mean a promoter controls the transcription of the gene adjacent or downstream or 3' to said promoter. In another example, "operably linked" or "functionally linked" can mean a terminator controls termination of transcription of the gene adjacent or upstream or 5' to said terminator.
[0051] The term "product of interest" or "biomolecule" as used herein can refer to any product produced by microbes from feedstock. In some cases, the product of interest may be a small molecule, enzyme, peptide, amino acid, organic acid, synthetic compound, fuel, alcohol, etc. For example, the product of interest or biomolecule may be any primary or secondary extracellular metabolite. The primary metabolite may be, inter alia, ethanol, citric acid, lactic acid, glutamic acid, glutamate, lysine, threonine, tryptophan and other amino acids, vitamins, polysaccharides, etc. The secondary metabolite may be, inter alia, an antibiotic compound like penicillin, or an immunosuppressant like cyclosporin A, a plant hormone like gibberellin, a statin drug like lovastatin, a fungicide like griseofulvin, etc. The product of interest or biomolecule may also be any intracellular component produced by a microbe, such as: a microbial enzyme, including: catalase, amylase, protease, pectinase, glucose isomerase, cellulase, hemicellulase, lipase, lactase, streptokinase, and many others. The intracellular component may also include recombinant proteins, such as insulin, hepatitis B vaccine, interferon, granulocyte colony-stimulating factor, streptokinase and others.
[0052] As used herein, the term "HTP genetic design library" or "library" refers to collections of genetic perturbations according to the present disclosure. In some embodiments, the libraries of the present disclosure may manifest as i) a collection of sequence information in a database or other computer file, ii) a collection of genetic constructs encoding for the aforementioned series of genetic elements, or iii) host cell strains comprising said genetic elements. In some embodiments, the libraries of the present disclosure may refer to collections of individual elements (e.g., collections of promoters for PRO swap libraries, collections of terminators for STOP swap libraries, collections of protein solubility tags for SOLUBILITY TAG swap libraries, or collections of protein degradation tags for DEGRADATION TAG swap libraries). In other embodiments, the libraries of the present disclosure may also refer to combinations of genetic elements, such as combinations of promoter:genes, gene:terminator, or even promoter:gene:terminators. In some embodiments, the libraries of the present disclosure may also refer to combinations of promoters, terminators, protein solubility tags and/or protein degradation tags. In some embodiments, the libraries of the present disclosure further comprise meta data associated with the effects of applying each member of the library in host organisms. For example, a library as used herein can include a collection of promoter::gene sequence combinations, together with the resulting effect of those combinations on one or more phenotypes in a particular species, thus improving the future predictive value of using said combination in future promoter swaps.
[0053] As used herein, the term "SNP" refers to Small Nuclear Polymorphism(s). In some embodiments, SNPs of the present disclosure should be construed broadly, and include single nucleotide polymorphisms, sequence insertions, deletions, inversions, and other sequence replacements. As used herein, the term "non-synonymous" or non-synonymous SNPs" refers to mutations that lead to coding changes in host cell proteins
[0054] A "high-throughput (HTP)" method of genomic engineering may involve the utilization of at least one piece of automated equipment (e.g. a liquid handler or plate handler machine) to carry out at least one-step of said method.
[0055] The term "polynucleotide" as used herein encompasses oligonucleotides and refers to a nucleic acid of any length. Polynucleotides may be DNA or RNA. Polynucleotides may be single-stranded (ss) or double-stranded (ds) unless otherwise specified. Polynucleotides may be synthetic, for example, synthesized in a DNA synthesizer, or naturally occurring, for example, extracted from a natural source, or derived from cloned or amplified material. Polynucleotides referred to herein can contain modified bases or nucleotides.
[0056] The term "pool", as used herein, can refer to a collection of at least 2 polynucleotides. In some embodiments, a set of polynucleotides may comprise at least 5, at least 10, at least 12 or at least 15 or more polynucleotides.
[0057] The term "overlapping sequence", or "overlapping assembly sequence" or "assembly overlap sequence" as used herein can refer to a sequence that is complementary in two polynucleotides and where the overlapping sequence is ss, on one polynucleotide such that it can be hybridized to another overlapping complementary ss region on another polynucleotide. An overlapping sequence may be at or close to (e.g., within about 5, 10, 20 nucleotides of) the terminal ends of two distinct polynucleotides. For example, if the two distinct polynucleotides are single-stranded, then the assembly overlap sequence would be present on the 3' terminal ends of each of the single-stranded polynucleotides. Alternatively, if the two distinct polynucleotides are double-stranded, then the assembly overlap sequence of one of the polynucleotides can be present on the 3' terminal end of said polynucleotide (i.e., 3' end in reference to the top strand of the ds polynucleotide), while the complementary assembly overlap sequence on the other polynucleotide can be present at the 5' end of said polynucleotide (i.e., 5' end in reference to the top strand of the ds polynucleotide) As necessary, the assembly overlap sequence on any ds polynucleotide may be made available by removing any non-overlapping sequence. The removal can be enzymatic such as through the use of a 3'-5' exonuclease activity of a polymerase.
[0058] As used herein, the term "assembling", can refer to a reaction in which two or more, four or more, six or more, eight or more, ten or more, 12 or more, 15 or more polynucleotides, e.g., four or more polynucleotides are joined to another to make a longer polynucleotide.
[0059] As used herein, the term "incubating under suitable reaction conditions", can refer to maintaining a reaction a suitable temperature and time to achieve the desired results, i.e., polynucleotide assembly. Reaction conditions suitable for the enzymes and reagents used in the present method are known (e.g. as described in the Examples herein) and, as such, suitable reaction conditions for the present method can be readily determined. These reactions conditions may change depending on the enzymes used (e.g., depending on their optimum temperatures, etc.).
[0060] As used herein, the term "joining", can refer to the production of covalent linkage between two sequences.
[0061] As used herein, the term "composition" can refer to a combination of reagents that may contain other reagents, e.g., glycerol, salt, dNTPs, etc., in addition to those listed. A composition may be in any form, e.g., aqueous or lyophilized, and may be at any state (e.g., frozen or in liquid form).
[0062] As used herein a "vector" is a suitable DNA into which a fragment or DNA assembly may be integrated such that the engineered vector can be replicated in a host cell. A linearized vector may be created restriction endonuclease digestion of a circular vector or by PCR. The concentration of fragments and/or linearized vectors can be determined by gel electrophoresis or other means.
Overview
[0063] Provided herein are methods and compositions that facilitate multiple assemblies to be produced in a single reaction in a deterministic rather than combinatorial manner. The methods and compositions provided herein impart the time, cost, and throughput benefits of multiplexed assembly while still enabling the creation of a library where all output assemblies are determined in advance. The methods and compositions provided herein allow for the creation of many plasmids or constructs in a single assembly reaction, reducing the number of total reactions required to create libraries of thousands of plasmids or constructs. The methods and compositions provided herein also allows for assembling a defined subset of desired plasmids or constructs out of a larger set of numerous possible combinations. In some cases, the methods and compositions provided herein minimizes the number of unique parts (`homology arms`) that need to be amplified from genomes (or synthesized) by not including any payload or insert sequence-specific assembly overlaps. This can eliminate the need to amplify multiple copies of the same homology-arm pair designed to be combined with multiple payloads or insert sequences. Further, diversity arising from combinations of payload/insert sequence and homology arm pairs is specified by sequences on the payload/insert sequence itself. The multitude of resulting payload sequences can be produced synthetically and inexpensively. Libraries generated using the methods and compositions provided herein can be suitable for any number of applications such as, for example, any genome editing methods or any pooled pathway assembly. The genome editing methods known in the art can be those that do not require tailored sites for RCME (recombinase cassette mediated exchange) for editing the genome of a cell at multiple arbitrary locations such as, for example, scarless genomic editing.
[0064] Provided herein is a composition comprising a mixture of polynucleotides for assembly in a deterministic fashion of a library of nucleic acid constructs. The mixture can comprise n pools of polynucleotide parts (e.g., first and second polynucleotides). Then pools can be at most, at least, or exactly 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19 or 20 pools of polynucleotide. The n pools can each comprise an equal number of polynucleotide parts or they can comprise differing numbers of polynucleotide parts (e.g., first and second polynucleotides). In one embodiment, the mixture comprises 2 pools such that one of the two pools comprises first polynucleotides and the other of the two pools comprises second polynucleotides. Each pool of first polynucleotides can comprise a paired second polynucleotide in a separate pool of second polynucleotides. Further to any of the above embodiments, the mixture can further comprise n-1 pools of insert or bridging polynucleotides. Each insert or bridging polynucleotide can comprise sequence complementary to an element of one of the n pools of polynucleotide parts (e.g., first polynucleotide) at its 5' end and to an element of one of the other pools of polynucleotide parts (e.g., second polynucleotide) at its 3' end. The insert sequences can be designed such that the assembly results in a library of polynucleotides where each polynucleotide comprises a specific element from each of the n pools of polynucleotide parts, interspersed with a specific element from each of the n-1 pools of insert polynucleotides.
[0065] The mixture of polynucleotides can comprise: a first pool containing pairs of polynucleotides, wherein each pair in the first pool contains a first polynucleotide and a second polynucleotide; and a second pool of insert polynucleotides, wherein each insert polynucleotide in the second pool comprises a first assembly overlap sequence at its 5' end that is complementary to a 3' end of a first polynucleotide and a second assembly overlap sequence at its opposing 3' end that is complementary to a 5' end of a second polynucleotide in a pair of polynucleotides from the first pool. In one embodiment, the composition can further comprise a cloning vector, wherein, for each pair in the first pool, a 5' end of the first polynucleotide and a 3' end of the second polynucleotide comprises sequence complementary to the cloning vector. The cloning vector can be any cloning vector known in the art that is suitable for propagation in a host cell such as, for example, E. coli or S. cerevisiae. In another embodiment, the composition also comprises a polymerase, an exonuclease, a ligase or any combination thereof. The polymerase can be strand displacing or non-strand displacing. The exonuclease can be a 5'-3' exonuclease. The pairs of polynucleotides in the first pool can be double-stranded, single-stranded or a combination thereof. The insert polynucleotides in the second pool can be double-stranded, single-stranded, or a combination thereof. In one embodiment, the polymerase is non-strand displacing and the composition further comprises a crowding agent. The crowding agent can be selected from polyethylene glycol (PEG), ficoll or dextran. In one embodiment, the crowding agent is PEG. The PEG can be used at a concentration of from about 3 to about 7% (weight/volume). The PEG can be selected from PEG-200, PEG-4000, PEG-6000, PEG-8000 or PEG-20,000. In another embodiment, the polymerase is strand displacing and the composition further comprises a single-stranded binding protein. The single strand DNA binding protein can be an extreme thermostable single-stranded DNA binding protein (ET SSB), E. coli recA, T7 gene 2.5 product, phage lambda RedB or Rac prophage RecT.
[0066] In one embodiment, a composition provided herein is a mixture of the following polynucleotides: (1) one or more first polynucleotides, (2) one or more insert polynucleotides, wherein the insert polynucleotide comprises a first assembly overlap sequence at its 5' end and a second assembly overlap sequence at its opposing 3'end, and (3) one or more second polynucleotides. In another embodiment, the composition is a mixture of the following polynucleotides: (1) one or more first polynucleotides, (2) one or more insert polynucleotides, wherein the insert polynucleotide comprises a first assembly overlap sequence at its 5' end and a second assembly overlap sequence at its opposing 3'end, (3) one or more second polynucleotides and (4) a cloning vector. Each of the one or more first polynucleotides can comprise sequence at its 3' or distal end that is complementary to the first assembly overlap sequence present at the 5' or proximal end of an insert polynucleotide from the one or more insert polynucleotides. Each of the one or more second polynucleotides can comprise sequence at its 5' or proximal end that is complementary to the second assembly overlap sequence present at the 3' or distal end of an insert polynucleotide from the one or more insert polynucleotides. Each of the one or more first polynucleotides can be paired with at least one of the one or more second polynucleotides, thereby forming one or more pairs of first and second polynucleotides. Each pair of first and second polynucleotides can comprise sequence at the distal end of the first polynucleotide that is complementary to the first assembly overlap sequence on the proximal end of an insert polynucleotide from the one or more insert polynucleotides as well as sequence at the proximal end of the second polynucleotide that is complementary to the distal end of an insert polynucleotide from the one or more insert polynucleotides.
[0067] Provided herein is a method for generating libraries of polynucleotides, the method comprising: a.) combining n pools of polynucleotide parts (e.g., first and second polynucleotides) and n-1 pools of insert or bridging polynucleotides; and b.) assembling the n pools of polynucleotide parts and n-1 pools of insert polynucleotides into a library of polynucleotides, wherein each polynucleotide in the library comprises a defined combination of an individual element from each of then pools of polynucleotide parts and bridging polynucleotides. Each insert or bridging polynucleotide in the n-1 pools of insert or bridging polynucleotides comprises a first assembly overlap sequence at its 5' end that is complementary to a 3' end of a first polynucleotide and a second assembly overlap sequence at its opposing 3'end that is complementary to a 5' end of a second polynucleotide in the n pools of first and second polynucleotides. The assembling can be performed via in vitro or in vivo overlap assembly methods. In some cases, the assembling is performed via an in vitro cloning method, wherein the mixture of the n pools of polynucleotide parts and n-1 pools of insert or bridging polynucleotides is heated to partially or fully denature any double-stranded polynucleotide parts present, then cooled at a slow rate to room temperature before being subjected to the in vitro cloning method.
[0068] Also provided herein is a method for generating libraries of polynucleotides, the method comprising: (a) combining a first pool of polynucleotides and a second pool of polynucleotides, wherein the first pool contains pairs of polynucleotides, wherein each pair in the first pool contains a first polynucleotide and a second polynucleotide, wherein the second pool contains insert polynucleotides, wherein each insert polynucleotide in the second pool comprises a first assembly overlap sequence at its 5' end that is complementary to a 3' end of a first polynucleotide and a second assembly overlap sequence at its opposing 3' end that is complementary to a 5' end of a second polynucleotide in a pair of polynucleotides from the first pool; (b) assembling the first pool and the second pool into a library of polynucleotides, wherein each polynucleotide in the library comprises an insert polynucleotide from the second pool and a pair of first polynucleotides and second polynucleotides from the first pool. The assembling can be performed via in vitro or in vivo overlap assembly methods. In some cases, the assembling is performed via an in vitro cloning method, wherein the mixture of the first pool and the second pool is heated to partially or fully denature polynucleotides present in the first and the second pools, then cooled at a slow rate to room temperature before being subjected to the in vitro cloning method. In some cases, the method further comprises combining a cloning vector with the first pool and the second pool during step (a), wherein opposing ends of the cloning vector comprise sequence complementary to a 5'end of the first polynucleotide and a 3' end of the second polynucleotide for each pair in the first pool. In some cases, the method further comprises combining a cloning vector with the first pool prior to step (a), wherein opposing ends of the cloning vector comprise sequence complementary to a 5' end of the first polynucleotide and a 3' end of the second polynucleotide for each pair in the first pool. In some cases, the cloning vector and the 5'end of the first polynucleotide and the 3'end of the second polynucleotide in each pair from the first pool comprise one or more recognition sequences for one or more site-specific nucleases. In some cases, the method further comprises generating single-stranded complementary overhangs between the opposing ends of the cloning vector and the 5'end of the first polynucleotide and the 3'end of the second polynucleotide in each pair from the first pool by adding the one or more site-specific nucleases for the one or more recognition sequences. In some cases, the method further comprises ligating the single-stranded complementary overhangs between the opposing ends of the cloning vector and the 5' end of the first polynucleotide and the 3' end of the second polynucleotide in each pair from the first pool. The ligating can be performed using a DNA ligase. In some cases, step (b) results in a circular product comprising an insert polynucleotide from the second pool, a first and second polynucleotide from a pair from the first pool and the cloning vector.
[0069] In one aspect, provided herein is a method for generating libraries of polynucleotides, the method comprising: (a) amplifying via polymerase chain reaction (PCR) a first pool of polynucleotides, wherein the first pool contains pairs of polynucleotides, wherein each pair in the first pool contains a first polynucleotide and a second polynucleotide, and wherein each first polynucleotide and each second polynucleotide in a pair comprises a 5' end and a 3' end, wherein the amplifying introduces a common overlap sequence comprising one or more recognition sequences for one or more site-specific nucleases onto the 5' end of a first polynucleotide and the 3' end of a second polynucleotide in a pair from the first pool; (b) assembling each pair of first polynucleotides and second polynucleotides from the first pool into a single nucleic acid fragment by utilizing common overlap sequence, wherein the single nucleic fragment for each pair comprises a first polynucleotide and second polynucleotide separated by the common overlap sequence from the 5' end of the first polynucleotide and the 3' end of the second polynucleotide, and wherein the 3'end of the first polynucleotide and the 5' end of the second polynucleotide in the single nucleic fragment for each pair are located on opposing terminal ends of the single nucleic acid fragment, distal to the one or more site-specific nuclease recognition sequence(s); (c) combining the single nucleic acid fragments for each pair with a second pool containing insert polynucleotides, wherein each insert polynucleotide in the second pool comprises a first assembly overlap sequence at its 5' end that is complementary to the 3' end of the first polynucleotide present within the single nucleic acid fragment and a second assembly overlap sequence at its opposing 3' end that is complementary to the 5' end of the second polynucleotide present within the single nucleic acid fragment; (d) assembling the first pool and the second pool into a third pool of circularized products, wherein the assembling is performed via in vitro or in vivo overlap assembly methods, and wherein each circularized product in the third pool comprises an insert sequence from the second pool and a pair of first polynucleotides and second polynucleotides from the first pool; (e) linearizing each circularized product in the third pool via digestion by one or more site-specific nuclease(s) that recognizes the one or more site-specific nuclease recognition sequence(s) located between the first polynucleotide sequence and the second polynucleotide sequence in each of the circularized products in the third pool; and (f) assembling the linearized products into cloning vectors by in vitro or in vivo cloning methods. In some cases, the common overlap sequence comprises an assembly overlap sequence of at least 1 nucleotide and the assembly in step (b) is performed by an overlap-based DNA assembly method. In some cases, the common overlap sequence comprises an assembly overlap sequence of from 10-25 nucleotides and the assembly in step (b) is performed by an overlap-based DNA assembly method. In some cases, the overlap-based DNA assembly method is selected from SOE-PCR or an in vitro overlap-assembly method (e.g., HiFi assembly using NEB.RTM. HiFi builder). In some cases, the one or more site-specific nuclease recognition sequence(s) present in the common overlap sequence on the 5' end of the first polynucleotide is complementary to the one or more site-specific nuclease recognition sequence(s) present in the common overlap sequence on the 3' end of the second polynucleotide in each pair, and wherein the utilizing the common overlap sequences of the first and second polynucleotides in each pair in step (b) entails performing SOE-PCR. In some cases, the utilizing the common overlap sequences of the first and second polynucleotides in each pair in step (b) entails digesting the one or more site-specific nuclease recognition sequences present in the common overlap sequence on the 5' end of the first polynucleotide and the 3' end of the second polynucleotide in each pair with one or more site specific nucleases for the one or more site-specific nuclease recognition sequences to generate single-stranded overhangs on the 5' end of the first polynucleotide and the 3' end of the second polynucleotide in each pair that comprise complementary sequence; and ligating the complementary sequence present on the single-stranded overhang on the 5' end of the first polynucleotide and the 3' end of the second polynucleotide in each pair. The assembling in step (d) can be performed via in vitro or in vivo overlap assembly methods. The assembling of step (d) can be performed using an overlap-based DNA assembly method. The overlap-based DNA assembly can be selected from SOE-PCR and an in vitro overlap-assembly method (e.g., HiFi assembly using NEB.RTM. HiFi builder). In some cases, the 3' end of the first polynucleotide and the 5' end of the second polynucleotide in the single nucleic acid fragment in each pair comprise an additional set of one or more site-specific nuclease recognition sequences and the first assembly overlap sequence and the second assembly overlap sequence in each insert polynucleotide in the second pool comprise one or more site-specific nuclease recognition sequences. In some cases, the assembling in step (d) entails digesting the additional one or more site-specific nuclease recognition sequences present on the 3' end of the first polynucleotide and the 5' end of the second polynucleotide in the single nucleic acid fragment in each pair and the one or more site-specific nuclease recognition sequences present in the first and second assembly sequences in each insert polynucleotide from the second pool with one or more site specific nucleases for the additional one or more site-specific nuclease recognition sequences on the 3' end of the first polynucleotide and the 5' end of the second polynucleotide in the single nucleic acid fragment in each pair and the one or more site-specific nuclease recognition sequences present in the first and second assembly sequences in each insert polynucleotide from the second pool to generate a single-stranded overhang on the 3' end of the first polynucleotide that comprises sequence complementary to sequence present on a single-stranded overhang on the 5' end of the first assembly sequence of an insert polynucleotide from the second pool and a single stranded overhang on the 5' end of the second polynucleotide that comprises sequence complementary to a sequence present on a single-stranded overhang on the 3'end of the second assembly sequence of the same insert polynucleotide from the second pool; and ligating the complementary sequence present on the single-stranded overhangs. In some cases, the assembling of step (d) is performed via an in vitro cloning method, wherein the mixture of the first pool and the second pool is heated to partially or fully denature polynucleotides present in the first and the second pools, then cooled at a slow rate to room temperature before being subjected to the in vitro cloning method. The assembling in step (f) can be performed via in vitro cloning methods or in vivo cloning methods. In some cases, the cloning vectors of step (f) comprise one or more site-specific nuclease recognition sequences. In some cases, the assembling in step (f) entails digesting the one or more site-specific nuclease recognition sequences in the cloning vectors with the one or more site-specific nucleases for the one or more site-specific nuclease recognition sequences recognition sequences present in the cloning vectors, wherein the digesting generates single-stranded overhangs on opposing ends of the cloning vectors, wherein the single-stranded overhang on one of the opposing ends of the cloning vector comprises sequence complementary to an end of the linearized product generated in step (e) and the single-stranded overhang on the other of the opposing ends of the cloning vectors comprises sequence complementary to an opposing end of the linearized product generated in step (e); and ligating the complementary sequences present on the single-stranded overhangs of the cloning vectors and the linearized products from step (e). A site-specific nuclease for use in any method or composition provided herein can be selected from a restriction endonuclease, Type IIs endonuclease(s), a homing endonuclease, an RNA-guided nuclease, a DNA-guided nuclease, a zinc-finger nuclease, a TALEN and a nicking enzyme or any combination thereof. The one or more site-specific nuclease recognition sequence(s) located between the first polynucleotide sequence and the second polynucleotide sequence can be one or more homing nuclease recognition sequence(s). The one or more site-specific nuclease(s) for the one or more site-specific nuclease recognition sequence(s) located between the first polynucleotide sequence and the second polynucleotide sequence can be a homing endonuclease.
[0070] In another aspect, provided herein is a method for generating libraries of polynucleotides, the method comprising: (a) amplifying via polymerase chain reaction (PCR) a first pool of polynucleotides, wherein the first pool contains pairs of polynucleotides, wherein each pair in the first pool contains a first polynucleotide and a second polynucleotide, and wherein each first polynucleotide and each second polynucleotide in a pair comprises a first terminal 5' end and an opposing a second terminal 3' end, wherein the amplifying introduces one or more recognition sequences for one or more site-specific nuclease(s) onto the first terminal 5' end of a first polynucleotide and the 3' end of a second polynucleotide in a pair from the first pool, wherein the one or more recognition sequences for the one or more site-specific nuclease(s) on the first terminal 5' end of the first polynucleotide is complementary to the one or more recognition sequences for the one or more site-specific nuclease(s) on the first terminal 3' end of the second polynucleotide in the pair; (b) assembling each pair of first polynucleotides and second polynucleotides from the first pool into a single nucleic acid fragment by performing a splicing and overlap extension polymerase chain reaction (SOE-PCR) utilizing the one or more complementary site-specific nuclease recognition sequence(s) on the first terminal 5' ends of the first polynucleotides and the 3' ends of the second polynucleotides within each pair, wherein the single nucleic fragment for each pair comprises a first polynucleotide and second polynucleotide separated by the one or more site-specific nuclease recognition sequence(s) from the first terminal 5' ends of the first polynucleotide and the 3' end of the second polynucleotides, and wherein the opposite second terminal 3' ends of the first polynucleotide and the 5' end of the second polynucleotide in the single nucleic fragment for each pair are located on opposing terminal ends of the single nucleic acid fragment, distal to the one or more site-specific nuclease recognition sequence(s); (c) combining the single nucleic acid fragments for each pair with a second pool containing insert polynucleotides, wherein each insert polynucleotide in the second pool comprises a first assembly overlap sequence at its 5' end that is complementary to the opposing terminal end one of the opposing 3' terminal end of the first polynucleotides present within the single nucleic acid fragment and a second assembly overlap sequence at its opposing 3' end that is complementary to the other of the opposing terminal 5' end of the second polynucleotides present within the single nucleic acid fragment; (d) assembling the first pool and the second pool into a third pool of circularized products, wherein the assembling is performed via in vitro or in vivo overlap assembly methods, and wherein each circularized product in the third pool comprises an insert sequence from the second pool and a pair of first polynucleotides and second polynucleotides from the first pool; (e) linearizing each circularized product in the third pool via addition of one or more site-specific nuclease(s) that recognizes the one or more site-specific nuclease recognition sequence(s) located between the first polynucleotide sequence and second polynucleotide sequence in each of the circularized products in the third pool; and (f) assembling the linearized products into cloning vectors by in vitro or in vivo cloning methods. The assembling in step (d) can be performed via in vitro or in vivo overlap assembly methods. In some cases, the assembling of step (d) is performed using an overlap-based DNA assembly method. The overlap-based DNA assembly can be selected from SOE-PCR and an in vitro overlap-assembly method (e.g., HiFi assembly using NEB.RTM. HiFi builder). In some cases, the 3' end of the first polynucleotide and the 5' end of the second polynucleotide in the single nucleic acid fragment in each pair comprise an additional set of one or more site-specific nuclease recognition sequences and the first assembly overlap sequence and the second assembly overlap sequence in each insert polynucleotide in the second pool comprise one or more site-specific nuclease recognition sequences. In some cases, the assembling in step (d) entails digesting the additional one or more site-specific nuclease recognition sequences present on the 3' end of the first polynucleotide and the 5' end of the second polynucleotide in the single nucleic acid fragment in each pair and the one or more site-specific nuclease recognition sequences present in the first and second assembly sequences in each insert polynucleotide from the second pool with one or more site specific nucleases for the additional one or more site-specific nuclease recognition sequences on the 3' end of the first polynucleotide and the 5' end of the second polynucleotide in the single nucleic acid fragment in each pair and the one or more site-specific nuclease recognition sequences present in the first and second assembly sequences in each insert polynucleotide from the second pool to generate a single-stranded overhang on the 3' end of the first polynucleotide that comprises sequence complementary to sequence present on a single-stranded overhang on the 5' end of the first assembly sequence of an insert polynucleotide from the second pool and a single stranded overhang on the 5' end of the second polynucleotide that comprises sequence complementary to a sequence present on a single-stranded overhang on the 3' end of the second assembly sequence of the same insert polynucleotide from the second pool; and ligating the complementary sequence present on the single-stranded overhangs. In some cases, the assembling of step (d) is performed via an in vitro cloning method, wherein the mixture of the first pool and the second pool is heated to partially or fully denature polynucleotides present in the first and the second pools, then cooled at a slow rate to room temperature before being subjected to the in vitro cloning method. The assembling in step (f) can be performed via in vitro cloning methods or in vivo cloning methods. In some cases, the cloning vectors of step (f) comprise one or more site-specific nuclease recognition sequences. In some cases, the assembling in step (f) entails digesting the one or more site-specific nuclease recognition sequences in the cloning vectors with the one or more site-specific nucleases for the one or more site-specific nuclease recognition sequences recognition sequences present in the cloning vectors, wherein the digesting generates single-stranded overhangs on opposing ends of the cloning vectors, wherein the single-stranded overhang on one of the opposing ends of the cloning vector comprises sequence complementary to an end of the linearized product generated in step (e) and the single-stranded overhang on the other of the opposing ends of the cloning vectors comprises sequence complementary to an opposing end of the linearized product generated in step (e); and ligating the complementary sequences present on the single-stranded overhangs of the cloning vectors and the linearized products from step (e). A site-specific nuclease for use in any method or composition provided herein can be selected from a restriction endonuclease, Type IIs endonuclease(s), a homing endonuclease, an RNA-guided nuclease, a DNA-guided nuclease, a zinc-finger nuclease, a TALEN and a nicking enzyme or any combination thereof.
[0071] In one embodiment, the first polynucleotides and the second polynucleotides in the methods and compositions provided herein comprise sequence complementary or corresponding to a target genomic locus in a host cell. The sequence complementary or corresponding to the target genomic locus present in the first and second polynucleotides can be located on the terminus of said first and second polynucleotide that opposes the terminus of said first and second polynucleotides that comprise sequence complementary to assembly overlap sequences present on an insert polynucleotide. When comprising sequence complementary or corresponding to a target genomic locus in a host cell, the first and second polynucleotides can be referred to as homology arms. In particular, each first polynucleotide can be referred to as a left homology arm, while each second polynucleotide can be referred to as a right homology arm. When comprising sequence complementary or corresponding to a target genomic locus in a host cell, generation of libraries of nucleic acid constructs through assembly of pairs of first and second polynucleotides and insert polynucleotides using the compositions and methods provided herein can be subsequently used in genome editing techniques for modifying the genome of a host cell. The host cell can be a prokaryotic cell or a eukaryotic host cell.
Polynucleotide Pairs
[0072] As described herein, the compositions and methods provided herein can comprise or utilize first polynucleotides and second polynucleotides such that each first polynucleotide is paired with a second polynucleotide. The first and second polynucleotides can be chemically synthesized (e.g., array synthesized or column synthesized) using any of the methods known in the art for synthesizing nucleic acids. The first and second polynucleotides can be amplified via an extension reaction (e.g., PCR) from existing DNA such as, for example, genomic DNA.
[0073] Each of the first and second polynucleotides can comprise functional and nonfunctional sequence or a combination thereof. The functional sequence can refer to sequence that represents a gene or a portion or domain thereof or a regulatory element or a portion thereof. As described further herein, the gene or the portion thereof can encode a protein that is part of a metabolic or biochemical pathway. Also as described further herein, the regulatory element can be a promoter, terminator, solubility tag, degradation tag or degron. The non-functional sequence can refer to sequence the does not represent a gene or portion thereof or a regulatory element or a portion thereof. The non-functional sequence can be sequence that aids in or is utilized for the assembly of said first and second polynucleotides with an insert polynucleotide as provided herein. In one embodiment, each of the first and second polynucleotides comprises a mixture of functional and non-functional sequence. In another embodiment, each of the first and second polynucleotides comprises of one or the other of functional or non-functional sequence. In embodiments where the first and second polynucleotides comprise only functional sequence, the functional sequence or a portion of the functional sequence can be utilized for the assembly of said first and second polynucleotides with an insert polynucleotide as provided herein.
[0074] The first and/or second polynucleotides can each vary in length and, in some cases, can be at least 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 80, 85, 90, 95, 100, 200, 300, 400, 500, 600, 700, 800, 900, 950 or 1000 nucleotide bases in length and/or may be more than 1 kb or 2 kb in length. Alternatively, the first and/or second polynucleotides can be 2 kb or more, or 1 kb or more or more than 900 bases, 800 bases, 700 bases, 600 bases, 500 bases, 400 bases, 300 bases, 200 bases or 100 bases in length. The first and/or second polynucleotides length can be in the range of 100 nucleotides-2 kb for example up to 100, up to 150, up to 200, up to 250, up to 300, up to 350, up to 400, up to 450, up to 500, up to 550, up to 600, up to 650, up to 700, up to 750, or up to 800, up to 850, up to 900, up to 950, up to 1000, up to 1500, or up to 2000 nucleotides. The minimum length of the first and/or second polynucleotides may be defined by a preferable Tm that is determined empirically.
[0075] As described herein, each of the first and second polynucleotide sequences can comprise sequence that aids in the assembly of said first and second polynucleotides with an insert polynucleotide. In order to aid in said assembly, said sequence can be complementary to the assembly overlap sequences present on insert polynucleotides. The sequence complementary to the assembly overlap sequences present on insert polynucleotides can also be referred to as assembly overlap sequences. In one embodiment, the assembly overlap sequences represent the entire first and/or second polynucleotide. In another embodiment, the assembly overlap sequences represent only a portion of the first and/or second polynucleotides and the first and/or second polynucleotides further comprise additional sequence beyond the assembly overlap sequences. In one embodiment, a first polynucleotide in a pair of first and second polynucleotides as provided herein comprises an assembly overlap sequence at its distal or 3'end that is complementary to a first assembly overlap sequence present at a 5' or proximal end of an insert polynucleotide, while a second polynucleotide in said pair comprises an overlap assembly overlap sequence at its proximal or 5'end that is complementary to a second assembly overlap sequence present at a 3' or distal end of said insert polynucleotide. Further to this embodiment, the first and second polynucleotide can each comprise additional sequence beyond the assembly overlap sequences. The additional sequence of the first and/or second polynucleotides can tailor said first and/or second polynucleotides to a specific application. The specific application can be any applications that utilize nucleic acid libraries known in the art, especially those that would benefit from a pooled deterministic assembly. Exemplary uses can include, but not be limited to, genome editing and pathway assembly.
[0076] The assembly overlap sequences present on a first and/or second polynucleotide can vary in length and, in some cases, can be at least 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30 nucleotides in length and/or may be up 100 nucleotides in length (e.g., up to 50, up to 30, up to 25, up to 20 or up to 15 nucleotides in length). The assembly overlap sequences length can be in the range of 15 nucleotides-100 nucleotides for example up to 20, up to 25, up to 30, up to 35, up to 40, up to 45, up to 50, up to 55, up to 60, up to 65, up to 70, up to 75, up to 80 nucleotides, up to 85 nucleotides, up to 90 nucleotides, up to 95 nucleotides or up to 100 nucleotides. The assembly overlap sequences can be the same length as an assembly overlap sequence present on an insert polynucleotide. The minimum length of the assembly overlap sequence may be defined by a preferable Tm that is determined empirically. In one embodiment, the assembly overlap sequence on a first and/or second polynucleotide comprises 1 or more nucleotides that are complementary to an end of an insert polynucleotide. In another embodiment, the assembly overlap sequence on a first and/or second polynucleotide comprises about 25 nucleotides that are complementary to an end of an insert polynucleotide.
[0077] As shown in FIG. 1, each of the pairs of first and second polynucleotides can further comprise vector overlap sequences with a cloning vector such that the first polynucleotides (i.e., the first DNA fragment in FIG. 1) may comprise a vector overlap sequence to the cloning vector at its 5' end, while the second polynucleotides (i.e., the second DNA fragment in FIG. 1) may comprise vector overlap sequences to the cloning vector at its 3' end. In embodiments, where each of the first polynucleotide and the second polynucleotide in a pair further comprise the first and second DNA fragments as provided herein, said first and second DNA fragments can be located downstream and adjacent to the vector overlap sequence to the cloning vector in the first polynucleotide and upstream and adjacent to the vector overlap sequence to the cloning vector in the second polynucleotide.
[0078] The vector overlap sequences can vary in length and, in some cases, can be at least 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30 nucleotides in length and/or may be up 100 nucleotides in length (e.g., up to 50, up to 30, up to 25, up to 20 or up to 15 nucleotides in length). Alternatively, the vector overlap sequences can be 2 kb or less, or 1 kb or less or less than 900 bases, 800 bases, 700 bases, 600 bases, 500 bases, 400 bases, 300 bases, 200 bases or 100 bases. The vector overlap sequences length can be in the range of 15 nucleotides-80 nucleotides for example up to 20, up to 25, up to 30, up to 35, up to 40, up to 45, up to 50, up to 55, up to 60, up to 65, up to 70, up to 75, or up to 80 nucleotides. The minimum length of the vector overlap sequence may be defined by a preferable Tm that is determined empirically.
[0079] In one embodiment, a pool containing pairs of first and second polynucleotides is generated by selecting pairs of first and second polynucleotide sequences from a larger set of such sequences such that no polynucleotide from said pool shares common sequence with any other polynucleotide from said pool beyond a specified threshold, excluding designed overlap assembly sequences between the pairs of polynucleotides of said pool and insert polynucleotides or pools thereof as provided herein, or said pool and a cloning vector. The specified threshold is at least 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19 or 20 contiguous nucleotides. The specified threshold is at most 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19 or 20 contiguous nucleotides. The specified threshold between 0 and 2, between 1 and 3, between 2 and 4, between 3-5, between 4 and 6, between 5 and 7, between 6 and 8, between 7 and 9, between 8 and 10, between 9 and 11, between 10 and 12, between 11 and 13, between 12 and 14, between 13 and 15, between 14 and 16, between 15 and 17, between 16 and 18, between 17 and 19, between 18 and 20 or between 19 and 21 contiguous nucleotides. The specified threshold between 0 and 5, between 0 and 10, between 0 and 15, between 0 and 20, between 5 and 10, between 5 and 15, between 5 and 20, between 10 and 15 or between 10 and 20 contiguous nucleotides. In one embodiment, the specified threshold is 12 contiguous nucleotides. Determination of a shared common sequence beyond the specified threshold can be done using a computer program that uses either BLAST analysis or simple substring searching to determine whether components share sequence with other components. If shared sequence is found beyond the specified threshold the components would not be placed into a pool together.
[0080] In one embodiment, pairing of an insert polynucleotide as described herein with a desired pair of first and second polynucleotides can be facilitated by preassembling the desired pair of first and second polynucleotides using an "inside-out assembly" method as shown in FIG. 2. In this method, the first and second polynucleotides can be amplified by PCR such that the vector proximal ends of the first polynucleotides each contain one or more unique site-specific nuclease site(s) or recognition sequence(s). The site-specific nuclease recognition sequences can be for site-specific nucleases selected from a restriction endonuclease, Type IIs endonuclease(s), a homing endonuclease, an RNA-guided nuclease, a DNA-guided nuclease, a zinc-finger nuclease, a TALEN and a nicking enzyme and any combinations thereof. In one embodiment, the vector proximal ends of the first polynucleotides each contain a single unique nuclease site or recognition sequence. In one embodiment, the unique nuclease recognition sequence is a unique restriction endonuclease site such that said restriction endonuclease site is not present in any of the polynucleotides present in a composition provided herein. In one embodiment, the unique nuclease site is a homing endonuclease sequence such as, for example, a homing endonuclease sequence specific for I-SceI or I-CeuI. A single pair of first and second polynucleotides are combined and a splicing and overlap extension polymerase chain reaction (SOE-PCR) is performed to assemble the two polynucleotides at the added unique nuclease sites (e.g. at the vector-proximal ends) leaving the ends that attach to an insert polynucleotide free. Alternatively, the entire sequence comprising the linked first and second polynucleotides can be synthesized directly using any of a variety of DNA synthesis methods known in the art. The attached first and second polynucleotides are assembled with the insert polynucleotide using an in vitro or in vivo overlap assembly method known in the art and/or provided herein, such as, for example, yeast (e.g., S cerevisiae) or E. coli homologous recombination based assembly, Gibson assembly or NEB.RTM. HiFi builder. The circularized product of the first and second polynucleotides with the insert polynucleotide can be linearized with the addition of the nuclease specific for the unique nuclease sequence (e.g., the homing endonuclease for the specific homing endonuclease sequence) resulting in the insert polynucleotide being flanked by the first and second polynucleotides which can then be assembled into the vector using Gibson assembly or other similar method.
Insert Polynucleotides/Payload Sequences
[0081] In one embodiment, an insert polynucleotide for use in a composition, kit or method provided herein comprises: (1) a first assembly overlap sequence on a 5' or proximal end of said insert polynucleotide, and (2) a second assembly overlap sequence on an opposing 3' or distal end of said insert polynucleotide. Further to this embodiment, the first assembly overlap sequence can comprise sequence complementary to sequence (e.g., an assembly overlap sequence) at a 3' or distal end of a first polynucleotide from a pair of first and second polynucleotides, while the second assembly overlap sequence can comprise sequence complementary to sequence (e.g., an assembly overlap sequence) at a 5' or proximal end of a second polynucleotide from the pair of first and second polynucleotides.
[0082] The first assembly overlap sequence and the second assembly overlap sequence on an insert polynucleotide provided herein can comprise 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 35, 40, 45, 50 or more nucleotides in length and/or may be up 100 nucleotides in length (e.g., up to 50, up to 30, up to 25, up to 20 or up to 15 nucleotides in length) that are complementary to the 3' end of a first polynucleotide and the 5' end of a second polynucleotide, respectively, in a pair of polynucleotides as provided herein. The assembly overlap sequences length can be in the range of 15 nucleotides-100 nucleotides for example up to 20, up to 25, up to 30, up to 35, up to 40, up to 45, up to 50, up to 55, up to 60, up to 65, up to 70, up to 75, up to 80 nucleotides, up to 85 nucleotides, up to 90 nucleotides, up to 95 nucleotides or up to 100 nucleotides. In one embodiment, the first assembly overlap sequence and the second assembly overlap sequence on an insert polynucleotide provided herein comprises 1 or more nucleotides that are complementary to the 3' end of a first polynucleotide and the 5' end of a second polynucleotide, respectively, in a pair of polynucleotides provided herein. In another embodiment, the first assembly overlap sequence and the second assembly overlap sequence on an insert polynucleotide provided herein comprises about 25 nucleotides that are complementary to the 3' end of a first polynucleotide and the 5' end of a second polynucleotide, respectively, in a pair of polynucleotides provided herein.
[0083] In another embodiment, the insert polynucleotide further comprises one or more payload sequences such that said one or more payload sequences are located between the first and second assembly overlap sequences. A payload sequence can be a random sequence. A payload sequence can be a marker sequence. The marker sequence can be any marker sequence known in the art. A payload sequence can be a gene or a portion thereof. The gene or portion thereof can be part of a metabolic or biochemical pathway. The gene or portion thereof can encode a protein or a domain thereof. A payload sequence can be selected from promoters, genes, regulatory sequences, nucleic acid sequence encoding degrons, nucleic acid sequence encoding solubility tags, nucleic acid sequence encoding degradation tags, terminators, barcodes, regulatory sequences or portions thereof. In some cases, the three components of the insert polynucleotide (i.e., the first assembly overlap sequence, the second assembly overlap sequence and the payload sequence) are synthesized or otherwise combined into contiguous pieces of DNA before use in an assembly method provided herein. In one embodiment, the first and second assembly overlaps are not random but designed to match specific pairs of first and second polynucleotides.
[0084] In embodiments where the pair of first and second polynucleotides comprise targeting sequences as described herein, a payload sequence present within an insert polynucleotide can result in an insertion relative to the original locus targeted by the targeting sequences on the pair of first and second polynucleotides, a deletion of sequence relative to the original locus targeted by the targeting sequences on the pair of first and second polynucleotides, or a replacement of one sequence with another. In the case of an insertion or modification, the `payload` can be the intended final sequence. In the case of a deletion, the `payload` can be a marker sequence or no sequence.
[0085] In one embodiment, the insert polynucleotides are used in a pooled fashion. Further to this embodiment, each insert polynucleotide in a pool of insert polynucleotides can comprise a first assembly overlap sequence that comprises sequence complementary to sequence (e.g., an assembly overlap sequence) at a 3' or distal end of a first polynucleotide from a pair of first and second polynucleotides and a second assembly overlap sequence that comprises sequence complementary to sequence (e.g., an assembly overlap sequence) at a 5' or proximal end of a second polynucleotide from the pair of first and second polynucleotides.
[0086] The pool of insert polynucleotides can contain any number of unique insert polynucleotide sequences. The number of insert polynucleotides can be at least, at most, or about 1, 5, 10, 25, 50, 75, 100, 125, 150, 175, 200, 250, 300, 350, 400, 450, 500, 550, 600, 650, 700, 750, 800, 850, 900, 950, 1000, 1100, 1200, 1300, 1400, 1500, 1600, 1700, 1800, 1900, 2000, 3000, 4000, 5000, 6000, 7000, 8000, 9000, 10,000, 20,000, 30,000, 40,000, 50,000, 75,000, 100,000, 150,000, 200,000 or 250,000 unique insert polynucleotides with or without a payload sequence.
[0087] A payload sequence can be at most or at least 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 80, 85, 90, 95, 100, 150, 200, 250, 300, 350, 400, 450, 500, 600, 700, 800, 900, 1000, 2000, 3000, 4000, 5000, 6000, 7000, 8000, 9000 or 10,000 nucleotides in length. In some cases, the payload sequence can be 0 nucleotides in length. A payload sequence can be at a length such that when incorporated into an insert polynucleotide, the entire insert polynucleotide can be chemically synthesized. The synthesis can be an array-based or column based synthesis method as known in the art. In one embodiment, a payload sequence is of a length such that it can be directly included or synthesized in an insert oligonucleotide along with the first and second assembly overlaps. An insert polynucleotide that can be synthesized can be up to about 1, 5, 10, 20, 30, 40, 50, 60, 70, 80, 90, 100, 110, 120, 130, 140, 150, 160, 170, 180, 190, 200, 210, 220, 250, 300, 350, 400 or more nucleotides in length.
[0088] In another embodiment, the insert polynucleotide can be generated in a single pool using the methods described in FIG. 3. As shown in FIG. 3, the payload sequence (e.g., the promoter sequence in FIG. 3) can be generated via PCR from three components: a pooled forward primer, a common reverse primer, and a payload template sequence (e.g., the promoter in FIG. 3). The payload sequence template can be a synthetic DNA fragment, a PCR product, or other single- or double-stranded DNA fragment. The pool of forward primers can be synthesized using array-based or column-based synthetic methods known in the art. Each forward primer in the pool can comprise (from 5' to 3'): 1) a sequence complementary to the distal or 3' end of the payload template sequence, 2) a second assembly overlap sequence comprising sequence complementary to a second polynucleotide from a pair of first and second polynucleotides, 3) one or more recognition sequences for one or more site-specific nuclease (e.g., a homing endonuclease site or recognition sequence), 4) a first assembly overlap sequence comprising sequence complementary to a first polynucleotide from a pair of first and second polynucleotides, and 5) a priming sequence that binds to the proximal end or 5' end of the payload template sequence. The common reverse primer can bind to the distal end or 3' end of the payload template sequence or to other sequence downstream of the payload sequence. PCR can be performed on the payload template sequence (e.g., the promoter in FIG. 3) using the pooled forward primers and the common reverse primer. After amplification, the PCR product can be circularized to generate a circular-permuted payload (insert) using an overlap assembly method known in the art, such as, for example, Gibson assembly, NEB.RTM. HIFI assembly, or similar methods, and then linearized using one or more site-specific nuclease(s) that recognizes the one or more site-specific nuclease recognition sequences (e.g., the homing endonuclease, I-SceI in FIG. 3). Nuclease digestion can result in a fragment suitable for use as insert polynucleotide (e.g., the "payload" part described in FIG. 1), with the large payloads flanked by first and second assembly overlap sequences (e.g., the homology arms or regions flanking the promoter sequence in FIG. 3). As shown in FIG. 3, at the ends of the payload sequences can be small partial nuclease recognition sequences (e.g., I-SceI in FIG. 3) that can be excised by the overlap assembly method utilized (e.g. 3' and 5' exonuclease activities of Gibson assembly reagents, NEB.RTM. HIFI assembly reagents, or equivalent mixtures). The product can be optionally amplified (e.g., RCA) after circularization and before linearization.
[0089] In one embodiment, each insert polynucleotide comprises a payload sequence such that each insert polynucleotide in a pool of insert polynucleotides comprises a different payload sequence from the payload sequence in each other insert polynucleotide in said pool.
[0090] In another embodiment, each insert polynucleotide comprises a payload sequence such that each insert polynucleotide in a pool of insert polynucleotides comprises the same payload sequence as the payload sequence in each other insert polynucleotide in said pool.
Cloning Methods
[0091] As described herein, a composition comprising pairs of first and second polynucleotides as well as insert polynucleotides can be assembled into a library of nucleic acids comprising first and second polynucleotides with an insert polynucleotide therebetween. Assembly of the pairs of first and second polynucleotides with the insert polynucleotides as provided herein can be performed by either in vitro or in vivo cloning methods. For the assembly of large DNA molecules, the final steps of the assembly may be conducted in vivo, such as in a yeast host cell. The balance between use of in vitro and in vivo assembly steps can be determined by the practicality of the method with regard to the nature of the nucleic acid molecules to be assembled.
[0092] In one embodiment, assembly of the pairs of first and second polynucleotides with the insert polynucleotides is performed using an in vitro cloning method. The in vitro cloning method can be any in vitro cloning method that employs overlap assembly known in the art. The in vitro cloning method used in the methods provided herein can be selected from infusion cloning (Clontech.RTM.), Golden Gate Assembly, Gateway Assembly, Gibson Assembly, and NEB.RTM. HIFI assembly or any other suitable in vitro cloning method known in the art. Infusion cloning can entail mixing a first pool of pairs of first and second polynucleotides as provided herein and a second pool of insert polynucleotides as described herein with the infusion cloning reagent and then transforming the resultant assemblies into an E. coli cloning host cell. The in vitro cloning method can be any of the overlap assembly methods described in U.S. Pat. No. 8,968,999, which is herein incorporated by reference in its entirety. The in vitro cloning method can be any of the overlap assembly methods described in US20160060671, which is herein incorporated by reference in its entirety. The in vitro cloning method can be the Gibson assembly method described in Jun Urano, Ph.D. and Christine Chen, Ph.D., Gibson Assembly.RTM. Primer-Bridge End Joining (PBnJ) Cloning, Synthetic Genomics Application Note, which is herein incorporated by reference in its entirety. In one embodiment, a composition comprising pairs of first and second polynucleotides, insert polynucleotides and a cloning vector are joined using a 5'-3' exonuclease; and a strand-displacing polymerase also present in the composition. The composition can also comprise a buffer containing a potassium salt such as potassium chloride in a concentration range of 7 mM-150 mM, for example, 20 mM-50 mM. A sodium salt (e.g., sodium chloride) in the range of 10 mM-100 mM such as 20 mM may also be used in addition to potassium salt. In some embodiments, the composition does not contain a crowding agent such as polyethylene glycol (PEG), Ficoll, or dextran. In some embodiments, the composition comprises a single stranded (ss) binding protein. A ss DNA binding protein for use in the composition may be E. coli recA, T7 gene 2.5 product, RedB (from phage lambda) or RecT (from Rac prophage), ET SSB (extreme thermostable single-stranded DNA binding protein) or any other ss DNA binding proteins known in the art could be used in the composition. The inclusion of a ss binding protein can improve the efficiency of assembly particularly for nucleic acid fragments with longer overlap sequences (e.g. at least 20 nucleotides) than would be otherwise occur in the absence of ss binding protein as measured by colony number. In some embodiments, the composition does not contain a non-strand displacing polymerase.
[0093] In another embodiment, a composition comprising pairs of first and second polynucleotides, insert polynucleotides and a cloning vector are joined using an isolated non-thermostable 5' to 3' exonuclease that lacks 3' exonuclease activity, a crowding agent, a non-strand-displacing DNA polymerase with 3' exonuclease activity, or a mixture of said DNA polymerase with a second DNA polymerase that lacks 3' exonuclease activity, and a ligase. The composition can further comprise a mixture of dNTPs, and a suitable buffer, under conditions that are effective for joining the polynucleotides and the cloning vector. In some embodiment, the composition can further comprise a crowding agent. The crowding agent can be selected from polyethylene glycol (PEG), dextran or ficoll. In one embodiment, the crowding agent is PEG. The PEG can be used at a concentration of from about 3 to about 7% (weight/volume). The PEG can be selected from PEG-200, PEG-4000, PEG-6000, PEG-8000 or PEG-20,000. In some embodiments, the exonuclease of is a T5 exonuclease and the contacting is under isothermal conditions, and/or the crowding agent is PEG, and/or the non-strand-displacing DNA polymerase is PHUSION.RTM. DNA polymerase or VENTR.RTM. DNA polymerase, and/or a Taq ligase.
[0094] In one embodiment, assembly of the pairs of first and second polynucleotides with the insert polynucleotides is performed using an in vivo cloning method. The in vivo cloning method can be any in vivo cloning method known in the art. The in vivo cloning method can be a homologous recombination mediated cloning method. The in vivo cloning method used in the methods provided herein can be selected from E. coli (RecA-dependent, RecA-independent or Red/ET-dependent) homologous recombination, Overlap Extension PCR and Recombination (OEPR) cloning, yeast homologous recombination, and Transformation-associated recombination (TAR) cloning and gene assembly in Bacillus as described in Tsuge, Kenji et al. "One step assembly of multiple DNA fragments with a designed order and orientation in Bacillus subtilis plasmid." Nucleic acids research vol. 31, 21 (2003): e133, which is herein incorporated by reference.
Applications
[0095] The composition and assembly methods provided herein can be used to construct any desired assembly, such as plasmids, vectors, genes, metabolic pathways, minimal genomes, partial genomes, genomes, chromosomes, extrachromosomal nucleic acids, for example, cytoplasmic organelles, such as mitochondria (animals), and in chloroplasts and plastids (plants), and the like.
[0096] The compositions and assembly methods provided herein can be used to generate libraries of nucleic acid molecules, and methods to use modified whole or partial nucleic acid molecules as generated therefrom. The libraries can contain 2 or more variants, and said multiple variants, can be screened for members having desired characteristics, such as high production levels of desired products of interest, enhanced functionality of the product of interest, or decreased functionality (if that is advantageous). Such screening may be done by high throughput methods, which may be robotic/automated as provided herein.
[0097] The disclosure also further includes products made by the compositions and assembly methods provided herein, for example, the resulting assembled synthetic genes or genomes (synthetic or naturally occurring) and modified optimized genes and genomes, and the use(s) thereof.
[0098] The compositions and assembly methods provided herein can have a wide variety of applications, permitting, for example, the design of pathways for the synthesis of desired products of interest or optimization of one or more sequences whose gene products play a role in the synthesis or expression of a desired product. The compositions and assembly methods provided herein can also be used to generate optimized sequences of a gene or expression thereof or to combine one or more functional domains or motifs of protein encoded by a gene. The gene can be part of a biochemical or metabolic pathway. The biochemical or metabolic pathway can produce a desired product of interest.
[0099] The desired product of interest can be any molecule that can be assembled in a cell culture, eukaryotic or prokaryotic expression system or in a transgenic animal or plant. Thus, the nucleic acid molecules or libraries thereof that result from the deterministic assembly methods provided herein may be employed in a wide variety of contexts to produce desired products of interest. In some cases, the product of interest may be a small molecule, enzyme, peptide, amino acid, organic acid, synthetic compound, fuel, alcohol, etc. For example, the product of interest or biomolecule may be any primary or secondary extracellular metabolite. The primary metabolite may be, inter alia, ethanol, citric acid, lactic acid, glutamic acid, glutamate, lysine, threonine, tryptophan and other amino acids, vitamins, polysaccharides, etc. The secondary metabolite may be, inter alia, an antibiotic compound like penicillin, or an immunosuppressant like cyclosporin A, a plant hormone like gibberellin, a statin drug like lovastatin, a fungicide like griseofulvin, etc. The product of interest or biomolecule may also be any intracellular component produced by a host cell, such as: a microbial enzyme, including: catalase, amylase, protease, pectinase, glucose isomerase, cellulase, hemicellulase, lipase, lactase, streptokinase, and many others. The intracellular component may also include recombinant proteins, such as: insulin, hepatitis B vaccine, interferon, granulocyte colony-stimulating factor, streptokinase and others. The product of interest may also refer to a protein of interest.
Pathway Assembly
[0100] In one embodiment, the compositions and methods provided herein are used to assemble a gene or a variant thereof. The gene or variant thereof can encode a protein that is part of a metabolic or biochemical pathway. The variant can be a codon optimized version or mutated version of said gene. The metabolic or biochemical pathway can produce a product of interest as provided herein. In one embodiment, the gene sequence or variant thereof can be present as a payload sequence within an insert polynucleotide as provided herein. The pairs of first and second polynucleotides can comprise sequence such that when assembled with said insert polynucleotide can serve to facilitate targeting of and insertion into a locus in a genetic element (e.g., genome, plasmid, etc.) within a host cell using a gene editing method as provided herein. The locus can be a specific locus or a random locus. Alternatively, the pairs of first and second polynucleotides can comprise sequence that when assembled with said insert polynucleotide can serve to facilitate further assembly of the resultant assembly with other assemblies generated using the methods provided herein. The other assemblies can comprise one or more additional genes present within the same metabolic or biochemical pathway and in this way facilitate the assembly of said metabolic or biochemical pathway. All of the genes or variants thereof can be assembled using the technique described herein of overlapping sequences on a single vector for a particular metabolic or biochemical pathway, or independent vectors for each member of said pathway can be employed by mixing the vectors for each member in successive transformation mixtures. The assembly of the first and second polynucleotides with an insert polynucleotide can be accomplished via assembly overlap sequences present in each of the polynucleotides using the assembly overlap methods provided herein. The pairs of first and second polynucleotides can further comprise vector overlap sequence as provided herein to facilitate assembly into a suitable vector. The vector can be a replicating plasmid. In some cases, the first and/second polynucleotide can further comprise sequence of a regulatory or control element that can govern an aspect of the gene or variant thereof or the protein encoded thereby such as the transcription, translation, solubility, or degradation thereof. The regulatory or control element can be a promoter, terminator, solubility tag, degradation tag or degron.
[0101] In another embodiment, the gene sequence or variant thereof is spread across a pair of first and second polynucleotides and an insert polynucleotide located therebetween or spread across a first or second polynucleotide and an insert polynucleotide located therebetween. By suitable assembly overlap segments on each of the polynucleotides, a mixture containing all of the polynucleotides can be assembled in the correct order in a single reaction mixture using overlap assembly as provided herein. The resultant will be full-length coding sequences of the gene or variant thereof. The pairs of first and second polynucleotides can further comprise sequence such that when assembled with said insert polynucleotide can serve to facilitate targeting of and insertion into a locus in a genetic element (e.g., genome, plasmid, etc.) within a host cell using a gene editing method as provided herein. The locus can be a specific locus or a random locus. Alternatively, the pairs of first and second polynucleotides can further comprise sequence that when assembled with said insert polynucleotide can serve to facilitate further assembly of the resultant assembly with other assemblies generated using the methods provided herein. The other assemblies can comprise one or more additional genes present within the same metabolic or biochemical pathway and in this way facilitate the assembly of said metabolic or biochemical pathway. All of the genes or variants thereof can be assembled using the technique described herein of overlapping sequences on a single vector for a particular metabolic or biochemical pathway, or independent vectors for each member of said pathway can be employed by mixing the vectors for each member in successive transformation mixtures. The pairs of first and second polynucleotides can further comprise vector overlap sequence as provided herein to facilitate assembly into a suitable vector. The vector can be a replicating plasmid. In some cases, the first and/second polynucleotide can further comprise sequence of a regulatory or control element that can govern an aspect of the gene or variant thereof or the protein encoded thereby such as the transcription, translation, solubility, or degradation thereof. The regulatory or control element can be a promoter, terminator, solubility tag, degradation tag or degron.
[0102] In still another embodiment, the compositions and methods provided herein are used to assemble or combine nucleic acid sequence that encode motifs or domains of a target protein. The nucleic acid sequence encoding a particular motif or domain of a target protein can be spread across a pair of first and second polynucleotides and an insert polynucleotide located therebetween or spread across a first or second polynucleotide and an insert polynucleotide located therebetween. The nucleic acid sequence encoding a particular motif or domain of a target protein can be present on a first polynucleotide, while a second motif or domain of the target protein can be present on a second polynucleotide and an insert polynucleotide can be used to join said first and second motif or domain of the target protein using assembly overlap sequences present on each polynucleotide and overlap assembly methods as provided herein. In some cases, the insert polynucleotide can comprise a portion of the first and/or second motif or domain. In some cases, the insert polynucleotide can comprise a third motif or domain of the target protein. The pairs of first and second polynucleotides can further comprise sequence such that when assembled with said insert polynucleotide can serve to facilitate targeting of and insertion into a locus in a genetic element (e.g., genome, plasmid, etc.) within a host cell using a gene editing method as provided herein. The locus can be a specific locus or a random locus. The pairs of first and second polynucleotides can further comprise vector overlap sequence as provided herein to facilitate assembly into a suitable vector. The vector can be a replicating plasmid.
Gene Editing
[0103] As described herein, a composition comprising pairs of first and second polynucleotides as well as insert polynucleotides can be assembled into a library of nucleic acids comprising first and second polynucleotides with an insert polynucleotide therebetween that can be subsequently utilized to modify the genetic content of a host cell. As provided herein, the library of nucleic acids can comprise control elements (e.g., promoters, terminators, solubility tags, degradation tags or degrons), modified forms of genes (e.g., genes with desired SNP(s)), antisense nucleic acids, and/or one or more genes that are part of a metabolic or biochemical pathway. In one embodiment, the modification entails gene editing of the host cell. The gene editing can entail editing the genome of the host cell and/or a separate genetic element present in the host cell such as, for example, a plasmid or cosmid. The gene editing method that can utilize nucleic acid assemblies generated using the methods and compositions as provided herein can be any gene editing method or system known in the art and can be selected based on the host for which gene editing is desired. Non-limiting examples of gene editing include homologous recombination, CRISPR, TALENS, FOK, or other endonucleases.
[0104] Homologous Recombination
[0105] In one embodiment, the gene editing method is a homologous recombination based method known in the art. The homologous recombination based method can be selected from single-crossover homologous recombination, double-crossover homologous recombination, or lambda red recombineering. Further to this embodiment, the first polynucleotide and the second polynucleotide in a pair of first and second polynucleotides such that each comprise sequence directed to or complementary to a desired loci in a nucleic acid element (e.g., genome, plasmid or cosmid) of a host cell and thereby direct an insert polynucleotide located therebetween to a desired locus in the genetic element (e.g., genome, cosmid or plasmid) of the host cell. Accordingly, the sequence directed to or complementary to a desired loci present in the pair can be used to determine the location(s) in the genome, cosmid or plasmid that will be targeted for editing. As exemplified in FIG. 1, the sequence directed to or complementary to a desired loci can be located at or toward the proximal or 5' end of a first polynucleotide, while in the second polynucleotide the sequence directed to or complementary to a desired loci can be located at or near the distal or 3' end. In the first polynucleotide, the sequence directed to or complementary to a desired loci can be located upstream of an assembly overlap sequence present in the first polynucleotide and downstream of a vector overlap sequence, if present. In the second polynucleotide, the sequence directed to or complementary to a desired loci can be located downstream of an assembly overlap sequence present in the second polynucleotide and upstream of a vector overlap sequence, if present.
[0106] In one embodiment, for each pair in a pool containing pairs of first polynucleotide and second polynucleotides, the sequence that is complementary to a desired loci in a pair is complementary to a different target locus in a host cell as compared to each other pair in the said pool.
[0107] In another embodiment, for each pair in a pool containing pairs of first polynucleotide and second polynucleotides, the sequence that is complementary to a desired loci in a pair is complementary to the same target locus in a host cell as compared to each other pair in the said pool.
[0108] Loop-in/Loop-Out
[0109] In some embodiments, the present disclosure teaches methods of looping out selected regions of DNA from the host organisms. The looping out method can be as described in Nakashima et al. 2014 "Bacterial Cellular Engineering by Genome Editing and Gene Silencing." Int. J. Mol. Sci. 15(2), 2773-2793. Looping out deletion techniques are known in the art, and are described in Tear et al. 2014 "Excision of Unstable Artificial Gene-Specific inverted Repeats Mediates Scar-Free Gene Deletions in Escherichia coli." Appl. Biochem. Biotech. 175:1858-1867. The looping out methods used in the methods provided herein can be performed using single-crossover homologous recombination or double-crossover homologous recombination. In one embodiment, looping out of selected regions can entail using single-crossover homologous recombination.
[0110] In one embodiment, a composition provided herein comprises pairs of first and second polynucleotides (e.g., left/right homology arms), insert polynucleotides and a vector such that assembly of the pairs of first and second polynucleotides with an insert polynucleotide and a vector using an in in vitro or in vivo assembly method as provided herein generates loop out vectors. In one embodiment, single-crossover homologous recombination is used between a loop-out vector and the host cell genome in order to loop-in said vector. The vector could comprise a marker that facilitates selection of looped-out clones after the loop-in step. In another embodiment, double-crossover homologous recombination is used between a loop-out vector and the host cell genome in order to integrate said vector. The insert sequence within the loop-out vector can be designed with a sequence, which is a direct repeat of an existing or introduced nearby host sequence, such that the direct repeats flank the region of DNA slated for looping and deletion. The insert sequence could further comprise a marker that facilitates selection of looped-out clones. Once inserted, cells containing the loop out plasmid or vector can be counter selected for deletion of the selection region.
[0111] In one aspect provided herein, polynucleotides or polynucleotide libraries generated using the compositions and/or methods provided herein can be used in a gene editing method that can entail the use of sets of proteins from one or more recombination systems. Said recombination systems can be endogenous to the microbial host cell or can be introduced heterologously. The sets of proteins of the one or more heterologous recombination systems can be introduced as nucleic acids (e.g., as plasmid, linear DNA or RNA, or integron) and be integrated into the genome of the host cell or be stably expressed from an extrachromosomal element. The sets of proteins of the one or more heterologous recombination systems can be introduced as RNA and be translated by the host cell. The sets of proteins of the one or more heterologous recombination systems can be introduced as proteins into the host cell. The sets of proteins of the one or more recombination systems can be from a lambda red recombination system, a RecET recombination system, a Red/ET recombination system, any homologs, orthologs or paralogs of proteins from a lambda red recombination system, a RecET recombination system, or Red/ET recombination system or any combination thereof. The recombination methods and/or sets of proteins from the RecET recombination system can be any of those as described in Zhang Y., Buchholz F., Muyrers J. P. P. and Stewart A. F. "A new logic for DNA engineering using recombination in E. coli." Nature Genetics 20 (1998) 123-128; Muyrers, J. P. P., Zhang, Y., Testa, G., Stewart, A. F. "Rapid modification of bacterial artificial chromosomes by ET-recombination." Nucleic Acids Res. 27 (1999) 1555-1557; Zhang Y., Muyrers J. P. P., Testa G. and Stewart A. F. "DNA cloning by homologous recombination in E. coli." Nature Biotechnology 18 (2000) 1314-1317 and Muyrers J P et al., "Techniques: Recombinogenic engineering--new options for cloning and manipulating DNA" Trends Biochem Sci. 2001 May; 26(5):325-31, which are herein incorporated by reference. The sets of proteins from the Red/ET recombination system can be any of those as described in Rivero-Muller, Adolfo et al. "Assisted large fragment insertion by Red/ET-recombination (ALFIRE)--an alternative and enhanced method for large fragment recombineering" Nucleic acids research vol. 35, 10 (2007): e78, which is herein incorporated by reference.
[0112] Lambda RED Mediated Gene Editing
[0113] As provided herein, gene editing as described herein can be performed using Lambda Red-mediated homologous recombination as described by Datsenko and Wanner, PNAS USA 97:6640-6645 (2000), the contents of which are hereby incorporated by reference in their entirety.
[0114] To use the lambda red recombineering system to modify target DNA, a linear donor DNA substrate (either dsDNA or ssDNA) can be electroporated into E. coli expressing the set of proteins from the lambda red recombination system. The set of proteins from the lambda red recombination system can comprise the exo, beta or gam proteins or any combination thereof. Gam can prevent both the endogenous RecBCD and SbcCD nucleases from digesting the linear donor DNA (either dsDNA or ssDNA) introduced into a microbial host cell, while exo is a 5'-3' dsDNA-dependent exonuclease that can degrade linear dsDNA starting from the 5' end and generate 2 possible products (i.e., a partially dsDNA duplex with single-stranded 3' overhangs or a ssDNA whose entire complementary strand was degraded) and beta can protect the ssDNA created by Exo and promote its annealing to a complementary ssDNA target in the cell. Beta expression can be required for lambda red based recombination with an ssDNA oligo substrate as described at blog.addgene.org/lambda-red-a-homologous-recombination-based-technique-fo- r-genetic-engineering, the contents of which are herein incorporated by reference.
[0115] The linear donor DNA substrate (either dsDNA or ssDNA) can be an assembly comprising a pair of first and second polynucleotides with an insert polynucleotide located therebetween generated using the methods and compositions provided herein. The pair of first and second polynucleotides can comprise genomic targeting sequences that target said donor DNA substrate to a specific locus in the genome of the host cell. These enzymes then catalyze the homologous recombination of the substrate with the target DNA sequence. This means cloning occurs in vivo, as compared to restriction enzyme cloning where the genetic changes occur in a test tube. The donor DNA substrate only requires .about.50 nucleotides of homology to the target site for recombination. As described at blog.addgene.org/lambda-red-a-homologous-recombination-based-technique-fo- r-genetic-engineering, whether a linear dsDNA or ssDNA substrate is used can depend on the goal of the experiment. dsDNA substrate may be best for insertions or deletions greater than approximately 20 nucleotides, while ssDNA substrate may be best for point mutations or changes of only a few base pairs.
[0116] dsDNA substrate can be made using the compositions and methods provided herein such that the pairs of first and second polynucleotides comprise about 50 base pairs of homology to the targeted insert site on opposing terminal ends. The dsDNA insert polynucleotide present within the substrate can include: large insertions or deletions, including selectable DNA fragments, such as antibiotic resistance genes, as well as non-selectable DNA fragments, such as gene replacements and tags.
[0117] ssDNA substrates can be also be made using the compositions and methods provided herein such that the pairs of first and second polynucleotides comprise about 50 base pairs of homology to the targeted insert site on opposing terminal ends and can have the desired alteration(s) located in the center of the sequence (i.e., within the insert polynucleotide).
[0118] ssDNA substrate can be more efficient than dsDNA with a recombination frequency between 0.1% to 1%, and can be increased to as high as 25-50% by designing substrates that avoid activating the methyl-directed mismatch repair (MMR) system. MMR's job is to correct DNA mismatches that occur during DNA replication. Activation of MMR can be avoided by: 1) using a strain of bacteria that has key MMR proteins knocked out or 2) specially design ssDNA substrates to avoid MMR: 1) E. coli with inactivated MMR: Using E. coli with inactive MMR is definitely the easier of the two options, but these cells are prone to mutations and can have more unintended changes to their genomes. 2) Designing ssDNA substrates that avoid MMR activation: In one embodiment, a C/C mismatch at or within 6 base pairs of the edit site is introduced. In another embodiment, the desired change is flanked with 4-5 silent changes in the wobble codons, i.e. make changes to the third base pair of the adjacent 4-5 codons that alter the nucleotide sequence but not the amino acid sequence of the translated protein. These changes can be 5' or 3' of the desired change.
[0119] In one embodiment, the polynucleotides or polynucleotide libraries generated using the compositions and/or methods provided herein can be used in a gene editing method that is implemented in a microbial host cell that already stably expresses lambda red recombination genes such as the DY380 strain described at blog.addgene.org/lambda-red-a-homologous-recombination-based-technique-fo- r-genetic-engineering, the contents of which are herein incorporated by reference. Other bacterial strains that comprise components of the lambda red recombination system and can be utilized to generate the organism to be genotyped using an enrichment method provided herein (e.g., CS-seq or SG-seq) can be found in Thomason et al (Recombineering: Genetic Engineering in Bacteria Using Homologous Recombination. Current Protocols in Molecular Biology. 106:V:1.16:1.16.1-1.16.39) and Sharan et al (Recombineering: A Homologous Recombination-Based Method of Genetic Engineering. Nature protocols. 2009; 4(2):206-223), the contents of each of which are herein incorporated by reference.
[0120] As provided herein, the set of proteins of the lambda red recombination system can be introduced into the microbial host cell prior to implementation of any of the editing methods known in the art and/or provided herein. Genes for each of the proteins of the lambda red recombination system can be introduced on nucleic acids (e.g., as plasmids, linear DNA or RNA, a mini-k, a lambda red prophage or integrons) and be integrated into the genome of the host cell or expressed from an extrachromosomal element. In some cases, each of the components (i.e., exo, beta, gam or combinations thereof) of the lambda red recombination system can be introduced as an RNA and be translated by the host cell. In some cases, each of the components (i.e., exo, beta, gam or combinations thereof) of the lambda red recombination system can be introduced as a protein into the host cell.
[0121] In one embodiment, genes for the set of proteins of the lambda red recombination system are introduced on a plasmid. The set of proteins of the lambda red recombination system on the plasmid can be under the control of a promoter such as, for example, the endogenous phage pL promoter. In one embodiment, the set of proteins of the lambda red recombination system on the plasmid is under the control of an inducible promoter. The inducible promoter can be inducible by the addition or depletion of a reagent or by a change in temperature. In one embodiment, the set of proteins of the lambda red recombination system on the plasmid is under the control of an inducible promoter such as the IPTG-inducible lac promoter or the arabinose-inducible pBAD promoter. A plasmid expressing genes for the set of proteins of the lambda red recombination system can also express repressors associated with a specific promoter such as, for example, the lad, araC or cI857 repressors associated with the IPTG-inducible lac promoter, the arabinose-inducible pBAD promoter and the endogenous phage pL promoters, respectively.
[0122] In one embodiment, genes for the set of proteins of the lambda red recombination system are introduced on a mini-.lamda., which a defective non-replicating, circular piece of phage DNA, that when introduced into microbial host cell, integrates into the genome as described at blog.addgene.org/lambda-red-a-homologous-recombination-based-technique-fo- r-genetic-engineering, the contents of which are herein incorporated by reference.
[0123] In one embodiment, genes for the set of proteins of the lambda red recombination system are introduced on a lambda red prophage, which can allow for stable integration of the lambda red recombination system into a microbial host cell such as described at blog.addgene.org/lambda-red-a-homologous-recombination-based-technique-fo- r-genetic-engineering, the contents of which are herein incorporated by reference.
[0124] CRISPR Mediated Gene Editing
[0125] In one aspect provided herein, a genetic element (e.g., genome, cosmid, or plasmid) of a host cell can be modified by CRISPR.
[0126] The CRISPR/Cas system is a prokaryotic immune system that confers resistance to foreign genetic elements such as those present within plasmids and phages and that provides a form of acquired immunity. CRISPR stands for Clustered Regularly Interspaced Short Palindromic Repeat, and cas stands for CRISPR-associated system, and refers to the small cas genes associated with the CRISPR complex.
[0127] CRISPR-Cas systems are most broadly characterized as either Class 1 or Class 2 systems. The main distinguishing feature between these two systems is the nature of the Cas-effector module. Class 1 systems require assembly of multiple Cas proteins in a complex (referred to as a "Cascade complex") to mediate interference, while Class 2 systems use a large single Cas enzyme to mediate interference. Each of the Class 1 and Class 2 systems are further divided into multiple CRISPR-Cas types based on the presence of a specific Cas protein. For example, the Class 1 system is divided into the following three types: Type I systems, which contain the Cas3 protein; Type III systems, which contain the Cas10 protein; and the putative Type IV systems, which contain the Csf1 protein, a Cas8-like protein. Class 2 systems are generally less common than Class 1 systems and are further divided into the following three types: Type II systems, which contain the Cas9 protein; Type V systems, which contain Cas12a protein (previously known as Cpf1, and referred to as Cpf1 herein), Cas12b (previously known as C2c1), Cas12c (previously known as C2c3), Cas12d (previously known as CasY), and Cas12e (previously known as CasX); and Type VI systems, which contain Cas13a (previously known as C2c2), Cas13b, and Cas13c. Pyzocha et al., ACS Chemical Biology, Vol. 13 (2), pgs. 347-356. In one embodiment, the CRISPR-Cas system for use in the methods provided herein is a Class 2 system. In one embodiment, the CRISPR-Cas system for use in the methods provided herein is a Type II, Type V or Type VI Class 2 system. In one embodiment, the CRISPR-Cas system for use in the methods provided herein is selected from Cas9, Cas12a, Cas12b, Cas12c, Cas12d, Cas12e, Cas13a, Cas13b, Cas13c or homologs, orthologs or paralogs thereof.
[0128] CRISPR systems used in methods disclosed herein comprise a Cas effector module comprising one or more nucleic acid guided CRISPR-associated (Cas) nucleases, referred to herein as Cas effector proteins. In some embodiments, the Cas proteins can comprise one or multiple nuclease domains. A Cas effector protein can target single stranded or double stranded nucleic acid molecules (e.g. DNA or RNA nucleic acids) and can generate double strand or single strand breaks. In some embodiments, the Cas effector proteins are wild-type or naturally occurring Cas proteins. In some embodiments, the Cas effector proteins are mutant Cas proteins, wherein one or more mutations, insertions, or deletions are made in a WT or naturally occurring Cas protein (e.g., a parental Cas protein) to produce a Cas protein with one or more altered characteristics compared to the parental Cas protein.
[0129] In some instances, the Cas protein is a wild-type (WT) nuclease. Non-limiting examples of suitable Cas proteins for use in the present disclosure include C2c1, C2c2, C2c3, Cas1, Cas1B, Cas2, Cas3, Cas4, Cas5, Cash, Cas7, Cas8, Cas9 (also known as Csn1 and Csx12), Cas10, Cpf1, Csy1, Csy2, Csy3, Cse1, Cse2, Csc1, Csc2, Csa5, Csn2, Csm1, Csm2, Csm3, Csm4, Csm5, Csm6, Cmr1, Cmr3, Cmr4, Cmr5, Cmr6, Csb1, Csb2, Csb3, Csx17, Csx14, Csx100, Csx16, CsaX, Csx3, Csx1, Csx15, Csf1, Csf2, Csf3, Csf4, MAD1-20, SmCsm1, homologues thereof, orthologues thereof, variants thereof, mutants thereof, or modified versions thereof. Suitable nucleic acid guided nucleases (e.g., Cas 9) can be from an organism from a genus, which includes but is not limited to: Thiomicrospira, Succinivibrio, Candidatus, Porphyromonas, Acidomonococcus, Prevotella, Smithella, Moraxella, Synergistes, Francisella, Leptospira, Catenibacterium, Kandleria, Clostridium, Dorea, Coprococcus, Enterococcus, Fructobacillus, Weissella, Pediococcus, Corynebacter, Sutterella, Legionella, Treponema, Roseburia, Filifactor, Eubacterium, Streptococcus, Lactobacillus, Mycoplasma, Bacteroides, Flaviivola, Flavobacterium, Sphaerochaeta, Azospirillum, Gluconacetobacter, Neisseria, Roseburia, Parvibaculum, Staphylococcus, Nitratifractor, Alicyclobacillus, Brevibacilus, Bacillus, Bacteroidetes, Carnobacterium, Clostridiaridium, Clostridium, Desulfonatronum, Desulfovibrio, Helcococcus, Leptotrichia, Listeria, Methanomethyophilus, Methylobacterium, Opitutaceae, Paludibacter, Rhodobacter, Sphaerochaeta, Tuberibacillus, and Campylobacter. Species of organism of such a genus can be as otherwise herein discussed.
[0130] Suitable nucleic acid guided nucleases (e.g., Cas9) can be from an organism from a phylum, which includes but is not limited to: Firmicute, Actinobacteria, Bacteroidetes, Proteobacteria, Spirochates, and Tenericutes. Suitable nucleic acid guided nucleases can be from an organism from a class, which includes but is not limited to: Erysipelotrichia, Clostridia, Bacilli, Actinobacteria, Bacteroidetes, Flavobacteria, Alphaproteobacteria, Betaproteobacteria, Gammaproteobacteria, Deltaproteobacteria, Epsilonproteobacteria, Spirochaetes, and Mollicutes. Suitable nucleic acid guided nucleases can be from an organism from an order, which includes but is not limited to: Clostridiales, Lactobacillales, Actinomycetales, Bacteroidales, Flavobacteriales, Rhizobiales, Rhodospirillales, Burkholderiales, Neisseriales, Legionellales, Nautiliales, Campylobacterales, Spirochaetales, Mycoplasmatales, and Thiotrichales. Suitable nucleic acid guided nucleases can be from an organism from within a family, which includes but is not limited to: Lachnospiraceae, Enterococcaceae, Leuconostocaceae, Lactobacillaceae, Streptococcaceae, Peptostreptococcaceae, Staphylococcaceae, Eubacteriaceae, Corynebacterineae, Bacteroidaceae, Flavobacterium, Cryomoorphaceae, Rhodobiaceae, Rhodospirillaceae, Acetobacteraceae, Sutterellaceae, Neisseriaceae, Legionellaceae, Nautiliaceae, Campylobacteraceae, Spirochaetaceae, Mycoplasmataceae, and Francisellaceae.
[0131] Other nucleic acid guided nucleases (e.g., Cas9) suitable for use in the methods, systems, and compositions of the present disclosure include those derived from an organism such as, but not limited to: Thiomicrospira sp. XS5, Eubacterium rectale, Succinivibrio dextrinosolvens, Candidatus Methanoplasma termitum, Candidatus Methanomethylophilus alvus, Porphyromonas crevioricanis, Flavobacterium branchiophilum, Acidomonococcus sp., Lachnospiraceae bacterium COE1, Prevotella brevis ATCC 19188, Smithella sp. SCADC, Moraxella bovoculi, Synergistes jonesii, Bacteroidetes oral taxon 274, Francisella tularensis, Leptospira inadai serovar Lyme str. 10, Acidomonococcus sp. crystal structure (5B43) S. mutans, S. agalactiae, S. equisimilis, S. sanguinis, S. pneumonia; C. jejuni, C. coli; N. salsuginis, N. tergarcus; S. auricularis, S. carnosus; N. meningitides, N. gonorrhoeae; L. monocytogenes, L. ivanovii; C. botulinum, C. difficile, C. tetani, C. sordellii; Francisella tularensis 1, Prevotella albensis, Lachnospiraceae bacterium MC2017 1, Butyrivibrio proteoclasticus, Peregrinibacteria bacterium GW2011_GWA2_33_10, Parcubacteria bacterium GW2011_GWC2_44_17, Smithella sp. SCADC, Microgenomates, Acidaminococcus sp. BV3L6, Lachnospiraceae bacterium MA2020, Candidatus Methanoplasma termitum, Eubacterium eligens, Moraxella bovoculi 237, Leptospira inadai, Lachnospiraceae bacterium ND2006, Porphyromonas crevioricanis 3, Prevotella disiens, Porphyromonas macacae, Catenibacterium sp. CAG:290, Kandleria vitulina, Clostridiales bacterium KA00274, Lachnospiraceae bacterium 3-2, Dorea longicatena, Coprococcus catus GD/7, Enterococcus columbae DSM 7374, Fructobacillus sp. EFB-N1, Weissella halotolerans, Pediococcus acidilactici, Lactobacillus curvatus, Streptococcus pyogenes, Lactobacillus versmoldensis, and Filifactor alocis ATCC 35896. See, U.S. Pat. Nos. 8,697,359; 8,771,945; 8,795,965; 8,865,406; 8,871,445; 8,889,356; 8,895,308; 8,906,616; 8,932,814; 8,945,839; 8,993,233; 8,999,641; 9,822,372; 9,840,713; U.S. patent application Ser. No. 13/842,859 (US 2014/0068797 A1); U.S. Pat. Nos. 9,260,723; 9,023,649; 9,834,791; 9,637,739; U.S. patent application Ser. No. 14/683,443 (US 2015/0240261 A1); U.S. patent application Ser. No. 14/743,764 (US 2015/0291961 A1); U.S. Pat. Nos. 9,790,490; 9,688,972; 9,580,701; 9,745,562; 9,816,081; 9,677,090; 9,738,687; U.S. application Ser. No. 15/632,222 (US 2017/0369879 A1); U.S. application Ser. No. 15/631,989; U.S. application Ser. No. 15/632,001; and U.S. Pat. No. 9,896,696, each of which is herein incorporated by reference.
[0132] In some embodiments, a Cas effector protein comprises one or more of the following activities:
[0133] a nickase activity, i.e., the ability to cleave a single strand of a nucleic acid molecule;
[0134] a double stranded nuclease activity, i.e., the ability to cleave both strands of a double stranded nucleic acid and create a double stranded break;
[0135] an endonuclease activity;
[0136] an exonuclease activity; and/or
[0137] a helicase activity, i.e., the ability to unwind the helical structure of a double stranded nucleic acid.
[0138] In aspects of the disclosure the term "guide nucleic acid" refers to a polynucleotide comprising 1) a guide sequence capable of hybridizing to a target sequence (referred to herein as a "targeting segment") and 2) a scaffold sequence capable of interacting with (either alone or in combination with a tracrRNA molecule) a nucleic acid guided nuclease as described herein (referred to herein as a "scaffold segment"). A guide nucleic acid can be DNA. A guide nucleic acid can be RNA. A guide nucleic acid can comprise both DNA and RNA. A guide nucleic acid can comprise modified non-naturally occurring nucleotides. In cases where the guide nucleic acid comprises RNA, the RNA guide nucleic acid can be encoded by a DNA sequence on a polynucleotide molecule such as a plasmid, linear construct generated using the methods and compositions provided herein.
[0139] In some embodiments, the guide nucleic acids described herein are RNA guide nucleic acids ("guide RNAs" or "gRNAs") and comprise a targeting segment and a scaffold segment. In some embodiments, the scaffold segment of a gRNA is comprised in one RNA molecule and the targeting segment is comprised in another separate RNA molecule. Such embodiments are referred to herein as "double-molecule gRNAs" or "two-molecule gRNA" or "dual gRNAs." In some embodiments, the gRNA is a single RNA molecule and is referred to herein as a "single-guide RNA" or an "sgRNA." The term "guide RNA" or "gRNA" is inclusive, referring both to two-molecule guide RNAs and sgRNAs.
[0140] In one embodiment, an assembly comprising a pair of first and second polynucleotides with an insert polynucleotide located therebetween generated using the methods and compositions provided herein is a guide RNA (gRNA). In some cases, the methods provided herein are used to generate a library of gRNAs.
[0141] The DNA-targeting segment of a gRNA comprises a nucleotide sequence that is complementary to a sequence in a target nucleic acid sequence. As such, the targeting segment of a gRNA interacts with a target nucleic acid in a sequence-specific manner via hybridization (i.e., base pairing), and the nucleotide sequence of the targeting segment determines the location within the target DNA that the gRNA will bind. The degree of complementarity between a guide sequence and its corresponding target sequence, when optimally aligned using a suitable alignment algorithm, is about or more than about 50%, 60%, 75%, 80%, 85%, 90%, 95%, 97.5%, 99%, or more. Optimal alignment may be determined with the use of any suitable algorithm for aligning sequences. In some embodiments, a guide sequence is about or more than about 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 45, 50, 75, or more nucleotides in length. In some embodiments, a guide sequence is less than about 75, 50, 45, 40, 35, 30, 25, 20 nucleotides in length. In aspects, the guide sequence is 10-30 nucleotides long. The guide sequence can be 15-20 nucleotides in length. The guide sequence can be 15 nucleotides in length. The guide sequence can be 16 nucleotides in length. The guide sequence can be 17 nucleotides in length. The guide sequence can be 18 nucleotides in length. The guide sequence can be 19 nucleotides in length. The guide sequence can be 20 nucleotides in length.
[0142] The scaffold segment of a guide RNA interacts with a one or more Cas effector proteins to form a ribonucleoprotein complex (referred to herein as a CRISPR-RNP or a RNP-complex). The guide RNA directs the bound polypeptide to a specific nucleotide sequence within a target nucleic acid sequence via the above-described targeting segment. The scaffold segment of a guide RNA comprises two stretches of nucleotides that are complementary to one another and which form a double stranded RNA duplex. Sufficient sequence within the scaffold sequence to promote formation of a targetable nuclease complex may include a degree of complementarity along the length of two sequence regions within the scaffold sequence, such as one or two sequence regions involved in forming a secondary structure. In some cases, the one or two sequence regions are comprised or encoded on the same polynucleotide. In some cases, the one or two sequence regions are comprised or encoded on separate polynucleotides. Optimal alignment may be determined by any suitable alignment algorithm, and may further account for secondary structures, such as self-complementarity within either the one or two sequence regions. In some embodiments, the degree of complementarity between the one or two sequence regions along the length of the shorter of the two when optimally aligned is about or more than about 25%, 30%, 40%, 50%, 60%, 70%, 80%, 90%, 95%, 97.5%, 99%, or higher. In some embodiments, at least one of the two sequence regions is about or more than about 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 25, 30, 40, 50, or more nucleotides in length.
[0143] A scaffold sequence of a subject gRNA can comprise a secondary structure. A secondary structure can comprise a pseudoknot region or stem-loop structure. In some examples, the compatibility of a guide nucleic acid and nucleic acid guided nuclease is at least partially determined by sequence within or adjacent to the secondary structure region of the guide RNA. In some cases, binding kinetics of a guide nucleic acid to a nucleic acid guided nuclease is determined in part by secondary structures within the scaffold sequence. In some cases, binding kinetics of a guide nucleic acid to a nucleic acid guided nuclease is determined in part by nucleic acid sequence with the scaffold sequence.
[0144] A compatible scaffold sequence for a gRNA-Cas effector protein combination can be found by scanning sequences adjacent to a native Cas nuclease loci. In other words, native Cas nucleases can be encoded on a genome within proximity to a corresponding compatible guide nucleic acid or scaffold sequence.
[0145] Nucleic acid guided nucleases can be compatible with guide nucleic acids that are not found within the nucleases endogenous host. Such orthogonal guide nucleic acids can be determined by empirical testing. Orthogonal guide nucleic acids can come from different bacterial species or be synthetic or otherwise engineered to be non-naturally occurring. Orthogonal guide nucleic acids that are compatible with a common nucleic acid-guided nuclease can comprise one or more common features. Common features can include sequence outside a pseudoknot region. Common features can include a pseudoknot region. Common features can include a primary sequence or secondary structure.
[0146] A guide nucleic acid can be engineered to target a desired target sequence by altering the guide sequence such that the guide sequence is complementary to the target sequence, thereby allowing hybridization between the guide sequence and the target sequence. A guide nucleic acid with an engineered guide sequence can be referred to as an engineered guide nucleic acid. Engineered guide nucleic acids are often non-naturally occurring and are not found in nature.
[0147] In some embodiments, the present disclosure provides a polynucleotide encoding a gRNA generated using the compositions and methods provided herein. In some embodiments, the composition comprising a pair of first and second polynucleotides and an insert polynucleotide further comprises an expression vector such that assembly of the pair of first and second polynucleotides with the insert polynucleotide and expression vector generates an expression vector comprising a gRNA-encoding nucleic acid.
[0148] In another embodiment, an assembly comprising a pair of first and second polynucleotides with an insert polynucleotide located therebetween generated using the methods and compositions provided herein is a donor DNA sequence. In some cases, the methods provided herein are used to generate a library of donor DNA sequences. The donor DNA sequence can be used in combination with a guide RNA (gRNA) in a CRISPR method of gene editing using homology directed repair (HDR). The CRISPR complex can result in the strand breaks within the target gene(s) that can be repaired by using homology directed repair (HDR). HDR mediated repair can be facilitated by co-transforming the host cell with a donor DNA sequence generated using the methods and compositions provided herein. The donor DNA sequence can comprise a desired genetic perturbation (e.g., deletion, insertion, and/or single nucleotide polymorphism) as well as targeting sequences derived from the first and second polynucleotides. In this embodiment, the CRISPR complex cleaves the target gene specified by the one or more gRNAs. The donor DNA sequence can then be used as a template for the homologous recombination machinery to incorporate the desired genetic perturbation into the host cell. The donor DNA can be single-stranded, double-stranded or a double-stranded plasmid. The donor DNA can lack a PAM sequence or comprise a scrambled, altered or non-functional PAM in order to prevent re-cleavage. In some cases, the donor DNA can contain a functional or non-altered PAM site. The mutated or edited sequence in the donor DNA (also flanked by the regions of homology) prevents re-cleavage by the CRISPR-complex after the mutation(s) has/have been incorporated into the genome.
Host Cells
[0149] As provided herein, the libraries of nucleic acid constructs generated using the compositions and/or methods provided herein can be used to edit or modify a genetic element (e.g., genome, cosmid or plasmid) of a host cell or engineer the host cell via introducing (e.g., transforming or transducing) one or more genetic element(s) (e.g., plasmid or cosmid) into said host cell. The genomic engineering or editing methods can be applicable to any organism where desired traits can be identified in a population of genetic mutants. The organism can be a microorganism or higher eukaryotic organism.
[0150] Thus, as used herein, the term "microorganism" should be taken broadly. It includes, but is not limited to, the two prokaryotic domains, Bacteria and Archaea, as well as certain eukaryotic fungi and protists. However, in certain aspects, "higher" eukaryotic organisms such as insects, plants, and animals can be utilized in the methods taught herein.
[0151] Suitable host cells include, but are not limited to: bacterial cells, algal cells, plant cells, fungal cells, insect cells, and mammalian cells. In one illustrative embodiment, suitable host cells include E. coli (e.g., SHuffle.TM. competent E. coli available from New England BioLabs in Ipswich, Mass.).
[0152] Other suitable host organisms of the present disclosure include microorganisms of the genus Corynebacterium. In some embodiments, preferred Corynebacterium strains/species include: C. efficiens, with the deposited type strain being DSM44549, C. glutamicum, with the deposited type strain being ATCC13032, and C. ammoniagenes, with the deposited type strain being ATCC6871. In some embodiments, the preferred host of the present disclosure is C. glutamicum.
[0153] Suitable host strains of the genus Corynebacterium, in particular of the species Corynebacterium glutamicum, are in particular the known wild-type strains: Corynebacterium glutamicum ATCC13032, Corynebacterium acetoglutamicum ATCC 15806, Corynebacterium acetoacidophilum ATCC 13870, Corynebacterium melassecola ATCC17965, Corynebacterium thermoaminogenes FERM BP-1539, Brevibacterium flavum ATCC14067, Brevibacterium lactofermentum ATCC13869, and Brevibacterium divaricatum ATCC14020; and L-amino acid-producing mutants, or strains, prepared therefrom, such as, for example, the L-lysine-producing strains: Corynebacterium glutamicum FERM-P 1709, Brevibacterium flavum FERM-P 1708, Brevibacterium lactofermentum FERM-P 1712, Corynebacterium glutamicum FERM-P 6463, Corynebacterium glutamicum FERM-P 6464, Corynebacterium glutamicum DM58-1, Corynebacterium glutamicum DG52-5, Corynebacterium glutamicum DSM5714, and Corynebacterium glutamicum DSM12866.
[0154] The term "Micrococcus glutamicus" has also been in use for C. glutamicum. Some representatives of the species C. efficiens have also been referred to as C. thermoaminogenes in the prior art, such as the strain FERM BP-1539, for example.
[0155] In some embodiments, the host cell of the present disclosure is a eukaryotic cell. Suitable eukaryotic host cells include, but are not limited to: fungal cells, algal cells, insect cells, animal cells, and plant cells. Suitable fungal host cells include, but are not limited to: Ascomycota, Basidiomycota, Deuteromycota, Zygomycota, Fungi imperfecti. Certain preferred fungal host cells include yeast cells and filamentous fungal cells. Suitable filamentous fungi host cells include, for example, any filamentous forms of the subdivision Eumycotina and Oomycota. (see, e.g., Hawksworth et al., In Ainsworth and Bisby's Dictionary of The Fungi, 8.sup.th edition, 1995, CAB International, University Press, Cambridge, UK, which is incorporated herein by reference). Filamentous fungi are characterized by a vegetative mycelium with a cell wall composed of chitin, cellulose and other complex polysaccharides. The filamentous fungi host cells are morphologically distinct from yeast.
[0156] In certain illustrative, but non-limiting embodiments, the filamentous fungal host cell may be a cell of a species of: Achlya, Acremonium, Aspergillus, Aureobasidium, Bjerkandera, Ceriporiopsis, Cephalosporium, Chrysosporium, Cochliobolus, Corynascus, Cryphonectria, Cryptococcus, Coprinus, Coriolus, Diplodia, Endothis, Fusarium, Gibberella, Gliocladium, Humicola, Hypocrea, Myceliophthora (e.g., Myceliophthora thermophila), Mucor, Neurospora, Penicillium, Podospora, Phlebia, Piromyces, Pyricularia, Rhizomucor, Rhizopus, Schizophyllum, Scytalidium, Sporotrichum, Talaromyces, Thermoascus, Thielavia, Trametes, Tolypocladium, Trichoderma, Verticillium, Volvariella, or teleomorphs, or anamorphs, and synonyms or taxonomic equivalents thereof.
[0157] Suitable yeast host cells include, but are not limited to: Candida, Hansenula, Saccharomyces, Schizosaccharomyces, Pichia, Kluyveromyces, and Yarrowia. In some embodiments, the yeast cell is Hansenula polymorpha, Saccharomyces cerevisiae, Saccaromyces carlsbergensis, Saccharomyces diastaticus, Saccharomyces norbensis, Saccharomyces kluyveri, Schizosaccharomyces pombe, Pichia pastoris, Pichia finlandica, Pichia trehalophila, Pichia kodamae, Pichia membranaefaciens, Pichia opuntiae, Pichia thermotolerans, Pichia salictaria, Pichia quercuum, Pichia pijperi, Pichia stipitis, Pichia methanolica, Pichia angusta, Kluyveromyces lactis, Candida albicans, or Yarrowia lipolytica.
[0158] In certain embodiments, the host cell is an algal such as, Chlamydomonas (e.g., C. Reinhardtii) and Phormidium (P. sp. ATCC29409).
[0159] In other embodiments, the host cell is a prokaryotic cell. Suitable prokaryotic cells include gram positive, gram negative, and gram-variable bacterial cells. The host cell may be a species of, but not limited to: Agrobacterium, Alicyclobacillus, Anabaena, Anacystis, Acinetobacter, Acidothermus, Arthrobacter, Azobacter, Bacillus, Bifidobacterium, Brevibacterium, Butyrivibrio, Buchnera, Campestris, Campylobacter, Clostridium, Corynebacterium, Chromatium, Coprococcus, Escherichia, Enterococcus, Enterobacter, Envinia, Fusobacterium, Faecalibacterium, Francisella, Flavobacterium, Geobacillus, Haemophilus, Helicobacter, Klebsiella, Lactobacillus, Lactococcus, Ilyobacter, Micrococcus, Microbacterium, Mesorhizobium, Methylobacterium, Mycobacterium, Neisseria, Pantoea, Pseudomonas, Prochlorococcus, Rhodobacter, Rhodopseudomonas, Roseburia, Rhodospirillum, Rhodococcus, Scenedesmus, Streptomyces, Streptococcus, Synechococcus, Saccharomonospora, Staphylococcus, Serratia, Salmonella, Shigella, Thermoanaerobacterium, Tropheryma, Tularensis, Temecula, Thermosynechococcus, Thermococcus, Ureaplasma, Xanthomonas, Xylella, Yersinia, and Zymomonas. In some embodiments, the host cell is Corynebacterium glutamicum.
[0160] In some embodiments, the bacterial host strain is an industrial strain. Numerous bacterial industrial strains are known and suitable in the methods and compositions described herein.
[0161] In some embodiments, the bacterial host cell is of the Agrobacterium species (e.g., A. radiobacter, A. rhizogenes, A. rubi), the Arthrobacter species (e.g., A. aurescens, A. citreus, A. globformis, A. hydrocarboglutamicus, A. mysorens, A. nicotianae, A. paraffineus, A. protophonniae, A. roseoparaffinus, A. sulfureus, A. ureafaciens), the Bacillus species (e.g., B. thuringiensis, B. anthracis, B. megaterium, B. subtilis, B. lentus, B. circulars, B. pumilus, B. lautus, B. coagulans, B. brevis, B. firmus, B. alkaophius, B. licheniformis, B. clausii, B. stearothermophilus, B. halodurans and B. amyloliquefaciens. In particular embodiments, the host cell will be an industrial Bacillus strain including but not limited to B. subtilis, B. pumilus, B. licheniformis, B. megaterium, B. clausii, B. stearothermophilus and B. amyloliquefaciens. In some embodiments, the host cell will be an industrial Clostridium species (e.g., C. acetobutylicum, C. tetani E88, C. lituseburense, C. saccharobutylicum, C. perfringens, C. beijerinckii). In some embodiments, the host cell will be an industrial Corynebacterium species (e.g., C. glutamicum, C. acetoacidophilum). In some embodiments, the host cell will be an industrial Escherichia species (e.g., E. coli). In some embodiments, the host cell will be an industrial Envinia species (e.g., E. uredovora, E. carotovora, E. ananas, E. herbicola, E. punctata, E. terreus). In some embodiments, the host cell will be an industrial Pantoea species (e.g., P. citrea, P. agglomerans). In some embodiments, the host cell will be an industrial Pseudomonas species, (e.g., P. putida, P. aeruginosa, P. mevalonii). In some embodiments, the host cell will be an industrial Streptococcus species (e.g., S. equisimiles, S. pyogenes, S. uberis). In some embodiments, the host cell will be an industrial Streptomyces species (e.g., S. ambofaciens, S. achromogenes, S. avermitilis, S. coelicolor, S. aureofaciens, S. aureus, S. fungicidicus, S. griseus, S. lividans). In some embodiments, the host cell will be an industrial Zymomonas species (e.g., Z. mobilis, Z. lipolytica), and the like.
[0162] In some embodiments, the host cell will be an industrial Escherichia species (e.g., E. coli).
[0163] Suitable host strains of the E. coli species comprise: Enterotoxigenic E. coli (ETEC), Enteropathogenic E. coli (EPEC), Enteroinvasive E. coli (EIEC), Enterohemorrhagic E. coli (EHEC), Uropathogenic E. coli (UPEC), Verotoxin-producing E. coli, E. coli O157:H7, E. coli O104:H4, Escherichia coli O121, Escherichia coli O104:H21, Escherichia coli K1, and Escherichia coli NC101. In some embodiments, the present disclosure teaches genomic engineering of E. coli K12, E. coli B, and E. coli C.
[0164] In some embodiments, the host cell can be E. coli strains NCTC 12757, NCTC 12779, NCTC 12790, NCTC 12796, NCTC 12811, ATCC 11229, ATCC 25922, ATCC 8739, DSM 30083, BC 5849, BC 8265, BC 8267, BC 8268, BC 8270, BC 8271, BC 8272, BC 8273, BC 8276, BC 8277, BC 8278, BC 8279, BC 8312, BC 8317, BC 8319, BC 8320, BC 8321, BC 8322, BC 8326, BC 8327, BC 8331, BC 8335, BC 8338, BC 8341, BC 8344, BC 8345, BC 8346, BC 8347, BC 8348, BC 8863, and BC 8864.
[0165] In some embodiments, the present disclosure teaches host cells that can be verocytotoxigenic E. coli (VTEC), such as strains BC 4734 (O26:H11), BC 4735 (O157:H-), BC 4736, BC 4737 (n.d.), BC 4738 (O157:H7), BC 4945 (O26:H-), BC 4946 (O157:H7), BC 4947 (O111:H-), BC 4948 (O157:H), BC 4949 (O5), BC 5579 (O157:H7), BC 5580 (O157:H7), BC 5582 (O3:H), BC 5643 (O2:H5), BC 5644 (O128), BC 5645 (O55:H-), BC 5646 (O69:H-), BC 5647 (O101:H9), BC 5648 (O103:H2), BC 5850 (O22:H8), BC 5851 (O55:H-), BC 5852 (O48:H21), BC 5853 (O26:H11), BC 5854 (O157:H7), BC 5855 (O157:H-), BC 5856 (O26:H-), BC 5857 (O103:H2), BC 5858 (O26:H11), BC 7832, BC 7833 (Oraw form:H-), BC 7834 (ONT:H-), BC 7835 (O103:H2), BC 7836 (O57:H-), BC 7837 (ONT:H-), BC 7838, BC 7839 (O128:H2), BC 7840 (O157:H-), BC 7841 (O23:H-), BC 7842 (O157:H-), BC 7843, BC 7844 (O157:H-), BC 7845 (O103:H2), BC 7846 (O26:H11), BC 7847 (O145:H-), BC 7848 (O157:H-), BC 7849 (O156:H47), BC 7850, BC 7851 (O157:H-), BC 7852 (O157:H-), BC 7853 (O5:H-), BC 7854 (O157:H7), BC 7855 (O157:H7), BC 7856 (O26:H-), BC 7857, BC 7858, BC 7859 (ONT:H-), BC 7860 (O129:H-), BC 7861, BC 7862 (O103:H2), BC 7863, BC 7864 (Oraw form:H-), BC 7865, BC 7866 (O26:H-), BC 7867 (Oraw form:H-), BC 7868, BC 7869 (ONT:H-), BC 7870 (O113:H-), BC 7871 (ONT:H-), BC 7872 (ONT:H-), BC 7873, BC 7874 (Oraw form:H-), BC 7875 (O157:H-), BC 7876 (O111:H-), BC 7877 (O146:H21), BC 7878 (O145:H-), BC 7879 (O22:H8), BC 7880 (Oraw form:H-), BC 7881 (O145:H-), BC 8275 (O157:H7), BC 8318 (O55:K-:H-), BC 8325 (O157:H7), and BC 8332 (ONT), BC 8333.
[0166] In some embodiments, the present disclosure teaches host cells that can be enteroinvasive E. coli (EIEC), such as strains BC 8246 (O152:K-:H-), BC 8247 (O124:K(72):H3), BC 8248 (O124), BC 8249 (O112), BC 8250 (O136:K(78):H-), BC 8251 (O124:H-), BC 8252 (O144:K-:H-), BC 8253 (O143:K:H-), BC 8254 (O143), BC 8255 (O112), BC 8256 (O28a.e), BC 8257 (O124:H-), BC 8258 (O143), BC 8259 (O167:K-:H5), BC 8260 (O128a.c.:H35), BC 8261 (O164), BC 8262 (O164:K-:H-), BC 8263 (O164), and BC 8264 (O124).
[0167] In some embodiments, the present disclosure teaches host cells that can be enterotoxigenic E. coli (ETEC), such as strains BC 5581 (O78:H11), BC 5583 (O2:K1), BC 8221 (O118), BC 8222 (O148:H-), BC 8223 (O111), BC 8224 (O110:H-), BC 8225 (O148), BC 8226 (O118), BC 8227 (O25:H42), BC 8229 (O6), BC 8231 (O153:H45), BC 8232 (O9), BC 8233 (O148), BC 8234 (O128), BC 8235 (O118), BC 8237 (O111), BC 8238 (O110:H17), BC 8240 (O148), BC 8241 (O6H16), BC 8243 (O153), BC 8244 (O15:H-), BC 8245 (O20), BC 8269 (O125a.c:H-), BC 8313 (O6:H6), BC 8315 (O153:H-), BC 8329, BC 8334 (O118:H12), and BC 8339.
[0168] In some embodiments, the present disclosure teaches host cells that can be enteropathogenic E. coli (EPEC), such as strains BC 7567 (O86), BC 7568 (O128), BC 7571 (O114), BC 7572 (O119), BC 7573 (O125), BC 7574 (O124), BC 7576 (O127a), BC 7577 (O126), BC 7578 (O142), BC 7579 (O26), BC 7580 (OK26), BC 7581 (O142), BC 7582 (O55), BC 7583 (O158), BC 7584 (O-), BC 7585 (O-), BC 7586 (O-), BC 8330, BC 8550 (O26), BC 8551 (O55), BC 8552 (O158), BC 8553 (O26), BC 8554 (O158), BC 8555 (O86), BC 8556 (O128), BC 8557 (OK26), BC 8558 (O55), BC 8560 (O158), BC 8561 (O158), BC 8562 (O114), BC 8563 (O86), BC 8564 (O128), BC 8565 (O158), BC 8566 (O158), BC 8567 (O158), BC 8568 (O111), BC 8569 (O128), BC 8570 (O114), BC 8571 (O128), BC 8572 (O128), BC 8573 (O158), BC 8574 (O158), BC 8575 (O158), BC 8576 (O158), BC 8577 (O158), BC 8578 (O158), BC 8581 (O158), BC 8583 (O128), BC 8584 (O158), BC 8585 (O128), BC 8586 (O158), BC 8588 (O26), BC 8589 (O86), BC 8590 (O127), BC 8591 (O128), BC 8592 (O114), BC 8593 (O114), BC 8594 (O114), BC 8595 (O125), BC 8596 (O158), BC 8597 (O26), BC 8598 (O26), BC 8599 (O158), BC 8605 (O158), BC 8606 (O158), BC 8607 (O158), BC 8608 (O128), BC 8609 (O55), BC 8610 (O114), BC 8615 (O158), BC 8616 (O128), BC 8617 (O26), BC 8618 (O86), BC 8619, BC 8620, BC 8621, BC 8622, BC 8623, BC 8624 (O158), and BC 8625 (O158).
[0169] In some embodiments, the present disclosure also teaches host cells that can be Shigella organisms, including Shigella flexneri, Shigella dysenteriae, boydii, and Shigella sonnei.
[0170] The present disclosure is also suitable for use with a variety of animal cell types, including mammalian cells, for example, human (including 293, WI38, PER.C6 and Bowes melanoma cells), mouse (including 3T3, NS0, NS1, Sp2/0), hamster (CHO, BHK), monkey (COS, FRhL, Vero), and hybridoma cell lines.
[0171] In various embodiments, strains that may be used in the practice of the disclosure including both prokaryotic and eukaryotic strains, are readily accessible to the public from a number of culture collections such as American Type Culture Collection (ATCC), Deutsche Sammlung von Mikroorganismen and Zellkulturen GmbH (DSM), Centraalbureau Voor Schimmelcultures (CBS), and Agricultural Research Service Patent Culture Collection, Northern Regional Research Center (NRRL).
[0172] In some embodiments, the methods of the present disclosure are also applicable to multi-cellular organisms. For example, the platform could be used for improving the performance of crops. The organisms can comprise a plurality of plants such as Gramineae, Fetucoideae, Poacoideae, Agrostis, Phleum, Dactylis, Sorgum, Setaria, Zea, Oryza, Triticum, Secale, Avena, Hordeum, Saccharum, Poa, Festuca, Stenotaphrum, Cynodon, Coix, Olyreae, Phareae, Compositae or Leguminosae. For example, the plants can be corn, rice, soybean, cotton, wheat, rye, oats, barley, pea, beans, lentil, peanut, yam bean, cowpeas, velvet beans, clover, alfalfa, lupine, vetch, lotus, sweet clover, wisteria, sweet pea, sorghum, millet, sunflower, canola or the like. Similarly, the organisms can include a plurality of animals such as non-human mammals, fish, insects, or the like.
[0173] Transformation of Host Cells
[0174] In some embodiments, the constructs generated by the methods of the present disclosure may be introduced into the host cells using any of a variety of techniques, including transformation, transfection, transduction, viral infection, gene guns, or Ti-mediated gene transfer. Particular methods include calcium phosphate transfection, DEAE-Dextran mediated transfection, lipofection, or electroporation (Davis, L., Dibner, M., Battey, I., 1986 "Basic Methods in Molecular Biology"). Other methods of transformation include for example, lithium acetate transformation and electroporation See, e.g., Gietz et al., Nucleic Acids Res. 27:69-74 (1992); Ito et al., J. Bacterol. 153:163-168 (1983); and Becker and Guarente, Methods in Enzymology 194:182-187 (1991). In some embodiments, transformed host cells are referred to as recombinant host strains.
Automation
[0175] In one embodiment, the compositions and methods provided herein are incorporated into a high-throughput (HTP) method for genetic engineering of a host cell. In another embodiment, the methods provided herein can be a molecular tool that is part of the suite of HTP molecular tool sets described in PCT/US18/36360, PCT/US18/36333 or WO 2017/100377, each of which is herein incorporated by reference, for all purposes, to create HTP genetic design libraries, which are derived from, inter alia, scientific insight and iterative pattern recognition. The compositions and methods provided herein can be used to generate libraries for use in high-throughput methods such as those described in PCT/US18/36360, PCT/US18/36333 or WO 2017/100377. Examples of libraries that can be generated using the methods provided herein can include, but are not limited to promoter ladders, terminator ladders, solubility tag ladders or degradation tag ladders. Examples of high-throughput genomic engineering methods that can utilize the compositions and methods provided herein can include, but are not limited to, promoter swapping, terminator (stop) swapping, solubility tag swapping, degradation tag swapping or SNP swapping as described in PCT/US18/36360, PCT/US18/36333 or WO 2017/100377. The high-throughput methods can be automated and/or utilize robotics and liquid handling platforms (e.g., plate robotics platform and liquid handling machines known in the art. The high-throughput methods can utilize multi-well plates such as, for example microtiter plates.
[0176] In some embodiments, the automated methods of the disclosure comprise a robotic system. The systems outlined herein are generally directed to the use of 96- or 384-well microtiter plates, but as will be appreciated by those in the art, any number of different plates or configurations may be used. In addition, any or all of the steps outlined herein may be automated; thus, for example, the systems may be completely or partially automated. The robotic systems compatible with the methods and compositions provided herein can be those described in PCT/US18/36360, PCT/US18/36333 or WO 2017/100377.
Kits
[0177] Also provided by the present disclosure are kits for practicing the methods for generating nucleic acid assemblies or libraries derived therefrom as described above. The kit can comprise a mixture containing all of the reagents necessary for assembling ssDNA molecules (e.g., oligonucleotides) or dsDNA molecules. In certain embodiments, a subject kit may contain: i. a pool of first polynucleotides containing pairs of first and second polynucleotides, ii. a second pool of insert polynucleotides, and (iii) optionally, a suitable cloning vector for propagation of the generated assemblies in a suitable host cell. In some cases, the kit includes a positive control.
[0178] In one embodiment, the kits provided herein further comprise a 5'-3' exonuclease, and a strand-displacing polymerase. In another embodiment, the kits provided herein further comprise a 5'-3' exonuclease, a ligase and a strand-displacing polymerase. In a still further embodiment, the kits provided herein comprise a single-stranded (ss) binding protein. The ss binding protein can be an extreme thermostable single-stranded DNA binding protein (ET SSB), E. coli recA, T7 gene 2.5 product, phage lambda RedB or Rac prophage RecT.
[0179] In a separate embodiment, the kits provided herein further comprise a 5' to 3' exonuclease that lacks 3' exonuclease activity, a crowding agent, a thermostable non-strand-displacing DNA polymerase with 3' exonuclease activity, or a mixture of said DNA polymerase with a second DNA polymerase that lacks 3' exonuclease activity, and an isolated thermostable ligase, in appropriate amounts. The crowding agent can PEG, dextran or ficoll. For example, the kit may contain T5 exonuclease, PEG, PHUSION.RTM.. DNA polymerase, and Taq ligase. In another example, the kit comprises: Exonuclease III, PEG, AMPLITAQ GOLD.RTM. DNA polymerase, and Taq ligase.
[0180] Any of the kits provided herein may also contain other reagents described above and below that may be employed in the method, e.g., a mismatch repair enzyme such as mutHLS, cel-1 nuclease, T7 endo 1, uvrD, T4 EndoVII, E. coli EndoV, a buffer, dNTPs, plasmids into which to insert the synthon and/or competent cells to receive the plasmids, controls etc., depending on how the method is going to be implemented.
[0181] The components of the kit may be combined in one container, or each component may be in its own container. For example, the components of the kit may be combined in a single reaction tube or in one or more different reaction tubes.
[0182] In addition to above-mentioned components, the subject kit further includes instructions for using the components of the kit to practice the subject method. The instructions for practicing the subject method are generally recorded on a suitable recording medium. For example, the instructions may be printed on a substrate, such as paper or plastic, etc. As such, the instructions may be present in the kits as a package insert, in the labeling of the container of the kit or components thereof (i.e., associated with the packaging or subpackaging) etc. In other embodiments, the actual instructions are not present in the kit, but means for obtaining the instructions from a remote source, e.g. via the internet, are provided. An example of this embodiment is a kit that includes a web address where the instructions can be viewed and/or from which the instructions can be downloaded.
[0183] Compositions, kits and methods for assembling pairs of polynucleotides in a first pool and insert polynucleotides in a second pool as described herein result in a product that is a dsDNA that can serve as a template for PCR, RCA or a variety of other molecular biology applications including direct transformation or transfection of a competent prokaryotic or eukaryotic host cell.
EXAMPLES
[0184] The present disclosure is further illustrated by reference to the following Examples. However, it should be noted that these Examples, like the embodiments described above, are illustrative and are not to be construed as restricting the scope of the disclosure in any way.
Example 1--Proof of Principle of Method for Multiplexed Assembly of DNA Fragments into Deterministic Library of Plasmids
[0185] Objective
[0186] This example describes the use of an in vitro assembly reaction as shown schematically in FIG. 1 to deterministically join a pool comprising precisely designed DNA parts comprising 4 components to generate a library of desired plasmids.
[0187] Methods/Results
[0188] Insert sequences (i.e., payloads) were synthesized on either a column or an array to generate a pool of column-synthesized payloads and a pool of array-synthesized payloads, each containing a mixture of the 7 payload sequences shown below.
TABLE-US-00001 >pMB070_promoter (SEQ ID NO. 1) ACCGTGCGTGTTGACAATTTTACCTCTGGCGGTGATACTGGTTGCATGT ACTAAGGAGGTTGT >b2405_promoter (SEQ ID NO. 2) ATGTCGGATATCTGGTGGTGAAATACTTTATGCCATGATAATTTAATA CGATGTATTTATTATATGGAGCACTTAATT >b0605_promoter (SEQ ID NO. 3) TAATGGAAACGCATTAGCCGAATCGGCAAAAATTGGTTACCTTACATC TCATCGAAAACACGGAGGAAGTATAG >pMB043_promoter (SEQ ID NO. 4) ACCGTGCGTGTTGACTATTTTACCTCTGGCGGTTAGAGTTAACATCCTA CAAGGAGAACAAAAGC >pMB071_promoter (SEQ ID NO. 5) ACCGTGCGTGTTGACTTAAATACCACTGGCGGTGATAATGGTTGCATG TACTAAGGAGGTTGT >b0159_promoter (SEQ ID NO. 6) CTCTCCCGCGTGAGAAATACGCTTCCCCGTAAGCGCATGGTAAACTAT GCCTTCAAATCGGGCTTATCGCGAGTAAATCT >pMB090_promoter (SEQ ID NO. 7) ACCGTGCGTGTTTACAATTTTACCTCTGGCGGTGATAATTAACATCCTA CAAGGAGAACAAAAGC
[0189] Separately, 6 pools comprising pairs of left and right homology arms were generated (i.e., loci pool numbers referenced in FIG. 4). Each loci pool contained a number of homology arms that each comprised sequence complementary to a separate locus in the genome of a Escherichia coli host cell; the number of unique loci (i.e., number of pairs of homology arms) is given below the plot for each pool in FIG. 4 (i.e., `Loci in Pool` in Table below graph). The sequences of each homology arm in each pair from each pool are SEQ ID NOs 8-179.
[0190] The pool of column-synthesized payloads and the pool of array-synthesized payloads were each separately mixed with the specific loci pool designated in FIG. 4. It should be noted that each mixture contained a molar ratio of left homology arm:payload:right homology arm was roughly 1:10:1. There was an excess of the payload because the payload pools contained oligonucleotides corresponding to 100-500 unique left and right homology arms, while only 10-19 homology arms were assembled in a given reaction given that a certain fraction of insert oligonuclotides would be inert in the reaction. The mixture further comprised the NEB Hifi DNA assembly master mix and a cloning vector for propagation in an E. coli cloning strain. Each mixture contained about 0.05 pmol of the respective loci pool, 0.2-1 pmol of the respective payload pool, and 0.0125 pmol of the cloning vector, theoretically resulting in 0.0125 pmol of final assembled product. Once assembled, each of the mixtures was subjected to the NEB Hifi DNA Assembly protocol for in vitro overlap assembly and propagated in an E. coli cloning strain. The number of unique loci (pairs of homology arms), payloads, and total possible constructs in each library is given in the table in FIG. 4.
[0191] Following propagation, at least 100 colonies per assembly were picked separately into liquid culture and grown overnight. The liquid culture was used as template for a rolling-circle amplification (RCA) of the entire cloned plasmid. The RCA product was fragmented using a Tn5 transposase fragmentation and adapter-ligation kit (Nextera, Illumina). Sample-specific indexing barcodes were then added via PCR, and plasmids from the libraries were the pooled and column purified. Library molarity was assessed with a Tapestation instrument and plasmids from the libraries were loaded onto a MiSeq instrument (300 cycle kit) for sequencing. The number of plasmids sequenced for each assembly in shown in FIG. 4.
[0192] To determine what assembly had been generated in the pool of assemblies, algorithms were used to search for unique 20-mer sequences for each junction between parts in each unique assembly in the raw sequencing reads in order to identify which DNA sequence was assembled. The reads were then mapped to the corresponding reference sequence for each sample in order to determine the full-length products created. In the plot in FIG. 4, `sequence-perfect` means that all four parts (vector backbone, left and right homology arms, and payload) were assembled together and there were no mutations in the plasmid. `Correct assembly with mutations` indicates the presence of all four parts in the correct arrangement but with one or more point mutations in the plasmid. `Misassembly` indicates plasmids with mispaired homology arms, or part(s) or portions of parts not present.
[0193] Conclusion
[0194] The results shown in FIG. 4 indicate that the process depicted in FIG. 1 can be successfully utilized to generate deterministic libraries of DNA assemblies.
Example 2--Proof of Principle of Method for Multiplexed Deterministic Assembly with Large Payloads Using Circular-Permuted Payload
[0195] Objective
[0196] This example describes the use of an in vitro assembly reaction as shown schematically in FIG. 1 to deterministically join a pool comprising precisely designed DNA parts including a circular-permuted payload (insert) whose preparation is described in FIG. 3.
[0197] Methods/Results
[0198] Insert was prepared by amplifying a template comprising the payload sequence (.about.2670 bp) using a pooled forward primer comprising, from 5' to 3' end, an assembly overlap to the right side of the payload, 53 bp of HOM2, an I-SceI restriction endonuclease recognition site, 53 bp of HOM1, and a primer binding site at the left side of the payload. The amplification product was excised from an agarose gel using a Qiagen gel extraction kit. The excised product was circularized in an NEB HiFi assembly reaction, purified via paramagnetic bead-based clean-up (e.g., an AXYPREP mag bead cleanup), and linearized using I-SceI (the `circular permutation`).
[0199] Separately, left and right homology arms that each comprised sequence complementary to a separate locus in the genome of a Saccharomyces cerevisiae host cell were amplified from genomic DNA.
[0200] The circular-permuted pooled payload and the set of left and right homology arms were combined with a cloning vector and assembled using an NEB HiFi reaction. The mixture contained a molar ratio of left homology arms:pooled insert:right homology arms of roughly 1:5:1. An excess of the insert was used because the pooled insert contained 54 unique sequences compared to the ten pairs of left/right homology arms used in the assembly. The mixture contained about 16 fmol of the hom arm pools, 80 fmol of the payload pool, and 2.5 fmol of the cloning vector, theoretically resulting in 2.5 fmol of final assembled product. Once assembled, the mixture was subjected to the NEB Hifi DNA Assembly protocol for in vitro overlap assembly and propagated in an E. coli cloning strain.
[0201] Following propagation, several colonies were picked separately into liquid culture and grown overnight. The liquid culture was used as template for a rolling-circle amplification (RCA) of the entire cloned plasmid. The RCA product was sequenced with two primers (one forward and one reverse) by Sanger sequencing. The primers were designed to bind on the cloning vector and read into the hom arms and payload.
[0202] To determine what assembly had been generated in the pool of assemblies, algorithms were used to search for unique 20-mer sequences in each unique assembly in the sequencing reads. The reads were then aligned to the corresponding reference sequence for each sample in order to verify the intended junctions were created, indicating the plasmid had assembled correctly. In FIG. 5, the long bars at the top indicate the structure of the plasmids to be assembled in the pool, and the shorter bars below represent Sanger sequences aligned to the corresponding reference sequence for three separate samples from the pool of assemblies. Faint vertical lines at the inner ends of the reads represent expected sequencing artifacts at the tail ends of Sanger reads. The data indicate that all of the intended junctions were assembled.
[0203] Conclusion
[0204] The results shown in FIG. 5 indicate that the processes depicted in FIG. 1 and FIG. 3 can be successfully utilized to generate deterministic libraries of DNA assemblies incorporating long payloads (e.g., >200 bp).
Example 3--Proof of Principle of Method for Multiplexed Deterministic Assembly with Large Payloads Using PCR-Amplified Payload
[0205] Methods/Results
[0206] FIG. 6 shows total success rates for pooled assemblies for which payload containing parts were created via PCR using primers that append the assembly overlaps from templates derived from a host genome. Amplified payloads ranged from 182-213 bp in length. PCR-amplified parts with appended 25 bp assembly overlaps were purified via magnetic bead-based protocol and subsequently normalized along with left and right parts containing homology arms to 0.25 picomoles total for payload, left and right parts separately and 0.05 picomoles of vector backbone before performing assembly using NEB's HIFI assembly master-mix and subsequently electroporated into electrocompetent cells. Success rates were calculated from the percentage of plasmids that were recovered and passed NGS-QC relative to those that were attempted to be created. Success rates were based on recovering and sequencing the following numbers of unique plasmids for each pool: pool 1: (48/70 plasmids), pool 2: (47/70 plasmids), pool 3: (46/70 plasmids), pool 4 (56/70 plasmids), pool 5: (37/49 plasmids). For pools 1:4, 7 promoter payloads were targeted to 10 Loci, and for pool 5 7 promoter payloads were targeted to 7 loci. Collectively across all five pools we 234/329 or 71.12% of plasmids were created and NGS-confirmed.
[0207] Conclusion
[0208] The results shown in FIG. 6 indicate that the process depicted in FIG. 1 can be successfully utilized with PCR-amplified insert fragments to generate deterministic libraries of DNA assemblies incorporating long payloads (e.g. >200 bp).
TABLE-US-00002 SEQUENCES OF THE DISCLOSURE WITH SEQ ID NO IDENTIFIERS NUCLEIC ACID SEQ Name ID NO: Description pMB070_promoter 1 Payload sequence b2405_promoter 2 Payload sequence b0605_promoter 3 Payload sequence pMB043_promoter 4 Payload sequence pMB071_promoter 5 Payload sequence b0159_promoter 6 Payload sequence pMB090_promoter 7 Payload sequence pool_1_b3748_left 8 Pool 1-Left Homology Arm pool_1_b3748_right 9 Pool 1-Right Homology Arm pool_1_b0388_left 10 Pool 1-Left Homology Arm pool_1_b0388_right 11 Pool 1-Right Homology Arm pool_1_b4348_left 12 Pool 1-Left Homology Arm pool_1_b4348_right 13 Pool 1-Right Homology Arm pool_1_b1982_left 14 Pool 1-Left Homology Arm pool_1_b1982_right 15 Pool 1-Right Homology Arm pool_1_b4367_left 16 Pool 1-Left Homology Arm pool_1_b4367_right 17 Pool 1-Right Homology Arm pool_1_b2285_left 18 Pool 1-Left Homology Arm pool_1_b2285_right 19 Pool 1-Right Homology Arm pool_1_b2405_left 20 Pool 1-Left Homology Arm pool_1_b2405_right 21 Pool 1-Right Homology Arm pool_1_b0495_left 22 Pool 1-Left Homology Arm pool_1_b0495_right 23 Pool 1-Right Homology Arm pool_1_b1646_left 24 Pool 1-Left Homology Arm pool_1_b1646_right 25 Pool 1-Right Homology Arm pool_1_b3189_left 26 Pool 1-Left Homology Arm pool_1_b3189_right 27 Pool 1-Right Homology Arm pool_2_b3125_left 28 Pool 2-Left Homology Arm pool_2_b3125_right 29 Pool 2-Right Homology Arm pool_2_b3787_left 30 Pool 2-Left Homology Arm pool_2_b3787_right 31 Pool 2-Right Homology Arm pool_2_b1948_left 32 Pool 2-Left Homology Arm pool_2_b1948_right 33 Pool 2-Right Homology Arm pool_2_b2790_left 34 Pool 2-Left Homology Arm pool_2_b2790_right 35 Pool 2-Right Homology Arm pool_2_b3197_left 36 Pool 2-Left Homology Arm pool_2_b3197_right 37 Pool 2-Right Homology Arm pool_2_b3791_left 38 Pool 2-Left Homology Arm pool_2_b3791_right 39 Pool 2-Right Homology Arm pool_2_b4260_left 40 Pool 2-Left Homology Arm pool_2_b4260_right 41 Pool 2-Right Homology Arm pool_2_b0071_left 42 Pool 2-Left Homology Arm pool_2_b0071_right 43 Pool 2-Right Homology Arm pool_2_b1687_left 44 Pool 2-Left Homology Arm pool_2_b1687_right 45 Pool 2-Right Homology Arm pool_2_b1006_left 46 Pool 2-Left Homology Arm pool_2_b1006_right 47 Pool 2-Right Homology Arm pool_3_b0335_left 48 Pool 3-Left Homology Arm pool_3_b0335_right 49 Pool 3-Right Homology Arm pool_3_b1940_left 50 Pool 3-Left Homology Arm pool_3_b1940_right 51 Pool 3-Right Homology Arm pool_3_b0109_left 52 Pool 3-Left Homology Arm pool_3_b0109_right 53 Pool 3-Right Homology Arm pool_3_b3399_left 54 Pool 3-Left Homology Arm pool_3_b3399_right 55 Pool 3-Right Homology Arm pool_3_b2478_left 56 Pool 3-Left Homology Arm pool_3_b2478_right 57 Pool 3-Right Homology Arm pool_3_b0320_left 58 Pool 3-Left Homology Arm pool_3_b0320_right 59 Pool 3-Right Homology Arm pool_3_b4521_left 60 Pool 3-Left Homology Arm pool_3_b4521_right 61 Pool 3-Right Homology Arm pool_3_b2260_left 62 Pool 3-Left Homology Arm pool_3_b2260_right 63 Pool 3-Right Homology Arm pool_3_b4169_left 64 Pool 3-Left Homology Arm pool_3_b4169_right 65 Pool 3-Right Homology Arm pool_3_b2405_left 66 Pool 3-Left Homology Arm pool_3_b2405_right 67 Pool 3-Right Homology Arm pool_4_b3493_left 68 Pool 4-Left Homology Arm pool_4_b3493_right 69 Pool 4-Right Homology Arm pool_4_b0479_left 70 Pool 4-Left Homology Arm pool_4_b0479_right 71 Pool 4-Right Homology Arm pool_4_b2470_left 72 Pool 4-Left Homology Arm pool_4_b2470_right 73 Pool 4-Right Homology Arm pool_4_b1451_left 74 Pool 4-Left Homology Arm pool_4_b1451_right 75 Pool 4-Right Homology Arm pool_4_b1981_left 76 Pool 4-Left Homology Arm pool_4_b1981_right 77 Pool 4-Right Homology Arm pool_4_b0237_left 78 Pool 4-Left Homology Arm pool_4_b0237_right 79 Pool 4-Right Homology Arm pool_4_b2497_left 80 Pool 4-Left Homology Arm pool_4_b2497_right 81 Pool 4-Right Homology Arm pool_4_b4260_left 82 Pool 4-Left Homology Arm pool_4_b4260_right 83 Pool 4-Right Homology Arm pool_4_b1412_left 84 Pool 4-Left Homology Arm pool_4_b1412_right 85 Pool 4-Right Homology Arm pool_4_b4139_left 86 Pool 4-Left Homology Arm pool_4_b4139_right 87 Pool 4-Right Homology Arm pool_4_b2039_left 88 Pool 4-Left Homology Arm pool_4_b2039_right 89 Pool 4-Right Homology Arm pool_4_b4473_left 90 Pool 4-Left Homology Arm pool_4_b4473_right 91 Pool 4-Right Homology Arm pool_4_b3510_left 92 Pool 4-Left Homology Arm pool_4_b3510_right 93 Pool 4-Right Homology Arm pool_4_b1007_left 94 Pool 4-Left Homology Arm pool_4_b1007_right 95 Pool 4-Right Homology Arm pool_4_b3058_left 96 Pool 4-Left Homology Arm pool_4_b3058_right 97 Pool 4-Right Homology Arm pool_4_b2688_left 98 Pool 4-Left Homology Arm pool_4_b2688_right 99 Pool 4-Right Homology Arm pool_4_b1716_left 100 Pool 4-Left Homology Arm pool_4_b1716_right 101 Pool 4-Right Homology Arm pool_4_b3071_left 102 Pool 4-Left Homology Arm pool_4_b3071_right 103 Pool 4-Right Homology Arm pool_4_b2139_left 104 Pool 4-Left Homology Arm pool_4_b2139_right 105 Pool 4-Right Homology Arm pool_5_b2434_left 106 Pool 5-Left Homology Arm pool_5_b2434_right 107 Pool 5-Right Homology Arm pool_5_b2037_left 108 Pool 5-Left Homology Arm pool_5_b2037_right 109 Pool 5-Right Homology Arm pool_5_b2451_left 110 Pool 5-Left Homology Arm pool_5_b2451_right 111 Pool 5-Right Homology Arm pool_5_b1902_left 112 Pool 5-Left Homology Arm pool_5_b1902_right 113 Pool 5-Right Homology Arm pool_5_b4310_left 114 Pool 5-Left Homology Arm pool_5_b4310_right 115 Pool 5-Right Homology Arm pool_5_b0676_left 116 Pool 5-Left Homology Arm pool_5_b0676_right 117 Pool 5-Right Homology Arm pool_5_b1497_left 118 Pool 5-Left Homology Arm pool_5_b1497_right 119 Pool 5-Right Homology Arm pool_5_b0183_left 120 Pool 5-Left Homology Arm pool_5_b0183_right 121 Pool 5-Right Homology Arm pool_5_b3631_left 122 Pool 5-Left Homology Arm pool_5_b3631_right 123 Pool 5-Right Homology Arm pool_5_b3791_left 124 Pool 5-Left Homology Arm pool_5_b3791_right 125 Pool 5-Right Homology Arm pool_5_b0438_left 126 Pool 5-Left Homology Arm pool_5_b0438_right 127 Pool 5-Right Homology Arm pool_5_b1981_left 128 Pool 5-Left Homology Arm pool_5_b1981_right 129 Pool 5-Right Homology Arm pool_5_b1709_left 130 Pool 5-Left Homology Arm pool_5_b1709_right 131 Pool 5-Right Homology Arm pool_5_b2176_left 132 Pool 5-Left Homology Arm pool_5_b2176_right 133 Pool 5-Right Homology Arm pool_5_b2168_left 134 Pool 5-Left Homology Arm pool_5_b2168_right 135 Pool 5-Right Homology Arm pool_5_b1872_left 136 Pool 5-Left Homology Arm pool_5_b1872_right 137 Pool 5-Right Homology Arm pool_5_b1203_left 138 Pool 5-Left Homology Arm pool_5_b1203_right 139 Pool 5-Right Homology Arm pool_5_b2231_left 140 Pool 5-Left Homology Arm pool_5_b2231_right 141 Pool 5-Right Homology Arm pool_5_b1622_left 142 Pool 5-Left Homology Arm pool_5_b1622_right 143 Pool 5-Right Homology Arm pool_6_b1857_left 144 Pool 6-Left Homology Arm pool_6_b1857_right 145 Pool 6-Right Homology Arm pool_6_b4024_left 146 Pool 6-Left Homology Arm pool_6_b4024_right 147 Pool 6-Right Homology Arm pool_6_b3942_left 148 Pool 6-Left Homology Arm pool_6_b3942_right 149 Pool 6-Right Homology Arm pool_6_b0592_left 150 Pool 6-Left Homology Arm pool_6_b0592_right 151 Pool 6-Right Homology Arm pool_6_b1415_left 152 Pool 6-Left Homology Arm pool_6_b1415_right 153 Pool 6-Right Homology Arm pool_6_b1762_left 154 Pool 6-Left Homology Arm pool_6_b1762_right 155 Pool 6-Right Homology Arm pool_6_b3414_left 156 Pool 6-Left Homology Arm pool_6_b3414_right 157 Pool 6-Right Homology Arm pool_6_b4374_left 158 Pool 6-Left Homology Arm pool_6_b4374_right 159 Pool 6-Right Homology Arm pool_6_b2917_left 160 Pool 6-Left Homology Arm pool_6_b2917_right 161 Pool 6-Right Homology Arm pool_6_b0346_left 162 Pool 6-Left Homology Arm pool_6_b0346_right 163 Pool 6-Right Homology Arm pool_6_b3966_left 164 Pool 6-Left Homology Arm pool_6_b3966_right 165 Pool 6-Right Homology Arm pool_6_b0406_left 166 Pool 6-Left Homology Arm pool_6_b0406_right 167 Pool 6-Right Homology Arm pool_6_b0652_left 168 Pool 6-Left Homology Arm pool_6_b0652_right 169 Pool 6-Right Homology Arm pool_6_b1493_left 170 Pool 6-Left Homology Arm pool_6_b1493_right 171 Pool 6-Right Homology Arm pool_6_b4159_left 172 Pool 6-Left Homology Arm pool_6_b4159_right 173 Pool 6-Right Homology Arm pool_6_b3795_left 174 Pool 6-Left Homology Arm pool_6_b3795_right 175 Pool 6-Right Homology Arm pool_6_b4246_left 176 Pool 6-Left Homology Arm pool_6_b4246_right 177 Pool 6-Right Homology Arm pool_6_b4440_left 178 Pool 6-Left Homology Arm pool_6_b4440_right 179 Pool 6-Right Homology Arm
Numbered Embodiments of the Disclosure
[0209] Other subject matter contemplated by the present disclosure is set out in the following numbered embodiments:
[0210] 1. A composition comprising a mixture of polynucleotides, the mixture comprising:
[0211] a first pool containing pairs of polynucleotides, wherein each pair in the first pool contains a first polynucleotide and a second polynucleotide; and
[0212] a second pool of insert polynucleotides, wherein each insert polynucleotide in the second pool comprises a first assembly overlap sequence at its 5' end that is complementary to a 3' end of a first polynucleotide and a second assembly overlap sequence at its opposing 3' end that is complementary to a 5' end of a second polynucleotide in a pair of polynucleotides from the first pool.
[0213] 2. The composition of embodiment 1, further comprising a cloning vector, wherein, for each pair in the first pool, a 5' end of the first polynucleotide and a 3' end of the second polynucleotide comprises sequence complementary to the cloning vector.
[0214] 3. The composition of embodiment 2, wherein each polynucleotide from the first pool is selected such that no polynucleotide from the first pool shares common sequence with any other polynucleotide from the first pool beyond a specified threshold, excluding designed assembly overlap sequences between the pairs of polynucleotides of the first pool and the insert polynucleotides of the second pool, or the pairs of polynucleotides of the first pool and the cloning vector.
[0215] 4. The composition of embodiment 3, wherein the specified threshold is between 5 and 15 contiguous nucleotides.
[0216] 5. The composition of any one of embodiments 1-4, further comprising a polymerase.
[0217] 6. The composition of embodiment 5, wherein the polymerase is strand-displacing or non-strand displacing.
[0218] 7. The composition of embodiment 6, wherein the polymerase is non-strand displacing and the composition further comprises a crowding agent.
[0219] 8. The composition of embodiment 7, wherein the crowding agent is polyethylene glycol (PEG).
[0220] 9. The composition of embodiment 8, wherein the PEG is used at a concentration of from about 3 to about 7% (weight/volume).
[0221] 10. The composition of embodiment 8 or 9, wherein the PEG is selected from PEG-200, PEG-4000, PEG-6000, PEG-8000 or PEG-20,000.
[0222] 11. The composition of embodiment 6, wherein the polymerase is strand displacing and the composition further comprises a single-stranded binding protein.
[0223] 12. The composition of embodiment 11, wherein the single strand DNA binding protein is an extreme thermostable single-stranded DNA binding protein (ET SSB), E. coli recA, T7 gene 2.5 product, phage lambda RedB or Rac prophage RecT.
[0224] 13. The composition of any one of the above embodiments, further comprising a 5'-3' exonuclease.
[0225] 14. The composition of any one of the above embodiments, further comprising a ligase.
[0226] 15. The composition of any one of the above embodiments, wherein each pair in the first pool is double-stranded DNA (dsDNA) or single-stranded DNA (ssDNA).
[0227] 16. The composition of any one of the above embodiments, wherein each insert polynucleotide in the second pool is dsDNA or ssDNA.
[0228] 17. The composition of any one of the above embodiments, wherein, for each pair in the first pool, the first polynucleotide and the second polynucleotide comprises sequence corresponding to a target genomic locus in a host cell.
[0229] 18. The composition of any one of embodiments 1-16, wherein, for each pair in the first pool, the first polynucleotide and the second polynucleotide comprise coding sequence corresponding to a gene that is part of a metabolic pathway.
[0230] 19. The composition of any one of embodiments 1-18, wherein, for each pair in the first pool, the first polynucleotide and the second polynucleotide comprise coding sequence corresponding to a functional domain or one or more proteins.
[0231] 20. The composition of any one of the above embodiments, wherein, for each pair in the first pool, the first polynucleotide and the second polynucleotide are linked together in a single construct, wherein the single construct comprises one or more recognition sequences for one or more site-specific nucleases between the first polynucleotide and the second polynucleotide.
[0232] 21. The composition of embodiment 20, wherein the one or more recognition sequences for one or more site-specific nucleases comprise a homing endonuclease recognition sequence.
[0233] 22. The composition of any one of the above embodiments, wherein the first assembly overlap sequence and the second assembly overlap sequence on each insert polynucleotide in the second pool comprises 1 or more nucleotides that are complementary to the 3' end of a first polynucleotide and the 5' end of a second polynucleotide, respectively, in a pair of polynucleotides from the first pool.
[0234] 23. The composition of any one of the above embodiments, wherein the first assembly overlap sequence and the second assembly overlap sequence on each insert polynucleotide in the second pool comprises about 25 nucleotides that are complementary to the 3' end of a first polynucleotide and the 5' end of a second polynucleotide, respectively, in a pair of polynucleotides from the first pool.
[0235] 24. The composition of any one of the above embodiments, wherein each insert polynucleotide in the second pool comprises one or more payload sequences located between the first assembly overlap sequence and the second assembly overlap sequence.
[0236] 25. The composition of embodiment 24, wherein the one or more payload sequences are selected from promoters, genes, regulatory sequences, nucleic acid sequence encoding degrons, nucleic acid sequence encoding solubility tags, terminators, unique identifier sequence or portions thereof.
[0237] 26. The composition of embodiment 17, wherein each pair of first and second polynucleotides in the first pool comprises sequence corresponding to a different target genomic locus in a host cell as compared to each other pair in the first pool.
[0238] 27. The composition of embodiment 17, wherein, for each pair in the first pool, the first polynucleotide and the second polynucleotide comprises sequence corresponding to the same target genomic locus in a host cell.
[0239] 28. The composition of any one of embodiments 24-27, wherein each payload sequence in the insert polynucleotides in the second pool is different from the payload sequence in each other insert polynucleotide in the second pool.
[0240] 29. The composition of any one of embodiments 24-27, wherein each payload sequence in the insert polynucleotides in the second pool is the same as the payload sequence in each other insert polynucleotide in the second pool.
[0241] 30. A method for generating libraries of polynucleotides, the method comprising:
[0242] (a) combining a first pool of polynucleotides and a second pool of polynucleotides, wherein the first pool contains pairs of polynucleotides, wherein each pair in the first pool contains a first polynucleotide and a second polynucleotide, wherein the second pool contains insert polynucleotides, wherein each insert polynucleotide in the second pool comprises a first assembly overlap sequence at its 5' end that is complementary to a 3' end of a first polynucleotide and a second assembly overlap sequence at its opposing 3' end that is complementary to a 5' end of a second polynucleotide in a pair of polynucleotides from the first pool; and
[0243] (b) assembling the first pool and the second pool into a library of polynucleotides, wherein each polynucleotide in the library comprises an insert polynucleotide from the second pool and a pair of first polynucleotides and second polynucleotides from the first pool, wherein the assembling is performed via in vitro cloning methods or in vivo cloning methods.
[0244] 31. The method of embodiment 30, wherein the first assembly overlap sequence and the second assembly overlap sequence on each insert polynucleotide in the second pool comprises 1 or more nucleotides that are complementary to the 3' end of a first polynucleotide and the 5' end of a second polynucleotide, respectively, in a pair of polynucleotides from the first pool.
[0245] 32. The method of embodiment 30 or 31, wherein the first assembly overlap sequence and the second assembly overlap sequence on each insert polynucleotide in the second pool comprises about 25 nucleotides that are complementary to the 3' end of a first polynucleotide and the 5' end of a second polynucleotide, respectively, in a pair of polynucleotides from the first pool.
[0246] 33. The method of any one of embodiments 30-32, wherein, for each pair in the first pool, the first polynucleotide and the second polynucleotide are linked together in a single construct, wherein the single construct comprises one or more recognition sequences for one or more site-specific nucleases between the first polynucleotide and the second polynucleotide.
[0247] 34. The method of embodiment 33, wherein the one or more recognition sequences for one or more site-specific nucleases comprises a homing endonuclease recognition sequence.
[0248] 35. The method of embodiment 33, wherein the linked single construct is produced by joining individual first and second polynucleotides via splicing and overlap-extension PCR (SOE-PCR), restriction-ligation, blunt-end ligation, overlap-based assembly method, recombination-based method, or any other enzymatic or chemical method of joining the first and second polynucleotides, or by synthesizing the single-construct directly.
[0249] 36. The method of any one of embodiments 30-32, further comprising combining a cloning vector with the first pool and the second pool during step (a), wherein opposing ends of the cloning vector comprise sequence complementary to a 5'end of the first polynucleotide and a 3' end of the second polynucleotide for each pair in the first pool.
[0250] 37. The method of any one of embodiments 30-32, further comprising combining a cloning vector with the first pool prior to step (a), wherein opposing ends of the cloning vector comprise sequence complementary to a 5' end of the first polynucleotide and a 3' end of the second polynucleotide for each pair in the first pool.
[0251] 38. The method of embodiment 36 or 37, wherein the cloning vector and the 5'end of the first polynucleotide and the 3'end of the second polynucleotide in each pair from the first pool comprise one or more recognition sequences for one or more site-specific nucleases.
[0252] 39. The method of embodiment 38, further comprising generating single-stranded complementary overhangs between the opposing ends of the cloning vector and the 5'end of the first polynucleotide and the 3'end of the second polynucleotide in each pair from the first pool by adding the one or more site-specific nucleases for the one or more recognition sequences.
[0253] 40. The method of embodiment 39, further comprising ligating the single-stranded complementary overhangs between the opposing ends of the cloning vector and the 5'end of the first polynucleotide and the 3' end of the second polynucleotide in each pair from the first pool.
[0254] 41. The method of any one of embodiments 36-40, wherein step (b) results in a circular product comprising an insert polynucleotide from the second pool, a first and second polynucleotide from a pair from the first pool and the cloning vector.
[0255] 42. The method of any one of embodiments 36-41, wherein the first pool is generated by selecting pairs of polynucleotide sequences from a larger set of such sequences such that no polynucleotide from the first pool shares common sequence with any other polynucleotide from the first pool beyond a specified threshold, excluding designed assembly overlap sequences between the pairs of polynucleotides of the first pool and the insert polynucleotides of the second pool, or the pairs of polynucleotides of the first pool and the cloning vector.
[0256] 43. The method of embodiment 42, wherein the specified threshold is between 5 and 15 contiguous nucleotides.
[0257] 44. The method of any one of embodiments 30-43, wherein the assembly is an in vitro cloning method, wherein the mixture of the first pool and the second pool is heated to partially or fully denature polynucleotides present in the first and the second pools, then cooled to room temperature before assembly.
[0258] 45. A method for generating libraries of polynucleotides, the method comprising:
[0259] (a) amplifying via polymerase chain reaction (PCR) a first pool of polynucleotides, wherein the first pool contains pairs of polynucleotides, wherein each pair in the first pool contains a first polynucleotide and a second polynucleotide, and wherein each first polynucleotide and each second polynucleotide in a pair comprises a 5' end and a 3' end, wherein the amplifying introduces a common overlap sequence comprising one or more recognition sequences for one or more site-specific nucleases onto the 5' end of a first polynucleotide and the 3' end of a second polynucleotide in a pair from the first pool;
[0260] (b) assembling each pair of first polynucleotides and second polynucleotides from the first pool into a single nucleic acid fragment by utilizing common overlap sequence, wherein the single nucleic fragment for each pair comprises a first polynucleotide and second polynucleotide separated by the common overlap sequence from the 5' end of the first polynucleotide and the 3' end of the second polynucleotide, and wherein the 3' end of the first polynucleotide and the 5' end of the second polynucleotide in the single nucleic fragment for each pair are located on opposing terminal ends of the single nucleic acid fragment, distal to the one or more site-specific nuclease recognition sequence(s);
[0261] (c) combining the single nucleic acid fragments for each pair with a second pool containing insert polynucleotides, wherein each insert polynucleotide in the second pool comprises a first assembly overlap sequence at its 5' end that is complementary to the 3' end of the first polynucleotide present within the single nucleic acid fragment and a second assembly overlap sequence at its opposing 3' end that is complementary to the 5' end of the second polynucleotide present within the single nucleic acid fragment;
[0262] (d) assembling the first pool and the second pool into a third pool of circularized products, wherein the assembling is performed via in vitro or in vivo overlap assembly methods, and wherein each circularized product in the third pool comprises an insert sequence from the second pool and a pair of first polynucleotides and second polynucleotides from the first pool;
[0263] (e) linearizing each circularized product in the third pool via digestion by one or more site-specific nuclease(s) that recognizes the one or more site-specific nuclease recognition sequence(s) located between the first polynucleotide sequence and the second polynucleotide sequence in each of the circularized products in the third pool; and
[0264] (f) assembling the linearized products into cloning vectors by in vitro or in vivo cloning methods.
[0265] 46. The method of embodiment 45, wherein the one or more site-specific nuclease recognition sequence(s) located between the first polynucleotide sequence and the second polynucleotide sequence is a homing nuclease recognition sequence.
[0266] 47. The method of embodiment 45 or 46, wherein the one or more site-specific nuclease(s) for the one or more site-specific nuclease recognition sequence(s) located between the first polynucleotide sequence and the second polynucleotide sequence is a homing endonuclease.
[0267] 48. The method of any one of embodiments 45-47, wherein the common overlap sequence comprises an assembly overlap sequence of at least 1 nucleotide and the assembly in step (b) is performed by an overlap-based DNA assembly method.
[0268] 49. The method of any one of embodiments 45-47, wherein the common overlap sequence comprises an assembly overlap sequence of from 10-25 nucleotides and the assembly in step (b) is performed by an overlap-based DNA assembly method.
[0269] 50. The method of embodiment 48 or 49, wherein the overlap-based DNA assembly method is selected from SOE-PCR or an in vitro overlap-assembly method.
[0270] 51. The method of embodiment 50, wherein the one or more site-specific nuclease recognition sequence(s) present in the common overlap sequence on the 5' end of the first polynucleotide is complementary to the one or more site-specific nuclease recognition sequence(s) present in the common overlap sequence on the 3' end of the second polynucleotide in each pair, and wherein the utilizing the common overlap sequences of the first and second polynucleotides in each pair in step (b) entails performing SOE-PCR.
[0271] 52. The method of any one of embodiments 45-47, wherein the utilizing the common overlap sequences of the first and second polynucleotides in each pair in step (b) entails digesting the one or more site-specific nuclease recognition sequences present in the common overlap sequence on the 5' end of the first polynucleotide and the 3' end of the second polynucleotide in each pair with one or more site specific nucleases for the one or more site-specific nuclease recognition sequences to generate single-stranded overhangs on the 5' end of the first polynucleotide and the 3' end of the second polynucleotide in each pair that comprise complementary sequence; and ligating the complementary sequence present on the single-stranded overhang on the 5' end of the first polynucleotide and the 3' end of the second polynucleotide in each pair.
[0272] 53. The method of any one of embodiments 45-52, wherein the assembling of step (d) is performed using an overlap-based DNA assembly method.
[0273] 54. The method of embodiment 53, wherein the overlap-based DNA assembly is selected from SOE-PCR and an in vitro overlap-assembly method.
[0274] 55. The method of any one of embodiments 45-52, wherein the 3' end of the first polynucleotide and the 5' end of the second polynucleotide in the single nucleic acid fragment in each pair comprise an additional set of one or more site-specific nuclease recognition sequences and the first assembly overlap sequence and the second assembly overlap sequence in each insert polynucleotide in the second pool comprise one or more site-specific nuclease recognition sequences.
[0275] 56. The method of embodiment 55, wherein the assembling in step (d) entails digesting the additional one or more site-specific nuclease recognition sequences present on the 3' end of the first polynucleotide and the 5' end of the second polynucleotide in the single nucleic acid fragment in each pair and the one or more site-specific nuclease recognition sequences present in the first and second assembly sequences in each insert polynucleotide from the second pool with one or more site specific nucleases for the additional one or more site-specific nuclease recognition sequences on the 3' end of the first polynucleotide and the 5' end of the second polynucleotide in the single nucleic acid fragment in each pair and the one or more site-specific nuclease recognition sequences present in the first and second assembly sequences in each insert polynucleotide from the second pool to generate a single-stranded overhang on the 3' end of the first polynucleotide that comprises sequence complementary to sequence present on a single-stranded overhang on the 5' end of the first assembly sequence of an insert polynucleotide from the second pool and a single stranded overhang on the 5' end of the second polynucleotide that comprises sequence complementary to a sequence present on a single-stranded overhang on the 3'end of the second assembly sequence of the same insert polynucleotide from the second pool; and ligating the complementary sequence present on the single-stranded overhangs.
[0276] 57. The method of any one of embodiments 45-56, wherein the cloning vectors of step (f) comprise one or more site-specific nuclease recognition sequences.
[0277] 58. The method of embodiment 57, wherein the assembling in step (f) entails digesting the one or more site-specific nuclease recognition sequences in the cloning vectors with the one or more site-specific nucleases for the one or more site-specific nuclease recognition sequences recognition sequences present in the cloning vectors, wherein the digesting generates single-stranded overhangs on opposing ends of the cloning vectors, wherein the single-stranded overhang on one of the opposing ends of the cloning vector comprises sequence complementary to an end of the linearized product generated in step (e) and the single-stranded overhang on the other of the opposing ends of the cloning vectors comprises sequence complementary to an opposing end of the linearized product generated in step (e); and ligating the complementary sequences present on the single-stranded overhangs of the cloning vectors and the linearized products from step (e).
[0278] 59. The method of any one of embodiments 45-58, wherein the first pool is generated by selecting pairs of polynucleotide sequences from a larger set of such sequences such that no polynucleotide from the first pool shares common sequence with any other polynucleotide from the first pool beyond a specified threshold, excluding designed assembly overlap sequences between the pairs of polynucleotides of the first pool and the insert polynucleotides of the second pool, or the pairs of polynucleotides of the first pool and the cloning vector.
[0279] 60. The method of embodiment 59, wherein the specified threshold is between 5 and 15 contiguous nucleotides.
[0280] 61. The method of any one of embodiments 45-60, wherein the first assembly overlap sequence and the second assembly overlap sequence on each insert polynucleotide in the second pool comprises 1 or more nucleotides that are complementary to the opposing terminal ends of the single nucleic acid fragment.
[0281] 62. The method of any one of embodiments 45-61, wherein the first assembly overlap sequence and the second assembly overlap sequence on each insert polynucleotide in the second pool comprises about 25 nucleotides that are complementary to the opposing terminal ends of the single nucleic acid fragment.
[0282] 63. The method of any one of embodiments 30-62, wherein, prior to step (a), the first pool of polynucleotides is generated by combining a mixture containing each first polynucleotide from the pairs of polynucleotides with a mixture containing each second polynucleotide from the pairs of polynucleotides.
[0283] 64. The method of any one of embodiments 30-63, wherein each pair in the first pool is double-stranded DNA (dsDNA) or single-stranded DNA (ssDNA).
[0284] 65. The method of any one of embodiments 30-43, wherein each insert polynucleotide in the second pool is dsDNA or ssDNA.
[0285] 66. The method of any one of embodiments 30-65, wherein, for each pair in the first pool, the first polynucleotide and the second polynucleotide comprises sequence corresponding to a target genomic locus in a host cell.
[0286] 67. The method of any one of embodiments 30-65, wherein, for each pair in the first pool, the first polynucleotide and the second polynucleotide comprise coding sequence corresponding to a gene that is part of a metabolic pathway.
[0287] 68. The method of any one of embodiments 30-65, wherein, for each pair in the first pool, the first polynucleotide and the second polynucleotide comprise coding sequence corresponding to a functional domain or one or more proteins.
[0288] 69. The method of any one of embodiments 30-68, wherein each insert polynucleotide in the second pool comprises one or more payload sequences located between the first assembly overlap sequence and the second assembly overlap sequence.
[0289] 70. The method of embodiment 69, wherein the one or more payload sequences are selected from promoters, genes, regulatory sequences, nucleic acid sequence encoding degrons, nucleic acid sequence encoding solubility tags, terminators, unique identifier sequence or portions thereof.
[0290] 71. The method of embodiment 66, wherein, for each pair in the first pool, the first polynucleotide and the second polynucleotide comprises sequence corresponding to a different target genomic locus in a host cell as compared to each other pair in the first pool.
[0291] 72. The method of embodiment 66, wherein, for each pair in the first pool, the first polynucleotide and the second polynucleotide comprises sequence corresponding to the same target genomic locus in a host cell.
[0292] 73. The method of any one of embodiments 69-72, wherein each payload sequence in the insert polynucleotides in the second pool is different from the payload sequence in each other insert polynucleotide in the second pool.
[0293] 74. The method of any one of embodiments 69-72, wherein each payload sequence in the insert polynucleotides in the second pool is the same as the payload sequence in each other insert polynucleotide in the second pool.
[0294] 75. The method of any one of embodiments 30 or 69-74, wherein each insert polynucleotide in the second pool is generated by:
[0295] (i) performing a polymerase chain reaction (PCR) on a mixture comprising the payload sequence, a forward primer and a reverse primer, wherein the forward primer comprises from 5' to 3', a short stretch of one or more nucleotides complementary to the payload sequence, the first assembly overlap sequence, one or more recognition sequences for one or more site-specific nucleases, the second assembly overlap sequence and a second stretch of one or more nucleotides complementary to the payload sequence and wherein the reverse primer comprises sequence complementary to the payload sequence or to other sequence downstream of the payload sequence, wherein the PCR generates a PCR product comprising from 5' to 3', the short stretch of nucleic acid complementary to the payload sequence, the first assembly overlap sequence, the one or more site-specific nuclease recognition sequence(s), the second assembly overlap sequence and the payload sequence;
[0296] (ii) circularizing the PCR product via an assembly method selected from the group consisting of splicing and overlap-extension PCR (SOE-PCR), restriction-ligation, blunt-end ligation, overlap based assembly method and recombination-based method, or any other enzymatic or chemical method of joining two DNA molecules; and
[0297] (iii) linearizing the circularized PCR product with one or more site-specific nuclease(s) that recognize the one or more site-specific nuclease recognition sequence(s), thereby generating the second pool of polynucleotides.
[0298] 76. The composition or method of any one of the above embodiments, wherein the site-specific nuclease(s) is one or more of restriction endonuclease(s), Type IIs endonuclease(s), homing endonuclease(s), RNA-guided nuclease(s), DNA-guided nuclease(s), zinc-finger nuclease(s), TALEN(s) or nicking enzyme(s).
[0299] The various embodiments described above can be combined to provide further embodiments. All of the U.S. patents, U.S. patent application publications, U.S. patent application, foreign patents, foreign patent application and non-patent publications referred to in this specification and/or listed in the Application Data Sheet are incorporated herein by reference, in their entirety. Aspects of the embodiments can be modified, if necessary to employ concepts of the various patents, application and publications to provide yet further embodiments.
[0300] These and other changes can be made to the embodiments in light of the above-detailed description. In general, in the following claims, the terms used should not be construed to limit the claims to the specific embodiments disclosed in the specification and the claims, but should be construed to include all possible embodiments along with the full scope of equivalents to which such claims are entitled. Accordingly, the claims are not limited by the disclosure.
INCORPORATION BY REFERENCE
[0301] All references, articles, publications, patents, patent publications, and patent applications cited herein are incorporated by reference in their entireties for all purposes. However, mention of any reference, article, publication, patent, patent publication, and patent application cited herein is not, and should not be taken as an acknowledgment or any form of suggestion that they constitute valid prior art or form part of the common general knowledge in any country in the world.
Sequence CWU
1
1
179163DNAArtificial SequencepMB070_promoter 1accgtgcgtg ttgacaattt
tacctctggc ggtgatactg gttgcatgta ctaaggaggt 60tgt
63278DNAArtificial
Sequenceb2405_promoter 2atgtcggata tctggtggtg aaatacttta tgccatgata
atttaatacg atgtatttat 60tatatggagc acttaatt
78374DNAArtificial Sequenceb0605_promoter
3taatggaaac gcattagccg aatcggcaaa aattggttac cttacatctc atcgaaaaca
60cggaggaagt atag
74465DNAArtificial SequencepMB043_promoter 4accgtgcgtg ttgactattt
tacctctggc ggttagagtt aacatcctac aaggagaaca 60aaagc
65563DNAArtificial
SequencepMB071_promoter 5accgtgcgtg ttgacttaaa taccactggc ggtgataatg
gttgcatgta ctaaggaggt 60tgt
63680DNAArtificial Sequenceb0159_promoter
6ctctcccgcg tgagaaatac gcttccccgt aagcgcatgg taaactatgc cttcaaatcg
60ggcttatcgc gagtaaatct
80765DNAArtificial SequencepMB090_promoter 7accgtgcgtg tttacaattt
tacctctggc ggtgataatt aacatcctac aaggagaaca 60aaagc
6581050DNAArtificial
Sequencegene editing homology arm pool_1_b3748_left 8caaatctcga
agatttcttt catgcggtgg gagatataga caataccgcg gccttgcgat 60ttcagctcgc
ggatgacgcg gaacagggat tcggtttcgg tatcggtcag cgcatcggtc 120ggttcatcca
taatgatgac tttcgactca aagctcagca ctttggcgat ttcaaccatt 180tgctggtcac
cgatggaaag atcgcccacc agcttgtcgc ttttaaagcg caggttaagt 240ttagccagca
atttatccgc ttcggcatac atggttttcc agtcaatttt gccaaagcga 300ttaacaaact
cacgaccgag gaaaatgttt tcggcaatgg tcaactgcgg gatcaggttc 360agttcctgat
ggataatccc aatcccggct tcctgggaag attttggccc ggtaaatgtc 420gtttctttcc
ccagccataa aagcgtaccg gcatcgcgag tatagatgcc agtaagcact 480ttcatcatgg
tggatttacc cgcgccgttt tcgcccacca gcgccatcac gcggcccgga 540tagacattta
acgctgcgcc cgagagggct tttacgcccg ggaaggcttt atcgatgcct 600ttaagctgaa
gtaatgcttc catgacggcc tcagaacgtc acgccagcac agagaatgat 660attcgcatac
ggagaacatt ctccgctgcg aattaccgcc tgactttctg cggtttgttg 720tttgaattgt
tcatgcgtgg tgtaacgaat ttcaatggta tttccctggt gtttttgcag 780ctgctcaagg
tgagtgagca acgtttcgtg gagttgcgga ttatggtgtt tgatctcttc 840cgcgataatg
gccgcctcga cctgcatttc atttgtgacg acgcccagca cctgcataaa 900agaaggtaca
ccctgggtta atgccatatc gatacgcgtt gtacttttgg ggatgggtaa 960accagcatca
cacaccacca gcgtatcggt atgtcccaga cgggagatca ccgatgaaat 1020atcagaatta
agaacggtgc cttttttcat
105091047DNAArtificial Sequencegene editing homology arm
pool_1_b3748_right 9caacctcgaa acgttttaca tggtgattaa ccatgaaaac
aaaaacgccc ccttatgaag 60aaaggaggcg tctggcgtta gatttcgacc tgagtaccca
gttcgataac cctgtttggc 120gggatttcaa attgatctgg tgcacgcagc gcattacgtt
gcagcagcaa gtacagcttg 180ccgcgcagac gcaaatacca cgggcgtttg ccgaggatca
acgactcatg cgacataaag 240aaggaggttt ccatcatccg gcaacttaat ccttccagac
cgcagcggtg gaaaacttct 300tctacgtttg gcgtttctcg ccaaccataa cttgccacca
cgcgccagaa agtgggcgac 360agttgttcaa tctgtacccg acggacgtta tggacatatg
gagcgtcttc ggtgcgcaga 420gttaacagaa tcacccgctc atgcaatacc ttgttatgtt
taaggttatg catcagcgca 480aagggaatga cgttgattgc acgcgacata tacaccgcgg
tcccgggcac gcgaacaggc 540ggtgatttct ccagcgaagc aatcatcgct tccagagagt
taccatgttc atgcatccgc 600cgcagcaagc ggaaacgctc gctcttccag gtggtcatca
cgataaacat cacagtaccg 660aggctcaatg gcaaccagcc gccggagagc agtttatcga
ggttagcggt gaacaatgga 720atatcgacac aaaggaaagc aatcaggatc agcgcaacaa
aatacttatt ccagtgccag 780ttctgacgtg ccacggtagt cgagagaata gacgtcagca
ccatggttcc ggtcaccgca 840atcccgtacg ccgccgccag gttgctggag tgctcaaagc
tgacaatcac aatcacgacc 900gcgacataga gcatccagtt cacaaacgga atatagattt
gccctgactc catttcggag 960gtgtgaataa tgcgcatcgg cgacaaatat cccagacgta
ccgcctgacg cgtcaatgag 1020aagacgccag agataaccgc ctgcgag
1047101049DNAArtificial Sequencegene editing
homology arm pool_1_b0388_left 10cgcttcgcca aagcagcgga aaatattcag
cacatcagcg gtatcttctg gggttaccag 60cgcgtttggc gttacggagg tcatcccggc
attaaccagt gcgggagtgt tcggcatggc 120gcggataatt ttccggtcat ggcccagcgc
gcgggcaagc tggtcgagcg tgacacctgc 180agcaatagaa acgaccagag agtctttatt
caggctggag gtgatttcgc taagcacttt 240aatcatgatg ccaggtttaa cggcagcaaa
aatgatgtcg gcgatttgcg ccacttcttg 300cgccgattct gcggcgttga tgccgaactg
gtcatgcagg gcggcgactt tatccgggga 360gggggtgtat acccagattt gccctggaag
cacctgaccg ctggcaatca gaccgccgag 420aatggctttt cccatattgc cgcagccaat
aaaaccgatt ttcttttcca ttgcctcact 480cctgccgtga aattcattgt tttgataatc
gctggcagaa gcataaacag aactatgccg 540gaaggcaaaa gcgcgacaca atagaggatt
acccaacaaa ggatgacttt atgacaattt 600gggtggatgc cgacgcgtgt cccaatgtaa
ttaaagagat tttgtatcgc gcggcggaac 660gtatgcagat gccgctggta ctggtagcaa
accagagttt acgcgtgccg ccatcgcgat 720ttattcgtac gctgcgcgtc gcggcaggtt
tcgacgttgc cgataacgaa attgtccggc 780agtgtgaagc gggcgatttg gtgatcaccg
cagatatacc tttggctgct gaagccatcg 840agaaaggcgc tgcggcgctt aatccgcgcg
gcgaacgtta cacgccagcg accattcgtg 900agcgcctgac gatgcgcgat tttatggata
ccttacgtgc cagtgggatc cagaccggcg 960gaccagatag cctttcacaa cgtgaccgcc
aggcctttgc cgcggagctg gagaagtggt 1020ggctggaagt gcaacgtagt cgtggctaa
1049111046DNAArtificial Sequencegene
editing homology arm pool_1_b0388_right 11atgacacaac ctctttttct
gatcgggcct cggggctgtg gtaaaacaac ggtcggaatg 60gcccttgccg attcgcttaa
ccgtcggttt gtcgataccg atcagtggtt gcaatcacag 120ctcaatatga cggtcgcgga
gatcgtcgaa agggaagagt gggcgggatt tcgcgccaga 180gaaacggcgg cgctggaagc
ggtaactgcg ccatccaccg ttatcgctac aggcggcggc 240attattctga cggaatttaa
tcgtcacttc atgcaaaata acgggatcgt ggtttatttg 300tgtgcgccag tatcagtcct
ggttaaccga ctgcaagctg caccggaaga agatttacgg 360ccaaccttaa cgggaaaacc
gctgagcgaa gaagttcagg aagtgctgga agaacgcgat 420gcgctatatc gcgaagttgc
gcatattatc atcgacgcaa caaacgaacc cagccaggtg 480atttctgaaa ttcgcagcgc
cctggcacag acgatcaatt gttgattttc gagcgcctat 540acttaacgtt catcccgtga
aataaggaag aacgatgcca acgaaaccgc cttatcctcg 600tgaagcatat atagtgacga
ttgaaaaagg aaagccagga cagacggtaa cctggtacca 660actcagagcc gatcatccta
aaccagactc gttgatcagt gaacatccga ccgctcagga 720agcgatggat gcgaaaaaac
gctatgagga ccctgacaaa gagtgaccgc atcagactgc 780tcggaaggga ttctgagtgc
cactacaagg gatctgcgtc acatttttca taattcatgt 840ttttctaata attagaatat
taaacaataa caatccatta ctggaatcat ttggaatctt 900tacattatgc cgtgcacgtc
tgctgctacg ctttttgtca tttgtagcac aagtaagtgt 960cagcagtggt gcttcacact
tgcccggtaa ttaacgacga aagaaaagta aggtggatga 1020acaatgagtg cgtcgttggc
gatcct 1046121050DNAArtificial
Sequencegene editing homology arm pool_1_b4348_left 12gcagttcatc
attgcatttc gtgctgaggg ggatgaaaaa aatatttcga tatattctgg 60taaagcatct
ttggttaatc gagctcgaat aagtttatca ggatatagca aattttgatg 120ttgtaatttt
ttcaataacc cacaaacacc aacaaattct aaacttccgt tatagcgagt 180aaataaaaga
tctccatctt gtaatttgtg gcggtttagt tcactttctg aacattctag 240aaaccgaata
tcgttttgat ctacatggcc agcacgtaca gaactaatgc gtagtattgg 300atgaccaaca
ccactttcat ttggctttga tgaaagacca ttacgtaatt cagttaagat 360agattcaaaa
tttaacttct taaatacaga atgttgcggc tcaaaattac gccatttttc 420tgtcaatttt
ccattaactg cgccccccaa taccgcttga cgaaaacgtt tcaggatttg 480tgggatttgc
tcaaaacgtg ctttggtgct gtctacctgc gccagcagcg tatcgagttt 540ttcagcgatg
attttttgtt cggcaagtgg tgggattggt atatttatca aatcaaagct 600tgccggctta
atattattaa tatttgcacc agcagaaagt gatgaaattt tgtttcgata 660aagagaagat
tttgtgaaat gagcaataaa accagaaaat ataagttttt caggacgtaa 720tacaccgcaa
aatgcgccga aactacattc aaatggtaga tgctgatgtg cggatttacc 780aactacggat
ttgctccctg atgacattgc aataacaata tcttcaggag atattttttg 840actttcttta
acaagatttt taggaacaaa aaccaagtcc gtagtatcaa acttgccatt 900ctgaatattg
ttcgcacgga taagaggcaa ataatcatct tttagataat ttattgcctg 960ctctttttta
tacgttactc ctcggattag agttgtgacc gtagatactg gggcgataac 1020ccacccctcc
ggcaatttcc ccgcactcat
1050131050DNAArtificial Sequencegene editing homology arm
pool_1_b4348_right 13tccttcaccc caccaaacgc ttcttccagc aactgacgct
gcaaatcggc ctcatcgctc 60gcccccagtt cacgcatcag cgcatccagt tcagacagcg
cctgtaccag ttcgcccatc 120gcttctgccg ctaatacatc cggctccggc aggctgtcgg
catcaatact gtctttatct 180ttcagccagg agatatccag cgaatcggat tttgcggtgc
ggatccactc acggctgaac 240ttgcgccagc ggctggtagc aagatgctgg tcggtgtttt
tgttctcttc gctgtcggca 300acttccgtct cttcggcgtt aaaactccat tcaccttcag
tgcgcgggct taaaccgtgc 360gggtcttcgc catacacgcg ctcaaacggc tgcaaatgct
cgtcggtaaa cggtgtgcgc 420ttgccgaaac tcggcatatt ggtacgcagg tcatacaccc
acacatcatc ggtacagttc 480ttatcctgat tcgggttcgc caccgtccct ttggtaaaga
acagcacgtt ggtcttcacg 540ccctgagcgt aaaaaatacc ggtcggcaga cgcagaatgg
tgtgcagatg acacttatcc 600atcaggtcac gacgaatgtc ggtgcctttg ccgccttcaa
acagcacgtt atccggcacc 660accaccgccg cacgaccgcc gggatgcagc gtttcgataa
tatgctgcat aaagcacaac 720tgtttgttgc tggtcgggtg aacaaaggtg cgggtaatgt
tggtgcctgc ggcgctgcca 780aacggcgggt tagtggcgac aatatgcgcc ttcggcaggt
tttcaccgtc gctacccaga 840gtgttgccca gacggattgc gccgccgtgg tcgaggttgc
cttcaatatc gtgcagcagg 900cagttcatca gtgccagacg acgggtgccg ggcaccagtt
cgaggccgat aaacgcgcgg 960tggatctgga aatcctgcgt gtcgccatca aggtcgtcca
gatcattggt ttgcgactta 1020acatagcggt cggcttcaat caaaaagccc
1050141050DNAArtificial Sequencegene editing
homology arm pool_1_b1982_left 14taagcgcatg ttaatgctga ccgtctggat
gatgggcatc gcgacagcct tgattggtat 60tcttccttca ttctcgacca ttgggtggtg
ggcacctatt ttgctggtga cactgcgtgc 120cattcaggga tttgcagtcg gcggcgaatg
gggaggcgcg gcgttgcttt ccgttgaaag 180tgcaccgaaa aataaaaaag ccttttacag
tagcggtgta caagttggct acggtgtagg 240tttactgctt tcaaccggac tggtttcatt
gatcagtatg atgacgactg acgaacagtt 300tttaagctgg ggctggcgca ttcctttcct
gtttagcatc gtactggtac tgggagcatt 360gtgggtgcgc aatggcatgg aggagtccgc
ggaatttgaa caacagcaac attatcaagc 420tgccgcgaaa aaacgcatcc cggttatcga
agcgctgtta cgacatcccg gtgctttcct 480gaagattatt gcgctacgac tgtgcgaatt
gctgacgatg tacatcgtta ctgcctttgc 540acttaattat tcaacccaga atatggggct
accgcgcgaa cttttcctta atattggttt 600gctggtaggt ggattaagct gcctgacaat
tccctgtttt gcctggcttg ccgatcgttt 660tggtcgccgt agggtttata tcacaggtac
gttaatcgga acgttgagcg catttccttt 720ctttatggcg cttgaagcac aatctatttt
ctggatagtt ttcttctcca taatgctggc 780aaacattgcg catgacatgg tggtgtgtgt
gcaacaaccg atgtttaccg aaatgtttgg 840tgccagttat cgctatagtg gcgctggagt
cggttatcag gttgccagtg tggttggcgg 900tggatttaca ccttttattg ccgctgcact
catcacttac tttgccggga actggcatag 960cgtcgccatt tatttgctgg ctggatgcct
gatttccgca atgaccgctt tgttgatgaa 1020agacagtcaa cgcgcttgat agcctggcga
1050151050DNAArtificial Sequencegene
editing homology arm pool_1_b1982_right 15atgaataata agggctccgg
tctgacccca gctcaggcac tggataaact cgacgcgctg 60tatgagcaat ctgtagtcgc
attacgcaac gccattggca actatattac aagtggcgaa 120ttacctgatg aaaacgcccg
caaacaaggt ctttttgtct atccatcact gaccgtaacc 180tgggacggta gcacaaccaa
tccccccaaa acgcgcgcat ttggtcgctt tacccacgca 240ggcagctaca ccaccacgat
tactcgccct actctctttc gttcgtatct taatgaacaa 300cttacgttgc tgtatcagga
ttatggtgcg catatctcag tgcaaccctc gcagcatgaa 360atcccttatc cttatgtcat
cgatggctct gaattgacac ttgatcgctc aatgagcgct 420gggttaactc gctacttccc
gacaacagaa ctggcgcaaa ttggcgatga aactgcagac 480ggcatttatc atccaactga
attctccccg ctatcgcatt ttgatgcgcg ccgcgtcgat 540ttttccctcg cacggttgcg
ccattatacc ggtacgccag ttgaacattt tcagccgttc 600gtcttgttta ccaactacac
acgttatgtg gatgaattcg ttcgttgggg atgcagccag 660atcctcgatc ctgatagtcc
ctacattgcc ctttcttgtg ctggcgggaa ctggatcacc 720gccgaaaccg aagcgccaga
agaagccatt tccgaccttg catggaaaaa acatcagatg 780ccagcatggc atttaattac
cgccgatggt cagggtatta ctctggtgaa tattggcgtg 840ggaccgtcaa atgctaaaac
catctgcgat catctggcag tgctacgccc ggatgtctgg 900ttgatgattg gtcactgtgg
cggattacgt gaaagtcagg ccattggcga ttatgtactt 960gcacacgctt atttacgcga
tgaccacgtt cttgatgcgg ttctgccgcc cgatattcct 1020attccgagca ttgctgaagt
gcaacgtgcg 1050161050DNAArtificial
Sequencegene editing homology arm pool_1_b4367_left 16cgttaagccc
gacggagaga gaaatattgc gctttatgtc gcgtggctac tcaatgacac 60aaattgccga
gcagcttaaa cgcaatatca aaacgatccg tgcacataaa tttaatgtga 120tgtcgaaact
gggcgtcagt tctgacgcag ggttgttgga ggccgcagat attctgttat 180gtatgcggca
ttgcgaaaca agtaatgtgt tgcatcccta ttaatccgca tgatgccggg 240tttacttccc
cggcagtgct ttcatttcag cgtacaatcg ccacattgct gcacatccgg 300taagcgataa
cgctggcagc aagtgcggcg caccagcagg ccgtcgcgca gtaccacggt 360acgccagagt
ggattatctt caccgttcgt gagcgttttc tcaaaaaaga gggcatggcg 420cagcgattca
acagtagcct cgccgagcag ttgcttcatc tcagtgagat accagttgat 480caaataaccg
gtattactcc agataagttt gccgttgatc tctccggtcg cttctagtgc 540ttgcacaacc
ggaaccagcg cctggctgat taacgtttcc attcgatgct gcggcgaatg 600tggtgttgcg
tttttatctt cacacacatc gacccagaaa caggcgacgc gtccggtttc 660gtgaaactca
gcatggaaat gttccggcga cacatctaat gccttttcct gcgtcagtag 720cgccagcatt
aatggtggca ccatcaggcc gatataccat tgtgcccata gtgagatcag 780cggtttgttc
tcgcggatca tcatcggttg gttgcgatag atatgatcgg aatagaccgc 840cagcagagaa
cttagcacat tcggtgatga ccattgcgcc agcgtcatgg cgttaagtgg 900ggcaggttca
tccaggcgga taaactccag caaatgttca cgatgttttg cgatcgtcgc 960ccgcacggct
tgcgcaagcg tgggatcctg cggctggaga tgcgttcgcc agatgacatc 1020ttcatagagc
ggtgcggaac gataggccat
1050171050DNAArtificial Sequencegene editing homology arm
pool_1_b4367_right 17aatcgggata gtaatctaaa tgataatgat tgctaatcat
agcgataggt ttacccgata 60gcaagggatt tatctggctt gcaaatgata aaaattatca
tatgatattg gttatcatta 120tcaatgaaag agatgaaatc atgttgcaac gtacgctggg
cagtggctgg ggagtgttgc 180tgccgggatt gctgattgca gggctgatgt atgcggattt
atcgtcagat cagtggcgga 240ttgtcattct gatgggatta gtattgacgc cgatgatgct
gtatcacaaa cagttgcggc 300attacatttt gctaccatcg tgcctggcac ttattgctgg
catcatgctg atgataatga 360atttgaatca gggatgaaaa atcaaggaag aaacaagaaa
ggaagtaaag ataattggtg 420cgaggggggg gacttgaacc cccacgtccg taaggacact
aacacctgaa gctagcgcgt 480ctaccaattc cgccaccttc gcacagtcat cttacttttt
ttgatatcgc ctcgtttggt 540gcgagggggg ggacttgaac ccccacgtcc gtaagaacac
taacacctga agctagcgcg 600tctaccaatt ccgccacctt cgcccagtgc gagcaatatc
aacgtggttt ttggtgcgag 660gggggggact tgaaccccca cgtccgtaag gacactaaca
cctgaagcta gcgcgtctac 720caattccgcc accttcgcat accatcaatt cttaaaaaga
attgctacca cggaggcgca 780ttctagtggt tttcagcttt tcgtcaatag ttaattatcg
acagaggtgt aattgctgga 840aaaatgtcca tcaggaaact agcgtgcagg tttggtatgc
atgcgggggc agatgccaga 900tgcgacgctg gcgcgtctta tctggcctac gaagggctaa
cgtgcaggtt ttgtaggtcg 960gataaggcgt tcacgccgca tccgacacgg tattcggcga
gataattaac ctttcttcgc 1020ctggcgggtc ataatggcgc gatacacctt
1050181050DNAArtificial Sequencegene editing
homology arm pool_1_b2285_left 18ctgtttcttc cccgcagatg tagcgccctg
ccccggtatg gacgaacagt tcgaaatcga 60aacctgttcc cataatgttt ttgccaagca
gacccgcttc ggtggcttcg gcaatggcac 120ggcgcagatt aactgccgct tcgatatatt
cgccacgcag gaagatgtag ccacggtaag 180ctttcagcgc aaacgcggag atgagcatac
cttccaccag caggtgcggc agttgctcca 240tcaacaggcg gtctttatag gtgcccggct
ccatttcatc ggcattacac agcaggtaac 300ggatgttcat ggattcgtct ttcggcatca
ggctccattt caggccagtc gagaagcccg 360cgccgccgcg ccctttcaga ccagcgtctt
ttacctgatt aacgatttcg tccggagaca 420gcccggtcag cgccttacgc gcgccttcgt
aaccgttttt gctgcggtat tcgtccagcc 480acactggctg tttgtcatcg cgcagacgcc
aggtcagcgg atgcgtttcg ggagtacgga 540taatgttttt catttatacc gctccagcag
ttcagggatc gcttccgggg tcagatgcgc 600gtgagtgtcc tcatcgatca tcatgtttgg
ccctttatca cagttcccca ggcagcaagt 660tggcagcagc gtaaagcggc catcaaatgt
cgtttgccct ggtttgatgt tcagcttttt 720ctcgagcgcc gcctgaatac cctgataacc
gttgatatga cagaccacgc tgtcacaata 780acggatcaca tggcgaccaa ccggctggcg
gaagatctga ctgtagaacg ttgccacacc 840ttcgacgtcg cttgccggaa tacccagcac
atcggcgatc gcgtggatcg caccatccgg 900cacccagcca cgctgcttct gaacgatttt
cagcgcttca atggacgccg cacgcgggtc 960ttcgtagtgg tgcatctcgt gctcgatcgc
ttcacgctct gccgcactca gctcaaaagc 1020ctcggtttgt ggttgttgat tctcgtgcat
1050191050DNAArtificial Sequencegene
editing homology arm pool_1_b2285_right 19aattagcggt ccacatctga
cataacaaaa tcgatactgc ccagataaac aatcaggtca 60gacaccaggc tgccgcggat
cgccgccgga atttgctgca aatgcgcaaa gctcggggta 120cgaacacggg tgcggtaact
catggtgctg ccgtcgctgg tcaggtagta actgttgatc 180cctttggtcg cctcaatcat
ctggaaagat tcattggcag gcatcaccgg accccacgac 240acttgcagga agtgggtgat
cagggtttcg atatgttgca gcgtgcgctc tttcggcggc 300ggcgtggtca gcgggtgatc
cgctttgaac gggccttccg gcatgttgtt gaggcactgc 360tcaagaatgc gcagactctg
gcgcagctct tccactttaa gcattacgcg ggtgtagcag 420tcagaaacgc caccacccac
cgggatttca aagtcgaagt tttcatagcc agaataagga 480cgcgccttac gcacgtcgaa
gtcgatcccg gtagcacgca ggcccgcgcc agtggtgccc 540cactccagcg cctctttcgc
gccataggcg gcaacgccct gggaacgacc tttcagaatg 600gtgttttgca gcgccgcttt
ctcgtaagac gccagacgtt tcggcatcca gtcgaggaac 660tcacgcagca ggcgatccca
gccgcgcggc aggtcgtgcg ctacgccgcc aatacggaac 720cacgccgggt gcatacggaa
accagtgatt gcttccacca gatcgtaaat tttctgacga 780tcggtaaagg cgaagaacac
tggcgtcatt gcgccgacgt cctgaataaa ggtcgagata 840tacagcaggt gactgttgat
gcggaacagt tcggagagca taacgcgaat gacgttaacg 900cgatccggca cggtgatccc
ggccagtttc tctaccgcca gcacgtaagg catttcgtta 960acgcagccgc cgaggtattc
gatacggtca gtatacggaa tgtagctgtg ccaggactgg 1020cgttcgccca ttttctccgc
accacggtgg 1050201025DNAArtificial
Sequencegene editing homology arm pool_1_b2405_left 20gagcgaagcg
agtcatcctg cacgacccac caatgtaaaa aagcgcccta aaggcgcttt 60tttgctatct
gcgatttgcg aaattgcctg atgcgcttca cttagcagac tactatttcc 120ggcaattcct
gtctcctcac ctactgtgtc aatgcagcca acagcttaac catcgcgggc 180gtcacctgct
gtgtttcata aacaatatat aaatctgcag ggatgcgctg tttgagcgga 240cggaaaatga
cacctggcca gttcatttgt gcgtagctgt ccgctatcaa tgtgatacca 300atgcccatac
tgaccatagc gagtaccgtt tgcggttcat taacttcgcg aataacaacc 360ggtgaaaatc
ccacctgctg gcaaactcgc tgcaaaaaat cccagtcagt gtaaacgggc 420ggcattgtaa
caaaatactc gtcacgtagc gcttccagcg ggacggtgga aaatgatgag 480agatgatgct
cttcaggcat cgccaccaga aacgccgatt catgcaaccg taagctggta 540aaaccagtcg
gtggttctgt cgccattcgc cagatcccgg catcaagttc gcggcgttcc 600agcaaggcca
tttgcatcgc gggcatcttt tcgcgaaaaa gaacgtcaac gttaggattt 660tccctgagga
atcgccgcat aaccgggcgc atccgtcccc acattgccgt tcccactacg 720ccgagttcaa
tccgccctgc ttctccccga cctatttgtt caatccgagc caatacatta 780ttagcattca
ccagcaatcg acgcgattct tccatcaaga ttttgcccgc gtgtgtcagt 840acgacgctgc
gcgaatggcg aataaaaagc tgcgtgccga gttgattttc cagctcttta 900atatgaatgc
tgagcggagg ctgagacata tttaaacgcg ctgctgcgcg gccaaaatgc 960aactcttccg
ctacggcaag aaaataacgg agcaacttaa gatctgttct gtatacgcgt 1020tccat
1025211050DNAArtificial Sequencegene editing homology arm
pool_1_b2405_right 21aacaaagcac caataccaaa accaacgccg gaagaaaata
aaatatcttt cactaattaa 60cctttatcat aaaagcagct ctgaagagca gagccgcgaa
tccttttaat gagtcaccgc 120tcgatgcttt atcttttcag ggtcatgatt atatttaaac
ccaaagaaaa atatcactgc 180gagaaaaaga gcatatcctg caaacaccag ccagatagtt
tgccagtctt ttacgccatc 240caccgaaaag taatctactg ccatgccact cagaatcgag
ccaacccatg cgccgacacc 300atttaccatg gtcataaaga gcccctgcgc gctggcacga
atgctggaat caacttcctg 360ttcgacaaat accgaaccag aaatattgaa gaaatcgaat
gcacagccat aaacaatcat 420cgacagcagc agcaaaataa atccggttgt tgacggatcg
ccataggcga agaagccaaa 480gcgcagcgtc caggccacca tactcatcag catgacggtt
ttaatgccaa atcgctttaa 540aaagaatggg atagtcagta taaagcccac ttctgccatc
tgtgaaactg acagtaaaat 600ggagggatat ttcaccacaa aactgtcagc aaactccggg
ttacgggcga aatcatgtag 660gaacggatta ccaaaaacgt tggtaatttg cagtaccgca
cccagcatca tggcaaagag 720gaaaaagatg gccatgcgtg gatttttaaa cagcacgaag
gcatccagac ccagcttgct 780ggcaagcgat gtggtcgctt ttttctccgc aaccggaatc
ttcggcaaag tcagcgcata 840agccgacagc agcaatgacg caccggacgc gatatacagc
tgcagactac tcaattccag 900atgcagcagg cttactgccc acatcgcgac aatgaacccc
accgtaccaa aaacgcgaat 960gggcgggaaa gcggtcaccg ggtcaagccc tgcctgggca
agacaggaat aagagacgct 1020gttcgataac gcaatagtcg gcataaacgc
1050221050DNAArtificial Sequencegene editing
homology arm pool_1_b0495_left 22cgacccggcg tggagattaa tcccatcacc
gatgatgtca tcacaatacg cccttcaccg 60tgcggtaaca tcgcgggtaa caggcgcatg
gtgagctggt gtgcgccgaa aaagttggcg 120gaaaactgct gttccatctg cgcacggctg
atggtggaaa gggggccata catgccgaat 180ccggcattgt taaagatccc atacagacaa
ttatcggtca gggcgatcac ctcgtcggct 240gcgcgatcaa cactttctgg tgaatccaga
tcgatcaaca cgccggtaaa tcccatgctg 300ttcatgcgct caacatcatc cggtttccgg
caacctgcca gcacatgaaa accctggcgt 360tttaattcga gcgcgctttc caggccaatt
ccactggaac atccggtaat taagaccgat 420ttttgcataa ctttacctgt caggatctcc
gttgctttat gagtcatgat ttactaaagg 480ctgcaactgc ttcgccatcc agtcggcaat
aaacggctgg gcgtcgcggt tgggatgaat 540accgtcatcc tgcatccatt gtggcttgag
gtagacctct tccataaaaa agggcagcag 600cggaacatca aactctttgg cgagtttggg
gtaaatggcg ctaaaggctt cattataacg 660gcgaccatag tttgcaggca gacgtatttg
cattaacaat ggttcagcgt tggcggcttt 720gacatcctgc aaaatctggc gcagcgtttg
ctcggtttgc tgtggctgaa aaccacgcaa 780accgtcattg ccgcccagtt caaccagcac
ccaacgcggc tgatgctgtt tcagcagagc 840cggaaggcgc gccagtcctt gttgcgaggt
gtcgccgctg atgctggcat taactaccga 900cgttttactc tgccacttat cattcaacaa
ggcaggccag gccgcgctgg cagacattcg 960atacccggcg ctcaggctat cacccagaat
caataacgtg tccgctgcgg cggcacggaa 1020ggttaacagg accaggaaca ggaagggcaa
1050231046DNAArtificial Sequencegene
editing homology arm pool_1_b0495_right 23atgccagcgg aaaacattgt
tgaagttcat catcttaaga agtccgtcgg tcagggggag 60catgaactct ccatcctcac
cggagttgag ctggttgtca aacgtggcga gaccatcgca 120ctggtgggcg agtcgggatc
gggtaagtca accttgctgg cgatcctcgc cgggcttgat 180gacggcagca gtggcgaagt
gagtctggtg ggacaaccgc tacataatat ggacgaagaa 240gcgcgggcaa agttgcgcgc
gaagcacgtc ggctttgttt ttcagtcatt tatgttaatt 300cctaccctta acgcgctgga
aaacgtcgag cttccggctc tgctgcgcgg tgagagtagc 360gcggaaagtc gtaacggggc
gaaagcgttg ctcgaacagt tagggctggg taaacgtctg 420gatcatcttc cggcacagct
ttccggcggt gaacagcaac gagtggcgct ggcacgagcc 480tttaatggtc gacctgatgt
gctgtttgcc gacgaaccca ccggcaacct tgaccgccag 540acgggcgata aaattgccga
cctgctgttt tccctcaacc gtgaacatgg caccacgttg 600attatggtga cccacgacct
gcaactggcg gcacgctgcg accgctgctt acggctggtg 660aacgggcagt tgcaggagga
agcatgattg cacgttggtt ctggcgcgaa tggcgttcgc 720cgtcgctatt aattgtctgg
ctggcgctaa gcctggcggt ggcctgcgtg ctggcgctgg 780gcaatatcag cgatcgcatg
gagaagggct taagccagca aagccgtgag tttatggcgg 840gcgatcgggc gttgcgcagt
tcacgcgaag tgccgcaagc gtggctggag gaagcgcaaa 900agcgcggcct gaaagtcggc
aagcagctga ctttcgccac aatgaccttt gcaggcgaca 960caccgcagct ggcgaacgtc
aaagcggtgg atgatatcta cccgatgtat ggcgatctgc 1020aaactaatcc ccctggcctg
aaaccg 1046241050DNAArtificial
Sequencegene editing homology arm pool_1_b1646_left 24aaaatcgttg
gcgtcgcgtt ggcgtggtta gcgttcgcca ttctgcgtcc aggatcggat 60gctcgtaaaa
gccgccgcca tattcgcgcg ctgcgccggg attttgtcga tcagctaagc 120cgccatccaa
cactgagtga aagcgaattt gaatcgctca cttatcatca cgtcagtcag 180ttgagtaaca
gccaggatgc gctggctcgc cgttggttat tacgctgggg tgtagtgctg 240ctgaactgtt
ctcatgttgt ctggcaattg cgcgactggg aatcgcgttc cgatccgtta 300tcgcgagtac
gggataactg tatttcactg ttgcggggag tgatgagtga gcgtggcgtt 360cagcaaaaat
cactggcggc cacacttgaa gaattacagc ggatttgcga cagccttgcc 420cgtcatcatc
aacctgccgc ccgtgagctg gcggcaattg tctggcggct gtactgctcg 480ctttcgcaac
ttgagcaagc accaccgcaa ggtacgctgg cctcttaatt acttaattac 540accacaggca
tagcgttcac cgccaccgcc cagcggttta ggttgatcgg acatattatc 600gccgccaacg
tggaccatca gcgctttgtc tttgatttca tccagtgatt tcagacgagg 660cgcgatgacg
gcatcggtag ctttgccgtc attattgacg accagtgcag gcagatcgcc 720taaatgcccg
gcaccttctg gcccttcatg tttaccggta ttttgtggat caagatgccc 780gcctgcggat
tccgcggcgc tggctttgcc atctttggtg gctggctggc agcttccttt 840ggcatgaata
tggaagccat gttcaccggg gggtaatgct ttcagatcgg gcgaaaactc 900cagaccttta
tcggtttcag taatggtgac gctaccaatt gactgcccta ccccttgcga 960cgtgacgagg
ttcatctcga ctttttcact ggcagcttgt gcgccggttg caacaaccag 1020cgccagaata
gccagactaa aacgtttcat
1050251050DNAArtificial Sequencegene editing homology arm
pool_1_b1646_right 25ttacggtacg tcgtacccca gtgccgcttt acggatacga
aaccattgtt gacgggtcat 60tttcagtgtt tctgcttcga cagctgcccg tacgcgctca
attttacctg aaccgataat 120tggcagcggc tgcgatggta aacgtaatac ccaggcgtaa
accacctgtt caatcgagcc 180cgcgtttaac tcctctgcca ccacagccag ttcatcacgc
agcggctgga aataatcatc 240attaaacaga cgaccaccac caaggcagga ccacgccatc
ggacgaacac gcagttgttg 300tagttggtcg agcgtgccat ccagcagtaa cggctgatgc
accggggata tttccacctg 360attagtggca agggtaaacg gcagacgtga ttgcaacagg
gcaaattgcg caggcgtaaa 420gttcgatacg ccaaaatgac gcactttgcc gctctgatgc
agatgtttga acgcgtccgc 480cacttcatcg gcatccatta acgggtctgg tcggtggatt
aacagcaaat ccagatgatc 540ggtcgcgaga ttaattagcg actgttcggc gctcttaatg
atgtgatcgc ggtcagtgat 600gtaatgacca atgacgtttt cttcacgcgc ggtcgtcgcg
ataccgcatt tactgacgat 660ttccatccgt tcacgcaggt gaggtgccag tttcagtgcc
tcgccaaacg ccgcttcgca 720ctgatagcca ccataaatat cagcatggtc cacggtggtc
acgccgagat ccagatgctc 780ttcaataaaa ctgaccagct ggcgggcgga catattccag
tccatcaatc gccagtagcc 840catcacaaaa cgggaaaact ccgggccttg cggcgcaata
gtaatacgct gaaccataat 900cgcttcctct tatcagatat gagaggagta tacgcaagat
taggttcaaa agagtgatgg 960ttgctccggt tcgtctgatg acgctggctt atttgcgcgt
aatttgcgca ttaatcgctg 1020ccgacaaagg cgcagcacct cttgtttttc
1050261050DNAArtificial Sequencegene editing
homology arm pool_1_b3189_left 26attgctttcg atttcggcgt gcgcgcccat
acggctcagc tctggcacat gcataaagcg 60gttttcaaag accgtttcgg tgataaaccc
ggtcccttct gccaccaggt tcaacagcgt 120gaactgggcc tgcatatcgg tcgggaatgc
cggatgcggc gcggtacgta cgttaacagc 180cttcggacgt ttgccatgca tatccaggct
aatccagtct tcgccgactt cgatgtccgc 240tccagcgtca cgcagtttcg ccagcacggc
gtcgagagta tctggctgcg cgttacggca 300gataattttg ccgcgagaaa tcgccgccgc
caccaggaaa gtaccggttt cgatacgatc 360cggcagaacg cgatagacac cgccgcctaa
acgttccaca ccttcgatga cgatacgatc 420ggtgccctga ccgctaattt tcgcacccag
cgtaatcagg aagttcgcgg tatcgacgat 480ttccggttca cgcgctgcgt tttcaataat
cgtggtgcct tccgccaggg ttgcagcaca 540catgatggtc accgttgcgc caacgctgac
tttatccatc acgatatgtg cacctttcaa 600acgaccatcg acggaagctt taacgtaacc
ttcttccagt ttgatggtcg cgcctaattg 660ttcgaggcca gaaatgtgta gatcaaccgg
acgcgcaccg atcgtacaac cgccaggtag 720tgaaacttgc ccctgaccaa agcgcgctac
cagcggcccc agcgcccaga tagaagcacg 780catggtttta accagatcgt aaggtgcgca
gaatacatta acgtcgcggg catcaatatg 840cacagaacca ttacgttcta ctttcgcacc
cagctggctt agcagcttca ttgatgtatc 900gacgtctttc agtttcggga cgttctggat
ctctaccggt tcttccgcca gtagtgcggc 960aaaaaggata ggcagagcag catttttagc
gccggaaatt gtgacttcgc cctggagctt 1020cgttggcccc tgaacacgaa atttatccat
1050271050DNAArtificial Sequencegene
editing homology arm pool_1_b3189_right 27ttagtttgtt ctcagttaac
aattcatatc cgctaccggc gaatcgccca tagctcaaaa 60gccgttcagt ttgcgatcgc
gcgcccactc cgcaggggta tacgctttga tcgacacagc 120atgaatgcgg ttatccgcaa
tatattccat cagcggacca tagaccgtct gctgtttttt 180aacccgactc atgccgtcaa
acaactcacc cacggcaata acctgaaagt ggctgccatc 240gccggaaacg tggacttcct
ggagggagag agcgttcatc aacacgctct gaatttcatt 300attttccatg ggatcttcaa
tcatcagtta ataaaccagc gaaacatctt agagcaaagt 360tgcgctggca taaataagca
aaaagcctcg ctgataaatc agacaaggct cgacttgcag 420gcaggtttgc cggacaggcg
gttaacgcca tatccggcct gaaaaaattt aacgaggcag 480aacatcagca ggcaaattat
acaatttcgc cagggtatac actttgtcgt ttaccccctg 540aagcgtcaca ttgttgccct
gctttttcgc cagatcgata agatggagca gcagtgccag 600tccccccgta tccacgcggg
agacacggct aagatcgatg caggtaatcc ccttcaccgc 660ttcctcacgc atttcccaaa
gcggtagcaa aacgtcctga tccagctctc cggataacgc 720cagcgtgtca cccgtctgca
tccagctcag tgactcgctc attatttttt ctcttccaga 780gtgattttct gttgagaaat
cgatttcagt tgcgcagtca ggccgtcgat acctttggta 840cgcagcagcg ttccccactc
gttttgtttg gtggtgatca tactgacgcc ttcagcaatc 900atgtcgtaag cctgccaatt
gcccgtctgg gagtttttac gccactggaa gtccagacgc 960accggcggac ggccattcgg
gtcaataatg gtaacgcgaa taggcacaat ggttttatcg 1020cccagcggct gttctggcgc
aatctgatag 1050281048DNAArtificial
Sequencegene editing homology arm pool_2_b3125_left 28gcgcaacctc
gctggcagat aaactttctt tataagagtc tggggcgatt acgattttca 60tacctatgcc
tgttaccaca tgacgccgga gggcgtttct cttattcggc ctggattcca 120ggcccggatt
gcaatacgcc atccgggcac gacgtcatta acgagtaact tcgactttcg 180ccagtttttc
gtagtagcac gccagggcgc tatgatccgc cgttcctaaa ccatctgctc 240gcagtgcctg
catcatctcc ataaccgcag ctgtgagcgg cagttgtgcg ccgacgccgt 300gagaagtatc
cagcgcattc gccagatcct taatatgcag atcaatacgg aagcccggct 360tgaagttgcg
gtccatcacc atcggcgctt tggcatccag cacggtactg cccgccagtc 420caccgcgaat
tgcctgataa accaggtccg ggttaacgcc cgctttagtt gccagcgtta 480acgcttctga
catcgcggca atattcagcg ccacaatgac ctgatttgcc agtttggtga 540cgttacctgc
accgatttcc ccggtatgca ccacggaacc cgccatcgct ttcatcaaat 600catagtattt
gtcgaaaata gccttgtcgc cgcccaccat cactgacagc gtaccgtcga 660tggctttcgg
ttcaccgccg ctcaccggag catccagcat atcaatgcct ttcgctttca 720gcgcttcgct
gatttcacgg cttgccagcg gtgcgataga actcatatcg atcaataccg 780tacctggctt
cgcgccttca ataatgccat tctcacccag cgccacctct ttcacatgag 840gggagtttgg
cagcatggtt atgatgacgt cgcactgttc agcgatcgct ttagccgtag 900acgctgtttc
tgcacctgca gcaatcacgt cagcaatagc ttctgggtta cggtcagcaa 960ccaccagcga
gtaacctgct ttcagaaggt ttttactcat tggtttaccc ataatcccca 1020ggccaataaa
accaactttc atagtcat
1048291050DNAArtificial Sequencegene editing homology arm
pool_2_b3125_right 29atcaatcatc tctcttgttg cggtggtggt tattttttaa
aggtatcagc cagtttctga 60gtggcagagc ggaagacgcc gagatcgctg ccgacagcca
caaacgtcgc gccccattcc 120agataacgac gcgcatcggc ttcgaccggc gcgaggatac
cgctgggttt gccgtgcgcg 180ctggcacggt taaaaatgtg ctgaattgct ttttgtacat
ccgggtgtga tgcattgccg 240agatggccta atgccgcggc cagatcgctg gggccgacga
agatgccgtc tacgccttcg 300gtagcggcaa tggcatcgac gttatctacg ccctgctgac
tttctatctg gaccagaata 360gtgatgttct tgttcgactg agcgaaataa tccgccacgg
tgccaaacat attggcgcgg 420tgagaaacgg agacgccgcg aatgccttcc ggtgggtaac
gggttgatgc caccgccagc 480tctgcttcct cttttgtttc tacaaaagga atcaggaagt
tatagaaacc gatatccaga 540agacgcttaa taattaccgg ctcgttggtc ggcactcgca
ctactggcgc gctggcgctg 600cctttcaagg ccattaactg cggaataaac gtggagatat
cgtttggcgc atgttcgcca 660tccagcacca gccagtcaaa cccagccaaa ccaagaactt
cagtgctaat cgggttagag 720agtgctgacc agcaaccaat ttgtacctgt ttcgcagcca
gtgcggcttt gaatttattc 780gggaaaacat cgttattcat cgcttatacc tttgcttatt
tctgcaattc catacgttta 840atgtcgccaa ctacgaagag gtagcagacc atcgccatca
gcgctgaaca tcccacgaaa 900accagtgctg cattgaagga gtgcagttca cttaccaggt
agccaatcac cagtggagtg 960acaatggagg caacattgcc aaagacgtta aagacgccgc
cgcagaggcc aacaatctct 1020ttcggcgcgg tgtcagaaat caccggccag
1050301050DNAArtificial Sequencegene editing
homology arm pool_2_b3787_left 30ggcgatcagt tcagcgattt ccatcgccgg
gctttcgcgc aggtcatcaa tattcggttt 60aaacgccaga ccaaagcagg cgattttcag
ttcactggcg cgtttatcgg tagccgccag 120gcaatcagcc accgccgctt tcacctgatc
gataacccag aacggtttgt gatcgttcac 180ttcgcgcgcg gtacggataa gccgcgcctg
ctgggggttc tgtgccacga taaaccacgg 240atcaacagca atgcagtgac cgcccacgcc
agggccaggc tgaagaatat taacgcgagg 300gtgacgattc gccaggcgaa tcagttccca
gacgttaatc ccctgatcgg cacaaatcag 360cgacaattca ttagcaaaag cgatattcac
atcgcggaag ctgttttcgg tgagcttaca 420catttccgcc gtccgcgagt tagtgacgac
acactcacct tcgaggaaaa ttttgtacag 480ttcgctggcg cgggccgaac aaaccggcgt
cataccacca atcacgcgat cgtttttaat 540cagctcgacc attacctgtc ctggtaacac
gcgttccggg cagtaagcaa tgttgacgtc 600cgcctgctcg cccacctgct gcgggaaagt
gagatccgga cgcatctctg ctaaccattc 660tgccatcttc tcggttgacc ccaccggcga
ggtggattca aggatcacca gcgcgccttt 720tttcagcact ggcgcaatgg agcgagcagc
cgattcaacg taggtcatat ctggctcatg 780atcgccctta aacggcgtgg gtacagcaat
cagccaggca tccgcttcaa ctggcgtcgt 840gctcgctcgt aaaaaaccgc cttctacggc
agtttttact acactcgcca aatcaggttc 900gacgatatgg atttcgccac gattgatggt
atcaaccgca tgttggttga tatcgacacc 960aattacctgt ttttgccgtg aggcaaacgc
tgctgccgtt ggcagcccga tataacccag 1020tccgataaca gaaatggtcg caaaactcat
1050311050DNAArtificial Sequencegene
editing homology arm pool_2_b3787_right 31agtgatatcc gattattttt
taacgcttcc agaatgcgag agcatgcctg accatcacca 60tacgggttat gggcgcggct
catagcttga tattcgtttt cgtcttttaa aagacgcgtc 120acttcctcga caattcgctg
cttatccgtg cctaccagac gcaccgtacc cgccgtcacc 180gcttccggac gctcagtggt
atcgcgcatc accagcacag gtttccccag cgaaggcgct 240tcttcctgaa tgccgcctga
gtcggtcaaa atcagccagg cgtggttcat cagccagaca 300aacggtaaat actcctgggg
atcgatcaga atgacatttt tcacatgccc cagaatgcga 360ttgaccggtt ctctgacgtt
cgggttgaga tgcaccggat agacaatctg gatgtcctgg 420tgcgtggtgg cgatgtctgc
cagcgcgtgg cagatttctt caaagccacg accgaaactc 480tcacgcctgt gaccggtcac
cagaatcatc tttttatcgg ggtcgataaa cgggtaattt 540gccgccagtt ctgaacgcag
cttgtcgctg ctcatcacct ggtcacgcac ccataacagt 600gcatcaatga ctgtattacc
ggtaatgaag attcggctat ccgcaacgtt ttcacgcagc 660aagttttgcc gggaagtttc
ggttggagag aagtgataca tcgccagatg cccggtcaat 720gtacggttag cctcttccgg
ccacggcgaa tagagatcgc ccgtgcgcag accagcctca 780acgtgaccaa caggaatacg
ctgataaaac gccgccaggc tggttgccag cgtcgtcgtc 840gtatcgccgt gaaccagcac
gacgtctggt ttgaactcgg caagaatagg ttttagccct 900tccagaatcc gacaggttat
ctctgtcagg ccctgtcctg gctgcattat gttgagatcg 960tagtcaggta caatggaaaa
gagtttcagc acctgatcga gcatctcccg atgctgcgca 1020gtgacgcaaa ctttagcctc
aaaaaaagga 1050321050DNAArtificial
Sequencegene editing homology arm pool_2_b1948_left 32aactggcgcg
ataacctggt gcgccaggtg cagcattcac agctggagct ggtcgccaac 60tttgccgata
tctcgctacg cctgtcgcag attttaaaac tgaaccccgg cgacgtcctg 120ccgatagaaa
aacccgatcg catcatcgcc catgttgacg gcgtcccggt gctgaccagt 180cagtatggca
ccctcaacgg tcagtatgcg ttacggatag aacatttgat taacccgatt 240ttaaattctc
tgaacgagga acagcccaaa tgagtgacat gaataatccg gccgatgaca 300acaacggcgc
aatggacgat ctgtgggctg aagcgttgag cgaacaaaaa tcaaccagca 360gcaaaagcgc
tgccgagacg gtgttccagc aatttggcgg tggtgatgtc agcggaacgt 420tgcaggatat
cgacctgatt atggatattc cggtcaagct gaccgtcgag ctgggccgta 480cgcggatgac
catcaaagag ctgttgcgtc tgacgcaagg gtccgtcgtg gcgctggacg 540gtctggcggg
cgaaccactg gatattctga tcaacggtta tttaatcgcc cagggcgaag 600tggtggtcgt
tgccgataaa tatggcgtgc ggatcaccga tatcattact ccgtctgagc 660gaatgcgccg
cctgagccgt tagtgatgaa taaccacgct actgtgcaat cttccgcgcc 720ggtttctgct
gcgccactgc tgcaggtgag cggcgcactc atcgccatta ttgccctgat 780cctcgctgct
gcctggctgg taaaacggtt gggatttgcc cctaaacgca ctggcgttaa 840cggtctgaaa
attagcgcca gtgcttcact gggcgcgcgt gaaagggttg tggtggtcga 900tgtggaagat
gcacggctgg tgctcggcgt taccgcaggt caaatcaatc tgctgcataa 960acttccccct
tctgcaccaa cggaagagat accgcagacc gattttcagt cggtcatgaa 1020aaatttgctt
aagcgtagcg ggagatcctg
1050331050DNAArtificial Sequencegene editing homology arm
pool_2_b1948_right 33atgcgtcgtt tattgtctgt cgcacctgtc cttctctggc
tgattacgcc cctcgccttc 60gcgcaactgc cgggtatcac cagccagccg ctgcctggcg
gtggacaaag ctggtcgctc 120ccggtgcaga cgctggtgtt catcacctcg ttgacgttta
ttccggcaat tttactgatg 180atgaccagtt tcacccgcat catcattgtt tttggtttat
tgcgtaacgc gctgggaaca 240ccctccgcgc cacctaacca ggtattgctg gggctggcac
tgtttttgac cttttttatt 300atgtcaccgg tgatcgacaa aatttatgta gatgcgtacc
agccattcag cgaagagaaa 360atatcaatgc aggaggcgct ggaaaaaggg gcgcagccgc
tgcgtgagtt tatgctgcgt 420cagacccgtg aggcagattt agggttgttt gccagactgg
cgaataccgg cccgttgcag 480ggacctgaag ccgtgccgat gcgcattttg ctcccggcct
acgtgaccag cgagttgaaa 540accgcatttc agataggctt cacgattttc atcccttttt
tgattatcga cctggtgata 600gccagcgtgt tgatggcatt ggggatgatg atggttcccc
cagccaccat tgctctgccc 660tttaaactga tgctgtttgt actggtggat ggctggcaat
tgctggtcgg ttcgctggcg 720cagagctttt acagctagag aggcaaaatg acacctgaat
cggtcatgat gatggggact 780gaagcgatga aagtcgcgct ggcactggct gccccgctat
tgttggtagc gttggtcacg 840ggccttatca tcagtatttt gcaggccgcc acgcagatta
acgaaatgac gctgtcgttt 900attccgaaaa tcatcgccgt atttatcgcc attattattg
ccggaccgtg gatgctcaat 960ctgttgctgg attacgtccg caccttgttc actaacctgc
cgtatatcat cgggtagccg 1020tactatgttg caggtgacaa gcgaacaatg
1050341050DNAArtificial Sequencegene editing
homology arm pool_2_b2790_left 34ccacgggatc aaggccgata tctttggcca
tttccgaacc ggcgatagag agcgtagcgc 60ggtcgccgta gttgaaggat gtgacgataa
acaacatcac cactatccag taacgagcat 120ttgtgcgttt ttccacactg ctcgcagcct
gacttaaaga actcattgtt gcactcctga 180aaattcgcgt tagccacgct cactctggac
tgcgacatcg ccaggaaatc agaggtgacg 240tagggtgttt tttgccgttt ttataggtcg
ttcgccgaat acggcgcgtg tttatatctg 300gcaatagcag tataaaaagc gcgccatagc
ggctcaccgt gcaacaacac aacattaatg 360cgttcaatga ggcccgattt tggcattagc
cctggacggt ggaatccact tcacggaaat 420gaaaacaaga acaagaaagg aagggttaaa
acgaagaaat aaagaagagt atgaaatgga 480tcgcttgact ccaggcaaac gccagtaaaa
atccgcgcta tgaagcagtt tttactggca 540tttgcctgaa aagattcgat tcagcaccgc
taaaacgaca tttaccgctc gctgaacata 600tcaggacaac agcgtgcccc actgttcgac
ccacggattt gattccgttt ccggttccgg 660gttctcactg gcgtcaatca acagcatttc
gccaacccgc tgagcgctct gttcctgcaa 720caaggcatcg aactgtttgc cgccattgca
gaaattcaca taactactgt cgccgagcgc 780aatcacgcca taacgcagat tcggctggaa
gcccagacta tctttgattc cctgaaagag 840tggcacaatg ctatcaggaa ggtcgccctg
cccggtcgtg gacgtaacca ccagaacata 900cttatcctga tagggcagcc agtcgcttaa
ttcaggatct tcaaataccg ttgctttgtg 960gccctgcgcg gtcagaatcg cttccgcttc
ttcggccact aacagtgaat tcccgtacat 1020ggtgccgaca aaaataccaa tttccgccat
1050351050DNAArtificial Sequencegene
editing homology arm pool_2_b2790_right 35accgttttct ccctggatta
ggaacttatc tctccatcct gacccgatgg cgcactgaac 60tcaacccttt cattttcagg
aagcagaccg cgccagccaa attgtgataa cgcctgcatc 120caggtgtcgt ccaggcccgc
gtgaatagtc agcggctcgc cagtaaaagg atgcgtcaat 180gacaactgac tggcatgtag
cattaaccgc tggaggccaa aatgctcagc accgctgcga 240ttctggcgta aatcgccatg
tttgctatcg ccaataatcg gatgacgcaa atgggcaaga 300tgtcggcgaa gctgatgttt
gcgtccggtt ttcggctcca gttccaccag gccgtagcgc 360gtggtcgggt aacgtccggt
cgctaccggc atttctacgg tcgccagacc gcgataatgc 420gtcactgctg gctgcgggcc
tttatcttcg cgggcaaatt tatcagcgat tttgtccagt 480tcttccacca gtggataatc
cagcaccgct tcttccatca accagccgcg cacaatcgca 540tggtaacgtt tctggatttg
gtgctgttca aactgttgtg ccagcagccg tccggcctcg 600ctggataatc ccatcaacaa
cacaccagaa gtgggtcggt ccagacgatg agcagtaaaa 660acatgctggc ctatctggtc
acgcacggtt tgcatgacca ctactttctc gtcgcgatcc 720agccagctgc ggtgaaccag
ccagccggag ggtttattta ccgcaaccag ccattcatcc 780tgatagagta tttccagcat
tagctcgcat catccgcaaa aagagcatcc agtttttcca 840gctcagccag aataagcgcg
cgttgcggat ggtccgtcgc cagcgccatt tcataatagg 900gtgcaacggc aaaagcgccc
ggtaacggct gtttgttatc taacaaatcg tgcattcgcg 960ggatcagcac ccactgcaac
cactccagtg gttccatggt gtccataaag aacggttggg 1020tactattaaa ttgatgcggc
tggggttcat 1050361050DNAArtificial
Sequencegene editing homology arm pool_2_b3197_left 36ttacggcttt
ctgaaaatct tcagcggacc ggcgagtata cctgaagaaa ggacgttaga 60tgcttttagc
tacggcactg ttaattgttg gtttactttt ggtcgtttac agtgccgacc 120gcctggtttt
tgccgcgtct attctttgcc gaacctttgg catcccgccg ctgatcatcg 180gcatgacggt
ggtcagtatt ggtacatcgt taccagaagt catcgtctcg cttgccgcgt 240ctctgcacga
acaacgcgat ttagccgttg gtacagccct cggctcaaac attatcaata 300tattgctgat
cctcggtctg gctgcgctgg ttcgtccttt taccgtccat tctgatgttc 360tacgccgtga
attaccctta atgttgttgg tcagcgtggt ggccggttcc gtactctatg 420acggacaact
tagtcgcagc gatggtatct ttctcctctt tctggctgtg ctatggctgc 480tgttcattgt
taaacttgca cgtcaggctg aacgtcaggg gactgacagc ctgaccagag 540agcagcttgc
agagctgccg cgtgacggcg gattgcccgt cgcgttttta tggctcggca 600ttgcgcttat
catcatgcca gtggccacgc ggatggtggt tgataacgcc acggtgctgg 660cgaattactt
tgccatcagc gagttgacga tgggtctgac ggcaattgct atcggaacca 720gcctgccgga
actggcaacc gcaatagcgg gggttcgcaa aggtgaaaac gacattgctg 780tcggaaatat
cattggcgca aacattttta atattgtcat cgtgttgggt ttacccgcgc 840tgataacgcc
aggagagatt gatccactgg cgtacagtcg tgactacagc gtgatgttgc 900tggtgagcat
tatttttgcg ttgctgtgct ggcggcgctc cccgcaaccg ggccgtggtg 960taggggtatt
attaactggc ggatttatcg tatggctggc gatgttgtac tggttatcgc 1020caatactcgt
tgaataactg gaaacgcatt
1050371043DNAArtificial Sequencegene editing homology arm
pool_2_b3197_right 37atgtcgcacg tagagttaca accgggtttt gactttcagc
aagcaggtaa agaagtcctg 60gcgattgaac gtgaatgcct ggcggagctt gatcaataca
tcaatcagaa tttcacgctt 120gcctgtgaaa agatgttctg gtgtaaaggg aaagttgtcg
tcatggggat gggaaaatcg 180gggcatattg ggcgaaaaat ggcggcaacg tttgccagca
ccggtacacc ttcatttttc 240gtccatcctg gtgaagccgc gcatggtgat ttaggcatgg
ttaccccaca ggatgtggtg 300attgctatct ctaactctgg tgaatccagc gaaatcacgg
ccttaattcc agtgcttaag 360cgtcttcacg taccgttaat ctgcatcacc ggtcgcccgg
agagcagcat ggcgcgcgcc 420gcagatgtgc atctgtgtgt taaagtagcg aaagaagcct
gtccgttagg gctggcaccg 480accagcagca ccaccgccac gctggttatg ggcgatgccc
tcgctgtcgc gctgttaaaa 540gcacgcggct ttactgctga agattttgcg ctctcacacc
caggcggcgc actgggtcgt 600aaacttctgc tgcgcgtaaa cgatattatg catacgggcg
atgagatccc gcatgttaag 660aaaacggcca gtctgcgtga cgcgttgctg gaagttaccc
gcaaaaatct tggtatgact 720gtcatttgcg atgacaatat gatgattgaa ggcatcttta
ccgacggtga tttacgccgt 780gtcttcgata tgggcgtgga tgttcgtcag ttaagtattg
ccgatgtgat gacgccgggg 840ggaatacgtg tgcgccctgg cattctggcc gttgaggcac
tgaacttaat gcagtcccgc 900catatcacct ccgtgatggt tgccgatggc gaccatttac
tcggtgtgtt acatatgcat 960gatttactgc gtgcaggcgt agtgtaaaga ttcaaggata
aacaacaatg agcaaagcag 1020gtgcgtcgct tgcgacctgt tac
1043381047DNAArtificial Sequencegene editing
homology arm pool_2_b3791_left 38gcgctcgctc tctttggtgg tgtagcgatc
ttcaccgtgg aactcaccaa agtgttcccc 60cgcagggcaa ccgtgcagcg gaatgtaatg
aaacaccgcc atgatttccg cttctttcag 120aaagttaatc aacgcgctcc ggtcatcaat
atcccgcagt ttaatgtaga acatatgcgc 180gttctgcacg cagccatcgg gaatcgacgg
cagctcgata cgcccggctt tcgccagagg 240cgctaacgca tcgtagtagt tttgccacag
cgccagacgt tgctggttga tacgatccgc 300tgcttccagt tgcgcccaca ggtatgcagc
ttgcagatcg gacatcaaat agctggagcc 360aatatcgcgc caggtatatt tatcgacctg
accacggaag aactggctgc ggttagtgcc 420cttttcacgg atgatctcgg ctcgttcgat
taacgcttta tcgttaatca gcgtcgcgcc 480gccttcaccg cccgccgtgt agtttttggt
ttcatggaag ctaaagcagc caatatgacc 540aatggttccc agtgcacgcc ctttgtaagt
ggacatcacg ccctgagcgg catcttctac 600cacaaacaaa ttatgctttt tcgccaacgc
cataatggtg tccatttcgc aggccacacc 660cgcgtaatgg accggcacga taacgcgcgt
tttgtcggtg atcgccgctt caatcagcgt 720ttcgtcgatg ttcatggtgt ccgggcgaac
atccacaaaa acgatttttg cgccacgcag 780cacaaaggca ttggcggtgg agacaaaggt
gtagctcggc atgatcactt catcgccagg 840ctggatatcg agcagcagcg ccgccatctc
cagcgaagcg gtgcaggacg gcgtcagtaa 900cactttggcg ctgccaaaac gttgctccag
ccactgctgg cagcgacggg taaaaccgcc 960atcgccacac agtttgccgc tacccattgc
cgactgcata tagtcgagtt cggttcccac 1020caccggcggt gcgttaaatg gaatcat
1047391048DNAArtificial Sequencegene
editing homology arm pool_2_b3791_right 39gtgatcacct gtataaccag
tacgcggtgc tttctacatt cgcaccactt tgtatgtatc 60gtttaagcgc ggcggtgttg
cccatttggg tcgccacccg caaagttgtt ttaccgcgag 120catacgccca gtttagcgcc
gtttgcatca gctcagcacc tgcaccgcgt ccagccagca 180ggccaattcg cgcatctgtc
gcattgagtt cccgtaaaga gacatagccg cgaatatcgc 240cggacgccgc acgtaaaatc
agacattgat gatcaaaggt gccgcgcacg gcattttcaa 300tccactgtgc ataaaagcga
ctgctggcgt caggcgcata ccacggcgca cgaaaacggc 360tttgcgcaaa tgcggcgctg
gctaactgac gtaatgcggg aatatcggtc tcttgtgcca 420ctacagcacc gctatcactg
gcattgttca cgggtagcgc caaatcaact tcaccttcta 480ccagggagaa tcccagctgt
tgcagggcat ccagttcacc cgtatttgat gccgcaattt 540tggcctgcac ccgtgaccac
ggcgctaacg cgtctggcgt caggagcggt gcttcagacg 600taatgcgcac gatggcgctg
ttaacaccaa agaaggcgtt ttcccaggtt agtggctcaa 660tactggcgcg gacgggcacg
aagtaactcc agcagatatt ggccgtagcc agttttcgct 720aatgaactgg cagcacgctt
cacaccctcg tcatcgagcc agccgttacg ccaggcaatc 780tcttccaggc aggcaatctt
aaagccctgg cgtttttcca ccgtctgtac aaaggtgctg 840gcttcaatca ggctgtcgtg
agtgccggta tccagccagg caaatccgcg cccgagcagt 900tcaacggtca ggttgcccgc
ctcgaggtac atctggttga tggaggtaat ctccagttca 960ccacgctccg acggcttcac
ctgctttgcg tactccacga ctttactgtc gtagaaataa 1020agcccggtca ccgcccagtt
tgacttcg 1048401050DNAArtificial
Sequencegene editing homology arm pool_2_b4260_left 40gttcagcact
tcaacggttt gaccggacat ggtggttaac acatcgcccg gacgataggc 60tcgtccgcca
ggcatgtttt cgcagcctgc caacacgccg ataacgttaa tcggcagttg 120tagctccgcg
accatccgca tcacgccgta aaccgctgcc gcaccgcaca tatcgtactt 180catctcatcc
atgccttctg aaggcttgat cgagataccg ccggagtcga aggttaaacc 240tttacccacc
agcacgattg ggcgtgcatc ttccgacgcg ttgcctttgt actcaatcac 300cgacatcagc
gattcgtttt gcgaaccctg accgaccgcc agataggaat gcatccccag 360ctctttcatc
tgctgttcgc cgataacgcg ggtgatgaca ttcttgctgt agctgtcagc 420cagctggcgc
gcttgtgaag cgaggtaagc ggcgttacag atattcggcg gcatattgcc 480gagatctttt
gctgctttaa tcccggcggc aatcgccaga ccgtgctgga tcgcgcgctc 540accgctggtc
agttcacggc gggtcggcac gttgaacacc atcttacgca gcggacgacg 600cggttcgctc
ttgttcgttt tcagctgatc gaaactgtag agcgtctctt ttgccgtctc 660gacagcctga
cgcactttcc agtagttgtt acggccttta acgtgcagct cagtcagaaa 720gcagaccgct
tccattgagc cagtatcatt cagcgtatta atggttttct gaataacctg 780cttgtactga
cgctcatcca gctcacgttc tttgccgcaa ccaataagga gaattcgctc 840ggaaagtaca
ttcggaacat ggtgcagcaa caatgtctgc cccggttttc cttccagttc 900gccccgacgt
agcagggcgc tgatgtaccc atcgctgatt ttatcgagct gttctgcaat 960cggagaaagg
cgacgtggtt cgaagacgcc cacgacgatg caggcactcc gctgtttctc 1020cgggctaccg
ctttttacac taaactccat
1050411049DNAArtificial Sequencegene editing homology arm
pool_2_b4260_right 41tttacgggcg tatttaaagt gataatcata agatatctgg
tgcgggagac gctcaaaagc 60cagctggcga tactcttcat cttgcttttg atcttcttct
gtcaaaagtt agtgaggatc 120ctcggcgcag cggttgacgg cgatattccg gcgaatctgg
tgctctccct tctcgggttg 180ggcgtgccgg aaatggcgca gcttatcctg ccattaagcc
tgttcctcgg gctgctgatg 240acgctgggca aactgtatac cgaaagtgaa attacggtaa
tgcatgcctg cggcctgagc 300aaagcggttc tggtgaaagc ggcaatgatc cttgcggtat
tcacggcaat cgtagcggcg 360gttaacgtga tgtgggcggg accgtggtca tcgcgtcatc
aggatgaagt gttagcagaa 420gcgaaagcga accctggcat ggcggcgctg gcgcaagggc
aattccagca agcgactaat 480ggcagctcgg tgctgttcat cgaaagcgtt gacggcagcg
atttcaaaga tgtgttcctc 540gcgcaaattc gaccaaaagg taatgcacgt ccttctgtgg
tggtggccga ttccggacat 600ttaacccagc tgcgcgacgg ctcccaggtc gtcactctca
accagggaac gcgcttcgaa 660ggcactgcat tgttacgtga tttccgcatt acggacttcc
aggattatca ggcgatcatt 720ggtcaccagg cggtggcgct cgacccgaac gataccgacc
agatggacat gcgcacattg 780tggaacactg acaccgatcg tgctcgcgca gaactgaact
ggcgtatcac gttggtattc 840accgtgttta tgatggcact tatggtcgta ccgctgagcg
tggttaaccc acgtcaggga 900cgcgtactgt cgatgctgcc agccatgctg ctgtatctac
ttttcttcct gatccagacc 960tccctgaaat cgaacggcgg taaaggtaag ctggacccga
cgctgtggat gtggaccgtt 1020aacctgattt atctggcttt agcgattgt
1049421050DNAArtificial Sequencegene editing
homology arm pool_2_b0071_left 42ggactggaaa taccagcaat gattctggct
ggctactatg tcaaacgtta tggtaagcgg 60cgaatgatgg tcatagcagt ggcggcagga
gtactgtttt acaccggatt gattttcttt 120aatagccgta tggcgttgat gacgctgcaa
ctttttaacg ctgtatttat cggcattgtt 180gcgggtattg ggatgctatg gtttcaggat
ttaatgcctg gaagagcggg ggcagctacc 240accttattta ctaacagtat ttctaccggg
gtaattctgg ctggcgttat tcagggagca 300attgcacaaa gttgggggca ctttgctgtc
tactgggtaa ttgcggttat ttctgttgtc 360gcattatttt taaccgcaaa ggttaaagac
gtttgatgac gtggacgata gcggaaagcc 420cggtcatttg accgggcaag gggattaatt
cataaacgca ggttgttttg cttcataagc 480ggcaatggcg tcgtcgtgct gcaaggtaag
cccaatactg tccagaccgt tcatcatgca 540gtggcggcgg aaggcatcga tggtaaagcg
ataggttttc tctcccgctt tcacctcttg 600cgcttccaga tccacgtcga aatggatccc
cggattagct ttcaccagcg caaacagttc 660gtccacttct gcatcgctta atttcaccgg
cagcagctgg ttgttaaagc tattgccgta 720gaagatgtca gcaaaactcg gcgcaatcac
cactttaaaa ccgtagtcgg tcaatgccca 780gggcgcgtgc tcacgcgaag agccacagcc
gaagttttct cgtgccagca aaatggaagc 840gccctgatac tgcgggaagt tcagcacgaa
gtccgggttt ggctgttggc ctttttcatc 900cagaaaacgc cagtcgttaa acagatgcgc
gccaaaaccc gtacgggtca ctttctgcaa 960aaactgtttc gggatgattg catcggtatc
gacattggcg gcatccagcg gaaccaccag 1020gcctgtgtgt ttgataaatt tctctgccat
1050431050DNAArtificial Sequencegene
editing homology arm pool_2_b0071_right 43ggtgtgctcc ttatttaatg
ttgcgaatgt cggcgaaatg tccggtcaca gcagcagcgg 60cagccattgc cgggctgacc
agatgcgtgc gcccgccgcg cccctggcgg ccttcaaagt 120tacggttgct ggtggaggca
caacgttcgc ccggattcag acggtcgttg ttcatcgcca 180gacacattga gcagccaggc
aagcgccatt caaaaccggc ttcaataaag attttatcca 240gaccttccgc ttccgcctgg
gcttttaccg ggccagagcc gggaaccacc agtgcctgca 300cgcctggcgc gacttttcgc
cctttggcga tctccgctgc cgcgcgtaaa tcttcaatgc 360gcgagttggt acaggaaccg
ataaacactt tgtcgatagc cacttcggtc agcggaatac 420ccggtttcag ccccatatag
gccagcgctt tttctgccga cgcgcgttca accggatcgg 480caaacgaagc cggatcggga
atattgtcgt tcacggaaat cacctggccg ggattggtgc 540cccaggtgac ctgcggtgaa
atttcttctg cttgcagagt gacaacggta tcgaaagttg 600cgccttcgtc ggtttgcagg
gttttccagt aggcaacggc gtcgtcgaaa tctttgcctt 660tcggcgcatg cagacggcct
ttgacatagt taaaggtggt ttcgtccggt gcaaccagac 720cggcttttgc gcccatttcg
attgccatat tgcacagggt catacgacct tccatgctta 780aatcacggat tgcttcgccg
caaaactcca ccacatgccc ggtgccgcct gcgctaccgg 840ttttaccgat aattgccagc
acgatatctt ttgcggtaat gcccggcgcg gctttgccct 900ggacttcaat tttcatggtt
tttgcgcggc cctgtttcag ggtttgcgtt gccagtacgt 960gttcaacttc ggaagtgccg
ataccaaagg ccagtgcgcc aaacgcgccg tgggtggcgg 1020tatgcgagtc gccgcagaca
atggtcatcc 1050441050DNAArtificial
Sequencegene editing homology arm pool_2_b1687_left 44aatgttcagc
ccgagcatct cttggtcagg cacatcggta atcaactcgc tgacggaatg 60ccagacaata
tcttcccgcg ccagattcag cacttttgag tccaccgtct ctaccgaaag 120cgcccgcgcc
tcaaccataa acggcgcgtt acgcagcgca gagtcaaaag agtcatattt 180gacgttcacc
agacggcgca ctttaggcaa gcgtgtaata tccagccgcg cttcggtaat 240aaaggccagc
gtcccttctg aacccgtcag aatgcgcgtc aggtcgaact cggtcatctc 300atcgttaaag
acatgacgca gatcgtaacc ggtaagaaag cggttaagtt tggggaagtt 360gtcgataatt
aactggcgtt gctgacggca acgttgataa accgtgttat aaattcgccc 420gattgtggta
ttggatttac ccagcgtttc cgccaattcg acgggtaaag gttgcgtatc 480gagaatatcg
ccccccaaca acaccgcgcg tacgccaagt acgtgatctg acgttttgcc 540atagaccagc
gatccctgac cggatgcatc ggtattgatc atcccaccga gcgttgcccg 600gttgctggtc
gaaagttccg gcgcaaaaaa gtagccgaac ggtttcaggt actgattgag 660ttgatctttt
atcaccccgg cctcaacgcg cacccagccc tcttcagggt taatttcgat 720gatgcggttc
atatggcggg acatatcaac aataatcccc tggttgagcg cctgaccgtt 780agtgccggtg
ccgccgccgc ggggggtaaa gatcagcgat gaatagcgtt cctgcgcggc 840aagacgggcg
atcagcgcca catctgcggt tgaacgcgga aataccaccg catcggggag 900aagttggtaa
atactgttgt cggtcgacat tgtcagacga tcggcataac ttgtcgccgt 960atcgccggta
aaaccttgtt gctccagctc ttgcaaaaaa ttaagcacca gttgaacgac 1020gccgggtgcc
tgggaaatct gtggaatcat
105045979DNAArtificial Sequencegene editing homology arm
pool_2_b1687_right 45tatattgacc ctttcctgcg gtctgtgatg taggtcgata
cactattctt tcaggctgct 60gcaatagcgc actgaaaggt gatgtttgtt tactctatgg
atttcgtgtt gcaggaaggc 120ggcaagcgag tgaactccag gagcttacaa tagtaagtga
ctggggtgaa cgaacgtagc 180cgcagcacat gcaacttgaa atacgacgag taaatcgttt
gcgtgttgcc tgagttgttg 240taccacattt ttttctaaca cgcccatcag aattaagggc
agaatcggcc tgttaaaaac 300cgctgaaatt gctcatcatt atgcaggtga gtttcgcgtg
ttcacgtcgc gtcgacgatt 360tgacgcacaa aaaaggtgaa aagtagttat ggtaaatgtt
cgtcagccca gggatgtcgc 420acaaattctg ctttcggtgc tgtttttagc catcatgatt
gtggcatgtc tgtggattgt 480tcaacccttt attctcggct ttgcatgggc cggtacggtg
gttatcgcca cctggccggt 540attgttacgt ttgcaaaaga tcatgtttgg ccgccgctct
ctcgccgttc tggtgatgac 600gctgttatta gtgatggtgt ttatcatccc tattgctttg
ctggttaaca gtatcgtcga 660cggcagcggc ccgctaatta aagccatttc cagcggtgac
atgacgttac ccgatctggc 720gtggcttaat accattccgg tgattggcgc gaagctgtat
gcaggctggc acaacttgct 780ggatatgggg gggacggcga tcatggcgaa agtccgccct
tatattggca ccaccaccac 840ctggttcgtt gggcaggcgg cgcatatcgg gcgctttatg
gtgcattgtg cgctgatgct 900tctcttcagt gccctgctgt actggcgcgg tgaacaggtg
gcacaaggca ttcgccattt 960tgcaacccgt ctggcaggc
979461050DNAArtificial Sequencegene editing
homology arm pool_2_b1006_left 46ggccggaatg gtatgaatca gcgcaccaaa
tttcggtgaa aagcccaaca gcatggcgat 60gacggcagca gcaacaaaca ccagcgtcga
gtagactttg gtcacggcca tcacaccgat 120attttcagca taggtggtca cgccgctacc
gccgacagag ccggaaagca tcgttgccag 180accatcgcct acgaatgccc gccccatata
cgggtccata ttgcgtccgg tcatcccggc 240gactgccttg agatgaccta agttttccgc
caccagaatc accgccacgg gcgcaatcag 300catcattgcc tgaccattaa aagcaggagt
ggaaaaatgt ggcagaccga accaggcagc 360atggctgacg agagtaaaat cgacggcttt
tcccagccct aaaacgttgg tcatcacgcc 420atacagcaga caggcgacaa ttaatcctac
gagaatcaat aaccgctgga tcatgccacg 480ggtaaacacc gccaccagcc caatacacag
caccgtcatt accgccatcc agctatcaaa 540ggccgaagcc gatacacttt tcactgcgat
aggcgctaag ttcaggccaa tcgccatcac 600caccgcaccc gtcaccaccg gcggcatcag
tcgttcaatc cagcgcgtac cgattttcat 660caccaccagg ccaatgacgg tataaaccag
cccacaggcg ataatcccgc ccagcgcaat 720gctgatattc gggttaatgc cctgaccgtt
aaagcccgtc gcggcgatca ccacgccgac 780aaaagccgcg ctggagccga gataactggg
gacgcgcccg ccggtaataa agaaaaacag 840taacgtgccg atccccgaca ttaaaatgga
aagattggga tccagcccca tcagaatcgg 900cattaacacc gtcgcgccaa acatcgccac
cgcgtgttga acgcccatta ctgccgtctg 960agcaaacggc aatcgttcat ccggcgcgac
cacgccgctc tctgtagagg tcgattttaa 1020ctgccagtga ggaaaaccga acattgccat
1050471044DNAArtificial Sequencegene
editing homology arm pool_2_b1006_right 47cagctgtctc cttaaggagg
ttaacaagca gggcgcatca gcgcgtgata actgcgatcg 60aaccacacca gcccgtaggg
tgtggtgtga cgatgaatcg cttcgatggc gcaaaacaga 120atgtcgtggg tgccgacgct
caccacctgg ctgatacggc agtcaaacga aaccagagcc 180tcttccagtt gcgggcatcc
ggtcaccccc gtctgccagc gggcggcggc aaagcggtgt 240tccatgggcg ttttgccgcc
aaaaaggttt gaaagcggct cctgcccggc gctaagtgta 300tttacacaca gcgttcgatt
ttcattgaat gccggccaga cggacgcccc acgattcagg 360cacaccagta atgtgggcgg
cgtatcggtc acactgcaga cggcgctggc ggtgaacccg 420gcgcgcccgg ctggaccgtc
cgtggtgata atattgaccg ccgcgcccat gcaggacatc 480gcatcgcgaa aagtttgttg
atcgacaatg ttcatagttt gctccttaca acagcccgca 540ggcttcttca aaggacagac
gtggcaggcg cgcataaagc ttgctgctat cgccatagcc 600gatattaatc agcagattgc
tcttcagcgt gctgcccgta aaaaaggcgt cgtccacgtg 660ttgacggtca aagcccgaca
tcgggccggt atccagtccc agcgcccggc aggcgacgat 720cagataggcc gcctgcatgg
aactgttgcg aaacgctgtt tcttcggcaa gttgtgggct 780ggaggtaaac caactgcggg
catcaccgtg gggaaacagt agtggtaacc gttcataaaa 840ttcactgtcc caggcgacga
tagcggtgac gggcgcggtc agggtttttt gcagattgcc 900gctggaaagt gccgggcgca
gacgttcttt tccttctgcc gtgcgggtaa acacgatccg 960tgccggagaa cagttagctg
atgtcggccc ccatttcatc agggcataaa tctcccgtaa 1020cgtctcatcg ctgacgggtg
tctc 1044481050DNAArtificial
Sequencegene editing homology arm pool_3_b0335_left 48atgatcaaag
cccatgaaat tcagggctgc atcgcgctgg aaaactcctt taaccgcgtc 60ggcctcgacc
acgttctgtt agtgaaagtg gcttccaccg ccgtggtcgc cgaaatgctc 120ggcctgaccc
gcgaggaaat tctcaacgcc gtttcgctgg cgtgggtgga cggtcagtcg 180ctgcgcacct
atcgccatgc gccgaacacc ggcacgcgta aatcctgggc ggcgggcgat 240gccacttccc
gcgcggtacg tctggcactg atggcgaaaa cgggcgaaat gggttacccg 300tcagccctga
ctgcgccggt gtggggcttc tacgacgtct cctttaaagg tgaatcgttc 360cgcttccagc
gcccgtacgg ttcctacgtt atggaaaatg tgctgttcaa aatctccttc 420ccggcggagt
tccactccca gacggcagtt gaagcagcga tgacgctcta tgaacagatg 480caggcagcag
gcaaaacggc ggcggatatc gaaaaagtga ccattcgcac ccacgaagcc 540tgtattcgca
tcatcgacaa aaaagggccg ctcaataacc cggcagaccg cgatcactgc 600attcagtaca
tggtggcgat cccgctgcta ttcgggcgct taacggcggc agattacgag 660gacaacgttg
cgcaagataa acgcattgac gccctgcgcg agaagatcaa ttgctttgaa 720gatccggcat
ttaccgctga ctaccacgac ccggaaaaac gcgccatcgc caatgccatt 780acccttgagt
tcaccgacgg cacacgattt gaagaagtgg tggtggagta ccccattggt 840catgctcgcc
gccgtcagga tggtattccg aaactggtcg ataaattcaa aatcaatctc 900gcgcgccagt
tcccgactcg ccaacagcag cgcattctgg aggtttctct cgacagagct 960cgcctggaac
agatgccggt caatgagtat ctcgacctgt acgtcattta agtaaacggc 1020ggtaaggcgt
aagttcaaca ggagagcatt
1050491050DNAArtificial Sequencegene editing homology arm
pool_3_b0335_right 49atgtctttta gcgaatttta tcagcgttcg attaacgaac
cggagcagtt ctgggccgag 60caggcccggc gtattgactg gcagacgccc tttacgcaaa
cgctcgatca cagcaatccg 120ccgtttgccc gttggttttg tgaaggccga accaacttgt
gccacaacgc catcgaccgc 180tggctggaga aacagccaga ggcgctggcg ctgattgccg
tctcttcgga aacagaagaa 240gagcgcacct ttacctttcg tcagctgcat gacgaagtga
acgcggtggc ctcaatgttg 300cgttcattgg gtgtgcagcg cggcgatcgg gtgctggtgt
atatgccgat gattgccgaa 360gcgcatatta ctctgctggc ctgcgcgcgc attggcgcta
ttcactcggt ggtgtttggt 420ggatttgcct cgcacagcgt ggcggcgcga attgatgacg
ctaaaccggt gctgattgtc 480tcggctgatg ccggagcgcg cggtggcaaa atcattccct
ataaaaaatt gctcgacgat 540gcgataagtc aggcgcagca ccagccacgc catgttttgc
tggtggatcg cgggctggcg 600aaaatggcgc gcgtcagcgg gcgggatgtc gatttcgcgt
cgttgcgcca tcaacacatc 660ggcgcgcggg taccggtggc gtggctggaa tccaacgaaa
cctcctgcat tctctacact 720tccggcacga ccggcaaacc taaaggcgtg cagcgtgacg
tcggcggata tgcggtggcg 780ctggcgacct cgatggacac catttttggc ggcaaagcgg
gcagcgtgtt cttttgcgca 840tcggatatcg gctgggtggt ggggcattcg tatatcgttt
acgcgccgct gctggcgggg 900atggcgacta tcgtttacga aggattgccg acctggccgg
actgcggcgt gtggtggaca 960atcgtcgaga aatatcaggt tagccggatg ttctcagcgc
cgaccgccat tcgcgtgctg 1020aaaaaattcc ctaccgctga aattcgcaaa
1050501050DNAArtificial Sequencegene editing
homology arm pool_3_b1940_left 50aatgtctgat aacgatccgc gcgtggtggc
gctggtcatt cgccagtgga taaataacga 60tcatgagtaa cctgacaggc accgataaaa
gcgtcatcct gctgatgacc attggcgaag 120accgggcggc agaggtgttc aagcacctct
cccagcgtga agtacaaacc ctgagcgctg 180caatggcgaa cgtcacgcag atctccaaca
agcagctaac cgatgtgctg gcggagtttg 240agcaagaagc tgaacagttt gccgcactga
atatcaacgc caacgattat ctgcgctcgg 300tattggtcaa agctctgggt gaagaacgtg
ccgccagcct gctggaagat attctcgaaa 360ctcgcgatac cgccagcggt attgaaacgc
tcaactttat ggagccacag agcgccgccg 420atctgattcg cgatgagcat ccgcaaatta
tcgccaccat tctggtgcat ctgaagcgcg 480cccaagccgc cgatattctg gcgttgttcg
atgaacgtct gcgccacgac gtgatgttgc 540gtatcgccac ctttggcggc gtgcagccag
ccgcgctggc ggagctgacc gaagtactga 600atggcttgct cgacggtcag aatctcaagc
gcagcaaaat gggcggcgtg agaacggcag 660ccgaaattat caacctgatg aaaactcagc
aggaagaagc cgttattacc gccgtgcgtg 720aattcgacgg cgagctggcg cagaaaatca
tcgacgagat gttcctgttc gagaatctgg 780tggatgtcga cgatcgcagc attcagcgtc
tgttgcagga agtggattcc gaatcgctgt 840tgatcgcgct gaaaggagcc gagcagccac
tgcgcgagaa attcttgcgc aatatgtcgc 900agcgtgccgc cgatattctg cgcgacgatc
tcgccaaccg tggtccggtg cgtctgtcgc 960aggtggaaaa cgaacagaaa gcgattctgc
tgattgtgcg ccgccttgcc gaaactggcg 1020agatggtaat tggcagcggc gaggatacct
1050511050DNAArtificial Sequencegene
editing homology arm pool_3_b1940_right 51atgtctgata atctgccgtg
gaaaacctgg acgccggacg atctcgcgcc accacaggca 60gagtttgtgc ccatagtcga
gccggaagaa accatcattg aagaggctga acccagcctt 120gagcagcaac tggcgcaact
gcaaatgcag gcccatgagc aaggttatca ggcgggtatt 180gccgaaggtc gccagcaagg
tcataagcag ggctatcagg aaggactggc ccaggggctg 240gagcaaggtc tggcagaggc
gaagtctcaa caagcgccaa ttcatgcccg gatgcagcaa 300ctggtcagcg aatttcaaac
tacccttgat gcacttgata gtgtgatagc gtcgcgcctg 360atgcagatgg cgctggaggc
ggcacgtcag gtcatcggtc agacgccaac ggtggataac 420tcggcactga tcaaacagat
ccaacagttg ttgcagcaag aaccgttatt cagcggtaaa 480ccacagctgc gcgtgcaccc
ggatgatctg caacgtgtgg atgatatgct cggcgctacc 540ttaagtttgc atggctggcg
cttgcggggc gatcccaccc tccatcctgg cggctgtaaa 600gtctccgccg atgaaggcga
tctcgacgcc agtgtcgcca ctcgctggca agaactctgc 660cgtctggcag caccaggagt
ggtgtaatga ccacgcgcct gactcgctgg ctaaccacgc 720tggataactt tgaagccaaa
atggcgcagt tgcctgcggt acgtcgctac gggcgattaa 780cccgcgctac cgggctggtg
ctggaagcca ccggattaca attgccgctc ggcgcaacct 840gtgtcattga gcgccagaac
ggcagcgaaa cgcacgaagt agaaagcgaa gtcgttggct 900ttaacggtca acggctgttt
ttaatgccgc tggaggaagt cgaaggtgtc ctgcccggcg 960cgcgtgttta tgccaaaaac
atttcggcag aagggctgca aagcggcaag cagttgccgc 1020tcggtccggc gttattaggt
cgcgttctgg 1050521050DNAArtificial
Sequencegene editing homology arm pool_3_b0109_left 52cagcacaggc
gaggggcaaa aaacgaaacg ggaaagcaga ttccgaggtt ttttatttcg 60ttgcagcgaa
agacaagaaa tttgcgaggc gttacgaaga aagttgggga aggggagatt 120atccgcccgc
gatggagcgg ataaatctgt caactattag cgaaaacgca ttgaaaggtc 180gagtgcttgt
acgtgtttag ttagcgcacc gacggagata aagtccacgc ccgtttcggc 240aaattcacgc
agtgttttgt cagtgacgtt gccagacact tccagtagcg ccttgccgtt 300ggtgcgtttg
acggcttcgc gcatctgttc tgtttcgaag ttatccagca tgatgatatc 360ggctcctgct
ttcagggctt catcaagttc ttccagattc tctacttcga cttctactgg 420cgcatccggg
tgcagccagg acgctttttc gaccgcctgg cgcactgagc cggaggcaat 480aatatggttt
tctttgatca ggaaggcatc agaaagcccc agacggtgat tcgctccgcc 540gccgcaaagt
accgcgtatt tcagagctga acgcaggccg ggtaaggttt tgcgcgtatc 600caacaactgc
gtgttggtgc cttccagcaa ttcgacatag tggcgtacct tactggcaac 660tcctgaaagg
gtttgcacaa aattaagcgc agtgcgttcg cccgttaaca gcacgcggga 720tgggccttca
agttcgaaca aggattgatt ggcattgatg acatcgccgt catccacatg 780ccagattatg
gtgacatcgt cgcctgccag ttgaataaac acctcttcaa cccagcgttt 840gccgcaaaag
acgccattct cgcgggtgat caccgtggca tgagagcgag aattttccgg 900taaaagtttt
gccgtaatat cattgttggc atcgactgtt ccgcctaaat cttcccgcag 960cgcctgggcc
accgcgccgg ggatatcgag attaatgcgt tccagcagct cgtcacgtcg 1020ggtgtcaggg
ttatagcggc gaggcggcat
1050531050DNAArtificial Sequencegene editing homology arm
pool_3_b0109_right 53gttaaaactc cagatagcta acgaatcata aggtagaaac
atgctactct gaaccgggta 60ttagcaccac atataaggag atcctgcatg ttgttagaac
aggggtggct ggttggcgcg 120cgccgcgttc cctcaccaca ttacgattgc cgcccggatg
acgaaacacc caccctgctg 180gtggtgcaca atattagcct gccgccaggc gagtttggcg
gtccgtggat cgacgcatta 240ttcactggaa ctattgatcc gcaggcacat cctttctttg
ctgagatcgc ccatttgcgc 300gtctccgctc actgtttgat tcgccgtgat ggtgaaatag
tccagtatgt tcctttcgat 360aaacgtgcat ggcatgcggg agtctctcag tatcaggggc
gcgaacgctg caatgatttt 420tctattggga ttgagcttga aggcaccgat acgctggcgt
ataccgatgc gcagtatcaa 480cagcttgcgg cggttacgcg ggcactgatt gattgctatc
cggatatcgc taaaaacatg 540acgggccatt gtgatattgc gccggatcgg aaaaccgatc
ccggtcctgc atttgattgg 600gcacggtttc gtgtgctggt cagcaaggag acaacatgac
gctatttaca accttactgg 660tgttaatttt cgagcgcctg tttaagttgg gcgagcactg
gcagcttgat catcgtcttg 720aagcgttctt tcggcgggtg aaacattttt ctctcgggcg
cacgttaggc atgaccatta 780ttgcgatggg cgtgactttt ttactgttac gcgcattgca
gggagtattg ttcaacgttc 840ccacgctact ggtgtggctg ctgattggtt tgctgtgtat
tggcgcaggt aaagttcgtc 900ttcattatca tgcttatctg acagctgctt cacgtaatga
tagccatgcc cgtgccacga 960tggctggcga actcaccatg attcacggcg tcccggcagg
ctgcgacgaa cgtgagtatt 1020tgcgtgagct gcaaaatgca ttgctgtgga
1050541048DNAArtificial Sequencegene editing
homology arm pool_3_b3399_left 54gtcgcggcgc tcttttttgt ccgggcgtcg
gtccgggtgc ggcatggtta aggcattaag 60tttacgtgcc agcgccattt tttcgcgttt
ctctacactt tccgcagtct cttcatacag 120caaggctgcc tcgctggcgg ggcgacgctg
ttcagtaatc gcctttacaa tcaccgtgcg 180ttcgtcattt ccctggcgca gagtgagcgt
ggcattcagc tcgacgattt tgctcggctt 240gctgcgctgc ccgttgtaat gcaccttacc
gccttcaatc atttcacggg ccagcgcgcg 300ggttttataa aaacgggcag cccatagcca
tttatccagt cgaacctcaa cagcaggttt 360ctctttcatg gcgtctcctt cacattagcg
aggggatcag gcggcggtag tcattcagtg 420acggatggcg ttgatactgt ttctcggcaa
tcccggaatc aggattagtc acgccgaggc 480agtaacgaat accaaattgc gcggcagcat
cgagaatcgc ttcgctgtca tcaataaaca 540gcgttctttc agctttcaga cccgtagctt
cggccaccgc atgccataac cgctgatcct 600ctttcggata accaaatgtg tgggtggaaa
gtaataaatc aaggtgtgcg tccagaccgg 660tatgctcaag ttttaccgcc aggttgtgcg
gatgcgcatt ggtgagcaaa attcgctgct 720taccgctggc tttcagtgcc tcaagaaacg
gaatggtatc ttcacgcagt acggcacgcg 780gtcccatctc ggtggtcatc gcacagatat
ccagacccag ttgctcactc cagtaatcaa 840gacagtacca gtttagcgta tgctgtacgt
cgtgatattg ctggcgcata tattccatcg 900cttcctgtgg cgtaaccccg tttttcgcgc
cccatgtttc aggcaccagc ttttgccaga 960aatagttatc gaaggcgagg tcgagcaacg
tgccgtccat atccagcaga acggtatcta 1020cgtcctgcca ggcaatgttg atatgcat
1048551034DNAArtificial Sequencegene
editing homology arm pool_3_b3399_right 55gagggaaatc tccagagtga
agcaatttgc gcgacagggt agcataacct gccgcgcaaa 60cgtgttattc gataaggctt
tctgaagggg tgatcagttg cgggttcagg cagctttcat 120aataggcctg aatctccatc
atgcgagtgc gatgacgctg gtaacgacgc caggcctgta 180cgccattgta aatggcgcta
cccagcatcg tcagtagcag caatgtggtg ctgagatagc 240gccacaggcc ggaacgatcc
gggatcggat gaaggccaat atgctgcgta ccattggcgt 300cagtgaagat ttttgtgacg
ataccctcgg cgttaaacgg cgtatgcatc agcatttgtg 360ccagtttctg gaaagcgttc
cactgttctt gcggcgggta gtcgtaaagt gatgccgaag 420gccagggctg atcaacaaaa
tcgctgcctt cgtcgctgac aatcaggaat ccgccggggg 480ccggactgtt tagtgattgt
gccgctcgcg ccgtttcatg cgtgataaac ggcgcggtgg 540aggttgccac caggttatcc
agcgattccg cactcaccgg gcgtaataac acattcacgc 600catcgagctt cccggcgttg
gcgcgtttta ccagcgcgtc ccagtcttta ctgttgccga 660gattgaccag cgcatttttc
agtcgcacac agtcatcttt ggcagaacat aaatccgctg 720tcttcagtac aatgtcgcca
aaatcatcaa gcaataccat gccggatttt tgaattgctg 780aacgtaatga ggcactgacg
cgagattcat cttccggttt agggtgcagc tggcgattaa 840ctgcttcagt caatgccgtc
gctttgttga ccagttcaga ttctggtaat ggcaatgagc 900gggcgtcgtt ccagatgatc
tgcgagcagt caaacggtaa aaaaggtgaa ttggttttcg 960cgctccaggt tccggaagtt
cgaatattac acattcccgt accgctaata cgcaatgtat 1020cgcctacccg cacg
1034561031DNAArtificial
Sequencegene editing homology arm pool_3_b2478_left 56gtgccgcttc
caggtaggct tcatcaccac tgacctgacg cttatagcgt gagtcagaac 60tacaggcagc
gagtaataaa acaagcgaaa cacccgcaac ctttgccagg cgcgactttt 120gaacagagta
agccatcaaa tctccctaaa ctttacagca aaccggcatg cttaagcgcc 180gctctgaccg
tctcacgacc actgtcggtg attggtgtca ttggcaggcg cagcgtatcg 240gtcgccacaa
gacccagttc cttacatgcc catttcaccg ggattggatt gggttcgaca 300aatagtttgt
tgtgtaatgg catcagacgc tgattaataa cgcgtgcctc ggcaaaatgc 360ccttctgctg
ccagtttgca catctgggcc atatcacgcg ctgcgacgtt agccgtaacg 420gaaataaccc
catgaccgcc caattgcatg aagtccagcg cgctcgcatc atcgccgctc 480agcagaacaa
aatcatctga aaccagctct ttgatctggt ttacacgcgt taagttccct 540gttgcctctt
tgattccgat aatatttttt actttcgcca gacggcccac cgtttccggg 600agcagatcgc
agccagtacg ggacggcaca ttatacagaa tttgcggcag gtcagtatgc 660tcagcgatgg
ctttgaaatg ctgatacaaa ccttcttgcg acggacgatt gtagtaaggg 720gttaccgtca
ggcagccgac gataccactg tcattgaagc gctgcgtcag gctaatggct 780tccgcagtag
cgttagcgcc ggtcccggca attaccggaa tgcgcccatc agccagatcc 840agcgtcatca
tcaccacatc agcatgttcg tcatgattta aggtagcgga ctcgccagtg 900gtgccaacag
aaacgatcgc cgaagtaccg ctggcgacat gataatcaat cagttttttc 960aagctagccc
gacagacatt acctttttca tccatcggag taacaatcgc gacaatactt 1020cccgtgaaca t
1031571050DNAArtificial Sequencegene editing homology arm
pool_3_b2478_right 57gggccatcct ctgtgcaaac aagtgtctca atggtacgtt
tggtatggca ttaaaagcaa 60gcagacagaa ccgttctgat tgttgtatgc atgttttttt
tatgctttcc ttaagaacaa 120ctcacccctt aaaggaataa ccagtttgac actgtcatcg
caacattatc tggtgatcac 180tgcgttgggt gccgatcgcc ctggaattgt gaacaccatc
acccgtcatg tcagtagttg 240cggctgtaat attgaagaca gtcgcctggc gatgctggga
gaagagttca cgtttattat 300gctgctttcc ggttcatgga atgccattac tctgattgaa
tcaacgttac cgttgaaagg 360tgccgaactg gatcttttaa tcgtgatgaa gcgcacgacg
gcgcgtccgc gtccgccaat 420gccagcatct gtctgggttc aggtcgatgt ggcagactcc
ccgcatttaa ttgaacgctt 480cacagcactt ttcgacgcgc atcatatgaa cattgcggag
ctggtgtcgc gcacgcaacc 540tgctgaaaat gaacgggctg cgcagttgca tattcagata
accgcccaca gccccgcatc 600tgcggacgca gcaaatattg agcaagcgtt caaagcccta
tgtacagaac tcaatgcaca 660aggcagtatt aacgtcgtca attattccca acatgatgaa
caggatggag ttaagtaatg 720aatccactga aagccggtga tatcgcaccg aaatttagct
tgccggatca agacggagaa 780caagttaatt tgaccgactt ccagggacag cgtgttctgg
tttatttcta cccgaaagcc 840atgacccccg gctgtaccgt acaggcctgc ggcttacgcg
ataacatgga tgagttgaaa 900aaagcgggcg ttgatgtgct gggtatcagc accgataaac
ccgaaaaact ctcccgtttt 960gcggaaaaag agctgcttaa ctttacgctc ctgtctgatg
aggaccacca ggtgtgcgaa 1020caattcggcg tctggggtga aaagtccttc
1050581050DNAArtificial Sequencegene editing
homology arm pool_3_b0320_left 58atgatggtgg tattaaacag caggcgattg
tgcagttatt gctggaacac ggtgccagcc 60cgcatctgac cgataaatat ggcaaaacgc
cactggaact ggcgcgggaa cggggctttg 120aagagattgc gcagttactg attgccgcag
gtgcataaac cgggaggctt gctatcaaca 180caccagaaag acggtgtgtg tgggcgctaa
ctgcggatgc ggattttctg gcgcagcggg 240ggcaaggaca ggttgaacag gtctttgcca
gagcggtaaa tatcgcactc ccggctcgcc 300agcagttgct gacgctgctt tgtgaagagt
acgacaatgc gccaaacagt tgtcggttgg 360cactcactca ctttgatgat ctgttccggc
atggtgataa ggttcagttt gacgatcaag 420gtattacggt tggtcaacat cttcatatag
agatgagtcg ttgtcggcgt tggctgtccc 480caaccttgca aatgaccgct gtgaattttc
accttatcgc ctggctacag tggcacgaca 540ttattcatca gcacctgggg gaaaatgaaa
ccctgtttaa ttatcgcggc gataatccgt 600tttatcaggc gttaaataaa gaattacata
ttaaacgacg ggcagttatt caggccgtaa 660acgataaaca aaatatcgcc tcagcggtcg
ccagtatgat ggggttaggg attggcctta 720cgccatcagc cgacgattat ttaacaggtc
tggcgcttat tttatttatt cccgggcatc 780cggcggaaaa atacaaagag gaattttatc
tcggtctgca acgcggcaaa aataatacca 840cattattaag tgccataacg ctggaagccg
cattacaaca acgctgccgg gaaaatattc 900atcgttttat tcacaacatt atttatgaca
tccctgggaa cgcaactcag gcaatagaaa 960aaattaaaca tattggctcc agttccggct
gcgacatgct gtatggcatg gccgatggtt 1020gtgcgctgag ccaaacctac ggagggaatt
1050591046DNAArtificial Sequencegene
editing homology arm pool_3_b0320_right 59atgtcagtta aaatagtcat
taaaccgaat acctattttg attctgtctc gctgatgtct 60atctccacgc gtgcaaataa
actcgacggc gtcgagcagg catttgtggc gatggcgacc 120gaaatgaata aaggcgtgct
gaagaattta ggactgctga cgccggagct ggagcaggcg 180aaaaacggcg acctgatgat
tgtcatcaat ggtaaatcgg gtgcggacaa cgagcagtta 240ctggtggaga ttgaagaact
gttcaacacc aaagcgcaaa gcggctcgca cgaggcgcgt 300tacgccacta ttggcagcgc
caaaaagcat atcccggaaa gtaacctggc ggtgatttcg 360gtcaacggtc tgtttgccgc
tcgcgaagcg cgtcaggcgc tgcaaaacga tctcaacgtg 420atgctgtttt ccgataacgt
ctcagttgaa gatgaactgg cgctcaagca actggcccac 480gaaaaagggc tgctgatgat
ggggccagac tgtggcacgg cgattatcaa cggcgcggcg 540ctctgttttg gtaacgccgt
gcgtcgcggc aacatcggta ttgttggcgc atccggcacc 600ggcagtcagg agttgagcgt
ccgcattcat gaatttggcg gcggcgtttc gcaactgatt 660ggcaccggcg ggcgcgacct
gagcgagaaa atcggcggcc tgatgatgct cgacgccatc 720gggatgctgg aaaacgatcc
gcaaactgaa atcattgcgc ttatctccaa accgcctgcg 780cctgcggtgg cccgcaaagt
gctggaacgt gcgcgcgcct gccgcaagcc ggtggtcgtc 840tgcttcctcg atcgtggcga
aacgccagtg gatgagcagg ggctacagtt tgcccgcggc 900accaaagagg cagcgctaaa
agcggtgatg ctctccggcg tgaaacagga aaatctcgac 960ctgcatacgc ttaaccagcc
gttgattgcg gatgtgcgtg cgcgtctgca accgcagcag 1020aaatacattc gtggcctgtt
ctgcgg 1046601050DNAArtificial
Sequencegene editing homology arm pool_3_b4521_left 60ccggagattg
ctcaattttt aaatcacggc tggcaacgct ggcattaccc attaccgcaa 60caatttctgc
aacctgtgcg ctgtcagttt ttgccatttc gttggcttct gcgcaagtaa 120tataggtttc
tgacggcaaa ccgtttttaa tattgtagtc ctgcgcccag gtcattggtg 180cgaaaacaaa
caggcccgcc agtaaagcaa attttttcat catcattcct tatttcattt 240tacccagaat
tgcaccaccc gtaccgccaa tcacggcacc tttaatcgcc ccttcgaggc 300cattgccggt
cagaacgcca gtgacagcac caacggcggc acccactttt gcacctttac 360gcgcattttt
accgtcgcgg cctttttctg ttactgcacc aacaccagcg ccaacagctg 420cacctttcag
tacgccatta acaccattgc cagtaagtaa accaacgcct gcgcctagca 480atgcaccttt
cgtggtgcgg ttcatatccg ccatcgctgg cgtggagcag aacaatgctg 540agataagccc
gaaggcaagt atttttttct tcaacttaga tgtccggtat taagtaagtt 600gcacacacaa
taatttcgtc ttcaattaag atctgcttaa ctaaagaacg ctcgctatta 660ttcagataat
tcaaaatgag cgtggctgtg atgataggaa ttatgttttt tacgtgaatg 720agaataatct
taaatgagga ataactcatt gattgacaat atttttattc aagaagtgtc 780attgactgtt
aacgcaatgt tgtaaaggta agataatctg atttatcaat attattgtgt 840gatttttatg
tgagcagaag atattcatca gcaacgatta cattagtcat tttattttgc 900cgacggcctc
attgtcgaaa gataagcgta cgacagtatt atcagaaaag agtgattttt 960tatccaacta
cacttcagcg cactgcgtgt aaaaaatgcc tctttcttat gcgggatatc 1020atcatttcat
catgatgtct ttggtgagcg
1050611049DNAArtificial Sequencegene editing homology arm
pool_3_b4521_right 61gtgaacacaa tacacctgcg ctgtctcttc aggatgaatc
ccctggtctg gtgcctgcgg 60gctgatgttg cagcagagct taggtcactt agacgctact
atcatttatc caatggcatg 120gaatcgaaat cagtcgatac ccgcagtata tatcgtgaac
tgggtgcaac gctgagttac 180aacatgcgcc tggggaacgg tatggaaatg aaccctggct
gaaggcggct gtgcgcaaag 240aatttgtcga tgataaccgg gtgaaggtga ataatgacgg
taatttcgtc aatgatttgt 300cgggcagacg tggaatatac caggcagcta ttaaagcctc
attcagcagt acgtttagcg 360gacatctcgg ggtggggtat agccatggtg ccggtgtgga
atccccgtgg aacgcggtgg 420ctggtgtgaa ctggtcgttc tgaccatcaa cgattaaact
gcgcttcggc gcagttttcg 480tttacaggat gttgaaaggg aaaattctgg ggcaaaaaaa
gcccgccagt tacggcggga 540aacctcatcc tatgggagaa caatgaataa tgaaattgcg
gggttatcat ctcccagtat 600atccatacta acaataaggt tatttactca accaggcata
aacattttgt tttgtgcgtg 660ggaacagcct taaggtgtaa agggggaggt ggaaatagca
atgaggagta tcagcaagaa 720tactcgccgc tttaccacaa cgtggatgag agggatgaaa
aactcaaggc agagataact 780ctgccttgaa gataaatgcg cttttacagc gggcttattt
cagctcttct gcttccggta 840aggtcacgtt cagctcaaga atagaaatat cgccatcttt
ttgctcaagc tgtacggtta 900ccatctcagg atcaatttgt acgtatttac aaatgacctc
aagaatatct ttacgcaact 960gcggcagata atgcggttct gcatcgctgc gacggcgttc
agcaacaata atctgcagcc 1020gttcttttgc aatgttggct gtgtttttc
1049621050DNAArtificial Sequencegene editing
homology arm pool_3_b2260_left 62acgtccgaca atggtcagct tgccattatg
catctcaccg cgatcgcgcg tagcgtacca 60gccttcgtca ttaaccagtg aaaccagttg
cccgttacgc cagtaacctt ctgccatact 120ggcagcccgc agccacactt cattattaac
gattttcact tcccgacccg gcagcggcga 180accaacgtct gccaggccgt cggcttcttt
cgcacacacc gtggaggcaa actcggtcag 240accatagccg caaaagcaac gaatcccctg
ctcgcgcgcc tgttccgtca actcgaccgg 300gatagccgcg ccgccaagta acaccgcttt
cagggaaacg gaactacggt taaccagcaa 360acgccagagt tgtgttggca ccagtgaagc
gtgagtacag cctgccagca tttgctccaa 420tggctgttta tcacgtaccg tcatccgcgc
accagcgtat aaccagcgcc acataattcc 480ctgaccggag acgtgaaaca gcggtaaaga
gagcaaccaa tcatcgtgat cgccaaacgg 540aatcagcgat aacacacctt gcgcactggc
aagatgggcc tgataagtat ggacagcggc 600tttcggcaaa ccggtagaac ccgaggtcaa
cgtcattgag cacagacgcg tcggctgcca 660cgtagcggca tgtgcgcctt caaccagctg
aatgtgcagc gacgttaatg ccggaaacgt 720gttttcccca tccggcacca gagcaaattg
cagcgtcaga ttgggcagca attcttcaag 780caacggttgc ggcagctgag ggttcacggg
caacacccgc gccccgcatt gcagtaacgc 840cagccaggcg agcagcgttt gcggcgtatt
ccacgcccgc aacatcacgc cgctgccctc 900aaccaccccc tgcaccgcaa atccggaggc
taattcatcg acgcgagcac aaagctcgcg 960ccagttgagt tgctcgtcat taagacgtaa
ggcgatggtt tctccccgca cttgccgcca 1020gtgacgccac ggccagtcag agaagatcat
1050631050DNAArtificial Sequencegene
editing homology arm pool_3_b2260_right 63aacaaccgct ccagtgcatc
aacttccacg acaggcagcg tgctacccgg ccagcgacgt 60acctgctgcg cctgcatcag
atccagcgtg tccagccctg gaatggtgtc cggcgttaac 120caggcggcaa tccgcgccag
ttgcgttaag cctaagctcg attcaatgga agaactgatc 180accgccgtca gccccagcgc
gtgcgccgcc tgtacctgct cgcgtacttt ttccagactg 240cccgtgagcg tgggtttgat
aactaccgcg cgcacgccct cttcagccac aaaggcaaaa 300tccggctcgc gcaggctttc
atcccaggca atggcaatgc cggtttcacg ggcaaacgct 360cgcgaatcat cgcgggtttt
gcacggctct tcgagaaacg cgatgcggtc gcgataatcc 420gggttaacgt atttggcaaa
ctgctgacct ttcagcggtg tccaggcgcg atttgcgtca 480agacgcaaat gcagatccgg
aattgcctcc agcaacagat tcaccaccat gccgtcgcgc 540accgcttcgt acaatcccac
tttgaccttc gccactttct cgcctggcat atctgcaagt 600ttgaggatca gatcgtccgg
atcgccatta cacagcggtg ccgcacggta gttggctgct 660tgcggcaacg tatctgtcag
ttctgccaat gcacagctta cgccaaaggc cacggaaggc 720atctgcggta gctcgcaatc
gcctgccagc cagttattta cccaggcaag cagcacactt 780tgcgcctctt cccaggtttc
ctgactgaag cccggcagtg gggagatctc cccccaccct 840tcgcgctcgc cttcacgcag
gcaaacatac agcccgtcgc gggtttttaa ccgcctgtcg 900cgcagaacca cccccgcgtc
catggggatc tgccagcggt atacctgcgc gctacgcatt 960acggattccg tttgaatttg
ctgaagtcag gctgacgttt ctggttgaag gcgttgcgac 1020cttcctgacc ttcttccgtc
atgtagaaca 1050641047DNAArtificial
Sequencegene editing homology arm pool_3_b4169_left 64tggaatgggc
cgatgtggtg gtgattggtc ccggtctggg ccagcaagag tgggggaaaa 60aagcactgca
aaaagttgag aattttcgca aaccgatgtt gtgggatgcc gatgcattga 120acctgctggc
aatcaatccc gataagcgtc acaatcgcgt gatcacgccg catcctggcg 180aggccgcacg
gttgttaggc tgttccgtcg ctgaaattga aagtgaccgc ttacattgcg 240ccaaacgtct
ggtacaacgt tatggcggcg tagcggtgct gaaaggtgcc ggaaccgtgg 300tcgccgccca
tcctgacgct ttaggcatta ttgatgccgg aaatgcaggc atggcgagcg 360gcggcatggg
cgatgtgctc tctggtatta ttggcgcatt gcttgggcaa aaactgtcgc 420cgtatgatgc
agcctgtgca ggctgtgtcg cgcacggtgc ggcagctgac gtactggcgg 480cgcgttttgg
aacgcgcggg atgctggcaa ccgatctctt ttccacgcta cagcgtattg 540ttaacccgga
agtgactgat aaaaaccatg atgaatcgag taattccgct ccctgatgag 600caggcaacat
tagacctggg cgagcgggta gcgaaagcct gcgatggcgc aaccgtaatc 660tatctgtatg
gcgatttagg cgcaggtaaa accaccttta gccggggctt tttacaggct 720ctgggtcatc
agggtaatgt caaaagcccc acttatacgc tggtcgaacc ctatacgctc 780gacaacttaa
tggtctatca ctttgatttg taccgccttg ccgatcccga ggagctggag 840tttatgggga
tccgcgatta ttttgccaac gatgccatct gcctggtgga gtggccacaa 900caaggtacag
gtgttcttcc tgacccggat gtcgaaatac acattgatta tcaggcacaa 960ggccgtgagg
cgcgcgtgag tgcggtttcc tctgcgggtg aattgttgct ggcgcgttta 1020gccggttaac
ctttgaaagg tggcggg
1047651050DNAArtificial Sequencegene editing homology arm
pool_3_b4169_right 65atgatgtatc gcatcagaaa ttggttggta gcgacgctgc
tgctgctgtg cacgccggtg 60ggtgccgcga cgctctctga tattcaggtt tctaacggta
atcaacaggc gcggataacg 120ttgagtttta ttggcgatcc tgattatgcg tttagccatc
aaagcaaacg caccgtggcg 180ctcgatatca aacaaacggg cgtgattcag ggactgccgt
tgttgttcag cggcaataat 240ctggtgaagg cgattcgctc tggaacgcct aaagatgcac
aaacgctacg gctggtggtc 300gatcttaccg aaaacggtaa aaccgaagcg gtgaagcggc
agaatggcag caattacact 360gtcgtcttta cgattaacgc cgatgtgccg ccaccgcctc
ctccgccgcc cgtggttgcg 420aaacgcgttg aaacgcctgc ggttgtcgca ccgcgcgtca
gcgaaccggc gcgcaatccg 480tttaaaacgg aaagtaaccg cactacgggt gttatcagca
gtaatacggt aacgcgtccg 540gcagcgcgcg cgacggctaa cactggcgat aaaattatca
tcgctattga tgccggacac 600ggcggtcagg accctggcgc tatcggcccc ggtggtacgc
gggagaaaaa tgtcaccatc 660gccatcgcgc gtaaattgcg tactttgctc aatgacgatc
cgatgtttaa aggcgtttta 720acccgtgacg gggattactt tatctcggtg atggggcgca
gtgatgtggc acgtaagcaa 780aacgccaatt tcctcgtgtc gattcacgct gatgccgcac
cgaaccgcag tgcgactggc 840gcttccgtat gggtgctctc taaccgtcgc gccaacagtg
aaatggccag ctggctggag 900cagcacgaga aacagtcgga gctgctgggt ggggcgggtg
atgtgctggc gaacagtcag 960tctgacccct atttaagcca ggcggtgctg gatttacagt
tcggtcattc ccagcgggta 1020gggtatgatg tagcgaccag tatgatcagt
1050661025DNAArtificial Sequencegene editing
homology arm pool_3_b2405_left 66gagcgaagcg agtcatcctg cacgacccac
caatgtaaaa aagcgcccta aaggcgcttt 60tttgctatct gcgatttgcg aaattgcctg
atgcgcttca cttagcagac tactatttcc 120ggcaattcct gtctcctcac ctactgtgtc
aatgcagcca acagcttaac catcgcgggc 180gtcacctgct gtgtttcata aacaatatat
aaatctgcag ggatgcgctg tttgagcgga 240cggaaaatga cacctggcca gttcatttgt
gcgtagctgt ccgctatcaa tgtgatacca 300atgcccatac tgaccatagc gagtaccgtt
tgcggttcat taacttcgcg aataacaacc 360ggtgaaaatc ccacctgctg gcaaactcgc
tgcaaaaaat cccagtcagt gtaaacgggc 420ggcattgtaa caaaatactc gtcacgtagc
gcttccagcg ggacggtgga aaatgatgag 480agatgatgct cttcaggcat cgccaccaga
aacgccgatt catgcaaccg taagctggta 540aaaccagtcg gtggttctgt cgccattcgc
cagatcccgg catcaagttc gcggcgttcc 600agcaaggcca tttgcatcgc gggcatcttt
tcgcgaaaaa gaacgtcaac gttaggattt 660tccctgagga atcgccgcat aaccgggcgc
atccgtcccc acattgccgt tcccactacg 720ccgagttcaa tccgccctgc ttctccccga
cctatttgtt caatccgagc caatacatta 780ttagcattca ccagcaatcg acgcgattct
tccatcaaga ttttgcccgc gtgtgtcagt 840acgacgctgc gcgaatggcg aataaaaagc
tgcgtgccga gttgattttc cagctcttta 900atatgaatgc tgagcggagg ctgagacata
tttaaacgcg ctgctgcgcg gccaaaatgc 960aactcttccg ctacggcaag aaaataacgg
agcaacttaa gatctgttct gtatacgcgt 1020tccat
1025671050DNAArtificial Sequencegene
editing homology arm pool_3_b2405_right 67aacaaagcac caataccaaa
accaacgccg gaagaaaata aaatatcttt cactaattaa 60cctttatcat aaaagcagct
ctgaagagca gagccgcgaa tccttttaat gagtcaccgc 120tcgatgcttt atcttttcag
ggtcatgatt atatttaaac ccaaagaaaa atatcactgc 180gagaaaaaga gcatatcctg
caaacaccag ccagatagtt tgccagtctt ttacgccatc 240caccgaaaag taatctactg
ccatgccact cagaatcgag ccaacccatg cgccgacacc 300atttaccatg gtcataaaga
gcccctgcgc gctggcacga atgctggaat caacttcctg 360ttcgacaaat accgaaccag
aaatattgaa gaaatcgaat gcacagccat aaacaatcat 420cgacagcagc agcaaaataa
atccggttgt tgacggatcg ccataggcga agaagccaaa 480gcgcagcgtc caggccacca
tactcatcag catgacggtt ttaatgccaa atcgctttaa 540aaagaatggg atagtcagta
taaagcccac ttctgccatc tgtgaaactg acagtaaaat 600ggagggatat ttcaccacaa
aactgtcagc aaactccggg ttacgggcga aatcatgtag 660gaacggatta ccaaaaacgt
tggtaatttg cagtaccgca cccagcatca tggcaaagag 720gaaaaagatg gccatgcgtg
gatttttaaa cagcacgaag gcatccagac ccagcttgct 780ggcaagcgat gtggtcgctt
ttttctccgc aaccggaatc ttcggcaaag tcagcgcata 840agccgacagc agcaatgacg
caccggacgc gatatacagc tgcagactac tcaattccag 900atgcagcagg cttactgccc
acatcgcgac aatgaacccc accgtaccaa aaacgcgaat 960gggcgggaaa gcggtcaccg
ggtcaagccc tgcctgggca agacaggaat aagagacgct 1020gttcgataac gcaatagtcg
gcataaacgc 1050681050DNAArtificial
Sequencegene editing homology arm pool_4_b3493_left 68agtgtcagaa
acgcacagca taatgcggcg catctggcta cgttgatcaa gcgacagctt 60gtcgtagctt
tccacatcgg tggtcaacat acctttcagg cggttgagcg cgttaatggt 120attcgacgga
tggcagtgga actccgcagg ttgcgttgcg ccagcttccg gagccggtac 180taactgatca
gcaccggtag cctgtttgag cagcgcagga tgctgctcaa agtaagcttc 240gacgttgttg
atggcatcac gggtacgggt gatttcgtag ccagtggcat tcatgttcac 300cacgaagcct
gctggcgcga cgccaatcaa taccaacata accagaccaa tgcctttctg 360accatcgttc
gcgccgtgcg aaaacgccac gccgatagcg gaaaggatca gcgcaatacg 420cgtccagaac
ggcggctttt tcttgccgtc tttcttttca cgctccgctg gggtcaggtg 480gatacgggcg
cgtttcttgg tgccgctcca gtagcgacgc agcaagaaaa tcagaccgcc 540agcaaacacc
aggccgacaa taggggaaac gatcagagaa ccgaaaatac ttaatacttt 600cgggatattg
agtgcatcca ccactgacgt cccggtcatc aacgcattgg ttaaaccaat 660cccgatgatc
gcgccaatca gcgtatgaga gctggatgca ggtaaaccaa agtaccaggt 720acccaggttc
cagataatcg ccgccagcaa catagagaac accatggcaa ggccatgaga 780cgatcccata
ttaagcagca gatccgtcgg cagcatatgc acaatggcat aggcaacact 840cagaccaccc
agcaaaacac ccaaaaagtt gaataccgcc gccataacca cggcgagctg 900agaacgcatc
gcgcgggtat agataacggt tgccacggcg ttggctgtgt catggaaacc 960attgatggct
tcgtagaaca gcacaaaagc cagtgcaagc aataataaca gcccggtatg 1020caaatccagg
ccagcaaaca aatgtagcat
1050691050DNAArtificial Sequencegene editing homology arm
pool_4_b3493_right 69ttgccccctg tatggatttc actcaaaaaa taattatctt
atataattca ggcaaatact 60tccttttagt aatattgatg ctggtgcgac cactgaggaa
tctttacaat tcacgcccgt 120tttttctaag aggagcgcaa cgtggaaagg tttgatgcca
ttattatagg cgctggtgcg 180gcgggtatgt tctgttctgc gctggcaggt caggcaggac
gccgggttct gctgatcgat 240aatggtaaaa aaccagggcg caaaatcctt atgtctggcg
gtgggcgctg caactttacc 300aacctttatg tcgaaccagg cgcttatctg agccagaatc
cgcatttttg taagtctgca 360ctcgcacgtt ttacccagtg ggatttcatt gatctggtca
ataaacacgg catcgcctgg 420cacgagaaaa cgttagggca actcttctgc gatgactccg
cgcagcagat tgtcgacatg 480ctggtggatg agtgcgagaa gggcaatgtg accttcagat
tgcgtagcga agtgctgagt 540gtggcgaagg atgaaacagg cttcacgctt gatctgaacg
gcatgactgt cggttgcgaa 600aagctggtca tcgcgactgg tgggctgtca atgccggggc
tgggcgcgtc gccgtttggt 660tataagattg ccgaacaatt tggcctcaac gtgctgccga
cccgcgcggg tctggtgcca 720ttcactctgc ataaaccgtt gctcgaagag ttacaggtgc
tggcgggcgt ggcggtgcct 780tccgtgatta ccgctgaaaa cggcaccgtt ttccgtgaga
acttactctt cacccaccgc 840ggcttgtctg gaccggcggt gttgcagatt tcaagctact
ggcaaccggg ggaatttgtc 900agcatcaatc tgctaccgga tgtggacctc gaaaccttcc
tgaatgagca gcgtaacgca 960catccgaatc aaagcctgaa aaacacactg gcggttcatc
taccgaagcg gttggttgaa 1020cgcttacagc aactcgggca aatcccggat
1050701050DNAArtificial Sequencegene editing
homology arm pool_4_b0479_left 70cataccgata cgtcctggaa gcagctcctg
agcgtagacc agaatggcag agaatgccga 60agcgaggata aatccaataa tcaccgttaa
aacccccgtc cagtgcaggc tggcgtaggg 120taaaatcagc gtaaacggcg caacgccgag
gatagagccc caaatcacat atttccgccc 180aattttatcc cctacaggcc cgccgatcac
cgtacctgcc gcaacggcaa acaggaaggc 240aaacagatga agctgagcat tctggataga
taatccgaat ttttgcatca gataaaaggt 300gtaatagctg ctgatgctcg ccatatagaa
atatttcgag aaaatgagga ttaacagaat 360gctgaccgcc agtacaactt tattgcgcgg
cagtggattg ataatcgtcg ctttgggttt 420tcctttattc attcggtgct gtgccgagta
ccaacggctg atttgcgcca acaccacgat 480cgccagcagt gccgcaagca caaaccaggc
aacgttgcct ttgccataag gcgcgataat 540caccgccgcc agcaagggtc ccagggaact
gccaaagttg ccgccgacct gaaagataga 600ttgcgccagg ccatgccgcc cgccggaagc
catacgggcc acgcgagaag attccggatg 660aaagaccgat gaaccggtac cgaccagcgc
cgccgccagc agaactgcgc caaaactgcc 720cgccagcgca agcagcacca gaccgcttaa
ggtaaagcac atgccaattg gcaacgacca 780tggcatcgga tatttatcgg tccagtagcc
gaccactggt tgcagtagcg aagaggcgag 840ctggaaggtg agggttatca tgccaatctg
cataaatgtc agagaaaatt ctgactgaag 900cagcggataa atcgccagaa tcagcgattg
gatcatgtcg ttcagcagat gtgagaggct 960gatagcacct aaaataccaa acgatgttcg
ggccttggtc gttgacgcag ccgcgcccgc 1020cacaggctgg ggttgttcac tcattgccat
1050711050DNAArtificial Sequencegene
editing homology arm pool_4_b0479_right 71aggaaagtca ctttttcagg
gttgcgatgt aaagaatgat cttatttgtg attattacca 60gactaacata cctgtatgcg
tcgtctgaag gaagtctcaa cgccgaatac agaatttcta 120atctggatgc agatttatct
tcaccggacg cagacttgtc tatgatgtcg cgtcatacta 180tttttcaaca cgttgaaatc
aggtcaggga gagaagtatg aaattattgc agcggggcgt 240ggcgttagcg ctgttaacca
catttacact ggcgagtgaa actgctctgg cgtatgagca 300ggataaaacc tacaaaatta
cagttctgca taccaatgat catcatgggc atttttggcg 360caatgaatat ggcgaatatg
gtctggcggc gcaaaaaacg ctggtggatg gtatccgcaa 420agaggttgcg gctgaaggcg
gtagcgtgct gctactttcc ggtggcgaca ttaacactgg 480cgtgcccgag tctgacttac
aggatgccga acctgatttt cgcggtatga atctggtggg 540ctatgacgcg atggcgatcg
gtaatcatga atttgataat ccgctcaccg tattacgcca 600gcaggaaaag tgggccaagt
tcccgttgct ttccgcgaat atctaccaga aaagtactgg 660cgagcgcctg tttaaaccgt
gggcgctgtt taagcgtcag gatctgaaaa ttgccgttat 720tgggctgaca accgatgaca
cagcaaaaat tggtaacccg gaatacttca ctgatatcga 780atttcgtaag cccgccgatg
aagcgaagct ggtgattcag gagctgcaac agacagaaaa 840gccagacatt attatcgcgg
cgacccatat ggggcattac gataatggtg agcacggctc 900taacgcaccg ggcgatgtgg
agatggcacg cgcgctgcct gccggatcgc tggcgatgat 960cgtcggtggt cactcgcaag
atccggtctg catggcggca gaaaacaaaa aacaggtcga 1020ttacgtgccg ggtacgccat
gcaaaccaga 1050721032DNAArtificial
Sequencegene editing homology arm pool_4_b2470_left 72ccaagcgtcg
tctggaggtg ttgtatcagt gttcgcaggc gctgaacact agccagattg 60atgtgcattg
tttccgccat attttgcaga ttgttcgcga caatgaagcg gctgaatatc 120tggagttaaa
tgtcggtgaa aactggcgga ttagcgaagg gcaaccaaac ccggaattgc 180cgatgcagat
tttaccggtg acaatgcaag agacggttta cggcgaactg cactggcaaa 240atagtcacgt
ttcatcatca gaaccgctgc ttaacagcgt ttcgtcgatg ctgggacgcg 300gtttgtactt
taatcaggcg cagaagcatt ttcagcaatt attgttgatg gaagaacgtg 360cgaccatcgc
ccgcgaattg cacgactcgc tggctcaggt actttcttac ttacgtatcc 420agttgacgtt
actgaagcgt tcgataccgg aagataacgc caccgcacaa agtatcatgg 480ccgatttttc
ccaggcgttg aatgatgctt atcggcagtt acgcgagctg ttgactactt 540tccgcctgac
gctgcagcag gcggatctcc cctccgcatt gagggaaatg ctggatacgt 600tacaaaatca
aaccagcgcc aaactgaccc tcgactgccg tctgccaacc ctggcactgg 660atgcgcaaat
gcaggtgcat ttgttgcaaa ttattcgcga agcggtgctg aatgcgatga 720agcacgccaa
cgccagcgaa atcgccgtca gttgcgtcac cgcgccggac ggcaatcaca 780cggtttatat
ccgtgataac gggattggta tcggtgaacc gaaagaaccc gaaggtcatt 840atggtctgaa
tatcatgcgc gaacgcgcgg aacggctagg tgggacgctg actttttcac 900aaccttccgg
cggcggcacg ttagtgagta ttagctttcg ctctgcggag ggtgaggaaa 960gtcagttaat
gtaatgcctc ctactgacca aagaatactt gcacttaagg ttcagtataa 1020aagggcatga
ta
1032731050DNAArtificial Sequencegene editing homology arm
pool_4_b2470_right 73atggcgaatt tctttattga tcgccccatt tttgcctggg
tgctggcaat cctgttgtgt 60ctgacaggta ccctggcgat tttttcattg cccgttgaac
aataccccga tctcgcgcca 120ccgaatgtgc gagtgaccgc taactatccc ggcgcatcgg
cccagacgct ggaaaacacc 180gtgacccagg ttatcgagca aaatatgacc ggcctcgata
atctcatgta tatgtcatct 240cagagcagtg gcaccggtca ggcatctgtc actttaagtt
ttaaagcagg caccgatccg 300gacgaagccg tgcagcaagt acaaaaccag ctgcaatcag
ccatgcgaaa gttaccgcag 360gcggtgcaaa atcagggcgt gacggtgcgt aaaaccggcg
ataccaacat tctgaccatt 420gccttcgtct ctaccgatgg ttcgatggat aaacaggata
ttgctgatta tgttgccagt 480aatattcagg acccgttaag ccgcgtgaat ggcgtcgggg
atatcgatgc ctatggttcg 540caatattcca tgcgtatctg gctggacccg gcgaaactca
acagtttcca gatgacggct 600aaagatgtca ctgatgccat tgagtcacag aacgcgcaga
ttgcggttgg gcaacttggt 660ggtacacctt ccgtcgataa gcaggcgctc aacgccacca
ttaacgccca gtcactgctg 720caaacaccag aacagttccg cgatatcacc ttgcgggtca
atcaggacgg ctcagaggta 780aggctgggcg atgtcgccac cgtcgaaatg ggggcggaga
aatacgatta tcttagccgc 840ttcaatggta agccagcctc cgggctgggg gtaaaactgg
cctccggcgc taacgaaatg 900gcgacagcgg agctggtgct caatcgtctc gacgagctgg
cgcagtattt cccgcatgga 960ctggaataca aggtggcgta tgaaaccacc tcgtttgtta
aagcctccat tgaagacgtg 1020gtgaaaacgc tgctggaagc tatcgctctg
1050741039DNAArtificial Sequencegene editing
homology arm pool_4_b1451_left 74ccagttcacc acggtgtgtc cagcggctgt
ctattccctg gtaatggcgt tgcagggtaa 60tcacgccgcc cgcatgtgac gggttaagtt
gtggtgccat gggtattgac tggtactggg 120tcgtttctcg ctctccggca tacatcatca
cactcatatc atcccgcgaa ctcaggctac 180gctcatagcg caacccagcc tgagtttgct
tgatggtttt tcgcgtgtcg tactgttctg 240cacgaggcgc ttgttgtgga ttagccttcc
attctgcttt ggttagccca cctgggtcat 300ctgctttgat atccacacta ttgaaaatca
gacttaattt gctggcttca tcaatgcgta 360cgcccagttt ggcattggct aaatttttct
gtgcgccact atggtcacga tagccgtggg 420tcgtaaaacg cgtggttgag acggtgtaat
cgacatcgcc aggctgtgtg ccgtctcccg 480ttgcgcccgt tgctttcagc ccatagcgcc
agctgccaaa actgccgtag taactactgg 540cttcaatggt tggtggctgt tgtccggtct
gggtggtgac attcattacc ccaccagacg 600cgttgccata cagggcagag aaggggccac
gcagcacttc cacattttgc acactgctta 660aatcgatgtt ggatgtttgc ccttgcccgt
cgggcatggt ggcgggaata ccgtccacat 720acaggcgaat accgcgaata ccgtaagtgg
agcgggagcc aaatccgcga atcgacagct 780gtaaatcttg cgcatagttc tgccggtttt
gtacctgcaa accaggcacg ccggtcagtg 840attcggacaa gttaatgcgc ggtgttgcca
ggcgcatctc ctcgccatcc accacgctta 900ctgctgctgg ggtatccagt tctgaaacca
cctgcggtgc ggcactgaca atcatagtct 960gttcatcagc ggcaaaaaca acgggggaaa
ggacaagcag tgcgggcaaa acggtctgtc 1020ggacggaaaa aatcttcat
1039751050DNAArtificial Sequencegene
editing homology arm pool_4_b1451_right 75gaaaaaagcc aggttaagaa
tgggaaaacg ccgtcatggt aatgaaattg taaatttatg 60gaaaatgaaa cggcacaata
cgttaagtaa ttgagaaaat tgtagtcgta acggcaagaa 120atgctccaca tttgagaaaa
taatgattac cattcccatt tataacaaga gcgtaacgat 180gattacgctt agcgaagcat
tgtgaagcag caaaaatatc ggttcatcaa agggagtcgt 240catgcattta cgtcatctgt
tttcatcgcg cctgcgtggt tcattactgt taggttcatt 300gcttgttgtt tcatcattca
gtacgcaggc cgcagaagaa atgctgcgta aagcggtagg 360taaaggtgcc tacgaaatgg
cttatagcca gcaagaaaac gcgctgtggc tcgccacttc 420gcaaagccgc aaactggata
aaggtggcgt ggtttatcgt cttgatccgg tcactctgga 480agtgacgcag gcgatccata
acgatctcaa gccgtttggt gccaccatca ataacacgac 540tcagacgttg tggtttggta
acaccgtaaa cagcgcggtc acggcgatag atgccaaaac 600gggcgaagtg aaaggccgtc
tggtgctgga tgatcgtaag cgcacggaag aggtgcgccc 660gctgcaaccg cgtgagctgg
tagctgacga tgccacgaac accgtttaca tcagtggtat 720tggtaaagag agcgtgattt
gggtcgttga tggcgggaat atcaaactga aaaccgccat 780ccagaacacc ggtaaaatga
gtaccggtct ggcgctggat agcgaaggca aacgtcttta 840caccactaac gctgacggcg
aattgattac catcgacacc gccgacaata aaatcctcag 900ccgtaaaaag ctgctggatg
acggcaaaga gcacttcttt atcaacatta gccttgatac 960cgccaggcag cgtgcattta
tcaccgattc taaagccgca gaagtgttag tggtcgatac 1020ccgtaatggc aatattctgg
cgaaggttgc 1050761050DNAArtificial
Sequencegene editing homology arm pool_4_b1981_left 76ggtgtcagga
gttattgcga tataagcatc ttttatgatt gctgctgaac gtttaatcga 60gggtggtaag
gataaacggt agacattatt ataacaatcc actaatgccc tggctttatc 120ttcacctttg
ggtccatgaa cgatcactat tggtatatct gtttcacttt aaatttttgc 180tattagattt
tctgcaatcg ataatgaaaa tgtacgttcc tgcgagctac cttctaaatt 240gaacgcaatg
taagatccta acgatcgcat ttcctcgcgc acctcatcga gtacatcctc 300acttagtggc
aattcatata ttggcctgac tgctggaaaa cccgcctcac gcatcataaa 360tgcccatgtc
ataggtacgg gagcccggag tttctgatcc atactggacg cgttcttgca 420caaaggggag
aagcaattca tggttatacc aacaacctga aaattcgttt ttgctttcaa 480ctgactgata
aataacatcg ttttcaggtt ctttttacgc atcccctcaa tgcaaagatc 540cggcgtaccg
tattgctgtg ttatgttctt tgctaaatct tttatttctt ttaatgttgc 600gtgatcctgc
atagtcattg tgactaatgt taatttagtc tgttcaagtt taagcgcatt 660aaagacttct
aaattaattg tcgacgttac aattaaaaga tgcttaattt tatgcaattc 720aagcgcccga
ataacaggaa agatggccat agcatcgcca atctgatcgg gaatatggat 780gacaacaaag
tctgtttttt caatattgaa attataagct ttataatcgt agtaactaaa 840tgcaatacgt
ctcaacaatg atgctaaaaa catacctaac ctcgcctccc tactggttat 900aatgcaatgc
agtctatcag actcatcagg gtgccatttt gtgcatatgc ggacttttat 960gtttcatatc
tctaacctgt gggtcctctg cttaatcctt aaacaacacc agcaactcct 1020gcgctttcat
cttccatcga atttttcatg
1050771036DNAArtificial Sequencegene editing homology arm
pool_4_b1981_right 77atggactcca cgctcatctc cactcgtccc gatgaaggga
cgctttcgtt aagtcgcgcc 60cgacgagctg cgttaggcag cttcgctggt gccgtcgtcg
actggtatga ttttttactc 120tatggcatca ccgccgcact ggtgtttaat cgcgagtttt
tcccgcaagt aagcccggcg 180atgggaacgc tcgccgcatt tgctaccttt ggcgtcggat
ttcttttccg tccgctcggc 240ggtgtcattt tcggtcactt tggcgaccga ctgggacgta
agcgcatgtt aatgctgacc 300gtctggatga tgggcatcgc gacagccttg attggtattc
ttccttcatt ctcgaccatt 360gggtggtggg cacctatttt gctggtgaca ctgcgtgcca
ttcagggatt tgcagtcggc 420ggcgaatggg gaggcgcggc gttgctttcc gttgaaagtg
caccgaaaaa taaaaaagcc 480ttttacagta gcggtgtaca agttggctac ggtgtaggtt
tactgctttc aaccggactg 540gtttcattga tcagtatgat gacgactgac gaacagtttt
taagctgggg ctggcgcatt 600cctttcctgt ttagcatcgt actggtactg ggagcattgt
gggtgcgcaa tggcatggag 660gagtccgcgg aatttgaaca acagcaacat tatcaagctg
ccgcgaaaaa acgcatcccg 720gttatcgaag cgctgttacg acatcccggt gctttcctga
agattattgc gctacgactg 780tgcgaattgc tgacgatgta catcgttact gcctttgcac
ttaattattc aacccagaat 840atggggctac cgcgcgaact tttccttaat attggtttgc
tggtaggtgg attaagctgc 900ctgacaattc cctgttttgc ctggcttgcc gatcgttttg
gtcgccgtag ggtttatatc 960acaggtacgt taatcggaac gttgagcgca tttcctttct
ttatggcgct tgaagcacaa 1020tctattttct ggatag
1036781050DNAArtificial Sequencegene editing
homology arm pool_4_b0237_left 78cacaccgacg ttcagggagg tttcaaccac
acctttggct acatcggagt tacgaatcac 60accgttcggg gtggcgttca gcagacgaat
aaaggtatcg cgagatttcg caatcagggc 120agctttatcg ttcgctacag agtccagcaa
caaggccaga tttttctctt tttctgccag 180ctcgtttttc aggatctcct gataggtatt
caccagagat ttcaggacgt cgactttatc 240agctgcgaca gcaatggtcg caaaggcttc
acgcgggatg gcgttacgca gtgtgccgcc 300gttgaaatcg ataaggcgca gatccagttc
ttccgcatga cccgccagga agcgcaccag 360cagtttgttg gcattaccca gcccaacgtg
gatttccccg ccggagtgac cgcctttcag 420accttttaag gttaacttga aggtttcaaa
accagctgga accgcttcac gatctaaatg 480caggttggag gtgaagtcga taccccccgc
acaacccatg tagatttcac cttcttcttc 540ggagtcggtg ttaatcagaa tatcagcctg
caaccagttg ccctgtaagc cgaacgcacc 600gtccataccg gcttcttcgg tcatggtcag
cagcacttcc agcgggccgt gaaccacgtt 660ttcgtcagcc agaaccgcca gcgcagaggc
cataccaatg ccgttatccg cacccagcgt 720ggtgccgcgc gctttaaccc attcgccatc
aatataaggc tggataggat ctttcgtgaa 780gtcatgcacg gtgtcgttat ttttctgcgg
caccatatcg aggtgggcct gtaagacgac 840cggtttacga ttttccatac ctgcggtagc
aggtttacga atcaggatat tacctacctg 900atcgcgttcg acatggaaac ctttctcttt
tgcccaacca acaatgtatt cagcgagttg 960ctcttcatga taggacgggt gaggaataga
acagattttg gcaaaaatat cccacagcgg 1020ctgtggagat aattgagaca gttcagacac
1050791039DNAArtificial Sequencegene
editing homology arm pool_4_b0237_right 79catgtgaaat actggttttt
agtgcgccag atctctataa tctcgcgcaa cctattttcc 60cctcgaacac tttttaagcc
gtagataaac aggctgggac acttcacatg agcgaaaaat 120acatcgtcac ctgggacatg
ttgcagatcc atgcacgtaa actcgcaagc cgactgatgc 180cttctgaaca atggaaaggc
attattgccg taagccgtgg cggtctggta ccgggtgcgt 240tactggcgcg tgaactgggt
attcgtcatg tcgataccgt ttgtatttcc agctacgatc 300acgacaacca gcgcgagctt
aaagtgctga aacgcgcaga aggcgatggc gaaggcttca 360tcgttattga tgacctggtg
gataccggtg gtactgcggt tgcgattcgt gaaatgtatc 420caaaagcgca ctttgtcacc
atcttcgcaa aaccggctgg tcgtccgctg gttgatgact 480atgttgttga tatcccgcaa
gatacctgga ttgaacagcc gtgggatatg ggcgtcgtat 540tcgtcccgcc aatctccggt
cgctaatctt ttcaacgcct ggcactgccg ggcgttgttc 600tttttaactt caggcgggtt
acaatagttt ccagtaagta ttctggaggc tgcatccatg 660acacaggcaa acctgagcga
aaccctgttc aaaccccgct ttaaacatcc tgaaacctcg 720acgctagtcc gccgctttaa
tcacggcgca caaccgcctg tgcagtcggc ccttgatggt 780aaaaccatcc ctcactggta
tcgcatgatt aaccgtctga tgtggatctg gcgcggcatt 840gacccacgcg aaatcctcga
cgtccaggca cgtattgtga tgagcgatgc cgaacgtacc 900gacgatgatt tatacgatac
ggtgattggc taccgtggcg gcaactggat ttatgagtgg 960gccacccagg cgatggtgtg
gcaacaaaaa gcctgtgcgg aagacgatcc gcaactcagt 1020ggtcgtcact ggctgcatg
1039801050DNAArtificial
Sequencegene editing homology arm pool_4_b2497_left 80aataccggaa
gcaccgatga caccataaag cagcagcgaa acgccgccca tcaccggcaa 60tgggatcatc
tggatagcgg cagccagttt accgacgcag gaaagcagga tagcgaaaat 120cgccgccccg
ccgataaccc aggtactgta aacacgggtg atcgccatca cgccaatgtt 180ttctccgtaa
gtagtatttg gcgtagagcc aaagaagccg gaaatcacgg tcgacaagcc 240attagcaaac
atcgaacggt gcagacctgg atcgcgcagc agatcttttt tgacgatatt 300agccgttact
accaggtgcc ctacgtgttc ggcaataacc actaacgccg ctggcagaat 360agtcagaatg
gcaaaccact cgaagcgcgg cgtatagagg gttggcagcg caaaccagtg 420agcattaata
atcggcgtgg tatcgacaat tcccattgcg aaagagagcg cgtaccccac 480cagcacgcca
attaaaatcg ggataattgc caggaaacca cgaaacagca cggaacctaa 540aaccgtgacc
gccagggtgg taatagagat gatgatggtt ttggagtctg gcgtttgccc 600ttcagccggg
agtaaacccg ccataccggc agctacgccc gccagctcca gaccgatgac 660ggcaacgatt
gcgcccattg ccgcaggtgg aaacagcacg tccagccagc cggtccccgc 720tttcttcacg
ataaaagaaa ccaggcagaa cagcacgccg cacataataa agccgcccag 780cgcgacttca
taccctaacg gcaacagtaa caataccggt gaaataaagg caaagctgga 840accaagataa
gccggaattt tccctttaca gatgaagaga tacagcagcg ttccaatacc 900gttaaataac
agtacagtcg ccgggttaat atgaaataag acgggcacca ggacggttgc 960accaaacatg
gcgaacaaat gttgcaaact aagcgggatt gtctgtaaaa gtggcggtct 1020ttcactcacc
ccgatagcac ggcgcgtcat
1050811050DNAArtificial Sequencegene editing homology arm
pool_4_b2497_right 81agtattatcc tctgtattat gtgttatagg cgctttactc
aaaaaaaagc cgactcttaa 60agtcggcttt aattattttt attctttatt tcgtaccaaa
gattttgtca ccggcatcgc 120cgaggcccgg aataatgtat ccgtgctcgt tcagtccctg
atcaatcgat gcggtataca 180gttcgacgtc cgggtgcgct ttttccagcg cagcgatacc
ttctggcgca gctaccagca 240ccagaacttt gatgctgctg cagcccgctt ttttcagcag
gtcgatggtc gcgataacgg 300aaccaccggt tgccagcatt gggtcaacga tcagcgccat
acgctcatcg atgttagaaa 360ccagtttctg gaagtacggt accggctcca gcgtttcttc
attacggtac ataccgacaa 420cgctgatgcg cgcgctcgga acgttttcca gcacaccgtc
catcatacca agacccgcac 480gcagaattgg cacaacggta attttcttac ctttgatctg
gtcgatttct accgggccgt 540tccagccttc gatagttact ttttccgttt cgaggtcggc
ggtcgcttcg taagtcagca 600ggctacccac ttcggaagcg agttcgcgaa agcgcttggt
gctgatatct tgctcacgca 660tcagtcccag cttgtgtttg acgagtgggt gtttgacttc
cacgatcttc atactctttc 720tcctttgagg ggcagccaca aaaaaaatcg acggattata
cctcctttct tcaaggcggc 780aatattcttt tcgttgactt tagtcaaaat gataacggtt
tgagataaag ttattttata 840ttcagatggt tatgaaagaa gattattcca tccgaaaact
aacctttacc ctggcacaag 900tcttctttcg ccgcgcgcct ggggaaaaga cgtgcaaaaa
ggttgtgtaa agcagtctcg 960caaacgtttg ctttccctgt tagaattgcg ccgaatttta
tttttctacc gcaagtaacg 1020cgtggggacc caagcagtga ccgataaaac
1050821050DNAArtificial Sequencegene editing
homology arm pool_4_b4260_left 82gttcagcact tcaacggttt gaccggacat
ggtggttaac acatcgcccg gacgataggc 60tcgtccgcca ggcatgtttt cgcagcctgc
caacacgccg ataacgttaa tcggcagttg 120tagctccgcg accatccgca tcacgccgta
aaccgctgcc gcaccgcaca tatcgtactt 180catctcatcc atgccttctg aaggcttgat
cgagataccg ccggagtcga aggttaaacc 240tttacccacc agcacgattg ggcgtgcatc
ttccgacgcg ttgcctttgt actcaatcac 300cgacatcagc gattcgtttt gcgaaccctg
accgaccgcc agataggaat gcatccccag 360ctctttcatc tgctgttcgc cgataacgcg
ggtgatgaca ttcttgctgt agctgtcagc 420cagctggcgc gcttgtgaag cgaggtaagc
ggcgttacag atattcggcg gcatattgcc 480gagatctttt gctgctttaa tcccggcggc
aatcgccaga ccgtgctgga tcgcgcgctc 540accgctggtc agttcacggc gggtcggcac
gttgaacacc atcttacgca gcggacgacg 600cggttcgctc ttgttcgttt tcagctgatc
gaaactgtag agcgtctctt ttgccgtctc 660gacagcctga cgcactttcc agtagttgtt
acggccttta acgtgcagct cagtcagaaa 720gcagaccgct tccattgagc cagtatcatt
cagcgtatta atggttttct gaataacctg 780cttgtactga cgctcatcca gctcacgttc
tttgccgcaa ccaataagga gaattcgctc 840ggaaagtaca ttcggaacat ggtgcagcaa
caatgtctgc cccggttttc cttccagttc 900gccccgacgt agcagggcgc tgatgtaccc
atcgctgatt ttatcgagct gttctgcaat 960cggagaaagg cgacgtggtt cgaagacgcc
cacgacgatg caggcactcc gctgtttctc 1020cgggctaccg ctttttacac taaactccat
1050831049DNAArtificial Sequencegene
editing homology arm pool_4_b4260_right 83tttacgggcg tatttaaagt
gataatcata agatatctgg tgcgggagac gctcaaaagc 60cagctggcga tactcttcat
cttgcttttg atcttcttct gtcaaaagtt agtgaggatc 120ctcggcgcag cggttgacgg
cgatattccg gcgaatctgg tgctctccct tctcgggttg 180ggcgtgccgg aaatggcgca
gcttatcctg ccattaagcc tgttcctcgg gctgctgatg 240acgctgggca aactgtatac
cgaaagtgaa attacggtaa tgcatgcctg cggcctgagc 300aaagcggttc tggtgaaagc
ggcaatgatc cttgcggtat tcacggcaat cgtagcggcg 360gttaacgtga tgtgggcggg
accgtggtca tcgcgtcatc aggatgaagt gttagcagaa 420gcgaaagcga accctggcat
ggcggcgctg gcgcaagggc aattccagca agcgactaat 480ggcagctcgg tgctgttcat
cgaaagcgtt gacggcagcg atttcaaaga tgtgttcctc 540gcgcaaattc gaccaaaagg
taatgcacgt ccttctgtgg tggtggccga ttccggacat 600ttaacccagc tgcgcgacgg
ctcccaggtc gtcactctca accagggaac gcgcttcgaa 660ggcactgcat tgttacgtga
tttccgcatt acggacttcc aggattatca ggcgatcatt 720ggtcaccagg cggtggcgct
cgacccgaac gataccgacc agatggacat gcgcacattg 780tggaacactg acaccgatcg
tgctcgcgca gaactgaact ggcgtatcac gttggtattc 840accgtgttta tgatggcact
tatggtcgta ccgctgagcg tggttaaccc acgtcaggga 900cgcgtactgt cgatgctgcc
agccatgctg ctgtatctac ttttcttcct gatccagacc 960tccctgaaat cgaacggcgg
taaaggtaag ctggacccga cgctgtggat gtggaccgtt 1020aacctgattt atctggcttt
agcgattgt 1049841050DNAArtificial
Sequencegene editing homology arm pool_4_b1412_left 84aatgactgct
ggtgtttatt taggggcgtt tccacgacat attccggcac agaatgcggt 60tctggacgtc
acctttgaat tccctcgcgg acgagccaca aaagatcgac tctatttttg 120tgtaccgatg
ctggatctgg tggttccgga agagggggag ctccgacagg ccgtggcgat 180gctggaaaca
ttacgcgaag agcaaggcag cgttctggtc cattgtgcat tgggattatc 240gcgcagtgcg
ctggtggtgg cggcatggtt gttatgttac ggacactgta aaaccgttaa 300tgaagcgatt
agctatattc gagccagacg cccgcagatt gtgctgacag acgagcacaa 360agcgatgctg
agattatggg aaaacaggta agtggattga gatgtggact gaatatctac 420agtccacatc
aagaccgtgt ccggttatgc agaaacaatg ctgtcgatgg ctgcttttgc 480gtcagactgt
gctttcgctg ccatttccgg accgtatgcg atcccttcgg cgaagacaaa 540tttcacatcg
gtaatgccga taaagccgag gaacgtggac agatacggcg tcaccaggtc 600cgttggtcca
tctttgtgga tcccgccgcg gctggtaata acgatggctt ttttacccgt 660taccagacct
tccggaccgt tctcggtata gcggaaagta acgcctgcgc gggcaaccag 720gtcaaaatag
tttttcaact gagtcgagat gttgaagtta tacatcggtg ccgcaataac 780gataacgtcg
tgggctttca gctcggcaat caactcatcg gaaagtgcca gagcttcctg 840ctgacgcgga
gtcagcggcg catcgctcgg acgcagagcg ccaaccagtt cgccatccag 900taccggaatc
ggatttgcag ccaggtcgcg aacggtgatt tcatcagcgg agtgcttttc 960gcgccattgt
tcaacaaaat aatcggacaa ctgattagac tgagagtacc ctgccaggat 1020gctggattta
agaactaata ccttgctcat
1050851050DNAArtificial Sequencegene editing homology arm
pool_4_b1412_right 85gaatttattg aacaacgcat agaaagccgc gatgtggtac
tctatatcta tcatttaaaa 60gaaaattaat caggcagact actgcccact aacgttatga
cagaacaaca aaaattgacc 120tttacggcct tgcagcagcg gctggattcg ctgatgctgc
gtgacagact gcgtttttct 180cgccgtctgc acggcgtgaa gaaggttaaa aatcctgatg
cacaacaggc cattttccag 240gagatggcga aagagattga ccaggcggca gggaaagtcc
tgctgcgtga agcggcacga 300ccggaaatta cttatcctga caatttaccg gttagtcaga
aaaaacagga cattctcgaa 360gcgattcgtg atcaccaggt ggtgatcgtc gccggggaaa
cgggttctgg taaaacgact 420cagttaccga aaatctgtat ggagctgggg cgcgggatta
aaggactgat cggccatacc 480cagccgcgtc gtctggcggc aagaacagtg gcgaaccgta
ttgcggaaga gctgaaaacg 540gagccgggcg gttgcatcgg ttacaaagtg cgtttcagcg
atcacgtaag tgataacacg 600atggtcaagc tgatgaccga cggtatcctg ctggcggaga
tccagcaaga ccgcctgctg 660atgcagtacg acactatcat tattgacgaa gcgcacgaac
gcagcctgaa tatcgatttt 720ttgctcggct atttgaaaga gttgctgccg cggcgtcctg
acctaaaaat cattatcact 780tccgcgacta tcgacccgga acgcttttcg cgccacttta
ataatgcgcc gattattgaa 840gtctccggtc ggacctatcc ggtggaagtg cgctatcgcc
cgattgttga agaagccgat 900gacaccgagc gcgatcagtt gcaggcgatt tttgacgccg
tagacgaact gagtcaggaa 960agccatggcg acattctgat ctttatgagc ggcgagcggg
aaatccgcga taccgccgat 1020gcgctgaaca agctgaactt acgccatacc
1050861050DNAArtificial Sequencegene editing
homology arm pool_4_b4139_left 86aacagtggtg tcgttaccga tgactttgaa
gcatacctgg ttaaccactt ccggaacaac 60cgggtttact ttagctggca tgatggaaga
gcccgcctgc agttccggca ggttgatctc 120gttcaggccg gcacgtgggc ctgaagagag
caagcgcagg tcgttacaga ttttggacat 180cttcacagcc aggcgtttca gcgcgccgtg
aaccataaca taagcgccgc agtcagaggt 240cgcttcgatc aggtcttcag ccggtacgca
tgggaagcca gtaacttcag ccagtttttt 300cactgccagc ggagagtact ctttcggcgt
gttcagacca gtaccgattg ctgttgcacc 360aaggttaact tccagcagca gttcagcggt
acgttggatg tttttcactt cttctttcag 420caggatgctg aaagcgcgga attcctgacc
gagggtcatc ggtactgcgt cctgcagctg 480ggtacgaccc attttcagga tgtcctggaa
ttcgacagct ttacgttcaa agccttcacg 540cagttggtta atcgcatcta ccagcttaat
cagggaagag taaactgcga tacggaaacc 600ggtcgggtag gcgtcgttag tggactgaca
tttgttaaca tggtcgttcg ggttcaggta 660ctgatattca cctttttggt gacccatcag
ttccagaccg atattggcca gcacttcgtt 720ggtgttcatg tttacggaag tacctgcgcc
gccctggtag acgtctaccg ggaactgatc 780catgcatttt ccgttgttca ggacttcatc
acatgcggca atgatggcat tcgctacact 840tttaggaatg gtttgcagct ctttgtttgc
catagctgcg gcttttttaa ccattaccat 900accgcgaaca aattcaggaa tatcactgat
tttgttgttg ctgatataga agttttcaat 960cgctctcaga gtgtgaacac catagtaggc
atcagctgga acttccctgg tacccaacag 1020atcttcttcg atacgaatgt tgtttgacat
1050871041DNAArtificial Sequencegene
editing homology arm pool_4_b4139_right 87ttttacccct taattattaa
tttgtgaaat agatcaccgc tttgggatta ctaccaaaaa 60tagttgcgca aacatcttga
aattttgcta atgaccacaa tataagctaa acgcgattcg 120caacccattc aggtagccgg
ggttaaccgg ctgctattac aggagaaacc tttgcgctgg 180ttacctttta ttgccatttt
cctttatgtc tatattgaga tttcaatctt tattcaggtt 240gcccatgtat tgggggtatt
gctgaccctc gtgctggtta tattcacgtc agttatcggt 300atgtcactgg tacgtaacca
gggctttaag aatttcgtgc tgatgcagca aaaaatggcg 360gcgggtgaaa acccagcggc
ggagatgatt aaaagtgttt cgctgatcat tgctggtttg 420ctgcttttat taccgggctt
ttttaccgac ttcctcggtc ttctactttt attgccgccg 480gtgcaaaagc atctgacagt
gaagttgatg ccgcatttgc gcttttctcg catgcctggc 540ggcggtttta gcgccgggac
cggtggcggt aatacttttg atggtgagta ccagcgaaag 600gatgatgagc gcgaccgcct
tgatcataaa gacgatcgcc aggattaatg tcgaaacgcc 660ggattatgtg gttatgccat
tttccggcgt ttttcgtttt ggcagcaaca gccataaccc 720cgccagcatg atcagcgcat
agagactttt ccagccgacc attgccagta acagaacgca 780taacagcccg ccaaccaccg
ccagtagtcg ataacgtcct tgcaataatt tacagcctgc 840cagcatgcat aacagataaa
tcataataaa gatgccattg gcataaataa taagagcgtc 900cagattgatc tctaaagcat
gaatcaccaa agtgctcacc acacagcagc cgagcaccgc 960attgagggca ttattcggga
tatggcgaga agagaggcgt gccaggtagt ggtcaggatt 1020atgttgcgcc tgcgaccaga c
1041881044DNAArtificial
Sequencegene editing homology arm pool_4_b2039_left 88ttgatgaacg
tgagtgattg tcttgaacaa agctgaccgg atagcctaga atatgttcaa 60atgctgattg
attaaagctc tcataaaaga aacctctatc atcaccaaat actcttggct 120ccagaattag
cacatcttca atttcagttc taatcacatt cattaatttg aatccttcgt 180cattttataa
agatactgcc cataattatt ctttattagt ggtacagcta attttcttac 240ttgctcaaca
tcaataaaac ctttacgaaa tgcaatctct tcaggacagg aaaccttcaa 300tccctggcgc
tcttcaattg tcgcaataaa attacttgct tctatcagac tctgatgagt 360ccccgtgtcc
agccacgcgt agccacgccc catcatcgcg acagacagac gtccctgctc 420aagataaata
cggttaatat ctgtaatttc taactcacca cgtgcagacg gcttcaagtt 480tttcgccatc
tgaaccacgt cgttatcata aaagtacaga cctgtaacgg cgtaattact 540ctttggttct
aacggttttt cttccagact gattgccgta ccgtttttat caaactcaac 600gacaccatag
cgttctggat cattaacgtg ataggcaaat accgttgcac cactttcttt 660gttaacagcg
gcctccatta gcttcggcag atcgtgaccg taaaagatat tatcaccaag 720aaccaaagca
caatcatcac caccaataaa ctcttcaccg atgataaatg cctgcgcgag 780gccatctggg
ctaggttgca ctttgtactg aagattcagg ccccactggc taccgtcacc 840cagcaattgt
tgaaaacgag gagtatcctg aggtgtactg ataatcaaaa tatcgcgaat 900acccgccaac
atcagtgtag agagcgggta atagatcatc ggtttatcat aaataggtaa 960tagctgttta
ctgacagcca tagtcacagg ataaagacgt gtaccagaac cacccgctaa 1020aataatacct
ttacgcattt tcat
1044891050DNAArtificial Sequencegene editing homology arm
pool_4_b2039_right 89ttcatcattc cttttaattc atcttgctcc accatcacga
acaagatgca aaaactatta 60aattgctgta gtcgtaaata attcattgag cattcgtttc
acgccaacct gccagtcagg 120caagacaagc gcaaagttct gctgaaattt ttctgtatta
aggcgagagt tatgtggacg 180acgagctggt gtaggatagg ctgttgttgg tactgcgttg
agcttgttga gtgcaagggg 240aatgcctgct ttgcgcgcct cttcaaaaac cagcgcagca
taatcgtacc aggttgtggt 300accactggct accaaatggt acaagcctgc gacatccggt
ttattcagtg cgacacgaat 360ggcatgtgct gtacaatcag ccagcagttc agcacctgtt
ggcgcaccaa actgatcgtt 420aataaccgct aattcttcac gctcttttgc cagacgtaac
atcgttttgg cgaagttatt 480tccttttcct gcatagaccc agctggtccg gaaaataaga
tgcttcgcgc aatattcctg 540taacgctttt tctccggcta acttggtttc accgtaaaca
tttagtggtg cggttgcatc 600cgtctccagc catggcatat cgccatttcc agggaagacg
taatcagtcg agtaatggat 660aacccaggct ccaacttcat ttgctgcttt cgcaatcgct
tcgacacttg ttgcgttaat 720taattgtgca aactccggtt ctgattctgc tttgtctact
gcggtgtgag cggctgcatt 780gacaataata tccggccgaa tgcttcttac ggtttcagct
acaccttcag gattactaaa 840atcaccgcaa taatcagtag agtgaacatc aaaagcaatc
aaattaccca aaggtgccag 900agcacgctgt agttcccaac ctacctgccc tgttttgcca
aaaaggagga tattcattac 960tggcggccct catagttctg ttcaatccac gattgatagg
caccactttt cacattatca 1020acccattttg tattggacag gtaccattcc
1050901050DNAArtificial Sequencegene editing
homology arm pool_4_b4473_left 90catcacccgc ccatagtgag cgttggattg
gtgcagcacc gcgccagcgt ggcagcagtg 60aaccatgaac gttgatacag ccaagacgcg
gcatctccag cactgctttc ggcagaatta 120aaccataggc gacgacgacc ataacatcag
cctgcagttc ggcgaccagt tgctggtttt 180cttgtggacg cagggaaaca ggttgaaaaa
cgggcagacc tttttcctca gccagaactt 240taaccgggct gggcatcagt tttttaccgc
gtcctgccgg tcggtctggc tgggtgaaca 300cgccaacgac gttatgacca gaagacaaca
gcgcgtcgag atgacgcgct gcaaagtcag 360gtgtacccgc aaaaataata cgtagtgatt
ctgacacgtt agttcttatc cttaagcccg 420ggctttcaga cgatccagtt tttcaacttt
ctgacgaata cgttgttgtt tcagcggtga 480cagataatcc ataaacagtt tgccgaccag
gtgatccatc tcatgctgaa tacagatggc 540taacagaccg tctgcttcca gttcaaatgg
tttaccgtcg cggtcaaggg cgcgaatttt 600aactttctct gcgcgcggca ctaaagcacg
ttgttcaggg atcgacaggc aaccttcttc 660aatgcctgtt tcgccgcttt tttctaaaag
ctctggattg attaacacta gccgttcgtc 720acggttttcc gaaacatcaa taacaatgat
acgttgatgg atatcaacct gggttgccgc 780caggccaata ccttcttctg cgtacatcgt
ctcgaacata tcatcgacga tacgctgaat 840ttctgcattc acttcttcta ccggtttagc
aactttgcga agccgctcgt ccggaatatg 900taacacttgc aaaactgaca taaatctcca
gagatgtgtt caggagttag aaagattatt 960tcttctattc tagacaaatc cccctctgat
tgacagcatc actgaccaat cgcaaagatt 1020gctaaggctg cttatggcag ggagataagg
1050911050DNAArtificial Sequencegene
editing homology arm pool_4_b4473_right 91atggtcgata cagaaatttg
gctgcgttta atgagtatca gcagcttgta cggcgatgat 60atggtccgta tcgctcactg
ggtggcaaaa cagtcgcata ttgatgcggt tgtattgcag 120caaacagggc ttacattgcg
gcaggcacaa cgctttcttt catttccacg aaagagtatc 180gaaagctcac tttgttggtt
ggagcaaccc aaccatcatt taattcctgc ggacagcgaa 240ttttatcctc ctcaacttct
ggcgacgaca gattaccccg gcgcactgtt tgttgaagga 300gaactgcacg cgctgcattc
atttcagctt gccgtagtgg ggagtcgggc gcattcatgg 360tatggcgagc gatggggacg
attattttgc gaaactctgg cgacgcgtgg agtgacaatt 420acgagtggac tggcgcgtgg
aatcgatggt gtagcgcata aagcagcctt acaggtaaat 480ggcgtcagca ttgctgtatt
ggggaatgga cttaatacca ttcatccccg ccgtcatgcc 540cgactggctg ccagtctgct
tgaacagggg ggcgctctcg tctcggaatt tcccctcgat 600gttccacccc ttgcttacaa
tttcccacga agaaatcgca ttatcagtgg tctaagtaaa 660ggtgtactgg tggtggaagc
ggctttgcgt agtggttcgc tggtgacagc acgttgtgcg 720cttgagcagg ggcgagaagt
ttttgccttg ccaggtccaa tagggaatcc gggaagcgaa 780gggcctcact ggttaataaa
acaaggtgcg attcttgtga cggaaccgga agaaattctg 840gaaaacttgc aatttggatt
gcactggttg ccagacgccc ctgaaaattc attttattca 900ccagatcagc aagacgtggc
attgccattt cctgagctcc tggctaacgt aggagatgag 960gtaacacctg ttgacgtcgt
cgctgaacgt gccggccaac ctgtgccaga ggtagttact 1020caactactcg aactggagtt
agcaggatgg 1050921046DNAArtificial
Sequencegene editing homology arm pool_4_b3510_left 92aatcaacgcc
agttgcaaaa aaatgaaccc ggattactta tttaaccgcc catagcatca 60acgcttacgg
aactctttgt tcactttccc acagcacttt taaaacatta aacctgacta 120cggaaaatat
cagccatgat aatgtttgaa atggctaatt tgccatagag tgaaaaaaat 180tagatgaaat
tcagtaggtt gaaataatca ctagcaggta attatttcaa tgatagtgcg 240caattgatct
acaacactgc gtagcggaga gagtattaat cggatcatag tcacatcaag 300tgactatgat
ccgggtgaca accggggtaa ttattgctgc ttaacgaaca aactggcgaa 360gctgaacagg
ctggcggcgc taaatatcag ttcaattccc accagtgtgg aaaccagcgt 420tacagacacc
atcggcgttg caccaaggaa tatccaggca atgacgatat ccagcacacc 480aataacgagc
tgtagccagc tgcctttcat tgaacgctga cgataccaac tcatcaggcg 540aataacccct
gcaacacaga acaaaccggc aataaatgcc gcaatggcaa aaatgcccag 600ctccggtgcg
cggatgaaga aatagccgat caataaatag gcgactgcga cgaggaaacc 660ggataatacc
ggccagaaat tatgactgcg gttgctgaat aacccgacaa taagcgcaat 720acccgagcag
attaataatg cacccactac tgtgcttaaa atatcgccag agacgaacgg 780gaaactgata
cacagcaacc cgacgataaa cagcagcacg gcaataaact ggattgctct 840gcgatgtttt
ttaagcatct ccagatcaaa cttcaaaatt gttgccttat ctatatataa 900catagaacca
ccctataaaa ttaagaagaa aatcccctgc tatcaatcta tgccaaaaac 960gcgtctaaga
atgcagtcga tttaataaaa atttcctaat tgcagtatct gatgcatctg 1020taactcattg
tattgaaata aaaata
1046931045DNAArtificial Sequencegene editing homology arm
pool_4_b3510_right 93atgaaaaaag tattaggcgt tattcttggt ggtctgcttc
ttctgccagt tgtgagcaat 60gcagcggatg cgcaaaaagc agctgataac aaaaaaccgg
tcaactcctg gacctgtgaa 120gatttcctgg ctgtggacga atccttccag ccaactgcag
ttggttttgc tgaagcgctg 180aacaacaaag ataaaccaga agatgcggtt ttagatgttc
agggtattgc aaccgtaacc 240ccagctatcg ttcaggcttg tactcaggat aaacaagcca
actttaaaga taaagttaaa 300ggcgaatggg acaaaattaa gaaagatatg taattccggg
aatgcgttac atcgtacttc 360cttgcatatt gaacaggccg gaatatcttc tttaaaagca
gctattcctc ctgttcatat 420ataatctcta tattgaatgg gttacaaaat gaatatttca
tctctccgta aagcgtttat 480ttttatgggc gctgtagcgg ctttgtcact ggtgaacgca
caatctgcgt tggcagccaa 540tgaatccgct aaagatatga cctgccagga atttattgat
ctgaatccaa aagcaatgac 600cccggttgca tggtggatgc tgcatgaaga aacagtatat
aaaggtggcg ataccgttac 660tttaaatgaa accgatctca ctcaaattcc taaagtgatc
gaatactgta agaaaaaccc 720gcagaaaaat ttgtatacct tcaaaaatca agcatctaat
gacttgccga attaatgagg 780tgcaagtaaa aaggagtagc aagttgagcc atcttgctgc
tcctttttgc atttttatat 840gacagcagaa tttattatac gtcttatact cgcggcaatt
gcctgtggcg ctattggcat 900ggaaaggcaa atgcgcggca aaggagcagg gttacgcaca
catgtattaa ttggcatggg 960aagcgccctg tttatgattg tttcgaaata tggttttgct
gacgtgctgt ctttagatca 1020cgtcggactc gaccccagcc gtatt
1045941050DNAArtificial Sequencegene editing
homology arm pool_4_b1007_left 94tcattaccgc catccagcta tcaaaggccg
aagccgatac acttttcact gcgataggcg 60ctaagttcag gccaatcgcc atcaccaccg
cacccgtcac caccggcggc atcagtcgtt 120caatccagcg cgtaccgatt ttcatcacca
ccaggccaat gacggtataa accagcccac 180aggcgataat cccgcccagc gcaatgctga
tattcgggtt aatgccctga ccgttaaagc 240ccgtcgcggc gatcaccacg ccgacaaaag
ccgcgctgga gccgagataa ctggggacgc 300gcccgccggt aataaagaaa aacagtaacg
tgccgatccc cgacattaaa atggaaagat 360tgggatccag ccccatcaga atcggcatta
acaccgtcgc gccaaacatc gccaccgcgt 420gttgaacgcc cattactgcc gtctgagcaa
acggcaatcg ttcatccggc gcgaccacgc 480cgctctctgt agaggtcgat tttaactgcc
agtgaggaaa accgaacatt gccatcagct 540gtctccttaa ggaggttaac aagcagggcg
catcagcgcg tgataactgc gatcgaacca 600caccagcccg tagggtgtgg tgtgacgatg
aatcgcttcg atggcgcaaa acagaatgtc 660gtgggtgccg acgctcacca cctggctgat
acggcagtca aacgaaacca gagcctcttc 720cagttgcggg catccggtca cccccgtctg
ccagcgggcg gcggcaaagc ggtgttccat 780gggcgttttg ccgccaaaaa ggtttgaaag
cggctcctgc ccggcgctaa gtgtatttac 840acacagcgtt cgattttcat tgaatgccgg
ccagacggac gccccacgat tcaggcacac 900cagtaatgtg ggcggcgtat cggtcacact
gcagacggcg ctggcggtga acccggcgcg 960cccggctgga ccgtccgtgg tgataatatt
gaccgccgcg cccatgcagg acatcgcatc 1020gcgaaaagtt tgttgatcga caatgttcat
1050951049DNAArtificial Sequencegene
editing homology arm pool_4_b1007_right 95agtttgctcc ttacaacagc
ccgcaggctt cttcaaagga cagacgtggc aggcgcgcat 60aaagcttgct gctatcgcca
tagccgatat taatcagcag attgctcttc agcgtgctgc 120ccgtaaaaaa ggcgtcgtcc
acgtgttgac ggtcaaagcc cgacatcggg ccggtatcca 180gtcccagcgc ccggcaggcg
acgatcagat aggccgcctg catggaactg ttgcgaaacg 240ctgtttcttc ggcaagttgt
gggctggagg taaaccaact gcgggcatca ccgtggggaa 300acagtagtgg taaccgttca
taaaattcac tgtcccaggc gacgatagcg gtgacgggcg 360cggtcagggt tttttgcaga
ttgccgctgg aaagtgccgg gcgcagacgt tcttttcctt 420ctgccgtgcg ggtaaacacg
atccgtgccg gagaacagtt agctgatgtc ggcccccatt 480tcatcagggc ataaatctcc
cgtaacgtct catcgctgac gggtgtctcc cgccagccgt 540tgtgagtgcg ggcatcggtg
aacagggtgc taagcgcacc tgggctaacg gcttcgttca 600tagcaattcc ttacagggcg
gcttcacggt gatgtaacag gctggcaagc ccgttgagta 660acagagcatt aaacgtttcg
ggatcggtca cgttgcaggc gtgtccgcca tagggcatca 720ccattttctg gctatcgggc
agggcggcat gaagttcact ggaacatgct gttggcacca 780gcagatcatc actggcgcag
atgatttgca ccgggcagcg gatgcgatcc gcatggtgac 840taaagtcagc gcgtttgagg
gcgttaagtc gacgcagtaa attatttttg ccctgaaaat 900gcgccagtgc cagcgcgtct
tctgcctcca ggcgaggtgc gcgggccgcc atccagtcgg 960cgggatagag gaacaacggc
tgcgcttcca cccatgcctg cgcgccgccg ctatacagta 1020atcgttcgcg aacctgaaaa
cagcggcgc 1049961050DNAArtificial
Sequencegene editing homology arm pool_4_b3058_left 96ggaacgaaaa
ctcggaagca gcgtaacggc tcacccccat cagcatccca cctgaaatgg 60tcgccccgga
acgggagaaa cccggccaca gcgccagaca ctggaaacag ccaatcataa 120atgcctgacg
ataggtcata tcatcaagac ccggcgcacg cggctctttc ggcttcaggc 180attcggcggc
aatcagcaac aaaccgccaa cgaccagcgc atacatcaca tttatcgggt 240taaacaatga
cttaatcgtg tcgtggaaca acagccccaa taccaccgcc ggaatcatcc 300ccagcaaaat
gtggatcagc gttaaacgac ctttgctttc accttcgtgc tgcaacgggc 360ggccaaagtg
gatgccaatc aggccaaaca gacgccgcca gaacatcact actaccgcca 420gaattgatcc
taactggatc acaacttcaa aggttttcgc cgtgtcgccc tcaaacccca 480acaagtgacc
gacaataatc atatggcccg tgctggatac cggcagaaat tctgtcaatc 540cttcgaccac
acccaatatt gccgctatca gcagcgagtg catatcgctc atcaataaac 600ccctaaatta
ttaaaatgta ccgcttgtcc gaactactgc gtatgaccag gttataaccg 660tttggtttaa
cagctgtaaa attaattatt ttctttcaga ttattgccac gctcaatgat 720tacgccaaca
ttcgccgccc gcgccactgc gcctggcttg ctgagtttga tacgcaccca 780cggcgagttg
aagcgtgcta acagcagctc cgccacctct tcagccacgc gttccaccag 840cgcaaaacgc
gccccctcga cgtggctgac caccgtttct gcaatgtcag cgtaactgag 900gcaatccgcc
acatcatcac ttttcgccgc tttacggtta tcccacgcca tttcgatatc 960gaacactaac
ttctgttcga tggtctgttc ccagtcgtaa acaccaatag tggtgattac 1020cgaaagttgc
tctataaata caatatccat
1050971050DNAArtificial Sequencegene editing homology arm
pool_4_b3058_right 97cacgtcctgc ctgctttttg gctaaccgga taccacttcc
ggcgaaatgt gcgtattatc 60cacagattca tcgttgaaca cgaattttca aaacggaaca
gcttatgagt gcaatcgcgc 120ctggaatgat cctcatcgcg tacctctgcg gctccatttc
cagtgccatt ctggtttgcc 180gcttgtgtgg gctgcccgat ccgcgaacca gcggctccgg
caatccaggc gcaaccaatg 240tgttacgtat cggtggcaag ggagcagccg tagcagtact
gattttcgac gttctgaaag 300gaatgttgcc cgtctggggc gcgtatgaat taggtgtcag
ccccttctgg ctaggcttaa 360ttgccatcgc cgcctgtctt ggacacatct ggcccgtttt
cttcggattt aaaggaggaa 420aaggcgttgc taccgctttt ggtgccatcg cacccattgg
ctgggatctc accggagtaa 480tggcgggaac ctggttactg accgtgctat tgagcggata
ctcgtcgctg ggagcgattg 540tcagtgcact gattgctccg ttttatgtct ggtggtttaa
gccacaattc accttcccgg 600tttcgatgct ctcttgcctg atcctgctgc gtcatcatga
caacatccaa cgtctgtggc 660gtcgtcagga gacaaaaatc tggacgaaat tcaaaagaaa
gcgcgaaaag gatcccgagt 720gatttctggt ggatctacat gacctgatag ccttcatcgg
gcttgcccag ccgttgctgg 780caccatgccg ccagaaattc cacgcagaca cgtaatttca
tgctgcgata gagcggctcc 840cggtaaacag cccagatatt ggcgctttgt gcatactctg
gcaatacttg caccagtttg 900ccactctcca gaaacggcaa cacatcccac tcggaacgca
gcataatccc tttgccctcc 960agcgcccatt gcagcacaat ctcgccgcta ttggaggaaa
gatgcccgct taccttcacc 1020gattttttct cctgaccgtt ccccaactcc
1050981050DNAArtificial Sequencegene editing
homology arm pool_4_b2688_left 98gaggaatcgc acctgctgtt catctacacc
aatcggcgag aacgggttga tgtccagcga 60acgcacttca atatattcaa tgccgccacg
taacagcgca tcagaaggcg actcgccgct 120gcgggtaacg cgttttggac gaatcggcgc
gtacagttcg ttttcaatct gcaacacgtt 180gctgttgatt tgcagcctct taccgtcttt
ctcaatacca atcttcgcgt actcttccga 240tggcgttttg attgcctgtt taaggcccgc
tacgtactcg taaagatcgt tgaaggtaat 300accaagattg ctttgcgatt tattggtata
gccgagatcg ctcaaacgaa gagaggtcgc 360atacggcagg taatacatac cgcactcggt
tttctcaaac ggcagcgacg ttggttttcc 420ttgcaggaaa gaagaacaaa tcgccggaga
tgcaccaaac agataaggaa tgacccaacc 480gaaacgatag taattgcgga taacgcggaa
atagcccgca gaaattttct ctttggcatc 540agcgcccgag atatcaccgc acttcgcttg
ccagaatgcc attggcaaag agaaattgta 600gtgcacgccg gaaatggttt gcatcagcgc
gccgtagcga tttttcagcc cttcacgata 660cagcgtttta aagcgtccgg tgttagaagt
gccgtactgt gccagttcga tgtcctgacc 720ttctgcgatg tagcatggca tacttaacgg
ccacatccgc tcatcgccca tattgcgcgc 780cgtataacga tgcagatcgc gcataaaggt
cagcatatgt tcaatatcac catccactgg 840tgtaatgaat tccagcaatg cttccgcaaa
atcggtagta atccatttgt gcgtcagtgc 900ggaacctaat gcttcaggat gacctgttgt
tgccagtgtg ccatcagcat taacacgcaa 960agtttcgcgc tccagcccac gctgtatccc
ctttaacgcc tgaggatgtt tttccagcca 1020ggccagcgcc tgtgatacgt ccgggatcaa
1050991039DNAArtificial Sequencegene
editing homology arm pool_4_b2688_right 99attgacctcc cgcctgtcaa
aatcgtttta attagcataa ctgtaatggt gaccatatgt 60gcaggcctac aattagtgcc
accacatcat gccctgaacg gtcgctgctg caactgcaac 120atagcgtaac gctttaccaa
ggcacaaaaa aaagattacc ggtccccacg agatgcgcat 180ccatcccgct aacagacaca
gtaaatcgcc aaccacgggc atccagctta ataatagcgt 240gactgcacca tagcgtttca
gccagccggt agctttctct tgccagcgcg atgttttacg 300caatggaaag aaacgcccaa
ggataacgtt agttaaccct ccaaggctat tacccattgt 360tgctgttaaa actaaaaccc
agggatgact gatcccggaa agcaacattg ccaccagcac 420gacttcggag ttgccgggta
atagtgtagc gctgagaaaa ctactggcaa acaacgagaa 480aagcgataac gcttcactca
cagcaagcga acatccacgg cgtccatgcc tgctgcacgg 540gccgcctgaa taccgaaatc
ggcatcttca aagaccacac actgcgtcgg ttgcacgccc 600atacgctgcg cgcacaacaa
aaatgtgtct ggcgcgggtt tatggtgttt gacgtgatcg 660gcagcgacga cggcgtcaaa
ataatggcgt aatcccaggt gcgccagcaa tgcctcagcg 720atggcgcttt cactccccgt
tcctacagcc attgggcgac gaccatgcca acttttcacc 780acatcaacaa gaggaagcgg
ttcgacgcta tccagcagca tacttcttac tgcttctgtt 840ttttcacgcg ctaacgcatg
cgggtcgaga tcggcctgat tcagctcaat aattgcctga 900gcaatacgcc aggtgggcga
tccattaagc gcaatcatcg cctgaatatc gtactgaaga 960ccgtagtgcc ctaatacttc
gcgccacgct ttacggtgcg taggctccgt atccaggatt 1020gtgccatcca tatcaaaaa
10391001050DNAArtificial
Sequencegene editing homology arm pool_4_b1716_left 100agcgctttcc
agttccgctt tacgcgcatt cagcgcctgc tgaacctgct ctttcgcttc 60gttgataacc
gcaccagctg ccggacgctc ttctggcggc agctcacgca gggtcgtcat 120ctgaagggtt
aagtgccctt ttttacccaa atattcgacg cgcacattat ctaacgcggc 180aacatctgac
gcctggctaa tggccgcctt cgcactggca accagttctg cgagatgtga 240catggttttc
ctcattgtgt cagtggtgac actggttcgt tggacttaga gcctatccca 300tcaggctatt
ttacttgcca ttttggtccc gggctgtgct cgaaattctc acgtacttaa 360atacgctccg
gtttctccgc gctggccgtg tccagtctgg ctgcgacaat tacacctgat 420gagacaggct
ttttattttt caaaacgcgc atacaaaaaa agcctccact gggaggcttt 480caggcgctgt
tttccgtttc tcttctcacg cgctagcctc ctggattcag gtgctaaagt 540aaaaaaagaa
gcggaaaata gcagcattca ttgcttgcgt taccttttgg tactcttcaa 600aagaccttta
ttgaaaaggc tacggcgata aaagtcaatg ttttgatggc gttgaaacga 660aaagagggag
actagctccc tctttcaact ggcttatgcc agagctgctt tcgctttttc 720aaccagagcg
gtgaacgcta ctttgtcgaa tactgcgata tcagccagga tcttacggtc 780gatttcaaca
gaggcttttt tcaggccatt gatgaatttg ctgtaagaaa taccgttctg 840acgtgctgct
gcgttgatac gcgcaatcca cagttgacgg aactgacgct tacgttgacg 900acggtcacgg
taagcatact gaccagcttt gataacagcc tggaaggcaa cgcggtatac 960gcgagaacgc
gcaccgtagt agcctttagc ttgtttcaaa attttcttgt gacgtgcacg 1020tgcaataaca
ccacgtttta cgcgagccat
10501011049DNAArtificial Sequencegene editing homology arm
pool_4_b1716_right 101ttatgcgtac ggcaggcacg cgattaccag gcccagatcg
cctttggaaa ccatggcttt 60cggacgcagg tgacgtttac gtttggtcgc ttttttggtc
agaatgtgac gcaggttagc 120gtgcttgtgc ttaaaaccac ctttaccggt ttttttgaag
cgcttagcag caccgcgtac 180ggtcttaatt tttggcattt taataacttc cacttcgcat
tgttaataaa cgaaacaaag 240gcgaacaaag cctgtgaagc ccgaaggctc cacagacagt
gctacttgaa ggccttactg 300tttcttctta ggagcgagca ccatgatcat ctggcggcct
tcgatcttcg ttgggaagga 360ttcgaccact gccagttctt gcaaatcgtc tttcacgcga
ttaagcactt ccataccgat 420ttgctggtgc gccatctcac gaccgcggaa acgcagcgtg
attttggctt tatcaccctc 480ttcgagaaag cgaatcaggc tgcggagttt tacctgatag
tcgccttcat ctgtaccagg 540acggaattta atttccttaa cctggataac tttttgcttt
ttcttctgtt ccttagaaga 600cttgctcttt tcatagagga atttgccgta atccattata
cgacaaaccg gcggctcggc 660gttagggctg atctcgacta agtctactcc ggcttcttct
gctttctcca gagcttctct 720cagactcaca ataccaagct gctcgccttc cagacctgtt
aagcgaactt cctgggcgcg 780aatttcgcca ttgatacggt tagggcgcgc cgtttgaact
cgttttccgc ctttaatacc 840ttattcctcc aattgtttaa gactgcggct gcgaatctct
tgttgcagct tctcgatcac 900ttcatttacg tccatgcttc ccaggtcttt accacggcgg
gtgcgaacgg caactttgcc 960tgattccacc tctttatcac cacagaccag catatatggg
acgcgacgca aagtgtgctc 1020gcggatttta aagccaatct tctcatttc
10491021050DNAArtificial Sequencegene editing
homology arm pool_4_b3071_left 102gtcgctggtt caagtccagc aggggccacc
agatatagca aaggctgacg agaaatcgtc 60agcctttttc tttttatata tcagttactt
tgcgtgccag taagccgctg cacgtacccg 120ctgtgggtca tactgttccg cttcaaagcg
gcggcttaaa ttcttaacga ctttaccttc 180gccggttatc cagatgaagt aatcatcggc
agggatttgc atctgcgcca gacgcgcatc 240taccgcctgc tcatcatgtg ccagccattc
gatattaaaa ccatcaaggt gcgcgagata 300atcctgacag gcgttatccc gcacgctaac
cagcgcacta acttgcggtt taacggcaag 360tttgctcaac gtttccaggc ggcggcgcaa
tgcaggcatt ccggattcat cgcagacata 420cagctgatac gcgtaatctt ccggcaccac
cagcgaaccg cgcggacctg ccaccgtaag 480tttatcgccc ggttgcgcct gcatcgccca
gccgctggcg accccaccgt cgtgaataaa 540gaaatcaatc gccagttcat ggcgtagttc
gtcatacagc ggcgtatagt cacgcgacgg 600tgggcgtggt ccttccggcc agacgatgcc
ctcttccgtt accgttggcg gcacaaagtg 660agcgtcaggt tgaggaaaga agagtttgct
gtgatcgtca aagccacgcg atgtaaaacc 720gtccagcgcc tcgccgccga ggacaatgcg
ctgaaaaccg gcgctgatgc gctcaacgcg 780taacacagtc agttcacgga agcgcagatc
attgcgaacg cgctgcgggt agcggggggt 840gttattcatt gttatcgcct tcgtgatggt
aatcagatat atctaaataa aactcgcaaa 900tgataatgat tgttaatcat gataaatgca
agcgatttgt agaactgata tgtctatagt 960ctgataagac gaaccgcctc ttctcaggca
tcattactca acgccggatg cggcgtgaac 1020gccttatccg gcctacgtgt gagatgagtc
10501031050DNAArtificial Sequencegene
editing homology arm pool_4_b3071_right 103atgagccatc atcacgaagg
gtgttgtaaa catgaaggcc agccacgcca tgagggctgc 60tgcaaaggtg agaagtcaga
acacgagcac tgcggacacg gtcaccagca tgaacacggt 120caatgctgcg gtggtcgcca
cggtcgcggc ggcggtcgtc ggcaacgttt ctttggtcac 180ggtgaattac gtctggtgat
tctggatatt ctctcgcgcg atgacagcca cggttacgaa 240ttgattaaag cgattgagaa
tctaacccag gggaattaca ccccaagccc gggcgtcatc 300tacccaacgc tggattttct
gcaggagcag tcgctgatta ccatccgcga agaggaagga 360ggtaagaagc agattgcgct
gaccgaacaa ggcgcgcagt ggctggaaga aaaccgcgaa 420caggtggaga tgattgaaga
acgcatcaaa gcgcgttgcg ttggcgcggc gctgcgccag 480aacccgcaaa tgaagcgggc
gctggataat tttaaagcgg tgctggattt acgcgtcaac 540cagagcgata tcagtgatgc
acaaataaaa aagatcattg cggtgatcga ccgcgccgct 600tttgatatta cgcaactgga
ttaatcgccg catccgccag tggcgcggtg caattgccgg 660atgcgacgct tgacgcgcct
tatccggcct acacccgcta cacaccccgc aggcctgata 720agatgcgcca gcatcgcatc
aggcattgtg ctccaaccgc cggatccggc ataccgatta 780atgcagtacc gtcaccgcgt
cttccagtcg gctggcgcgg tgtttcacca tcgccgacac 840ctgcgcactc tcttccacca
gctcggcatt tttctgggtg atcaggttaa gctcatccac 900tgcacgggtc aggctggaaa
gcccatcggc ctgttccagc gttgaatggc taatctgggc 960gatcaactgg gtgacgtttt
tcacctgtgc cacaatatct tccatcgtcc gtccggcggc 1020gtgtacctgc tgcgaaccgg
attgcacctt 10501041050DNAArtificial
Sequencegene editing homology arm pool_4_b2139_left 104cttttgcgat
atcgagattg gcgttaagac gaccactatc gaaaatgggt agcgtcaggc 60ctgccgtaac
gcccatttgc tgcgcggaat gacggaacag atcgcttaag tgcaacgcat 120cctgttgcag
gaaggccatc aggttgatgt caggataaaa tgccgctttt gccgcatcaa 180tggtgcttag
cgatgactca acgtaccagt gcgccgcctg caaatctgcc cgccgggcca 240gtaaggagta
ccccagttca tcaggaagct ggcttgccac tttcggcaac gcgaccggtt 300taagcttcaa
tgactttgtc tggttatttg taagtgcgct taaccgtgcc tcaataattt 360tcattttccc
cgcgacatcg ttgagctgct gccgggtttt gctggcatta atatcggttt 420ccacaccttc
aactgaagaa gtaatcccgt tctgatatag ctggcgatcg gtcgcgataa 480tggtgttctg
ctctttttct atttgctgca agaccgtgtt taacgccgcc tgggtttgcc 540actcccagta
caggcgggct acgctgccag ccagcaattg gcgggtttgc tcgcgttccg 600ccgcccgtgc
tttaaccgta cccaggcggg cagtaacctc cgcccgattc tttccccaga 660tatcgagatg
ccagcccgcc gttaagccaa aagtaccgtt ggtgtaccac gggccggtcg 720tacctgcggc
cggatcgttc agagcaaacg gccccattaa gccttctgcc gacatttttt 780gccgctccat
atccgccgaa aagtcgatct gcggaccatc ctgagtggca actgccttcg 840cctgggcttc
agctagctga atgcgctgtt cagccacctg catatccggt gcgttctgta 900gtgcattgtt
aattaaggaa gtgagttgat tatcgtgata ctccagccac cattggctgt 960ctggccaacc
attttcagcg ccgtgggtaa tgcggtgtca acttgtgcag cgggcgtttg 1020ctggcttaac
gcctggcggg tttcatgcat
10501051047DNAArtificial Sequencegene editing homology arm
pool_4_b2139_right 105aggcgcacac ccggccagca tcagtaacag cggaaaacag
gcgatggctg gataaaagga 60atcacgattc atgggggaat aatcaggtaa gaaaaggtgc
gcggagatta ccgtgtgttg 120cgatatattt tttagtttcg cgtggcaata catcagtggc
aataaaacga catatccaga 180aaaatataca ctaagtgaat gatatcttcc gatttatctt
aatcgtttat ggataacggc 240aaagggcttc gttttttcct atacttattc agcactcaca
aataaaggaa cgccaatgaa 300aattatactc tgggctgtat tgattatttt cctgattggg
ctactggtgg tgactggcgt 360atttaagatg atattttaaa attaattaat gtcatcaggt
ccgaaaataa cgagaatatt 420tcagtctctc atcctgttgc gctcctgtca tgtgcattgc
ttcatataat cactggcgca 480aggagcgcgc agggggcggc caatcgccgc cgccccctgc
acccccgggc tctggcgaac 540aaaatcgccg ctgcgcggtg ccctcggctt atcccttacg
gctaccgggt cgggcgcgag 600gtaacatccc tgtaaaacgc gccctcagcc cacatccatg
tgggctgccc cggccttcag 660ggaacgcctc ggcaattttg acgccaccaa acaaccgtgc
ggcctattga taaagagcta 720acacattgtc aaaaaacatc actatggttt tttagagttt
ctcgatatca attgcctgaa 780tagcccttgc aatatcaggg gaattattca acacccgaac
atgctgaaat aattccgttg 840cttcatcgta ttctttacgc aaataactca accactgttt
aatccgcgca acgtgatata 900acccggtatc gccctgcttt tccagacggg tatatttttg
cagcaaagca accacctccg 960gccacggcat tcgcggttcg ttatatttta ccacccggct
caggttggga atattgagcg 1020ccccgcgacc aatcatcact gcgtcgc
10471061050DNAArtificial Sequencegene editing
homology arm pool_5_b2434_left 106tcgctcactt tggtggacga cccaaaccag
ttccacgggt tagcggcaga ccagttaact 60gacgacatcg tggaacagcc ggtcagcatc
aatggcatag cgcataacat taaacgcagc 120gatttcatgt cacttccttt ggttattcaa
taacgttgct tggagtgcaa attcaccaaa 180aagtgccgtt attcttttac ttcataaaaa
caagcgcgta aacgtcgatt ggtcagcagc 240cagattaacg ccacaatatc cgccactacc
agcgccagac cgataccgct aacggattca 300ccgttcagcc acagccacgg ttgccagcaa
agtaaaacca cctgcgccaa cagcaacaga 360aaatagagta cacgccaggt gcgaggaaac
gtagcccgcc gaccgctcaa caaaaacgcc 420agcaccgccg gaatgccagg aatcagcccc
agccagaaat tatcgtgatc gggataaaac 480agatttagca gcgcagtacc ctgctcgcgc
gacgcaccgg caatgacaaa cagcacccag 540gttcgcgcct gaagcaatag cacaagccag
aagagcaagg gtaaacgcag gcgaccgtgc 600gcatcataat ggacaggatg aaactcagta
ctcttcatct tcaatcaaac gcttacccag 660actcagcacg tcggcgtgtt catatcccag
gcgttcatac attccgagca ccatgtcgtt 720atcttccggc acattgatct gaattttcgg
gcagccacga gcaatcagct ttttctccag 780ccgattaagc aacgcattgg caatcccacg
cccacgaaac tctggatgca cgccaagata 840ataagcagac ccgcgatgcc cgtcataacc
gcccatcacc gttccgacca cgtcaccgtt 900tacctcagcg accaaaaaca aactgacgtc
atggttcatc ttacgctcga tgtccatttc 960cggatcgttc cacggacgca gcaagtcgca
acgctcccaa agggtgatga cctcttcgaa 1020atcttcctgg cgaaatacgc gtatctccat
10501071050DNAArtificial Sequencegene
editing homology arm pool_5_b2434_right 107ggtattcgtt acctttttgc
gggttaaaag gctgattatg gcgtgaacgg tcgaattagc 60caatatctga cgaaaatcgg
ttgaaaaagt ggcataatgg ggagttgtca actattgaaa 120tgaaaagtaa aacaattctc
aacagcaaac cgtcgtaacg gattacgcga tacgatataa 180catctggaac tttattatta
caactcaggc cgtatgagca cttttaaacc actaaaaaca 240ctcacttcgc gccgccaggt
gctgaaagcc ggtttggctg ccctgacgtt gtcaggaatg 300tcgcaagcca tcgccaaaga
cgaactttta aaaaccagca acggacacag caagccgaaa 360gccaaaaaat ctggcggcaa
acgtgtcgtt gttctcgatc caggtcacgg cggaattgat 420accggagcga tcggacgcaa
cggttcgaaa gaaaaacatg tggtgctggc gattgctaaa 480aacgtccgtt ccattttgcg
taatcatggg attgatgcgc gtttaacgcg ttctggcgat 540acgtttatcc cactttacga
tcgcgttgaa atcgcccata aacatggcgc agatctgttt 600atgtcaattc atgccgatgg
ctttaccaac ccgaaagctg ccggtgcttc ggtatttgcc 660ctctctaacc gtggggcaag
tagcgcaatg gcgaaatacc tgtctgaacg cgaaaaccgc 720gccgatgaag ttgccggtaa
aaaggcgact gacaaggatc acctattgca acaagtgctg 780tttgatctgg tgcaaacaga
taccattaaa aatagtctga cgctcggctc gcatattctg 840aagaagatta agccggtgca
taaactgcac agccgcaaca ccgaacaagc ggcatttgtg 900gtgttgaaat caccgtcggt
tccttcggtg ctggtggaaa cctcgtttat caccaacccg 960gaagaagaac ggctgttagg
cacggcggcg tttcgtcaga aaatcgccac agcgattgct 1020gaaggcgtga tcagttattt
ccactggttc 10501081028DNAArtificial
Sequencegene editing homology arm pool_5_b2037_left 108aatgaaataa
gaaaaggcag aggcgatata atcattagca caatcactgc attatcatat 60cccggcccta
tacttatttt tactagtata gatgcaccca agagcagaat taatgaaaaa 120gcaccaccaa
tcaaactcaa gcaggtcaat gattttttaa ttaaaatcac acccttcaca 180cgattaagaa
caagcgtact tgatattctt gggtatattg cttgggtgat aggatttaat 240agcccttgaa
gcgcgtttct tatagtattg gccgcattaa aattccctac ggacgttggt 300ccagatataa
atcccaggat aataactatt cccgtagaat ataaactaat agcagatgtg 360gaaataaaaa
catgaaaacc gtctgctaaa gatcgacgca cattatgtaa tgatagcgta 420actttaccaa
tccaaccttc atgaacaacg atagctagtg caataattcc agcaaccaga 480tttgcacttg
actgaataaa accggcaatt gctatatctg actttgtgtt cacaaaaata 540aatgttagag
ggataatagc caagcgggat aaaatactac ttaaagtcag ccatttcatt 600ttttcttttc
cctgaaacag ccagataggg tagattaaat tcccgactaa tgcaggaaca 660aacgaccata
taattacggc atgcttgtta tattcaggaa caagcaaggt catcgacgtt 720aagaaaatca
atgtaatgac gataagaact atttttgaaa atatcaccgc ccaaaaaata 780gacgttactt
tatctttact atctgctgct ttggcaatac tctgagttgc tgtgagattg 840aaaccatatt
caacaaacat tatcatatat agcatagtcg cttggcaaaa accgaatata 900ccgaaatttt
caggaccaag tgttcttaca agatatggaa atgtaagcaa tggtaaaaga 960taattgctac
cttgaacgac agccagatat ataacgtttc ttcttaaaga taatttattc 1020gtattcat
10281091050DNAArtificial Sequencegene editing homology arm
pool_5_b2037_right 109gcaattaatt ttaatctgat aagctcatct aacgtaaaga
gcctttcatc ttttggcgaa 60aggattaacc ctgatgtttg gggccaatca attgcaatgc
gttcatcatt ccaacatatt 120ccacaatcgc tttcaggatg ataatagttt gtagttttat
attgaaattc agcgatatca 180gacagaacca aaaagccatg agcaaaccct tttggtatcc
acaactgctg cttattatca 240gctgaaagca gaacaccaac ccatttacca aaggataccg
aattgggtcg aatatcaaca 300gcaacatcaa aaactgctcc atgagtgcag cgtacaagtt
tatcttgtgc gtactcgccg 360cgttgaaagt gaaggcctct gagtacattt tttgatgaac
gtgagtgatt gtcttgaaca 420aagctgaccg gatagcctag aatatgttca aatgctgatt
gattaaagct ctcataaaag 480aaacctctat catcaccaaa tactcttggc tccagaatta
gcacatcttc aatttcagtt 540ctaatcacat tcattaattt gaatccttcg tcattttata
aagatactgc ccataattat 600tctttattag tggtacagct aattttctta cttgctcaac
atcaataaaa cctttacgaa 660atgcaatctc ttcaggacag gaaaccttca atccctggcg
ctcttcaatt gtcgcaataa 720aattacttgc ttctatcaga ctctgatgag tccccgtgtc
cagccacgcg tagccacgcc 780ccatcatcgc gacagacaga cgtccctgct caagataaat
acggttaata tctgtaattt 840ctaactcacc acgtgcagac ggcttcaagt ttttcgccat
ctgaaccacg tcgttatcat 900aaaagtacag acctgtaacg gcgtaattac tctttggttc
taacggtttt tcttccagac 960tgattgccgt accgttttta tcaaactcaa cgacaccata
gcgttctgga tcattaacgt 1020gataggcaaa taccgttgca ccactttctt
10501101050DNAArtificial Sequencegene editing
homology arm pool_5_b2451_left 110cagatccgtt tcatcaatcg ggatcgccac
cggcaaattg cgcagcggca gttgtacgcc 60ctcaagccag attgtgctgc cagagagcga
aagggtatgc gcgcccgcgc caatcaccgt 120ggcgcgcacg gtttgcgccg gaaactgtac
gttcatctca cgcaggcgcg gatggtcatg 180cagcgcagtt gccagcagcg ggccaatatc
ggcaaaacag aacgggtcgg cgggctggtg 240gcgataacat tcgcccacgc cgccagaaag
cgtaatgatt tcgggcgtaa cacctgcggg 300cagcaaaccg gtttgcatca atgcctgcgc
gagcggtgag agcgttccgt caatcacttc 360gacaatcagt tctgccatcc gccgggtcac
ctgcaccagc tgcgcgccgg tcagcgaacg 420ggcgtcggtg cctgcaccga agcactcatc
cacaatcatc tgccccggtt tatgagcgta 480aaccacgcgc ccgtggctgt cggtttccag
caggcgacca ccgacgttga ggcaggcagt 540gccgctgatt tttccggcat cgaacagggc
gtagttcgcg gtgccaccgc cgatgtcgat 600attcagtacc cgacacagcc gttgttcaga
aagggtttgt gccccggctc cgtgaccggc 660gatcacggat tcgaggtgcg gcccggcgct
ggcaacgaca aaatcgccca gcgactgaga 720gagcgccatc accgccgggc gagcattgcg
ggttttcgcg ctttcaccgg tgatgatgat 780ggcaccagaa tcaacgcttt ccggctcaat
acccgcagca tgatattgct cgagtattaa 840ggttttcagt tccgcttctt ttaaaccgcc
ctgtttatcg acaggggtaa agaacaccgg 900actttgccag ctaatttcgc gtttaatgaa
ttcgtagcgc ggcacctgcg acaccgccgc 960acggttaacc agctccagcc gggagaaaat
cacctgggtg gtggtggtgc cgatatcgat 1020accgacgctc aatagctggc gagtgttcac
10501111046DNAArtificial Sequencegene
editing homology arm pool_5_b2451_right 111gattgtgcct ccgcttcggt
tttagtcgcg gtcgcgtctt cttttggcac cagcatcatc 60gccacgccaa tcgccgttac
gccgccgatc aacttgccga caatcatcgg gaagatcatg 120gcgttcatgt tggcagcggc
gaagcctaag tggtcgccca gggcgaaagc agcggaaacg 180gcgaaggcgc agttgatgac
tttgccgcgg gtatccatct gcttcatcat gccgaacatc 240gggatgttgt tggcaagcgt
tgccaccatg ccggctgccg cgatgttgtt catattcagt 300actttaccga cgctcatcag
cggtttttca aaccagcgag tcagcagcag caccatcgga 360tacgccccta acagaacgca
ggagatagaa ccgataactt caatggcgcg catcacctca 420ccgggtttat cgccaggggc
cataaagata ggatccagac cggggatcag ttcccagcca 480agcaggaatt tcactaccgc
agcggcaaga ccgagggtga tcaatgcaac gaggaatttg 540gcgaagatct ggaagccgtt
gatcattttt tccgggatga atttcagccc cagcgccacc 600agaatcgcaa caatgatcac
cgggatcatg ttcatcagga tcagggcgaa agtgaattcc 660accggctggc cgttgatctg
cacaccggag tacatagcaa ccagaccacc agcgatacaa 720ccaatcggaa tggtcacaat
gcccgccagc acgccgagcg ccagataacg acggtcagaa 780ggttcgataa tgccgagcgc
caccggaatg gaaaacacaa tcgttggccc catcatcgac 840ccgagaatta acccagagta
tagccacgcg gctacgtcgc cgcccgccag ctctttggcg 900aggaagaagc cgcccatatc
gcacgccagc agtgttccgg cgaacatcga tgggttagcg 960ccgagcattt cgtaaaccgg
aataattacc ggcccgagaa cgtgagccag taccggtgcc 1020agcgcggtca taccgaccat
cgccag 10461121050DNAArtificial
Sequencegene editing homology arm pool_5_b1902_left 112acatcccagc
cacgtttctg catctcttta tacagttcct ggccctgacg ttcgccaatt 60ttagtcgccg
ccatcatcac cagcggaacg gtatccattg gcttaccttt ggcgttaaca 120aactggtcat
ccacggcaat gactttcata tcgtagccac gcgctttcgc gacgatggca 180gagccgagtt
tggggtccgg agtacaaata acgaaacctt ttgcgccact ggcagccagg 240ctgtcgatcg
cgttcaatgt tttttcgcca tccggcacgg caatcttaat aacctcaaac 300cctaaatcct
tcccggcttt atcggcaaac ttccattcgg tctggaacca cggctcttcc 360ggttgcttca
ccagaaaacc gagcttcagg ttctccgcca tagcggattg tgacataacg 420gctgccagac
caatggctgc cagggcttta gtaaatttgt gcatggttct ctccagcttt 480agtgtcgttt
tgtgtagggc aaaaacgaat gacattcgtt aaattaatcg gaaaacaaag 540cattaccttt
taactaaaag ataagtgact gtgttgacat agttttagcg agaaattaat 600tctccatagg
agagcaatat cacatcgcag aattacagtg agaacgtgca taaatttagc 660gggaaaagac
ataagggaaa gccaatttgt cagacaaatt gtcgaatgca cagcagatta 720atccataaga
ttagcctgga aatccttgtt gtctttggta cccatgcggg atgtcttctt 780tttaaccagt
caataggccg cattacctgg cgttgagttt ttgaaatggt gtaataaccg 840caactcaaag
atgtggaaaa tgcacgtcat tcatttcgtc attaattatc actgtgctca 900ttaattaaca
gaacacgtat aatgagagcc atctcgcaaa aatgaaaaaa cgttttataa 960aatcatcact
tcatcatgaa ttcaaattca ttgattaata tcaacaagat acaaaaagca 1020ctatcattaa
aattcattgc agttacattg
10501131048DNAArtificial Sequencegene editing homology arm
pool_5_b1902_right 113atggcaaccg ctggaatgct tctcaaactc aactctcaaa
tgaaccgcga gttttacgca 60tccaatctct accttcacct gagtaactgg tgttctgaac
agagtctgaa cggcaccgcc 120actttccttc gcgcccaggc acagagtaat gtgacccaaa
tgatgcgcat gtttaacttt 180atgaagagtg tcggcgctac ccccatcgtt aaagccattg
atgttcccgg tgaaaaactg 240aactctctgg aagaactgtt ccaaaaaacg atggaagaat
acgagcaacg ttctagtacg 300ttggcacagt tagccgatga agcgaaagaa ctgaatgatg
attcaaccgt caatttcctg 360cgcgatctgg aaaaagaaca gcagcatgat ggtctgttgc
tgcaaaccat tcttgatgaa 420gtgcgcagtg cgaaacttgc gggtatgtgc cctgtgcaga
ccgaccaaca tgttctgaat 480gtcgtgtcac accagctgca ttgatcatca tcggcgctaa
tgcattgcgc cgatgaaggt 540tttgagaaac cgctgcctca tctgtttgaa gcagcggttt
ttttaatggg attcaccctg 600tggggtaaac ttgagttcaa taagcgcgat ggctttttgg
attgcccgca tggtgaccgg 660gtctgcggcg gcgggatggt tagtaaagtc gatattcttc
agctgactgg acattttttc 720acgaacttca acgggcgcga ttacatcgag aacatccaga
atttgtttga taaccaactg 780gcaagcaacc acatcagaaa ccaattcctg atcggcattc
agcggctggg acatcgtaaa 840ctcctgatag cattttgaaa gccgttatag tagcgacttc
acatcttcag cgatagtcac 900atccaccgtc atcaggacac aaaaaaacct gccggagcag
gttttttgtt atcggaacat 960attgcctggc ggtacgtctt tgaacgtctt gcaatagtta
ttgaacatac ttttcaggat 1020tttgcgcagt ttcatcgcgg cactccga
10481141049DNAArtificial Sequencegene editing
homology arm pool_5_b4310_left 114gctttgccgc ctgcagtttc accgccaata
atcaatagac tattattcca gggcaatgat 60actccgtagg cccgaccttg cgataattca
cccgatttat cccatttccc gttatgccaa 120agatgaatat cagtgctata tgattttttc
aggccttcat gcgcatagtt cttaccgttc 180tggtaatttt ctcgtgaacc tttgaatccg
gcccctccgg caaatataag agaatcattg 240cttatccccg caaaaccgcc agctacgcca
tctggtgatg agacgggagc aagcttattc 300cattttaaat tattaccggt gaaatcaagt
tcaaatacgg catccgttcg caatcctggt 360ttggcttcgc cattaataag ccaggtttta
tcacctttat tcacaaccgc cgcaccagcc 420gttccgtacc agggcgattc gccagcgtaa
ctccattgct gtgttgaggg atcaaaagac 480aacagaaact tattgaagaa ataatcttct
gcttttttgt caaaatagtg agcattgatt 540ttatctatag cggttgaatc ttttccagcc
tcgttgagat cttcaaaata gccattgaag 600atattctggt taacaccacc agtaacataa
gccttgccgt tgtgtacaaa agtcacatgg 660cccgccatgc ccatcggcgc gtgcgacatc
aatttaaccc aactattggt tttggggttg 720tatttgtgta cgtcattaaa tacctgagtc
aagccctcgc tgtttttgcc aatgccgcca 780aacacataca gattgccatc aataaatgca
gaggttgctt gatctcttgg tccgccaggg 840aatgcagcta acgctgtcca ttttttatct
ttggcctgtg tatccagctt gtaccatgcc 900gtacctgcgc tacctaaacc aatgtagaca
gtgtcgttat caattgctcc ggtaccactt 960ttaaatggca caggagtttc cggtaataca
gacgcgtttg cggcaaatga agccatcatg 1020atagcaagcg ccgttattgt tttattcat
10491151021DNAArtificial Sequencegene
editing homology arm pool_5_b4310_right 115tgtgactgtc tcctgtctac
tacagtttaa atgacacacc aatgcgataa ctgttttccg 60ataaattatc tctgccgttg
taaacaccct gacggtcaag gtagtcatat tctatgtatg 120gcgtaatatc gggcgtcata
tggtattgta gaacaaatgc attttccgtc gcccatttct 180tatggtttgc atagcgataa
tcgttctgtt tgctgtatag cgtcgtttgc catgcgaagg 240tgaaatcact attaatatgg
taagtgacat atccatccca acgatgaacg ttatcacgag 300acatatcacc ggataagtct
tgttgtcggt aagctttcca gtcgtaacga tagcgaatgc 360caaaattaag atcttttgtc
gcgtcccagg acagttttac gtagggtccg tagcgtgtgc 420cgttgctgct aaaatgcgtt
aacattcccg ggcgcaccgt ccattgatca tcaagtttaa 480tcgcgtaatt aacttcaacc
tgaacatcat tgagtgcggc attttccttt ttattatcat 540gaatggtatt ccaggtatta
ctttccatgc ttgcccacca tccattttgc catccctcac 600tgactttgag tcgagtctca
taggcgtggc ttccactacg atatccacca cgtacgtcca 660gtgtcgcagc ctgagaaatt
aatggggacg aaaagcacag taataatacg ccagaaagta 720ttttagcctt tttcataaat
ttcactcatt tgtaggatac agaaagcaat acaaagcccg 780cataaacaat tagcatttat
gttgtgtaat atttttttgc caggcttata gtgtctttgg 840caaccggtag ctgtatttta
tatttttttg tataaggtct cctgtgaaaa atctcttttc 900acattattta aataaacaga
gatccagatt aaatacctga gtataaaatc tcttctgatg 960tttaattgat ttgaatgttc
gtaagctata tcacttactc aatccatttt acccagagtc 1020a
10211161050DNAArtificial
Sequencegene editing homology arm pool_5_b0676_left 116cactttatcg
gcttcggtga tttcaccggc aataacaatt ttttgcggat taaataagtt 60gatagcaatg
gcgatggttt tacccagatg acgaccgaca tactcaatta cttccgacgc 120cagactatcg
cctttgttcg cggctttgca gatagttttg atggtgcagt cgtccagcgg 180cacgcggctc
tggtagccct gctttaacag attcaacacc cgttgttcaa tggcagcgtt 240ggcagcgata
gtttccaggc agccaaagtt gccgcagtgg cagcgttcac ccagcggttc 300gacctgaata
tggccaattt caccgacgtt gccgttgcgg ccaataaaaa tgcgcccgtt 360agagataatc
ccggccccgg ttccgcgatg gacacgcacc agaatggagt cttcgcaatc 420ctgacttgca
ccgaagtagt gctccgccag cgccagacta cggatatcgt gaccaacgaa 480acaggtcact
ttaaaacgtt cttccagagc ttctaccagc ccccagtttt ctacctgaat 540atgcggcatg
taatgaattt tgccgctgtc cgggtcaaca agccctggca ggatcaccga 600aatcgcgatc
agctcgcgca gtttgcgctg gtagctatca ataaactgag caatggcatt 660caacagggca
tgttccagcg tttgctgggt acgttccggc agcgggtaat gttcttctgc 720cagcactttg
ctgctgagat caaacagagt gatggtggcg tcatgacgac caagccgtac 780gccgattgcg
tggaaattgc gggtttcggt gacgatggag atagcgcggc ggcccccggt 840ggaggcctgc
tgatcaactt ctttgatcag cccgcgttcg ataagctgac gcgtaatttt 900ggttacgctg
gcgggggcaa gctggctttg ctcggcaatc tgaatccgcg agattggccc 960gtactggtca
atcaggcgat aaaccgccgc gctgttaagc tgttttacga gatcaacatt 1020acctatctga
gcttgtccgc ctggtgtcat
10501171050DNAArtificial Sequencegene editing homology arm
pool_5_b0676_right 117actttctctt attgagttac gacctcgtta ccgttaacga
tggtcttggt gattttaaaa 60tcaggtgtga atgcagtcag gttggctact ttacctgcgg
cgagtgtgcc gagacgtttc 120tcaacgccaa tcgcacgcgc cggatagagc gtcgccatac
gtagcacttc atccagtgcg 180ataccgcaat gttcgaccag attacgcacg ccttcaatca
tggttaagga tgaaccgctt 240aacgtaccgt tctcatccac acaaagtccg ttacggtagt
atattgtttt acccgcaaaa 300atgaactgtt caatgttggc acctgctggc gcggtggcgt
cagtaaccag acacagtttg 360tcgcctttca gacgtttagc gttgcgaatg ttggcgtaat
caacatgcag gccatcagca 420ataataccgc aataaatgtc agcttcgtcg aggatcgcgc
ccgccaggcc aggttcacga 480ccggtaatat acggcatcgc gttgtacaga tgggtggcaa
aggtaatccc cgcgcggaaa 540ccggcttttg cttctttcaa cgtcgcgttg gagtgaccgg
cagaaaccac aatcccggca 600tttgccagtt tgctgatgac ttccgcagga accatttccg
gtgccagggt cactttggta 660atgacgtcgg cgttttcaca caggaaatcg accagcgcgg
catcaggctt acgcacaaaa 720ttcggattat gggtgccttt ttttaccaga ttcagccacg
gaccttccag atgcagacct 780aacgcctgat tcggatgttt tgccaggtac tcgcgcataa
cgcgcacgcc ctgtttcatc 840agctcatcgc tggtggtgat aagcgtcggc agatagttag
tacagcctga tttctcattg 900gctttctgca tgatttccag cgtttccacg ctgaccgctt
cagcggtgtc gttaaactgt 960acgccgccgc agccgtttaa ctgcacatcg ataaaaccgg
gggagagaat ggccccgttc 1020agtgaacgtt gttcgatctc tggcggcagt
10501181007DNAArtificial Sequencegene editing
homology arm pool_5_b1497_left 118actttagtaa tacgatgctt aggacaaccg
ccattgcaga taggtttata tgcacattgc 60tgacatttcg ctggaatccg ttttttttgc
gctgtcagtt gtacactgtt catcgttttg 120agttcagatt tattaatgtt tccaattttg
tactgtggat agacaaaatg gtcgcattcg 180taaatgtctc cattactttc aacaaccaga
ttatccttgc aggactcctg gaaaatacaa 240ctggtatgcc cattccccaa aaaacggctg
acaaagcttt caaactgacg gatgaaaatt 300tcacccacat cgtttttaac ccattgcata
aaaatggttg acataaactt gccataagcc 360gtgggaggca cagaaaaatc aatgatacgg
aatgtgttct cactatgacc actgaaatca 420atattcggcg tcccggtttc tagcaattcg
ataaattgca tatgtttact gccgatagat 480tttaaaaaat gataaacctc aagagggtaa
tggacattaa cgttattaat gacggttaac 540gtattaaact ctacttgata tgatttcaga
cgctcgatgg ctgctatcac ttttgcaaaa 600gtaccgttac ctgaattact gcgtctgtaa
cggtcatgta actcctgggg gccatcgatc 660gagataccaa ccagaaattc atgttctttg
agaaaggcac accattcatt attcaataaa 720atgccattcg tttgtaatgc attaaaaata
cgtttttggc ctgcatagcg ttgttgatag 780tgaataactt tacggaaaaa atccaggcca
gccagagtgg gttcaccgcc ttgccaggta 840aaatagacct gattgccaga cgctgcgata
tattgtttga tgaactcttt cagagtgctg 900tcatccatcc atttttcatg agtaaactgc
gactcttttt caaggtaaaa acagtaatca 960catttgagat tacattgaaa actggagggc
ttggctgtaa cgtgcat 10071191050DNAArtificial Sequencegene
editing homology arm pool_5_b1497_right 119cgctatctcg ctcaataagg
cggcggaaaa atccgccgca tgaaggttta gttatttcgc 60ttcgcttagt gctttcttga
tattgttaaa cttctcctga tttacctcgc taagcggtgg 120ctggctgctg tcgataaact
ctcttaccac gccttgcatc tctttaacga cctgcggatt 180ggcggcggca aggttatctt
tttgctgtag atccgtcagt ttgtagagac ctaactgatt 240gttttctact gtatagacaa
gcgaataatc gttatttctc accgtataag agaattggct 300taagtcctca gtgttggggt
tatgcgggta atcgtctgac tgatggcgaa caaatttgtg 360gtaattatcc cagaatggaa
tattttcctc gtcaaaccag tgagaataag aggttatcca 420ggtcagattt ttatgtggct
cgccttgttt cttatcttgc aaccagggca gcaaggaaac 480gccatccagc ttaaggtctt
ttggaatgct gatatcggct gcatcaagag ctgtcgggta 540gaaatccatt gcggaaatca
gcttgtcata attaccgggt tgaagttttc ctttccacca 600cataaacatt ggggtgtgag
taccgccagg ataggtctga ctcttatagc ctttttgcgc 660cccgttcagc ggcagaggac
catcgataac cgcaccatta tcggaggtaa agagaataat 720tgtattgtca tactgtccgt
ttttcttcag ttgttcgaga atgcgtttta caccctgatc 780aacagaataa acggaagcgt
agtagttatc tgctgtttga ctaccggtat taaattgctt 840ctgatattga tccggtgcag
gattatcatt tggcaggtgc ggagcattat aagccaggta 900aagcataaaa ggctggtcaa
gtgttttggc acgatcaaca acgccaattg cctcatcggt 960taactgatcg ctgatataac
cttttgcggg gacacgttca cgatttttga acagtgaagg 1020ggagttgtaa tatgccgttc
ctgcagcgtg 10501201045DNAArtificial
Sequencegene editing homology arm pool_5_b0183_left 120ccaacgcccg
ctttgttggt gttgccgggc cacgaatgca ggctgaaggc tgcgaagcct 60ggtacgaaat
ggaagaactg gcggtgatgg gcattgttga agtgctcggt cgtctgcgtc 120gcttactgca
tattcgtgcc gatctgacaa agcgttttgg cgaactgaag ccagatgttt 180ttgttggtat
tgatgcgcct gacttcaata ttactcttga aggtaacctc aaaaagcagg 240gtatcaaaac
cattcattac gtcagtccgt cagtctgggc gtggcgacag aaacgtgttt 300tcaaaatagg
cagagccacc gatctggtgc tcgcatttct gcctttcgaa aaagcgtttt 360atgacaaata
caacgtaccg tgccgcttta tcggtcatac catggctgat gccatgccat 420tagatccaga
taaaaatgcc gcccgtgatg tgctggggat ccctcacgat gcccactgcc 480tggcgttgct
accggggagc cgtggtgcag aagttgaaat gcttagtgcc gatttcctga 540aaacggccca
gcttttgcgc cagacatatc cggatctcga aatcgtggtg ccactggtga 600atgccaaacg
ccgcgagcag tttgaacgca tcaaagctga agtcgcgcca gacctttcag 660ttcatttgct
ggatgggatg ggccgtgagg cgatggtcgc cagcgatgcg gcgctactgg 720cgtcgggtac
ggcagccctg gagtgtatgc tggcgaaatg cccgatggtg gtgggatatc 780gcatgaagcc
ttttaccttc tggttggcga agcggctggt gaaaactgat tatgtctcgc 840tgccaaatct
gctggcgggc agagagttag tcaaagaatt attgcaggaa gagtgtgagc 900cgcaaaaact
ggctgcggcg ctgttaccgc tgttggcgaa cgggaaaacc agccacgcga 960tgcacgatac
cttccgtgaa ctgcatcagc agatccgctg caatgccgat gagcaggcgg 1020cacaagccgt
tctggagtta gcaca
10451211040DNAArtificial Sequencegene editing homology arm
pool_5_b0183_right 121atgatcgaat ttgtttatcc gcacacgcag ctggttgcgg
gtgtggatga agtcggacgc 60gggccgttag ttggcgcggt cgtcaccgct gcggtgatcc
ttgacccggc gcgcccgatt 120gccgggctga atgattccaa aaagctgagc gaaaaacgcc
gtctggcgct ctatgaagag 180atcaaagaga aagcgttgag ctggagtctg ggccgcgcgg
aaccccacga aatcgacgag 240ctgaacattc ttcatgcgac catgctggcg atgcagcgtg
ccgtcgctgg gctgcatatt 300gcgccggaat atgtgttgat tgatggtaac cgctgcccga
aattaccgat gcctgcgatg 360gctgtggtga aaggcgatag ccgcgtaccg gaaatcagtg
ccgcgtctat cctggcgaaa 420gtgacgcgtg acgccgaaat ggcggcgctg gatattgttt
tcccgcaata tggttttgcc 480caacacaaag ggtacccaac cgcttttcat ctggaaaaac
tggctgaaca cggcgcgacc 540gaacaccatc ggcgcagctt tgggcctgtc aaacgcgcac
tgggacttgc gtcctgattc 600ttgtgtcgag attaagtaaa ccggaatctg aagatgtctg
aaccacgttt cgtacacctg 660cgggtgcaca gcgactactc gatgatcgat ggcctggcca
aaaccgcacc gttggtaaaa 720aaggcggcgg cgttgggtat gccagcactg gcgatcaccg
atttcaccaa cctttgtggt 780ctggtgaagt tctacggagc gggacatggc gcagggatta
agcctatcgt cggggcagat 840tttaacgtcc agtgcgacct gctgggtgat gagttaaccc
acctgacggt actggcggcg 900aacaataccg gctatcagaa tctgacgttg ctgatctcaa
aagcgtatca gcgcgggtac 960ggtgccgccg ggccgatcat cgatcgcgac tggcttatcg
aattaaacga agggttgatc 1020cttctttccg gcggacgcat
10401221047DNAArtificial Sequencegene editing
homology arm pool_5_b3631_left 122aatcctggtc ataaagatgc gatatcatgg
ggatatgtta ttaactactc ctgtcatcag 60tacgctcaag cagaattatc ctgatgcaaa
aatcgatatg ctgctttatc aggacaccat 120ccctattttg tctgaaaacc cggaaattaa
tgcgctctat gggataagca ataaaggtgc 180gggaactttc gataaaatta aaaatgtgct
ttcgttgata aaaactctgc gtgcgaataa 240ttatgacctg gtcattaatc ttacggatca
gtggatggtg gcgctgctgg tacgttgttt 300acctgcacgg atgaaaatat cgcaacttta
tggtcatcgg cagcatggta tttggaaaaa 360aagcttcaca cacttagcgc caatacacgg
tacacatatt gttgagcgta atttatcggt 420ccttgagcca ttaggtatta ccgatttcta
caccgacaca acaatgagtt acgccgaaga 480ttgctggaag aagatgcgcc gggaattaga
tgccctgggc gtaaaagatc attatgttgt 540catccaaccg acagcgcgtc agatatttaa
gtgttgggat aacgataaat tttctaaggt 600tatcgatgcg ctgcaacagc gaggctatca
ggttgtgcta acctgtgggc cctcggcaga 660tgatctcgct tgtgtagatg agattgcacg
aggttgcgaa acaaaaccca ttactggcct 720tgcaggtaaa acacgttttc ctgaactggg
tgcattaatt gatcatgcag tgctttttat 780tggtgtggat tctgcgccgg gacatattgc
agcggcagtg aaaacgccag tcattagtct 840atttggtgca acggatcacg tattctggcg
tccctggacc gagaatatta ttcaattctg 900ggcggggaat tatcagaaaa tgccgacccg
gcatgaactt gaccgcaaca aaaaatatct 960ttctgttatc ccagcggagg atgtgatcgc
cgctacggaa aagctgttgc cagaagatgc 1020cccttcagct gacaggaatg cacaatt
10471231041DNAArtificial Sequencegene
editing homology arm pool_5_b3631_right 123atgatcgtgg cgttttgttt
atataaatat tttccatttg gtgggcttca acgtgacttt 60atgcgcattg catcaacagt
tgccgcacgg ggccaccatg ttcgggtata tacacagtcg 120tgggaaggcg attgcccgaa
agcatttgag cttattcagg tgccagttaa gtcccatacc 180aaccatggac gcaatgcaga
atattatgcc tgggtacaaa atcatctcaa agagcatccc 240gcagatcgcg ttgttgggtt
taataagatg cctggcctgg atgtttattt tgccgctgat 300gtttgttacg ccgagaaagt
tgcgcaagaa aaaggttttt tatatcgttt aacatcacga 360tatcgccatt atgccgcatt
tgagcgagcg actttcgagc agggtaaatc gacgaaactt 420atgatgctga ccgataagca
aatcgccgat ttccagaagc attatcaaac tgaacctgaa 480cgttttcaaa ttcttcctcc
cggtatttat ccggacagaa aatacagtga gcaaatccca 540aacagccgtg aaatttatcg
ccagaaaaat ggcataaaag agcaacaaaa cttattactg 600caggttggat cagattttgg
ccgtaaaggt gtagatcgct caattgaagc tttggcatcg 660ttaccggaat cattacgtca
caatacgctt ttatttgttg ttggtcagga taagccgcga 720aaatttgaag cgctggcaga
aaaactcggc gtgcggagca atgtgcattt cttctccggt 780cgcaatgatg tgtcagaatt
aatggcagcc gctgatttat tactgcatcc cgcttatcag 840gaagccgcgg gtatcgttct
tctagaagcg atcactgctg ggttacctgt tttaacaaca 900gcggtatgtg ggtacgcgca
ttatattgcg gatgccaatt gtggaacggt catcgctgaa 960cctttctctc aggaacaatt
aaatgaagtt ttacgtaaag cgttaactca gtcgccattg 1020cgaatggcct gggcggagaa t
10411241047DNAArtificial
Sequencegene editing homology arm pool_5_b3791_left 124gcgctcgctc
tctttggtgg tgtagcgatc ttcaccgtgg aactcaccaa agtgttcccc 60cgcagggcaa
ccgtgcagcg gaatgtaatg aaacaccgcc atgatttccg cttctttcag 120aaagttaatc
aacgcgctcc ggtcatcaat atcccgcagt ttaatgtaga acatatgcgc 180gttctgcacg
cagccatcgg gaatcgacgg cagctcgata cgcccggctt tcgccagagg 240cgctaacgca
tcgtagtagt tttgccacag cgccagacgt tgctggttga tacgatccgc 300tgcttccagt
tgcgcccaca ggtatgcagc ttgcagatcg gacatcaaat agctggagcc 360aatatcgcgc
caggtatatt tatcgacctg accacggaag aactggctgc ggttagtgcc 420cttttcacgg
atgatctcgg ctcgttcgat taacgcttta tcgttaatca gcgtcgcgcc 480gccttcaccg
cccgccgtgt agtttttggt ttcatggaag ctaaagcagc caatatgacc 540aatggttccc
agtgcacgcc ctttgtaagt ggacatcacg ccctgagcgg catcttctac 600cacaaacaaa
ttatgctttt tcgccaacgc cataatggtg tccatttcgc aggccacacc 660cgcgtaatgg
accggcacga taacgcgcgt tttgtcggtg atcgccgctt caatcagcgt 720ttcgtcgatg
ttcatggtgt ccgggcgaac atccacaaaa acgatttttg cgccacgcag 780cacaaaggca
ttggcggtgg agacaaaggt gtagctcggc atgatcactt catcgccagg 840ctggatatcg
agcagcagcg ccgccatctc cagcgaagcg gtgcaggacg gcgtcagtaa 900cactttggcg
ctgccaaaac gttgctccag ccactgctgg cagcgacggg taaaaccgcc 960atcgccacac
agtttgccgc tacccattgc cgactgcata tagtcgagtt cggttcccac 1020caccggcggt
gcgttaaatg gaatcat
10471251048DNAArtificial Sequencegene editing homology arm
pool_5_b3791_right 125gtgatcacct gtataaccag tacgcggtgc tttctacatt
cgcaccactt tgtatgtatc 60gtttaagcgc ggcggtgttg cccatttggg tcgccacccg
caaagttgtt ttaccgcgag 120catacgccca gtttagcgcc gtttgcatca gctcagcacc
tgcaccgcgt ccagccagca 180ggccaattcg cgcatctgtc gcattgagtt cccgtaaaga
gacatagccg cgaatatcgc 240cggacgccgc acgtaaaatc agacattgat gatcaaaggt
gccgcgcacg gcattttcaa 300tccactgtgc ataaaagcga ctgctggcgt caggcgcata
ccacggcgca cgaaaacggc 360tttgcgcaaa tgcggcgctg gctaactgac gtaatgcggg
aatatcggtc tcttgtgcca 420ctacagcacc gctatcactg gcattgttca cgggtagcgc
caaatcaact tcaccttcta 480ccagggagaa tcccagctgt tgcagggcat ccagttcacc
cgtatttgat gccgcaattt 540tggcctgcac ccgtgaccac ggcgctaacg cgtctggcgt
caggagcggt gcttcagacg 600taatgcgcac gatggcgctg ttaacaccaa agaaggcgtt
ttcccaggtt agtggctcaa 660tactggcgcg gacgggcacg aagtaactcc agcagatatt
ggccgtagcc agttttcgct 720aatgaactgg cagcacgctt cacaccctcg tcatcgagcc
agccgttacg ccaggcaatc 780tcttccaggc aggcaatctt aaagccctgg cgtttttcca
ccgtctgtac aaaggtgctg 840gcttcaatca ggctgtcgtg agtgccggta tccagccagg
caaatccgcg cccgagcagt 900tcaacggtca ggttgcccgc ctcgaggtac atctggttga
tggaggtaat ctccagttca 960ccacgctccg acggcttcac ctgctttgcg tactccacga
ctttactgtc gtagaaataa 1020agcccggtca ccgcccagtt tgacttcg
10481261045DNAArtificial Sequencegene editing
homology arm pool_5_b0438_left 126gtgactgaaa aagaaaccac tttcaacgag
ctgatgaacc agcaggcgta atttacgcag 60cataacgcgc taaattcgca caaaggcccg
tcaccgccag gtggtgggct tttttttgtc 120atgaattttg catggaaccg tgcgaaaagc
ctctttcggt gttagcgtaa caacaaaaga 180ttgttatgct tgaaatatgg tgatgccgta
cccataacac agggactagc tgataatccg 240tccataaggt tacaatcggt acagcaggtt
ttttcaattt tatccaggag acggaaatgt 300catacagcgg cgaacgagat aactttgcac
cccatatggc gctggtgccg atggtcattg 360aacagacctc acgcggtgag cgctcttttg
atatctattc tcgtctactt aaggaacgcg 420tcatttttct gactggccag gttgaagacc
acatggctaa cctgattgtg gcgcagatgc 480tgttcctgga agcggaaaac ccagaaaaag
atatctatct gtacattaac tccccaggcg 540gggtgatcac tgccgggatg tctatctatg
acaccatgca gtttatcaag cctgatgtca 600gcaccatctg tatgggccag gcggcctcga
tgggcgcttt cttgctgacc gcaggggcaa 660aaggtaaacg tttttgcctg ccgaattcgc
gcgtgatgat tcaccaaccg ttgggcggct 720accagggcca ggcgaccgat atcgaaattc
atgcccgtga aattctgaaa gttaaagggc 780gcatgaatga acttatggcg cttcatacgg
gtcaatcatt agaacagatt gaacgtgata 840ccgagcgcga tcgcttcctt tccgcccctg
aagcggtgga atacggtctg gtcgattcga 900ttctgaccca tcgtaattga tgccagaggc
gcaactgtgc cgctatactt atccagggcg 960gcacaacgct gtaagcggct tgcgcctgag
aatggcattt gcgtcgtcgt gtgcggcaca 1020aagaacaaag aagaggtttt gaccc
10451271050DNAArtificial Sequencegene
editing homology arm pool_5_b0438_right 127atgacagata aacgcaaaga
tggctcaggc aaattgctgt attgctcttt ttgcggcaaa 60agccagcatg aagtgcgcaa
gctgattgcc ggtccatccg tgtatatctg cgacgaatgt 120gttgatttat gtaacgacat
cattcgcgaa gagattaaag aagttgcacc gcatcgtgaa 180cgcagtgcgc taccgacgcc
gcatgaaatt cgcaaccacc tggacgatta cgttatcggc 240caggaacagg cgaaaaaagt
gctggcggtc gcggtataca accattacaa acgtctgcgc 300aacggcgata ccagcaatgg
cgtcgagttg ggcaaaagta acattctgct gatcggtccg 360accggttccg gtaaaacgct
gctggctgaa acgctggcgc gcctgctgga tgttccgttc 420accatggccg acgcgactac
actgaccgaa gccggttatg tgggtgaaga cgttgaaaac 480atcattcaga agctgttgca
gaaatgcgac tacgatgtcc agaaagcaca gcgtggtatt 540gtctacatcg atgaaatcga
caagatttct cgtaagtcag acaacccgtc cattacccga 600gacgtttccg gtgaaggcgt
acagcaggca ctgttgaaac tgatcgaagg tacggtagct 660gctgttccac cgcaaggtgg
gcgtaaacat ccgcagcagg aattcttgca ggttgatacc 720tctaagatcc tgtttatttg
tggcggtgcg tttgccggtc tggataaagt gatttcccac 780cgtgtagaaa ccggctccgg
cattggtttt ggcgcgacgg taaaagcgaa gtccgacaaa 840gcaagcgaag gcgagctgct
ggcgcaggtt gaaccggaag atctgatcaa gtttggtctt 900atccctgagt ttattggtcg
tctgccggtt gtcgcaacgt tgaatgaact gagcgaagaa 960gctctgattc agatcctcaa
agagccgaaa aacgccctga ccaagcagta tcaggcgctg 1020tttaatctgg aaggcgtgga
tctggaattc 10501281050DNAArtificial
Sequencegene editing homology arm pool_5_b1981_left 128ggtgtcagga
gttattgcga tataagcatc ttttatgatt gctgctgaac gtttaatcga 60gggtggtaag
gataaacggt agacattatt ataacaatcc actaatgccc tggctttatc 120ttcacctttg
ggtccatgaa cgatcactat tggtatatct gtttcacttt aaatttttgc 180tattagattt
tctgcaatcg ataatgaaaa tgtacgttcc tgcgagctac cttctaaatt 240gaacgcaatg
taagatccta acgatcgcat ttcctcgcgc acctcatcga gtacatcctc 300acttagtggc
aattcatata ttggcctgac tgctggaaaa cccgcctcac gcatcataaa 360tgcccatgtc
ataggtacgg gagcccggag tttctgatcc atactggacg cgttcttgca 420caaaggggag
aagcaattca tggttatacc aacaacctga aaattcgttt ttgctttcaa 480ctgactgata
aataacatcg ttttcaggtt ctttttacgc atcccctcaa tgcaaagatc 540cggcgtaccg
tattgctgtg ttatgttctt tgctaaatct tttatttctt ttaatgttgc 600gtgatcctgc
atagtcattg tgactaatgt taatttagtc tgttcaagtt taagcgcatt 660aaagacttct
aaattaattg tcgacgttac aattaaaaga tgcttaattt tatgcaattc 720aagcgcccga
ataacaggaa agatggccat agcatcgcca atctgatcgg gaatatggat 780gacaacaaag
tctgtttttt caatattgaa attataagct ttataatcgt agtaactaaa 840tgcaatacgt
ctcaacaatg atgctaaaaa catacctaac ctcgcctccc tactggttat 900aatgcaatgc
agtctatcag actcatcagg gtgccatttt gtgcatatgc ggacttttat 960gtttcatatc
tctaacctgt gggtcctctg cttaatcctt aaacaacacc agcaactcct 1020gcgctttcat
cttccatcga atttttcatg
10501291036DNAArtificial Sequencegene editing homology arm
pool_5_b1981_right 129atggactcca cgctcatctc cactcgtccc gatgaaggga
cgctttcgtt aagtcgcgcc 60cgacgagctg cgttaggcag cttcgctggt gccgtcgtcg
actggtatga ttttttactc 120tatggcatca ccgccgcact ggtgtttaat cgcgagtttt
tcccgcaagt aagcccggcg 180atgggaacgc tcgccgcatt tgctaccttt ggcgtcggat
ttcttttccg tccgctcggc 240ggtgtcattt tcggtcactt tggcgaccga ctgggacgta
agcgcatgtt aatgctgacc 300gtctggatga tgggcatcgc gacagccttg attggtattc
ttccttcatt ctcgaccatt 360gggtggtggg cacctatttt gctggtgaca ctgcgtgcca
ttcagggatt tgcagtcggc 420ggcgaatggg gaggcgcggc gttgctttcc gttgaaagtg
caccgaaaaa taaaaaagcc 480ttttacagta gcggtgtaca agttggctac ggtgtaggtt
tactgctttc aaccggactg 540gtttcattga tcagtatgat gacgactgac gaacagtttt
taagctgggg ctggcgcatt 600cctttcctgt ttagcatcgt actggtactg ggagcattgt
gggtgcgcaa tggcatggag 660gagtccgcgg aatttgaaca acagcaacat tatcaagctg
ccgcgaaaaa acgcatcccg 720gttatcgaag cgctgttacg acatcccggt gctttcctga
agattattgc gctacgactg 780tgcgaattgc tgacgatgta catcgttact gcctttgcac
ttaattattc aacccagaat 840atggggctac cgcgcgaact tttccttaat attggtttgc
tggtaggtgg attaagctgc 900ctgacaattc cctgttttgc ctggcttgcc gatcgttttg
gtcgccgtag ggtttatatc 960acaggtacgt taatcggaac gttgagcgca tttcctttct
ttatggcgct tgaagcacaa 1020tctattttct ggatag
10361301050DNAArtificial Sequencegene editing
homology arm pool_5_b1709_left 130cgaaacgatc gcgcatcgtc acaaccacaa
atcccgaaca gtccacaccg cgccgcgtca 60tgccaccata acgatacggc gtgccatgcc
agctttgtag ctggtcgttc aaaccggcaa 120taacggtaat cgaatcagaa agtctggcat
ttggcggcgg tgctttatgg tggctacacc 180cggccagaag cagtgctgtg atcaaaataa
ggcagaaacg cattccgtac ggttcctctg 240ttttttattc ttgcattaat ttagcgtcgt
aattacccga ttttcaagat actaatgaaa 300tcagatggtc gaaatcagca ttctgtgacc
ttcgatatcc agacggcgaa aattcatccc 360ataggcctgc gccagatttg gcggcgtgag
cacctcttcc ctgcgtccac tggccagcat 420ttttccacct tttagcaacc acgcccgatg
cgcatgacgc aatgtgtggt tgagatcgtg 480actgctcatc acaatcgcca gtccttgctg
acacagcgcg ctcagaattt tgtctaacgc 540actttgttgc gcaacatcaa gactgttcat
cggctcatca agaagcagca attggcctgc 600gggattggct tgtggtgtga tttgcaacac
caccgcagca agacgtacgc gttgccattc 660accgccggaa agttgattgg tgctacgtcc
gagtttgtca tcaagagcca gcgcccctgc 720gacatcattc agtagttcgg tacgcgtttt
atcgtgctga tgcagtgtca ggtagtgcca 780gaccggcgtt gcaaacggcg gcgtctgctg
ttgtgaaaga taggcgcgat gcagcgcgag 840ttttgttgcg gaccatgctt ccagtggttg
ccccgcgaac tgaatgcttc ccttaccgct 900ggtcattccg gccattcgcg ccagtaaggt
actcttaccc gcgccattcg gccccaccag 960gtgcaggatc tccccagccc gaacctcgcc
agaaagcggc cccaggcggg tagattccgc 1020aacatcttgt aactgcatca caatagacat
10501311050DNAArtificial Sequencegene
editing homology arm pool_5_b1709_right 131tattttgcca acgccagttt
aatgctttcc atcacaatgg gatcttccgg cgtcatatcc 60ggggaaaaac gctggatgac
ttttccgtcc ctgccaacca ggaatttttc aaaattccat 120aaaatatcat ccgggtacag
cggtgcacgg cctttgctga ccatacgggc atagaatccg 180ctctcttccg gcgcgactgc
ggtcggcgct gcggcaatca atttttgata cagcggatgg 240cgtccttcgc cattaacttc
aatcttactg aacatcggga acgtcacccc ccatgtggtg 300gtacagtaag ttttaatctc
ttcatcgctg cccggttctt gttccagaaa ctggttgcac 360gggaatccca gcaccataaa
acctcgatcg acccaggctt tctgaatatt ctccaactgc 420tcatattgcg gcgttaagcc
acactttgag gcgacattga caatcaacag cacattaccg 480gcgaacttct ccagcgtggt
cacttcaccg tcgatatctt tcactacggt cgtcagaatg 540gaatcttgca tcgtttctcc
tgggtgtggt cagtaaaaat cttagctttt aatcatagac 600cgtctttttg cggctaacgt
cctgctttta acaataacca gataaacacc ggcgcaccta 660acgttgcggt gaccacgcca
ataggcagct ctgcggcagc taatgccagg cgcgctacaa 720tatcggccag cagcaatgcg
ctcgcccctg ccagcgcgca gccgggaagt aatacgcgat 780gatcggttaa accacacaac
cggagaatat gggggatcac cagaccaata aagccgatag 840cacccgccag cgccacactg
acgccaacca tccagccggt cgctgccacc agcacattgc 900gccagaacca caggggtaaa
cccagttgcc gcgccgagat ctcgccaagt gctaacatat 960tcatcggcct ggactgacaa
cagatccaca acaacacggg gatcaatgcc agcatcagcc 1020agctttgccg ccagtctacg
ccgccaaaac 10501321050DNAArtificial
Sequencegene editing homology arm pool_5_b2176_left 132atggattgca
tgcgtttcac tcaattgtac tttaattgac caaccccgct tattaacttt 60ctgtatcact
ttttcttata aaaaatcatg taaaaccgct cgccaagacc gcaccaatcg 120ggtaatctcg
aactcgtttt gcctcggcgg tagattatcc tcacagcata taattttgtg 180cgttagtcca
cagatttggc cttaaggaat tgtttcaaca tgcccaggta attagtctcg 240tgtcgcttgg
cattttttta taacgatatt tgtcgttaag gacttcaagg gaaaacaaac 300aacatggtca
aatctcaacc gattttgaga tatatcttgc gcgggattcc cgcgattgca 360gtagcggttc
tgctttctgc atgtagtgca aataacaccg caaagaatat gcatcctgag 420acacgtgcag
tgggtagtga aacatcatca ctgcaagctt ctcaggatga atttgaaaac 480ctggttcgta
atgtcgacgt aaaatcgcga attatggatc agtatgctga ctggaaaggc 540gtacgttatc
gtctgggcgg cagcactaaa aaaggtatcg attgttctgg tttcgtacag 600cgtacattcc
gtgagcaatt tggcttagaa cttccgcgtt cgacttacga acagcaggaa 660atgggtaaat
ctgtttcccg cagtaatttg cgtacgggtg atttagttct gttccgtgcc 720ggttcaacgg
gacgccatgt cggtatttat atcggcaaca atcagtttgt ccatgcttcc 780accagcagtg
gtgttattat ttccagcatg aatgaaccgt actggaagaa gcgttacaac 840gaagcacgcc
gggttctcag ccgcagctaa taaaccgttt ggatgcaatc ccttggctat 900cctgacgagt
taactgaaag cactgcttag gcagtgcttt tttgttttca ttcatcagag 960aaaatgatgt
ttccgcgtct tgatccaggc tatagtccgg tcattgttat cttttaaatg 1020ttgtcgtaat
ttcaggaaat taacggaatc
10501331050DNAArtificial Sequencegene editing homology arm
pool_5_b2176_right 133atgttcatac gcgctcccaa ttttggacgt aagctcctgc
ttacctgcat tgttgcaggc 60gtaatgattg cgatactggt gagttgcctt cagtttttag
tggcctggca taagcacgaa 120gtcaaatacg acacactgat taccgacgta caaaagtatc
tcgataccta ttttgccgac 180ctgaaatcca ctactgaccg gctccagccg ctgaccttag
atacctgcca gcaagctaac 240cccgaactga ccgcccgcgc agcgtttagc atgaatgtcc
gaacgtttgt gctggtgaaa 300gataaaaaaa cattctgttc atctgcgacc ggtgagatgg
acattccact caatgaattg 360attccggcgc tcgacattaa taaaaacgtc gatatggcga
tcttacccgg cacgccgatg 420gtgccgaaca aacccgcaat cgtcatctgg tatcgcaacc
ctttgctgaa aaatagcggc 480gtctttgccg ctctgaatct caacctgacg ccttcactct
tttatagttc acggcaggaa 540gattacgatg gcgtcgccct cattattggc aatactgcgc
tatctacctt ttcttcacgt 600ttgatgaacg ttaacgaatt aaccgacatg ccagtccgtg
aaactaaaat tgcgggcatt 660cctctgaccg ttcggcttta tgcagatgac tggacatgga
acgatgtgtg gtacgcattt 720ttactgggcg gcatgagtgg aactgtcgtt ggcctgctct
gctattacct gatgagcgta 780cgtatgcgcc ccggcagaga aatcatgacc gccatcaagc
gcgaacaatt ttacgtggcg 840tatcaaccgg tggtggatac acaagctttg cgagtaacgg
gcctggaagt actgctacgc 900tggcggcatc ctgtcgcggg agaaattccc ccggatgcct
tcattaactt tgccgaatcg 960caaaagatga ttgtgccgct gactcagcac ctgtttgagt
taattgcccg cgatgccgca 1020gaattagaaa aagtgctgcc ggtaggcgtc
10501341050DNAArtificial Sequencegene editing
homology arm pool_5_b2168_left 134agttttgctt ttcgcgccgc cgcgcccagc
agggtcttcg ccatataggc gcgtgcctga 60ccgagattag cgtcaataat cagcagcgtt
ttcattatgc ctctcctgct gtcagttaaa 120aggttgtaag tcgacgcgcg ccatcattgc
ggccaactgc ggacgatcgg taatacccac 180attgctttga cttaccgcca gggctgcaac
agctgtcgcc agacgcagtg tgtgttcact 240ggattcacgc atcagcaagc cataaatcag
gccaccaacc atagaatccc ctgcgccaac 300ggtgcttacg acatcgactg acggtggttt
ggcgatccat tcgccggagg cattaaccca 360aagcgcgcct tcggcaccca gtgaaataac
aacatgcgcg atgccttgtt cacgtagcgc 420atgtgcagct tcaatcacat ctttcatttc
aggcagttta cggcctgccc agatttccag 480ctcgcggcgg ttaggtttca ccagccacgg
tgccgctttc aaacctgcta ctaacgcttc 540acggctacta tcaaagataa tgcaaggaca
ctgactacgc aggcgagtca tccagtcggt 600gaacgcttcc gggctgacgc ctgacggtaa
gcttccgctg acacagacca tatcgaactg 660accgagccag ctcagagaat cagtcacaaa
gcgttcccag tcggcggggg tgacttcaaa 720acccgagaag ttgaagtcgg tcacttcgcc
gtctttttcc gtcagcttaa cgttaattcg 780ggtgcgcccc tgtacaacct ggaaacggtt
ggcaatgccc agctcgctga acagttgctg 840aaaaccatcc tgattgtctt tacccaggaa
gccgccaacg gtgacatcaa ttcccaggtc 900ttttaatact ttggccacgt tgatgccttt
acccgccgca tgcagaccgg tggttttcac 960caggttcact tcgccgcgtt caatttccgg
gcagaaacca acaaggtcat aagccggatt 1020aagggtgata gtagcaacac gtctgctcat
10501351046DNAArtificial Sequencegene
editing homology arm pool_5_b2168_right 135tatgcgccct ccccaagacc
agcagcgata gcgtcgccga ttgctttcag cgcctgttca 60gcatctgcac cctgggcggt
aaagcgtagg cgatgacctt tcttaacgcc aagtgccaca 120actttcatca gactacgtcc
gtttgccggt ttgccggtac catcaaggtt tgtcacggta 180atatcactgt taaattgttt
aatggtattg accagcatgg tacctggacg agcatgcagg 240ccgtgttcat tgcgcaccac
aaactccgcg cttaacacgt cgtcggtcgg cgcatcatcg 300ctggtcagca gcgccagcaa
cgttgccgca tccgctttca gcaagcggtc agctttattg 360tcgagcaata aatcagcgag
acgcttaaga accgcgatgg gctgatcgtc attcatcgcc 420acactcacca gcatggctgc
cgtttcgccg tccacatcaa aagcatttgc cgcacggctt 480accgcaatcg cgctacgcag
attgccttcg gcgctatcgc tcagccagat accctgtccg 540agattcagcg gttgttcatt
gatggctttg gtgacgaaag tggcgtcaac tgcccccgcc 600tctttcagac gcgcagcgtt
cagcgcctga agagtcagca gatcgctggc gacgatatcc 660agtgtcagca tttcgttgtc
gagcttcagc tgctcactct gcttttcgcc catcagtaat 720gcgcgaagtt cttctgctgt
tgttgctgac ttcagttgtt cagcaacgga atcatcgctc 780agtacgtggg tcagctggcg
tagcaggccc agatgttcat ccgagctggc agcaataccg 840attgccacgt acgctacctg
accgtcaccc caggtgacgc cttccgggaa ctgaaatacc 900tgaacgccgg ttttcagcac
ctgatcgcgg gtgtcggtag tgccgtgtgg aatagcaata 960ccattgccga ggaacgttga
ggtttgctgt tcgcgcgcca gcatgccatt gacgtagcct 1020tctgctacat taccggcctg
caccag 10461361050DNAArtificial
Sequencegene editing homology arm pool_5_b1872_left 136catcagcatt
gtgcgattcg ccgccatcag ttcagcgagt ttgactatct gcgcttccgg 60tacgccagta
atttccgccg cccagaccgc gctcttcggc gtattatcgc tcttacctgt 120cagatactct
tcaaactgcg gatacccggt agtgtatttt tcaaggaaca ctttatcgtg 180tttgccttgt
gtcatcaggg tatgcgcaat ccctaacatc agtgccacgt cggtgcccat 240attcggcgcg
atccaggtgg cattatcgtc aaagaattcg atggtttcgg agcggatagg 300atcaatggca
atcactggtt tgccagattt tttcagctga tggaagtatt ccagcccttg 360ctcatcggta
ctgctccagg caatttttaa ggtattcagc gggttcattc cccacagcac 420cacaacctgg
ctgttttcca gaatcagcgg ccaggaggtc tgctgttcat acacctctac 480agaaccgacc
acatgcggca tgatcacctg tgctgccccg gttgaataat cgccgctatg 540cccggaataa
ccgcccgcca ggttcatata acgttgcagt aaggtttgcg ctttatgcaa 600cacgccagaa
gagcgccagc cgtaagatcc ggcaaaaatg gccgatggtc cgttagcttt 660acgaatacga
tcatgttgct catgaatcag ttttaatgcc tgttcccagc tcacctgtac 720ataggtatct
tcgccacgac ctttcgccgg ttgcagtgga ttatcgagat agctttttct 780caccatcgga
tgctgaatac gcgccgtggt gtgtacctga tccgccgccg tagactgtaa 840ggaattcggt
atggttttcg ccagcgcgcc tgttgaagaa acaatcttgc cgtccttcac 900ttctacgttc
atcgctcccc aacgtcccgc ggtgaggatt ttaccgccct tctcttctgc 960ccatgcgggg
agcggtgctg ccgatgtcac caccagcgct ccagcggcaa taccgctgtg 1020tttaataaat
tcacgtcttg ttaatgtcat
10501371050DNAArtificial Sequencegene editing homology arm
pool_5_b1872_right 137aacttcctcc ctgatcaacg aggatcactg tttctcggta
atatctttgg cgttgtactg 60gaaataccgc gttaaaatgt ccagttcgtt ttcgctcatg
ctggttcgtg cccccattcc 120tttggcaatg gacggccacg cattgacggt gtaatggtcg
gcggcaatag gggcatgaca 180accagcgcaa taggtatcgt caagtttttc agcgtattgc
catagcggtt tacggtccgc 240taatgcggga tcggtaagcg caccctgtaa agacgcctga
cgccattgat tgccgtattc 300gtcagcctgc cattccccgt ttacagtgag cgccttgata
ccttcttcac ttaatgtggc 360tagcgccagc cgttgacctg ccgccaggta gagcgtgttt
tcactgccct gcatttgata 420accctgcaac agaacgatcg gctgtttgcc actggcatca
acgacggtga gatcggttcc 480aggattcacg gtagccagct cgcctatgtg agaagttttg
aaaggataaa tatgtgcgcc 540attagtaact gaagtagcgg cctgactttc cagctcatgc
gccgcgttgt catccatttt 600tatttctggc ggaaaatggg caatgccttt atgacaatcg
atacaggttt cgctgtcctt 660ttgtgctttg ttatgcattt tctgcgcaga ttcactttgc
gaggcaatat ccatggcatc 720aaaagaatgg caactacggc acgttgcaga gtcagtggct
tttaattctt tccatactgt 780ttcggccatt tcctggcgat gagcttcgaa cttatcgtca
ctgtctattt tgccgctaac 840aaattcatga taaatatctt tagatgcctt taatttagca
aataaataat ccatccctga 900ctttggaata tggcaatcgg cacattccgc acgtatccct
ttctggttcg aaaagtggac 960agttccctga tattcctcaa aaggtttact catcgagtgg
caggaaaggc aaaatgctgt 1020atccgacgtt ttatgtaaga ctttttgcgc
10501381050DNAArtificial Sequencegene editing
homology arm pool_5_b1203_left 138gatgtaatct ttaccttctg cacgcatttt
gcctgcttct ttcgcgcctt gttcaccttt 60gtaagtgatg aaatcttcaa acgagatggt
ttgtgcacgg ataaagcctt tttcaaaatc 120agtatggatt ttgcccgctg cctgcggcgc
ggttgctcca accggaatgg tccatgcacg 180cacttctttc accccagcgg tgaagtaagt
ttgcaggttc agcagtttat aaccggcacg 240gatcacacgg ttcaggcccg gctcttccag
cccaagctcc tgcataaact cgtcacgttc 300ttcgtcgtcc agttcggcaa tgtctgcttc
aacagcagca caaaccggaa ccacaacaga 360accttctttc gccgcgattt cacgcacctg
gtcaagatat gggttgtttt caaaaccgtc 420ttcgttgacg ttggcgatgt acattgttgg
tttcagcgtc aggaagctca ggtaacgaat 480agccgctttc tcttcagcgc ttaaatccag
cgcgcgcagc atacctgcat tttccaactg 540gggcaggcat ttttccagga ccgccagctc
agctttcgcg tctttatcgc cacctttggc 600tttcttctgt acgcgatgaa tcgcacgttc
gcaggtgtcg aggtctgcca gcgccagttc 660ggtgttgata acttcaatat cgtcagccgg
gttaactttg ccggaaacgt gaatgatgtt 720gtcattttca aagcagcgaa caacgtgacc
gatcgcttcg gtttcacgga tgttggtcag 780gaactggtta cccagacctt cgcctttcga
tgcgcctttt accagaccgg cgatatcgac 840aaattccatg gtcgtgggaa gcgtacgctg
cggttttacg atttcagcca gttgatccag 900gcgaggatca ggcattggta cgacgcctgt
gttcggctca atggtgcaga atggaaagtt 960ggccgcttca ataccggctt tggtcagcgc
gttgaacagg gtagatttcc cgacgttggg 1020caaaccgacg ataccgcatt tgaatcccat
10501391036DNAArtificial Sequencegene
editing homology arm pool_5_b1203_right 139ttattgcgct ttaaaggcgt
gcaatcggtt cgttgctttg gtcaagccat ctgtaaacca 60catttcagta caacgcgccg
cttcgtcaat ggcttcatca attaacttct gttcactaac 120aggcggtttg cctaacacaa
aaccgacaac tttattttta tcgcccggat gaccgattcc 180gatgcgtaaa cggtgaaagt
tagggttatt acccaattta ctgatgatgt ctttcagtcc 240attgtgacca ccatggccac
cgcccaattt aaatttggcg acgccaggag gcagatccag 300ttcgtcgtgg gccaccagaa
tttcgtccgg attaatgcgg aaaaaactgg ccatcgccgc 360aacggctttg ccgctgagat
tcataaatgt agtcgggact aacaggcgga catcttcgcc 420tccaagagtg actcgcgaag
tataaccaaa gaatttagcc tcttcgcgca gcggagcgcg 480caaacgctct gccagtaagt
caacgaacca ggcaccagca ttatgtcgcg ttgcggcgta 540ttcagcaccg gggttcgcca
ggccgacaat caatttaatc gtcacgtttt tttgtcctga 600gtgtgtacat aactggcgcg
tagtttactg gttgcggccc cgcttgacaa aaaactgcgt 660atcaaatgca gataacgtaa
taattgcctg agtggactat tagaaagtca aggtgttcag 720gcgtttattt gtaaagtttt
gttgaaataa gggttgtaat tgtgatcacg cccgcacata 780acccactggg tgttgtctat
actttacaca taaggaagag gggtattccc tgttacaacc 840cagaaagttc cggaggtgac
atatgaaacg caaaaacgct tcgttactcg gtaacgtgct 900catggggttg ggtctggtgg
taatggtggt cggcgtgggg tattcaatcc tcaaccagtt 960accacagttt aatatgcccc
agtatttcgc acatggtgca gtgctaagta ttttcgtcgg 1020tgccattctc tggctg
10361401050DNAArtificial
Sequencegene editing homology arm pool_5_b2231_left 140gatgtctttc
aggttcatga tcttcggctg accatggtgc aatgccacca tgttgatacc 60gaaagaaacc
tgcaactggg tctgggagta gaggttgttg agcacaactt caccgaccgc 120atcgcgtttc
acttcaatca cgatgcgcat accgtctttg tcagactcgt cacgcagcgc 180gctgatgcct
tccacgcgtt tttcttttac cagttccgca atcttctcga tcaggcgcgc 240tttgtttacc
tgatacggaa tttcgtggac gataatggtt tcacgaccgg ttttggcgtc 300aacttccact
tctgcgcgag cgcggatata caccttgccg cgaccggtac ggtaagcttc 360ttcaataccg
cgacgaccgt taatgattgc cgccgtcggg aagtccggcc ccgggatgtg 420ttccatcagc
ccttcaatgc tgatgtcttc atcatcaata tacgccagac aaccgttgat 480gacttccgtc
aggttgtgcg gcgggatgtt ggttgccata cctacggcga taccggaaga 540accgttcacc
agcaggttag gaattttggt tggcatgacg tccggaattt tttccgtgcc 600gtcatagtta
tcaacgaaat cgaccgtctc tttttcgaga tcggccatca gttcatgggc 660aattttcgcc
agacggattt ccgtataacg cattgccgcc gcagagtcgc cgtcgataga 720accgaagtta
ccctgaccgt ctaccagcat ataacgcagc gagaatggct gcgccatgcg 780gacgatcgtg
tcatagaccg ccgagtcacc atggggatgg tatttaccga ttacgtcacc 840aacgacacgg
gcagattttt tataggcttt gttccagtca ttgcctagta cgttcatggc 900gtaaagtacg
cgacggtgta ccggcttcag gccatctcgg acatctggca gcgcacggcc 960aacaatgacc
gacatcgcat aatccagata ggagctcttc agctcttcct caatgttgac 1020cggtgtaatt
tctctcgcaa ggtcgctcat
10501411048DNAArtificial Sequencegene editing homology arm
pool_5_b2231_right 141aatgcctgat atactcgttt gtcttgccaa ttacggagta
gaagtgccaa tgaatgccga 60aaaatcgccg gtaaaccata acgtagacca cgaagagatc
gctaaatttg aagccgtcgc 120ctcccgctgg tgggatctgg aaggtgagtt caaaccgctg
caccgcatta acccgctgcg 180tctgggctat attgccgagc gtgctggcgg tttatttggc
aaaaaggtgc tcgatgtcgg 240ttgtggcggc ggcattctgg ccgagagtat ggcgcgcgaa
ggcgcgacgg tgaccggtct 300ggatatgggc tttgagccat tgcaggtggc aaaactgcac
gcactggaaa gcggcattca 360ggtggattac gtgcaggaaa ccgtggaaga gcacgcggca
aaacatgccg ggcagtatga 420tgtggtgacc tgcatggaga tgctggagca cgtccccgat
ccgcagtcag tggtcagagc 480ctgtgcgcaa ctggtgaaac caggcggcga tgtctttttc
tcgacactta accgcaacgg 540caagtcatgg ctgatggcgg tggttggtgc ggaatatatt
ttgcgcatgg tgcccaaagg 600cacgcatgat gtgaagaagt ttattaaacc ggcagaattg
ctgggctggg tggatcagac 660cagtttaaaa gagcggcata tcactgggct gcattacaac
ccgatcacta atacttttaa 720actcggcccc ggcgtggatg tgaactatat gctgcacacg
cagaataagt gaggttgatg 780tttggccgcg ccaatgcctg atgcgacgct tgccgcgtct
tatcaggcct acaaatgctc 840cccgtaggcc ggataaggcg tttacgccgc atccggcaac
cgtgccgact agacagtgat 900taccatttca ccgtcatcga caaaaaccct gccgtctggg
caaaatcatc actccctttc 960tgccacgcca cgctgccgcg cagggacact cgctgactga
tattgcccgt gactcccact 1020tttatttcac cccgttgctt caccgcat
10481421050DNAArtificial Sequencegene editing
homology arm pool_5_b1622_left 142attatctcct cgctggtgat gggccttgtc
ggcctggtga ttccattagt ctggccgatt 60ttcgccatgg gtattagcgg cttgggccat
atgataaaca gcgcgggtga tttcggaccg 120atgctgtttg gtaccggtga acgtctgctg
ttgccgtttg gtctgcatca cattctggtg 180gcattaattc gctttaccga cgcaggcggc
acgcaggaag tctgcggtca aaccgtcagc 240ggcgcactga ccatcttcca ggcgcaattg
agttgcccga ccactcacgg tttttctgaa 300agcgccacgc gtttcctttc gcaaggtaaa
atgcctgcgt ttctcggcgg tctgccaggt 360gcagcgttag ctatgtatca ctgcgcgcgc
ccggaaaatc gccataaaat taaaggtctg 420ctgatttctg gcctgatcgc ctgcgtcgtt
ggcggcacta ccgaaccgct ggaattcctg 480ttcctgttcg tagcgccagt tctgtatgtc
atccacgcgc tgttaaccgg cctcggcttc 540accgtcatgt ctgtgctcgg cgtcaccatc
ggtaataccg acggcaatat catcgacttc 600gtggtgttcg gtattttgca tggtctgtca
accaagtggt acatggtgcc agtggtggcg 660gcaatctggt ttgtcgttta ctacgtcatc
ttccgtttcg ctatcacccg cttcaatctg 720aaaaccccgg ggcgcgatag cgaagttgcc
agctcaatcg aaaaagccgt tgccggtgcg 780ccgggtaaat caggttacaa cgttcctgca
atcctcgaag cattaggcgg tgccgacaat 840attgtcagcc tcgataactg cattacccgt
ctgcgtttgt ctgtgaaaga tatgtcgctt 900gttaatgtgc aggcactgaa ggacaatcgg
gcaattggcg tagtacaact taatcaacat 960aacctgcagg ttgttatcgg gccacaagtt
cagtcagtaa aagatgaaat ggccggtctg 1020atgcatactg tccaggcata aggataagat
10501431050DNAArtificial Sequencegene
editing homology arm pool_5_b1622_right 143atgttcgatt tttcaaaggt
cgtggatcgt catggcacat ggtgtacaca gtgggattat 60gtcgctgacc gtttcggcac
tgctgacctg ttaccgttca cgatttcaga catggatttt 120gccactgccc cctgcattat
cgaggcgctg aatcagcgcc tgatgcacgg cgtatttggc 180tacagccgct ggaaaaacga
tgagtttctc gcggctattg cccactggtt ttccacccag 240cattacaccg ccatcgattc
tcagacggtg gtgtatggcc cttctgtcat ctatatggtt 300tcagaactga ttcgtcagtg
gtctgaaaca ggtgaaggcg tggtgatcca cacacccgcc 360tatgacgcat tttacaaggc
cattgaaggt aaccagcgca cagtaatgcc cgttgcttta 420gagaagcagg ctgatggttg
gttttgcgat atgggcaagt tggaagccgt gttggcgaaa 480ccagaatgta aaattatgct
cctgtgtagc ccacagaatc ctaccgggaa agtgtggacg 540tgcgatgagc tggagatcat
ggctgacctg tgcgagcgtc atggtgtgcg ggttatttcc 600gatgaaatcc atatggatat
ggtttggggc gagcagccgc atattccctg gagtaatgtg 660gctcgcggag actgggcgtt
gctaacgtcg ggctcgaaaa gtttcaatat tcccgccctg 720accggtgctt acgggattat
agaaaatagc agtagccgcg atgcctattt atcggcactg 780aaaggccgtg atgggctttc
ttccccttcg gtactggcgt taactgccca tatcgccgcc 840tatcagcaag gcgcgccgtg
gctggatgcc ttacgcatct atctgaaaga taacctgacg 900tatatcgcag ataaaatgaa
cgccgcgttt cctgaactca actggcagat cccacaatcc 960acttatctgg catggcttga
tttacgtccg ttgaatattg acgacaacgc gttgcaaaaa 1020gcacttatcg aacaagaaaa
agtcgcgatc 10501441050DNAArtificial
Sequencegene editing homology arm pool_6_b1857_left 144agcgacggcc
agagtaagaa cggtgagcga ccccaacata acgcggtgtg gtcgcggtaa 60attattaaac
gccagggcga cagagcgggc tatctgttgc acgtaatcac ttcctcatta 120atctcctttc
aggcagctcg catactggtt ggctaattga ctcaggaatt ctgaatagct 180tgttttaccc
agtttgatat tcgtccccag gggatccaac gttcccatac gaacggatgt 240ccctcgtgcg
acgctctcaa cgaccgctgg cctgaactgt ggctcagcaa aaacgcaggt 300tgctttttgc
tcaaccaact gtgttcttat ttcatgtaaa cgctgcgcgc caggttgaat 360ctcagggtta
acggtaaaat gaccaagcgg tgtcagtcca aactgttttt cgaaatagcc 420gtaagcatcg
tgaaaaacga aataaccttt ccccttgagc ggcgcgagct cgttaccaac 480ctgcgtttcg
gttgaggcta attgtgcctc aaaatccttc aggttggcgt caagtttggc 540tcgactttgc
ggcataagtt ccactaattt tccatggatt gcaaccgctg tagcccgcgc 600tatctctggg
gaaagccaaa gatgcatgtt gaaatcgccg tgatggtgat cttcgtcact 660tttttccgcg
tggtcgtgat catcatcatc gccgtgaata cttttcatca gtagcggttt 720cacatcttca
agctgcgcaa tcgttacctg cttcgctcct ggtaatttgc ttaccggttt 780ttgcataaac
gcttccatct ccgggccaac ccaaacgact aagtccgcgt tctgtaagcg 840ttttacatcc
gatgggcgca gtgaataatc atgttctgaa gcgccgtcag gaagtaaaac 900ctctgtttct
gttaccccat cagcaatggc agaagcgatg aacccaacgg gtttaagcga 960agcgacaacg
gcggcatctg cggcctgtgt tgcacctccc cagagagcgg cggataatgc 1020tgcgaaaaga
agcgtttttt tatgtaacat
10501451031DNAArtificial Sequencegene editing homology arm
pool_6_b1857_right 145aatgcgacca ataatcgtaa tgaatatgag aagtgtgata
ttataacatt tcatgactac 60tgcaagacta aaattaacat gacaagtctg gtttccctgg
aaaatgtctc ggtttctttt 120ggccaacgcc gcgtcctctc tgatgtgtcg ctggaactta
aacctggaaa aattttgact 180ttacttgggc caaatggcgc aggtaagtcg acactggtac
gggtagtgct cgggctggta 240acacccgatg aaggggttat caagcgcaac ggaaaactgc
gcatcggcta tgtaccgcag 300aagctgtatc tcgacaccac gttgccactg accgtaaacc
gttttttacg cttacgccct 360ggtacacata aagaagatat tttgcctgca ctgaaacgtg
tccaggccgg gcacctgatt 420aacgcaccga tgcaaaagct ctctggtggc gaaacgcagc
gtgtactatt agcgcgagca 480ttgttaaatc gcccgcaatt attagtgctg gatgaaccca
ctcaaggcgt ggatgtaaat 540ggccaggtgg cgttatatga ccttattgac caactgcgtc
gtgaactgga ttgtggcgtt 600ttaatggttt ctcacgatct gcatctggtg atggcaaaaa
ccgatgaagt gctgtgcctg 660aatcaccaca tttgttgttc cggcacaccg gaagttgttt
ccctgcatcc ggagtttatt 720tcaatgtttg gtcctcgtgg tgctgaacaa ctgggtatct
atcgccatca tcataaccat 780cgtcacgatt tacagggacg aattgttttg cgtcggggaa
atgatcgctc atgattgaat 840tattatttcc cggttggtta gccgggatca tgctcgcctg
tgccgcgggt ccgctgggtt 900cgtttgtagt ctggcgtcgt atgtcttatt tcggtgatac
gctggctcat gcctcattac 960ttggcgtcgc gtttggtttg ttgctggacg tgaatccatt
ctatgcggtg attgccgtta 1020cgctgctgct g
10311461050DNAArtificial Sequencegene editing
homology arm pool_6_b4024_left 146tgccacgctc acttctgacg tggtgattaa
gtctaccgaa atattatgcc gcgcgaggat 60gccgaaaact tccgcgagga aaccgcgaga
atgcagcata ttcaggctgt gcaaagtgag 120cagagtctga ttgcgacgaa gcgccagagc
gcggaacagc ggcggatttt cagttttatt 180gcacaccagc gtaccacctg cgcgtgggtc
tttgctggag ccgacaaaga ccgggatatc 240gctgcgtact gcgggtagca acgttgccgg
atgcagtact tttgcaccaa aagttgccat 300ctctgccgct tcggcaaacg cgatttcatc
aatgcgtttt gctgcggaaa ctacgcgtgg 360atcggtggtg tagatgcccg ggacgtcggt
ccagatatca acacgagatg cgtgtaaagc 420ctccgccagc aaggctgccg tataatcgct
gcctccacgg ccaagcgtcg ttgtacgacc 480tttattttcg ctaccgataa atccctgggt
gatcactaag ccttcattga gacgtgggag 540cagctgcagc gcggccagtt ccgccagcgc
ggctatatct ggctctgcac gaccaaatcg 600gtcgttggta cgcatcactt tacgtacatc
aaaccactgt gcctgaacat cgcgttcgcg 660caggatctca acaaacagca gggtcgacat
cagctcgccg tggctgacca gctcatctgt 720cagcgccgga gacgttgcca gcgccgccgc
ttctgccaga acagtaatgt tctccagcag 780acgttcaatc tcttcacgga taacgttcgg
gtaacgcaga cgttccagaa tggcaaactg 840gatgttgcgg atagcgtcga gtttttcgaa
tcgctcgcca ggttccagtc cttcagctaa 900agcgaccagc agattagtga taccagcaga
agccgagagg acaactaaac gcacgttggc 960atcagaaagc acaatatcag cgctgcggtt
catggcgtca aaatcagcta cgctggtacc 1020gccaaatttg gagacaacaa tttcagacat
10501471050DNAArtificial Sequencegene
editing homology arm pool_6_b4024_right 147aactacctcg tgtcagggga
tccattttca gccttggcac aagggaagag cggaagacgg 60gtgggcgcag agcgatactt
cgctactatt ttcacccaga agtgctccac cacttgcgaa 120acgcccgact gcgaacgctt
ctggtgacaa cccaggggat tcagcccctg tagccgatga 180tgaacgtggc cagccgttca
atcacctcgg cgatgcaccc cctcaggtgt tatcacagga 240ctggctcctc caacaccgtt
acttgggcaa cgcgcctctt ctggcctgcg ctagcgcagg 300tagtacattt ataaataaag
ggtgagcggg gcggttgtca acgatggggt catgcggatt 360tttcatccac tcctggcggt
cagtagttca gctaataaat gcttcactgc gctaagggtt 420tacactcaac attacgctaa
cggcactaaa accatcacat ttttctgtga ctggcgctac 480aatcttccaa agtcacaatt
ctcaaaatca gaagagtatt gctaatgaaa aacatcaatc 540caacgcagac cgctgcctgg
caggcactac agaaacactt cgatgaaatg aaagacgtta 600cgatcgccga tctttttgct
aaagacggcg atcgtttttc taagttctcc gcaaccttcg 660acgatcagat gctggtggat
tactccaaaa accgcatcac tgaagagacg ctggcgaaat 720tacaggatct ggcgaaagag
tgcgatctgg cgggcgcgat taagtcgatg ttctctggcg 780agaagatcaa ccgcactgaa
aaccgcgccg tgctgcacgt agcgctgcgt aaccgtagca 840ataccccgat tttggttgat
ggcaaagacg taatgccgga agtcaacgcg gtgctggaga 900agatgaaaac cttctcagaa
gcgattattt ccggtgagtg gaaaggttat accggcaaag 960caatcactga cgtagtgaac
atcgggatcg gcggttctga cctcggccca tacatggtga 1020ccgaagctct gcgtccgtac
aaaaaccacc 10501481029DNAArtificial
Sequencegene editing homology arm pool_6_b3942_left 148gctgcgggtc
tgtacccact catacttgaa caggttctcg aagaaatagt tgctccactg 60ggtcggcgtc
tgggtccaga ctacttccag accagaggta atggcatctg cgccaacgcc 120gctgccgtaa
gtgctcgccc aacctaaacc ttgttcttca atcggtgcag cttctggatc 180aggacctaca
tttgatgtcg gaccggcacc gtgggtttta cccagcgtat gaccacccgc 240aatcagcgcc
acggtttctt cgtcgttcat gcccatgttg ccgaaggtcg cgcggatagc 300tgctgccgca
gaaagcggtt cgccgctgtg atccgggcct tccgggttaa cgtaaatcag 360acccatctcg
gttgcaccca gcggtgcttt cgccagcgct tccggatgac ggtgagtcag 420ccaggctttt
tcatcacccc agttaacatc cagatccggt tcccagacgt cttcacgacc 480ggcaccaaaa
ccgaaggtac ggaagccgga gttttctagc gccacgttac ccgcgaggat 540aaacaggtcg
gcccaggaga ttttctgacc atatttctgt ttgattggcc acaacaggcg 600acgcgcttta
tcgaggctta cgttatccgg ccaggagttc agcggtgcaa aacgttgctg 660accacgaccc
gcgccaccgc gtccatcgat tgaacggtaa gtccccgcgc cgtgccaggc 720catacgaata
aacagaccgg cgtaactgcc ccagtcggct ggccaccacg gttgagattc 780tgtcaacagg
gctttcagat cttttttcag gccgtagtaa tctaatttgc tgaattcttt 840gcggtagtca
aagtcctcac ccagtgggtt agaacgatta gaatgttggt ttaacaggtc 900aacacgaagt
tgatttggcc accagtcgcg agtggttgtg cccgcccccg cactctggtc 960gtgaccgccc
tgatggaacg ggcatttgcc agtggctgtg gtgttatgga tatcgtctga 1020cgtgctcat
10291491045DNAArtificial Sequencegene editing homology arm
pool_6_b3942_right 149agttgagatc ttacatattg ttggttaaag agatgtagat
caaattgatc ttaattaagc 60ttttttgatt atgtccgaat tcggacaaat gctttataaa
aaagggtttt agcttcatat 120ccttcaagta atgagggaga aaacaaggct gataacctta
ttttcccgcc ctcatttcga 180ggcagcattt tgtgctctgt ttaaaatttg tgatcactgt
gtgattttca caaaagccac 240actatttata aaccaggtcg aacccccagc gtatggcaaa
tcgcgtaact catttcagca 300cggttaagcg tatagaagtg gaaatctttc actccttcac
ggcttaaaat cttcaccata 360tccatggcaa tattcgcgcc aaccagtttg cgggtttcgg
catcatcatc cagaccgtcg 420aacatttgcg ccatccacgc cggaatacgc acgttggtca
tatcggcaaa tttcttcgcc 480tgtttaaagt tagataccgg caaaattccc ggaataattt
ccacatcaat gcccgccgat 540acacagcggt cacgaaaacg caggtagctt tcgacatcga
agaagaactg agtaatcgcg 600cggttggctc cggcatccac tttgcgtttc agattaagca
aatccgcctg agcgcttttt 660gcttccgggt gaacttccgg atacgccgcc acggagatat
cgaaatctgc cacttctttt 720aacagcgtca ccaggtcaga agcatacatt tctggcttac
cacttcccgg cggcagatcg 780ccacgcagcg ccacgatatg acgaataccg ttattccagt
agtcgcgtgc aatggtgcgc 840agctcgtcgg gcgtcgcatc aatgcaagta agatgcggtg
ccgcttccag accagtgcga 900tctttaatgc ctttaataat gctgtgcgta cggtcgcgct
cgccggagtt cgcgccatag 960gtcaccgata caaacttcgg tttcaggctg ctaaggcgat
cgatggagtt ccacagggtc 1020tgctccattt cactggtacg cggcg
10451501050DNAArtificial Sequencegene editing
homology arm pool_6_b0592_left 150ttgattatcg gcgtgttgtt attgctggtg
ctggtggagt tgcgacattt tcgccagacg 60ccgccgcagg tgacagcgtc cgacagttaa
tgcttaaaac agcgccttaa gcctatccag 120cacttgcatg gcgctgtagt aatccagacg
gaacgtctcg gttcccagcg cataaacctg 180cttgttttgt actgcaggca ggtgcgcgag
cagcggatta gcataaatag catcggcatc 240tttctgatca ccggcgaaca ggaatagtga
ctcgccattt aaccctgcag ccagattttc 300cccaccaagc tgaatgatgt catggcgttt
accctgactt tggctggcat ttaaccctgc 360gggtaacttc gccagcgtaa agccgagttg
ttccagcatc tgcccttgtg ctgattctgg 420cgtccagaga ttggcactgt gtgcagcggc
agtatagaca atggcagtga ccggctgcgg 480cggtaatttg atttgctctt tcgccgccgc
cagttgctta tcaaactgcg caatccgctc 540tgccgcttgt ttctcatgcc cggtaatttc
gccaagttgc gttaacagcg actgccagct 600tttgtcgtcg taattgatga ttaatgtcgg
ggcgatggtg gaaagctgat catacagtgc 660cagcgccgaa tccccgccgg ttgcgctaat
taaaatcaga tccggcattt gcgcggcaac 720ggcttcggcg ctcggttcgc cgatatagag
ccgttgcagt ttgcgttctt tcgccacctt 780gctccactgg cgtaaaaagc cctggtcatc
cgcgacgcgg ttattcggcg tggtcgcgcc 840gctggcgatc accggagcat caatcgccag
cagtgagccg gtcagggtga cgctggtgga 900aacaatacgc tgcggctggc tttccagtgt
atgtgtgcca cggctgtcag taatctgacg 960cggccagtca gcggcctgaa ctgcggctat
tcctgaaagc aaaagtcctg ttaatagaag 1020ggcgttgcgg tagagcgggg cgagtctcac
10501511050DNAArtificial Sequencegene
editing homology arm pool_6_b0592_right 151aaatcagctt cctgttatta
ataaggttaa gggcgtaatg acaaattcga caaagcgcac 60aatccgtccc ctcgcccctt
tggggagagg gttagggtga ggggaacagc cagcactggt 120gcgaacatta accctcaccc
cagccctcac cctggaaggg agagggggca gaacggcgca 180ggacatcaca ttgcgcttat
gcgaatccat caataatgct tctcattttc attgtaacca 240caaccagatg caaccccgag
ttgcagattg cgttacctca agagttgaca tagtgcgcgt 300ttgcttttag gttagcgacc
gaaaatataa atgataatca ttattaaagc ctttatcatt 360ttgtggagga tgatatggat
acgtcactgg ctgaggaagt acagcagacc atggcaacac 420ttgcgcccaa tcgctttttc
tttatgtcgc cgtaccgcag ttttacgacg tcaggatgtt 480tcgcccgctt cgatgaaccg
gctgtgaacg gggattcgcc cgacagtccc ttccagcaaa 540aactcgccgc gctgtttgcc
gatgccaaag cgcagggcat caaaaatccg gtgatggtcg 600gggcgattcc cttcgatcca
cgtcagcctt cgtcgctgta tattcctgaa tcctggcagt 660cgttctcccg tcaggaaaaa
caagcttccg cacgccgttt cacccgcagc cagtcgctga 720atgtggtgga acgccaggca
attccggagc aaaccacgtt tgaacagatg gttgcccgcg 780ccgccgcact taccgccacg
ccgcaggtcg acaaagtggt gttgtcacgg ttgattgata 840tcaccactga cgccgccatt
gatagtggcg tattgctgga acggttgatt gcgcaaaacc 900cggttagtta caacttccat
gttccgctgg ctgatggtgg cgtcctgctg ggggccagcc 960cggaactgct gctacgtaaa
gacggcgagc gttttagctc cattccgtta gccggttccg 1020cgcgtcgtca gccggatgaa
gtgctcgatc 10501521024DNAArtificial
Sequencegene editing homology arm pool_6_b1415_left 152gcatttcctt
aaaagatatg tcaggcttgc ggagtggcgg ttaaggacat acgatttcct 60cctttcagag
tgctccgctt ctcactatta tctcacgcag tattcttaag ggaacgataa 120ggaggaacca
tgaacattac cccgtttccg acgctttcgc cggcaactat agatgccata 180aatgttatcg
gacagtggct ggcgcaggat gatttctccg gtgaggtgcc gtatcaggcc 240gattgcgtga
tccttgcagg caatgcggtt atgccgacta tcgatgcggc atgtaagatt 300gcccgcgatc
agcaaattcc tttactgatt agtggtggta tcggtcactc gacaactttt 360ttgtatagcg
ccatcgcaca gcatccgcac tacaacacta tccgcaccac tggcagagca 420gaagcgacca
tcctggcgga tatcgctcat cagttctggc acattccgca tgaaaaaatc 480tggattgaag
accagtcaac aaactgcggt gaaaacgcac gctttagcat cgcgctattg 540aatcaggccg
tagaacgagt tcatacggct atcgttgttc aggaccccac catgcagcgg 600cgcacgatgg
cgacgttccg ccgtatgact ggggacaatc ccgatgcacc acgctggtta 660agttatcccg
gattcgttcc tcagttagga aataacgcag acagtgtaat ctttattaat 720cagttacaag
gattatggcc agttgagcgt tatctctcac tactcactgg cgagctgccg 780cgtttacgcg
atgatagcga tggctacggt ccccgcgggc gagattttat cgttcacgtt 840gattttccgg
cagaagtcat ccatgcatgg caaacgctga aacatgatgc ggtgctcatc 900gaggcgatgg
aaagtcgctc gttacgttaa aaattgcccg tttgtgaacc acttgtttgc 960aaacgggcat
gactcctgac ttttatttct gccttttatt ccttttacac ttgtttttat 1020gaag
10241531050DNAArtificial Sequencegene editing homology arm
pool_6_b1415_right 153atgtcagtac ccgttcaaca tcctatgtat atcgatggac
agtttgttac ctggcgtgga 60gacgcatgga ttgatgtggt aaaccctgct acagaggctg
tcatttcccg catacccgat 120ggtcaggccg aggatgcccg taaggcaatc gatgcagcag
aacgtgcaca accagaatgg 180gaagcgttgc ctgctattga acgcgccagt tggttgcgca
aaatctccgc cgggatccgc 240gaacgcgcca gtgaaatcag tgcgctgatt gttgaagaag
ggggcaagat ccagcagctg 300gctgaagtcg aagtggcttt tactgccgac tatatcgatt
acatggcgga gtgggcacgg 360cgttacgagg gcgagattat tcaaagcgat cgtccaggag
aaaatattct tttgtttaaa 420cgtgcgcttg gtgtgactac cggcattctg ccgtggaact
tcccgttctt cctcattgcc 480cgcaaaatgg ctcccgctct tttgaccggt aataccatcg
tcattaaacc tagtgaattt 540acgccaaaca atgcgattgc attcgccaaa atcgtcgatg
aaataggcct tccgcgcggc 600gtgtttaacc ttgtactggg gcgtggtgaa accgttgggc
aagaactggc gggtaaccca 660aaggtcgcaa tggtcagtat gacaggcagc gtctctgcag
gtgagaagat catggcgact 720gcggcgaaaa acatcaccaa agtgtgtctg gaattggggg
gtaaagcacc agctatcgta 780atggacgatg ccgatcttga actggcagtc aaagccatcg
ttgattcacg cgtcattaat 840agtgggcaag tgtgtaactg tgcagaacgt gtttatgtac
agaaaggcat ttatgatcag 900ttcgtcaatc ggctgggtga agcgatgcag gcggttcaat
ttggtaaccc cgctgaacgc 960aacgacattg cgatggggcc gttgattaac gccgcggcgc
tggaaagggt cgagcaaaaa 1020gtggcgcgcg cagtagaaga aggggcgaga
10501541048DNAArtificial Sequencegene editing
homology arm pool_6_b1762_left 154caggcgttta tgagtattta acggatgatg
ctccccacgg aacatttctt atgggccaac 60ggcatttctt actgtagtgc tcccaaaact
gcttgtcgta acgataacac gcttcaagtt 120cagcatccgt taactttctg cgatagcagc
agatatgcca gtaaagaaat cccatttgac 180tatttttttg ataatcttct tcgctttcga
acaactcgtg cgcctttcga gaagcaagca 240ttatataatg ccaggccagt tcttcttcaa
ttgtcccgtt ttgaaaagct gtgcttgata 300tcgagatcat ccatgataat tccgccgccc
atattagctt cgccgaggat ttaccggagc 360tatgattagc gcaatcagag atatagtctg
agggaaaaac agcaaattta ttcaacaagg 420cgataacctg ctctggggct tcctccatgt
ttgctttaaa ggtattggct ccatggtcgc 480cagaaagaaa atgctccatt aaggcacaat
aactttcgct atcttcgata ccccattgat 540cctctaaaga ctcgcgtctt ttacttatga
tatcgatcga gtcaaaagga agcacatgat 600attggaaggt atctttgcca ggttcaggct
ttcgcggcca gaactccagc gtttcagacc 660attgcttatg atagaatcga taaggtgcga
tcaattgtag cgcctgtaac ttctcgatac 720tgagcggctc aataccttta gcctgataat
aatgcagttg ttcttttttt gctttaaaac 780cggcccgaac aataagcccc atcataatta
atagataaag aaaagagcat cccgcggtaa 840tcaggcctct ttcattcaaa ccgttggatg
ttatcgctgc gaacacaaac attacagcga 900caacacatgt taaataaaac ccccacttac
aaagcagcat ggccttattt tctttaatca 960tccgttcaaa attactatta aatatttccc
agccattaaa agaatacttc tcgctcccag 1020gatggttttg taataaaact tttttcat
10481551050DNAArtificial Sequencegene
editing homology arm pool_6_b1762_right 155tcagaatatg actcgaatag
cacgaaagat tcactcgctt acgctatcgc cccgcttccg 60acttcatctg ctggcggact
ttttttcgca ctacgtttac gcggtgcagc ctttttctta 120tcagcactgc caccactgcc
cggagccaca atgccgcgaa actgccgcac cggcgtacgt 180ttggcttgat caataagctg
atatagcgtc cccaccagcg gctgcataaa gtcctgatag 240cgacactgct tttcgctgat
ttgcgtcagc accgattccc agtgcgcggt catgtccggt 300cgcgtcgcca tctccggcag
cgaatggaat agcgcttttc cggcgtcggt ggagtggata 360tagcgccctt ttttggtcag
gaaaccacgc ttgaacaaca gttcaataat cccggcacgc 420gttgcctctg tccccagacc
atcggtcgca cggaggatct ttttcagatc tttatcctgc 480acaaagcgcg cgatcccggt
catcgccgaa agcagtgttg catcggtaaa atggcgcggc 540ggctgggttt gccgctctac
cacttcacct ttttcacaca gcaactcatc gcctttcgcc 600accacaggca gtggcgtgcc
gtcgttttct tcatcgcgct ctttgctgcc taacagcgtg 660cgccagcctg cttcagcaag
aaaacgcgct ttagcgacaa atttgccttt ggcaatgtcc 720agttcgataa cacacttgcg
gaacaccgca tccgggcaga attgcatcag atactgacgg 780gcaatcaggt tatagacctt
cgcttcgttc tccgtcaggt tgatcgcaga actccgtgcg 840gtcggaatga tggcgtggtg
cgcatcgacc tttttgtcat cccaacagcg gttgcgtata 900tctggatcta ccactggctg
cggcaacaga tccggtgcat gaacactgat ggcattcatc 960accgcgtggc gtccggcaaa
atgttcttct ggcaaatagc gacaatcaga acgcggataa 1020gtgattagct tgtgcgtttc
gtacagtttc 10501561050DNAArtificial
Sequencegene editing homology arm pool_6_b3414_left 156gcattaatcc
aacagcaagc gccacgagga cgagagcgat gaagccgttc attttgaagc 60ggatcatcag
gagcaacaac aagattacac cgatagcaac aatgactaat ggcatgattt 120acctggcctt
tcatttgtta tgggtaacgt caattttctg acgacaaact ctaattatcc 180caatcgggaa
cagagatatt gcggcaccac gactgatacc cactaaaact aattattgta 240gtcagatgtc
aggagtatgt ttggtaccca tgtgaatgat acgggtaaca tctggcgttt 300gagaatcacc
agagcggggt aaatttaaat tatgagaggt tggtcatatt atcgcgggga 360aacgaaccga
ggatttgaca aagcaatgct gcgccaacgt ctggcacatg ttcaacgtag 420gcccgaaatg
acgctttagc gtcgcatcgg gcaatctaca aaagagggga taacttagta 480gtaggagtgt
tcgccgcgct ggtgttcggt gagatcgcgc acacctttca gctccgggaa 540ttcgttcagc
agctgcttct cgatcccttc tttcagcgtc acatcgacca tggaacaacc 600gttacagccg
ccgccaaatt gcagaatggc gtaaccgtct tcggtgattt ccatcagcga 660aacgcgacca
ccgtgaccag caagctgtgg gttgatctgc gactgcagca tatactccac 720gcgctccatc
agcggtgcat cgtctgccac tttacgcatt ttggcgttcg gggctttcag 780cgttaactgg
gaacccaact ggtcggtaac aaaatcgatc tctgcatctt ccaggtatgg 840tgcgcttaac
tcatcaacat acgcggtcag caggtcaaat ttcagggctg tgtcggtggc 900ttccacagcg
tccggcggac aataagaaac gccacattca gcgttaggcg tgccagggtt 960aatcacaaat
acgcggattt gtgtcccttc ttcctgattt gccagcagtt tggcaaagtg 1020cgcttgtgca
gcatcggaaa tacggatcat
10501571032DNAArtificial Sequencegene editing homology arm
pool_6_b3414_right 157agtaatggcc taatagttga ctattttagt tggttataat
acgcccatca tcgaggctct 60acaaggttcg acaaaggcac cagacctgga cagccgccgc
accattgcgt aaaagcaact 120gcgcaatctc tgcgacggta cttccggtgg taacgacatc
atccacaatc accatatggc 180gaccttgcac gggcaattca agacgaaagg catttttcag
gttgcgcttg cgcagccggg 240cactgagaaa atgctgggtc gcagtggccc gtgtacgtgt
gacggcttcg ctatcccatt 300ggcagtgcaa ccagcgtgat aacggctgac acagcaaatc
gctctgatta aatccccgac 360gccagtgacg ccgctgccat aacggaacgc tgacgatgcg
atccggcaat tgcaacccgg 420tggtgcgacg agcgtgtaag acttccaata gtaacagacg
tgacagggcg ctggcgattt 480cactgcgccg ggaaaattta agctggtgga taagcggact
taacggcggc gcatagtcgg 540caaccgtgac cagtctttgc cagggcggcg gtttttgcag
gcagcgaccg cagggaagat 600gggagtgtgt ggcgggtaat ccacattgtg ggcataacgt
tttatctgtg cgggtggcgc 660gtgaacagac cgaacaaatc ccccaatgac ctaacgccag
tggcattcgg catagccagc 720ataatcccgg tactgttagc atatgttcat ccttgtaagt
caaaagagaa caatagcgga 780tgaataacat ctggtggcag accaaaggtc aggggaatgt
tcatcttgtg ctgctgcacg 840gatggggact gaatgccgaa gtgtggcgtt gcattgacga
ggaacttagc tcgcatttta 900cgctgcacct tgttgacctg cccggcttcg ggcgtagccg
gggatttggt gcgctgtcac 960ttgctgatat ggccgaagcc gtgctgcaac aggcacctga
taaagccatt tggttaggct 1020ggagtctggg cg
10321581049DNAArtificial Sequencegene editing
homology arm pool_6_b4374_left 158gcgggcaggt catcctgtaa gtctccggca
aacagaatac ggctttgttc gaaatcatca 60ctgtgacgca gcaagacttc acttgccggg
gtaaatgcag acatggaatg ctcctcaatt 120gatactggcg gcgattatag ccatatgttg
gcgcggtatc gacgaatttg ctatatttgc 180gcccctgaca acaggagcga ttcgctatga
catcccgacg agactggcag ttacagcaac 240tgggcattac ccagtggtcg ctgcgtcgcc
ctggcgcgtt gcagggggag attgccattg 300cgatcccggc acacgtccgt ctggtgatgg
tggcaaacga tcttcccgcc ctgactgatc 360ctttagtgag cgatgttctg cgcgcattaa
ccgtcagccc cgaccaggtg ctgcaactga 420cgccagaaaa aatcgcgatg ctgccgcaag
gcagtcactg caacagttgg cggttgggta 480ctgacgaacc gctatcactg gaaggcgctc
aggtggcatc accggcgctc accgatttac 540gggcaaaccc aacggcacgc gccgcgttat
ggcaacaaat ttgcacatat gaacacgatt 600tcttccctcg aaacgactga tttaccggcg
gcttaccaca ttgaacaacg cgcccacgcc 660tttccgtgga gtgaaaaaac gtttgccagc
aaccagggcg agcgttatct caactttcag 720ttaacgcaaa acggcaaaat ggcggcgttt
gcgattacgc aagtggtgct ggatgaagct 780acattgttca atattgcggt cgatcctgac
tatcagcgtc agggattggg aagggcgctg 840ctggaacatc tgatcgacga actggaaaaa
cgcggcgtgg cgacactatg gctggaagtc 900cgtgcttcaa acgctgccgc cattgccctg
tacgaaagtt taggctttaa cgaggcgacg 960attcgccgca attactaccc caccacggac
ggtcgcgaag acgccatcat catggcgttg 1020ccaatcagta tgtaatacaa ggtggaata
10491591037DNAArtificial Sequencegene
editing homology arm pool_6_b4374_right 159atgaagtggg actggatttt
ctttgatgcc gatgaaacgc tgtttacctt tgactcattc 60accggcctgc agcggatgtt
tcttgattac agcgtcacct ttaccgctga agattttcag 120gactatcagg ccgttaacaa
gccactgtgg gtggattatc aaaacggcgc gatcacttca 180ttacagcttc agcacgggcg
gtttgagagc tgggccgaac ggctgaacgt cgagccaggt 240aaactcaacg aagcctttat
taatgcgatg gcggaaatct gcacgccgct gccgggcgcg 300gtttctctgc ttaacgccat
tcgtggcaac gccaaaatcg gcatcatcac caacggcttt 360agtgccttgc aacaggtgcg
tctggaacgc acgggcctgc gtgattactt cgatttgctg 420gtgatttccg aagaagttgg
cgttgccaaa ccgaataaga aaattttcga ttatgcgctg 480gaacaggcgg gcaatcctga
ccgttcacgc gtgctgatgg ttggcgacac tgccgagtcc 540gatattctcg gtggcatcaa
cgccgggctt gcgacctgct ggctgaatgc acaccatcgc 600gagcaaccag aaggcatcgc
gcccacctgg accgtttctt cgttgcacga actggagcag 660ctcctgtgta aacactgatt
gcctcccccc cgttgatggg taaaatagcc gcaatttttc 720gttttcaaca agcgcggcgc
gatgccgctt actcaagaag aaagaattat gacgttgtct 780ccttatttgc aagaggtggc
gaagcgccgc acttttgcca ttatttctca cccggacgcc 840ggtaagacta ccatcaccga
gaaggtgctg ctgttcggac aggccattca gaccgccggt 900acagtaaaag gccgtggttc
caaccagcac gctaagtcgg actggatgga gatggaaaag 960cagcgtggga tctccattac
tacgtctgtg atgcagtttc cgtatcacga ttgcctggtt 1020aacctgctcg acacccc
10371601050DNAArtificial
Sequencegene editing homology arm pool_6_b2917_left 160acgtgaacga
ggatttgagc gcgcggcaca aaagctgtgc attacacaat cagccgtctc 60acagcgcatt
aagcaactgg aaaatatgtt cgggcagccg ctgttggtgc gtaccgtacc 120gccgcgcccg
acggaacaag ggcaaaaact gctggcactg ctgcgccagg tggagttgct 180ggaagaagag
tggctgggcg atgaacaaac cggttcgact ccgctgctgc tttcactggc 240ggtcaacgcc
gacagtctgg cgacgtggtt gcttcctgca ctggctcctg tgttggctga 300ttcgcctatc
cgcctcaact tgcaggtaga agatgaaacc cgcactcagg aacgtctgcg 360ccgcggcgaa
gtggtcggcg cggtgagtat tcaacatcag gcgctgccga gttgtcttgt 420cgataaactt
ggtgcgctcg actatctgtt cgtcagctca aaaccctttg ccgaaaaata 480tttccctaac
ggcgtaacgc gttcggcatt actgaaagcg ccagtggtcg cgtttgacca 540tcttgacgat
atgcaccagg cctttttgca gcaaaacttc gatctgcctc caggcagcgt 600gccctgccat
atcgttaatt cttcagaagc gttcgtacaa cttgctcgcc agggcaccac 660ctgctgtatg
atcccgcacc tgcaaatcga gaaagagctg gccagcggtg aactgattga 720cttaacgcct
gggctatttc aacgacggat gctctactgg caccgctttg ctcctgaaag 780ccgcatgatg
cgtaaagtca ctgatgcgtt actcgattat ggtcacaaag tccttcgtca 840ggattaatcc
atcaaataat gcctgatagc acatatcagg cgttgtcctc acttcttttt 900gtattccttg
aatcacatca caaaatagac aaatctcagg cggcaaaaaa cgacgtctga 960atgcattttt
tttgctggcg acaaacccac gtaaaaagct caccgtaggc gcaaataccc 1020tcattttgat
tgcgttttac ggagcaaata
10501611033DNAArtificial Sequencegene editing homology arm
pool_6_b2917_right 161atgtctaacg tgcaggagtg gcaacagctt gccaacaagg
aattgagccg tcgggagaaa 60actgtcgact cgctggttca tcaaaccgcg gaagggatcg
ccatcaagcc gctgtatacc 120gaagccgatc tcgataatct ggaggtgaca ggtacccttc
ctggtttgcc gccctacgtt 180cgtggcccgc gtgccactat gtataccgcc caaccgtgga
ccatccgtca gtatgctggt 240ttttcaacag caaaagagtc caacgctttt tatcgccgta
acctggccgc cgggcaaaaa 300ggtctttccg ttgcgtttga ccttgccacc caccgtggct
acgactccga taacccgcgc 360gtggcgggcg acgtcggcaa agcgggcgtc gctatcgaca
ccgtggaaga tatgaaagtc 420ctgttcgacc agatcccgct ggataaaatg tcggtttcga
tgaccatgaa tggcgcagtg 480ctaccagtac tggcgtttta tatcgtcgcc gcagaagagc
aaggtgttac acctgataaa 540ctgaccggca ccattcaaaa cgatattctc aaagagtacc
tctgccgcaa cacctatatt 600tacccaccaa aaccgtcaat gcgcattatc gccgacatca
tcgcctggtg ttccggcaac 660atgccgcgat ttaataccat cagtatcagc ggttaccaca
tgggtgaagc gggtgccaac 720tgcgtgcagc aggtagcatt tacgctcgct gatgggattg
agtacatcaa agcagcaatc 780tctgccggac tgaaaattga tgacttcgct cctcgcctgt
cgttcttctt cggcatcggc 840atggatctgt ttatgaacgt cgccatgttg cgtgcggcac
gttatttatg gagcgaagcg 900gtcagtggat ttggcgcaca ggacccgaaa tcactggcgc
tgcgtaccca ctgccagacc 960tcaggctgga gcctgactga acaggatccg tataacaacg
ttatccgcac caccattgaa 1020gcgctggctg cga
10331621047DNAArtificial Sequencegene editing
homology arm pool_6_b0346_left 162tcgtataacg ttactggttt cacattcacc
accctgaatt gactctcttc cgggcgctat 60catgccatac cgcgaaaggt tttgcgccat
tcgatggtgt caacgtaaat gcatgccgct 120tcgccttccg gccaccagaa tagcctgcga
ttcaacccct tcttcgatct gttttgctac 180ccgttgtagc gccggaagat gcttttccgc
tgcctgttca atggtcattg cgctcgccat 240atacaccaga ttcagacagc caatcacccg
ttgttcactg cgcagcggta cggcgataga 300ggcgatcttc tcctcctgat cccagccgcg
gtagttctgt ccgtaaccct ctttgcgcgc 360gcgcgccaga atggcttcca gctttaacgg
ttcccgtgcc agttgatagt catcaccggg 420gcgggaggct aacatttcga ttaattcctt
gcggtcttgt tccgggcaaa aggccagcca 480ggtcaggccc gaggcggttt tcagaagcgg
caaacgtcgc ccgaccattg cccggtgaaa 540ggataagcgg ctgaaacggt gagtggtttc
gcgtaccacc attgcatcaa catccagcgt 600ggacacatct gtcggccata ccacttcgcg
caacagatcg cccagcagtg gggccgccag 660tgcagaaatc cactgttcgt cacgaaatcc
ttcgcttaat tgccgcactt tgatggtcag 720tcgaaaacta tcatcggagg ggctacggcg
gacatatccc tcttcctgca gcgtctccag 780cagtcgccgc acagtggtgc gatgcaggcc
gctgagttcc gccagcagcc cgacgctggc 840accgccatca agtttattta acatatttaa
taacattaga ccgcgggtta agccgcgcac 900ggttttgtat tccgtctgct cattgttctg
catattaatt gacatttcta tagttaaaac 960aacgtggtgc acctggtgca cattcgggca
tgttttgatt gtagccgaaa acacccttcc 1020tatactgagc gcacaataaa aaatcat
10471631049DNAArtificial Sequencegene
editing homology arm pool_6_b0346_right 163cgtgaggtac tgaaatggca
atacaacacc ctgacatcca gcctgctgtt aaccatagcg 60ttcaggtggc gatcgctggt
gccggcccgg ttgggctgat gatggcgaac tatctcggcc 120agatgggcat tgacgtgctg
gtggtggaga aactcgataa gttgatcgac tacccgcgtg 180cgattggtat tgatgacgag
gcgctgcgca ccatgcagtc ggtcggcctg gtcgatgatg 240ttctgccgca cactacgccg
tggcacgcga tgcgttttct caccccgaaa ggccgctgtt 300ttgctgatat tcagccaatg
accgatgaat ttggctggcc gcgccgtaac gcctttattc 360agccgcaggt cgatgcggtg
atgctggaag gggtgtcgcg ttttccgaat gtgcgctgct 420tgttttcccg cgagctggag
gccttcagtc agcaagatga cgaagtgacc ttgcacctga 480aaacggcaga agggcagcgg
gaaatagtca aagcccagtg gctggtagcc tgtgacggtg 540gagcaagttt tgtccgtcgc
actctgaatg tgccgtttga aggtaaaact gcgccaaatc 600agtggattgt ggtagatatc
gccaacgatc cgttaagtac gccgcatatc tatttgtgtt 660gcgatccggt gcgcccgtat
gtttctgccg cgctgcctca tgcggtacgt cgctttgaat 720ttatggtgat gccgggagaa
accgaagagc agctgcgtga gccgcaaaat atgcgcaagc 780tgttaagcaa agtgctgcct
aatccggaca atgttgaatt gattcgccag cgtgtctaca 840cccacaacgc gcgactggcg
caacgtttcc gtattgatcg cgtactgctg gcgggcgatg 900ccgcgcacat catgccggta
tggcaggggc agggctataa cagtggtatg cgcgacgcct 960ttaacctcgc atggaaactg
gcgttggtta tccaggggaa agcccgcgat gcgctgctcg 1020atacctatca acaagaacgt
cgcgatcac 10491641050DNAArtificial
Sequencegene editing homology arm pool_6_b3966_left 164ctcaacataa
cctgtacccg gcgtcgtagt ctgtttctgc cagtcgacac ccgcaccaat 60actaccgtga
ccaacgatga cattgtttgc ccactggacg gtgtattgct tcatctcatc 120gagcgtcgcc
gacgaatcat aacgaccata atggggatcg tagttgtaat ctttgctatg 180gctatagctg
gtaatgagtt gtgatttaat cagttcgccg ttatagcgca gcccggcgtc 240ccaactttgg
ctatagagtt tacgggtatc gagcaacggt gaaccgggag aataatacgc 300gtcataattg
gtacggttat catagccata gccgcgcaca aagccgctcc aggcatcagt 360aaagttatgc
tccagcgcgc cataaagcgt tttacttaaa aaaccatcgt tatctgtctg 420cgcttgcgtt
ccggtattac cataggcaac aacatcataa ccatgagtat gggcataatc 480gcccaacagc
gttacccgtg tcttatcccc cagttgttgc tgcgtagaga catcatagtt 540ctgataacta
ttgcttcccc accctgctga aatttccgtt ccgggttcat cgcgcgtcgt 600gatgatattc
accaccccgc ctattgcatc ggaaccataa acagcggagc gcggcccacg 660gatatattca
acacgctgga caagcgcaat agggaactgg ctaaggtcgg cagaaccact 720cacccccgcc
agattcaggc gtacgccatc aattaacacc aacacatgac tggcatttgt 780accgcgaata
aaaatagatg agagctgacc tgaaccgccg ttttgggtga tatcgacgcc 840cggaagacgg
cgcagcacat cattgaccga ggtcgactgc cagcggtcga tatcctgacg 900ggtcacaacg
gtggttggtg caagcacagt gctgcgcggc tgttcaaaac ggttagcagt 960aacgacgaga
gtatccgggc tggtatcctg tgcccaagcg gaaaatgccg tgacggaaca 1020cgccgtcagc
agcgaagctt ttttaatcat
10501651050DNAArtificial Sequencegene editing homology arm
pool_6_b3966_right 165tgtaaagcat ccacaataga agaaggatgc cgcaggtttc
atcaatatta cgcgatgatg 60agaaccagat gcgacgttgg ccggcaggtc ttcgggcttg
gaggggtatc taagatacta 120agagatgatg acttcccacc gaatggcagt gtccgcataa
cgcaatcatc gcacctttcc 180ttaccgctgc gcgtcagctc cagattcgca ctggattccc
tattaactca caggaccggc 240aagtggatgc tacaggttgt aacaagttac tgtccagacg
tagctcacaa ataggaattc 300atcaagatct ggacatctga tgagcaatcc ctacaatcgc
cgcgtacttt aatttttcag 360gatacatcat gacccccgaa caccttccaa cagaacagta
tgaagcgcag ttagccgaaa 420aagtggtacg tttgcaaagt atgatggcac cgttttctga
cctggttccg gaagtgtttc 480gctcgccggt cagtcattac cggatgcgcg cggagttccg
catctggcac gatggcgatg 540acctgtatca catcattttc gatcaacaaa ccaaaagccg
catccgcgtg gatagcttcc 600ccgccgccag tgaacttatc aaccagttga tgacggcgat
gattgcgggt gtgcgtaata 660atcccgttct gcgccacaag ttgttccaga ttgattacct
cactacactg agtaatcagg 720cggtggtttc cctgctatac cataagaagc tggatgatga
gtggcgtcag gaagcggagg 780ccctgcgcga tgcactgcgc gcgcagaatc tgaatgtgca
tctgattggt cgggcaacga 840aaaccaaaat cgagctggat caggattaca tcgatgaacg
tctgccggtc gcagggaaag 900agatgatcta ccgtcaggta gaaaacagct ttacccagcc
gaacgcggcg atgaatattc 960agatgctgga atgggcgctg gacgtaacca aaggctcaaa
aggcgattta ctggagctgt 1020actgcggcaa cggtaacttt tcattagcgc
10501661050DNAArtificial Sequencegene editing
homology arm pool_6_b0406_left 166gtcgtttact gtcgctggac gggccgacgg
gcgcgctgac gcacggtact ttcaccgatt 60tacttgataa gctcaacccc ggcgatcttc
tggtttttaa taatacccgc gtgatcccgg 120cgcgcctgtt tgggcgtaaa gccagcggcg
gcaagattga agtgctggtt gaacggatgc 180tcgacgacaa acgcattctt gcgcatattc
gcgcctcgaa agcgccaaaa cctggcgcag 240aactgctgct gggcgatgac gaaagtatta
acgcaacaat gaccgcgcgc cacggcgcac 300tgtttgaagt cgaatttaat gatgaacgct
cggtgctgga tattctcaac agcatcggcc 360atatgccgct gccgccgtat atcgaccgtc
cggacgaaga cgctgaccgc gaactttatc 420aaaccgttta tagcgaaaaa ccgggcgcgg
ttgcagcccc gaccgcaggt ctgcattttg 480acgagccttt gctggaaaaa ttgcgcgcca
aaggcgtgga gatggcgttt gtgacgttgc 540acgttggtgc gggcaccttc cagccggtgc
gcgtcgacac cattgaagat cacatcatgc 600actcggaata cgctgaagta ccgcaggatg
tggtagacgc ggtactggcg gcgaaagcgc 660gcggtaaccg ggtgattgcg gttggcacca
cttcagtacg ttcgctggaa agcgcggctc 720aggcagcgaa aaacgatctc attgaaccgt
tcttcgacga tacccaaatc tttatctatc 780cgggcttcca gtacaaagtg gtcgatgcgc
tggtgacgaa cttccacttg ccagagtcga 840cgctgattat gctggtttcg gcctttgccg
gttatcaaca caccatgaac gcctataaag 900cagcggtaga agagaaatat cgctttttta
gttacggtga tgcgatgttt atcacgtaca 960atccgcaggc aattaatgag cgcgtcgggg
agtaattccg cggcgctggt ttaaaacgtt 1020ggactgtttt tctgacgtag tggagaaaaa
10501671050DNAArtificial Sequencegene
editing homology arm pool_6_b0406_right 167atgaaatttg aactggacac
caccgacggt cgcgcacgcc gtggccgcct ggtctttgat 60cgtggcgtag tggaaacgcc
ttgttttatg cctgttggca cctacggcac cgtaaaaggg 120atgacgccgg aagaagttga
agccactggc gcgcaaatta tcctcggcaa caccttccac 180ctgtggctgc gcccgggcca
ggaaatcatg aaactgcacg gcgatctgca cgattttatg 240cagtggaagg ggccgatcct
caccgactcc ggcggcttcc aggtcttcag ccttggcgat 300attcgtaaaa tcaccgaaca
gggcgtgcac ttccgtaacc cgatcaacgg cgatccgatt 360ttcctcgatc ctgaaaaatc
aatggagatt cagtacgatc ttggttcgga tatcgtcatg 420atctttgatg agtgtacgcc
gtatcctgct gactgggatt acgcaaaacg ctccatggag 480atgtctctgc gttgggcgaa
gcgtagccgt gagcgttttg acagtctcgg aaacaaaaat 540gcgctgtttg gtatcatcca
gggcagcgtt tacgaagatt tacgtgatat ttctgttaaa 600ggtctggtag atatcggttt
tgatggctac gctgtcggcg gtctggctgt gggtgagccg 660aaagcagata tgcaccgcat
tctggagcat gtatgcccgc aaattccggc agacaaaccg 720cgttacctga tgggcgttgg
taaaccagaa gacctggttg aaggcgtacg tcgtggtatc 780gatatgtttg actgcgtaat
gccaacccgc aacgcccgaa atggtcattt gttcgtgacc 840gatggcgtgg tgaaaatccg
caatgcgaag tataagagcg atactggccc actcgatcct 900gagtgtgatt gctacacctg
tcgcaattat tcacgcgctt acttgcatca tcttgaccgt 960tgcaacgaaa tattaggcgc
gcgactcaac accattcata accttcgtta ctaccagcgt 1020ttgatggcgg gtttacgcaa
ggctattgaa 10501681040DNAArtificial
Sequencegene editing homology arm pool_6_b0652_left 168ccgcctgcta
ccggaatatc ggtgcgatta agcaaggtca gcatacgcag aacattgcgt 60aaggtttttt
ctggtgtctg gtttccggcg gaagacgtaa ttgctttgac atcaagctct 120ggtgaggcga
gggcgagaac tattgcgata gcgtcgtcat gacctgggtc gcaatctaac 180agaattggca
gtgccattgt tgctccttgt tgtgtgcttc tttgcgacaa gggtaacgcc 240aggatgtaac
agatacgagg ggcgaaacga taaagcgtga gatggcgcgc aattgggtat 300gcgcgccaga
gtgattaatg caggattttc gcgaggaagt cttttgcgcg gtccgatttc 360ggatcatcga
agaaagcgtc tttcggcgag tcttcgacaa ttttaccctc gtccataaag 420atcacccgat
tcgccacttt acgggcaaag cccatttcgt gggtcaccac catcatggtc 480attccttcgt
tcgccagttc caccatcacg tccagtactt cgttgatcat ctccggatcc 540agcgccgatg
tcggttcgtc aaacagcatc gcaataggat ccatacacaa cgcgcgagcg 600attgccacac
gctgctgctg accgccggaa agctgcgccg gaaacttatt ggcgtgagca 660gaaagcccga
cacgctccag cagtttcagg gctttttcac gagccggcgc tttatcgcgt 720ttaagcactt
tcacctgcgc cagggtcagg ttttcgataa tcgacagatg agggaacagc 780tcgaaatgct
ggaataccat cccgacgcgg gaacgcagct ttgccagatc ggttttcttg 840tcgttaacca
cgataccatc gacggtgatt tcaccttgct gcaccggttc gaggccgttg 900acggttttaa
tcagcgttga tttgccggaa ccagacgggc cgcaaaccac caccacttcg 960ccttttttca
cttcggttga gcagtcggtc agcacctgaa agtgaccata ccattttgaa 1020acatttttca
gggtaatcat
10401691041DNAArtificial Sequencegene editing homology arm
pool_6_b0652_right 169tatgctgtcc ttcttttcaa gtagctgacc aacaacgacg
cgctaagact aataacgaaa 60taaacaaatc cggcaaacag gatcatctca acctgcgtac
catcacgctc accaatggtt 120gaggcggtac ggaagaaatc ggccagggat aacacataca
ccagtgaggt atcctggaac 180agtacgatgc cctgagtgag cagcagcggc accatcgcgc
ggaacgcctg cggcagaata 240atcagtttca tcgactgcca gtgagtcatt cccaacgcca
gcgcggcgct cgattgacca 300cgagaaatac tttgaatacc agcacggata atctctgaat
agtaggccgc ttcaaacatc 360gaaaacgcca ccatcgccga aattaaacgg atatcatttt
ttggcgataa tcccagcacg 420ttttgcagaa aacccggcac gatcaggtaa aaccacagca
aaaccataac taaaggaatc 480gagcggaata cgttaacgta ggctttggca aaccacgcca
cgggcgcaaa gctggataaa 540cgcatcaccg ccagcatcgt gccccacaaa ataccaatca
ctaccgccgt gacggtgatt 600ttcagggtga tcaccagccc gtcgagcaga tatggcaggg
aagggacaat ggaactccag 660tcaaactcgt acattatttg ccccccatgt tgccaggcag
gcgaacttta cgttcaacca 720gcgtcatcac cagcatgata aaagcgttaa tcaacacata
cgccagcgta atggcggtaa 780acgactccca ggcatgggct gagtaatcga gcaatttacc
cgcctgcgcc gccatatcca 840ccagaccgat agtcgaggcg atggcggagt ttttcaccag
gttcatcatc tctgaggtca 900tcggcgggac gataacgcga taagcattag gcagcagtac
gtatcgataa gcctgcggta 960gcgtcaggcc catcgccagc gcggcatttt tttgccctcg
cggcagcgac tgaatcgcgg 1020cgcgtacctg ttcgcaaaca c
10411701050DNAArtificial Sequencegene editing
homology arm pool_6_b1493_left 170ggcaacctgg taagaggcgt tctgtacttt
ggtatagcct tcacgaccga ggcgcaggaa 60ttcatagtac tgtgcaatta cctgacccgc
cgggcgggag aagttgatgg caaaagtacc 120aatttgacca cccaggtagt caacgttgaa
caccagttcc tgcggcagcg cttcttcgtc 180acgccagata acccagccgc agcccagcgg
agccagaccg aatttatggc ctgaagcact 240gatcgatttc acacgcggca ggcggaagtc
ccagacgata tccggggcga cgaacggtgc 300caggaagcca ccgctggcag cgtcgatgtg
catgtcgatg tcgataccgg tatcggcctg 360gaatttatcc agcgcatcgt gcagcggttg
tgggaactca tagttaccag tgtaggtcac 420gccgaaagtc ggcaccacgc cgatggtgtt
ttcgtcacag gcttcaatca tgcgtttcgg 480gtccataaac aactgaccgg ggcgcatagg
gatctcacgc agctccacat cccagtagcg 540ggcgaattta tgccagcaga tttgtaccgg
accgcacacc aggtttggtt tatccgttgg 600tttgcctgca gcttccatac gcttgcgcca
acgccatttc atcgccatcc cgccgagcat 660acaggcctcg gaagaaccaa tggtgttggt
gccaacggcc tgaccatttt tcggcgcagg 720cgcatgccac agatcggcaa ccatatttac
gcaacgcagg tcgatggctg cggattgcgg 780atattcttct ttgtcgatcc agtttttgtt
aatggataaa tccatcaatt tgtggacatt 840ttcgtcgtcc caggtctggc agaaagtggc
caggttctga cgagcgttgc catcaagata 900taattcgtca ttgataatct ggaatgcgac
atcgtcgcgc atttcgtgca gcggaaaacg 960ttttgattct gcgatagtgg aaatagactt
cgcaccaaaa cgtgaatcga gtagttccga 1020ccttaaatcc gttacttgct tcttatccat
10501711046DNAArtificial Sequencegene
editing homology arm pool_6_b1493_right 171acaaaatcct aatgttattt
atcgtgagat attacgcgaa taatattttt tcattgaaaa 60acaatacaat atgaaattct
tgggtggtgg taaggtgttt tatgctgtta tttttatgcg 120cattctgtgt ctcctgaatt
atcacgtaaa aatcagacct taaaatatca ctattagtac 180ttgattatta ttttgaacgc
atttataaaa ttattacata aaaatagcga atattgctaa 240aatccccgcc aacgatgtgt
tgacggggct gttattattt tggcaataat actccggtat 300aagtatttac cggatgagaa
agatattgtt taacggcagt gttaacattc tctaccgtca 360tttgtttcaa caattgctcc
tgctcagtcc atgctgcagg atcgtcatat tgaataagac 420tatttacaat agtgttcgct
aattgttgaa cgctacgctg ttggatatcg aggctgcgct 480gaacgttttg ctggtattca
ttcagttctt gctcactgat ccctttagcc agacgcttaa 540ccatcacttc attcgctaac
gttaacagtt catcatgtcg ttctggttga caagtaaaag 600ccagcaaatg actgatatct
ttggcctgag gatcaaccga gaggcgagaa gaaacgctgt 660atgctccaga tgcctgttca
cgaatattaa cacgtagatc ttttgccagt gcgacgttaa 720aagcatcgag cgccatacgc
gtcggcagat taacaggtgt ccgggaatca taacgcttcc 780actgtgaaac ctgtgccaca
ggttcatttt gttcttttac agtaaccgat gcgttgtccg 840tcgcgcgagt taatggttta
cctgcggcta atggcgaatc agagtgtttg attgatccta 900agtaacgcgt aattaacgcc
acgagtttgt cttctgcgac attaccgaca atgacaaacg 960tgatatccgc tggagatgaa
aacaattggc gatcggcagc cagcgcatct gcggcagtaa 1020actgtgcaat ctgattttct
tgcagt 10461721050DNAArtificial
Sequencegene editing homology arm pool_6_b4159_left 172ataacgtaac
cgttgcacac gcaactgcgc catttcggta tcaagctgtt gtggtttcgg 60catttccggc
agccgtgcca cctgcgcccg cagcgcttcg ccgagcagat tggacgatcc 120cagccattgc
gactgttcac gcagcgtatt caacgcctgc cggacctgta acgtctggct 180ggcagcctga
cgctgttgcg aggcaacgag atccatccgc tgcgcctgtt gattcaaagc 240cgccgatagt
tcgcggttaa ttttgaattg cgcgacgata tctttcggca aatcggcgct 300gttttctgcc
agcaattcgg tactttccag cgcccgctcc gcctcaagct gacgttggct 360gtttaattga
ttacgcaagg cctgcaaata cgcatccagt tgctggctct ctttttccgc 420cagctctgag
cgtaagcgcg ctaattcctg gcggttattg gcagacagct gcgccagctc 480cagttcatca
acgagcgcct taagacgtgc agagtcagac tgcaacgcga aattttgtgc 540ctgattgagc
ggagtattgc cggtaagcgt tcccaggcgg cgctcgatct catttaactg 600acggcgggcg
tcggtttgct gttgcggcag ttgattcagc gaatcggcaa tctcgcgggc 660gcgctcctgc
tcttgctggg cctgacggct tttatccagc aactggctgc tgacctggag 720aatttcctga
ttcagcgcgt cggtagacat tcccggcgac acgctgcgcg gctcgtcacg 780catgttgttt
aattgtgcgc gcagagtagc ggagagtttc ggataattat cgataacttg 840ctgatattgt
ttgatgcgct caagggaacc ttttcgttcc tcaagcgcat ttaaggcaga 900ctggagcgcc
tctacgactt ccggctgtgc gggtttcgcc gcttttgcct gctccagttc 960ctgagtgatt
tgtttgctat cgggggccgt cgcggcgtac gccccccaac tgaggcacca 1020ggccatcaga
aaagtgataa tcaggcgcac
10501731050DNAArtificial Sequencegene editing homology arm
pool_6_b4159_right 173gtcagcgttt cctttgatgg attagacctg gtcttttttg
tcgtcaacca atgggctggc 60gtcgtgttct gcttcgatct cttcagcagg aagcggggca
ggttcagcgt ctggcgtaac 120aaaggtttcg gtagatactg ccagcggctg gccaattttc
gtgacagaca ggctttccag 180ttgctcaacc agattcactt tacccggtgc aaacaggttg
ataacggtgg aaccgagttt 240aaagcgaccc atttcctggc ctttcagcag tgccacagaa
ccgtcgtttt ccccggcagg 300ccaggtccag cgcttgatga taccttcgcg cggcggcgta
atggtgcccg cccagaccgt 360ctcaatgctg ccaacaatcg tcgctccgac cagaatctgc
gccattgggc caaattcggt 420atcgaaaagg caaatcacgc gttcgttacg ggcaaacaga
ttcggcacgt tctgagccgt 480gagatggtta acggagaaga gatcgcccgg cacgtagatc
atctcacgca gaataccgtt 540gcacggcatg tgtacgcggt ggtagtcacg cggggagagg
taagtggtca caaacgtacc 600gttgcggaac aggtccgcca tcagatagtt gcctgccagc
agggcttcga ggctgtagtt 660gtggcctttg gcttgcagga ttttatcttc ttcgatttta
cccaactggc tgataacgcc 720atcggcaggc atgaccagta cattcggatc ggtatcgatt
gggcgtactt cgtcacgcag 780cggacggaca aagaattcgt taaaggtgcg gtagctggcg
gtgtccggct tttgcgcctc 840tttcatgtcg accttgtagt atttaacgaa cagatcgata
accagttttg tcagccatcc 900tgcccgcttg cttgcgcccc aacccgccag gcgagtaagc
catagtttcg gcagaatgta 960ctgtagcgaa agtttaaatg aatttaacaa ggtagcctcc
aggccattgt tttgtcgttc 1020ctgatccggc ctacatgccg gatcctgaaa
10501741050DNAArtificial Sequencegene editing
homology arm pool_6_b3795_left 174gttcaggcat gagccaatta gcagaatagc
aatagatact gccacacccg caaccggtac 60gccgtgacgg gaaactttcg ccattgccgc
cggtaactga cggtttttcg ccagtgcgta 120gagcatacgt ccgcaactgt acatgccgct
gttacagcca gagagcgcag ccgtcagcac 180cacaaagttg ataatgcccg ccgctgcggt
aataccgatt ttggcaaaag tcagtacgaa 240cgggctgccg ttgctgccta tttcattcca
cgggaagatg gtgacgataa cgaaaatcgc 300gcctacgtag aaaatcagga tccgccacag
caccttgcct acggcactgc gcagcgtcac 360ctgcggattc ttcgcttcac cggcagtaat
gccaatcagc tccacgccct ggtaggacgc 420caccacaata cacagagcgg tcaggaaccc
tttccagcca cccgcaaaga aaccgccatg 480ctctgtgaga ttgctaaaac caatcgactg
cccgccattg ccaaagccaa agaaaatcac 540gcccaggcca atgacaatca tcacgataat
cgtggtgact ttgatcatcg cgaaccagaa 600ctcgatttcg ccgtacaacc gcaccgccgc
cagattcgcc aacgccacca gcgccactgc 660gatcaatgcg ggtatccact gcgccatctc
cgggaaccag aactggacat aaacgccaat 720ggcggtgatt tcagagatcc ccaccgccat
ccacataaac cagtaagacc aggcggtgag 780atagccaaag aacgggctca tataacgatg
cgcataaacg gcgaacgaac cggtaaccgg 840ttcgaggaac aacatttcgc ccattgaacg
catgatgaaa aagacgaaca gcccggcgat 900gatataggcc aacaatacgg atggcccggc
ccatttcagg gtactggcgg cccccataaa 960caggccgacg ccaatggtgc cccccagggc
gatgagttcg atatgtcgag cttccagccc 1020acgctgtagc tctggtttgt tatctgccat
10501751050DNAArtificial Sequencegene
editing homology arm pool_6_b3795_right 175aaatcctcgt gttgtgtttg
catgctttcc ggtgttaccg gttatcgtta tgggtacatc 60gagtgttgca aatgttttcg
taattcagga gaaatggcaa ataaagcatt aaaaatttga 120atgctttgtg taataaaaaa
gcagacaggc gacggagtga ccactccgtc gctttacaaa 180gagaggaaaa tcataggttg
ccggtgtagt gccagcgtaa ataacgcagc aaacgaagct 240gacgcttaat gcggctcggc
tgcgaaagca ggcggtagag ccactccagc cccagcgttt 300gccagatttt cggtgcgcgt
tttacgtgac cggtgaaaac atcgtaagtc ccgccaacgc 360ccatatacag cgcatctgga
tgtaccagac ggcagtcgcg catgatgatc tcctgctttg 420gcgatcccat cgcaacggtg
acgatttgcg caccgctggc atgaatgcgt tcaaacagcg 480cctgacgctg ctcgggttta
aaataaccat cctgactgcc aacgatattc acattccact 540ggttgcgcag tttagcttca
gtttgcgcca gcacttcagg tttaccgccc acaagaaata 600ccggcgtccc ttctttgcct
gcgcgcgcca tcagctcttc ccagagatcg gcaccggcaa 660cgcgggaaac ctgcgcctgc
gggtactttt tacgtactga acgtacaacg ctgatgccat 720ccgcatattt aaattcggca
gcgttaatta actccctgac ctcggcgtta tcttcaatag 780tcagcatttt ttcagcatta
atggcaacca gcgttccctg cttaagctgc ccgtcagcaa 840acagataatc gagggcgtgc
tgcatatcac gccaaccaat caactgtaag ccacgcagcg 900tataggttgg tgccgtggtg
ttgttattca ttgttatcct tcaacctgcg tccggagcga 960tgattttgta cgtttatgaa
tgagtccggc gctttcaaaa agccagtaca acagttttgc 1020gatcatcaga catgcgccga
agaccacgat 10501761050DNAArtificial
Sequencegene editing homology arm pool_6_b4246_left 176aaccagtgcc
agtaacgcct ggcgagcgaa aatcccgttg cctgcctgct ggaagtacca 60ggcgtgtggc
gttttatcaa catccgtcgc aatctcatca acacgcggca gcggatgcag 120cactttcata
ttggctttgg cgttgtggag atcgctggcg cgaagaacaa actgcgcttt 180cacgttggcg
tactcggacg ggtccagacg ctctttttgc acgcgggtca tgtacaggat 240gtctacttcc
gccatcactt cttcaataga gctgtgcaga ctccatgcga tccctttttc 300atcgagcata
tccagaatgt attgcggcat tgccagcgcg tccggcgcga tgaagtaaaa 360acggttgccg
tcgaacttcg ctaacgcctg agtcagggag tgaacggtgc ggccatattt 420caggtcacca
accattgcga cgtggagatt gtccagacgc ccctgggttt cctgaatagt 480gaataagtcc
agcaaggttt gcgtcggatg ttggttggag ccatcaccgg cattcagtac 540cggtacattg
ccggaaaact cggtggccag gcgcgccgca ccttcctgcg gatgacgcat 600cactatcgca
tcgacgtaag tgctgataac cgaaatggta tcggccagcg tttcgccctt 660tttacccagt
gatgtattgg cgctgtcgga gaagcccacc acgctggccc ccaggcggtg 720catagatgtt
tcgaaagaga ggcgggtacg ggtagaggct tcgaagaaac agctggcaat 780gactttgtgc
ttcaacagct ctggttgcgg gtttgctttc agtttcgccg ctgtcgccag 840caccagatta
aggtcatcgc gactaaggtc gtttatggaa atgatatgtt tctgatatag 900cggattagcc
atcttttatc tcctgacgcc tgggcaaaaa aaagcccctc gattgagggg 960ctgggaatgg
gtgatcaacg ggaagaaaaa cggcaggcca gcgtcttttt tcagacgcgg 1020taagacaaaa
tgtcgaacac actgaaccat
10501771050DNAArtificial Sequencegene editing homology arm
pool_6_b4246_right 177actattgctt actgctcagg gatgcgcgct atcactttaa
tttcaaaatc aaagcctgcc 60agccatgtaa cacccaccgc cgtccagttt ggataaggtg
gggcgctaaa tatttcattt 120ttcaccgtca tgatgtcttc aaattggttt tctggatcgg
tatggaagct cgtaacatca 180atgatatcgt caaaagtgca tcccgcagct gccagggtcg
catgcaaatt atcaaatgcc 240agtctgactt gttgctgaaa atcgggttct ggtgttccgt
cctctcgact tcctacttgc 300ccggaaacaa acagcaaatc gccggaacga atagccgcag
aataacgatg ctcagcatat 360agtgaatgtc ggccagcagg gaaaacagcg gttctttcta
ccatttggtt atcctcaaga 420tttacgacat gaacagaaga tttctcttta ccgggagccg
cttttagcgg acgacgtgag 480taaacaaaac ccagacatca tggataatgg ctgggcttaa
ttgagcgtag tcggttatgc 540gccaaacgcg ccatcaatgg tatgcatcgc gccggtaaca
aaactggctt ctggccctgc 600taaccatgcg accataccag cgacctcttc cggttgccca
tgtcttttga tagccatcaa 660actatgcaac atatcgcgca ttggcccgtt ggcgggatta
gcgtcggtat caattggccc 720tggctggacg acgttaatgg tgatcccacg cggtccaaaa
tcacgggcca gcccgcgcgc 780catgccttgc agggcagatt tgctggcggc ataagcagcc
atgcctgcaa caggcatacg 840atcgccattc acggagccga tgattaagat gcgcccgcct
tcgggcatct gccgggcggc 900ttcaacagag gcatgataag gagcatgaat attgattttg
aaaaggcgat caatatcgtc 960ggcatttaat tccagggcct cgccaaagac gccaatacct
gcatttacca ccaggatatc 1020caatgcgccg ctcttacgaa cgacatcaat
10501781050DNAArtificial Sequencegene editing
homology arm pool_6_b4440_left 178atggtggatg ctgcgcacct ttggtgtaga
gaaagtgtcg attctggggg gtggacttgc 60aggctggcag cgcgatgatc tgctgttaga
agaaggtgca gtagagctgc cggaaggaga 120gtttaacgcc gcgtttaatc ctgaagccgt
ggtgaaagta accgatgtat tattggcaag 180ccatgaaaat acggcgcaaa ttattgatgc
ccgcccggct gcacgtttta acgcagaagt 240tgatgaacct cgcccaggtt tacgtcgcgg
acatattccc ggagcactga atgttccgtg 300gacggaactg gtgcgcgaag gcgaactaaa
aacgaccgat gaactggatg cgatattttt 360tggtcgcggc gtcagctacg acaaaccaat
tatcgtcagc tgcggctctg gtgtaacggc 420agccgtggtt ttgttagcac tcgcgacgct
ggatgtgcca aacgtgaaac tgtacgacgg 480cgcatggagt gaatggggcg cgcgggcaga
tttaccggtt gagccagtga aataagtatt 540ttacaggcaa taaaaaaccg ccgaatttgg
cggtttttta ttgctagtct ggttcgcggc 600ctttccagca ggttgacttg tgttacatga
gcaacgcagg tgcttcacag caaaacaata 660ctcaccagta actctctttt tgtcaagcaa
aagagagtaa ttattgttta tttagcgtat 720tatcgacacc ggccctttcc gccgtgttcg
gtaataaaat aacctggctt attagtccga 780attcagacaa atataaataa atcctgctca
aaattaaaaa ttctaaccgg taaaagatat 840tacttaaaca tgtaaattca ctttccttta
aaaaacaaaa aaccgccaaa atcaggcggt 900tttttgttgc tggtccggtt cgcggccttt
ccagcaggtt gtattaccgt agtaatgcaa 960gcgcgtctca gcggagacaa tactcgccag
taactctctt tttgtcaagc aaaagagagt 1020tattattgtt ctgttagtgt attatccact
10501791050DNAArtificial Sequencegene
editing homology arm pool_6_b4440_right 179gcggcccttt ccgccgtctc
gcaaacgggc gctggcttta ggaaaggatg ttccgtggcc 60gtaaatgcag gtgtttcaca
gcgcttgcta tcgcggcaat atcgccagtg gtgctgtcgt 120gatgcggtct tcgcatggac
cgcacaatga agatacggtg cttttgtatc gtacttattg 180tttctggtgc gctgttaacc
gaggtaaata ataaccggag tctctccggc gacaatttac 240tggtggttaa caaccttcag
agcagcaagt aagcccgaat gccgcccttt gggcggcata 300ttttagatta tccgattctg
tttaaagtca cgcaaaaaac caccccagcg acgttcatag 360aatggcgcaa tatgttcggt
aataaagtgg ctaattcctt tttccccttt tttcacctga 420caaatatcga ttggttcatc
gccaggtaat gtatcggtcg ctacacttcc cgtcgcctga 480ataatttctt cgatatcacc
atcggcttca atgccaataa gtaaattagg ctgtgcctct 540tcgttctctt taattgaaca
aataaaagca cgcttcaccg gcttaatggt tttaaataag 600gtggtgagtg aatcaatcat
ttgtgctggc ggctctgcga cttccgataa tatcagcgat 660tcaccgcctt ccaggatttc
ctggctgctc agcggatttc cctcttcacc aatcaacaaa 720ctgatttcac gcggcataaa
ttctttaccg gttggcagtt tggcattaag gaagagcgtt 780tcgccaagtg tcatctcaaa
cagcgtgcga acgggcatta cgacaaatgc ctgttcgtct 840tcaaccgcct gttgaagtgc
ttctaacgag gtgaaaaaag gaatgacgct ggtgccgtct 900tctttttccc agtgctgtaa
atcaagcgcg ctatcttcaa ccacagcctc gccctgcgcc 960gccgtaccag gcacccagac
ggtggattcc agtagagtac ggaaaaaggc cgggcggtgc 1020gccggttcag ttgctgcttt
ttccagcagg 1050
User Contributions:
Comment about this patent or add new information about this topic: