Patent application title: MODIFIED GUIDE RNAS, CRISPR-RIBONUCLEOTPROTEIN COMPLEXES AND METHODS OF USE

Inventors:
IPC8 Class: AC12N1511FI
USPC Class: 1 1
Class name:
Publication date: 2021-05-13
Patent application number: 20210139891

Abstract:

Described herein are modified guide RNAs such as a single guide RNA including, from 5' to 3', a single-stranded protospacer sequence, a first complementary strand of a binding region for the Cas9 polypeptide, an aptamer that binds a biotin-binding molecule, and a second complementary strand of the binding region for the Cas9 polypeptide. Also described is an RNP complex including the modified guide RNA and a Cas9 polypeptide or active fragment thereof. Also included are methods of modifying target genes in cells using the modified guide RNAs.

Claims:

1. A ribonucleoprotein (RNP) complex, comprising a modified guide RNA comprising, a crRNA comprising a single-stranded protospacer sequence and a first complementary strand of a binding region for a Cas9 polypeptide, a tracrRNA comprising a second complementary strand of the binding region for the Cas9 polypeptide, wherein the crRNA or the tracrRNA comprises a nucleic acid aptamer that binds a biotin-binding molecule, wherein the crRNA and the tracrRNA hybridize through the first and second complementary strands of the binding region for the Cas9 polypeptide; the biotin-binding molecule; and the Cas9 polypeptide or an active fragment thereof, wherein the Cas9 polypeptide or active fragment thereof is active for guide RNA binding, and has an active, inactive or partially inactive nuclease domain.

2. The RNP complex of claim 1, wherein the biotin-binding molecule has one, two or four biotin binding sites, and wherein the biotin-binding molecule optionally comprises a fluorescent label.

3. The RNP complex of claim 1, further comprising a biotinylated molecule.

4. The RNP complex of claim 3, wherein the biotinylated molecule is a biotinylated donor polynucleotide.

5. The RNP complex of claim 4, wherein the biotinylated donor polynucleotide comprises single-stranded DNA, double-stranded DNA, RNA, or a duplex of RNA and DNA.

6. The RNP complex of claim 4, wherein the donor polynucleotide includes a mutation, deletion, alteration, integration, gene correction, gene replacement, transgene insertion, nucleotide deletion, gene disruption, and/or gene mutation in a target nucleic acid.

7. The RNP complex of claim 3, wherein the biotinylated molecule comprises a biotinylated nanoparticle, dye, contrast agent, or peptide.

8. The RNP complex of claim 7, wherein the nanoparticle is a quantum dot, a gold particle, a magnetic particle, or a polymeric nanoparticle.

9. The RNP complex of claim 4, wherein the biotin-binding molecule is covalently linked to the donor polynucleotide, either directly or via a linker molecule.

10. The RNP complex of claim 9, wherein the donor polynucleotide comprises single-stranded DNA, double-stranded DNA, RNA, or a duplex of RNA and DNA.

11. The RNP complex of claim 10, wherein the donor polynucleotide includes a mutation, deletion, alteration, integration, gene correction, gene replacement, transgene insertion, nucleotide deletion, gene disruption, and/or gene mutation in a target nucleic acid.

12. A method of modifying a target nucleic acid in a cell, comprising delivering to the cell an RNP complex, the RNP complex comprising a crRNA comprising a single-stranded protospacer sequence and a first complementary strand of a binding region for a Cas9 polypeptide, a tracrRNA comprising a second complementary strand of the binding region for the Cas9 polypeptide, wherein the crRNA or the tracrRNA comprises a nucleic acid aptamer that binds a biotin-binding molecule, wherein the crRNA and the tracrRNA hybridize through the first and second complementary strands of the binding region for the Cas9 polypeptide; the biotin-binding molecule; and the Cas9 polypeptide or an active fragment thereof, wherein the Cas9 polypeptide or active fragment thereof is active for guide RNA binding, and has an active, inactive or partially inactive nuclease domain.

13. The method of claim 12, wherein modifying the target nucleic acid increases or decreases expression of a gene product of the target nucleic acid.

14. The method of claim 12, further comprising delivering a donor polynucleotide to the cell, and wherein modifying the target nucleic acid comprises homology-directed repair (HDR).

15. The method of claim 12, further comprising delivering a donor polynucleotide to the cell, and wherein modifying the target nucleic acid comprises addition of a genetically encoded functionality, or correction of a mutation in the target nucleic acid.

16. The method of claim 12, wherein modifying the target nucleic acid creates a double strand break (DSB) which is repaired by a non-homologous end joining (NHEJ) cell repair mechanism generating indels thereby modifying the polynucleotide sequence of the target nucleic acid.

17. The method of claim 12, further comprising delivering a donor polynucleotide to the cell, and wherein modifying the target nucleic acid creates a DSB which is repaired by a HDR cell repair mechanism incorporating a donor DNA sequence thereby modifying the polynucleotide sequence of the target nucleic acid.

18. The method of claim 12, further comprising delivering a biotinylated molecule, wherein the biotinylated molecule targets the RNP complex to a specific cell type, organ or tissue.

19. A method of modifying a target nucleic acid in a cell, comprising delivering to the cell two RNP complexes, wherein each RNP complex comprises a crRNA comprising a single-stranded protospacer sequence and a first complementary strand of a binding region for a Cas9 polypeptide, a tracrRNA comprising a second complementary strand of the binding region for the Cas9 polypeptide, wherein the crRNA or the tracrRNA comprises a nucleic acid aptamer that binds a biotin-binding molecule, wherein the crRNA and the tracrRNA hybridize through the first and second complementary strands of the binding region for the Cas9 polypeptide; the biotin-binding molecule; and the Cas9 polypeptide or an active fragment thereof, wherein the Cas9 polypeptide or active fragment thereof is active for guide RNA binding, and has an active, inactive or partially inactive nuclease domain.

20. The method of claim 19, further comprising delivering a donor polynucleotide to the cell, wherein the donor polynucleotide comprises a gene correction relative to the sequence of the target nucleic acid, thereby providing multiple allelic correction of the target nucleic acid, or excision of target DNA from the target nucleic acid.

21. The method of claim 19, further comprising delivering a donor polynucleotide to the cell, wherein the donor polynucleotide comprises a gene correction relative to the sequence of the target nucleic acid, thereby providing multiple allelic correction of the target nucleic acid.

22. The method of claim 19, wherein modifying the target nucleic acid provides excision of genomic DNA.

23. The method of claim 19, wherein each RNP complex further comprises a biotinylated molecule.

24. A method of modifying a target nucleic acid in a cell, the cell comprising a Cas9 polypeptide, wherein the Cas9 polypeptide is active for guide RNA binding, and has an active, inactive or partially inactive nuclease domain, the method comprising delivering to the cell a modified guide RNA, the modified guide RNA comprising, a crRNA comprising a single-stranded protospacer sequence and a first complementary strand of a binding region for a Cas9 polypeptide, a tracrRNA comprising a second complementary strand of the binding region for the Cas9 polypeptide, wherein the crRNA or the tracrRNA comprises a nucleic acid aptamer that binds a biotin-binding molecule, wherein the crRNA and the tracrRNA hybridize through the first and second complementary strands of the binding region for the Cas9 polypeptide; wherein the modified guide RNA is associated with the biotin-binding molecule; and wherein the single-stranded protospacer sequence of the modified guide RNA hybridizes to a sequence in the target nucleic acid to be modified.

25. The method of claim 24, wherein two modified guide RNAs are delivered to the cell, and wherein each of the modified guide RNAs hybridizes to a different nucleic acid sequence.

26. The method of claim 24, further comprising delivering a donor polynucleotide to the cell, wherein the donor polynucleotide comprises a gene correction relative to the sequence of the target nucleic acid, thereby providing multiple allelic correction of the target nucleic acid, or excision of target DNA from the target nucleic acid.

27. A kit comprising a modified guide RNA, the modified guide RNA comprising, a crRNA comprising a single-stranded protospacer sequence and a first complementary strand of a binding region for a Cas9 polypeptide, a tracrRNA comprising a second complementary strand of the binding region for the Cas9 polypeptide, wherein the crRNA or the tracrRNA comprises a nucleic acid aptamer that binds a biotin-binding molecule, wherein the crRNA and the tracrRNA hybridize through the first and second complementary strands of the binding region for the Cas9 polypeptide; a biotin-binding molecule; a Cas9 polypeptide, and a biotinylated molecule.

28. A method of modifying a target nucleic acid in a cell, comprising delivering to the cell a vector expressing a modified guide RNA, a vector expressing a Cas9 polypeptide, an avidin, and a biotinylated donor DNA template, the modified guide RNA comprising, a crRNA comprising a single-stranded protospacer sequence and a first complementary strand of a binding region for a Cas9 polypeptide, a tracrRNA comprising a second complementary strand of the binding region for the Cas9 polypeptide, wherein the crRNA or the tracrRNA comprises a nucleic acid aptamer that binds a biotin-binding molecule, wherein the crRNA and the tracrRNA hybridize through the first and second complementary strands of the binding region for the Cas9 polypeptide wherein the single-stranded protospacer sequence of the modified guide RNA hybridizes to a sequence in the target nucleic acid to be modified.

29. The method of claim 28, wherein the cell is a human cell.

30. The method of claim 29, wherein the human cell is a human pluripotent stem cell line, or a primary blood cell.

Description:

CROSS-REFERENCE TO RELATED APPLICATIONS

[0001] This application is a continuation of U.S. application Ser. No. 16/008,376, which claims priority to U.S. Provisional Application 62/519,317 filed on Jun. 14, 2017, which is incorporated herein by reference in its entirety.

FIELD OF THE DISCLOSURE

[0002] The present disclosure is related to modified guide RNAs and CRISPR-ribonucleoprotein complexes containing the modified guide RNAs and their use in genome editing methods.

BACKGROUND

[0003] Precise editing of DNA sequences in the human genome can be used to correct mutations or introduce novel genetic functionality for many biomedical purposes. Specifically, nonviral delivery of pre-formed CRISPR ribonucleoproteins (RNPs) is currently being developed for somatic gene editing applications. RNPs combining Streptococcus pyogenes Cas9 nuclease (Sp. Cas9, a high-affinity nuclease isolated from a type II CRISPR-associated system) and a single-guide RNA (sgRNA), for example, generate on-target DNA double strand breaks (DSBs) with little to no off-target DNA cleavage. This break can be repaired through error prone non-homologous end joining (NHEJ) or precise homology directed repair (HDR), in which a template is used. Co-delivery of a nucleic acid donor template with the Sp.Cas9 RNP (Sp.Cas9+sgRNA) is capable of producing precise edits at target loci through HDR of the DSB. However, variable delivery of the CRISPR system along with the donor templates generates a spectrum of edits, where a majority of cells include imprecise insertions and deletions (indels) of DNA bases from NHEJ repair of the DSB. Even when precise HDR of the DSB occurs on one allele, there is a chance that both alleles in diploid cells are not identically edited, resulting in imprecise edits on the other allele. Faithful writing of DNA, or scarless gene editing, within human cells remains an outstanding challenge.

[0004] Strategies to promote precise editing include addition of small molecules to block NHEJ and restricting Sp. Cas9 activity to particular phases of the cell cycle, but variability and toxicity has been observed across human cell lines when applying small molecules to promote HDR. Also, selection strategies through viral integration and excision of drug or cell-surface selection cassettes, flow cytometry for co-expressed fluorescent protein, or through transient drug selection can assist in the isolation of cells with one or two precisely-edited alleles. For all of these strategies, imprecise editing through NHEJ typically outnumbers precise HDR outcomes. None of the current strategies precisely control the delivery of the RNP with the donor template, and many resort to `flooding` the cell with high Cas9 expression and/or the donor template.

[0005] What is needed are new strategies for genome editing that have improved editing fidelity.

BRIEF SUMMARY

[0006] In one aspect, a modified guide RNA, comprises

[0007] a crRNA comprising a single-stranded protospacer sequence, and a first complementary strand of a binding region for the Cas9 polypeptide, and

[0008] a tracrRNA comprising, a second complementary strand of the binding region for the Cas9 polypeptide,

[0009] wherein the crRNA or the tracrRNA comprises an aptamer that binds a biotin-binding molecule,

[0010] wherein the crRNA and the tracrRNA hybridize through the first and second complementary strands of the binding region for the Cas9 polypeptide.

[0011] In another aspect, a modified sg RNA comprises, from 5' to 3',

[0012] a single-stranded protospacer sequence,

[0013] a first complementary strand of a binding region for the Cas9 polypeptide,

[0014] an aptamer that binds a biotin-binding molecule, and a second complementary strand of the binding region for the Cas9 polypeptide.

[0015] In another aspect, an RNP complex comprises the modified guide RNA such as the sgRNA and a Cas9 polypeptide or active fragment thereof.

[0016] In another aspect, a method of modifying a target gene in a cell comprises delivering to the cell the RNP complex described above, wherein the single-stranded protospacer sequence of the modified guide RNA such as the sgRNA hybridizes to a sequence in the target gene to be modified.

[0017] In another aspect, a method of modifying a target gene in a cell comprises delivering to the cell the modified guide RNA described above, wherein the modified guide RNA is associated with a biotin-binding molecule, and wherein the single-stranded protospacer sequence of the modified guide RNA hybridizes to a sequence in the target gene to be modified.

BRIEF DESCRIPTION OF THE DRAWINGS

[0018] The patent or application file contains at least one drawing executed in color. Copies of this patent or patent application publication with color drawing(s) will be provided by the Office upon request and payment of the necessary fee,

[0019] FIG. 1 is a schematic showing assembled ssODN-S1mplexes which are complexes of Sp. Cas9 protein, sgRNA with Sim aptamer, streptavidin, and a single-stranded oligodeoxynucleotide (ssODN) donor template. Sim-sgRNAs add an RNA aptamer at the first stem loop of the sgRNA that is capable of binding streptavidin protein. A biotin-ssODN is then added to this tertiary complex. ssODN-S1mplex particles are designed to promote homology directed repair (HDR).

[0020] FIG. 2 shows the predicted secondary structure of Sim-sgRNA. Protospacer designates the region that defines the sequence to target in the human genome. Sim stem loop (coral) binds streptavidin.

[0021] FIG. 3 shows the predicted secondary structure of Sim-sgRNAs variants.

[0022] FIG. 4 shows in vitro transcription of S1m-sgRNAs compared to standard sgRNAs. Sim-sgRNAs are larger than sgRNAs due to the insertion of S1m stem loop.

[0023] FIG. 5 shows in vitro complexes of sgRNAs and streptavidin. Lane 1: Sim-sgRNA. Lane 2: streptavidin. Lane 3-5: Progressive ratios of Sim-sgRNA streptavidin. As streptavidin concentration was increased the electrophoretic front of Sim-sgRNAs was slowed. The presence of several bands may be due to multiple Sim-sgRNAs binding to a single streptavidin. Lane 6-7: Addition of streptavidin to standard sgRNAs do not shift the electrophoretic front.

[0024] FIG. 6 shows dynamic light scattering of ssODN-S1mplex (S1mplex=tertiary complexes of Sp.Cas9, Sim-sgRNA, and streptavidin) particle assembly. Cas9 (orange) and streptavidin (blue) proteins fail to interact when in solution together and have a hydrodynamic radius consistent with published data. The addition of sgRNA to Sp. Cas9 protein increases the radius of the particle to 10 nm (yellow). This radius does not change with the addition of streptavidin (red). When Sim-sgRNAs are added to Sp. Cas9 (purple), the radius is increased by a larger amount than sgRNAs, potentially due to the larger size of the Sim-sgRNA. When streptavidin is added to this complex (green), a shift in size of about 3 nm occurs, the size of streptavidin. A second peak at 35 nm may be associated with multiple Cas9-S1m-sgRNA complexes connected to a single streptavidin.

[0025] FIG. 7 shows two representative single cell multispectral flow cytometric images of S1m-sgRNA and sgRNA transfected cells with Cas9 immunohistochemistry and fluorescent streptavidin (scale bar: 10 .mu.m). Arrowheads indicate presence of overlapping colors. Numbers in yellow are measured log Pearson correlation coefficient as determined by IDEAS software.

[0026] FIG. 8 shows the correlation coefficient of Cas9 immunocytochemistry fluorescent signal and streptavidin fluorescence, as measured by multispectral image cytometry within hPSCs. Use of S1m-sgRNA significantly increased the correlation between the two signals (***p<10.sup.-5, Student's two-tailed t-test).

[0027] FIG. 9 shows representative confocal images of S1m-sgRNA and sgRNA transfected cells with Cas9 immunohistochemistry and fluorescent streptavidin (scale bar: 5 .mu.m). Arrowheads indicate presence of overlapping colors.

[0028] FIG. 10 shows the correlation coefficient of Cas9 immunocytochemistry and streptavidin fluorescence inside the nuclei of transfected cells. Introduction of S1m-sgRNAs significantly increased the correlation between the two molecules (*p<0.05, Student's two-tailed t-test).

[0029] FIG. 11 shows in vitro tertiary complexes of S1m-sgRNA, streptavidin, and ssODN. Lanes 1-4: Components of S1m particles ran individually. Lanes 5-7: complexes of S1m-sgRNAs, streptavidin, and biotin-ssODNs. Three concentrations of ssODN were used while amount of S1m-sgRNA and streptavidin was held constant. Major bands showing the complexation of all three components can be seen. Elongated bands may be due to different stoichiometry of bio-ssODN and S1m-sgRNA connected to streptavidin.

[0030] FIG. 12 shows in vitro tertiary complexes of S1m-sgRNA, streptavidin, and ssODN. Lanes 1-4: Components of S1m particles ran individually. Lanes 5-7: complexes of S1m-sgRNAs, streptavidin, and biotinylated ssODNs. Numbers represent relative stoichiometry between components ran on gel. Major bands showing the complexation of all three components can be seen. Elongated bands may be due to different stoichiometry of biotin-ssODN and S1m-sgRNA connected to streptavidin. Lanes 8-10: complexes of S1m-sgRNAs, streptavidin, and ssODNs. ssODNs do not interfere with the binary complex. Lane 11: complexes of streptavidin and biotin-ssODNs, with free sgRNAs. None of the typical S1m-sgRNA-streptavidin complexes can be seen in this lane.

[0031] FIG. 13 shows gene editing via NHEJ using S1m-sgRNA RNPs. Knockout of integrated H2B-mCherry fluorescence in human embryonic kidney (HEK) cells. When transfected together with a plasmid encoding Sp. Cas9, S1m-sgRNAs induced .about.50% the level of NHEJ as sgRNA as measured by the loss of fluorescence (44.9% vs. 83.1%) five days post transfection.

[0032] FIG. 14 shows the ratio of precise to imprecise editing using S1mplexes formed with different S1m-sgRNA variants in hPSCs. Each S1m-sgRNA increased the ratio of precise to imprecise editing when compared to sgRNAs. S1mplexes with S1m-sgRNA-1, and S1m-sgRNA-2 had the highest ratios of precise editing.

[0033] FIG. 15 shows the ratio of precise to imprecise editing at BFP locus. ssODN-S1mplexes had an 18.4-fold higher ratio than sgRNAs and contained four precise edits to every one indel as analyzed by deep sequencing 8 days post lipofection of HEKs.

[0034] FIG. 16 shows the ratio of precise to imprecise editing at EMX1 locus. ssODN-S1mplexes had a 2.7-fold higher ratio than sgRNAs.

[0035] FIG. 17 shows the ratio of precise insertions to imprecise indels at BFP locus in hPSCs as analyzed by deep sequencing. ssODN-S1mplexes had a 9.7-fold increase in comparison to standard sgRNAs and a 7.4-fold increase when compared with untethered ssODNs.

[0036] FIG. 18 shows the ratio of precise insertions to imprecise indels at EMX1 locus. Addition of streptavidin to S1mplex resulted in a 15-fold increase in the ratio of precise insertions to imprecise indels.

[0037] FIGS. 19 and 20: ssODN design. Genomic sequence is denoted with black bars. sgRNA targeting site and PAM is denoted by `PAM` inside genomic locus, while red triangles are the sgRNA cut site. ssODN length is measured around cut site either upstream (-) or downstream (+) as read by the reading frame. Biotin (blue hexagon) was attached to either the 5' or 3' end of the ssODN. ssODNs were identical in sequence to either the PAM or Non-PAM sequence as read in a 5'-3' direction. RNP controls were standard sgRNAs plus corresponding ssODN.

[0038] FIG. 19 shows absolute NHEJ (orange diamonds) and HDR percentages (purple diamonds) as a function of total reads at two different loci in hPSCs using different ssODN designs. Each symbol represents a single replicate analyzed by deep sequencing 4 days after nucleofection into hPSCs. HDR levels were generally higher in each replicate than NHEJ levels.

[0039] FIG. 20 shows the ratio of HDR:indel reads in deep sequencing using each ssODN combined with S1mplexes. Blue circles represent individual biological replicates. With each ssODN, S1mplexes increased the ratio of HDR:indel when compared to sgRNA controls but no significant trends as to symmetry, sidedness, or biotin location were observed.

[0040] FIG. 21 is a schematic of S1mplexes with quantum dot cargoes. Qdots can be complexed with the S1mplex by a disulfide linker (Qdot-SS-S1mplex, top) or by using streptavidin covalently attached directly to the quantum dot (QdotSA-S1mplex, bottom). The quantum dot has a mean diameter of 20 nm.

[0041] FIG. 22 shows a gene editing comparison of different Qdot S1mplexes. Gene editing of HEK H2B-mCherry reporter cells five days post sorting as assayed by flow cytometry. QdotSA interferes with RNP activity, while Qdot-SS has equivalent gene editing activity as the free RNP (n=3 technical replicates).

[0042] FIG. 23 shows gene-editing using various combinations of components with QdotSA. Conjugation of S1mplexes to QdotSA significantly lowers gene editing efficiency. Editing efficiency is lower even if QdotSA is transfected separately from the S1mplexes without complexation. S1m-sgRNA|QdotSA indicates complexation of S1m-sgRNA RNP with transfection agent in a separate tube from QdotSA complexation with transfection agent, and subsequent addition of the contents of the S1m-sgRNA tube followed immediately by addition of the QdotSA tube. 5 hr. gap indicates a 5 hour culture time between transfections. Immediate application of the QdotSA can moderately interfere with the activity of the RNP, but these interference effects are abrogated if QdotSA is added 5 hours later. All RNP activity is abrogated by complexation with the QdotSA (last column) (n=3 technical replicates).

[0043] FIG. 24 shows representative epifluorescence images of untransfected and Qdot-SS-S1mplex transfected cells 24 hours post transfection (Scale bar: 10 .mu.m). Arrowheads indicate Qdot fluorescence in the cytoplasm.

[0044] FIG. 25 shows increased fluorescence of Qdot-S1mplex allows sorting out of quantum dot positive fractions compared to untransfected cells 24 hours post transfection.

[0045] FIG. 26 shows quantum dot conjugation to S1mplex via a cleavable disulfide linker allows fluorescent enrichment of gene-edited human cells. Increased fluorescence of Qdot-S1mplex after cleavage of the disulfide linker allows sorting out of quantum dot positive fractions compared to untransfected cells 24 hours post transfection (n=3 biological replicates).

[0046] FIG. 27 shows a schematic of simultaneous editing at two loci strategy. HEK cells were transfected simultaneously with two S1m particles, labeled with distinct fluorophores. Editing at the BFP locus was associated with Red-ssODN-S1mplexes (AlexaFluor.RTM.-594 fluorophore), while editing at the EMX1 locus was associated with Green-ssODN-S1mplexes (AlexaFluor.RTM.-488 fluorophore).

[0047] FIG. 28 shows single cells sorting for enrichment of editing at BFP locus. In enriched S1mplex clonal populations, indels (brown) and HDR (blue) events occurred in a 1:1 ratio. In sgRNA clones, all isolated clones either had indel or wildtype genotypes. Genotypes were assayed by Sanger sequencing. No mosaic genotypes were observed.

[0048] FIG. 29 shows fluorescent S1mplexes inside the cell using confocal microscopy. Arrows denote Green-S1mplex both inside the nucleus and outside the cell (Scale bar: 10 .mu.m).

[0049] FIG. 30 shows twenty-four hours post transfection, cells were sorted into populations that were positive for either fluorophore, both or neither. Analysis via deep sequencing was done 6 days post sorting. Top: ratio of precise (perfect sequence match to ssODN) to imprecise editing (indels) in sorted populations. Populations enriched for BFP targeted S1mplexes (Red+ and double positive) had elevated ratios up to 40 times as many insertions as indels. Bottom: ratio of precise to imprecise editing in sorted populations. Populations enriched for EMX1 targeted S1mplexes (Red+ and double positive) had elevated ratios of precise insertions to indels.

[0050] FIG. 31 Off-target analysis of double positive populations using TIDE at the top 5 off-target locations for each sgRNA. No modifications were detected below the TIDE limit of detection (dotted line).

[0051] FIG. 32 shows an off-target analysis of sorted S1mplex populations. Off-target analysis using TIDE software at the top 5 predicted off-target sites within the human genome at the BFP and EMX1 loci. Y axis indicates the percentage of cells with 0 mismatches from the parental sequence (perfect matches in sequencing reads). None of the sorted S1mplex populations showed off-target effects above the limit of detection. The unsorted sgRNA RNP population had a small proportion of cells that may have been edited at OT-2 of the EMX1 off-target sites.

[0052] FIG. 33 shows release of a biotin-ssODN through a photocleavable linkage had no significant effect on HDR editing. FIG. 33a shows a biotin-ssODN that contained a UV-cleavable linker was attached to streptavidin and S1mplex particles in order to study the potential of releasing the ssODN inside the cell to promote HDR. Lane 1: DNA standard. Lane 2: Photo-cleavable biotin-ssODN. Lane 3: standard ssODN. Lane 4: Binary complexes of streptavidin and photo-cleavable biotin-ssODNs. Lane 5-6: Binary complexes cleaved by either exposure to light through a DAPI filter cube (lane 5) or exposure to a UV transilluminator (lane 6). DAPI filter cube cleaved nearly all ssODN after 10 minutes whereas transilluminator had complete cleavage. Cleaved DNA product was the same length as control standard ssODN. FIG. 33b shows release of biotin-ssODN by 15 minutes of light exposure through a DAPI filter cube every hour post transfection. Levels of HDR were not significantly affected by the release of the ssODN within the cell at any time point (n=3 biological replicates).

[0053] FIG. 34 is a schematic of the structure and sequence of S1m-sgRNA-V3. This sequence removes 6 nt from the beginning of the S1m aptamer. Removal of these nucleotides simplified the secondary structure of the RNA. This modification may potentially decrease the number of incorrectly folded and therefore inactive S1m-sgRNAs.

[0054] FIG. 35 shows the binding capability of S1m-sgRNA-1 and S1m-sgRNA-V3 with streptavidin using an electrophoretic mobility shift assay (EMSA). S1m-sgRNAs or standard sgRNAs were mixed with native streptavidin protein at the indicated ratios (w/w) and allowed to complex prior to being loaded on an agarose gel. Lane 1: S1m-sgRNA-1. Lane 2: S1m-sgRNA-V3. Lane 3: Streptavidin. Lane 4: 10:1 S1m-sgRNA-1:Streptavidin. Lane 5: 1:1 S1m-sgRNA-1:Streptavidin. Lane 6: 1:10 S1m-sgRNA-1:Streptavidin. Lane 7: 10:1 S1m-sgRNA-V3:Streptavidin. Lane 8: 1:1 S1m-sgRNA-V3:Streptavidin. Lane 9: 1:10 S1m-sgRNA-V3:Streptavidin. Lane 10: sgRNA. Lane 7: 1:10 sgRNA:Streptavidin.

[0055] FIG. 36 shows the induction of NHEJ using various sgRNAs. Cas9 RNPs were formed with standard sgRNA, S1m-sgRNA-1, or S1m-sgRNA-V3 targeting the same locus and transfected into H2b-mCherry expressing HEK cells. % NHEJ was measured by loss of fluorescence 7 days post transfection. Both S1m-sgRNA versions were less effective at creating double strand breaks repaired by NHEJ than standard sgRNA. S1m-sgRNA-V3 induced more NHEJ events than V1 (.about.3-fold higher) potentially due to simplified secondary structure. Both S1m-sgRNA variants were still capable of creating genetic modifications. (n=3 technical replicates. Error bars represent .+-.1 S.D.)

[0056] FIG. 37 shows the induction of HDR using various sgRNAs. Cas9 RNPs were formed with standard sgRNA, S1m-sgRNA-1, or S1m-sgRNA-V3 targeting the same locus. S1m-sgRNA-1 and V3 were also used to create S1mplexes containing an ssODN to induce HDR at the target site. S1m-sgRNAs again formed fewer DSBs and S1m-sgRNA-V3 was more efficient at inducing NHEJ than V1. Similarly, when S1mplexes were formed using S1m-sgRNAs, V3 induced higher levels of HDR than V1. However, in this replicate, ratios of HDR:NHEJ differed from what was seen in previous experiments (n=3 technical replicates. Error bars represent .+-.1 S.D.)

[0057] FIG. 38 shows identification of corrected Pompe iPSCs using ArrayEdit platform following transfection with fluorescent S1mplexes. Array Edit enables tracking of phenotypic characteristics.

[0058] FIG. 39 shows the phenotypic difference between wildtype and Pompe disease iPSCs. Cell lines were cocultured together at the indicated ratio and evaluated for the presence of mCherry (wildtype) or DAPI (disease). Lysosome acidity was measured using LysoSensor.TM. Green and quantified on a per-cell basis.

[0059] FIG. 40 shows identification of corrected Pompe iPSCs. Pompe iPSCs and H9-H2b-mCherry cells were mock transfected and plated of ArrayEdit platform. Over seven days number of cells per feature was tracked and used to calculate average growth rate (bottom right). On day seven, wells were stained with LysoSensor.TM. Green and per cell intensity was measured (top left). Data was plotted as a per-feature average. Pompe iPSCs were transfected with S1mplex-ssODNs targeting diseased loci and analyzed in the same manner as described above but with the addition of S1mplex presence on day 1. Clones to be selected (bottom left) were determined by gating out the lowest average growth rate of mock transfected cells as well as the upper intensity limit of mock transfected Pompe iPSCs. Microfeatures with cells meeting both of these criteria as well as displaying S1mplex presence were selected and expanded.

[0060] FIG. 41 shows selection of gene-corrected disease iPSCs. Sanger sequencing traces of corrected cell lines. Heterozygous mutations within the PAM sequence show that the ssODN was used as the HDR template in all lines.

[0061] FIG. 42 shows dual S1mplexes for the precise excision of genomic DNA. a) 2 sgRNAs designed in the LAMAS locus for excision of a 238 bp stretch of genomic DNA. B) Mixed S1m sgRNAs (1,2) with streptavidin added to HEK 293s, with ratio sgRNA:streptavidin 2:1 at 50 ng/well per guide. Gel shows LAMAS locus PCR amplicon spanning both guides. Average excision efficiency of 22% with dual S1mplexes.

[0062] The above-described and other features will be appreciated and understood by those skilled in the art from the following detailed description, drawings, and appended claims.

DETAILED DESCRIPTION

[0063] Described herein are modified guide RNAs such as sgRNAs and their RNP complexes with Cas9. Without being held to theory, the inventors hypothesized that some of the errors in gene editing outcomes could be reduced by preassembling RNPs with donor template or other moieties that enable the isolation of precisely-edited cells (FIG. 1). The inventors designed a strategy inspired by CRISPR display that leverages structural studies of the RNP to identify locations in the guide RNA sequence where RNA aptamers could be tolerated.

[0064] The S1mplex tool described here exploits high affinity interactions between a short RNA aptamer and streptavidin to promote more faithful writing of the human genome. In an aspect, these RNP-containing complexes can be assembled outside the cell to a desired stoichiometry and delivered as an all-in-one gene-editing nanoparticle together with a donor nucleic acid template. In addition, the complexes can be easily decorated with additional moieties such as fluorophores or Qdots to enrich for edited cells. Use of these particles with a biotinylated ssODN reduced heterogeneity in delivery among RNPs and nucleic acids within human cells and enriches the ratio of precisely-edited to imprecisely-alleles edited alleles up to 18-fold higher than standard RNP methods, approaching a ratio of four precise edits to every one imprecise edit. Further functionalization with a unique fluorophore enables multiplexed editing and enrichment of precisely edited populations through cell sorting. Taken together, advances with the S1mplex tool generates new, chemically-defined reagents to promote precise editing of the human genome.

[0065] The inventors devised a strategy inspired by CRISPR display that leverages structural studies of the RNP to identify locations in the sgRNA sequence where RNA aptamers could be tolerated (FIG. 1). Three sgRNAs with a modification either in a stem loop of the sgRNA or at the 3' end were designed (FIG. 2), as these locations have previously been shown to tolerate additions with a minimal loss in Cas9 binding activity. Separately, at each location, a perfectly complementary 10 nucleotide block was added which was previously shown to aid aptamer addition to sgRNAs and a 60 nucleotide S1m aptamer, which has a strong non-covalent interaction with streptavidin. The added sequence extends the sgRNA stem loop and contains two distinct bulges used for binding. We termed these new sgRNAs S1m-sgRNA-1, S1m-sgRNA-2, and S1m-sgRNA-3 in reference to their position in the sgRNA from 5' to 3' (FIG. 2).

[0066] CRISPR refers to the Clustered Regularly Interspaced Short Palindromic Repeats type II system used by bacteria and archaea for adaptive defense. This system enables bacteria and archaea to detect and silence foreign nucleic acids, e.g., from viruses or plasmids, in a sequence-specific manner. In type II systems, guide RNA interacts with Cas9 and directs the nuclease activity of Cas9 to target DNA sequences complementary to those present in the guide RNA. Guide RNA base pairs with complementary sequences in target DNA. Cas9 nuclease activity then generates a double-stranded break in the target DNA.

[0067] CRISPR/Cas9 is an RNP complex. CRISPR RNA (crRNA) includes a 20 base protospacer element that is complementary to a genomic DNA sequence as well as additional elements that are complementary to the transactivating RNA (tracrRNA). The tracrRNA hybridizes to the crRNA and binds to the Cas9 protein, to provide an active RNP complex. Thus, in nature, the CRISPR/Cas9 complex contains two RNA species.

[0068] sgRNA refers to a single RNA species which combines the tracrRNA and the crRNA and is capable of directing Cas9-mediated cleavage of target DNA. An sgRNA thus contains the sequences necessary for Cas9 binding and nuclease activity and a target sequence complementary to a target DNA of interest (protospacer sequence). In general, in an sgRNA, the tracrRNA and the crRNA are connected by a linker loop sequence. sgRNAs are well-known in the art. While sgRNA is generally used throughout this disclosure, two-part guide RNAs containing a crRNA and a tracrRNA can also be employed.

[0069] As used herein, a guide RNA protospacer sequence refers to the nucleotide sequence of a guide RNA that binds to a target DNA sequence and directs Cas9 nuclease activity to the target DNA locus. In some embodiments, the guide RNA protospacer sequence is complementary to the target DNA sequence. As described herein, the protospacer sequence of a single guide RNA may be customized, allowing the targeting of Cas9 activity to a target DNA of interest.

[0070] Any desired target DNA sequence of interest may be targeted by a guide RNA target sequence. Any length of target sequence that permits CRISPR-Cas9 specific nuclease activity may be used in a guide RNA. In some embodiments, a guide RNA contains a 20 nucleotide protospacer sequence.

[0071] In addition to the protospacer sequence, the targeted sequence includes a protospacer adjacent motif (PAM) adjacent to the protospacer region which is a sequence recognized by the CRISPR RNP as a cutting site. Without wishing to be bound to theory, it is thought that the only requirement for a target DNA sequence is the presence of a protospacer-adjacent motif (PAM) adjacent to the sequence complementary to the guide RNA target sequence. Different Cas9 complexes are known to have different PAM motifs. For example, Cas9 from Streptococcus pyogenes has a NGG trinucleotide PAM motif; the PAM motif of N. meningitidis Cas9 is NNNNGATT; the PAM motif of S. thermophilus Cas9 is NNAGAAW; and the PAM motif of T. denticola Cas9 is NAAAAC.

[0072] A modified guide RNA is a one-part or two-part RNA capable of directing Cas-9-mediated cleavage of target DNA. A modified sg RNA is a single RNA species capable of directing Cas9-mediated cleavage of target DNA. A modified sgRNA, for example, comprises sequences that provide Cas9 nuclease activity, a protospacer sequence complementary to a target DNA of interest, and an aptamer that binds a biotin-binding molecule. The inventors of the present application unexpectedly found that the linker loop that connects the tracrRNA and the crRNA in an sgRNA can be replaced with an aptamer that binds a biotin-binding molecule such as a streptavidin-binding aptamer. Unexpectedly, the modified sgRNAs can bind both Cas9 protein and streptavidin, and form active RNP complexes which induce error-prone DNA repair less frequently than standard CRISPR-Cas9 RNP complexes.

[0073] In an aspect, a modified guide RNA, comprises a crRNA comprising a single-stranded protospacer sequence and a first complementary strand of a binding region for the Cas9 polypeptide, and a tracrRNA comprising a second complementary strand of the binding region for the Cas9 polypeptide, wherein the crRNA or the tracrRNA comprises an aptamer that binds a biotin-binding molecule, wherein the crRNA and the tracrRNA hybridize through the first and second complementary strands of the binding region for the Cas9 polypeptide.

[0074] In another aspect, the crRNA and the tracrRNA form an sgRNA, the sgRNA comprise from 5' to 3',

[0075] the single-stranded protospacer sequence,

[0076] the first complementary strand of a binding region for the Cas9 polypeptide,

[0077] the aptamer that binds a biotin-binding molecule, and

[0078] the second complementary strand of the binding region for the Cas9 polypeptide.

[0079] More specifically, a modified sgRNA comprises, from 5' to 3', a single-stranded protospacer sequence, a first complementary strand of a binding region for the Cas9 polypeptide, an aptamer that binds a biotin-binding molecule, and a second complementary strand of the binding region of the Cas9 protein. In an embodiment, in the secondary structure of the modified sgRNA, the stem forms a stem-loop structure with the aptamer that binds the biotin-binding molecule. Specific modified sgRNAs are provided in FIG. 2.

[0080] The single-stranded protospacer region can comprise 17 to 20 nucleotides. Exemplary binding regions for Cas9 polypeptides comprise 10 to 35 base pairs.

[0081] In an aspect, the aptamer that binds a biotin-binding molecule forms a stem-loop structure. The stem portion of the stem-loop structure optionally forms a contiguous double strand with the double-stranded binding region for the Cas9 polypeptide. The stem portion of the aptamer can comprise 9 to 15 base pairs, while the loop comprises 30 nucleotides. As shown in FIG. 2, the aptamer may contain more than one stem-loop structure. As shown in Example 9, the length of the stem portion of the aptamer is not critical and can be adjusted depending on the application of the modified guide RNA.

[0082] Also included herein is an RNP complex comprising the modified guide RNA, e.g., sgRNA, and a Cas9 polypeptide or active fragment thereof. Exemplary modified sgRNAs include:

TABLE-US-00001 (SEQ ID NO: 1) NNNNNNNNNNNNNNNNNNNNGUUUAAGAGCUAUGCUGCGAAUACGAGAUG CGGCCGCCGACCAGAAUCAUGCAAGUGCGUAAGAUAGUCGCGGGUCGGCG GCCGCAUCUCGUAUUCGCAGCAUAGCAAGUUUAAAUAAGGCUAGUCCGUU AUCAACUUGAAAAAGUGGCACCGAGUCGGUGCUUUU; (SEQ ID NO: 2) NNNNNNNNNNNNNNNNNNNNGUUUAAGAGCUAUGCUGGAAACAGCAUAGC AAGUUUAAAUAAGGCUAGUCCGUUAUCAACUUCGAAUACGAGAUGCGGCC GCCGACCAGAAUCAUGCAAGUGCGUAAGAUAGUCGCGGGUCGGCGGCCGC AUCUCGUAUUCGGAAAAAGUGGCACCGUGACGGUGCUUUU; or (SEQ ID NO: 3) NNNNNNNNNNNNNNNNNNNNGUUUAAGAGCUAUGCUGGAAACAGCAUAGC AAGUUUAAAUAAGGCUAGUCCGUUAUCAACUUGAAAAAGUGGCACCGUGA CGGUGCCGAAUACGAGAUGCGGCCGCCGACCAGAAUCAUGCAAGUGCGUA AGAUAGUCGCGGGUCGGCGGCCGCAUCUCGUAUUCGUUUU; or (SEQ ID NO: 70) NNNNNNNNNNNNNNNNNNNNGUUUAAGAGCUAUGCUGCGAAUACGAGCCG CCGACCAGAAUCAUGCCAAGUGCGUAAGAUAGUCGCGGGUCGGCGGCUCG UAUUCGCAGCAUAGCAAGUUUAAAUAAGGCUAGUCCGUUAUCAACUUGAA AAAGUGGCACCGUGAAGUCGGUGCUUUU

[0083] A "Cas9" polypeptide is a polypeptide that functions as a nuclease when complexed to a guide RNA, e.g., an sgRNA or modified sgRNA. The Cas9 (CRISPR-associated 9, also known as Csn1) family of polypeptides, for example, when bound to a crRNA:tracrRNA guide or single guide RNA, are able to cleave target DNA at a sequence complementary to the sgRNA target sequence and adjacent to a PAM motif as described above. Cas9 polypeptides are characteristic of type II CRISPR-Cas systems. The broad term "Cas9" Cas9 polypeptides include natural sequences as well as engineered Cas9 functioning polypeptides. The term "Cas9 polypeptide" also includes the analogous Clustered Regularly Interspaced Short Palindromic Repeats from Prevotella and Francisella 1 or CRISPR/Cpf1 which is a DNA-editing technology analogous to the CRISPR/Cas9 system. Cpf1 is an RNA-guided endonuclease of a class II CRISPR/Cas system. This acquired immune mechanism is found in Prevotella and Francisella bacteria. Additional Class I Cas proteins include Cas3, Cas8a, Cas5, Cas8b, Cas8c, Cas 10d, Case1, Cse 2, Csy 1, Csy 2, Csy 3, GSU0054, Cas 10, Csm 2, Cmr 5, Cas10, Csx11, Csx10, and Csf 1. Additional Class 2 Cas9 polypeptides include Csn 2, Cas4, C2c1, C2c3 and Cas13a.

[0084] Exemplary Cas9 polypeptides include Cas9 polypeptide derived from Streptococcus pyogenes, e.g., a polypeptide having the sequence of the Swiss-Prot accession Q99ZW2 (SEQ ID NO: 5); Cas9 polypeptide derived from Streptococcus thermophilus, e.g., a polypeptide having the sequence of the Swiss-Prot accession G3ECR1 (SEQ ID NO: 6); a Cas9 polypeptide derived from a bacterial species within the genus Streptococcus; a Cas9 polypeptide derived from a bacterial species in the genus Neisseria (e.g., GenBank accession number YP_003082577; WP_015815286.1 (SEQ ID NO: 7)); a Cas9 polypeptide derived from a bacterial species within the genus Treponema (e.g., GenBank accession number EMB41078 (SEQ ID NO: 8)); and a polypeptide with Cas9 activity derived from a bacterial or archaeal species. Methods of identifying a Cas9 protein are known in the art. For example, a putative Cas9 protein may be complexed with crRNA and tracrRNA or sgRNA and incubated with DNA bearing a target DNA sequence and a PAM motif.

[0085] The term "Cas9" or "Cas9 nuclease" refers to an RNA-guided nuclease comprising a Cas9 protein, or a fragment thereof (e.g., a protein comprising an active, inactive, or partially active DNA cleavage domain of Cas9, and/or the gRNA binding domain of Cas9). In some embodiments, a Cas9 nuclease has an inactive (e.g., an inactivated) DNA cleavage domain, that is, the Cas9 is a nickase. Other embodiments of Cas9, both DNA cleavage domains are inactivated. This is referred to as catalytically-inactive Cas9, dead Cas9, or dCas9.

[0086] Functional Cas9 mutants are described, for example, in US20170081650 and US20170152508, incorporated herein by reference for its disclosure of Cas9 mutants.

[0087] In addition, to the modified sgRNA and the Cas9 polypeptide or active fragment thereof, an RNP complex may further comprise a biotin-binding molecule such as an avidin such as avidin, streptavidin, or NeutrAvidin.TM. which bind with high affinity to the aptamer that binds the biotin-binding molecule in the modified sgRNA. Avidin, streptavidin and NeutrAvidin.TM. are a tetramers and each subunit can bind biotin with equal affinity. Avidin, streptavidin and NeutrAvidin.TM. variants that contain one, two or three biotin binding sites are also available and may be employed in the complex.

[0088] When the RNP complex comprises a biotin-binding molecule, the complex can further comprise a biotinylated molecule which associates with the complex via the biotin-binding molecule. The biotinylated molecule can target the RNP complex to a specific cell type, organ or tissue. For example, PEG-coated gold nanoparticles exhibit size-dependent in vivo toxicity; the renal clearance of quantum dots can be controlled; and the accumulation of PEGylated silane-coated magnetic iron oxide nanoparticles has been shown to be size dependent.

[0089] In one embodiment, the biotinylated molecule is a biotinylated oligodeoxynucleotide, such as a biotinylated donor DNA template. Homologous recombination can insert an exogenous polynucleotide sequence into the target nucleic acid cleavage site. An exogenous polynucleotide sequence can be called a donor polynucleotide or a donor sequence. In some embodiments, a donor polynucleotide, a portion of a donor polynucleotide, a copy of a donor polynucleotide, or a portion of a copy of a donor polynucleotide can be inserted into a target nucleic acid cleavage site. A donor polynucleotide can be single-stranded DNA, double-stranded DNA, RNA, or a duplex of RNA and DNA. A donor polynucleotide can be a sequence that does not naturally occur at a target nucleic acid cleavage site. In some embodiments, modifications of a target nucleic acid due to NHEJ and/or HDR can lead to, for example, mutations, deletions, alterations, integrations, gene correction, gene replacement, transgene insertion, nucleotide deletion, gene disruption, and/or gene mutation. The process of integrating non-native nucleic acid(s) into genomic DNA can be referred to as "genome engineering".

[0090] In an embodiment, the biotinylated molecule is a nanoparticle, such as a quantum dot, a gold particle, a magnetic particle, a polymeric nanoparticle. In another embodiment, the biotinylated molecule is a biotinylated fluorescent dye such as Atto 425-Biotin, Atto 488-Biotin, Atto 520-Biotin, Atto-550 Biotin, Atto 565-Biotin, Atto 590-Biotin, Atto 610-Biotin, Atto 620-Biotin, Atto 655-Biotin, Atto 680-Biotin, Atto 700-Biotin, Atto 725-Biotin, Atto 740-Biotin, fluorescein biotin, biotin-4-fluorescein, biotin-(5-fluorescein) conjugate, and biotin-B-phycoerythrin, Alexa Fluor.RTM. 488 biocytin, Alexa Fluor.RTM.546, Alexa Fluor.RTM. 549, lucifer yellow cadaverine biotin-X, Lucifer yellow biocytin, Oregon green 488 biocytin, biotin-rhodamine and tetramethylrhodamine biocytin. Biotinylated molecule may also be a peptide, proteins or protein domains, specifically antibodies and Fab domains.

[0091] In another aspect, the biotin-binding molecule can be covalently linked to a donor polynucleotide, a nanoparticle, or a dye molecule either directly or via a linker molecule, using, for example a disulfide linker. The bound biotin-binding molecule can then bind the aptamer of the modified sgRNA. Additional biotinylated donor polynucleotides, nanoparticle, contrast agent, or dye molecules can then be associated with the bound biotin-binding molecule. Alternatively, the biotin-binding molecule can be associated with the biotinylated molecule prior to adding to modified sgRNA.

[0092] Further included herein are methods of modifying a target gene, such as a target gene in a cell by contacting the cell with the RNP complexes and modified guide RNAs described herein. The cell can be from any organism (e.g., a bacterial cell, an archaeal cell, a cell of a single-cell eukaryotic organism, a plant cell, an algal cell, a fungal cell (e.g., a yeast cell), a cell from an invertebrate animal, a cell from a vertebrate animal, or a cell from a mammal, including a cell from a human.

[0093] Also included herein is a method of modifying a target gene in a cell, comprising delivering to the cell the modified guide RNA, wherein the modified guide RNA is associated with a biotin-binding molecule, and wherein the single-stranded protospacer sequence of the modified guide RNA hybridizes to a sequence in the target gene to be modified.

[0094] In some embodiments, the present disclosure provides for methods of modifying a target gene in a plant. As used herein, the term "plant" refers to whole plants, plant organs, plant tissues, seeds, plant cells, seeds and progeny of the same. Plant cells include, without limitation, cells from seeds, suspension cultures, embryos, meristematic regions, callus tissue, leaves, roots, shoots, gametophytes, sporophytes, pollen and microspores. Plant parts include differentiated and undifferentiated tissues including, but not limited to roots, stems, shoots, leaves, pollens, seeds, tumor tissue and various forms of cells and culture (e.g., single cells, protoplasts, embryos, and callus tissue).

[0095] In an embodiment, modifying the target gene increases or decreases the expression of a gene product of the target gene.

[0096] In another embodiment, modifying the target gene comprises high-fidelity homology-directed repair (HDR).

[0097] In another embodiment, modifying the target gene comprises the addition of a genetic functionality, or the correction of a mutation.

[0098] In yet another embodiment, modifying the target gene creates a double strand break (DSB) which is repaired by a non-homologous end joining (NHEJ) cell repair mechanism generating indels thereby modifying the polynucleotide sequence of the target gene.

[0099] In a further embodiment, modifying the target gene creates a DSB which is repaired by a homologous recombination (HDR) cell repair mechanism incorporating a donor DNA sequence thereby modifying the polynucleotide sequence of the target gene.

[0100] In an aspect, the S1m-sgRNAs described herein can be used for biallelic correction. Infantile-onset Pompe disease contains two distinct deleterious mutations at different points within a single gene. In an aspect, two S1m-sgRNAs can be employed simultaneously, one for correction of each disease locus. As shown in Example 11, clones containing edits at both alleles were identified.

[0101] In another aspect, the S1m-sgRNAs described herein can be used for the excision of genomic DNA. In an aspect, two S1m-sgRNAs can be employed simultaneously, wherein each S1m-sgRNA targets an end of the region to be excised. As shown in Example 12, human cells contain the properly excised region of genomic DNA

[0102] Delivery of polynucleotides and RNPs of the present disclosure to cells, in vitro, or in vivo, may be achieved by a number of methods known to one of skill in the art. These methods include lipofection, electroporation, nucleofection, microinjection, biolistics, liposomes, immunoliposomes, polycation or lipid:nucleic acid conjugates. Lipofection is well known and lipofection reagents are sold commercially. Cationic and neutral lipids that are suitable for efficient receptor-recognition lipofection of polynucleotides are described in the art.

[0103] Lipid:nucleic acid complexes, including targeted liposomes such as immunolipid complexes, and the preparation of such complexes is well known to one of skill in the art.

[0104] Electroporation can be used to deliver the polynucleotides and RNPs of the present disclosure. In these methods, the polynucleotides or RNPs are mixed in an electroporation buffer with the target cells to form a suspension. This suspension is then subjected to an electrical pulse at an optimized voltage, which creates temporary pores in the phospholipid bilayer of the cell membrane, permitting charged molecules like DNA and proteins to be driven through the pores and into the cell. Reagents and equipment to perform electroporation are sold commercially.

[0105] Biolistic, or microprojectile delivery, can be used to deliver the polynucleotides and RNPs of the present disclosure. In these methods, microprojectiles, such as gold or tungsten, are coated with the polynucleotide by precipitation with calcium chloride, spermidine or polyethylene glycol. The microprojectile particles are accelerated at high speed into a cell using a device such as the BIOLISTIC.RTM. PDS-1000/He Particle Delivery System (Bio-Rad; Hercules, Calif.).

[0106] In another embodiment, a viral vector expressing the modified guide RNA of the present disclosure, a viral vector expressing a Cas9 polypeptide and biotinylated donor DNA template (e.g., a biotinylated donor DNA template), can be transfected into a cell, such as a human cell. Human cells include human pluripotent stem cell lines and primary blood cell such as hematopoietic stem and progenitor cells and T-cells. Once editing has occurred in the cell line, the cells can be differentiated and transplanted into a subject, or used for drug development.

[0107] In some embodiments, the polynucleotides of the present disclosure may also comprise modifications that, for example, increase stability of the polynucleotide. Such modifications may include phosphorothioates, chiral phosphorothioates, phosphorodithioates, phosphotriesters, aminoalkylphosphotriesters, methyl and other alkyl phosphonates such as 3'-alkylene phosphonates, 5'-alkylene phosphonates, chiral phosphonates, phosphinates, phosphoramidates including 3'-amino phosphoramidate and amino alkylphosphoramidates, phosphorodiamidates, thionophosphoramidates, thionoalkylphosphonates, thionoalkylphosphotriesters, selenophosphates, and boranophosphates having normal 3'-5' linkages, 2-5' linked analogs, and those having inverted polarity wherein one or more internucleotide linkages is a 3' to 3', a 5' to 5' or a 2' to 2' linkage. Exemplary nucleic acid-targeting polynucleotides having inverted polarity can comprise a single 3' to 3' linkage at the 3'-most internucleotide linkage (i.e. a single inverted nucleoside residue in which the nucleobase is missing or has a hydroxyl group in place thereof). Various salts (e.g., potassium chloride or sodium chloride), mixed salts, and free acid forms can also be included.

[0108] In some embodiments, the polynucleotides of the present disclosure may also contain other nucleic acids, or nucleic acid analogues. An example of a nucleic acid analogue is peptide nucleic acid (PNA).

[0109] The invention is further illustrated by the following non-limiting examples.

EXAMPLES

Methods

[0110] Cell Culture:

[0111] WA09 hESCs (WiCell, Madison, Wis.) were maintained in E8 medium on Matrigel.RTM. (WiCell) coated tissue culture polystyrene plate (BD Falcon). Cells were passaged every 3-4 days at a 1:6 ratio using Versene.RTM. solution (Life Technologies). WA09-BFP hESCs were generated through lentiviral transduction of BFP dest clone (Addgene #71825) and sorted to ensure clonal populations. After expansion, lines were sorted monthly on a BD FACS Aria to maintain expression levels.

[0112] Human embryonic kidney cells (293T) were obtained from ATCC and were maintained between passage 15-60 in Growth medium containing DMEM (Life Technologies), 10% v/v FBS (WiCell), 2 mM L-Glutamine (Life Technologies), and 50 U/mL Penicillin-Streptomycin (Life Technologies). Cells were passaged 1:40 with Trypsin-EDTA (Life Technologies) onto Gelatin-A (Sigma) coated plates. HEK-H2B-mCherry lines were generated through CRISPR-mediated insertion of a modified AAV-CAGGS-EGFP plasmid (Addgene #22212) at the AAVS safe harbor locus using gRNA AAVS1-T2 (Addgene #41818). HEK-BFP lines were generated and maintained as mentioned above. All cells were maintained at 37.degree. C. and 5% CO2.

[0113] One Pot Transcription of S1m-sgRNA:

[0114] S1m-sgRNAs were synthesized by first creating a double stranded DNA block that encoded the sgRNA scaffold as well as the S1m aptamer. This scaffold was formed by overlap PCR using Phusion.RTM. High-Fidelity Polymerase (New England Biolabs) according to the manufacturer's protocols and was placed in the thermocycler for 30 cycles of 98.degree. C. for 10 s and 72.degree. C. for 15 s with a final extension period of 72.degree. C. for 10 min. A second primer consisting of a truncated T7 promoter, the sgRNA target, and homology to the S1m scaffold was then added to the scaffold and PCR was performed again using Phusion.RTM. and placed in a thermocycler at 98.degree. C. for 30 s followed by 35 cycles of 98.degree. C. for 5 s, 60.degree. C. for 10 s, and 72.degree. C. for 15 s, with a final extension period of 72.degree. C. for 10 min. S1m PCR products were then incubated overnight at 37.degree. C. in a HiScribe.TM. T7 IVT reaction (New England Biolabs) according to manufacturer's protocol. The resulting RNA was purified using MEGAclear.TM. Transcription Clean-Up Kit (Thermo Fisher) and quantified on a Nanodrop.TM.2000.

[0115] S1m RNP Formation:

[0116] NLS-Cas9-NLS protein (Aldevron, Madison, Wis.) was combined with S1m-sgRNAs and allowed to complex for 5 minutes with gentle mixing. To this complex, streptavidin (Life Technologies) was added and the mixture was allowed to complex for an additional 5 minutes. Finally, biotin-ssODNs (Integrated DNA Technologies) were added to the tertiary complex and subsequently vortexed at low speed. This final mixture was then allowed to sit for 10 minutes to ensure complete complexation.

[0117] S1m-sgRNA and Streptavidin Binding Gel Shift Assays:

[0118] S1m-sgRNAs were heated at 75.degree. C. for 5 min and cooled to room temperature for 15 min 20 pmol S1m-sgRNA was combined with streptavidin at 10:1, 1:1, and 1:10 molar ratios in a final volume of 5 .mu.l and the mixture was allowed to complex for 10 min. The S1m-sgRNA-streptavidin complexes were run on a 1% agarose gel. Tertiary complexes were assembled by first mixing 15 pmol each of S1m-sgRNA and streptavidin. To this mixture, 6, 15, or 30 pmol of ssODN was added prior to running the complexes through a 1% agarose gel. All gels were run using Kb+ Ladder (Invitrogen) as a molecular weight marker to allow for inter-gel size comparisons even when running RNA samples.

[0119] Biotin Competition Assay:

[0120] S1m-sgRNA was heated to 75.degree. C. for 5 min and cooled to room temperature. 20 pmol each S1m-sgRNA and streptavidin were complexed for 10 min. 80 pmol biotin was added at 30, 20, 10, 5, and 0 min intervals prior to running the complexes through a 1% agarose gel.

[0121] Dynamic Light Scattering:

[0122] DLS was performed using a DynaPro.RTM. NanoStar.RTM. (Wyatt Technology) using small volume (4 .mu.L) disposable cuvettes. 10 .mu.g of each component was added into the cuvette and diluted as necessary with dH.sub.2O to reach 4 .mu.L solution volume. In mixed component conditions, components were allowed to mix for 5 minutes while taking readings. Acquisitions were performed for 20 seconds with a minimum of 4 acquisitions per measurement. 5 measurements were performed per sample and were conducted at room temperature. Data was graphed as a function of percent intensity.

[0123] Quantum Dot Biotin Conjugation:

[0124] To make Qdot-SS-simplexes, amine-PEG green fluorescent quantum dots (Qdot.RTM. ITK.TM. 525--ThermoFisher) were reacted with a degradable dithiol biotin linker (EZ-Link.TM..RTM. Sulfo-NHS-Biotin--ThermoFisher) as follows: First, 25 .mu.l of an 8 .mu.M Quantum dot solution in 50 mM Borate buffer were desalted into PBS using Zeba.TM. desalting columns (40K MWCO--ThermoFisher) and then reacted with excess sulfoNHS-dithiol-biotin linker for 2 hours at 4.degree. C. with shaking. The conjugate was purified from excess linker through buffer exchange in the desalting columns. Quantum dots retained their fluorescence and were stored at 4.degree. C. until use.

[0125] RNP Delivery:

[0126] HEK transfections were performed using TransIT-X2.RTM. delivery system (Mirus Bio, Madison, Wis.) according to manufacturer's protocol. 2.5.times.10.sup.5 cells/cm.sup.2 were seeded in a 24-well plate 24 hours prior to transfection. RNP complexes were formed as described in 25 uL of Opti-MEM.TM. (Life Technologies). 1 .mu.g of Ca9 protein, 500 ng sgRNA, 500 ng streptavidin, and 500 ng ssODN were used. In a separate tube, 25 uL of Opti-MEM.TM. was combined with 0.75 uL of TransIT-X2.RTM. reagent and allowed to mix for 5 minutes. TransIT-X2.RTM. and RNP solutions were then mixed by gentle pipetting and placed aside for 15 minutes. After this incubation, 50 .mu.L of solution were added dropwise into the well. Media was changed 24 hours post transfection.

[0127] For HEK transfections involving quantum dots, Lipofectamine.TM. 2000 (Life Technologies) was used for delivery. Qdot-RNP complexes were formed according to the following amounts (for 24 wells: 500 ng of Ca9 protein, 187.5 ng sgRNA, 187.5 ng streptavidin, 3.125 pMoles of quantum dots and 3 ul Lipofectamine.TM. per well; a quarter of these amounts were used when transfecting 5000 cells in 96 well plates).

[0128] All hPSC transfections were performed using the 4D-Nucleofector.TM. System (Lonza) in P3 solution using protocol CB150. Cells were pretreated with Rho-kinase (ROCK) inhibitor (Y-27632 Selleck Chemicals) 24 hours prior to transfection. 8 .mu.g Cas9, 3.5 .mu.g sgRNA, 3.5 .mu.g streptavidin, and 1 .mu.g ssODN were used to form particles as described above. Cells were then harvested using TrypLE.TM. (Life Technologies) and counted. 2.times.10.sup.5 cells per transfection were then centrifuged at 100.times.g for 3 minutes. Excess media was aspirated and cells were resuspended using 20 .mu.L of RNP solution per condition. After nucleofection, samples were incubated in nucleocuvettes at room temperature for 15 minutes prior to plating into one well of a 6-well plate containing E8 media+10 .mu.M ROCK inhibitor. Media was changed 24 hours post transfection and replaced with E8 medium.

[0129] Immunocytochemistry:

[0130] To measure correlation hPSCs were transfected with Cas9 protein and streptavidin-AF-647. 24 hours post transfection, cells were fixed using 4% PFA and incubated at room temperature for 10 minutes. Cells were then permeabilized using 0.05% Triton X-100 and incubated for 10 minutes. Following two washes with 5% goat serum, Cas9 antibody (Clontech #632607, 1:150) was added to cells and incubated overnight at 4.degree. C. The next day, cells were rinsed twice with 5% goat serum and then incubated with a goat anti-rabbit secondary antibody (Santa Cruz Biotech #sc-362262, 1:500) for one hour at room temperature. Cells were then washed twice with PBS and mounted for imaging.

[0131] To visualize S1mplexes in the nucleus human embryonic kidney cells (HEK293T) were plated at 16,000 cells per well in an 8-well chamber slide at day 0. On day 1, 20 mL of transfection media was added to cells in 200 .mu.L of maintenance media. Transfection media contained 20 .mu.L Opti-MEM.RTM. (Life Technologies), 10 pmol Streptavidin Alexa Fluor.RTM. 488 conjugate (Thermo Fisher), and 0.6 .mu.L TransIT.RTM. transfection reagent (Mirus). On day 3, cells were incubated with 1.times. CellMask.TM. Plasma Membrane Stain (ThermoFisher) and 1.times. Hoechst for 10 min. Following incubation at 37.degree. C., cells were immediately washed with PBS and fixed in 4% paraformaldehyde (IBI Scientific) at room temperature for 15 min. Cells were analyzed using a Nikon Eclipse TI epifluorescent microscope and a Nikon ARI confocal microscope.

[0132] Multispectral Imaging Flow Cytometry:

[0133] hPSCs were transfected and stained as described above. After staining, cells were centrifuged and resuspended in 50 .mu.L PBS. Fluorescence was detected on ImageStream.RTM. X Mark II (EMD Millipore) according to manufacture instructions. Cellular colocalization was measured by IDEAS software package (Amnis) using predefined colocalization wizard.

[0134] Flow Cytometry:

[0135] Flow cytometry of BFP expression and conversion to GFP was measured using a BD FACS Aria using the DAPI and FITC filters and analyzed using FlowJo. Voltages were established by running wild type WA09 hPSCs as well as WA09-BFP hPSCs. Sorting was performed on a BD FACSAria.TM. II with a nozzle size of 100 .mu.m at room temperature and sorted into culture media.

[0136] Genomic Analysis:

[0137] DNA was isolated from cells using DNA QuickExtract.TM. (Epicentre, Madison, Wis.) following treatment by 0.05% trypsin-EDTA and centrifugation. QuickExtract.TM. solution was incubated at 65.degree. C. for 15 minutes, 68.degree. C. for 15 minutes, and finally 98.degree. C. for 10 minutes. Genomic PCR was performed following manufacturer's instructions using AccuPrime.TM. HiFi Taq (Life Technologies) and 500 ng of genomic DNA. Products were then purified using AMPure.RTM. XP magnetic bead purification kit (Beckman Coulter) and quantified using a Nanodrop.TM.2000. For deep sequencing, samples were pooled and run on an Illumina HiSeg.TM. 2500 High Throughput at a run length of 2.times.125 bp or an Illumina Miseq.RTM. 2.times.150 bp.

[0138] Deep Sequencing Data Analysis:

[0139] A custom python script was developed to perform sequence analysis. The pipeline starts with preprocessing, which consists of filtering out low quality sequences and finding the defined ends of the reads. For each sample, sequences with frequency of less than 100 were filtered from the data. Sequences in which the reads matched with primer and reverse complement subsequences classified as "target sequences". Target sequences were aligned with corresponding wildtype sequence using global pairwise sequence alignment. Sequences that were misaligned around the expected cut site were classified as NHEJ events while sequences that had insertions larger that 15 bp were classified as HDR events. The frequency, length, and position of matches, insertions, deletions, and mismatches were all tracked in the resulting aligned sequences.

[0140] Cell Membrane Staining:

[0141] Human embryonic kidney cells (HEK293) were plated at 16,000 cells/well in an 8-well chamber slide at day 0. On day 1, 20 .mu.L of transfection media was added to cells in 200 .mu.L of maintenance media. Transfection media contained 20 .mu.L Opti-MEM.RTM. (Life Technologies), 400 ng Streptavidin Alexa Fluor.RTM. 488 conjugate (Thermo Fisher), and 0.6 .mu.L TransIT.RTM. transfection reagent (Mirus). On day 3, cells were incubated with 1.times. CellMask.TM. Plasma Membrane Stain (ThermoFisher) and 1.times. Hoechst for 10 min. Following incubation at 37.degree. C., cells were immediately washed with PBS and fixed in 4% paraformaldehyde (IBI Scientific) at room temperature for 15 min. Cells were analyzed using a Nikon Eclipse TI epifluorescent microscope and a Nikon AR1 confocal microscope.

[0142] Statistics:

[0143] All error bars are shown as .+-.1 standard deviation. p values were computed using a Student's two-tailed t-test and deemed significant at .alpha.<0.05.

[0144] Nucleic Acid Sequences:

[0145] The relevant nucleic acid sequences are provided in the following tables:

TABLE-US-00002 TABLE 1 Primers used to create sgRNA and Sim-sgRNAs. S1m Construct Name Sequence (5' to 3') SEQ ID NO: S1m-sgRNA-1_F GTTTAAGAGCTATGCTGCGAATACGAGATGCGGCCGCC 8 GACCAGAATCATGCAAGTGCGTAAGATAGTCGCGGGTC GGCGGCCGCATCTCGTATTC S1m-sgRNA-1_R AAAAGCACCGACTCGGTGCCACTTTTTCAAGTTGATAA 9 CGGACTAGCCTTATTTAAACTTGCTATGCTGCGAATAC GAGATGCGGCCGCCGACCCG S1m Forward TTAATACGACTCACTATAGGNNNNNNNNNNNNNNNNNN 10 NNGTTTAAGAGCTATGCTGCGA RNATracR AAAAGCACCGACTCGGTGCC 11

TABLE-US-00003 TABLE 2 Protospacer and respective PAMs used for genomic targeting. sgRNA Name Sequence (5' to 3') PAM SEQ ID NO: BFP GCTGAAGCACTGCACGCCAT GGG 12 (BFP .fwdarw. GFP) EMX1 GTCACCTCCAATGACTAGGG TGG 13 (EMX1_21) mCherry GGAGCCGTACATGAACTGAG GGG 14 (mCherry_15)

TABLE-US-00004 TABLE 3 Forward and reverse primers for genomic loci. Genomic SEQ SEQ Primer Forward (5' to 3') ID NO: Reverse (5' to 3') ID NO: EMX1 CCATCCCCTTCTGTGAATGT 15 GGAGATTGGAGACACGGAGA 16 EMX1 TCCACCTTGGCTTGGCTTTG 17 CCCTCCACCAGTACCCCAC 18 Symmetric mCherry AAGGGCGAGGAGGATAACATGG 19 TTGTACAGCTCGTCCATGCCG 20 Interior EMX1 CCAATGACAAGCTTGCTAGC 21 Insertion

TABLE-US-00005 TABLE 4 ssODNs used to direct HDR after DSB formation. ssODN Donor Sequence (5' to 3') SEQ ID NO: BFP .fwdarw. GFP NT TCATGTGGTCGGGGTAGCGGCTGAAGCACTGCACGCCAT 22 GGGTCAGGGTGGTCACGAGGGTGGGCCAGGGCACCGGCA GCTTGCCGGTGGTGCAGATGAA BFP .fwdarw. GFP 5Biotin/TCATGTGGTCGGGGTAGCGGCTGAAGCACTG 23 5PCBio NT CACGCCATGGGTCAGGGTGGTCACGAGGGTGGGCCAGGG CACCGGCAGCTTGCCGGTGGTGCAGATGAA EMX1 NT AAGCAGCACTCTGCCCTCGTGGGTTTGTGGTTGCCCACC 24 GCTAGCAAGCTTGTCATTGGAGGTGACATCGATGTCCTC CCCATTGGCCTG EXM1 5Biotin/AAGCAGCACTCTGCCCTCGTGGGTTTGTGGT 25 5PCBio NT TGCCCACCGCTAGCAAGCTTGTCATTGGAGGTGACATCG ATGTCCTCCCCATTGGCCTG

TABLE-US-00006 TABLE 5 Off-target sequences and corresponding genomic locus for each sgRNA used. Mismatches from protospacer are labelled in red. sgRNA Target Sequence Off-Target Sequence SEQ ID NO: PAM Locus BFP .fwdarw. GFP OT1 GCAGAAGCACTGCAAGCCAT 27 CAG chr17: GCTGAAGCACTGCACGCCAT +39786906 (SEQ ID NO: 26) OT2 TCTGAAGTGCTGCACGCCAT 28 CAG chr2: -238397265 OT3 GTGGAAGCACTGCAAGCCAT 29 TGG chr7: -11228464 OT4 GGTGGAGCAGGGCACGCCAT 30 CAG chr9: +109114765 OT5 GAAGAAGCACTGCACCCCAT 31 CAG chr13: -75660548 EMX1 OT1 AGGACCACCAATGACTAGGG 33 CAG chr3: GTCACCTCCAATGACTAGGG -64303990 (SEQ ID NO: 32) OT2 ACCACCTGTAATGACTAGGG 34 TAG chr4: -149749778 OT3 GGAGCCTCCAGTGACTAGGG 35 GAG chr17: -38423030 OT4 GTGAACTACAGTGACTAGGG 36 TGG chr8: +112210096 OT5 CTGGCCTCCAAAGACTAGGG 37 GAG chr15: -75011931

TABLE-US-00007 TABLE 6 Forward and reverse primers used to amplify off-target genomic loci. Off-Target SEQ SEQ Primer Forward (5' to 3') ID NO: Reverse (5' to 3') ID NO: BFP OT1 TTTCCTAGCAAGCAGACTCAGA 38 AGCTGTCCTTTGTCCCATTGA 39 BFP OT2 TCTCCATGCCCTCCTTTCCAT 40 GGATGTAGTCCATGATCTTCCCC 41 BFP OT3 TCCCAGAATGTGAAAGTGGAGG 42 CTGTGGGCTTTCCTCAGCTC 43 BFP OT4 GCTGACTAACGTCCACTGCT 44 TGGACCTATGTTTTTCTTCGTCAC 45 BFP OT5 AAAGTCTGTGGCCTTGTGAGA 46 AACCCTACCCCCTACCTGAA 47 EMX1 OT1 TTCCCCAGGTAGTTGCTGTTC 48 TCTGCACATGTCCCAACTGTC 49 EMX1 OT2 ATCCGTACCTAACCATGACCC 50 GCACAGATCTTGGTGGCTTT 51 EMX1 OT3 GGCTGGGTTTCCCAAACGTA 52 CAAACTGCTGTGTTGGGTGG 53 EMX1 OT4 ACTTGGAAGGGTCCACACAA 54 CCTTGAATAGAGCATTTTTCCCCA 55 EMX1 OT5 TCCTACCCTTGGATGGGGTT 56 GGGCTACACGGTCCCTAAAG 57

Example 1: Design of Modified SGRNA

[0146] A novel sgRNA with a modification at the stem loop closest to the 5' end of the sgRNA was designed (FIG. 3). This location was chosen because it has previously been shown to tolerate additions with a minimal loss in activity. An S1m aptamer was added, which has a strong non-covalent interaction with streptavidin. The added S1m aptamer extends the sgRNA stem loop closest to the 5' end and contains two distinct bulges used for binding. These modifications do not otherwise disrupt the predicted sgRNA secondary structure (FIG. 3). We confirmed that S1m-sgRNAs can be made rapidly in vitro via one-pot transcription and are larger than standard sgRNAs when analyzed by agarose gel electrophoresis (FIG. 3).

[0147] Similar experiments were performed with sgRNAs S1m-sgRNA-1, S1m-sgRNA-2, S1m-sgRNA-3, and S1m-sgRNA-V3.

Example 2: Formation of Streptavidin and Cas9 Complexes with Modified sgRNA

[0148] Next, we verified the ability of S1m-sgRNAs to complex with streptavidin in vitro by combining a constant amount of S1m-sgRNA with increasing amounts of streptavidin. The electrophoretic front of the S1m-sgRNA slowed as streptavidin levels increased (FIG. 5). At the maximum amount of streptavidin, 40% of the front had slowed demonstrating the binding of the S1m-sgRNA with streptavidin. In contrast, when the same amount of standard (non-S1m) sgRNA was run with streptavidin, the electrophoretic front remained constant.

[0149] To demonstrate the ability of S1m-sgRNA-1 to complex with streptavidin and Cas9 protein simultaneously, we performed dynamic light scattering (DLS). When streptavidin and Cas9 were combined in solution, two peaks were distinct at 3.0 nm and 7.8 nm (FIG. 6), both of which match closely the radii previously reported for each protein. We next formed Cas9 RNPs with excess standard sgRNAs and observed that the species formed were larger than Cas9 alone and did not increase in radius with the addition of streptavidin. Excess sgRNA was not detected by DLS and was included in the DLS studies to ensure all key components were able to assemble together (data not shown). Additionally, these samples had a discernable peak corresponding to the presence of streptavidin alone. RNPs containing S1m-sgRNAs and Sp. Cas9 protein increased in radius by a larger amount than RNPs containing standard sgRNAs and Sp. Cas9 protein, likely due to the increased length of S1m-sgRNAs. When streptavidin was added to S1m-sgRNA RNPs, the average radius of the complex was increased by .about.3 nm, the radius of streptavidin protein. These tertiary complexes of Sp.Cas9, S1m-sgRNA-1, and streptavidin are termed "S1mplexes". The second, larger peak in the S1mplex DLS trace is attributed to the tetrameric nature of streptavidin that can harbor up to four RNPs.

[0150] While assembly of S1mplexes in vitro is important, the maintenance of complexes post-delivery is imperative to gene editing function. To demonstrate this capability, we delivered Cas9 protein and streptavidin in combination with either sgRNAs or S1m-sgRNAs into human pluripotent stem cells (hPSCs) via nucleofection and conducted immunohistochemistry for the two protein components. Multispectral imaging flow cytometric analysis of single fixed cells confirmed the co-localization of the two protein components within hPSCs (FIG. 7). Significantly higher correlation in the fluorescent signals from the two protein components were seen when S1m-sgRNA-1 was included (p<10.sup.-5, Student's two-tailed t-test FIG. 8). To gain further subcellular resolution of these components after S1mplex delivery, images obtained using confocal microscopy on fixed, intact hPSC cultures were analyzed using CellProfiler for overlap between the two components within the nuclei. At 24 hours after delivery, the correlation between the fluorescent signals arising from Cas9 and streptavidin within the nucleus was significantly higher when using S1m-sgRNAs than sgRNAs (p<0.05, Student's two-tailed t-test, FIG. 9,10). Together, these results indicate that complexes between Cas9 and streptavidin are preserved specifically through the S1m aptamer during transfection and subsequent subcellular trafficking such as nuclear transport.

Example 3: Formation of a Quaternary Complex with Donor DNA Template

[0151] After demonstrating the ability to form S1mplexes, we searched for a method to combine donor DNA template with S1mplexes and form a quaternary complex. Given the strong interaction between streptavidin and biotin (K.sub.D=10.sup.-15 M) we selected biotinylated single-stranded oligodeoxynucleotide (ssODNs) donor templates. All components (S1m-sgRNA, streptavidin, biotin-ssODN) were run alone individually on a gel and compared side-by-side with standard reagents (sgRNA, ssODN) to establish baseline migration characteristics. The biotin-ssODN ran slightly higher than the standard ssODN, presumably due to the biotin modification (FIG. 11, 12). Tertiary complexes were formed using varying levels of biotin-ssODNs. The primary band displayed a higher electrophoretic shift than either the sgRNA or ssODN alone, indicating complex formation (FIG. 11, lanes 5-7). To demonstrate that all components combined successfully, unmodified ssODNs were run in the place of biotin-ssODNs. The unmodified ssODN displayed the expected electrophoretic shift despite the presence of the S1m-streptavidin complex (FIG. 12, lanes 8-10). Finally, standard sgRNA was run with streptavidin and biotin-ssODN. In this condition, the smeared band from S1m-streptavidin binding was not observed and instead solid bands representing sgRNA and ssODN-streptavidin were present (FIG. 12, lane 11).

[0152] Due to the strong interaction of biotin and streptavidin, we needed to ensure that biotin did not displace S1m-sgRNA-1 already bound to streptavidin when added in solution. To do so, we combined S1m-sgRNA-1s with streptavidin at a 1:1 molar ratio. We then added 4-fold molar excess of biotin to occupy every binding site on each streptavidin molecule and incubated the complex for 0, 5, 10, 20, or 30 minutes. After incubation, gel shift following electrophoresis was not different from bound S1m-sgRNA: streptavidin combinations suggesting that biotin did not interfere with the S1m-streptavidin interaction at four times the concentrations used in this study (data not shown).

Example 4: Gene Editing Activity of S1M-sgRNAs in Human Cells

[0153] Next, we examined the ability of S1m-sgRNAs to edit genes within human cells. We created a human embryonic kidney (HEK) cell line that constitutively expressed blue fluorescent protein (BFP) from an integrated transgene. DSBs produced by sgRNAs that target the fluorophore in combination with Cas9 expressed from a transfected plasmid are repaired predominantly through NHEJ, with indel formation at the DSB. NHEJ-mediated gene edits are expected to result in a loss of BFP fluorescence within this HEK line. After delivery of Sim-sgRNAs and a plasmid encoding Cas9 to this HEK line, BFP expression was analyzed via flow cytometry. All S1m-sgRNAs (1, 2, and 3) created indels at approximately half the frequency of standard sgRNAs (data not shown). While the .about.2-fold decrease in generating indel edits is significant, such decreases in indel formation have been linked to a concomitant decrease in off-target effects.

[0154] We also created a human embryonic kidney (HEK) cell line that constitutively expressed a histone 2B (H2B)-mCherry fusion protein generated by integrating a transgene into one chromosome at the safe harbor AAVS1 locus. DSBs produced by sgRNAs that target the mCherry fluorophore in combination with Sp. Cas9 expressed from a transfected plasmid will be repaired predominantly though NHEJ, with indel formation at the DSB. NHEJ-mediated gene edits are expected to create a loss of mCherry fluorescence assayed via flow cytometry. When transfected into cells, S1m-sgRNAs created NHEJ gene edits at approximately half the frequency of standard sgRNAs, knocking out fluorescence in 45% of cells compared to 83% loss by standard sgRNAs (FIG. 13). While the .about.2-fold decrease in generating NHEJ edits is significant, such decreases in NHEJ activity have been linked to a concomitant decrease in off-target effects.

Example 5: Increased HDR to Indel Ratios in Human Cells

[0155] We tested the ability of all three ssODN-S1mplexes to induce HDR in a hPSC line containing a BFP-expressing transgene that can be switched to express GFP through a 3 nucleotide switch (data not shown). S1mplexes with biotin-ssODNs (ssODN-S1mplexes) were assembled using one of the three S1m-sgRNAs and compared to standard sgRNAs and ssODN combinations. After delivery of ssODN-S1mplexes and subsequent deep sequencing of genomic DNA, we found that all three ssODN-S1mplexes had a higher ratio of HDR:indel editing than standard RNPs. ssODN-S1mplexes with S1m-sgRNA-1 and S1m-sgRNA-2 induced similar ratios of HDR:indel editing while ssODN-S1mplexes with S1m-sgRNA-3 had a slightly depressed HDR:indel ratio (FIG. 14). The decreased HDR:indel ratio found using S1m-sgRNA-3 may have been due to the lower binding affinity of this sgRNA with streptavidin, as seen in the EMSA (data not shown). In order to minimize the frequency of indel mutations while maximizing HDR, we decided to use S1m-sgRNA-1 for all remaining experiments and will refer to it henceforth simply as S1m-sgRNA.

[0156] With this knowledge, we then evaluated S1mplexes in multiple human cell lines for their ability to generate a variety of precise nucleotide changes. We assembled ssODN-S1mplexes to again switch BFP to GFP. After delivery to HEK cells, deep sequencing revealed that the ssODN-S1mplexes enriched the ratio of precise insertions to imprecise editing 18.4-fold over standard RNPs and approached a ratio of four precise edits to every one indel (FIG. 15). When the same experiments were conducted in hPSCs, results from flow cytometry assays were consistent with these conclusions from deep sequencing (data not shown). Additionally, when introducing a 12 nucleotide insertion into the EMX1 locus.sup.29 of HEKs with ssODN-S1mplexes, the ratio of precise insertions to imprecise editing increased 2.7-fold over standard sgRNA RNPs (FIG. 16 and data not shown). Taken together, this shows that ssODN-S1mplexes are able to shift the balance of editing to enrich for small, precise edits within the genome.

[0157] We tested the ability of this strategy to create even larger sequence changes in hPSCs by designing an ssODN that carried a variable 18 nucleotide insertion. We deep sequenced the cell population after delivery of ssODN-S1mplexes, again targeting the BFP and EMX1 loci. When standard sgRNA RNPs were transfected with streptavidin-ssODN complexes, minimal insertion was seen with a subsequently low ratio of precise HDR to imprecise indel alleles (FIG. 17). Equivalent precise:imprecise ratios were seen when standard sgRNA RNPs and ssODNs were transfected as when S1m-sgRNA RNPs were transfected with biotin-ssODN (without streptavidin) (FIG. 17 and data not shown). However, levels of indels were increased in the sgRNA RNP-free ssODN condition (data not shown). When the full ssODN-S1mplexes were transfected into hPSCs, HDR insertion levels greatly increased (data not shown) as did the ratio of precisely-edited to imprecisely-edited alleles to 9.7-fold over standard RNP methods (FIG. 17). Again, we observed four precise edits to every one indel with ssODN-S1mplexes at this locus. At the endogenous EMX1 locus, we delivered the S1m-sgRNA RNPs with biotin-ssODNs either with or without streptavidin. When streptavidin was added to generate the full ssODN-S1mplex, rates of insertion increased 51-fold (data not shown), and the ratio of precise to imprecise gene-editing increased 15-fold (FIG. 18). Taken together, each component of the ssODN-S1mplex is necessary to drive higher HDR: indel ratios within human cells.

Example 6: Design Constraints on the ssODN-S1mplex

[0158] Recent studies have reported that the design of the ssODN has a significant effect on the rate of HDR. Accordingly, we explored various ssODN designs with ssODN-S1mplexes. Designs were limited to a 100 nucleotide length for ease of synthetic synthesis, but varied as follows: asymmetrical around the cut site, extending 30 upstream and 67 bp downstream or vice-versa, either identical to the sequence containing the PAM or the reverse complement (non-PAM), and biotinylated on either the 5' or 3' end of the ssODN (FIGS. 19,20, left). S1mplexes containing each unique ssODN were assembled and transfected separately into BFP-expressing hPSCs. Four days after delivery, genomic DNA from each condition was collected and analyzed using deep sequencing. Under these conditions, 2.8.+-.2.2% of alleles in all samples were edited via HDR and NHEJ (FIG. 19, top and data not shown). We observed that neither the asymmetry, sidedness, biotin, nor location on the ssODN had a significant effect on the HDR or indel outcomes using ssODN-S1mplexes (FIG. 19, top and data not shown). Precise editing ranged from 2-10 times greater than imprecise editing (FIG. 20, top and data not shown).

[0159] We next sought to test these ssODN designs at an endogenous GAA locus using a patient-derived hPSC line that contains a pathogenic 1 bp deletion in exon 10 on one allele. We designed sgRNAs that target only the mutant allele as well as ssODNs to correct the mutation to wildtype and modify the PAM site. These ssODNs were again asymmetrical, 34 bp upstream and 66 bp downstream from the cut site, complementary to the PAM or non-PAM strand, and biotinylated at either the 5' or 3' end of the ssODN (FIG. 19, 20, bottom). At this locus ssODN-S1mplexes again had higher levels of precise to imprecise editing than RNPs consisting of sgRNAs, with 3-8 precise edits occurring for every imprecise edit (FIG. 20, bottom and data not shown). Consistent with the sequencing results at the BFP locus, absolute levels of HDR and NHEJ editing were 2.0.+-.1.1% (FIG. 19, 20, bottom and data not shown). There was still no significant difference between any of the ssODNs tested when complexed to the S1mplex.

Example 7: Imaging of Simplexes Transfected Cells

[0160] To facilitate isolation of the precisely-edited cells, we pursued a strategy to label the cells that received the S1mplexes by including additional biotinylated fluorescent cargoes. We preassembled standard streptavidin-conjugated quantum dots (QdotSA, 20 nm diameter) with S1mplexes (QdotSA-S1mplexes, FIG. 21, bottom). After transfection of QdotSA-S1mplexes, a subpopulation of cells contained Qdots within the cytoplasm. High-intensity green fluorescence dots were distributed variably across the transfected cell population, indicating that standard transfection methods likely generate significant heterogeneity in the number of RNPs delivered to each cell. Despite the presence of Qdots in the cytoplasm, no gene editing was observed upon further culture and analysis within the HEK H2B-mCherry reporter cell line (FIG. 22, FIG. 23). When the biotin linkage of the S1mplex to the Qdot was mediated through a pH-sensitive disulfide linker (Qdot-SS-S1mplex, FIG. 21, top), we observed a gain in gene editing activity (FIG. 22), while the Qdots remained largely within the cytoplasm (FIG. 24), suggesting separation and nuclear transport of the RNP. The fluorescence from the Qdot at 24 hours post transfection was utilized for fluorescence activated cell sorting (FACS). There was a shift in fluorescence for the whole cell population, indicating uptake of Qdot-S1mplexes in most cells, although to differing extents (FIG. 25). The fluorescence from the Qdot at 24 hours post transfection was utilized for cell sorting, and sorted cells with positive fluorescent signal were gene edited at 3.7-fold higher rates versus cells transfected using standard methods (FIG. 26).

Example 8: Multiplexed Gene Editing with Simplexes

[0161] To obtain further control and refine the mutagenic spectrum of S1mplexes, we attached a fluorescent label directly to streptavidin that could be used for identification during flow cytometry. We preassembled an S1m-sgRNA and biotin-ssODN targeting BFP with a streptavidin labeled with a red fluorophore (AlexaFluor.RTM.-594) (FIG. 27) and then performed a single cell FACS for the isolation of clones that had high fluorescence after delivery. Upon further cell culture, clones were analyzed by Sanger sequencing for editing at the BFP locus. Of the 34 isolated clones in the S1mplex-positive population, eight underwent HDR; eight harbored indels; and, the rest remained unedited (FIG. 28). In comparison, when using sgRNAs, seven of the 41 isolated clones harbored indels and none were positive for HDR. Cell populations did not contain mosaic gene editing, indicating that defined gene editing outcomes could be enriched by FACS on the S1mplex fluorescence. Using this capability we tested whether if it was possible to multiplex edits using differently colored S1mplexes. We thus assembled the same ssODN-S1mplex targeting BFP, termed red-ssODN-S1mplex, and separately complexed an S1m-sgRNA and biotin-ssODN targeting EMX1 with a streptavidin labeled with a green fluorophore (AlexaFluor.RTM.-488), termed green-ssODN-S1mplex (FIG. 27). The two ssODN-S1mplexes were mixed and transfected simultaneously into HEKs (FIG. 29).

[0162] Twenty-four hours post transfection, we sorted cells using FACS into one of four populations: positive for either fluorophore, both, or neither (FIG. 30). Only the top 2% of each population was taken, as we observed some association of the fluorescent S1mplex with the cell membrane in addition to robust fluorescent signal within the nucleus of some of the cells (FIG. 29). One-week post sort, each of the four populations was analyzed for editing via deep sequencing as well as by flow cytometry for BFP editing or insert-based PCR for EMX1. Deep sequencing revealed that editing at the EMX1 locus was increased in the presence of green-ssODN-S1mplexes (Green+ and double positive fractions) (FIG. 30, and data not shown). In these populations the ratio of precise to imprecise edits increased and approached one and was 2-fold greater than that of the double negative fraction (data not shown). Similarly, editing at the BFP locus was increased in the Red+ and double positive fractions. As was seen in previous deep sequencing experiments, the ratio of precise to imprecise edits was elevated in the presence of S1mplexes. With the addition and sorting of fluorescent S1mplexes, the ratio was greater than 10 insertions per indel (FIG. 30 and data not shown). Interestingly, the level of indels was highest in the double negative fraction (data not shown); this may be due to the presence of unlabeled RNPs that did not complex with streptavidin. Results with conventional flow cytometry and PCR assays followed the same trends, consistent with these conclusions from deep sequencing data not shown). We analyzed the top 5 off-target sites for both the BFP and EMX1 sgRNAs using TIDE.sup.31 in the sorted fractions as well as previous samples used for deep sequencing. None of the sorted populations using ssODN-S1mplexes had modifications above the TIDE limit of detection (FIG. 31, data not shown). However, using standard sgRNA RNPs, notable off-target mutagenesis occurred at EMX1 off-target site 2 (data not shown). Taken together, the assembly of S1mplex particles with a fluorescent tag can be used to create multiple, precise edits with increased efficiency without needing multiple transfections or extended culture.

[0163] We analyzed the top 5 off-target sites for both the BFP and EMX1 sgRNAs using TIDE in the sorted fractions as well as previous samples used for deep sequencing. None of the sorted populations using ssODN-S1mplexes had modification above the limit of detection (FIG. 32). However, using standard sgRNA RNPs, notable off-target mutagenesis occurred at EMX1 off-target site 2 (FIG. 32). Taken together, the pairing of S1mplex particles with a fluorescent tag can be used to create multiple, precise edits with increased efficiency without needing multiple transfections or extended culture.

[0164] FIG. 33 shows release of a biotin-ssODN through a photocleavable linkage had no significant effect on HDR editing. FIG. 33a shows a biotin-ssODN that contained a UV-cleavable linker was attached to streptavidin and S1mplex particles in order to study the potential of releasing the ssODN inside the cell to promote HDR. Lane 1: DNA standard. Lane 2: Photo-cleavable biotin-ssODN. Lane 3: standard ssODN. Lane 4: Binary complexes of streptavidin and photo-cleavable biotin-ssODNs. Lane 5-6: Binary complexes cleaved by either exposure to light through a DAPI filter cube (lane 5) or exposure to a UV transilluminator (lane 6). DAPI filter cube cleaved nearly all ssODN after 10 minutes whereas transilluminator had complete cleavage. Cleaved DNA product was the same length as control standard ssODN. FIG. 33b shows release of biotin-ssODN by 15 minutes of light exposure through a DAPI filter cube every hour post transfection. Levels of HDR were not significantly affected by the release of the ssODN within the cell at any time point (n=3 biological replicates).

Conclusions from Examples 1-8

[0165] The S1mplex strategy provides a straightforward, robust and modular method to regulate the gene editing activity of Sp.Cas9 RNPs. RNA modification of the sgRNA with S1m can be performed readily through short nucleic acid synthesis methods, whereas other methods that engineer the Cas9 protein can add challenges in protein expression, purification and stability. Our strategy could complement and add functionality to generate engineered variants (e.g., high fidelity, switchable, and optogenetic nucleases). Pre-assembled S1mplexes could also be readily manufactured to be off-the-shelf reagents with well-defined critical quality attributes appropriate for clinical use: avidin has previously been tolerated in clinical trials and clinical grade Sp.Cas9 is available from several vendors.

[0166] Gene editing in human cells could be controlled by the linkages within the S1mplex. For the Qdot-S1mplexes, a gain of RNP activity occurred after switching to a labile disulfide bond. Without being held to theory, it is believed that large cargoes such as Qdots (20 nm diameter) complexed with the RNP inhibit Cas9 nuclease activity. The smaller ssODN-S1mplexes without labile bonds with mean diameters of 16 nm could generate edits at target loci. The Qdot-S1mplex results demonstrate that the biotin-streptavidin linkage is strong enough to associate biotinylated cargoes with the RNP, while disulfide bonds, which are enzymatically labile at low pH, likely dissociate the S1mplex in low pH endocytotic trafficking compartments and release the RNP from the cargo to fully recover activity. Regulating CRISPR gene editing tightly through the release of large cargoes could be explored with other chemistries that generate labile cargoes upon excitation by light or heat. Such strategies could advance targeted therapy to specific areas and cell types within the body.

[0167] The site-specific complexation of the HDR donor template with the RNP through a biotin-streptavidin noncovalent interaction and an S1m RNA aptamer-streptavidin interaction favored precise gene editing outcomes at a ratio of .about.1-10 precise edits to each indel. Absolute levels of precise editing decreased as the length of insertion increased, which has been shown previously, and we anticipate that even higher ratios of precise to imprecise editing could be generated for single nucleotide changes. 44,750 disease-associated single nucleotide or indel mutations in the ClinVar database can be corrected, in principle, by HDR via donor templates of 1-50 nucleotides in length. While dissociation of the RNP from its complexed quantum dot cargo was required for Cas9 activity, release of the biotin-ssODN through a photocleavable linkage had no significant effect on HDR editing (FIG. 34). Using a different chemistry in mouse cells, biotin-ssODNs could be recruited to RNPs within the cell produced by translation of injected Cas9-avidin mRNA. Increased local concentration of biotinylated donor template at the DSB through the streptavidin bridge of the S1mplex could be one mechanism that increases precise editing. Other potential mechanisms include differential modification of the ssODN ends to promote strand invasion or enhance stability within the cells, and a more defined stoichiometry of the RNP to the ssODN within each cell. Further modifications to the ssODN template and linkers could be used to dissect these gene editing mechanisms. The S1mplex strategy coupled with the variety of conjugatable biotinylated reagents enables the formation of a versatile toolkit centered around precise gene editing to advance gene editing scientific development and gene therapy.

Additional Materials and Methods

[0168] S1m-sgRNA-V3 was generated in a similar fashion but scaffold PCR was performed under different conditions. Phusion.RTM. PCR was performed using the following thermocycling protocol: 30 cycles of 98.degree. C. for 10 s and 72.degree. C. for 15 s with a final extension period of 72.degree. C. for 10 min. These scaffolds were then combined with the same second primer as in S1m-sgRNA-1 but cycled for 30 cycles of 98.degree. C. for 10 s and 60.degree. C. for 10 s and 72.degree. C. for 15 s with a final extension period of 72.degree. C. for 10 min.

[0169] LysoSensor.TM. Quantification.

[0170] H9 hESCs and Pompe iPSCs were harvested and counted to establish correct cell number ratios prior to being plated on glass-bottom well slides (Ibidi.TM.). Cells were allowed to attach for 24 hours prior to analysis. Cocultures were stained with LysoSensor Green (1:1000) and Hoescht33342 (1:2000) for 5 minutes followed by 2.times. washes with PBS. Images were obtained using confocal microscopy (Nikon AR-1) and analyzed using CellProfiler.

[0171] Creation of ArrayEdit Platform.

[0172] .mu.CP was performed using previously described methods. The surface modification involved printing of an alkanethiol initiator to nucleate the polymerization of hydrophilic poly(ethylene glycol) (PEG) chains. Briefly, double sided-adhesive was attached to the bottom of a standard tissue culture plate, after which a laser cutter was used to cut out the well bottoms. Glass sheets were purchased at a size slightly smaller than a well plate. A metal evaporator was then used to deposit a thin layer of titanium, followed by a layer of gold onto one side of the glass sheet. Using previously described chemistry, patterns were transferred to gold-coated glass via a polydimethylsiloxane stamp after which the glass was submerged in a poly(ethylene glycol) (PEG) solution overnight to build hydrophillic PEG chains surrounding .mu.Features. After submersion, sheets were washed with deionized water to remove residual copper deposited by the reaction and 70% ethanol to sterilize. Standard tissue culture plates with well bottoms cut out were then fastened to processed sheets using a custom-made alignment device.

[0173] Biallelic Correction of Pompe iPSC.

[0174] All hPSC transfections were performed using the 4D-Nucleofector.TM. System (Lonza) in P3 solution using protocol CA-137. Cells were pretreated with Rho-kinase (ROCK) inhibitor (Y-27632 Selleck Chemicals) 24 hours prior to transfection. 50 pmol Cas9, 60 pmol sgRNA, 50 pmol streptavidin, and 60 pmol ssODN were used to form particles per ssODN-S1mplex as described above. Cells were then harvested using TrypLE.TM. (Life Technologies) and counted. 2.times.10.sup.5 cells per transfection were then centrifuged at 100.times.g for 3 minutes. Excess media was aspirated and cells were resuspended using 20 .mu.L of RNP solution per condition. After nucleofection, samples were incubated in nucleocuvettes at room temperature for 15 minutes prior to plating into 3.times.10.sup.4 cells per well of an ArrayEdit plate containing mTeSR1+10 .mu.M ROCK inhibitor. Media was changed 24 hours post transfection and replaced with mTeSR1 medium.

[0175] High-Content Image Acquisition and Analysis.

[0176] Automated microscopy was performed using a Nikon Eclipse TI epifluorescent microscope and NIS Elements Advanced Research (V4.30) software. The ND acquisition 6D module was used to establish a 20.times.20 grid pattern such that one 10.times. image was taken at each .mu.feature and combined in a single file. Nikon Perfect Focus was used to ensure that all images were in the same Z-plane and in focus. Each image was then corrected for illumination defects using CellProfiler and the number of nuclei was determined as well as LysoSensor.TM. intensity and S1mplex presence within the cell.

[0177] Dual S1mplexes for the excision of genomic DNA. Two different s1m-sgRNA-1 sequences, cutting .about.238 bps apart in the LAMAS locus were designed (target sequences+PAM: GTAGCCGGGGAAGCGAAGCA-GGG (SEQ ID NO: 58) and GCTCACGGACGGCTCCTACC-TGG (SEQ ID NO: 59)) and sgRNAs for these sequences were made through in vitro transcription. One day prior to transfection, HEK 293 cells were seeded at 5,000 cells/well in a 96 well plate. Prior to transfection, first, RNPs were formed by mixing each S1m-sgRNA at a 1:1 molar ratio with Cas9 protein separately. Dual S1mplexes were then formed by mixing the two different RNPs with streptavidin at a 1:1:1 molar ratio. S1mplexes were then mixed with Lipofectamine.TM. (100 ng Dual S1mplexes mixed with 0.75 uL Lipofectamine.TM. 2000 per well) and used to transfect the HEK293 cells. Three days post transfection, cells were harvested and genomic DNA extracted as described previously. A 744 bp portion of the LAMAS locus spanning both targets was amplified using PCR (With primers CCCCATCGTTCCATCTCCTCT (SEQ ID NO: 60) and CGCGGGTTCTTTTGGTATCTTG (SEQ ID NO: 61)) and band intensities of unaffected and excised portions were used to quantify excision efficiency.

TABLE-US-00008 TABLE 7 primers S1m Construct Name SEQ ID NO: Sequence (5' to 3') S1m_V3_F 62 GTTTAAGAGCTATGCTGCGAATACGAGCCGCCGACCAGAATCAT GCAAGTGCGTAAGATAGTCGCGGGTCGGCGGCTCGTATTC S1m_V3_R 63 AAAAGCACCGACTCGGTGCCACTTTTTCAAGTTGATAACGGACT AGCCTTATTTAAACTTGCTATGCTGCGAATACGAGCCGCCGACC CG S1m1 Forward 64 TTAATACGACTCACTATAGGNNNNNNNNNNNNNNNNNNNNGTTT AAGAGCTATGCTGCGA S1m-SL2_F 65 GTTTAAGAGCTATGCTGGAAACAGCATAGCAAGTTTAAATAAGG CTAGTCCGTTATCAACTTCGAATACGAGATGCGGCCGCCGACCA GA S1m-SL2_R 66 AAAAAAAGCACCGACTCGGTGCCACTTTTTCCGAATACGAGATG CGGCCGCCGACCCGCGACTATCTTACGCACTTGCATGATTCTGG TCGGCGGC S1m-SL3_F 67 GTTTAAGAGCATGCTGGAAACAGCATAGCAAGTTTAAATAAGGC TAGTCCGTTATCAACTTGAAAAAGTGGCACCGAGTCGGTGCCGA A S1m-SL3_R 68 AAAAAAACGAATACGAGATGCGGCCGCCGACCCGCGACTATCTT ACGCACTTGCATGATTCTGGTCGGCGGCCGCATCTCGTATTCGG CACCGACT RNATracR 69 AAAAGCACCGACTCGGTGCC

TABLE-US-00009 TABLE 8 protospacers and respective PAMs used for genomic targeting SEQ sgRNA Name ID NO: Sequence (5' to 3') PAM BFP (BFP .fwdarw. GFP) 71 GCTGAAGCACTGCACGCCAT GGG mCherry 72 GGAGCCGTACATGAACTGAG GGG (mCherry_15) GAA .DELTA.T 73 CTCGTTGTCCAGGTAGGCCC GG GAA X746 74 TGGACCACCAGCTCCTGTGG GGG

Example 9: Variants of S1m-sg RNA: Variable Length of S1M Linker

[0178] We have created two different S1m-sgRNA versions that may serve different functions for downstream applications. Importantly, we have shown that the exact sequence of the construct is malleable and can be fine-tuned as desired. S1m-sgRNA-1 has a longer stem loop and may demonstrate more degrees of freedom in solution or when bound to Cas9 to form an RNP. This structure may have advantages when attaching larger cargoes such as additional proteins that may cause steric interference with Cas9 protein. Similarly S1m-sgRNA-V3 (FIG. 34) contains a shorter stem loop linking the sgRNA and S1m aptamer. This structure may be easier to fold in to the correct secondary structure due to the decreased complexity of the sequence and fewer binding partners for each nucleotide in the sequence. This sequence may also be amenable to synthetic construction methodologies that are length limited to preserve fidelity of the final product

[0179] We next texted the capability of both sgRNAs to bind to streptavidin through an electrophoretic mobility shift assay (FIG. 35). Both sgRNAs showed a similar shift on the gel suggesting the same binding capability of both aptamer constructions. This is as we expected as the core sequence and therefore secondary structure of the streptavidin binding region is unchanged. However, with this assay we are unable to distinguish the portion of S1m-sgRNAs that are folded correctly. Both S1m-sgRNA-1 and V3 showed similar upward mobility following EMSA suggesting the presence of larger complexes within the solution. In comparison, no so shift was observed when mixing sgRNAs with streptavidin.

[0180] A core capability of the CRISPR/Cas9 system is the ability to create double strand breaks that are subsequently repaired by cellular mechanisms. To test this capability with S1m-sgRNAs we transfected Cas9 RNPs containing an sgRNA in targeting the fluorophore (Table 8) into H2b-mCherry expressing HEK cells and tested for the loss of fluorescence after 7 days. Both S1m-sgRNA variants induced fewer NHEJ events than a standard sgRNA (FIG. 36). While this loss of function is significant, it may lend greater utility to S1m-sgRNAs in applications relating to precise editing. In clinical settings, the high level of uncontrolled NHEJ products is undesirable. Between the two S1m-sgRNA variants, V3 induced .about.3-fold higher NHEJ events than S1m-sgRNA-1. This may be due to a higher number of active sgRNAs within the transfected pool and may also suggest that V3 is more suitable to targeted deletion strategies.

[0181] We next tested the capabilities of both S1m-sgRNAs to induce HDR when formed in to an ssODN-S1mplex. S1m-sgRNA-V3 again induced a higher level of HDR when compared to S1m-sgRNA-1 (FIG. 37). However, the ratio of precise to imprecise mutations was decreased in this condition as the level of NHEJ was significantly higher than S1m-sgRNA-1. This suggests that S1m-sgRNA-1 may be a better choice for when only precise mutations are desired within the target cell population.

[0182] Both S1m-sgRNA-1 and S1m-sgRNA-V3 have potential to be used in the field of clinical gene editing and may span different applications. S1m-sgRNA-V3 is easier to create and induces higher levels of overall editing, a feature that may be useful in ex vivo therapies. Due to the higher cutting efficiency of S1m-sgRNA-V3, one could also envision a strategy of large deletions by tethering together two RNPs at a defined length. S1m-sgRNA-1 in comparison is a longer aptamer and may feature more utility for attachment of larger cargoes such as qDots or growth factors. It generally has a lower level of overall editing efficiency for both HDR and NHEJ applications but may be more useful for in vivo editing where precise mutations are desired.

Example 10: Isolation of Biallelic Corrected iPSCs

[0183] We obtained an iPSC line derived from a patient afflicted with infantile-onset Pompe disease. This cell line contains two distinct deleterious mutations at different points within a single gene. We created two fluorescent S1mplex-ssODNs containing sgRNA (Table 8) and ssODNs specific to each diseased locus and transfected them into cells prior to plating on our ArrayEdit platform (FIG. 38). ArrayEdit functions by looking for phenotypic differences between cell colonies to enrich the proportion of selected clones that are edited. We identified lysosome acidity as a potential difference between healthy and diseased cell lines that can be analyzed using image cytometry. To test this hypothesis we co-cultured WA09-H2b-mCherry expressing cells with diseased Pompe iPSCs and stained the lysosomes with LysoSensor.TM. Green. LysosSensor.TM. Green is a dye that is preferentially trafficked to acidic organelles and fluoresces at higher intensity at lower pH. We then analyzed the green intensity of each cell within the colculture using CellProfiler and found that there was a significant difference between the two populations, even when growing within the same colony (FIG. 39).

[0184] With this knowledge we mock transfected WA09 and Pompe PSCs and plated them on ArrayEdit to obtain baseline phenotypic data. We simultaneously transfected Pompe iPSCs with both fluorescent S1mplex-ssODNs. Across all conditions we tracked the growth rate of colonies and seven days post-transfection the LysoSensor.TM. intensity. We also measured the presence of each S1mplex in the corresponding condition. We again found that the WA09 cell colonies had a significantly higher LysoSensor.TM. intensity than Pompe iPSCs. Importantly, we also observed Pompe iPSC colonies that displayed intensities similar to that of the control WA09 line, suggesting editing events (FIG. 40). In previous experiments we observed that edited cell colonies may suffer a decrease in fitness while editing events occurred. Accordingly, we tracked cell number of each colony over from day 1-7 of the experiment and plotted the average change in cell number over this time course. We again observed cell colonies that grew slower than mock transfected Pompe iPSCs. Importantly, there were numerous cell colonies that fit all of the criteria for selection for downstream analysis. These were: low growth rate, high Lysosensor.TM. intensity, and presence of at least one S1mplex type. After selection and Sanger sequencing we observed that we had obtained clones that were positive for correction at both loci individually, and most importantly one clones that contained edits at both alleles simultaneously including mutations to the PAM site (SEQ ID NOs. 76-79; 81-84), showing the ssODN that was the used as the donor DNA (SEQ ID NOs. 75 and 80) (FIG. 41).

TABLE-US-00010 SEQ ID NO: 75 TGCAGCCTCTCGTTGTCCAGGTATGGCCCGGGTCCACTGCC; SEQ ID NO: 76 TNCAGCCTCTCGTTGTCCAGGTATGGCCNGGNTCAATTGCT; SEQ ID NO: 77 TNCAGCCTCTCGTTGTCCAGGTATGGCCCGGATCCACTGCC; SEQ ID NO: 78 CTCAGACCTNTTNTTNTCAAGGTAAGGCCCGGGTCCACTGCC; SEQ ID NO: 79 TNCAGCCTCTCGTTGTCCAGGTATGGCCCGGATCCACTGCC; SEQ ID NO: 80 CCTGGACTGTGGACCACCAGCTCCTGTGGGGGAGGCCCT; SEQ ID NO: 81 CCTGGACTGTGGACCACCAGCTCCTGTNGGGGAGGCCCT; SEQ ID NO: 82 CCTGGACTGTGGACCACCAGCTCCTGTGGGGAGAGGCCCT.

Example 11: Dual Simplexes for the Excision of Genomic DNA

[0185] Dual S1mplexes containing S1m-sgRNAs targeted to 2 different spots in the LAMAS locus were formed (FIG. 42) in order to test whether RNPs targeting 2 positions packaged into S1mplexes and transfected into HEK 293 cells were able to excise the intermediate genomic sequence. After genomic isolation and PCR amplification of the LAMAS, analysis (FIG. 42) showed an average excision efficiency of .about.22% of the region spanned by the two sgRNAs in HEK293 cells, demonstrating the utility of dual guided S1mplexes for excision purposes.

[0186] To isolate the specific S1mplexes containing only one RNP targeting each site, we will use HPLC (high performance liquid chromatography) to separate out the various S1mplex species formed by random mixing of streptavidin and the various RNPs. We expect to be able to isolate the specific fraction containing one RNP for each of the two sites bound to a single streptavidin. We will compare the excision efficiency of that isolated dual S1mplexes with that of standard double sgRNAs, with and without a donor template for precise excision. For S1mplexes, the donor will be biotinylated and attached to the streptavidin as part of the S1mplex. We expect the simultaneous delivery in a nanoparticle of both RNPs as well as a donor to both increase the efficiency and precision of excision.

[0187] The use of the terms "a" and "an" and "the" and similar referents (especially in the context of the following claims) are to be construed to cover both the singular and the plural, unless otherwise indicated herein or clearly contradicted by context. The terms first, second etc. as used herein are not meant to denote any particular ordering, but simply for convenience to denote a plurality of, for example, layers. The terms "comprising", "having", "including", and "containing" are to be construed as open-ended terms (i.e., meaning "including, but not limited to") unless otherwise noted. Recitation of ranges of values are merely intended to serve as a shorthand method of referring individually to each separate value falling within the range, unless otherwise indicated herein, and each separate value is incorporated into the specification as if it were individually recited herein. The endpoints of all ranges are included within the range and independently combinable. All methods described herein can be performed in a suitable order unless otherwise indicated herein or otherwise clearly contradicted by context. The use of any and all examples, or exemplary language (e.g., "such as"), is intended merely to better illustrate the invention and does not pose a limitation on the scope of the invention unless otherwise claimed. No language in the specification should be construed as indicating any non-claimed element as essential to the practice of the invention as used herein.

[0188] While the invention has been described with reference to an exemplary embodiment, it will be understood by those skilled in the art that various changes may be made and equivalents may be substituted for elements thereof without departing from the scope of the invention. In addition, many modifications may be made to adapt a particular situation or material to the teachings of the invention without departing from the essential scope thereof. Therefore, it is intended that the invention not be limited to the particular embodiment disclosed as the best mode contemplated for carrying out this invention, but that the invention will include all embodiments falling within the scope of the appended claims. Any combination of the above-described elements in all possible variations thereof is encompassed by the invention unless otherwise indicated herein or otherwise clearly contradicted by context.

Sequence CWU 1

1

821180RNAartificialsgRNAmisc_feature(1)..(20)n is a, c, g, or u 1nnnnnnnnnn nnnnnnnnnn guuuaagagc uaugcugcga auacgagaug cggccgccga 60ccagaaucau gcaagugcgu aagauagucg cgggucggcg gcucguauuc gcagcauagc 120aaguuuaaau aaggcuaguc cguuaucaac uugaaaaagu ggcaccgagu cggugcuuuu 1802190RNAartificialsgRNAmisc_feature(1)..(20)n is a, c, g, or u 2nnnnnnnnnn nnnnnnnnnn guuuaagagc uaugcuggaa acagcauagc aaguuuaaau 60aaggcuaguc cguuaucaac uucgaauacg agaugcggcc gccgaccaga aucaugcaag 120ugcguaagau agucgcgggu cggcggccgc aucucguauu cggaaaaagu ggcaccguga 180cggugcuuuu 1903190RNAartificialsgRNAmisc_feature(1)..(20)n is a, c, g, or u 3nnnnnnnnnn nnnnnnnnnn guuuaagagc uaugcuggaa acagcauagc aaguuuaaau 60aaggcuaguc cguuaucaac uugaaaaagu ggcaccguga cggugccgaa uacgagaugc 120ggccgccgac cagaaucaug caagugcgua agauagucgc gggucggcgg ccgcaucucg 180uauucguuuu 19041368PRTStreptococcus pyogenes 4Met Asp Lys Lys Tyr Ser Ile Gly Leu Asp Ile Gly Thr Asn Ser Val1 5 10 15Gly Trp Ala Val Ile Thr Asp Glu Tyr Lys Val Pro Ser Lys Lys Phe 20 25 30Lys Val Leu Gly Asn Thr Asp Arg His Ser Ile Lys Lys Asn Leu Ile 35 40 45Gly Ala Leu Leu Phe Asp Ser Gly Glu Thr Ala Glu Ala Thr Arg Leu 50 55 60Lys Arg Thr Ala Arg Arg Arg Tyr Thr Arg Arg Lys Asn Arg Ile Cys65 70 75 80Tyr Leu Gln Glu Ile Phe Ser Asn Glu Met Ala Lys Val Asp Asp Ser 85 90 95Phe Phe His Arg Leu Glu Glu Ser Phe Leu Val Glu Glu Asp Lys Lys 100 105 110His Glu Arg His Pro Ile Phe Gly Asn Ile Val Asp Glu Val Ala Tyr 115 120 125His Glu Lys Tyr Pro Thr Ile Tyr His Leu Arg Lys Lys Leu Val Asp 130 135 140Ser Thr Asp Lys Ala Asp Leu Arg Leu Ile Tyr Leu Ala Leu Ala His145 150 155 160Met Ile Lys Phe Arg Gly His Phe Leu Ile Glu Gly Asp Leu Asn Pro 165 170 175Asp Asn Ser Asp Val Asp Lys Leu Phe Ile Gln Leu Val Gln Thr Tyr 180 185 190Asn Gln Leu Phe Glu Glu Asn Pro Ile Asn Ala Ser Gly Val Asp Ala 195 200 205Lys Ala Ile Leu Ser Ala Arg Leu Ser Lys Ser Arg Arg Leu Glu Asn 210 215 220Leu Ile Ala Gln Leu Pro Gly Glu Lys Lys Asn Gly Leu Phe Gly Asn225 230 235 240Leu Ile Ala Leu Ser Leu Gly Leu Thr Pro Asn Phe Lys Ser Asn Phe 245 250 255Asp Leu Ala Glu Asp Ala Lys Leu Gln Leu Ser Lys Asp Thr Tyr Asp 260 265 270Asp Asp Leu Asp Asn Leu Leu Ala Gln Ile Gly Asp Gln Tyr Ala Asp 275 280 285Leu Phe Leu Ala Ala Lys Asn Leu Ser Asp Ala Ile Leu Leu Ser Asp 290 295 300Ile Leu Arg Val Asn Thr Glu Ile Thr Lys Ala Pro Leu Ser Ala Ser305 310 315 320Met Ile Lys Arg Tyr Asp Glu His His Gln Asp Leu Thr Leu Leu Lys 325 330 335Ala Leu Val Arg Gln Gln Leu Pro Glu Lys Tyr Lys Glu Ile Phe Phe 340 345 350Asp Gln Ser Lys Asn Gly Tyr Ala Gly Tyr Ile Asp Gly Gly Ala Ser 355 360 365Gln Glu Glu Phe Tyr Lys Phe Ile Lys Pro Ile Leu Glu Lys Met Asp 370 375 380Gly Thr Glu Glu Leu Leu Val Lys Leu Asn Arg Glu Asp Leu Leu Arg385 390 395 400Lys Gln Arg Thr Phe Asp Asn Gly Ser Ile Pro His Gln Ile His Leu 405 410 415Gly Glu Leu His Ala Ile Leu Arg Arg Gln Glu Asp Phe Tyr Pro Phe 420 425 430Leu Lys Asp Asn Arg Glu Lys Ile Glu Lys Ile Leu Thr Phe Arg Ile 435 440 445Pro Tyr Tyr Val Gly Pro Leu Ala Arg Gly Asn Ser Arg Phe Ala Trp 450 455 460Met Thr Arg Lys Ser Glu Glu Thr Ile Thr Pro Trp Asn Phe Glu Glu465 470 475 480Val Val Asp Lys Gly Ala Ser Ala Gln Ser Phe Ile Glu Arg Met Thr 485 490 495Asn Phe Asp Lys Asn Leu Pro Asn Glu Lys Val Leu Pro Lys His Ser 500 505 510Leu Leu Tyr Glu Tyr Phe Thr Val Tyr Asn Glu Leu Thr Lys Val Lys 515 520 525Tyr Val Thr Glu Gly Met Arg Lys Pro Ala Phe Leu Ser Gly Glu Gln 530 535 540Lys Lys Ala Ile Val Asp Leu Leu Phe Lys Thr Asn Arg Lys Val Thr545 550 555 560Val Lys Gln Leu Lys Glu Asp Tyr Phe Lys Lys Ile Glu Cys Phe Asp 565 570 575Ser Val Glu Ile Ser Gly Val Glu Asp Arg Phe Asn Ala Ser Leu Gly 580 585 590Thr Tyr His Asp Leu Leu Lys Ile Ile Lys Asp Lys Asp Phe Leu Asp 595 600 605Asn Glu Glu Asn Glu Asp Ile Leu Glu Asp Ile Val Leu Thr Leu Thr 610 615 620Leu Phe Glu Asp Arg Glu Met Ile Glu Glu Arg Leu Lys Thr Tyr Ala625 630 635 640His Leu Phe Asp Asp Lys Val Met Lys Gln Leu Lys Arg Arg Arg Tyr 645 650 655Thr Gly Trp Gly Arg Leu Ser Arg Lys Leu Ile Asn Gly Ile Arg Asp 660 665 670Lys Gln Ser Gly Lys Thr Ile Leu Asp Phe Leu Lys Ser Asp Gly Phe 675 680 685Ala Asn Arg Asn Phe Met Gln Leu Ile His Asp Asp Ser Leu Thr Phe 690 695 700Lys Glu Asp Ile Gln Lys Ala Gln Val Ser Gly Gln Gly Asp Ser Leu705 710 715 720His Glu His Ile Ala Asn Leu Ala Gly Ser Pro Ala Ile Lys Lys Gly 725 730 735Ile Leu Gln Thr Val Lys Val Val Asp Glu Leu Val Lys Val Met Gly 740 745 750Arg His Lys Pro Glu Asn Ile Val Ile Glu Met Ala Arg Glu Asn Gln 755 760 765Thr Thr Gln Lys Gly Gln Lys Asn Ser Arg Glu Arg Met Lys Arg Ile 770 775 780Glu Glu Gly Ile Lys Glu Leu Gly Ser Gln Ile Leu Lys Glu His Pro785 790 795 800Val Glu Asn Thr Gln Leu Gln Asn Glu Lys Leu Tyr Leu Tyr Tyr Leu 805 810 815Gln Asn Gly Arg Asp Met Tyr Val Asp Gln Glu Leu Asp Ile Asn Arg 820 825 830Leu Ser Asp Tyr Asp Val Asp His Ile Val Pro Gln Ser Phe Leu Lys 835 840 845Asp Asp Ser Ile Asp Asn Lys Val Leu Thr Arg Ser Asp Lys Asn Arg 850 855 860Gly Lys Ser Asp Asn Val Pro Ser Glu Glu Val Val Lys Lys Met Lys865 870 875 880Asn Tyr Trp Arg Gln Leu Leu Asn Ala Lys Leu Ile Thr Gln Arg Lys 885 890 895Phe Asp Asn Leu Thr Lys Ala Glu Arg Gly Gly Leu Ser Glu Leu Asp 900 905 910Lys Ala Gly Phe Ile Lys Arg Gln Leu Val Glu Thr Arg Gln Ile Thr 915 920 925Lys His Val Ala Gln Ile Leu Asp Ser Arg Met Asn Thr Lys Tyr Asp 930 935 940Glu Asn Asp Lys Leu Ile Arg Glu Val Lys Val Ile Thr Leu Lys Ser945 950 955 960Lys Leu Val Ser Asp Phe Arg Lys Asp Phe Gln Phe Tyr Lys Val Arg 965 970 975Glu Ile Asn Asn Tyr His His Ala His Asp Ala Tyr Leu Asn Ala Val 980 985 990Val Gly Thr Ala Leu Ile Lys Lys Tyr Pro Lys Leu Glu Ser Glu Phe 995 1000 1005Val Tyr Gly Asp Tyr Lys Val Tyr Asp Val Arg Lys Met Ile Ala 1010 1015 1020Lys Ser Glu Gln Glu Ile Gly Lys Ala Thr Ala Lys Tyr Phe Phe 1025 1030 1035Tyr Ser Asn Ile Met Asn Phe Phe Lys Thr Glu Ile Thr Leu Ala 1040 1045 1050Asn Gly Glu Ile Arg Lys Arg Pro Leu Ile Glu Thr Asn Gly Glu 1055 1060 1065Thr Gly Glu Ile Val Trp Asp Lys Gly Arg Asp Phe Ala Thr Val 1070 1075 1080Arg Lys Val Leu Ser Met Pro Gln Val Asn Ile Val Lys Lys Thr 1085 1090 1095Glu Val Gln Thr Gly Gly Phe Ser Lys Glu Ser Ile Leu Pro Lys 1100 1105 1110Arg Asn Ser Asp Lys Leu Ile Ala Arg Lys Lys Asp Trp Asp Pro 1115 1120 1125Lys Lys Tyr Gly Gly Phe Asp Ser Pro Thr Val Ala Tyr Ser Val 1130 1135 1140Leu Val Val Ala Lys Val Glu Lys Gly Lys Ser Lys Lys Leu Lys 1145 1150 1155Ser Val Lys Glu Leu Leu Gly Ile Thr Ile Met Glu Arg Ser Ser 1160 1165 1170Phe Glu Lys Asn Pro Ile Asp Phe Leu Glu Ala Lys Gly Tyr Lys 1175 1180 1185Glu Val Lys Lys Asp Leu Ile Ile Lys Leu Pro Lys Tyr Ser Leu 1190 1195 1200Phe Glu Leu Glu Asn Gly Arg Lys Arg Met Leu Ala Ser Ala Gly 1205 1210 1215Glu Leu Gln Lys Gly Asn Glu Leu Ala Leu Pro Ser Lys Tyr Val 1220 1225 1230Asn Phe Leu Tyr Leu Ala Ser His Tyr Glu Lys Leu Lys Gly Ser 1235 1240 1245Pro Glu Asp Asn Glu Gln Lys Gln Leu Phe Val Glu Gln His Lys 1250 1255 1260His Tyr Leu Asp Glu Ile Ile Glu Gln Ile Ser Glu Phe Ser Lys 1265 1270 1275Arg Val Ile Leu Ala Asp Ala Asn Leu Asp Lys Val Leu Ser Ala 1280 1285 1290Tyr Asn Lys His Arg Asp Lys Pro Ile Arg Glu Gln Ala Glu Asn 1295 1300 1305Ile Ile His Leu Phe Thr Leu Thr Asn Leu Gly Ala Pro Ala Ala 1310 1315 1320Phe Lys Tyr Phe Asp Thr Thr Ile Asp Arg Lys Arg Tyr Thr Ser 1325 1330 1335Thr Lys Glu Val Leu Asp Ala Thr Leu Ile His Gln Ser Ile Thr 1340 1345 1350Gly Leu Tyr Glu Thr Arg Ile Asp Leu Ser Gln Leu Gly Gly Asp 1355 1360 136551409PRTStreptococcus thermophilus 5Met Leu Phe Asn Lys Cys Ile Ile Ile Ser Ile Asn Leu Asp Phe Ser1 5 10 15Asn Lys Glu Lys Cys Met Thr Lys Pro Tyr Ser Ile Gly Leu Asp Ile 20 25 30Gly Thr Asn Ser Val Gly Trp Ala Val Ile Thr Asp Asn Tyr Lys Val 35 40 45Pro Ser Lys Lys Met Lys Val Leu Gly Asn Thr Ser Lys Lys Tyr Ile 50 55 60Lys Lys Asn Leu Leu Gly Val Leu Leu Phe Asp Ser Gly Ile Thr Ala65 70 75 80Glu Gly Arg Arg Leu Lys Arg Thr Ala Arg Arg Arg Tyr Thr Arg Arg 85 90 95Arg Asn Arg Ile Leu Tyr Leu Gln Glu Ile Phe Ser Thr Glu Met Ala 100 105 110Thr Leu Asp Asp Ala Phe Phe Gln Arg Leu Asp Asp Ser Phe Leu Val 115 120 125Pro Asp Asp Lys Arg Asp Ser Lys Tyr Pro Ile Phe Gly Asn Leu Val 130 135 140Glu Glu Lys Val Tyr His Asp Glu Phe Pro Thr Ile Tyr His Leu Arg145 150 155 160Lys Tyr Leu Ala Asp Ser Thr Lys Lys Ala Asp Leu Arg Leu Val Tyr 165 170 175Leu Ala Leu Ala His Met Ile Lys Tyr Arg Gly His Phe Leu Ile Glu 180 185 190Gly Glu Phe Asn Ser Lys Asn Asn Asp Ile Gln Lys Asn Phe Gln Asp 195 200 205Phe Leu Asp Thr Tyr Asn Ala Ile Phe Glu Ser Asp Leu Ser Leu Glu 210 215 220Asn Ser Lys Gln Leu Glu Glu Ile Val Lys Asp Lys Ile Ser Lys Leu225 230 235 240Glu Lys Lys Asp Arg Ile Leu Lys Leu Phe Pro Gly Glu Lys Asn Ser 245 250 255Gly Ile Phe Ser Glu Phe Leu Lys Leu Ile Val Gly Asn Gln Ala Asp 260 265 270Phe Arg Lys Cys Phe Asn Leu Asp Glu Lys Ala Ser Leu His Phe Ser 275 280 285Lys Glu Ser Tyr Asp Glu Asp Leu Glu Thr Leu Leu Gly Tyr Ile Gly 290 295 300Asp Asp Tyr Ser Asp Val Phe Leu Lys Ala Lys Lys Leu Tyr Asp Ala305 310 315 320Ile Leu Leu Ser Gly Phe Leu Thr Val Thr Asp Asn Glu Thr Glu Ala 325 330 335Pro Leu Ser Ser Ala Met Ile Lys Arg Tyr Asn Glu His Lys Glu Asp 340 345 350Leu Ala Leu Leu Lys Glu Tyr Ile Arg Asn Ile Ser Leu Lys Thr Tyr 355 360 365Asn Glu Val Phe Lys Asp Asp Thr Lys Asn Gly Tyr Ala Gly Tyr Ile 370 375 380Asp Gly Lys Thr Asn Gln Glu Asp Phe Tyr Val Tyr Leu Lys Asn Leu385 390 395 400Leu Ala Glu Phe Glu Gly Ala Asp Tyr Phe Leu Glu Lys Ile Asp Arg 405 410 415Glu Asp Phe Leu Arg Lys Gln Arg Thr Phe Asp Asn Gly Ser Ile Pro 420 425 430Tyr Gln Ile His Leu Gln Glu Met Arg Ala Ile Leu Asp Lys Gln Ala 435 440 445Lys Phe Tyr Pro Phe Leu Ala Lys Asn Lys Glu Arg Ile Glu Lys Ile 450 455 460Leu Thr Phe Arg Ile Pro Tyr Tyr Val Gly Pro Leu Ala Arg Gly Asn465 470 475 480Ser Asp Phe Ala Trp Ser Ile Arg Lys Arg Asn Glu Lys Ile Thr Pro 485 490 495Trp Asn Phe Glu Asp Val Ile Asp Lys Glu Ser Ser Ala Glu Ala Phe 500 505 510Ile Asn Arg Met Thr Ser Phe Asp Leu Tyr Leu Pro Glu Glu Lys Val 515 520 525Leu Pro Lys His Ser Leu Leu Tyr Glu Thr Phe Asn Val Tyr Asn Glu 530 535 540Leu Thr Lys Val Arg Phe Ile Ala Glu Ser Met Arg Asp Tyr Gln Phe545 550 555 560Leu Asp Ser Lys Gln Lys Lys Asp Ile Val Arg Leu Tyr Phe Lys Asp 565 570 575Lys Arg Lys Val Thr Asp Lys Asp Ile Ile Glu Tyr Leu His Ala Ile 580 585 590Tyr Gly Tyr Asp Gly Ile Glu Leu Lys Gly Ile Glu Lys Gln Phe Asn 595 600 605Ser Ser Leu Ser Thr Tyr His Asp Leu Leu Asn Ile Ile Asn Asp Lys 610 615 620Glu Phe Leu Asp Asp Ser Ser Asn Glu Ala Ile Ile Glu Glu Ile Ile625 630 635 640His Thr Leu Thr Ile Phe Glu Asp Arg Glu Met Ile Lys Gln Arg Leu 645 650 655Ser Lys Phe Glu Asn Ile Phe Asp Lys Ser Val Leu Lys Lys Leu Ser 660 665 670Arg Arg His Tyr Thr Gly Trp Gly Lys Leu Ser Ala Lys Leu Ile Asn 675 680 685Gly Ile Arg Asp Glu Lys Ser Gly Asn Thr Ile Leu Asp Tyr Leu Ile 690 695 700Asp Asp Gly Ile Ser Asn Arg Asn Phe Met Gln Leu Ile His Asp Asp705 710 715 720Ala Leu Ser Phe Lys Lys Lys Ile Gln Lys Ala Gln Ile Ile Gly Asp 725 730 735Glu Asp Lys Gly Asn Ile Lys Glu Val Val Lys Ser Leu Pro Gly Ser 740 745 750Pro Ala Ile Lys Lys Gly Ile Leu Gln Ser Ile Lys Ile Val Asp Glu 755 760 765Leu Val Lys Val Met Gly Gly Arg Lys Pro Glu Ser Ile Val Val Glu 770 775 780Met Ala Arg Glu Asn Gln Tyr Thr Asn Gln Gly Lys Ser Asn Ser Gln785 790 795 800Gln Arg Leu Lys Arg Leu Glu Lys Ser Leu Lys Glu Leu Gly Ser Lys 805 810 815Ile Leu Lys Glu Asn Ile Pro Ala Lys Leu Ser Lys Ile Asp Asn Asn 820 825 830Ala Leu Gln Asn Asp Arg Leu Tyr Leu Tyr Tyr Leu Gln Asn Gly Lys 835 840 845Asp Met Tyr Thr Gly Asp Asp Leu Asp Ile Asp Arg Leu Ser Asn Tyr 850 855 860Asp Ile Asp His Ile Ile Pro Gln Ala Phe Leu Lys Asp Asn Ser Ile865 870 875 880Asp Asn Lys Val Leu Val Ser Ser Ala Ser Asn Arg Gly Lys Ser Asp 885 890 895Asp Phe Pro Ser Leu Glu Val Val Lys Lys Arg Lys Thr Phe Trp Tyr 900 905 910Gln Leu Leu Lys Ser Lys Leu Ile Ser Gln Arg Lys Phe Asp Asn Leu 915 920 925Thr Lys Ala Glu Arg Gly Gly Leu Leu Pro Glu Asp Lys Ala Gly Phe 930 935 940Ile Gln Arg Gln Leu Val Glu Thr Arg Gln Ile Thr Lys His Val

Ala945 950 955 960Arg Leu Leu Asp Glu Lys Phe Asn Asn Lys Lys Asp Glu Asn Asn Arg 965 970 975Ala Val Arg Thr Val Lys Ile Ile Thr Leu Lys Ser Thr Leu Val Ser 980 985 990Gln Phe Arg Lys Asp Phe Glu Leu Tyr Lys Val Arg Glu Ile Asn Asp 995 1000 1005Phe His His Ala His Asp Ala Tyr Leu Asn Ala Val Ile Ala Ser 1010 1015 1020Ala Leu Leu Lys Lys Tyr Pro Lys Leu Glu Pro Glu Phe Val Tyr 1025 1030 1035Gly Asp Tyr Pro Lys Tyr Asn Ser Phe Arg Glu Arg Lys Ser Ala 1040 1045 1050Thr Glu Lys Val Tyr Phe Tyr Ser Asn Ile Met Asn Ile Phe Lys 1055 1060 1065Lys Ser Ile Ser Leu Ala Asp Gly Arg Val Ile Glu Arg Pro Leu 1070 1075 1080Ile Glu Val Asn Glu Glu Thr Gly Glu Ser Val Trp Asn Lys Glu 1085 1090 1095Ser Asp Leu Ala Thr Val Arg Arg Val Leu Ser Tyr Pro Gln Val 1100 1105 1110Asn Val Val Lys Lys Val Glu Glu Gln Asn His Gly Leu Asp Arg 1115 1120 1125Gly Lys Pro Lys Gly Leu Phe Asn Ala Asn Leu Ser Ser Lys Pro 1130 1135 1140Lys Pro Asn Ser Asn Glu Asn Leu Val Gly Ala Lys Glu Tyr Leu 1145 1150 1155Asp Pro Lys Lys Tyr Gly Gly Tyr Ala Gly Ile Ser Asn Ser Phe 1160 1165 1170Ala Val Leu Val Lys Gly Thr Ile Glu Lys Gly Ala Lys Lys Lys 1175 1180 1185Ile Thr Asn Val Leu Glu Phe Gln Gly Ile Ser Ile Leu Asp Arg 1190 1195 1200Ile Asn Tyr Arg Lys Asp Lys Leu Asn Phe Leu Leu Glu Lys Gly 1205 1210 1215Tyr Lys Asp Ile Glu Leu Ile Ile Glu Leu Pro Lys Tyr Ser Leu 1220 1225 1230Phe Glu Leu Ser Asp Gly Ser Arg Arg Met Leu Ala Ser Ile Leu 1235 1240 1245Ser Thr Asn Asn Lys Arg Gly Glu Ile His Lys Gly Asn Gln Ile 1250 1255 1260Phe Leu Ser Gln Lys Phe Val Lys Leu Leu Tyr His Ala Lys Arg 1265 1270 1275Ile Ser Asn Thr Ile Asn Glu Asn His Arg Lys Tyr Val Glu Asn 1280 1285 1290His Lys Lys Glu Phe Glu Glu Leu Phe Tyr Tyr Ile Leu Glu Phe 1295 1300 1305Asn Glu Asn Tyr Val Gly Ala Lys Lys Asn Gly Lys Leu Leu Asn 1310 1315 1320Ser Ala Phe Gln Ser Trp Gln Asn His Ser Ile Asp Glu Leu Cys 1325 1330 1335Ser Ser Phe Ile Gly Pro Thr Gly Ser Glu Arg Lys Gly Leu Phe 1340 1345 1350Glu Leu Thr Ser Arg Gly Ser Ala Ala Asp Phe Glu Phe Leu Gly 1355 1360 1365Val Lys Ile Pro Arg Tyr Arg Asp Tyr Thr Pro Ser Ser Leu Leu 1370 1375 1380Lys Asp Ala Thr Leu Ile His Gln Ser Val Thr Gly Leu Tyr Glu 1385 1390 1395Thr Arg Ile Asp Leu Ala Lys Leu Gly Glu Gly 1400 140561082PRTNeisseria 6Met Ala Ala Phe Lys Pro Asn Pro Ile Asn Tyr Ile Leu Gly Leu Asp1 5 10 15Ile Gly Ile Ala Ser Val Gly Trp Ala Met Val Glu Ile Asp Glu Glu 20 25 30Glu Asn Pro Ile Arg Leu Ile Asp Leu Gly Val Arg Val Phe Glu Arg 35 40 45Ala Glu Val Pro Lys Thr Gly Asp Ser Leu Ala Met Val Arg Arg Leu 50 55 60Ala Arg Ser Val Arg Arg Leu Thr Arg Arg Arg Ala His Arg Leu Leu65 70 75 80Arg Ala Arg Arg Leu Leu Lys Arg Glu Gly Val Leu Gln Ala Ala Asp 85 90 95Phe Asp Glu Asn Gly Leu Ile Lys Ser Leu Pro Asn Thr Pro Trp Gln 100 105 110Leu Arg Ala Ala Ala Leu Asp Arg Lys Leu Thr Pro Leu Glu Trp Ser 115 120 125Ala Val Leu Leu His Leu Ile Lys His Arg Gly Tyr Leu Ser Gln Arg 130 135 140Lys Asn Glu Gly Glu Thr Ala Asp Lys Glu Leu Gly Ala Leu Leu Lys145 150 155 160Gly Val Ala Asp Asn Ala His Ala Leu Gln Thr Gly Asp Phe Arg Thr 165 170 175Pro Ala Glu Leu Ala Leu Asn Lys Phe Glu Lys Glu Ser Gly His Ile 180 185 190Arg Asn Gln Arg Gly Asp Tyr Ser His Thr Phe Ser Arg Lys Asp Leu 195 200 205Gln Ala Glu Leu Ile Leu Leu Phe Glu Lys Gln Lys Glu Phe Gly Asn 210 215 220Pro His Ile Ser Gly Gly Leu Lys Glu Gly Ile Glu Thr Leu Leu Met225 230 235 240Thr Gln Arg Pro Ala Leu Ser Gly Asp Ala Val Gln Lys Met Leu Gly 245 250 255His Cys Thr Phe Glu Pro Ala Glu Pro Lys Ala Ala Lys Asn Thr Tyr 260 265 270Thr Ala Glu Arg Phe Ile Trp Leu Thr Lys Leu Asn Asn Leu Arg Ile 275 280 285Leu Glu Gln Gly Ser Glu Arg Pro Leu Thr Asp Thr Glu Arg Ala Thr 290 295 300Leu Met Asp Glu Pro Tyr Arg Lys Ser Lys Leu Thr Tyr Ala Gln Ala305 310 315 320Arg Lys Leu Leu Gly Leu Glu Asp Thr Ala Phe Phe Lys Gly Leu Arg 325 330 335Tyr Gly Lys Asp Asn Ala Glu Ala Ser Thr Leu Met Glu Met Lys Ala 340 345 350Tyr His Ala Ile Ser Arg Ala Leu Glu Lys Glu Gly Leu Lys Asp Lys 355 360 365Lys Ser Pro Leu Asn Leu Ser Pro Glu Leu Gln Asp Glu Ile Gly Thr 370 375 380Ala Phe Ser Leu Phe Lys Thr Asp Glu Asp Ile Thr Gly Arg Leu Lys385 390 395 400Asp Arg Ile Gln Pro Glu Ile Leu Glu Ala Leu Leu Lys His Ile Ser 405 410 415Phe Asp Lys Phe Val Gln Ile Ser Leu Lys Ala Leu Arg Arg Ile Val 420 425 430Pro Leu Met Glu Gln Gly Lys Arg Tyr Asp Glu Ala Cys Ala Glu Ile 435 440 445Tyr Gly Asp His Tyr Gly Lys Lys Asn Thr Glu Glu Lys Ile Tyr Leu 450 455 460Pro Pro Ile Pro Ala Asp Glu Ile Arg Asn Pro Val Val Leu Arg Ala465 470 475 480Leu Ser Gln Ala Arg Lys Val Ile Asn Gly Val Val Arg Arg Tyr Gly 485 490 495Ser Pro Ala Arg Ile His Ile Glu Thr Ala Arg Glu Val Gly Lys Ser 500 505 510Phe Lys Asp Arg Lys Glu Ile Glu Lys Arg Gln Glu Glu Asn Arg Lys 515 520 525Asp Arg Glu Lys Ala Ala Ala Lys Phe Arg Glu Tyr Phe Pro Asn Phe 530 535 540Val Gly Glu Pro Lys Ser Lys Asp Ile Leu Lys Leu Arg Leu Tyr Glu545 550 555 560Gln Gln His Gly Lys Cys Leu Tyr Ser Gly Lys Glu Ile Asn Leu Gly 565 570 575Arg Leu Asn Glu Lys Gly Tyr Val Glu Ile Asp His Ala Leu Pro Phe 580 585 590Ser Arg Thr Trp Asp Asp Ser Phe Asn Asn Lys Val Leu Val Leu Gly 595 600 605Ser Glu Asn Gln Asn Lys Gly Asn Gln Thr Pro Tyr Glu Tyr Phe Asn 610 615 620Gly Lys Asp Asn Ser Arg Glu Trp Gln Glu Phe Lys Ala Arg Val Glu625 630 635 640Thr Ser Arg Phe Pro Arg Ser Lys Lys Gln Arg Ile Leu Leu Gln Lys 645 650 655Phe Asp Glu Asp Gly Phe Lys Glu Arg Asn Leu Asn Asp Thr Arg Tyr 660 665 670Val Asn Arg Phe Leu Cys Gln Phe Val Ala Asp Arg Met Arg Leu Thr 675 680 685Gly Lys Gly Lys Lys Arg Val Phe Ala Ser Asn Gly Gln Ile Thr Asn 690 695 700Leu Leu Arg Gly Phe Trp Gly Leu Arg Lys Val Arg Ala Glu Asn Asp705 710 715 720Arg His His Ala Leu Asp Ala Val Val Val Ala Cys Ser Thr Val Ala 725 730 735Met Gln Gln Lys Ile Thr Arg Phe Val Arg Tyr Lys Glu Met Asn Ala 740 745 750Phe Asp Gly Lys Thr Ile Asp Lys Glu Thr Gly Glu Val Leu His Gln 755 760 765Lys Thr His Phe Pro Gln Pro Trp Glu Phe Phe Ala Gln Glu Val Met 770 775 780Ile Arg Val Phe Gly Lys Pro Asp Gly Lys Pro Glu Phe Glu Glu Ala785 790 795 800Asp Thr Pro Glu Lys Leu Arg Thr Leu Leu Ala Glu Lys Leu Ser Ser 805 810 815Arg Pro Glu Ala Val His Glu Tyr Val Thr Pro Leu Phe Val Ser Arg 820 825 830Ala Pro Asn Arg Lys Met Ser Gly Gln Gly His Met Glu Thr Val Lys 835 840 845Ser Ala Lys Arg Leu Asp Glu Gly Val Ser Val Leu Arg Val Pro Leu 850 855 860Thr Gln Leu Lys Leu Lys Asp Leu Glu Lys Met Val Asn Arg Glu Arg865 870 875 880Glu Pro Lys Leu Tyr Glu Ala Leu Lys Ala Arg Leu Glu Ala His Lys 885 890 895Asp Asp Pro Ala Lys Ala Phe Ala Glu Pro Phe Tyr Lys Tyr Asp Lys 900 905 910Ala Gly Asn Arg Thr Gln Gln Val Lys Ala Val Arg Val Glu Gln Val 915 920 925Gln Lys Thr Gly Val Trp Val Arg Asn His Asn Gly Ile Ala Asp Asn 930 935 940Ala Thr Met Val Arg Val Asp Val Phe Glu Lys Gly Asp Lys Tyr Tyr945 950 955 960Leu Val Pro Ile Tyr Ser Trp Gln Val Ala Lys Gly Ile Leu Pro Asp 965 970 975Arg Ala Val Val Gln Gly Lys Asp Glu Glu Asp Trp Gln Leu Ile Asp 980 985 990Asp Ser Phe Asn Phe Lys Phe Ser Leu His Pro Asn Asp Leu Val Glu 995 1000 1005Val Ile Thr Lys Lys Ala Arg Met Phe Gly Tyr Phe Ala Ser Cys 1010 1015 1020His Arg Gly Thr Gly Asn Ile Asn Ile Arg Ile His Asp Leu Asp 1025 1030 1035His Lys Ile Gly Lys Asn Gly Ile Leu Glu Gly Ile Gly Val Lys 1040 1045 1050Thr Ala Leu Ser Phe Gln Lys Tyr Gln Ile Asp Glu Leu Gly Lys 1055 1060 1065Glu Ile Arg Pro Cys Arg Leu Lys Lys Arg Pro Pro Val Arg 1070 1075 108071395PRTTreponema 7Met Lys Lys Glu Ile Lys Asp Tyr Phe Leu Gly Leu Asp Val Gly Thr1 5 10 15Gly Ser Val Gly Trp Ala Val Thr Asp Thr Asp Tyr Lys Leu Leu Lys 20 25 30Ala Asn Arg Lys Asp Leu Trp Gly Met Arg Cys Phe Glu Thr Ala Glu 35 40 45Thr Ala Glu Val Arg Arg Leu His Arg Gly Ala Arg Arg Arg Ile Glu 50 55 60Arg Arg Lys Lys Arg Ile Lys Leu Leu Gln Glu Leu Phe Ser Gln Glu65 70 75 80Ile Ala Lys Thr Asp Glu Gly Phe Phe Gln Arg Met Lys Glu Ser Pro 85 90 95Phe Tyr Ala Glu Asp Lys Thr Ile Leu Gln Glu Asn Thr Leu Phe Asn 100 105 110Asp Lys Asp Phe Ala Asp Lys Thr Tyr His Lys Ala Tyr Pro Thr Ile 115 120 125Asn His Leu Ile Lys Ala Trp Ile Glu Asn Lys Val Lys Pro Asp Pro 130 135 140Arg Leu Leu Tyr Leu Ala Cys His Asn Ile Ile Lys Lys Arg Gly His145 150 155 160Phe Leu Phe Glu Gly Asp Phe Asp Ser Glu Asn Gln Phe Asp Thr Ser 165 170 175Ile Gln Ala Leu Phe Glu Tyr Leu Arg Glu Asp Met Glu Val Asp Ile 180 185 190Asp Ala Asp Ser Gln Lys Val Lys Glu Ile Leu Lys Asp Ser Ser Leu 195 200 205Lys Asn Ser Glu Lys Gln Ser Arg Leu Asn Lys Ile Leu Gly Leu Lys 210 215 220Pro Ser Asp Lys Gln Lys Lys Ala Ile Thr Asn Leu Ile Ser Gly Asn225 230 235 240Lys Ile Asn Phe Ala Asp Leu Tyr Asp Asn Pro Asp Leu Lys Asp Ala 245 250 255Glu Lys Asn Ser Ile Ser Phe Ser Lys Asp Asp Phe Asp Ala Leu Ser 260 265 270Asp Asp Leu Ala Ser Ile Leu Gly Asp Ser Phe Glu Leu Leu Leu Lys 275 280 285Ala Lys Ala Val Tyr Asn Cys Ser Val Leu Ser Lys Val Ile Gly Asp 290 295 300Glu Gln Tyr Leu Ser Phe Ala Lys Val Lys Ile Tyr Glu Lys His Lys305 310 315 320Thr Asp Leu Thr Lys Leu Lys Asn Val Ile Lys Lys His Phe Pro Lys 325 330 335Asp Tyr Lys Lys Val Phe Gly Tyr Asn Lys Asn Glu Lys Asn Asn Asn 340 345 350Asn Tyr Ser Gly Tyr Val Gly Val Cys Lys Thr Lys Ser Lys Lys Leu 355 360 365Ile Ile Asn Asn Ser Val Asn Gln Glu Asp Phe Tyr Lys Phe Leu Lys 370 375 380Thr Ile Leu Ser Ala Lys Ser Glu Ile Lys Glu Val Asn Asp Ile Leu385 390 395 400Thr Glu Ile Glu Thr Gly Thr Phe Leu Pro Lys Gln Ile Ser Lys Ser 405 410 415Asn Ala Glu Ile Pro Tyr Gln Leu Arg Lys Met Glu Leu Glu Lys Ile 420 425 430Leu Ser Asn Ala Glu Lys His Phe Ser Phe Leu Lys Gln Lys Asp Glu 435 440 445Lys Gly Leu Ser His Ser Glu Lys Ile Ile Met Leu Leu Thr Phe Lys 450 455 460Ile Pro Tyr Tyr Ile Gly Pro Ile Asn Asp Asn His Lys Lys Phe Phe465 470 475 480Pro Asp Arg Cys Trp Val Val Lys Lys Glu Lys Ser Pro Ser Gly Lys 485 490 495Thr Thr Pro Trp Asn Phe Phe Asp His Ile Asp Lys Glu Lys Thr Ala 500 505 510Glu Ala Phe Ile Thr Ser Arg Thr Asn Phe Cys Thr Tyr Leu Val Gly 515 520 525Glu Ser Val Leu Pro Lys Ser Ser Leu Leu Tyr Ser Glu Tyr Thr Val 530 535 540Leu Asn Glu Ile Asn Asn Leu Gln Ile Ile Ile Asp Gly Lys Asn Ile545 550 555 560Cys Asp Ile Lys Leu Lys Gln Lys Ile Tyr Glu Asp Leu Phe Lys Lys 565 570 575Tyr Lys Lys Ile Thr Gln Lys Gln Ile Ser Thr Phe Ile Lys His Glu 580 585 590Gly Ile Cys Asn Lys Thr Asp Glu Val Ile Ile Leu Gly Ile Asp Lys 595 600 605Glu Cys Thr Ser Ser Leu Lys Ser Tyr Ile Glu Leu Lys Asn Ile Phe 610 615 620Gly Lys Gln Val Asp Glu Ile Ser Thr Lys Asn Met Leu Glu Glu Ile625 630 635 640Ile Arg Trp Ala Thr Ile Tyr Asp Glu Gly Glu Gly Lys Thr Ile Leu 645 650 655Lys Thr Lys Ile Lys Ala Glu Tyr Gly Lys Tyr Cys Ser Asp Glu Gln 660 665 670Ile Lys Lys Ile Leu Asn Leu Lys Phe Ser Gly Trp Gly Arg Leu Ser 675 680 685Arg Lys Phe Leu Glu Thr Val Thr Ser Glu Met Pro Gly Phe Ser Glu 690 695 700Pro Val Asn Ile Ile Thr Ala Met Arg Glu Thr Gln Asn Asn Leu Met705 710 715 720Glu Leu Leu Ser Ser Glu Phe Thr Phe Thr Glu Asn Ile Lys Lys Ile 725 730 735Asn Ser Gly Phe Glu Asp Ala Glu Lys Gln Phe Ser Tyr Asp Gly Leu 740 745 750Val Lys Pro Leu Phe Leu Ser Pro Ser Val Lys Lys Met Leu Trp Gln 755 760 765Thr Leu Lys Leu Val Lys Glu Ile Ser His Ile Thr Gln Ala Pro Pro 770 775 780Lys Lys Ile Phe Ile Glu Met Ala Lys Gly Ala Glu Leu Glu Pro Ala785 790 795 800Arg Thr Lys Thr Arg Leu Lys Ile Leu Gln Asp Leu Tyr Asn Asn Cys 805 810 815Lys Asn Asp Ala Asp Ala Phe Ser Ser Glu Ile Lys Asp Leu Ser Gly 820 825 830Lys Ile Glu Asn Glu Asp Asn Leu Arg Leu Arg Ser Asp Lys Leu Tyr 835 840 845Leu Tyr Tyr Thr Gln Leu Gly Lys Cys Met Tyr Cys Gly Lys Pro Ile 850 855 860Glu Ile Gly His Val Phe Asp Thr Ser Asn Tyr Asp Ile Asp His Ile865 870 875 880Tyr Pro Gln Ser Lys Ile Lys Asp Asp Ser Ile Ser Asn Arg Val Leu 885 890 895Val Cys Ser Ser Cys Asn Lys Asn Lys Glu Asp Lys Tyr Pro Leu Lys 900 905 910Ser Glu Ile Gln Ser Lys Gln Arg Gly Phe Trp Asn Phe Leu Gln Arg

915 920 925Asn Asn Phe Ile Ser Leu Glu Lys Leu Asn Arg Leu Thr Arg Ala Thr 930 935 940Pro Ile Ser Asp Asp Glu Thr Ala Lys Phe Ile Ala Arg Gln Leu Val945 950 955 960Glu Thr Arg Gln Ala Thr Lys Val Ala Ala Lys Val Leu Glu Lys Met 965 970 975Phe Pro Glu Thr Lys Ile Val Tyr Ser Lys Ala Glu Thr Val Ser Met 980 985 990Phe Arg Asn Lys Phe Asp Ile Val Lys Cys Arg Glu Ile Asn Asp Phe 995 1000 1005His His Ala His Asp Ala Tyr Leu Asn Ile Val Val Gly Asn Val 1010 1015 1020Tyr Asn Thr Lys Phe Thr Asn Asn Pro Trp Asn Phe Ile Lys Glu 1025 1030 1035Lys Arg Asp Asn Pro Lys Ile Ala Asp Thr Tyr Asn Tyr Tyr Lys 1040 1045 1050Val Phe Asp Tyr Asp Val Lys Arg Asn Asn Ile Thr Ala Trp Glu 1055 1060 1065Lys Gly Lys Thr Ile Ile Thr Val Lys Asp Met Leu Lys Arg Asn 1070 1075 1080Thr Pro Ile Tyr Thr Arg Gln Ala Ala Cys Lys Lys Gly Glu Leu 1085 1090 1095Phe Asn Gln Thr Ile Met Lys Lys Gly Leu Gly Gln His Pro Leu 1100 1105 1110Lys Lys Glu Gly Pro Phe Ser Asn Ile Ser Lys Tyr Gly Gly Tyr 1115 1120 1125Asn Lys Val Ser Ala Ala Tyr Tyr Thr Leu Ile Glu Tyr Glu Glu 1130 1135 1140Lys Gly Asn Lys Ile Arg Ser Leu Glu Thr Ile Pro Leu Tyr Leu 1145 1150 1155Val Lys Asp Ile Gln Lys Asp Gln Asp Val Leu Lys Ser Tyr Leu 1160 1165 1170Thr Asp Leu Leu Gly Lys Lys Glu Phe Lys Ile Leu Val Pro Lys 1175 1180 1185Ile Lys Ile Asn Ser Leu Leu Lys Ile Asn Gly Phe Pro Cys His 1190 1195 1200Ile Thr Gly Lys Thr Asn Asp Ser Phe Leu Leu Arg Pro Ala Val 1205 1210 1215Gln Phe Cys Cys Ser Asn Asn Glu Val Leu Tyr Phe Lys Lys Ile 1220 1225 1230Ile Arg Phe Ser Glu Ile Arg Ser Gln Arg Glu Lys Ile Gly Lys 1235 1240 1245Thr Ile Ser Pro Tyr Glu Asp Leu Ser Phe Arg Ser Tyr Ile Lys 1250 1255 1260Glu Asn Leu Trp Lys Lys Thr Lys Asn Asp Glu Ile Gly Glu Lys 1265 1270 1275Glu Phe Tyr Asp Leu Leu Gln Lys Lys Asn Leu Glu Ile Tyr Asp 1280 1285 1290Met Leu Leu Thr Lys His Lys Asp Thr Ile Tyr Lys Lys Arg Pro 1295 1300 1305Asn Ser Ala Thr Ile Asp Ile Leu Val Lys Gly Lys Glu Lys Phe 1310 1315 1320Lys Ser Leu Ile Ile Glu Asn Gln Phe Glu Val Ile Leu Glu Ile 1325 1330 1335Leu Lys Leu Phe Ser Ala Thr Arg Asn Val Ser Asp Leu Gln His 1340 1345 1350Ile Gly Gly Ser Lys Tyr Ser Gly Val Ala Lys Ile Gly Asn Lys 1355 1360 1365Ile Ser Ser Leu Asp Asn Cys Ile Leu Ile Tyr Gln Ser Ile Thr 1370 1375 1380Gly Ile Phe Glu Lys Arg Ile Asp Leu Leu Lys Val 1385 1390 1395890DNAartificialprimer 8gtttaagagc tatgctgcga atacgagatg cggccgccga ccagaatcat gcaagtgcgt 60aagatagtcg cgggtcggcg gctcgtattc 90990DNAartificialprimer 9aaaagcaccg actcggtgcc actttttcaa gttgataacg gactagcctt atttaaactt 60gctatgctgc gaatacgagc cgccgacccg 901060DNAartificialprimermisc_feature(21)..(40)n is a, c, g, or t 10ttaatacgac tcactatagg nnnnnnnnnn nnnnnnnnnn gtttaagagc tatgctgcga 601120DNAartificialprimer 11aaaagcaccg actcggtgcc 201220DNAartificialprotospacer 12gctgaagcac tgcacgccat 201320DNAartificialprotospacer 13gtcacctcca atgactaggg 201420DNAartificialprotospacer 14ggagccgtac atgaactgag 201520DNAartificialprimer 15ccatcccctt ctgtgaatgt 201620DNAartificialprimer 16ggagattgga gacacggaga 201720DNAartificialprimer 17tccaccttgg cttggctttg 201819DNAartificialprimer 18ccctccacca gtaccccac 191922DNAartificialprimer 19aagggcgagg aggataacat gg 222021DNAartificialprimer 20ttgtacagct cgtccatgcc g 212120DNAartificialprimer 21ccaatgacaa gcttgctagc 2022100DNAartificialssODN 22tcatgtggtc ggggtagcgg ctgaagcact gcacgccatg ggtcagggtg gtcacgaggg 60tgggccaggg caccggcagc ttgccggtgg tgcagatgaa 10023100DNAartificialssODN 23tcatgtggtc ggggtagcgg ctgaagcact gcacgccatg ggtcagggtg gtcacgaggg 60tgggccaggg caccggcagc ttgccggtgg tgcagatgaa 1002490DNAartificialssODN 24aagcagcact ctgccctcgt gggtttgtgg ttgcccaccg ctagcaagct tgtcattgga 60ggtgacatcg atgtcctccc cattggcctg 902590DNAArtificial SequencessODN 25aagcagcact ctgccctcgt gggtttgtgg ttgcccaccg ctagcaagct tgtcattgga 60ggtgacatcg atgtcctccc cattggcctg 902620DNAartificialOff-target sequence 26gctgaagcac tgcacgccat 202720DNAartificialOff-target sequence 27gcagaagcac tgcaagccat 202820DNAartificialOff-target sequence 28tctgaagtgc tgcacgccat 202920DNAartificialOff-target sequence 29gtggaagcac tgcaagccat 203020DNAartificialOff-target sequence 30ggtggagcag ggcacgccat 203120DNAartificialOff-target sequence 31gaagaagcac tgcaccccat 203220DNAartificialOff-target sequence 32gtcacctcca atgactaggg 203320DNAartificialOff-target sequence 33aggaccacca atgactaggg 203420DNAartificialOff-target sequence 34accacctgta atgactaggg 203520DNAartificialOff-target sequence 35ggagcctcca gtgactaggg 203620DNAartificialOff-target sequence 36gtgaactaca gtgactaggg 203720DNAartificialOff-target sequence 37ctggcctcca aagactaggg 203822DNAartificialprimer 38tttcctagca agcagactca ga 223921DNAartificialprimer 39agctgtcctt tgtcccattg a 214021DNAartificialprimer 40tctccatgcc ctcctttcca t 214123DNAartificialprimer 41ggatgtagtc catgatcttc ccc 234222DNAartificialprimer 42tcccagaatg tgaaagtgga gg 224320DNAartificialprimer 43ctgtgggctt tcctcagctc 204420DNAartificialprimer 44gctgactaac gtccactgct 204524DNAartificialprimer 45tggacctatg tttttcttcg tcac 244621DNAartificialprimer 46aaagtctgtg gccttgtgag a 214720DNAartificialprimer 47aaccctaccc cctacctgaa 204821DNAartificialprimer 48ttccccaggt agttgctgtt c 214921DNAartificialprimer 49tctgcacatg tcccaactgt c 215021DNAartificialprimer 50atccgtacct aaccatgacc c 215120DNAartificialprimer 51gcacagatct tggtggcttt 205220DNAartificialprimer 52ggctgggttt cccaaacgta 205320DNAartificialprimer 53caaactgctg tgttgggtgg 205420DNAartificialprimer 54acttggaagg gtccacacaa 205524DNAartificialprimer 55ccttgaatag agcatttttc ccca 245620DNAartificialprimer 56tcctaccctt ggatggggtt 205720DNAartificialprimer 57gggctacacg gtccctaaag 205823DNAartificialtarget sequence 58gtagccgggg aagcgaagca ggg 235923DNAartificialtarget sequence 59gctcacggac ggctcctacc tgg 236021DNAartificialprimer 60ccccatcgtt ccatctcctc t 216122DNAartificialprimer 61cgcgggttct tttggtatct tg 226284DNAartificialprimer 62gtttaagagc tatgctgcga atacgagccg ccgaccagaa tcatgcaagt gcgtaagata 60gtcgcgggtc ggcggctcgt attc 846390DNAartificialprimer 63aaaagcaccg actcggtgcc actttttcaa gttgataacg gactagcctt atttaaactt 60gctatgctgc gaatacgagc cgccgacccg 906460DNAartificialprimermisc_feature(21)..(40)n is a, c, g, or t 64ttaatacgac tcactatagg nnnnnnnnnn nnnnnnnnnn gtttaagagc tatgctgcga 606590DNAartificialprimer 65gtttaagagc tatgctggaa acagcatagc aagtttaaat aaggctagtc cgttatcaac 60ttcgaatacg agatgcggcc gccgaccaga 906696DNAartificialprimer 66aaaaaaagca ccgactcggt gccacttttt ccgaatacga gatgcggccg ccgacccgcg 60actatcttac gcacttgcat gattctggtc ggcggc 966790DNAartificialprimer 67gtttaagagc tatgctggaa acagcatagc aagtttaaat aaggctagtc cgttatcaac 60ttgaaaaagt ggcaccgagt cggtgccgaa 906896DNAartificialprotospacer 68aaaaaaacga atacgagatg cggccgccga cccgcgacta tcttacgcac ttgcatgatt 60ctggtcggcg gccgcatctc gtattcggca ccgact 966920DNAartificialprotospacer 69aaaagcaccg actcggtgcc 2070175RNAartificialsgRNAmisc_feature(1)..(20)n is a, c, g, or u 70nnnnnnnnnn nnnnnnnnnn guuuaagagc uaugcugcga auacgagccg ccgaccagaa 60ucaugccaag ugcguaagau agucgcgggu cggcggcucg uauucgcagc auagcaaguu 120uaaauaaggc uaguccguua ucaacuugaa aaaguggcac cgugacggug cuuuu 1757120DNAartificialprotospacer 71gctgaagcac tgcacgccat 207220DNAartificialprotospacer 72ggagccgtac atgaactgag 207320DNAartificialprotospacer 73ctcgttgtcc aggtaggccc 207420DNAartificialprotospacer 74tggaccacca gctcctgtgg 207541DNAartificialdonor DNA 75tgcagcctct cgttgtccag gtatggcccg ggtccactgc c 417641DNAartificialPompe gene editsmisc_feature(2)..(2)n is a, c, g, or tmisc_feature(29)..(29)n is a, c, g, or tmisc_feature(32)..(32)n is a, c, g, or t 76tncagcctct cgttgtccag gtatggccng gntcaattgc t 417741DNAartificialPompe gene editsmisc_feature(2)..(2)n is a, c, g, or t 77tncagcctct cgttgtccag gtatggcccg gatccactgc c 417842DNAartificialPompe gene editsmisc_feature(10)..(10)n is a, c, g, or tmisc_feature(13)..(13)n is a, c, g, or tmisc_feature(16)..(16)n is a, c, g, or t 78ctcagacctn ttnttntcaa ggtaaggccc gggtccactg cc 427941DNAartificialPompe gene editsmisc_feature(2)..(2)n is a, c, g, or t 79tncagcctct cgttgtccag gtatggcccg gatccactgc c 418039DNAartificialdonor DNA 80cctggactgt ggaccaccag ctcctgtggg ggaggccct 398139DNAartificialPompe gene editsmisc_feature(28)..(28)n is a, c, g, or t 81cctggactgt ggaccaccag ctcctgtngg ggaggccct 398240DNAartificialPompe gene edits 82cctggactgt ggaccaccag ctcctgtggg gagaggccct 40

User Contributions:

Comment about this patent or add new information about this topic:

Date	Title
New patent applications in this class:
2022-09-22	Electronic device
2022-09-22	Front-facing proximity detection using capacitive sensor
2022-09-22	Touch-control panel and touch-control display apparatus
2022-09-22	Sensing circuit with signal compensation
2022-09-22	Reduced-size interfaces for managing alerts

Inventors list

Assignees list

Classification tree browser

Top 100 Inventors

Top 100 Assignees

Patent application title: MODIFIED GUIDE RNAS, CRISPR-RIBONUCLEOTPROTEIN COMPLEXES AND METHODS OF USE

Inventors:
IPC8 Class: AC12N1511FI
USPC Class: 1 1
Class name:
Publication date: 2021-05-13
Patent application number: 20210139891

Abstract:

Claims:

Description:

Inventors list

Assignees list

Classification tree browser

Top 100 Inventors

Top 100 Assignees

Patent application title: MODIFIED GUIDE RNAS, CRISPR-RIBONUCLEOTPROTEIN COMPLEXES AND METHODS OF USE

Inventors: IPC8 Class: AC12N1511FI USPC Class: 1 1 Class name: Publication date: 2021-05-13 Patent application number: 20210139891

Abstract:

Claims:

Description:

Inventors:
IPC8 Class: AC12N1511FI
USPC Class: 1 1
Class name:
Publication date: 2021-05-13
Patent application number: 20210139891