Patent application title: METHODS AND PRODUCTS FOR PRODUCING ENGINEERED MAMMALIAN CELL LINES WITH AMPLIFIED TRANSGENES
Inventors:
IPC8 Class: AC12N1590FI
USPC Class:
1 1
Class name:
Publication date: 2019-08-15
Patent application number: 20190249199
Abstract:
Methods of inserting genes into defined locations in the chromosomal DNA
of cultured mammalian cell lines which are subject to gene amplification
are disclosed. In particular, sequences of interest (e.g., genes encoding
biotherapeutic proteins) are inserted proximal to selectable genes in
amplifiable loci, and the transformed cells are subjected to selection to
induce co-amplification of the selectable gene and the sequence of
interest. The invention also relates to meganucleases, vectors and
engineered cell lines necessary for performing the methods, to cell lines
resulting from the application of the methods, and use of the cell lines
to produce protein products of interest.Claims:
1-19. (canceled)
20. A method for inserting an exogenous sequence into an amplifiable locus of a mammalian cell comprising: (a) providing a mammalian cell having an endogenous target site proximal to a selectable gene within the amplifiable locus, wherein the endogenous target site comprises: (i) a recognition sequence for an engineered meganuclease; (ii) a 5' flanking region 5' to the recognition sequence; and (iii) a 3' flanking region 3' to the recognition sequence; and (b) introducing a double-stranded break between the 5' and 3' flanking regions of the endogenous target site; (c) contacting the cell with a donor vector comprising from 5' to 3': (i) a donor 5' flanking region homologous to the 5' flanking region of the endogenous target site; (ii) an exogenous sequence; and (iii) a donor 3' flanking region homologous to the 3' flanking region of the endogenous target site; whereby the donor 5' flanking region, the exogenous sequence and the donor 3' flanking region are inserted between the 5' and 3' flanking regions of the endogenous target site by homologous recombination to provide a modified cell.
21. The method of claim 20, further comprising growing the modified cell in the presence of a compound that inhibits the function of the selectable gene to amplify the copy number of the selectable gene.
22. The method of claim 20, wherein the exogenous sequence comprises a gene of interest.
23. The method of claim 20, wherein the endogenous target site is downstream from the 3' regulatory region of the selectable gene.
24. The method of claim 23, wherein the endogenous target site is 0 to 100,000 base pairs downstream from the 3' regulatory region of the selectable gene.
25. The method of claim 20, wherein the endogenous target site is upstream from the 5' regulatory region of the selectable gene.
26. The method of claim 25, wherein the endogenous target site is 0 to 100,000 base pairs upstream from the 5' regulatory region of the selectable gene.
27. The method of claim 20, wherein the selectable gene is glutamine synthetase (GS) and the locus is methionine sulphoximine (MSX) amplifiable.
28. The method of claim 20, wherein the selectable gene is dihydrofolate reductase (DHFR) and the locus is Methotrexate (MTX) amplifiable.
29. The method of claim 20, wherein the selectable gene is selected from the group consisting of Dihydrofolate Reductase, Glutamine Synthetase, Hypoxanthine Phosphoribosyltransferase, Threonyl tRNA Synthetase, Na,K-ATPase, Asparagine Synthetase, Ornithine Decarboxylase, Inosine-5'-monophosphate dehydrogenase, Adenosine Deaminase, Thymidylate Synthetase, Aspartate Transcarbamylase, Metallothionein, Adenylate Deaminase (1,2), UMP-Synthetase and Ribonucleotide Reductase.
30. The method of claim 29, wherein the selectable gene is amplifiable by selection with a selection agent selected from the group consisting of Methotrexate (MTX), Methionine sulphoximine (MSX), Aminopterin, hypoxanthine, thymidine, Borrelidin, Ouabain, Albizziin, Beta-aspartyl hydroxamate, alpha-difluoromethylornithine (DFMO), Mycophenolic Acid, Adenosine, Alanosine, 2'deoxycoformycin, Fluorouracil, N-Phosphonacetyl-L-Aspartate (PALA), Cadmium, Adenine, Azaserine, Coformycin, 6-azauridine, pyrazofuran, hydroxyurea, motexafin gadolinium, fludarabine, cladribine, gemcitabine, tezacitabine and triapine.
31-54. (canceled)
55. A recombinant meganuclease comprising a polypeptide having at least 75%, 85%, 90%, 95%, 97%, 98% or 99% sequence identity to SEQ ID NO: 9.
56. The recombinant meganuclease of claim 55, having the sequence of the meganuclease of SEQ ID NO: 9.
57. A recombinant meganuclease which recognizes and cleaves a recognition site having at least 75%, 85%, 90%, 95%, 97%, 98% or 99% sequence identity to SEQ ID NO: 7.
58. The recombinant meganuclease of claim 57, wherein the meganuclease recognizes and cleaves a recognition site of SEQ ID NO: 7.
59-70. (canceled)
Description:
CROSS REFERENCE TO RELATED APPLICATIONS
[0001] This application is a continuation of U.S. patent application Ser. No. 15/783,243, filed Oct. 13, 2017, which is a continuation of U.S. patent application Ser. No. 14/806,175, filed Jul. 22, 2015, which is a continuation of U.S. patent application Ser. No. 14/091,572, filed Nov. 27, 2013, which is a continuation of International Application No. PCT/US2012/040599, filed Jun. 1, 2012, which claims priority to U.S. Provisional application No. 61/492,174 filed Jun. 1, 2011, the disclosures of all of which are hereby incorporated by reference in their entireties for all purposes.
FIELD OF THE INVENTION
[0002] The invention relates to the field of molecular biology and recombinant nucleic acid technology. In particular, the invention relates to methods of inserting genes into defined locations in the chromosomal DNA of cultured mammalian cell lines which are subject to gene amplification. The invention also relates to meganucleases, vectors and engineered cell lines necessary for performing the methods, cell lines resulting from the application of the methods, and use of the cell lines to produce protein products of interest.
BACKGROUND OF THE INVENTION
[0003] Therapeutic proteins are the primary growth driver in the global pharmaceutical market (Kresse, Eur J Pharm Biopharm 72, 479 (2009)). In 2001, biopharmaceuticals accounted for $24.3 billion in sales. By 2007, this number had more than doubled to $54.5 billion. The market is currently estimated to reach $78 billion by 2012 (Pickering, Spectrum Pharmaceutical Industry Dynamics Report, Decision Resources, Inc., 5 (2008)). This includes sales of "blockbuster" drugs such as erythropoietin, tissue plasminogen activator, and interferon, as well as numerous "niche" drugs such as enzyme replacement therapies for lysosomal storage disorders. The unparalleled growth in market size, however, is driven primarily by skyrocketing demand for fully human and humanized monoclonal antibodies (Reichert, Curr Pharm Biotechnol 9, 423 (2008)). Because they have the ability to confer a virtually unlimited spectrum of biological activities, monoclonal antibodies are quickly becoming the most powerful class of therapeutics available to physicians. Not surprisingly, more than 25% of the molecules currently undergoing clinical trials in the United States and Europe are monoclonal antibodies (Reichert, Curr Pharm Biotechnol 9, 423 (2008)).
[0004] Unlike more traditional pharmaceuticals, therapeutic proteins are produced in living cells. This greatly complicates the manufacturing process and introduces significant heterogeneity into product formulations (Field, Recombinant Human IgG Production from Myeloma and Chinese Hamster Ovary Cells, in Cell Culture and Upstream Processing, Butler, ed., (Taylor and Francis Group, New York, 2007)). In addition, protein drugs are typically required at unusually high doses, which necessitates highly scalable manufacturing processes and makes manufacturing input costs a major price determinant. For these reasons, treatment with a typical therapeutic antibody (e.g., the anti-HER2-neu monoclonal Herceptin.RTM.) costs $60,000-$80,000 for a full course of treatment (Fleck, Hastings Center Report 36, 12 (2006)). Further complicating the economics of biopharmaceutical production is the fact that many of the early blockbuster biopharmaceuticals are off-patent (or will be off-patent soon) and the US and EU governments are expected to greatly streamline the regulatory approval process for "biogeneric" and "biosimilar" therapeutics (Kresse, Eur J Pharm Biopharm 72, 479 (2009)). These factors should lead to a significant increase in competition for sales of many prominent biopharmaceuticals (Pickering, Spectrum Pharmaceutical Industry Dynamics Report, Decision Resources, Inc., 5 (2008)). Therefore, there is enormous interest in technologies which reduce manufacturing costs of protein therapeutics (Seth et al., Curr Opin Biotechnol 18, 557 (2007)).
[0005] Many of the protein pharmaceuticals on the market are glycoproteins that cannot readily be produced in easy-to-manipulate biological systems such as bacteria or yeast. For this reason, recombinant therapeutic proteins are produced almost exclusively in mammalian cell lines, primarily Chinese hamster ovary (e.g., CHO-K1), mouse myeloma (e.g., NS0), baby hamster kidney (BHK), murine C127, human embryonic kidney (e.g., HEK-293), or human retina-derived (e.g., PER-C6) cells (Andersen and Krummen, Curr Opin Biotechnol 13, 117 (2002)). Of these, CHO cells are, by far, the most common platform for bioproduction because they offer the best combination of high protein expression levels, short doubling time, tolerance to a wide range of media conditions, established transfection and amplification protocols, an inability to propagate most human pathogens, a paucity of blocking intellectual property, and the longest track record of FDA approval (Field, Recombinant Human IgG Production from Myeloma and Chinese Hamster Ovary Cells, in Cell Culture and Upstream Processing, Butler, ed. (Taylor and Francis Group, New York, 2007)).
[0006] Large-market biopharmaceuticals are typically produced in enormous stirred-tank bioreactors containing hundreds of liters of CHO cells stably expressing the protein product of interest (Chu and Robinson, Curr Opin Biotechnol 12, 180 (2001), Coco-Martin and Harmsen, Bioprocess International 6, 28 (2008)). Under optimized industrial conditions, such manufacturing processes can yield in excess of 5 g of protein per liter of cells per day (Coco-Martin and Harmsen, Bioprocess International 6, 28 (2008)). Because of the large number of cells involved (.about.50 billion cells per liter), the level of protein expression per cell has a very dramatic effect on yield. For this reason, all of the cells involved in the production of a particular biopharmaceutical must be derived from a single "high-producer" clone, the production of which constitutes one of the most time- and resource-intensive steps in the manufacturing process (Clarke and Compton, Bioprocess International 6, 24 (2008)).
[0007] The first step in the large-scale manufacture of a biopharmaceutical is the transfection of mammalian cells with plasmid DNA encoding the protein product of interest under the control of a strong constitutive promoter. Stable transfectants are selected by using a selectable marker gene also carried on the plasmid. Most frequently, this marker is a dihydrofolate reductase (DHFR) gene which, when transfected into a DHFR deficient cell line such as DG44, allows for the selection of stable transfectants using media deficient in hypoxanthine. The primary reason for using DHFR as a selectable marker is that it enables a process called "gene amplification". By growing stable transfectants in gradually increasing concentrations of methotrexate (MTX), a DHFR inhibitor, it is possible to amplify the number of copies of the DHFR gene present in the genome. Because the gene encoding the protein product of interest is physically coupled to the DHFR gene, this results in amplification of both genes with a concomitant increase in the expression level of the therapeutic protein (Butler, Cell Line Development for Culture Strategies: Future Prospects to Improve Yields, in Cell Culture and Upstream Processing, Butler, ed., (Taylor and Francis Group, New York, 2007)). Related systems for the creation of stable bioproduction lines use the glutamine synthetase (GS) or hypoxanthine phosphoribosyltransferase (HPRT) genes as selectable markers and require the use of GS- or HPRT-deficient cell lines as hosts for transfection (Clarke and Compton, Bioprocess International 6, 24 (2008)). In the case of the GS system, gene amplification is accomplished by growing cells in the presence of methionine sulphoximine (MSX) (Clarke and Compton, Bioprocess International 6, 24 (2008)). In the case of the HPRT system, gene amplification is accomplished by growing cells in HAT medium, which contains aminopterin, hypoxanthine, and thymidine (Kellems, ed. Gene amplification in mammalian cells: a comprehensive guide, Marcel Dekker, New York, 1993).
[0008] In all of these systems, the initial plasmid DNA comprising a biotherapeutic gene expression cassette and a selectable marker integrates into a random location in the genome, resulting in extreme variability in therapeutic protein expression from one stable transfectant to another (Collingwood and Urnov, Targeted Gene Insertion to Enhance Protein Production from Cell Lines, in Cell Culture and Upstream Processing, Butler, ed. (Taylor and Francis Group, New York, 2007)). For this reason, it is necessary to screen hundreds to thousands of initial transfectants to identify cells which express acceptably high levels of gene product both before and after gene amplification (Butler, Cell Line Development for Culture Strategies: Future Prospects to Improve Yields, in Cell Culture and Upstream Processing, Butler, ed. (Taylor and Francis Group, New York, 2007)). A second and more problematic consequence of random gene integration is the phenomenon of transgene silencing, in which recombinant protein expression slows or ceases entirely over time (Collingwood and Urnov, Targeted Gene Insertion to Enhance Protein Production from Cell Lines, in Cell Culture and Upstream Processing, Butler, ed. (Taylor and Francis Group, New York, 2007)). Because these effects often do not manifest themselves for weeks to months following the initial transfection and screening process, it is generally necessary to carry and expand dozens of independent clonal lines to identify one that expresses the protein of interest consistently over time (Butler, Cell Line Development for Culture Strategies: Future Prospects to Improve Yields, in Cell Culture and Upstream Processing, Butler, ed. (Taylor and Francis Group, New York, 2007)).
[0009] This large number of screening and expansion steps results in a very lengthy and expensive process to simply generate the cell line that will, ultimately, produce the therapeutic of interest. Indeed, using conventional methods, a minimum of 10 months (with an average of 18 months) and an upfront investment of tens of millions of dollars in labor and material is required to produce an initial pool of protein-expressing cells suitable for industrial manufacturing (Butler, Cell Line Development for Culture Strategies: Future Prospects to Improve Yields, in Cell Culture and Upstream Processing, Butler, ed. (Taylor and Francis Group, New York, 2007)). If one takes into account lost time on market for a blockbuster protein therapeutic, inefficiencies in cell line production can cost biopharmaceutical manufacturers hundreds of millions of dollars (Seth et al., Curr Opin Biotechnol 18, 557 (2007)).
[0010] Much of the time and expense of bioproduction cell line creation can be attributed to random genomic integration of the bioproduct gene resulting in clone-to-clone variability in genotype and, hence, variability in gene expression. One way to overcome this is to target gene integration to a defined location that is known to support a high level of gene expression. To this end, a number of systems have been described which use the Cre, Flp, or .PHI.C31 recombinases to target the insertion of a bioproduct gene (reviewed in Collingwood and Urnov, Targeted Gene Insertion to Enhance Protein Production from Cell Lines, in Cell Culture and Upstream Processing, Butler, ed. (Taylor and Francis Group, New York, 2007)). Recent embodiments of these systems, most notably the Flp-In.RTM. system marketed by Invitrogen Corp. (Carlsbad, Calif.), couple bioproduct gene integration with the reconstitution of a split selectable marker so that cells with correctly targeted genes can be selected. As expected, these systems result in greatly reduced heterogeneity in gene expression and, in some cases, individual stable transfectants can be pooled, obviating the time and expense associated with expanding a single clone.
[0011] The principal drawback to recombinase-based gene targeting systems is that the recombinase recognition sites (loxP, FRT, or attB/attP sites) do not naturally occur in mammalian genomes. Therefore, cells must be pre-engineered to incorporate a recognition site for the recombinase before that site can be subsequently targeted for gene insertion. Because the recombinase site itself integrates randomly into the genome, it is still necessary to undertake extensive screening and evaluation to identify clones which carry the site at a location that is suitable for high level, long-term gene expression (Collingwood and Urnov, Targeted Gene Insertion to Enhance Protein Production from Cell Lines, in Cell Culture and Upstream Processing, Butler, ed. (Taylor and Francis Group, New York, 2007)). In addition, the biomanufacturing industry is notoriously hesitant to adopt "new" cell lines, such as those that have been engineered to carry a recombinase site, that do not have a track record of FDA approval. For these reasons, recombinase-based cell engineering systems may not readily be adopted by the industry and an approach that allows biomanufacturers to utilize their existing cell lines is preferable.
SUMMARY OF THE INVENTION
[0012] The present invention depends, in part, upon the development of mammalian cell lines in which sequences of interest (e.g., exogenous, actively transcribed transgenes) are inserted proximal to an endogenous selectable gene in an amplifiable locus, and the discovery that (a) the insertion of such exogenous sequences of interest does not inhibit amplification of the endogenous selectable gene, (b) the exogenous sequence of interest can be co-amplified with the endogenous selectable gene, and (c) the resultant cell lines, with an amplified region comprising multiple copies of the endogenous selectable gene and the exogenous sequence of interest, are stable for extended periods even in the absence of the selection regime which was employed to induce amplification. Thus, in one aspect, the invention provides a method for producing cell lines which can be used for biomanufacturing of a protein product of interest by specifically targeting the insertion of an exogenous sequence of interest capable of actively expressing the protein product of interest proximal to an endogenous selectable gene. In another aspect, the invention provides engineered cell lines that can be used to produce protein products of interest (e.g., therapeutic proteins such as monoclonal antibodies) at high levels.
[0013] It is understood that any of the embodiments described below can be combined in any desired way, and any embodiment or combination of embodiments can be applied to each of the aspects described below, unless the context indicates otherwise.
[0014] In one aspect, the invention provides a recombinant mammalian cell comprising an engineered target site stably integrated within selectable gene within an amplifiable locus, wherein the engineered target site disrupts the function of the selectable gene and wherein the engineered target site comprises a recognition sequence for a site specific endonuclease.
[0015] In some embodiments, the selectable gene is glutamine synthetase (GS) and the locus is methionine sulphoximine (MSX) amplifiable. In some embodiments, the selectable gene is dihydrofolate reductase (DHFR) and the locus is Methotrexate (MTX) amplifiable.
[0016] In some embodiments, the selectable gene is selected from the group consisting of Dihydrofolate Reductase, Glutamine Synthetase, Hypoxanthine Phosphoribosyltransferase, Threonyl tRNA Synthetase, Na,K-ATPase, Asparagine Synthetase, Ornithine Decarboxylase, Inosine-5'-monophosphate dehydrogenase, Adenosine Deaminase, Thymidylate Synthetase, Aspartate Transcarbamylase, Metallothionein, Adenylate Deaminase (1,2), UMP-Synthetase and Ribonucleotide Reductase.
[0017] In some embodiments, the selectable gene is amplifiable by selection with a selection agent selected from the group consisting of Methotrexate (MTX), Methionine sulphoximine (MSX), Aminopterin, hypoxanthine, thymidine, Borrelidin, Ouabain, Albizziin, Beta-aspartyl hydroxamate, alpha-difluoromethylornithine (DFMO), Mycophenolic Acid, Adenosine, Alanosine, 2'deoxycoformycin, Fluorouracil, N-Phosphonacetyl-L-Aspartate (PALA), Cadmium, Adenine, Azaserine, Coformycin, 6-azauridine, pyrazofuran, hydroxyurea, motexafin gadolinium, fludarabine, cladribine, gemcitabine, tezacitabine and triapine.
[0018] In some embodiments, the engineered target site is inserted into an exon of the selectable gene. In some embodiments, the site specific endonuclease is a meganuclease, a zinc finger nuclease or TAL effector nuclease. In some embodiment, the recombinant cell further comprises the site specific endonuclease.
[0019] In one aspect, the invention provides a recombinant mammalian cell comprising an engineered target site stably integrated proximal to a selectable gene within an amplifiable locus, wherein the engineered target site comprises a recognition sequence for a site specific endonuclease.
[0020] In some embodiments, the engineered target site is downstream from the 3' regulatory region of the selectable gene. In some embodiments, the engineered target site is 0 to 100,000 base pairs downstream from the 3' regulatory region of the selectable gene. In other embodiments, the engineered target site is upstream from the 5' regulatory region of the selectable gene. In some embodiments, the engineered target site is 0 to 100,000 base pairs upstream from the 5' regulatory region of the selectable gene.
[0021] In another aspect, the invention provides a method for inserting an exogenous sequence into an amplifiable locus of a mammalian cell comprising: (a) providing a mammalian cell having an endogenous target site proximal to a selectable gene within the amplifiable locus, wherein the endogenous target site comprises: (i) a recognition sequence for an engineered meganuclease; (ii) a 5' flanking region 5' to the recognition sequence; and (iii) a 3' flanking region 3' to the recognition sequence; and (b) introducing a double-stranded break between the 5' and 3' flanking regions of the endogenous target site; (c) contacting the cell with a donor vector comprising from 5' to 3': (i) a donor 5' flanking region homologous to the 5' flanking region of the endogenous target site; (ii) an exogenous sequence; and (iii) a donor 3' flanking region homologous to the 3' flanking region of the endogenous target site; whereby the donor 5' flanking region, the exogenous sequence and the donor 3' flanking region are inserted between the 5' and 3' flanking regions of the endogenous target site by homologous recombination to provide a modified cell.
[0022] In some embodiments, the method further comprises growing the modified cell in the presence of a compound that inhibits the function of the selectable gene to amplify the copy number of the selectable gene. In some embodiments, the exogenous sequence comprises a gene of interest.
[0023] In some embodiments endogenous target site is downstream from the 3' regulatory region of the selectable gene. In some embodiments, the endogenous target site is 0 to 100,000 base pairs downstream from the 3' regulatory region of the selectable gene. In other embodiments, the endogenous target site is upstream from the 5' regulatory region of the selectable gene. In some embodiments, the endogenous target site is 0 to 100,000 base pairs upstream from the 5' regulatory region of the selectable gene.
[0024] In one aspect, the invention provides a method for inserting an exogenous sequence into an amplifiable locus of a mammalian cell comprising: (a) providing a mammalian cell having an endogenous target site proximal to a selectable gene within the amplifiable locus, wherein the endogenous target site comprises: (i) a recognition sequence for an engineered meganuclease; (ii) a 5' flanking region 5' to the recognition sequence; and (iii) a 3' flanking region 3' to the recognition sequence; and (b) introducing a double-stranded break between the 5' and 3' flanking regions of the endogenous target site; (c) contacting the cell with an engineered target site donor vector comprising from 5' to 3': (i) a donor 5' flanking region homologous to the 5' flanking region of the endogenous target site; (ii) an exogenous sequence comprising an engineered target site; and (iii) a donor 3' flanking region homologous to the 3' flanking region of the endogenous target site; whereby the donor 5' flanking region, the exogenous sequence and the donor 3' flanking region are inserted between the 5' and 3' flanking regions of the endogenous target site by homologous recombination to provide a mammalian cell comprising the engineered target site; (d) introducing a double-stranded break between the 5' and 3' flanking regions of the engineered target site; (e) contacting the cell comprising the engineered target site with a sequence of interest donor vector comprising from 5' to 3': (i) a donor 5' flanking region homologous to the 5' flanking region of the engineered target site; (ii) an exogenous sequence comprising a sequence of interest; and (iii) a donor 3' flanking region homologous to the 3' flanking region of the engineered target site; whereby the donor 5' flanking region, the exogenous sequence comprising the sequence of interest and the donor 3' flanking region are inserted between the 5' and 3' flanking regions of the engineered target site by homologous recombination to provide an engineered mammalian cell comprising the sequence of interest.
[0025] In some embodiments, the method further comprises growing the engineered mammalian cell in the presence of a compound that inhibits the function of the selectable gene to amplify the copy number of the selectable gene. In some embodiments, the sequence of interest comprises a gene.
[0026] In another aspect, the invention provides a method for inserting an exogenous sequence into an amplifiable locus of a mammalian cell comprising: (a) providing a mammalian cell having an endogenous target site within a selectable gene within the amplifiable locus, wherein the endogenous target site comprises: (i) a recognition sequence for an engineered meganuclease; (ii) a 5' flanking region 5' to the recognition sequence; and (iii) a 3' flanking region 3' to the recognition sequence; and (b) introducing a double-stranded break between the 5' and 3' flanking regions of the endogenous target site; (c) contacting the cell with an engineered target site donor vector comprising from 5' to 3': (i) a donor 5' flanking region homologous to the 5' flanking region of the endogenous target site; (ii) an exogenous sequence comprising an engineered target site; and (iii) a donor 3' flanking region homologous to the 3' flanking region of the endogenous target site; whereby the donor 5' flanking region, the exogenous sequence and the donor 3' flanking region are inserted between the 5' and 3' flanking regions of the endogenous target site by homologous recombination to provide a mammalian cell comprising the engineered target site; (d) introducing a double-stranded break between the 5' and 3' flanking regions of the engineered target site; (e) contacting the cell comprising the engineered target site with a sequence of interest donor vector comprising from 5' to 3': (i) a donor 5' flanking region homologous to the 5' flanking region of the engineered target site; (ii) an exogenous sequence comprising a sequence of interest; and (iii) a donor 3' flanking region homologous to the 3' flanking region of the engineered target site; whereby the donor 5' flanking region, the exogenous sequence comprising the sequence of interest and the donor 3' flanking region are inserted between the 5' and 3' flanking regions of the engineered target site by homologous recombination to provide a engineered mammalian cell comprising the sequence of interest.
[0027] In some embodiments, the method further comprises growing the engineered mammalian cell in the presence of a compound that inhibits the function of the selectable gene to amplify the copy number of the selectable gene.
[0028] In some embodiments, the sequence of interest comprises a gene.
[0029] In some embodiments, the endogenous target site is within an intron of the selectable gene. In other embodiments, the endogenous target site is within an exon of the selectable gene.
[0030] In one aspect, the invention provides a recombinant meganuclease comprising a polypeptide having at least 75%, 80%, 85%, 90%, 95%, 97%, 98% or 99% sequence identity to SEQ ID NO: 15.
[0031] In another aspect, the invention provides a recombinant meganuclease comprising the amino acid sequence of SEQ ID NO: 15.
[0032] In another aspect, the invention provides a recombinant meganuclease which recognizes and cleaves a recognition site having 75%, 85%, 90%, 95%, 97%, 98% or 99% sequence identity to SEQ ID NO: 14. In one embodiment, the meganuclease recognizes and cleaves a recognition site of SEQ ID NO: 14.
[0033] In another aspect, the invention provides a recombinant meganuclease comprising a polypeptide having at least 75%, 85%, 90%, 95%, 97%, 98% or 99% sequence identity to SEQ ID NO: 9. In one embodiment, the recombinant meganuclease has the sequence of the meganuclease of SEQ ID NO: 9.
[0034] In another aspect, the invention provides a recombinant meganuclease which recognizes and cleaves a recognition site having at least 75%, 85%, 90%, 95%, 97%, 98% or 99% sequence identity to SEQ ID NO: 7. In one embodiment, the meganuclease recognizes and cleaves a recognition site of SEQ ID NO: 7.
[0035] In another aspect, the invention provides a recombinant meganuclease comprising a polypeptide having at least 75%, 80%, 85%, 90%, 95%, 97%, 98% or 99% sequence identity to SEQ ID NO: 10. In one embodiment, the recombinant meganuclease comprises the polypeptide of SEQ ID NO: 10.
[0036] In another aspect, the invention provides a recombinant meganuclease which recognizes and cleaves a recognition site having at least 75%, 85%, 90%, 95%, 97%, 98% or 99% sequence identity to SEQ ID NO: 8. In one embodiment, the meganuclease recognizes and cleaves a recognition site of SEQ ID NO: 8.
[0037] In another aspect, the invention provides a recombinant meganuclease comprising a polypeptide having at least 75%, 80%, 85%, 90%, 95%, 97%, 98% or 99% sequence identity to SEQ ID NO: 13. In one embodiment, rhe recombinant meganuclease comprises the polypeptide of SEQ ID NO: 13.
[0038] In another aspect, the invention provides a recombinant meganuclease which recognizes and cleaves a recognition site having at least 75%, 85%, 90%, 95%, 97%, 98% or 99% sequence identity to SEQ ID NO: 12. In one embodiment, the meganuclease recognizes and cleaves a recognition site of SEQ ID NO: 12.
[0039] In another aspect, the invention provides a recombinant meganuclease comprising a polypeptide having at least 75%, 80%, 85%, 90%, 95%, 97%, 98% or 99% sequence identity to SEQ ID NO: 29. In one embodiment, the recombinant meganuclease comprises the polypeptide of SEQ ID NO: 29.
[0040] In another aspect, the invention provides a recombinant meganuclease which recognizes and cleaves a recognition site having at least 75%, 85%, 90%, 95%, 97%, 98% or 99% sequence identity to SEQ ID NO: 30. In one embodiment, the meganuclease recognizes and cleaves a recognition site of SEQ ID NO: 30.
[0041] In another aspect, the invention provides recombinant mammalian cell lines which continue to express a protein product of interest from an exogenous sequence of interest present in an amplified region of the genome (i.e., present in 2-1,000 copies, co-amplified with a selectable gene in an amplifiable locus) for a period of at least 8, 9, 10, 11, 12, 13, or 14 weeks after removal of the amplification selection agent, and with a reduction of expression levels and/or copy number of less than 20, 25, 30, 35 or 40%.
[0042] In another aspect, the invention provides methods of producing recombinant cells with amplified regions including a sequence of interest and a selectable gene by subjecting the above-described recombinant cells to selection with a selection agent which causes co-amplification of the sequence of interest and the selectable gene.
[0043] In another aspect, the invention provides methods of producing a protein product of interest by culturing the above-described recombinant cells, or the above-described recombinant cells with amplified regions, and obtaining the protein product of interest from the culture medium or a cell lysate.
BRIEF DESCRIPTION OF THE FIGURES
[0044] FIG. 1. A general strategy for targeting a sequence of interest to an amplifiable locus.
[0045] FIGS. 2A and 2B. (A) Schematic of the CHO DHFR locus showing a preferred region for targeting a sequence of interest 5,000-60,000 base pairs downstream of the DHFR gene. (B) Schematic of the CHO GS locus showing a preferred region for targeting a sequence of interest 5,000-55,000 base pairs downstream of the GS gene.
[0046] FIG. 3. Strategy for inserting a sequence of interest into an amplifiable locus in a two-step process involving a pre-integrated engineered target sequence.
[0047] FIG. 4. Strategy for inserting an engineered target sequence into an amplifiable locus with concomitant removal of a portion of the selectable gene, followed by insertion of a sequence of interest and reconstitution of the selectable gene.
[0048] FIG. 5. Strategy for inserting an engineered target sequence into an amplifiable locus with concomitant disruption of the coding sequence of a selectable gene, followed by insertion of a sequence of interest and reconstitution of the selectable gene.
[0049] FIG. 6. Strategy for inserting an engineered target sequence into an amplifiable locus with concomitant disruption of the mRNA processing, followed by insertion of a sequence of interest and reconstitution of the selectable gene.
[0050] FIGS. 7A through 7D. (A) A direct-repeat recombination assay for site-specific endonuclease activity. (B) Results of the assay in (A) applied to the CHO-23/24 and CHO-51/52 meganucleases. (C) Alignment of sequences obtained from CHO cells transfected with mRNA encoding the CHO-23/24 meganuclease (SEQ ID NOS 37-39, 38, 40, 38, and 38, respectively, in order of appearance). (D) Alignment of sequences obtained from CHO cells transfected with mRNA encoding the CHO-51/52 meganuclease (SEQ ID NOS 41-51, respectively, in order of appearance).
[0051] FIGS. 8A and 8B. (A) Strategy for inserting an exogenous DNA sequence into the CHO DHFR locus using the CHO-51/52 meganuclease. (B) PCR products demonstrating insertion of an engineered target sequence.
[0052] FIGS. 9A through 9C. (A) Strategy for inserting an engineered target sequence into the CHO DHFR locus using the CHO-23/24 meganuclease, followed by Flp recombinase-mediated insertion of a sequence of interest. (B) PCR products from hygromycin-resistant clones produced in (A). (C) GFP expression by the 24 clones produced in (B).
[0053] FIGS. 10A through 10C. Results of experiments with a GFP-expressing CHO line produced by integrating a GFP gene expression cassette into the DHFR locus using a target sequence strategy as shown in FIG. 9.
[0054] FIGS. 11A through 11C. (A) A direct-repeat recombination assay, as in FIG. 5A. (B) The assay in (A) applied to the CHO-13/14 and CGS-5/6 meganucleases. (C) Alignment of sequences obtained from CHO cells transfected with mRNA encoding the CGS-5/6 meganuclease (SEQ ID NOS 52-56, 56, 56-63, 63, 63, and 63-64, respectively, in order of appearance).
DETAILED DESCRIPTION OF THE INVENTION
1.1 Introduction
[0055] The present invention depends, in part, upon the development of mammalian cell lines in which exogenous actively transcribed transgenes have been inserted proximal to an endogenous amplifiable locus, and the discovery that (a) the insertion of such exogenous actively transcribed transgenes does not prevent or substantially inhibit amplification of the endogenous amplifiable locus, (b) the exogenous actively transcribed transgene can be co-amplified with the endogenous amplifiable locus, and (c) the resultant cell line, with an amplified region comprising multiple copies of the endogenous amplifiable locus and the exogenous actively transcribed transgene is stable for extended periods even in the absence of the selection regime which was employed to induce amplification. Thus, in one aspect, the invention provides a method for producing cell lines which can be used for biomanufacturing of a protein product of interest by specifically targeting the insertion of an exogenous gene capable of actively expressing the protein product of interest proximal to an endogenous amplifiable locus. In another aspect, the invention provides engineered cell lines that can be used to produce protein products of interest (e.g., therapeutic proteins such as monoclonal antibodies) at high levels.
1.2 References and Definitions
[0056] The patent and scientific literature referred to herein establishes knowledge that is available to those of skill in the art. The entire disclosures of the issued U.S. patents, pending applications, published foreign applications, and scientific and technical references cited herein, including protein and nucleic acid database sequences, are hereby incorporated by reference to the same extent as if each was specifically and individually indicated to be incorporated by reference.
[0057] As used herein, the term "meganuclease" refers to naturally-occurring homing endonucleases (also referred to as Group I intron encoded endonucleases) or non-naturally-occurring (e.g., rationally designed or engineered) endonucleases based upon the amino acid sequence of a naturally-occurring homing endonuclease. Examples of naturally-occurring meganucleases include I-SceI, I-CreI, I-CeuI, I-DmoI, I-MsoI, I-AniI, etc. Rationally designed meganucleases are disclosed in, for example, WO 2007/047859 and WO 2009/059195, and can be engineered to have modified DNA-binding specificity, DNA cleavage activity, DNA-binding affinity, or dimerization properties relative to a naturally occurring meganuclease. A meganuclease may bind to double-stranded DNA as a homodimer (e.g., wild-type I-CreI), or it may bind to DNA as a heterodimer (e.g., engineered meganucleases disclosed in WO 2007/047859). An engineered meganuclease may also be a "single-chain meganuclease" in which a pair of DNA-binding domains derived from a natural meganuclease are joined into a single polypeptide using a peptide linker (e.g., single-chain meganucleases disclosed in WO 2009/059195).
[0058] As used herein, the term "single-chain meganuclease" refers to a polypeptide comprising a pair of meganuclease subunits joined by a linker. A single-chain meganuclease has the organization: N-terminal subunit-Linker-C-terminal subunit. The two meganuclease subunits will generally be non-identical in amino acid sequence and will recognize non-identical DNA sequences. Thus, single-chain meganucleases typically cleave pseudo-palindromic or non-palindromic recognition sequences. Methods of producing single-chain meganucleases are disclosed in WO 2009/059195.
[0059] As used herein, the term "site specific endonuclease" means a meganuclease, zinc-finger nuclease or TAL effector nuclease.
[0060] As used herein, with respect to a protein, the term "recombinant" means having an altered amino acid sequence as a result of the application of genetic engineering techniques to nucleic acids which encode the protein, and cells or organisms which express the protein. With respect to a nucleic acid, the term "recombinant" means having an altered nucleic acid sequence as a result of the application of genetic engineering techniques. Genetic engineering techniques include, but are not limited to, PCR and DNA cloning technologies; transfection, transformation and other gene transfer technologies; homologous recombination; site-directed mutagenesis; and gene fusion. In accordance with this definition, a protein having an amino acid sequence identical to a naturally-occurring protein, but produced by cloning and expression in a heterologous host, is not considered recombinant. As used herein, the term "engineered" is synonymous with the term "recombinant."
[0061] As used herein, with respect to a meganuclease, the term "wild-type" refers to any naturally-occurring form of a meganuclease. The term "wild-type" is not intended to mean the most common allelic variant of the enzyme in nature but, rather, any allelic variant found in nature. Wild-type homing endonucleases are distinguished from recombinant or non-naturally-occurring meganucleases.
[0062] As used herein, the term "recognition sequence" refers to a DNA sequence that is bound and cleaved by a meganuclease. A recognition sequence comprises a pair of inverted, 9 base pair "half sites" which are separated by four base pairs. In the case of a homo- or heterodimeric meganucleases, each of the two monomers makes base-specific contacts with one half-site. In the case of a single-chain heterodimer meganuclease, the N-terminal domain of the protein contacts a first half-site and the C-terminal domain of the protein contacts a second half-site. In the case if I-CreI, for example, the recognition sequence is 22 base pairs and comprises a pair of inverted, 9 base pair "half sites" which are separated by four base pairs.
[0063] As used herein, the term "target site" refers to a region of the chromosomal DNA of a cell comprising a target sequence into which a sequence of interest can be inserted. As used herein, the term "engineered target site" refers to an exogenous sequence of DNA integrated into the chromosomal DNA of a cell comprising an engineered target sequence into which a sequence of interest can be inserted.
[0064] As used herein, the term "target sequence" means a DNA sequence within a target site which includes one or more recognition sequences for a nuclease, integrase, transposase, and/or recombinase. For example, a target sequence can include a recognition sequence for a meganuclease. As used herein, an "engineered target sequence" means an exogenous target sequence which is introduced into a chromosome to serve as the insertion point for another sequence.
[0065] As used herein, the term "flanking region" or "flanking sequence" refers to a sequence of >3 or, preferably, >50 or, more preferably, >200 or, most preferably, >400 base pairs of DNA which is immediately 5' or 3' to a reference sequence (e.g., a target sequence or sequence of interest).
[0066] As used herein, the terms "amplifiable locus" refers to a region of the chromosomal DNA of a cell which can be amplified by selection with one or more compounds (e.g., drugs) in the growth media. An amplifiable locus will typically comprise a gene encoding a protein which, under the appropriate conditions, is necessary for cell survival. By inhibiting the function of such an essential protein, for example with a small molecule drug, the amplifiable locus is duplicated many times over as a means of increasing the copy number of the essential gene. A gene of interest, if integrated into an amplifiable locus, will also become duplicated with the essential gene. Examples of amplifiable loci include the chromosomal regions comprising the DHFR, GS, and HPRT genes.
[0067] As used herein, the term "amplified locus" or "amplified gene" or "amplified sequence" refers to a locus, gene or sequence which is present in 2-1,000 copies as a result of gene amplification in response to selection of a selectable gene. An amplified gene or sequence can be a gene or sequence which is co-amplified due to selection of a selectable gene in the same amplifiable locus. In preferred embodiments, a sequence of interest is amplified to at least 3, 4, 5, 6, 7, 8, 9 or 10 copies.
[0068] As used herein, the term "selectable gene" refers to an endogenous gene that is essential for cell survival under some specific culture conditions (e.g., presence or absence of a nutrient, toxin or drug). Selectable genes are endogenous to the cell and are distinguished from exogenous "selectable markers" such as antibiotic resistance genes. Selectable genes exist in their natural context in the chromosomal DNA of the cell. For example, DHFR is a selectable gene which is necessary for cell survival in the presence of MTX in the culture medium. The gene is essential for growth in the absence of hypoxanthine and thymidine. If the endogenous DHFR selectable gene is eliminated, cells are able to grow in the absence of hypoxanthine and thymidine if they are given an exogenous copy of the DHFR gene. This exogenous copy of the DHFR gene is a selectable marker but is not a selectable gene. An amplifiable locus comprises a selectable gene and a target site. A target site is found outside of a selectable gene such that a selectable gene does not comprise a target site. Examples of selectable genes are given in Table 1.
[0069] As used herein, when used in connection with the position of a target site, recognition sequence, or inserted sequence of interest relative to the position of a selectable gene, the term "proximal" means that the target site, recognition sequence, or inserted sequence of interest is within the same amplifiable locus as the selectable gene, either upstream (5') or downstream (3') of the selectable gene, and preferably between the selectable gene and the next gene in the region (whether upstream (5') or downstream (3')). Typically, a "proximal" target site, recognition sequence, or inserted sequence of interest will be within <100,000 base pairs of the selectable gene, as measured from the first or last nucleotide of the first or last regulatory element of the selectable gene.
[0070] As used herein, the term "homologous recombination" refers to the natural, cellular process in which a double-stranded DNA-break is repaired using a homologous DNA sequence as the repair template (see, e.g. Cahill et al. (2006), Front. Biosci. 11:1958-1976). The homologous DNA sequence may be an endogenous chromosomal sequence or an exogenous nucleic acid that was delivered to the cell. Thus, for some applications of engineered meganucleases, a meganuclease is used to cleave a recognition sequence within a target sequence in a genome and an exogenous nucleic acid with homology to or substantial sequence similarity with the target sequence is delivered into the cell and used as a template for repair by homologous recombination. The DNA sequence of the exogenous nucleic acid, which may differ significantly from the target sequence, is thereby inserted or incorporated into the chromosomal sequence. The process of homologous recombination occurs primarily in eukaryotic organisms. The term "homology" is used herein as equivalent to "sequence similarity" and is not intended to require identity by descent or phylogenetic relatedness.
[0071] As used herein, the term "stably integrated" means that an exogenous or heterologous DNA sequence has been covalently inserted into a chromosome (e.g., by homologous recombination, non-homologous end joining, transposition, etc.) and has remained in the chromosome for a period of at least 8 weeks.&&
[0072] As used herein, the term "non-homologous end-joining" or "NHEJ" refers to the natural, cellular process in which a double-stranded DNA-break is repaired by the direct joining of two non-homologous DNA segments (see, e.g. Cahill et al. (2006), Front. Biosci. 11:1958-1976). DNA repair by non-homologous end-joining is error-prone and frequently results in the untemplated addition or deletion of DNA sequences at the site of repair. Thus, for certain applications, an engineered meganuclease can be used to produce a double-stranded break at a meganuclease recognition sequence within an amplifiable locus and an exogenous nucleic acid molecule, such as a PCR product, can be captured at the site of the DNA break by NHEJ (see, e.g. Salomon et al. (1998), EMBO J. 17:6086-6095). In such cases, the exogenous nucleic acid may or may not have homology to the target sequence. The process of non-homologous end-joining occurs in both eukaryotes and prokaryotes such as bacteria.
[0073] As used herein, the term "sequence of interest" means any nucleic acid sequence, whether it codes for a protein, RNA, or regulatory element (e.g., an enhancer, silencer, or promoter sequence), that can be inserted into a genome or used to replace a genomic DNA sequence. Sequences of interest can have heterologous DNA sequences that allow for tagging a protein or RNA that is expressed from the sequence of interest. For instance, a protein can be tagged with tags including, but not limited to, an epitope (e.g., c-myc, FLAG) or other ligand (e.g., poly-His). Furthermore, a sequence of interest can encode a fusion protein, according to techniques known in the art (see, e.g., Ausubel et al., Current Protocols in Molecular Biology, Wiley 1999). In preferred embodiments, a sequence of interest comprises a promoter operably linked to a gene encoding a protein of medicinal value such as an antibody, antibody fragment, cytokine, growth factor, hormone, or enzyme. For some applications, the sequence of interest is flanked by a DNA sequence that is recognized by the engineered meganuclease for cleavage. Thus, the flanking sequences are cleaved allowing for proper insertion of the sequence of interest into genomic recognition sequences cleaved by an engineered meganuclease. For some applications, the sequence of interest is flanked by DNA sequences with homology to or substantial sequence similarity with the target site such that homologous recombination inserts the sequence of interest within the genome at the locus of the target sequence.
[0074] As used herein, the term "donor DNA" refers to a DNA molecule comprising a sequence of interest flanked by DNA sequences homologous to a target site. Donor DNA can serve as a template for DNA repair by homologous recombination if it is delivered to a cell with a site-specific nuclease such as a meganuclease, zinc-finger nuclease, or TAL-effector nuclease. The result of such DNA repair is the insertion of the sequence of interest into the chromosomal DNA of the cell. Donor DNA can be linear, such as a PCR product, or circular, such as a plasmid. In cases where a donor DNA is a circular plasmid, it may be referred to as a "donor plasmid."
[0075] As used herein, unless specifically indicated otherwise, the word "or" is used in the inclusive sense of "and/or" and not the exclusive sense of "either/or."
2.1 Transgene Targeting to Amplifiable Loci
[0076] The present invention provides methods for generating transgenic mammalian cell lines expressing a desired protein product of interest, including "high-producer" cell lines, by targeting the insertion of a gene encoding the protein product of interest (e.g., a therapeutic protein gene expression cassette) to regions of the genome that are amplifiable. Such regions in mammalian cells include the DHFR, GS, and HPRT genes, as well as others shown in Table 1.
[0077] The precise mechanism of gene amplification is not known. Indeed, it is very likely that there is no single mechanism by which gene amplification occurs but that a variety of different random chromosomal aberrations, in combination with strong selection for amplification, results in increased gene copy number (reviewed in Omasa (2002), J. Biosci. Bioeng. 94:600-605). It is clear that chromosomal location plays a major role in amplification and the stable maintenance of amplified genes (Brinton and Heintz (1995), Chromosoma 104:143-51). It has been found that transgenes integrated into chromosomal locations adjacent to telomeres are more easily amplified and, once amplified, tend to be stable at high copy numbers after the selection agent is removed (Yoshikawa et al. (2000), Cytotechnology 33:37-46; Yoshikawa et al. (2000), Biotechnol Prog. 16:710-715). This is significant because selection agents such as MTX and MSX are toxic and cannot be included in the growth media in a commercial biomanufacturing process. In contrast, transgenes integrated into regions in the CHO genome that are not adjacent to telomeres amplify inefficiently and rapidly lose copy number following the removal of selection agents from the media. For example, Yoshikawa et al. found that randomly-integrated transgenes linked to a DHFR selectable marker amplified to greater than 10-fold higher copy numbers when the integration site was adjacent to a telomere (Yoshikawa et al. (2000), Biotechnol Prog. 16:710-715). These researchers also found that an amplified transgene integrated into a non-telomeric region will lose >50% of its copies in only 20 days following the removal of MTX from the growth media. None of the selectable genes identified in Table 1 is adjacent to a telomere in the mouse genome (www.ensembl.com) and the similarity in genome organization between mouse and CHO makes it likely that these genes are in non-telomeric regions in CHO as well (Xu et al. (2011), Nat. Biotechnol. 29:735-741). Thus, the prior art instructs that the loci identified in Table 1, including the DHFR and GS loci, are not preferred locations to target transgene insertion if the goal is efficient and stable gene amplification.
[0078] In addition, in the case of endogenous gene amplification, it is clear that chromosomal sequences outside of the selectable gene sequence play an important role in facilitating amplification and in defining the length of DNA sequence that is co-amplified with the gene under selection (Looney and Hamlin (1987), Mol. and Cell. Biol. 7:569-577). In particular, it has been shown that the sequence and location of the DNA replication origin in relation to the selectable gene plays a major role in amplification. For example, it has been shown that amplification of the endogenous CHO DHFR locus is dependent upon a pair of replication origins found in the region 5,000-60,000 base pairs downstream of the DHFR gene coding sequence (Anachkova and Hamlin (1989), Mol. and Cell. Biol. 9:532-540; Milbrandt et al. (1981), Proc. Natl. Acad. Sci. USA 78:6042-6047). Further, Brinton and Heintz have shown that these same replication origins fail to promote gene amplification when incorporated randomly into the genome with a transgenic DHFR sequence (Brinton and Heintz (1995), Chromosoma. 104:143-51). This clearly demonstrates the importance of maintaining both the sequence and proper chromosomal context of these replication origins to promote DHFR gene amplification. Thus the art instructs that the region downstream of DHFR is critical to gene amplification and should not be disrupted by, for example, inserting a transgenic gene expression cassette as described in the present invention.
[0079] Surprisingly, we have discovered that DNA sequences, including exogenous transcriptionally active sequences, which are inserted proximal to (e.g., within <100,000 base pairs) selectable genes in mammalian cell lines (e.g., CHO-K1) will co-amplify in the presence of appropriate compounds which select for amplification. Thus, the present invention provides methods for reliably and reproducibly producing isogenic cell lines in which transgenes encoding protein products of interest (e.g., biotherapeutic gene expression cassettes) can be amplified but in which it is not necessary to screen a large number of randomly generated cell lines to identify those which express high levels of the protein product of interest and are resistant to gene silencing.
[0080] In addition, we have surprisingly found that the mammalian cell lines of the invention, in which a sequence of interest is co-amplified with a selectable gene in an amplifiable locus, are stable with respect to expression of the sequence of interest and/or copy number of the sequence of interest even in the absence of continued selection. That is, whereas the art teaches that amplified sequences will be reduced in copy number over time if selection is not maintained (see, e.g., Yoshikawa et al. (2000), Biotechnol Prog. 16:710-715), we have found that cell lines produced according to the methods of the invention continue to produce the protein products of interest (encoded by the sequences of interest) at levels within 20%-25% of the initial levels, even 14 weeks after removal of the selection agent. This is significant, as noted above, because selection agents such as MTX and MSX are toxic, and it would be highly desirable to produce biotherapeutic proteins in cell lines which do not require continued exposure to such selection agents. Therefore, in some embodiments, the invention provides recombinant mammalian cell lines which continue to express a protein product of interest from an exogenous sequence of interest present in an amplified region of the genome (i.e., present in 2-1,000 copies, co-amplified with a selectable gene in an amplifiable locus) for a period of at least 8, 9, 10, 11, 12, 13, or 14 weeks after removal of the amplification selection agent, and with a reduction of expression levels and/or copy number of less than 20, 25, 30, 35 or 40%.
[0081] The present invention also provides the products necessary to practice the methods, and to target insertion of sequences of interest into amplifiable loci in mammalian cell lines. A common method for inserting or modifying a DNA sequence involves introducing a transgenic DNA sequence flanked by sequences homologous to the genomic target and selecting or screening for a successful homologous recombination event. Recombination with the transgenic DNA occurs rarely but can be stimulated by a double-stranded break in the genomic DNA at the target site (Porteus et al. (2005), Nat. Biotechnol. 23: 967-73; Tzfira et al. (2005), Trends Biotechnol. 23: 567-9; McDaniel et al. (2005), Curr. Opin. Biotechnol. 16: 476-83). Numerous methods have been employed to create DNA double-stranded breaks, including irradiation and chemical treatments. Although these methods efficiently stimulate recombination, the double-stranded breaks are randomly dispersed in the genome, which can be highly mutagenic and toxic. At present, the inability to target gene modifications to unique sites within a chromosomal background is a major impediment to routine genome engineering.
[0082] One approach to achieving this goal is stimulating homologous recombination at a double-stranded break in a target locus using a nuclease with specificity for a sequence that is sufficiently large to be present at only a single site within the genome (see, e.g., Porteus et al. (2005), Nat. Biotechnol. 23: 967-73). The effectiveness of this strategy has been demonstrated in a variety of organisms using ZFNs (Porteus (2006), Mol Ther 13: 438-46; Wright et al. (2005), Plant J. 44: 693-705; Urnov et al. (2005), Nature 435: 646-51). Homing endonucleases are a group of naturally-occurring nucleases which recognize 15-40 base-pair cleavage sites commonly found in the genomes of plants and fungi. They are frequently associated with parasitic DNA elements, such as Group I self-splicing introns and inteins. They naturally promote homologous recombination or gene insertion at specific locations in the host genome by producing a double-stranded break in the chromosome, which recruits the cellular DNA-repair machinery (Stoddard (2006), Q. Rev. Biophys. 38: 49-95). Homing endonucleases are commonly grouped into four families: the LAGLIDADG (SEQ ID NO: 65) family, the GIY-YIG family, the His-Cys box family and the HNH family. These families are characterized by structural motifs, which affect catalytic activity and recognition sequence. For instance, members of the LAGLIDADG (SEQ ID NO: 65) family are characterized by having either one or two copies of the conserved LAGLIDADG (SEQ ID NO: 65) motif (see Chevalier et al. (2001), Nucleic Acids Res. 29(18): 3757-3774). The LAGLIDADG (SEQ ID NO: 65) homing endonucleases with a single copy of the LAGLIDADG (SEQ ID NO: 65) motif form homodimers, whereas members with two copies of the LAGLIDADG (SEQ ID NO: 65) motif are found as monomers.
[0083] Natural homing endonucleases, primarily from the LAGLIDADG (SEQ ID NO: 65) family, have been used to effectively promote site-specific genome modification in plants, yeast, Drosophila, mammalian cells and mice, but this approach has been limited to the modification of either homologous genes that conserve the endonuclease recognition sequence (Monnat et al. (1999), Biochem. Biophys. Res. Commun. 255: 88-93) or to pre-engineered genomes into which a recognition sequence has been introduced (Rouet et al. (1994), Mol. Cell. Biol. 14: 8096-106; Chilton et al. (2003), Plant Physiol. 133: 956-65; Puchta et al. (1996), Proc. Natl. Acad. Sci. USA 93: 5055-60; Rong et al. (2002), Genes Dev. 16: 1568-81; Gouble et al. (2006), J. Gene Med. 8(5):616-622).
[0084] Systematic implementation of nuclease-stimulated gene modification requires the use of engineered enzymes with customized specificities to target DNA breaks to existing sites in a genome and, therefore, there has been great interest in adapting homing endonucleases to promote gene modifications at medically or biotechnologically relevant sites (Porteus et al. (2005), Nat. Biotechnol. 23: 967-73; Sussman et al. (2004), J. Mol. Biol. 342: 31-41; Epinat et al. (2003), Nucleic Acids Res. 31: 2952-62).
[0085] I-CreI (SEQ ID NO: 1) is a member of the LAGLIDADG (SEQ ID NO: 65) family of homing endonucleases which recognizes and cuts a 22 base pair recognition sequence in the chloroplast chromosome of the algae Chlamydomonas reinhardtii. Genetic selection techniques have been used to modify the wild-type I-CreI cleavage site preference (Sussman et al. (2004), J. Mol. Biol. 342: 31-41; Chames et al. (2005), Nucleic Acids Res. 33: e178; Seligman et al. (2002), Nucleic Acids Res. 30: 3870-9, Arnould et al. (2006), J. Mol. Biol. 355: 443-58). More recently, a method of rationally-designing mono-LAGLIDADG (SEQ ID NO: 65) homing endonucleases was described which is capable of comprehensively redesigning I-CreI and other homing endonucleases to target widely-divergent DNA sites, including sites in mammalian, yeast, plant, bacterial, and viral genomes (WO 2007/047859).
[0086] Thus, in one embodiment, the invention provides engineered meganucleases derived from the amino acid sequence of I-CreI that recognize and cut DNA sites in amplifiable regions of mammalian genomes. These engineered meganucleases can be used in accordance with the invention to target the insertion of gene expression cassettes into defined locations in the chromosomal DNA of cell lines such as CHO cells. This invention will greatly streamline the production of desired cell lines by reducing the number of lines that must be screened to identify a "high-producer" clone suitable for commercial-scale production of a therapeutic glycoprotein.
[0087] The present invention involves targeting transgenic DNA "sequences of interest" to amplifiable loci. The amplifiable loci are regions of the chromosomal DNA that contain selectable genes that become amplified in the presence of selection agents (e.g., drugs). For example, the Chinese Hamster Ovary (CHO) cell DHFR locus can be amplified to .about.1,000 copies by growing the cells in the presence of methotrexate (MTX), a DHFR inhibitor. Table 1 lists additional examples of selectable genes that can be amplified using small molecule drugs (Kellems, ed. Gene amplification in mammalian cells: a comprehensive guide. Marcel Dekker, New York, 1993; Omasa (2002), J. Biosci. Bioeng. 94:6 600-605).
TABLE-US-00001 TABLE 1 Amplifiable Genes Selectable Gene Name Amplified With Dihydrofolate Reductase Methotrexate (MTX) Glutamine Synthetase Methionine sulphoximine (MSX) Hypoxanthine Phosphoribosyl- Aminopterin, hypoxanthine, and thymidine transferase Threonyl tRNA Synthetase Borrelidin Na,K-ATPase Ouabain Asparagine Synthetase Albizziin or Beta-aspartyl hydroxamate Ornithine Decarboxylase alpha-difluoromethylornithine (DFMO) Inosine-5'-monophosphate Mycophenolic Acid dehydrogenase Adenosine Deaminase Adenosine, Alanosine, 2'deoxycoformycin Thymidylate Synthetase Fluorouracil Aspartate Transcarbamylase N-Phosphonacetyl-L-Aspartate (PALA) Metallothionein Cadmium Adenylate Deaminase (1, 2) Adenine, Azaserine, Coformycin UMP-Synthetase 6-azauridine, pyrazofuran Ribonucleotide Reductase hydroxyurea, motexafin gadolinium, fludarabine, cladribine, gemcitabine, tezacitabine, triapine.
[0088] Several considerations must be taken into account when selecting a specific target site for the insertion of a sequence of interest within an amplifiable locus. First, the selected insertion site must be co-amplified with the gene under selection. In many cases, experimental data already exists in the art which delimits the amount of flanking chromosomal sequence that co-amplifies with a selectable gene of interest. This data, which precisely defines the extent of the amplifiable locus, exists for CHO DHFR (Ma et al. (1988), Mol Cell Biol. 8(6):2316-27), human DHFR (Morales et al. (2009), Mol Cancer Ther. 8(2):424-432), and CHO GS (Sanders et al. (1987), Dev Biol Stand. 66:55-63). Where such data does not already exist in the art, we predict that chromosomal DNA sequences <100,000 base pairs upstream or downstream of the selectable gene coding sequence are likely to co-amplify. Hence, these regions could be suitable sites for targeting the insertion of a sequence of interest.
[0089] Second, target sites should be selected which will not greatly impact the function of the selectable gene (e.g., the endogenous DHFR, GS, or HPRT gene). Because amplification requires a functional copy of the selectable gene, insertion sites within the promoter, exons, introns, polyadenylation signals, or other regulatory sequences that, if disrupted, would greatly impact transcription or translation of the selectable gene, should be avoided. For example, WO 2008/059317 discloses meganucleases which cleave DNA target sites within the HPRT gene. To the extent WO 2008/059317 discloses the insertion of genes into the HPRT locus, it teaches that the HPRT gene coding sequence should be disrupted in the process of transgene insertion to facilitate selection for proper targeting using 6-thioguanine. 6-thioguanine is a toxic nucleotide analog that kills cells having functional HPRT activity. Because cells produced in accordance with WO 2008/059317 will not have HPRT activity, they will not amplify an inserted transgene in response to treatment with an HPRT inhibitor and, so, cannot be used in the present invention. For the present invention, unless the precise limits of all regulatory sequences are already known for a particular selectable gene, insertion sites >1,000 base pairs, >2,000 base pairs, >3,000 base pairs, >4,000 base pairs, or, preferably, >5,000 base pairs, upstream or downstream of the gene coding sequence should be selected. However, if the location of the regulatory sequences are known, the sequence of interest can be inserted immediately adjacent to the either the most 5' or 3' regulatory sequence (e.g., immediately 3' to the polyadenylation signal).
[0090] Lastly, target sites should be selected which do not disrupt other chromosomal genes which may be important for normal cell physiology. In general, gene insertion sites should be >1,000 base pairs, >2,000 base pairs, >3,000 base pairs, >4,000 base pairs, or, preferably, >5,000 base pairs, away from any gene coding sequence.
[0091] Various methods of the invention are described schematically in the figures as follows:
[0092] FIG. 1 depicts a general strategy for targeting a sequence of interest to an amplifiable locus. In the first step, a site-specific endonuclease introduces a double-stranded break in the chromosomal DNA of a cell at a site that is proximal to an endogenous selectable gene. The cleaved chromosomal DNA then undergoes homologous recombination with a donor DNA molecule comprising a sequence of interest flanked by DNA sequences homologous to sequences flanking the endonuclease recognition sequence in the target site. As a result, the sequence of interest is inserted into the chromosomal DNA of the cell adjacent to the endogenous selectable gene. The modified cell is then grown in the presence of one or more compounds that inhibit the function of the selectable gene to induce an increase in the copy number (i.e., amplification) of the selectable gene. The sequence of interest, which is genetically linked to the selectable gene, will co-amplify with the selectable gene. The result is a stable transgenic cell line comprising multiple copies of the sequence of interest.
[0093] FIG. 2(A) depicts a schematic of the CHO DHFR locus showing a preferred region for targeting a sequence of interest 5,000-60,000 base pairs downstream of the DHFR gene. FIG. 2(B) depicts a schematic of the CHO GS locus showing a preferred region for targeting a sequence of interest 5,000-55,000 base pairs downstream of the GS gene. Promoters are shown as arrows. Exons are shown as rectangles, with non-coding exons in white and protein coding exons in gray.
[0094] FIG. 3 depicts a strategy for inserting a sequence of interest into an amplifiable locus in a two-step process involving a pre-integrated target sequence. In the first step, the chromosomal DNA of a cell is cleaved by a site-specific endonuclease at a site that is proximal to a selectable gene. The cleaved chromosomal DNA then undergoes homologous recombination with a donor DNA molecule comprising an exogenous target sequence flanked by DNA sequences homologous to the sequences flanking the endogenous target site. This results in the insertion of the new engineered target sequence into the chromosomal DNA of the cell proximal to the selectable gene. A sequence of interest can subsequently be targeted proximal to the same selectable gene using a nuclease, integrase, transposase, or recombinase that specifically recognizes the pre-integrated engineered target sequence. The modified cell is then grown in the presence of one or more compounds that co-amplify the selectable gene and the sequence of interest.
[0095] FIG. 4 depicts a strategy for inserting an engineered target sequence into a selectable gene (e.g., DHFR) with concomitant removal of a portion of the selectable gene. A site-specific endonuclease is first used to cleave the chromosomal DNA of the cell proximal to or within the selectable gene sequence. As shown in the figure, the endogenous target site is between exons 2 and 3 of the CHO DHFR gene (although the target site could be within any intron or exon, and the selectable gene could be any gene subject to amplification). The chromosomal DNA then undergoes homologous recombination with a first donor DNA ("donor DNA #1") such that the sequence of the first donor DNA is inserted into the chromosomal DNA of the cell. As shown in the figure, this results in the replacement of the promoter and first two exons of DHFR by the new engineered target sequence (although the first donor DNA could replace more or less of the chromosomal DNA, such as only a portion of one exon). If such a replacement is made to all DHFR alleles in a cell, the resultant cell line is DHFR (-/-). A sequence of interest can subsequently be targeted proximal to the selectable gene in the cell line using an endonuclease, integrase, transposase, or recombinase that recognizes the engineered target sequence. As shown in the figure, the second donor DNA ("donor DNA #2") comprises a sequence of interest as well as a promoter and the first two exons of DHFR. Proper targeting of this second donor DNA molecule results in the insertion of the sequence of interest at the engineered target sequence while simultaneously reconstituting a functional DHFR gene. Thus, properly targeted cell lines will be DHFR+ and can be selected using media deficient in hypoxanthine/thymidine. In addition, the sequence of interest can be co-amplified with the DHFR gene using MTX selection. The strategy diagrammed here for DHFR can be applied to any selectable gene in an amplifiable locus.
[0096] FIG. 5 depicts a strategy for inserting an engineered target sequence into an amplifiable locus with concomitant disruption of the coding sequence of a selectable gene. A site-specific endonuclease is first used to cleave the chromosomal DNA of the cell within the selectable gene coding sequence. As shown in the figure, the endogenous target site is in the third exon of the CHO GS gene. The chromosomal DNA then undergoes homologous recombination with a first donor DNA ("donor DNA #1") such that the sequence of the first donor DNA is inserted into the chromosomal DNA of the cell. This results in the insertion of a new engineered target sequence into the GS coding sequence. If such an insertion occurs in both alleles of the GS gene and results in a frameshift mutation or otherwise disrupts the function of the GS gene, the resultant cell line will be GS (-/-). A sequence of interest can subsequently be targeted proximal to the amplifiable locus in the cell line using an endonuclease, integrase, transposase, or recombinase that recognizes the engineered target sequence. As shown in the figure, a second donor DNA ("donor DNA #2") comprises a sequence of interest operably linked to a promoter as well as the 3' portion of the GS coding sequence comprising exons 3, 4, 5, and 6. (The figure shows exons 3, 4, 5, and 6 joined into a single nucleotide sequence (i.e., with introns removed), but a sequence including either the naturally-occurring introns or one or more artificial introns could also be employed). Proper targeting of the second donor DNA molecule results in the insertion of the sequence of interest at the engineered target sequence while simultaneously reconstituting a functional GS gene. Thus, properly targeted cell lines will be GS+ and can be selected using media deficient in L-glutamine. In addition, the sequence of interest can be co-amplified with the GS gene using MSX selection. The strategy diagrammed here for GS can be applied to any selectable gene in an amplifiable locus.
[0097] FIG. 6 depicts a strategy for inserting an engineered target sequence into an amplifiable locus with concomitant disruption of the mRNA processing of a selectable gene. A site-specific endonuclease is first used to cleave the chromosomal DNA of the cell within an intron in the selectable gene. As drawn, the endogenous target site is in the intron between the third and fourth coding exons of the CHO GS gene. The chromosomal DNA then undergoes homologous recombination with a donor DNA #1 such that the sequence of the donor DNA is inserted in the chromosomal DNA of the cell. This results in the insertion of a new engineered target sequence into the GS coding sequence with an additional sequence that causes the GS mRNA to be processed incorrectly. As drawn, this additional sequence comprises a strong splice acceptor. If such an insertion occurs in both alleles of the GS gene, the artificial splice acceptor will cause the GS mRNA to splice incorrectly, resulting in a loss of GS expression and a requirement for growth in media containing L-glutamine. A sequence of interest can subsequently be targeted to the amplifiable locus in the cell line using an endonuclease, integrase, transposase, or recombinase that recognizes the engineered target sequence. As diagrammed, donor DNA #2 comprises a sequence of interest operably linked to a promoter as well as the 3' portion of the GS coding sequence comprising exons 4, 5, and 6 joined into a single nucleotide sequence. (The figure shows exons 4, 5, and 6 joined into a single nucleotide sequence (i.e., with introns removed), but a sequence including either the naturally-occurring introns or one or more artificial introns could also be employed). Proper targeting of this donor DNA #2 molecule results in the insertion of the sequence of interest at the engineered target sequence while simultaneously reconstituting a functional GS gene. Thus, properly targeted cell lines will be GS+ and can be selected using media deficient in L-glutamine and the sequence of interest can be co-amplified with the GS gene using MSX selection. The strategy diagrammed here for GS can be applied to any selectable gene in an amplifiable locus.
[0098] FIG. 7(A) depicts a direct-repeat recombination assay for site-specific endonuclease activity. A reporter plasmid is produced comprising the 5' two-thirds of the GFP gene ("GF"), followed by an endonuclease recognition sequence, followed by the 3' two-thirds of the GFP gene ("FP"). Mammalian cells are transfected with this reporter plasmid as well as a gene encoding an endonuclease. Cleavage of the recognition sequence by the endonuclease stimulates homologous recombination between direct repeats of the GFP gene to restore GFP function. GFP+ cells can then be counted and/or sorted on a flow cytometer.
[0099] FIG. 7(B) depicts the results of the assay of FIG. 7(A) as applied to the CHO-23/24 and CHO-51/52 meganucleases. Light bars indicate the percentage of GFP+ cells when cells are transfected with the reporter plasmid alone (-endonuclease). Dark bars indicate the percentage of GFP+ cells when cells are co-transfected with a reporter plasmid and the corresponding meganuclease gene (+endonuclease). The assay was performed in triplicate and the standard deviation is shown.
[0100] FIG. 7(C) depicts alignment of sequences obtained from CHO cells transfected with mRNA encoding the CHO-23/24 meganuclease. The top sequence is from a wild-type (WT) CHO cell with the recognition sequence for CHO-23/24 underlined.
[0101] FIG. 7(D) depicts alignment of sequences obtained from CHO cells transfected with mRNA encoding the CHO-51/52 meganuclease. The top sequence is from a wild-type (WT) CHO cell with the recognition sequence for CHO-51/52 underlined.
[0102] FIG. 8(A) depicts a strategy for inserting an exogenous DNA sequence into the CHO DHFR locus using the CHO-51/52 meganuclease. CHO cells were co-transfected with mRNA encoding CHO-51/52 and a donor plasmid comprising an EcoRI site flanked by 543 base pairs of DNA sequence homologous to the region upstream of the CHO-51/52 recognition site and 461 base pairs of DNA sequence homologous to the region downstream of the CHO-51/52 recognition site. 48 hours post-transfection, genomic DNA was isolated and subjected to PCR using primers specific for the downstream region of the DHFR locus (dashed arrows).
[0103] FIG. 8(B) depicts PCR products that were cloned into pUC-19 and 48 individual plasmid clones and were digested with EcoRI and visualized on an agarose gel. 10 plasmids (numbered lanes) yielded a 647 base pair restriction fragment, consistent with cleavage of a first EcoRI site within the pUC-19 vector and a second EcoRI site in the cloned PCR fragment. These 10 plasmids were sequenced to confirm that they harbor a PCR fragment comprising a portion of the downstream DHFR locus with an EcoRI restriction site inserted into the CHO-51/52 recognition sequence. This restriction pattern was not observed when CHO cells were transfected with the donor plasmid alone.
[0104] FIG. 9(A) depicts a strategy for inserting an engineered target sequence into the CHO DHFR locus using the CHO-23/24 meganuclease. CHO cells were co-transfected with mRNA encoding CHO-23/24 and a donor plasmid comprising, in 5' to 3' orientation, an SV40 promoter, an ATG start codon, an FRT site, and a Zeocin-resistance (Zeo) gene. Zeocin-resistant cells were cloned by limiting dilution and screened by PCR to identify a clonal cell line in which the donor plasmid sequence integrated into the CHO-23/24 recognition site. After expansion, this cell line was co-transfected with a first plasmid encoding Flp recombinase operably linked to a promoter and second plasmid (donor plasmid #2) comprising a GFP gene under the control of a CMV promoter, an FRT site, and a hygromycin-resistance (Hyg) gene lacking a start codon. Flp-mediated recombination between FRT sites resulted in the integration of the donor plasmid #2 sequence into the engineered target sequence (i.e., the FRT site) such that a functional Hyg gene expression cassette was produced. FIG. 9(B) depicts PCR products from hygromycin-resistant clones produced as in (A) that were cloned by limiting dilution. Genomic DNA was extracted from 24 individual clones and PCR amplified using a first primer in the DHFR locus and a second primer in the Hyg gene (dashed lines). All 24 clones yielded a PCR product consistent with Hyg gene insertion into the engineered target sequence. FIG. 9(C) depicts GFP expression by the 24 clones produced in (B) using flow cytometry. All clones were found to express high levels of GFP with relatively little clone-to-clone variability.
[0105] FIG. 10. A GFP-expressing CHO line was produced by integrating a GFP gene expression cassette into the DHFR locus using an engineered target sequence strategy as shown in FIG. 9. This cell line was then grown in MTX as described in Example 2 to amplify the integrated GFP gene. (A) Flow cytometry plots showing GFP intensity on the Y-axis. In the pre-MTX cell line, GFP intensity averages approximately 2.times.10.sup.3 whereas in the cell line grown in 250 nM MTX, a distinct sub-population is visible (circled) in which GFP intensity approaches 10.sup.4. (B) MTX treated cell lines were sorted by FACS to identify individual cells expressing higher amounts of GFP. Five such high-expression cells were expanded and GFP intensity was determined by flow cytometry. All five clones were found to have significantly increased GFP expression relative to the pre-MTX cell line. (C) Genomic DNA was isolated from the five clonal cell lines produced in (B) and subjected to quantitative PCR using a primer pair specific for the GFP gene. It was found that the five high-expression clones had significantly more copies of the GFP gene than the pre-MTX cell line. These results demonstrate that the copy number and expression level a transgene integrated downstream of CHO DHFR can amplify in response to MTX treatment.
[0106] FIG. 11. (A) A direct-repeat recombination assay, as in FIG. 5A. (B) The assay in (A) applied to the CHO-13/14 and CGS-5/6 meganucleases. Light bars indicate the percentage of GFP+ cells when cells are transfected with the reporter plasmid alone (-endonuclease). Dark bars indicate the percentage of GFP+ cells when cells are co-transfected with a reporter plasmid and the corresponding meganuclease gene (+endonuclease). The assay was performed in triplicate and standard deviation is shown. (C) Alignment of sequences obtained from CHO cells transfected with mRNA encoding the CGS-5/6 meganuclease. The top sequence is from a wild-type (WT) CHO cell with the recognition sequence for CGS-5/6 underlined. Dashes indicate deleted bases. Bases that are italicized and in bold are point mutations or insertions relative to the wild-type sequence. Note that the mutations observed in at least clones 6d4, 6g5, 3b7, 3d11, 3e5, 6f10, 6hH8, 6d10, 6d7, 3g8, and 3a9 are expected to knockout GS gene function.
2.1.1 Gene Targeting to the CHO DHFR Locus
[0107] The CHO DHFR locus is diagrammed in FIG. 2A. The locus comprises the DHFR gene coding sequence in 6 exons spanning .about.24,500 base pairs. The Msh3 gene is located immediately upstream of DHFR and is transcribed divergently from the same promoter as DHFR. A hypothetical gene, 2BE2121, can be found .about.65,000 base pairs downstream of the DHFR coding sequence. Thus, there is a .about.65,000 base pair region downstream of the DHFR gene that does not harbor any known genes and is a suitable location for targeting the insertion of sequences of interest. Target sites for insertion of a sequence of interest generally should not be selected which are <1,000 base pairs, and preferably not <5,000 base pairs from either the DHFR or 2BE2121 genes. This limits the window of preferred target sites to the region 1,000-60,000 base pairs, or 5,000-60,000 base pairs downstream of the DHFR coding sequence. The sequence of this region is provided as SEQ ID NO: 2.
[0108] The human and mouse DHFR loci have an organization similar to CHO locus. In both cases, the Msh3 gene is immediately upstream of DHFR but there is a large area devoid of coding sequences downstream of DHFR. In humans, the ANKRD34B gene is .about.55,000 base pairs downstream of DHFR while the ANKRD34B gene is .about.37,000 base pairs downstream of DHFR in mouse. Therefore, the genomic region downstream of DHFR is an appropriate location to insert genes of interest in CHO, human, and mouse cells and cell lines. Further, gene expression cassettes inserted into this region will be expressed at a high level, resistant to gene silencing, and capable of being amplified by treatment with MTX. Methods for amplifying the CHO cell DHFR locus are known in the art (see, e.g., Kellems, ed., Gene amplification in mammalian cells: a comprehensive guide. Marcel Dekker, New York, 1993) and typically involve gradually increasing the concentration of MTX in the growth media from 0 to as high as 0.8 mM over a period of several weeks.
2.1.2 Gene Targeting to the GS Locus
[0109] The CHO, human, and mouse glutamine synthetase (also known as "glutamate-ammonia ligase" or "GluL") loci share a common organization (FIG. 2B). The TEDDM1 gene is immediately upstream of GS in all species (.about.5,000 bp upstream in the case of human, .about.7,000 bp upstream in the case of mouse and CHO). The closest downstream gene, however, is .about.46,000 away in the case of human and .about.117,000 bp away in the case of mouse and CHO. Therefore, we predict that the chromosomal region 1,000-41,000 bp, or 5,000-41,000 bp downstream of GS in human cells and 1,000-100,000 bp, or 5,000-100,000 bp downstream of GS in mouse and CHO cells are appropriate locations to target the insertion of sequences of interest. Because DNA sites distal to the GS coding sequence are more likely to be susceptible to gene silencing, the chromosomal region 5,000-60,000 bp downstream of GS is a preferred location to target the insertion of a sequence of interest even in mouse or CHO cells. The sequence of this region from the CHO genome is provided as SEQ ID NO: 3. Gene expression cassettes inserted into this region will be expressed at a high level, resistant to gene silencing, and capable of being amplified by treatment with MSX. Less-preferred regions include the chromosomal region between the TEDDM1 and GS genes or the region <10,000 bp downstream of TEDDM1 (see FIG. 2B). Methods for amplifying the GS locus are known in the art (Bebbington et al. (1992), Biotechnology (N Y). 10(2):169-75).
2.2 Engineered Endonucleases for Gene Targeting
[0110] A sequence of interest may be inserted into an amplifiable locus using an engineered site-specific endonuclease. Methods for generating site-specific endonucleases which can target DNA breaks to pre-determined loci in a genome are known in the art. These include zinc-finger nucleases (Le Provost et al. (2010), Trends Biotechnol. 28(3):134-41), TAL-effector nucleases (Li et al. (2011), Nucleic Acids Res. 39(1):359-72), and engineered meganucleases (WO 2007/047859; WO 2007/049156; WO 2009/059195). In one embodiment, the invention provides engineered meganucleases derived from I-CreI that can be used to target the insertion of a gene of interest to an amplifiable locus. Methods to produce such engineered meganucleases are known in the art (see, e.g., WO 2007/047859; WO 2007/049156; WO 2009/059195). In preferred embodiments, a "single-chain" meganuclease is used to target gene insertion to an amplifiable region of the genome. Methods for producing such "single-chain" meganucleases are known in the art (see, e.g., WO 2009/059195 and WO 2009/095742). In some embodiments, the engineered nuclease is fused to a nuclear localization signal (NLS) to facilitate nuclear uptake. Examples of nuclear localization signals include the SV40 NLS (amino acid sequence MAPKKKRKV (SEQ ID NO: 36)) which can be fused to the C- or, preferably, the N-terminus of the protein. In addition, an engineered nuclease may be tagged with a peptide epitope (e.g., an HA, FLAG, or Myc epitope) to monitor expression levels or localization or to facilitate purification.
2.3 Engineered Cell Lines with Sequences of Interest Targeted to Amplifiable Loci
[0111] In some embodiments, the invention provides methods for using engineered nucleases to target the insertion of transgenes into amplifiable loci in cultured mammalian cells. This method has two primary components: (1) an engineered nuclease; and (2) a donor DNA molecule comprising a sequence of interest. The method comprises contacting the DNA of the cell with the engineered nuclease to create a double strand DNA break in an endogenous recognition sequence in an amplifiable locus followed by the insertion of the donor DNA molecule at the site of the DNA break. Such insertion of the donor DNA is facilitated by the cellular DNA-repair machinery and can occur by either the non-homologous end-joining pathway or by homologous recombination (FIG. 1).
[0112] The engineered nuclease can be delivered to the cell in the form protein or, preferably, as a nucleic acid encoding the engineered nuclease. Such nucleic acid can be DNA (e.g., circular or linearized plasmid DNA or PCR products) or RNA. For embodiments in which the engineered nuclease coding sequence is delivered in DNA form, it should be operably linked to a promoter to facilitate transcription of the engineered nuclease gene. Mammalian promoters suitable for the invention include constitutive promoters such as the cytomegalovirus early (CMV) promoter (Thomsen et al. (1984), Proc Natl Acad Sci USA. 81(3):659-63) or the SV40 early promoter (Benoist and Chambon (1981), Nature. 290(5804):304-10) as well as inducible promoters such as the tetracycline-inducible promoter (Dingermann et al. (1992), Mol Cell Biol. 12(9):4038-45).
[0113] In some embodiments, mRNA encoding the engineered nuclease is delivered to the cell because this reduces the likelihood that the gene encoding the engineered nuclease will integrate into the genome of the cell. Such mRNA encoding an engineered nuclease can be produced using methods known in the art such as in vitro transcription. In some embodiments, the mRNA is capped using 7-methyl-guanosine. In some embodiments, the mRNA may be polyadenylated.
[0114] Purified engineered nuclease proteins can be delivered into cells to cleave genomic DNA, which allows for homologous recombination or non-homologous end-joining at the cleavage site with a sequence of interest, by a variety of different mechanisms known in the art. For example, the recombinant nuclease protein can be introduced into a cell by techniques including, but not limited to, microinjection or liposome transfections (see, e.g., Lipofectamine.TM., Invitrogen Corp., Carlsbad, Calif.). The liposome formulation can be used to facilitate lipid bilayer fusion with a target cell, thereby allowing the contents of the liposome or proteins associated with its surface to be brought into the cell. Alternatively, the enzyme can be fused to an appropriate uptake peptide such as that from the HIV TAT protein to direct cellular uptake (see, e.g., Hudecz et al. (2005), Med. Res. Rev. 25: 679-736).
[0115] Alternatively, gene sequences encoding the engineered nuclease protein are inserted into a vector and transfected into a eukaryotic cell using techniques known in the art (see, e.g., Ausubel et al., Current Protocols in Molecular Biology, Wiley 1999). The sequence of interest can be introduced in the same vector, a different vector, or by other means known in the art. Non-limiting examples of vectors for DNA transfection include virus vectors, plasmids, cosmids, and YAC vectors. Transfection of DNA sequences can be accomplished by a variety of methods known to those of skill in the art. For instance, liposomes and immunoliposomes are used to deliver DNA sequences to cells (see, e.g., Lasic et al. (1995), Science 267: 1275-76). In addition, viruses can be utilized to introduce vectors into cells (see, e.g., U.S. Pat. No. 7,037,492). Alternatively, transfection strategies can be utilized such that the vectors are introduced as naked DNA (see, e.g., Rui et al. (2002), Life Sci. 71(15): 1771-8).
[0116] General methods for delivering nucleic acids into cells include: (1) chemical methods (Graham et al. (1973), Virology 54(2):536-539; Zatloukal et al. (1992), Ann. N.Y. Acad. Sci., 660:136-153; (2) physical methods such as microinjection (Capecchi (1980), Cell 22(2):479-488, electroporation (Wong et al. (1982), Biochim. Biophys. Res. Commun. 107(2):584-587; Fromm et al. (1985), Proc. Nat'l Acad. Sci. USA 82(17):5824-5828; U.S. Pat. No. 5,384,253) and ballistic injection (Johnston et al. (1994), Methods Cell. Biol. 43(A): 353-365; Fynan et al. (1993), Proc. Nat'l Acad. Sci. USA 90(24): 11478-11482); (3) viral vectors (Clapp (1993), Clin. Perinatol. 20(1): 155-168; Lu et al. (1993), J. Exp. Med. 178(6):2089-2096; Eglitis et al. (1988), Avd. Exp. Med. Biol. 241:19-27; Eglitis et al. (1988), Biotechniques 6(7):608-614); and (4) receptor-mediated mechanisms (Curiel et al. (1991), Proc. Nat'l Acad. Sci. USA 88(19):8850-8854; Curiel et al. (1992), Hum. Gen. Ther. 3(2):147-154; Wagner et al. (1992), Proc. Nat'l Acad. Sci. USA 89 (13):6099-6103). In some preferred embodiments, 7-methyl-guanosine capped mRNA encoding the engineered nuclease is delivered to cells using electroporation.
[0117] The donor DNA molecule comprises a gene of interest operably linked to a promoter. In many cases, a donor molecule may comprise multiple genes operably linked to the same or different promoters. For example, donor molecules comprising monoclonal antibody expression cassettes may comprise a gene encoding the antibody heavy chain and a second gene encoding the antibody light chain. Both genes may be under the control of different promoters or they may be under the control of the same promoter by using, for example, an internal-ribosome entry site (IRES). Donor molecules may also comprise a selectable marker gene operably linked to a promoter to facilitate the identification of transgenic cells. Such selectable markers are known in the art and include neomycin phosphotransferase (NEO), hypoxanthine phosphoribosyltransferase (HPRT), glutamine synthetase (GS), dihydrofolate reductase (DHFR), and hygromycin phosphotransferase (HYG) genes.
[0118] In some embodiments, donor DNA molecules will additionally comprise flanking sequences homologous to the target sequences in the DNA of the cell. Such homologous flanking sequences comprise >3 or, preferably, >50 or, more preferably, >200 or, most preferably, >400 base pairs of DNA that are identical or nearly identical in sequence to the chromosomal locus recognized by the engineered nuclease (FIG. 1). Such homologous DNA sequences facilitate the integration of the donor DNA sequence into the amplifiable locus by homologous recombination.
[0119] The "donor" DNA molecule can be circular (e.g., plasmid DNA) or linear (e.g., linearized plasmid or PCR products). Methods for delivering DNA molecules are known in the art, as discussed above.
[0120] In some embodiments, the engineered nuclease gene and donor DNA are carried on separate nucleic acid molecules which are co-transfected into cells or cell lines. For example, the engineered nuclease gene operably linked to a promoter can be transfected in plasmid form simultaneously with a separate donor DNA molecule in plasmid or PCR product form. In an alternative embodiment, the engineered nuclease can be delivered in mRNA form with a separate donor DNA molecule in plasmid or PCR product form. In a third embodiment, the engineered nuclease gene and donor DNA are carried on the same DNA molecule, such as a plasmid. In a fourth embodiment, cells are co-transfected with purified engineered nuclease protein and a donor DNA molecule in plasmid or PCR product form.
[0121] Following transfection with the engineered nuclease and donor DNA, cells are typically allowed to recover from transfection (24-72 hours) before being cloned using methods known in the art. Common methods for cloning a genetically engineered cell line include "limiting dilution" in which transfected cells are transferred to tissue culture plates (e.g., 48 well, 96 well plates) at a concentration of <1 cell per well and expanded into clonal populations. Other cloning strategies include robotic clone identification/isolation systems such as ClonePix.TM. (Genetix, Molecular Devices, Inc., Sunnyvale, Calif.). Clonal cell lines can then be screened to identify cell lines in which the sequence of interest is integrated into the intended target site. Cell lines can easily be screened using molecular analyses known in the art such as PCR or Southern Blot. For example, genomic DNA can be isolated from a clonal cell line and subjected to PCR amplification using a first (sense-strand) primer that anneals to a DNA sequence in the sequence of interest and a second (anti-sense strand) primer that anneals to a sequence in the amplifiable locus. If the donor DNA molecule comprises a DNA sequence homologous to the target site, it is important that the second primer is designed to anneal to a sequence in the amplifiable locus that is beyond the limits of homology carried on the donor molecule to avoid false positive results. Alternatively, cell lines can be screened for expression of the sequence of interest. For example, if the sequence of interest encodes a secreted protein such as an antibody, the growth media can be sampled from isolated clonal cell lines and assayed for the presence of antibody protein using methods known in the art such as Western Blot or Enzyme-Linked Immunosorbant Assay (ELISA). This type of functional screen can be used to identify clonal cell lines which carry at least one copy of the sequence of interest integrated into the genome. Additional molecular analyses such as PCR or Southern blot can then be used to determine which of these transgenic cell lines carry the sequence of interest targeted to the amplifiable locus of interest, as described above.
[0122] The method of the invention can be used on any culturable and transfectable cell type such as immortalized cell lines and stem cells. In preferred embodiments, the method of the invention is used to genetically modify immortalized cell lines that are commonly used for biomanufacturing. This includes:
[0123] 1. Hamster cell lines such as baby hamster kidney (BHK) cells and all variants of Chinese Hamster Ovary (CHO) cells, e.g., CHO-K1, CHO-S (Invitrogen Corp., Carlsbad, Calif.), DG44, or Potelligent.TM. (Lonza Group Ltd., Basel, Switzerland). Because the genome sequences of different hamster cell lines are very nearly identical, an engineered meganuclease which can be used to practice the invention in one hamster cell type (e.g., BHK cells) can generally be used to practice the invention in another hamster cell type (e.g., CHO-K1).
[0124] 2. Mouse cell lines such as mouse hybridoma or mouse myeloma (e.g., NS0) cells. Because the genome sequences of different mouse cell lines are very nearly identical, an engineered meganuclease which can be used to practice the invention in one mouse cell type (e.g., mouse hybridoma cells) can generally be used to practice the invention in another mouse cell type (e.g., NS0).
[0125] 3. Human cell lines such as human embryonic kidney cells (e.g., HEK-293 or 293S) and human retinal cells (e.g., PER.C6). Because the genome sequences of different human cell lines are very nearly identical, an engineered meganuclease which can be used to practice the invention in one human cell type (e.g., HEK-293 cells) can generally be used to practice the invention in another human cell type (e.g., PER.C6).
2.6 Pre-Engineered Cell Lines with Engineered Target Sequences in Amplifiable Loci
[0126] In one embodiment, the invention provides cell lines which are pre-engineered to comprise a targetable "engineered target sequence" for gene insertion in an amplifiable locus in a mammalian cell line (FIG. 3). An engineered target sequence comprises a recognition sequence for an enzyme which is useful for inserting transgenic nucleic acids into chromosomal DNA sequences. Such engineered target sequences can include recognition sequences for engineered meganucleases derived from I-CreI (e.g., SEQ ID NO 37-87 from WO 2009/076292), recognition sequences for zinc-finger nucleases, recognition sequences for TAL effector nucleases (TALENs), the LoxP site (SEQ ID NO 4) which is recognized by Cre recombinase, the FRT site (SEQ ID NO: 5) which is recognized by FLP recombinase, the attB site (SEQ ID NO: 6) which is recognized by lambda recombinase, or any other DNA sequence known in the art that is recognized by a site specific endonuclease, recombinase, integrase, or transpose that is useful for targeting the insertion of nucleic acids into a genome. Thus, the invention allows one skilled in the art to use an engineered nuclease (e.g., a meganuclease, zinc-finger nuclease, or TAL effector nuclease) to insert an engineered target sequence into an amplifiable locus in a mammalian cell line. The resulting cell line comprising such an engineered target sequence at an amplifiable locus can then be contacted with the appropriate enzyme (e.g., a second engineered meganuclease, a second zinc-finger nuclease, a second TAL effector nuclease, a recombinase, an integrase, or a transposase) to target the insertion of a gene of interest into the amplifiable locus at the engineered target sequence. This two-step approach can be advantageous because the efficiency of gene insertion that can be achieved using an optimal meganuclease, zinc-finger nuclease, recombinase, integrase, or transposase might be higher than what can be achieved using the initial endonuclease (e.g., meganuclease or zinc-finger nuclease) that cleaves the endogenous target site to promote insertion of the engineered target sequence.
[0127] In an alternative embodiment, a cell line is produced by inserting an engineered target sequence into an amplifiable locus with the concomitant removal of all or a portion of the adjacent endogenous marker gene (FIG. 4). For example, an engineered meganuclease, zinc-finger nuclease, or TAL-effector nuclease can be used to remove the first two exons of both alleles of the CHO DHFR gene and replace them with an engineered target sequence for a different engineered meganuclease, ZFN, TALEN, recombinase, integrase, or transposase. The resulting cell line will be DHFR deficient and unable to grow in the absence of hypoxanthine/thymidine. Alternatively, for example, an engineered meganuclease, ZFN or TALEN can be used to remove the first exon of both alleles of the CHO GS gene and replace it with an engineered target sequence for a different engineered meganuclease, ZFN, TALEN, recombinase, integrase, or transposase (FIG. 4). The resulting cell line will be GS deficient and unable to grow in the absence of L-glutamine. Such a cell line is useful because a gene of interest can be inserted into the engineered target sequence in the pre-engineered cell line while simultaneously reconstituting the selectable gene (e.g., DHFR or GS). Thus, it is possible to select for transfectants harboring the gene of interest at the amplifiable locus using media conditions that select for DHFR+ or GS+ cells.
[0128] In an alternative embodiment, a cell line is produced in which an engineered target sequence is inserted into an amplifiable locus with disruption of the selectable gene (FIGS. 5, 6). This can be accomplished, for example, using a meganuclease which recognizes a DNA site in the coding sequence of the selectable gene. Such a meganuclease can be used to target the insertion of an engineered target sequence into the selectable gene coding sequence resulting in disruption of gene function by, for example, introducing a frameshift (FIG. 5). Alternatively, for example, an engineered target sequence can be inserted into an intron in the selectable gene sequence with an additional sequence that promotes improper processing of the selectable gene transcript (FIG. 6). Such sequences that promote improper processing include, for example, artificial splice acceptors or polyadenylation signals. Splice acceptor sequences are known in the art (Clancy (2008), "RNA Splicing: Introns, Exons and Spliceosome," Nature Education 1:1) and typically comprise a 20-50 base pair pyrimidine-rich sequence followed by a sequence (C/T)AG(A/G). SEQ ID NO: 33 is an example of a splice acceptor sequence. Likewise, polyadenylation signals are known in the art and include, for example, the SV40 polyadenylation signal (SEQ ID NO: 34) and the BGH polyadenylation signal (SEQ ID NO: 35). In some embodiments, the resulting cell line harboring the new engineered target sequence in all alleles of the selectable gene will be deficient in the function of the gene due to mis-transcription or mis-translation and will be able to grow only under permissive conditions. For example, an engineered target sequence can be inserted into the GS gene sequence using a meganuclease resulting in a cell line that is GS-/- that can grow only in the presence of L-glutamine in the growth media. In a subsequent step, a gene of interest can be inserted into the engineered target sequence while simultaneously reconstituting the selectable gene (e.g., DHFR or GS). Thus, it is possible to select for transfectants harboring the gene of interest at the amplifiable locus using media conditions that select for DHFR+ or GS+ cells.
2.5 Transgenic Cell Lines for Biomanufacturing
[0129] In some embodiments, the invention provides transgenic cell lines suitable for the production of protein pharmaceuticals. Such transgenic cell lines comprise a population of cells in which a gene of interest, operably linked to a promoter, is inserted into the genome of the cell at an amplifiable locus wherein the gene of interest encodes a protein therapeutic. Examples of protein therapeutics include: monoclonal antibodies, antibody fragments, erythropoietin, tissue-type plasminogen activator, Factor VIII, Factor IX, insulin, colony stimulating factors, interferons (e.g., interferon-.alpha., interferon-.beta., and interferon-.gamma.), interleukins (e.g., interleukin-2), vaccines, tumor necrosis factor, and glucocerebrosidase. Protein therapeutics are also referred to as "biologics" or "biopharmaceuticals."
[0130] To be used for biomanufacturing, a transgenic cell line of the invention should undergo: (1) adaptation to serum-free growth in suspension; and (2) amplification of the gene of interest. In some embodiments, the invention is practiced on adherent cell lines which can be adapted to growth in suspension to facilitate their maintenance in shaker-flasks or stirred-tank bioreactors as is typical of industrial biomanufacturing. Methods for adapting adherent cells to growth in suspension are known in the art (Cell Culture and Upstream Processing, Butler, ed. (Taylor and Francis Group, New York, 2007)). For regulatory reasons, it is generally necessary to further adapt biomanufacturing cell lines to chemically-defined media lacking animal-derived components (i.e., "serum-free" media). Methods for preparing such media and adapting cell lines to it are known in the art (Cell Culture and Upstream Processing, Butler, ed. (Taylor and Francis Group, New York, 2007)). Such media can also be purchased commercially (e.g., CD-3 media for maintenance of CHO cells, available from Sigma-Aldrich, St. Louis, Mo.) and cells can be adapted to it by following the manufacturers' instructions. In some embodiments, the cell line is adapted to growth in suspension and/or serum-free media prior to being transfected with the engineered nuclease.
[0131] Lastly, methods for gene amplification are known in the art (Cell Culture and Upstream Processing, Butler, ed. (Taylor and Francis Group, New York, 2007)). In general, the process involves adding an inhibitor of a selectable gene product to the growth media to select for cells that express abnormally high amounts of the gene product due to gene-duplication events. In general, the concentration of inhibitor added to the growth media is increased slowly over a period of weeks until the desired level of gene amplification is achieved. Inhibitor is then generally removed from the media prior to initiating a bioproduction run to avoid the possibility of the inhibitor contaminating the protein therapeutic formulation. For example, the CHO DHFR locus can be amplified by slowly increasing the concentration of MTX in the growth media from 0 mM to as high as 0.8 mM over a period of several weeks. The GS locus can, likewise, be amplified by slowly increasing the concentration of MSX in the media from 0 .mu.M to as high as 100 .mu.M over a period of several weeks. Methods for evaluating gene amplification are known in the art and include Southern Blot and quantitative real-time PCR (rtPCR). In addition, or as an alternative, expression levels of the sequence of interest, which are generally correlated to gene copy number, can be evaluated by determining the concentration of protein therapeutic in the growth media using conventional methods such as Western Blot or ELISA.
[0132] Following cell line production, adaptation, and amplification, protein therapeutics can be produced and purified using methods that are standard in the biopharmaceutical industry.
EXAMPLES
[0133] This invention is further illustrated by the following examples, which should not be construed as limiting. Those skilled in the art will recognize, or be able to ascertain, using no more than routine experimentation, numerous equivalents to the specific substances and procedures described herein. Such equivalents are intended to be encompassed in the scope of the claims that follow the examples below. Example 1 refers to engineered meganucleases that can be used to target the insertion of a gene of interest downstream of the DHFR gene in CHO cells. Example 2 refers to engineered meganucleases that can be used to target the insertion of an engineered target sequence into the CHO DHFR gene with concomitant removal of DHFR exons 1 and 2. Example 2 also refers to engineered meganucleases that can be used to target the insertion of an engineered target sequence into the CHO GS gene. Example 3 refers to meganucleases that can be used to target the insertion of a gene of interest downstream of the GS gene in CHO cells.
Example 1
[0134] Targeted Gene Insertion into the CHO DHFR Locus Using Engineered Meganucleases
[0135] The CHO genomic DNA sequence 10,000-55,000 base pairs downstream of the DHFR gene was searched to identify DNA sites amenable to targeting with engineered meganucleases. Two sites (SEQ ID NO: 7 and SEQ ID NO: 8) were selected which are, respectively, 35,699 and 15,898 base pairs downstream of the DHFR coding sequence (Table 2).
TABLE-US-00002 TABLE 2 Example Recognition Sites For Engineered Meganucleases in the CHO DHFR Locus. SEQ ID Location Relative to CHO NO: Target Site Sequences DHFR Coding Sequence 7 5'-TAAGGCCTCATATGAAAATATA-3' 35,699 bp downstream 8 5'-ATAGATGTCTTGCATACTCTAG-3' 15,898 bp downstream
1. Meganucleases that Recognize SEQ ID NO: 7 and SEQ ID NO: 8
[0136] An engineered meganuclease (SEQ ID NO: 9) was produced which recognizes and cleaves SEQ ID NO: 7. This meganuclease is called "CHO-23/24". A second engineered meganuclease (SEQ ID NO: 10) was produced which recognizes and cleaves SEQ ID NO: 8. This meganuclease is called "CHO-51/52." Each meganuclease comprises an N-terminal nuclease-localization signal derived from SV40, a first meganuclease subunit, a linker sequence, and a second meganuclease subunit.
2. Site-Specific Cleavage of Plasmid DNA by Meganucleases CHO-23/24 and CHO-51/52
[0137] CHO-23/24 and CHO-51/52 were evaluated using a direct-repeat recombination assay as described previously (Gao et al. (2010), Plant J. 61(1):176-87, FIG. 7A). A defective GFP reporter cassette was generated by first cloning a 5' 480 bp fragment of the GFP gene into NheI/HindIII-digested pcDNA5/FRT (Invitrogen Corp., Carlsbad, Calif.) resulting in the plasmid pGF. Next, a 3' 480 bp fragment of the GFP gene (including a 240 bp sequence duplicated in the 5' 480 bp fragment) was cloned into BamHI/XhoI-digested pGF. The resulting plasmid, pGFFP, consists of the 5' two-thirds of the GFP gene followed by the 3' two-thirds of the GFP gene, interrupted by 24 bp of the pcDNA5/FRT polylinker. To insert the meganuclease recognition sites, complementary oligonucleotides comprising the sense and anti-sense sequence of each recognition site were annealed and ligated into HindIII/BamHI-digested pGFFP.
[0138] The coding sequences of the engineered meganucleases were inserted into the mammalian expression vector pCP under the control of a constitutive (CMV) promoter. Chinese hamster ovary (CHO) cells at approximately 90% confluence were transfected in 96-well plates with 150 ng pGFFP reporter plasmid and 50 ng of meganuclease expression vector or, to determine background, 50 ng of empty pCP, using Lipofectamine 2000 according to the manufacturer's instructions (Invitrogen Corp., Carlsbad, Calif.). To determine transfection efficiency, CHO cells were transfected with 200 ng pCP GFP. Cells were washed in PBS 24 h post-transfection, trypsinized and resuspended in PBS supplemented with 3% fetal bovine serum. Cells were assayed for GFP activity using a Cell Lab Quanta SC MPL flow cytometer and the accompanying Cell Lab Quanta analysis software (Beckman Coulter, Brea, Calif.).
[0139] Results are shown in FIG. 7B. It was found that both of the engineered meganucleases were able to cleave their intended recognition sites significantly above background within the context of a plasmid-based reporter assay.
3. Site-Specific Cleavage of CHO DHFR Locus by Meganucleases CHO-23/24 and CHO-51/52
[0140] To determine whether or not CHO-23/24 and CHO-51/52 are capable of cleaving their intended target sites in the CHO DHFR locus, we screened genomic DNA from CHO cells expressing either CHO-23/24 or CHO-51/52 to identify evidence of chromosome cleavage at the intended target site. This assay relies on the fact that chromosomal DNA breaks are frequently repaired by NHEJ in a manner that introduces mutations at the site of the DNA break. These mutations, typically small deletions or insertions (collectively known as "indels") leave a telltale scar that can be detected by DNA sequencing (Gao et al. (2010), Plant J. 61(1):176-87).
[0141] CHO cells were transfected with mRNA encoding CHO-23/24 or CHO-51/52. mRNA was prepared by first producing a PCR template for an in vitro transcription reaction (SEQ ID NO: 20 and SEQ ID NO: 21). Each PCR product included a T7 promoter and 609 bp of vector sequence downstream of the meganuclease gene. The PCR product was gel purified to ensure a single template. Capped (m7G) RNA was generated using the RiboMAX T7 kit (Promega Corp., Fitchburg, Wis.) according to the manufacturer's instructions and. Ribo m7G cap analog (Promega Corp., Fitchburg, Wis.) was included in the reaction and 0.5 .mu.g of the purified meganuclease PCR product served as the DNA template. Capped RNA was purified using the SV Total RNA Isolation System (Promega Corp., Fitchburg, Wis.) according to the manufacturer's instructions.
[0142] 1.5.times.10.sup.6CHO-K1 cells were nucleofected with 3.times.10.sup.12 copies of CHO-23/24 or CHO-51/52 mRNA (2.times.10.sup.6 copies/cell) using an Amaxa Nucleofector II device (Lonza Group Ltd., Basel, Switzerland) and the U-23 program according to the manufacturer's instructions. 48 hours post-transfection, genomic DNA was isolated from the cells using a FlexiGene kit (Qiagen, Hilden, Germany) according to the manufacturer's instructions. The genomic DNA was then subjected to PCR to amplify the corresponding target site. In the case of cells transfected with mRNA encoding CHO-23/24, the forward and reverse PCR primers were SEQ ID NO: 16 and SEQ ID NO: 17. In the case of cells transfected with mRNA encoding CHO-51/52, the forward and reverse PCR primers were SEQ ID NO: 18 and SEQ ID NO: 19. PCR products were gel purified and cloned into pUC-19. 40 plasmids harboring PCR products derived from cells transfected with CHO-23/24 mRNA were sequenced, 13 of which were found to have mutations in the CHO-23/24 target site (FIG. 7C). 44 plasmids harboring PCR products derived from cells transfected with CHO-51/52 mRNA were sequenced, 10 of which were found to have mutations in the CHO-51/52 target site (FIG. 7D). These results indicate that CHO-23/24 and CHO-51/52 are able to cut their intended target sites downstream of the CHO DHFR gene.
4. Site-Specific Integration into the CHO DHFR Locus Using an Engineered Meganuclease
[0143] To evaluate the efficiency of DNA insertion into the CHO DHFR locus using an engineered meganuclease, we prepared a donor plasmid (SEQ ID NO: 11) comprising an EcoRI restriction enzyme site flanked by DNA sequence homologous to the CHO-51/52 recognition site (FIG. 8A). Specifically, the donor plasmid of SEQ ID NO: 11 comprises a pUC-19 vector harboring a homologous recombination cassette inserted between the KpnI and HindIII restriction sites. The homologous recombination cassette comprises, in 5'- to 3'-order: (i) 543 base pairs of DNA identical to the sequence immediately upstream of the CHO-51/52 cut site, including the upstream half-site of the CHO-51/52 recognition sequence and the four base pair "center sequence" separating the two half-sites comprising the CHO-51/52 recognition sequence; (ii) an EcoRI restriction enzyme site (5'-GAATTC-3'); and iii) 461 base pairs of DNA identical to the sequence immediately downstream of the CHO-51/52 cut site, including the downstream half-site of the CHO-51/52 recognition sequence and the four base pair "center sequence" separating the two half-sites comprising the CHO-51/52 recognition sequence. Note that this results in a duplication of the four base pair "center sequence" (5'-TTGC-3') to maximize the likelihood of strand invasion by the 3' overhangs generated by CHO-51/52 cleavage. We have discovered that donor plasmids comprising such a duplication of the center sequence are optimal substrates for gene targeting by homologous recombination.
[0144] mRNA encoding CHO-51/52 was prepared as described above. 1.5.times.10.sup.6 CHO-K1 cells were nucleofected with 3.times.10.sup.12 copies of CHO 51-52 mRNA (2.times.10.sup.6 copies/cell) and 1.5 .mu.g of the donor plasmid (SEQ ID NO: 11). Nucleofection was performed using an Amaxa Nucleofector II device (Lonza Group Ltd., Basel, Switzerland) and the U-23 program according to the manufacturer's instructions. 48 hours post-transfection, genomic DNA was isolated from the cells using a FlexiGene kit (Qiagen, Hilden, Germany) according to the manufacturer's instructions. The DNA was subjected to PCR using primers flanking the CHO-51/52 recognition site (SEQ ID NO: 18 and SEQ ID NO: 19). Importantly, these primers are beyond the limits of homologous sequence carried in the donor plasmid and, therefore, will amplify only the chromosomal DNA sequence and not the donor plasmid. PCR products were cloned into a pUC-19 plasmid and 48 clones were purified and digested with EcoRI (FIG. 8B). 10 plasmids yielded a restriction pattern consistent with the insertion of an EcoRI site into the CHO-51/52 recognition sequence. These data demonstrate that it is possible to use CHO-51/52 to precisely insert DNA downstream of the CHO DHFR gene at SEQ ID NO: 8.
5. Site-Specific Integration of an Engineered Target Sequence into the CHO DHFR Locus
[0145] A donor plasmid (SEQ ID NO: 25) was produced comprising an FRT sequence (SEQ ID NO: 5) adjacent to a zeocin resistance gene under the control of an SV40 early promoter (FIG. 9A). This cassette was flanked by DNA sequence homologous to the CHO DHFR locus immediately upstream or downstream of the CHO-23/24 recognition sequence. CHO cells were co-transfected with this donor plasmid and mRNA encoding CHO-23/24 as described above. 72 hours post-transfection, zeocin-resistant cells were cloned by limiting dilution and expanded for approximately 3 weeks. Clonal populations were then screened by PCR using a first primer in the SV40 promoter (SEQ ID NO: 26) and a second primer in the DHFR locus (SEQ ID NO: 16) to identify cell lines carrying the FRT/Zeocin sequence downstream of the DHFR gene. One such cell line carrying the integrated FRT Insertion target sequence was subsequently co-transfected with a second donor plasmid (SEQ ID NO: 27) and a plasmid encoding Flp recombinase. SEQ ID NO: 27 comprises a GFP gene under the control of a CMV promoter, a FRT sequence, and a non-functional hygromycin resistance gene lacking an ATG start codon. Flp-stimulated recombination between FRT sites in the genome and the plasmid resulted in the incorporation of the entire plasmid sequence into the CHO genome at the site of the engineered target sequence. Such recombination restored function to the hygromycin-resistance gene by orientating it downstream of an ATG start codon integrated as part of the engineered target sequence. As such, successful integrations could be selected using hygromycin.
[0146] Hygromycin-resistant cells were cloned by limiting dilution and 24 individual clonal lines were assayed by PCR using a first primer in the hygromycin-resistance gene (SEQ ID NO: 28). All 24 clones yielded the expected PCR product (FIG. 9B), indicating that the GFP gene expression cassette was successfully inserted into the DHFR engineered target sequence in all cases. The 24 cell lines were then evaluated by flow cytometry and were found to express consistent levels of GFP (FIG. 9C).
6. Transgene Amplification
[0147] A GFP-expressing CHO line produced as described above was seeded at a density of 3.times.10.sup.5 cells/mL in 30 mL of media containing 50 nM MTX. Cells were cultured for 14 days before being re-seeded at the same density in media containing 100 nM MTX. Cells were cultured for another 14 days before being re-seeded in media containing 250 nM MTX. Following 14 days in culture, GFP expression in the treated cells was evaluated by flow cytometry and compared to GFP expression in the parental (pre-MTX) cell population (FIG. 10A). It was found that the MTX-treated cells had a distinct sub-population in which GFP expression was significantly increased. Individual high-expression cells from the MTX-treated population were then isolated using a cell sorter and 5 clones were expanded for 14 days in the absence of MTX. GFP expression in the 5 clonal cell populations was then evaluated by flow cytometry and compared with the parental (pre-MTX) cell population. It was found that the MTX-treated clones had approximately 4-6 times the GFP intensity as the pre-MTX cells. Quantitative PCR was then performed using a primer set specific for the GFP gene and it was found that the MTX-treated clones all had approximately 5-9 times as many copies of the GFP gene as the pre-MTX population. These data provide conclusive evidence that a transgene inserted downstream of the CHO DHFR gene can be amplified by treatment with MTX.
7. Stability of Gene Amplification
[0148] The five clonal cell lines expressing high levels of GFP that were produced in (6) above were then passaged for a period of 14 weeks in media with or without 250 nM MTX to evaluate the stability of gene amplification. GFP intensity was determined on a weekly basis and the quantitative PCR assay used to determine GFP gene copy number described above was repeated at the end of the 14 week evaluation period. As expected, the clones passaged in media with MTX maintained a high level of GFP expression with no clone deviating more than 20% from the GFP intensity determined in week 1. Quantitative PCR revealed that gene copy number likewise deviated by less than 20% for all clones. Surprisingly, gene amplification was equally stable in cell lines grown in media lacking MTX. Contrary to what would have been predicted based on the existing art, GFP gene expression was not reduced by more than 18% in any of the five cell lines over the 14 week evaluation period. Gene copy number determined by quantitative PCR was also stable with less than 24% deviation over time for all of the cell lines. These results indicate that a transgene amplified in the CHO DHFR locus is stable for an extended period of time, obviating the need to grow the cells in toxic selection agents that that could contaminate bioproduct formulations.
Example 2
Insertion of an Engineered Target Sequence into the CHO DHFR or GS Gene Coding Regions
[0149] As diagrammed in FIG. 4, an alternative method for targeting a sequence of interest to an amplifiable locus involves the production of a cell line in which a portion of a selectable gene is replaced by an engineered target sequence. The advantage of this approach is that the subsequent insertion of a sequence of interest can be coupled with reconstitution of the selectable gene so that cell lines harboring the properly targeted sequence of interest can be selected using the appropriate media conditions. A cell line harboring such an engineered target sequence can be produced using nuclease-induced homologous recombination. In this case, a site-specific endonuclease which cuts a recognition sequence near or within the selectable gene sequence is preferred.
1. Engineered Meganucleases that Cut within the DHFR or GS Genes.
[0150] A meganuclease called "CHO-13/14" (SEQ ID NO: 12) was produced which cuts a recognition sequence in the CHO DHFR gene (SEQ ID NO: 13). The recognition sequence is in an intron between Exon 2 and Exon 3 of CHO DHFR. A meganuclease called "CGS-5/6" (SEQ ID NO: 14) was produced which cuts a recognition sequence in the CHO GS gene (SEQ ID NO: 15). Each meganuclease comprises an N-terminal nuclease-localization signal derived from SV40, a first meganuclease subunit, a linker sequence, and a second meganuclease subunit.
2. Site-Specific Cleavage of Plasmid DNA by Meganucleases CHO-13/14 and CGS-5/6
[0151] CHO-13/14 and CGS-5/6 were evaluated using a direct-repeat recombination assay as described in Example 1 (FIG. 7A). Both meganucleases were found to efficiently cleave their intended recognition sequences within the context of a plasmid-based reporter assay (FIG. 7B).
3. Site-Specific Cleavage of the CHO GS Gene by CGS-5/6
[0152] CHO cells were transfected with mRNA encoding CGS-5/6. mRNA was prepared by first producing a PCR template for an in vitro transcription reaction (SEQ ID NO: 22). Each PCR product included a T7 promoter and 609 bp of vector sequence downstream of the meganuclease gene. The PCR product was gel purified to ensure a single template. Capped (m7G) RNA was generated using the RiboMAX T7 kit (Promega Corp., Fitchburg, Wis.) according to the manufacturer's instructions and. Ribo m7G cap analog (Promega Corp., Fitchburg, Wis.) was included in the reaction and 0.5 .mu.g of the purified meganuclease PCR product served as the DNA template. Capped RNA was purified using the SV Total RNA Isolation System (Promega Corp., Fitchburg, Wis.) according to the manufacturer's instructions.
[0153] 1.5.times.10.sup.6 CHO-K1 cells were nucleofected with 3.times.10.sup.12 copies of CGS-5/6 using an Amaxa Nucleofector II device (Lonza Group Ltd., Basel, Switzerland) and the U-23 program according to the manufacturer's instructions. 48 hours post-transfection, genomic DNA was isolated from the cells using a FlexiGene kit (Qiagen, Hilden, Germany) according to the manufacturer's instructions. The genomic DNA was then subjected to PCR to amplify the CGS-5/6 target site using the primers of SEQ ID NO: 23 and SEQ ID NO: 24. The PCR products were cloned into a pUC-19 plasmid and 94 plasmids harboring PCR products were digested with the BssSI restriction enzyme, which recognized and cuts the sequence 5'-CTCGTG-3' found within the CGS-5/6 recognition sequence. 17 plasmids were found to be resistant to BssSI, suggesting that the CGS-5/6 recognition site was mutated. These 17 plasmids were sequenced to confirm the existence of indels or point mutations within the CGS-5/6 recognition sequence (FIG. 7C). These results indicate that CGS-5/6 is able to cut its intended target site within the CHO GS gene. Because the CGS-5/6 recognition sequence is within an exon in the GS coding sequence, many of the mutations introduced by CGS-5/6 are expected to frameshift the GS gene. Therefore, CGS-5/6 is useful for knocking-out CHO GS to produce GS (-/-) cell lines. Such cell lines are useful because they are amenable to GS selection and amplification for producing biomanufacturing cell lines.
Example 3
Meganucleases for Targeting Gene Insertion to the CHO GS Locus
[0154] 1. Engineered Meganucleases that Cut Downstream of the CHO GS Gene.
[0155] An engineered meganuclease called "CHOX-45/46" (SEQ ID NO: 29) was produced which recognizes a DNA sequence (SEQ ID NO: 30) approximately 7700 base pairs downstream of the CHO GS coding sequence. CHO cells were transfected with mRNA encoding CHOX-45/46 as described in Example 2. 72 hours post transfection, genomic DNA was extracted from the transfected cell pool and the region downstream of the CHO GS gene was PCR amplified using a pair of primers (SEQ ID NO: 31 and SEQ ID NO: 32) flanking the CHOX-45/46 recognition sequence. PCR products were then cloned and 24 cloned products were sequenced. It was found that 14 of the 24 clones PCR products (58.3%) had large mutations in the sequence consistent with meganuclease-induced genome cleavage followed by mutagenic repair by non-homologous end-joining. From these data, we conclude that the CHOX-45/46 meganuclease is able to specifically cleave a DNA site downstream of the CHO GS gene coding sequence and will likely be able to target the insertion of transgenes to this amplifiable locus in the genome.
TABLE-US-00003 SEQUENCE LISTING SEQ ID NO: 1 (wild-typeI-CreI, Genbank Accession #P05725) 1 MNTKYNKEFL LYLAGFVDGD GSIIAQIKPN QSYKFKHQLS LTFQVTQKTQ RRWFLDKLVD 61 EIGVGYVRDR GSVSDYILSE IKPLHNFLTQ LQPFLKLKQK QANLVLKIIE QLPSAKESPD 121 KFLEVCTWVD QIAALNDSKT RKTTSETVRA VLDSLSEKKK SSP SEQ ID NO: 2 (Chromosomal region 5,000-55,000 base pairs downstream of CHO DHFR gene coding sequence) 1 taaaactcaa gatgccagct ttgtagctag cttaggaaac aaagtagtaa aaaataataa 61 tgggtgggtg aaggtctgaa gcatttacag agttctctca agacaaagca cagaggctgg 121 tggccacata acttggcaac tgatttgggg gaacagaata caagaaagga aatttaaata 181 ctgtttttct caatgttgaa ctatatgggc atagtcacag ctgcctaacc tatagagact 241 ggaagctgga acctcggcta tctaagatag aataatcaag aaatgtcaat tatttgagaa 301 aaacatcagg aataaatagc tgctaagtta caagttggtg ctttagacat ttggagagga 361 taggatgggg gctcccagac ctggggctcc ctaataaagc tgtgctggcc tacaagttcc 421 agggatcctc cagtccatgc ctcccactgt tgggactgcg ggcgatggtt tctgacgtgg 481 gtactgaggg cctgaactgt ccacacactt aagccacacg ccttttactg agtcatctcc 541 tcatctcaga acattttcct ttaatctttc ttaatgaaaa ggtcgcattt cttccgaggg 601 ctagcctcct gttactctct atacatgtca cataaaacta catgaaaact ttgaaggcac 661 tatatgtcca tactcagatg aaaagccatt agctgtggtc atacaaaacc ccacagacca 721 actgttggga aacatcagac ttttttcctg cagcgcctgc cctgatcttc cacagagaat 781 tcagtctcac tttttccagg atgacttctg aactatcacc gtaagatgag aatttgaaac 841 aaagatgtaa gtaatgaact tcatgtgttc tgaacacaca gcttagtgca ttgaaattac 901 gtaacacccg cttccttata agccatttct caaaatgttc ccattacacc tgcatcgggg 961 atgggtccca gaatcttcct tttaaataaa caccccagag gattctgaag ctagaacacc 1021 aaggactgac agagagaagc atgcctgtgg gcgactccag acacctggga gctgcctgct 1081 ttcttgctac tgatttagaa ggcatttgcc cccgaatggg gctgggggac tgtcactatt 1141 tctcattctc gggactttga aaggaagcaa aacagaaaac catgcaaagt ataagccacc 1201 atggaataat ggcagacgat ccggttgtgc agattagatt ttacatattg ctgattttga 1261 agctaaagac ctttcacttc ttaaatatat aataaaattc atacaagagt attttgtgta 1321 ggtaactcag tcagatacaa ggtaagcaaa gtaaatgata ggtgcccctt aacaaaatgc 1381 attctcatag ttcatttatc aattatagaa atggtggact ggagggaagg cttgaggtca 1441 ggagaatgtg ctgctcttcc agacagcccg ggttcttttc cccagcaatc tgggactcac 1501 gtctgcctgt agctccaggc ccaggggatc tggcaccttc ttctggcctc tgcaggcacc 1561 catacacaca tggcatacac acacatacac aaattctaaa attaaatagt aggttgtagg 1621 cctacacaaa aacatgcata cattaactaa ataattaata gttaataaat aaaaatcaac 1681 caaacacata cactgattaa gtaacatgac tctgtaaggt caaaggcggc tgaccagctg 1741 tgggaagggt taaataataa caatcacctt tgaaagactg gacctggtga ttaaggatgt 1801 tccagctgtg tcgtggatga gaaatcaaat gcataattga atgagtgcca ggaatagaac 1861 tggagacttt ctggtgagaa tgcttttact ggcagtagag tccctgtcta aacaggagag 1921 agacctgcag tagccctgtg gcggccctgc agtggccctg tgatggctct gcagttgtac 1981 tcttcctgag ataggagaca cactagagag tgtttctaat gagcagctcc tgtactttct 2041 gttcccctgg agaccgcacg tgtttctccg ataatacatt gacatttctg ttaaaccatt 2101 ttcttcttgg aacaaaaatg gagaacaaat cagattggtg tgtggtcttt taaataactt 2161 ggtacttaat aacacaaaac aaaattatca gaggctggat tttaggtgct ctcagcatct 2221 gccacccctg agccatcagt caggtcttgg aggaacaatc tccaaggaga aaacagttct 2281 gtcctcagaa aagctggagg aatatgagat tttctacagc actcatagca aaatcattta 2341 cggaagggat cctgagtaag atggcctctt cttcatcaca tggtcatagt ctgcttcaat 2401 ggggagaata gttcaatcta gcatcgagaa atcgaaggtt cccttttgac tggcaatgcc 2461 ccatagatag atagatatag attatgtata tattgtgtaa aacacacgta tgtatatata 2521 atacacatac atgtatgtgt atacatacat acatacatac atacatacat acatacatac 2581 atacatagat acgtgtgtgt gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt 2641 ttgagactga gtttctctac tatgtagctc tggctgtcct gaaagttgct aagtagacca 2701 gactggccag accagatcca ccctcctctg cctcctaagt gctgagatta aaggcctgca 2761 cccaccccca cccagcccat cttatatttt gcttcatttc aaagtaagct ctatgcatca 2821 tttattcctg catattatta gccatggttc agtcttgttt gtgttttgga atatttactt 2881 aacaaaactt gaaaaacatt tttcaagatt tgtttgtttt taagatttat ttatttatta 2941 tgtataataa taaatattat tatgaaaaac ggtgttctgc ctgcagggca gaagagggca 3001 ccagattgaa ttacagatgg ttgtgagcca ccatgtggtt gctgggactt gaactcagga 3061 cctctggaag agcagccagt acttttaact gctgagccat ctccccaggc ccaaaataca 3121 catcttaagt gtattgccac aagcatacat cttcatggcc caatcttctg tccatcactt 3181 cagacagctc tccttctttc cctggccagt cacaacaccc tcagctatca ggaaaggccc 3241 tatgggggtt gttttgtttt cccactccag ttcccttgcc tgctctgacc tcatgagtag 3301 actcatacag gatgtgctca cttcacttgg gatgatttct ttttcaccca ttgttgctct 3361 gcccagaatt tgttcctttt tattgtctta gtgttaatca actatcaaag ccagcaacaa 3421 aaaatagtag ggaaactttt ttgatagggt aaacctgatt gattgcaggc tttggttgcc 3481 ttgtttggtc tatccccttg agagtccctt acaatgtgag ttagttagtg gctgctaact 3541 agttgaatct caacttcctt tttctttaat gtgggtattt gtaaggaata gcccccttaa 3601 atctagattc tgttctcaaa tcaagcaagc tcaaggctgt aagcatggat tcaccaactt 3661 tcctgctcaa ggaatttaaa tgtctggtct ccatcatatt actttaatag taatagttta 3721 ttatacacat gtgccagctg tatatccctt ttcttcttga tggacctatg aactctgttg 3781 aggtgagatt tgaacccctt agaaggtgct agagaagagg tacctgatgg tcaaggcaag 3841 gctgatactt attcatgggt cccacatctg ctaatgtaag caataacaga taatatgctt 3901 tgtgtttaga cccacagtgg ttgcatgtac actaagtatg tatcatcatt gtcttatcgt 3961 tcctttagaa tacagctaat aattatgacc gctattctca tagcatttat attatatgag 4021 cattgtaaat tattttgaaa tgctttaaga tatacttgag aactatgcat atcatgcgta 4081 tgttgttcta ccagctggga ccttgaaatg agatcccttg aggccagcat aaagagaaag 4141 ttttcatctc aaacaaacaa aagatacact tgataataga tgagggataa atgtcatact 4201 ttttatatag tgattgagaa tctacagatt tgggtatcct ggtcacttag gagaccaagg 4261 gaggactatt agctctagag ctatgaactt tatctccaga ttccaaagcc aatacaaact 4321 ctagccaagt tggggtgctg ttacctgtat ccctctgtca aattccaagt gttttcacca 4381 cctttactgt atctttccaa ctgttctctt ttataaccac acatagttca tggtctttcc 4441 ttctctcact tgactgtgga gtaacctaac ttgcgtgttt ccagttttcg atctcttcct 4501 taaatctaca ctagttaacc acaaagaccc tcttttctga gctgtgtcta ttctatcact 4561 gtcaccattc cttaatgctc tcccagatgc agccaaactt cactttgggc ttgagagtct 4621 tctccaggtg acagtgacta atgtctccag attgagcatc taccatctac cctgtgtatt 4681 acacatgaat agccttagct tttcagcaat agacagatag atccatagtt agccatgtca 4741 acacccttct tcatgctgtt ctcacagtaa taagtcctaa ttcctgtttt ctcccatcta 4801 aactcaaccc tgtcctaaat accttactca aatcctaatt gtatctcttc cacaaacatt 4861 tcccccttct ctccattaca aggtggaaac tcagagatcc aggtgtcttg catgttgttg 4921 attctgtcct caacaaggaa ttccccaggt tcctgcacga aggaaagcat ggaggaccat 4981 acttgaggct actggtgtag tgggaagaca ggcccaaacc atgtcacaga aacccatcac 5041 cagaaagttg ggggaggcag cccagttgtg gagcaggaga aggagaaaac aggcttgggg 5101 aactgctagc tatgctttgt cacagtcaca agaaaaaagg gccctagcct ggcctacata 5161 ttctacaact tcctgaatct ttgctctgaa atgaagaggt ttggatggct gtctgggaat 5221 tcatcttgct tgcagtgaag ctccttgggg tatttgaaac caggaagttt gaaggagttg 5281 atgctaattg ttttctaaag tgtgtgagga gtactggcag agttcaggcc ttgtgaggaa 5341 agaatcctat atctagtctg cactcctggg cacatgagac attcagctat ctcccttata 5401 aagcatagaa agtactcttg tacttgacac agaaataatt tcagtatgta gagcattaaa 5461 aaaaagtatg aatgacttag agagatggct catcagttaa aagcacatac tgctcttcca 5521 gaggtcctga gttcaattcc caacaaccac aaaaactcac acatatgcat gtgattaaaa 5581 ataaaatctc tctctctctc tctctctgtg tgtgtgtgtg tgtgtgtgtg tgtgtgtgag 5641 tgtgtgtgtg tgtgtgtgag tgtgtgagtg tgtgtgtgtg tgtgtgtgtg tgtgtgtgtg 5701 tgtgtgtgtg tgtgtgtgtg tgtgtgatgg tgggcttgtg tttgcaagcc cagcactagg 5761 gagttaaggc ctcactcaca gtgccaggcc agtctaggtt acagtgagtt ctagacagcc 5821 caagctacag agtaaggtac tgacaaagaa agaaagaaag aaaaaaagaa agaaagaaag 5881 aaagaaagaa agaaagaaag aaagaaagaa agaaagaaag aaagaaagga gagaggtgag 5941 agggagggaa ggaactggaa gggggaagga gggaaagaaa agaaaaagaa acaaccaaag 6001 gaacaaacca ctgtatgcca ttatacatta gctttgggct ttacaggtta tacactctat 6061 attgtcatag ccaatgtctc aatattccat aagaggtgtc tagttgtggg tatgttcttt 6121 cttagtcctt ttatttagac tacatgacct gtttttgcct aataggccat tagtaatact 6181 gacttctcca catgctgccc tcaaaactta ctcctggaag atctttattt aagctatgaa 6241 cgaaaatctt aaccctgtga cctgccaccc agaatgcctc tgggaacaac ctcaggcaac 6301 ctatcaagcc gcttttccaa catttggggc aacagggatt aaaattatga ttgttgtctg 6361 cctgctgagt tcaaactcac agagggacca gaagctgact cactgatatc aagcagttct 6421 aaattttcag tttaaaactc taattattaa acaggggatg tcctcagacc agcactcaag 6481 agaaggagat aggcagagct ctatgagttg agttataggc cagcctggtt ttcatagtga 6541 gttttagctc tccagagagt taccagcaag accctgtcac aaacaaataa aaacaaacaa 6601 acaattaggg gatatacata taactaaatg ataaagcctt acctagcaca ttcaagtccc 6661 caggttcaat tgctagccct gggtggggat ttggacaaat ttaaaaagac cttttttgta 6721 tcacacataa atatgactgc actggttgtt gttttccatg gaaacagaat caatgtggca 6781 tgtattttac ggcattagct catatagttg tgcaggctgg caagtgtgga atgtataggg 6841 caggccagga atcagaaatt gatacaaaat tcaggaaaga cctctgggtg caatggtgca 6901 cacctttaat tcaagcactt gaaaggcaga ggcaggtgat ctttgtgagt tccaggccag 6961 cctggtctac atagtgaatt ccgggacagc cagggcttca tagaaagaac ctgtctcaaa 7021 acacacaaac aatcagaggg aagggcttat tttgtttttg agacagggtc ttctatgtag
7081 cccaggctgg cctcaaactc atgctcttga tatgcccacc tcacaagtgc atgttaagat 7141 tacaggtgcc tgacacacac cacttttgtg aagtgctgaa gagtaagccc agggcttcat 7201 ggacgctggg caagcactgt gccagctgag ccacactccc cagtgtgcac gatactttgc 7261 aaagatagat ccatatggat gctgtgcttc tatctaaaca gaatgacaac cacactctgg 7321 caggttctgg ttcataactg agtcttattg gtcacctcct tctccatttt tcgctggtat 7381 ttctcaagga gagaccacaa atgagaagtg aagcctaact tttaatgcgg tctctcctat 7441 gtcacctaaa ttctagctca aacagggttt ctggctctta ccttttcctc gggtttctgg 7501 atacttgaag tgttaacggg catttctctt aaagaccaaa tctggccaga ttcaaatggc 7561 tggccttcaa ctcggcaaac taggaacaat aatgtccgct gcatgtggct tgtagcactc 7621 tgtttctatt catggacttg tgagtgattt ctgggaaaca cgaattataa gataagtcct 7681 tttcagtgga cttcacaagt tcaccctcag gtagtatact gtcaggtaga aacgtctttc 7741 agagaagcga gaggtgacaa gccctctggg ctggccattg tccctgctgg cattgaacag 7801 cctgttcagc acatgaaagc atcgcctgat gctcccaaag ctggagcact ggcagccccc 7861 tgcagtcagg tgtgtagggt gggttagcag gggtgcttag gcgggttttg tagttacctt 7921 ttcaacacaa atgcaaaagc cagagagaga gagagagaga gagagagaga gagagagaga 7981 gagagggaga gagagagaga gagagagaga gagagagaga gagagagaga gcaggaaagc 8041 atccaggctt tgaagcaagc cagccttcag ctctgtcctt gagccattct gagtggaatg 8101 gagtaattgt ctgcttggag aactgaagaa tagcacatgg caaagaacaa tttgtacctg 8161 gaatatattc attagcttgc atgtcaaaag gccacatgca gatagaaacc attatcttgg 8221 cattctttaa aaccttgcag ccttgagact tgaggtgcag aaacccacat gcccatgtga 8281 ctgactacct gtcgatctct ccagccctgc ctggctaaca gggacaatat agggggatgg 8341 tgggagggga cagcttagac tcctgtggac ttggattgaa agaagaacag ggaagacagg 8401 ggactgtgca aataagcact ctattaggac ctatttttgg tgtcttggga ccctcctact 8461 ggtttagctt aaattgagag gggatttggt ttgcctcact agctgtttct tcccactcaa 8521 ttcacaatta cagctttctt cattgtcatt aaaatacatt aaatgtgtac ttgttggggt 8581 aaggctttct gttgaaatct gcataaagac aatgtccaca gcccccagtc agtggaaaga 8641 gcagtaggac cagaaggcat gtgtttccat cccgagtcta tattggaatg tttgttaaaa 8701 cctgcacttg taagagacaa acactagaac catcagcttg caggtctaca ggccagtgtt 8761 gccagtgcag ataatgccca aactggaacc taaagatgaa ggcctttggg agctgaggtg 8821 gaagagtcag ctgtgatctc ccagatgtcc tcctcatgcc ccattgccac tctagcctcc 8881 cacctccaag cacatttggg atccaactgc taacccctgg tgttcttttc ttagttgaaa 8941 ttctcaggga ataacctaag agtctctgtc actcagtcta tggcatccta tgataacagc 9001 caaggctaaa tagccatcat tgttcttttt ccagatgctc agcaatgagg atgcagaggt 9061 gaacaaaggt ggttcagggc tgccctgatg atgaatttga caagccagaa tctaacaaga 9121 tcagtcggta aacagaatcc tccttcctat ccagagatgt tggcttgttc tgtcactgga 9181 tgggcatcat ttactataag tcatacaggc accagacact cagagataaa taacatgaag 9241 tttccagtct tatgcagtcc tgtctagttg acttgccagt attctcaagg aagttccacc 9301 ccagcccctg gcatccatag accaaggact ctggaatgtt ctgggaaagc tccacctgag 9361 ctcctagcac ccatatatcc aaagagtctg gaacgttatg gtggaagccc cacctctctc 9421 tccccagacc tcgccccctc aaaaagtcca ccaaagactc cccacccccc acacaccccc 9481 agatgctcaa gaccacttcc atagagtatt taaactgcct cccagaaaac agaattcatt 9541 ttttcagtct ctcttcccca tgtcctctca gggtgggggg caggggtatt agtattcaag 9601 cacctatact ggcctgtcct tggggttctg acaagatatg acctcagcta cagccactaa 9661 gatcaccacc tgtgtatatc cactatgctc ccttttaaaa gggccctgtc cacctcccat 9721 tctctctgtc tctctctctg tctctgtctc tgtgtgtgtg tgtctctgtc tctctctctc 9781 tttctctctc tctctgtctc tctctctctc tccttctctg cctgactctc cctccctccc 9841 ctgctctctt ctttcctgct gcttttgtcc ctagaggcta gtctcctctc tccccttccc 9901 ccttttccca ttcactttcc cccaataaaa aactctccac ccaagctcta tcacatggca 9961 tcattctctt gctccatgat tttaaaatca caatgaggag gggagcatgg aaaaattatc 10021 caggaagact ttatccatta aacctgggtg ctttttcttt cttccttcct tcctttcttt 10081 ccttctttct ttcttccttt cttttttcct ttcttccttt cttttttcct tttttccttt 10141 ctttttgttt tgttttgttt tgagacagcg tttctctgta gctttggaga ctgccctgaa 10201 actcaatctg tagagcaggc tggccttgag ctcacagaga tccacctgcc tctgcctccc 10261 atgtgcttga attaaaggtg tgcaccacca ctgcctggct taaaactggg ctttttctaa 10321 gtcagtttga tttggattgc tgcattggca gagaggttta ttggggtgca gaaacctttc 10381 aaccagcttt tgagctaatg atagagagaa gctcaaggaa ttggagcaat gcttgactag 10441 ggatgtcaga gggaggctat ccagaggagc ttacaactga ggtaaactta aaagttaggg 10501 agtttgtcaa cttcaaccca cagaatagag cagagccagg aggagctgag gcttctgagt 10561 gttatggtgg aagcatcacc ccaacccttg acatccatat gcctgaagag tctggaatgt 10621 tatggtggaa gttccaccca agcctccctt cccggtcgcc ctccaaaccc tgctacatct 10681 cagaaatccc accaaatgat gactccctcc cccagagata ttcaagacca ctcccacagg 10741 gtatttaaac tgccccccaa cccccagaaa atagatgtgt ggttttccaa tctctctttc 10801 ctatcacgtc tctggggagc tggcaggcca tttgggagca ttgtatccat taaacgactt 10861 ctcagtggag actctgaaag ccagaagagc ctagacagat agatgtcttg catactctag 10921 agactacaga tgccggccca gactattata tccagcaaaa gtttcaaaca ccatacaaag 10981 tcaaatttaa acagtatcta tctacaaatc caatattaca gaaggtgcta gtaggaaaac 11041 tccaaactaa gattaactat acctgtgaag acacaggaaa taatctcaca ctggcaaaag 11101 aagaaaaacc tctctctctc tctcctctct ctctctctct ctctctctct ctctctctct 11161 ctctctctct ctctctcaca cacacacaca cacacacaca cacacaccaa caccaatacc 11221 atgaacaaca aaataacagg aattaacaat aattgatgtg tgtgtatgtc cctgtgtgtg 11281 tgtccttgtg tgtgtctgtt tgtgtgtctg tgtatatgtt tgtcacctga ggggtggctc 11341 ttccttggtt tgtgaggttt ctacccaatc tataactccc ttttcttcat tcacttcctc 11401 atgtccttac tagtctctat tgtggattaa ggaaactgtg tggagaacag ttttcttcta 11461 gaaaagaaca ctagccatct catgtaatca aattggtgac tatcctaatt attatgagag 11521 agcttccgtc cagtaagtgc tagaagtaga tgcagagatc cacagacaag cactgagcca 11581 agctccagga gtcctgttga aaagagagag gaaggattgt aggagccaaa gagtcaagag 11641 catgacaggg aaacccacag agacagctga cctgggcttg tgggtgggag ctcatggact 11701 cttgaccaac aattagggaa cctgcatgag gccaacctag gaactctgca tgtgtgtgac 11761 agttgtatag catggtctgt ttgtgaggct tctagcagtg ggatcagggc ctgtccttgg 11821 cgcttgagct ggcttttggg aacctgttcc gcatgctgga ttaccacacc cagccttgat 11881 gctgggggaa gcacttggtc ctgcctcaac ttgatgcgcc ttgcattgtt ggattctcat 11941 gggaggactg cccctttctg aaaaagaaca aggagaagtg aataggggag gggattggga 12001 ggagaggaag gagaggaaac tgtgataggg atgtaaaata aattaaaaaa ttaattaatt 12061 aaaaaagaac acttgtactg gtagattggc taaaatgaaa caaagataaa agtacacagg 12121 aaaaagagag gagaaacctg gggagggggg ctccaaagag aggtgagggg gggatgggaa 12181 tggcagctta gtggaggaag gaagacatga cctacacgaa tcgagctgta gtttttatct 12241 ggagcatagg gtaaagatgt ttgaggagaa ggaggaacac atgcttgtaa aacatggtct 12301 tcagaaccag caacaatcat acagagtgtc cagggtccat gggcacatga aggacagacc 12361 aacacatatt taacagtaaa gtgtccatat ttggtatgaa agtgatgggt aaattgtcct 12421 gggactgtaa tttagttgta aaggacttgt ctggcatgtg ggtattcttg ggttccctcc 12481 ttagcactga aaaaaaaaaa aaacacacac acacacacac atatattcta gtgttttgta 12541 gaaaaggatt caaagaaagc catgatttct cttttgataa atccagaata atgtaataag 12601 aacacacagt ggtgtgattt cagcaatcaa gtacaggttg cttgtctgtt tgttgtatgg 12661 gatggttggg tggttgtttg cttggtttgt aagatgggtg ggtgggttgg tgggtggttg 12721 cttggttggg tagttggttg ggtgattggg tgggtgggta tttggttggg tgggtggtgg 12781 gttggttggt cgtttggttg ggtggggtgg gttttgtttt gagacaggga tttactctat 12841 atctcagttt gtctcaaact cactatgtgc acatgagtat gtgatgagat tatctaagac 12901 catagtgtct gtgttcatgg aatgtctctc tagcttagag aatttaaaaa atggccatgt 12961 agggaaaccc ctcagaaaag gagtttctat ggcctccaag aataagaatg gatcctccta 13021 gctcggagtc agcaaggaac tgaagccctt aattttatag acacaaagga atccattgtg 13081 tggctccttc ccagccaagt ctcagatgag tcacagacct gcatggcacc ttatgcagtc 13141 ttttgaggtc ccaagaatag gatgcagata agccatgcca gaatcccaac acacaaagcc 13201 ttagtgatat agtaaatatg tattgtgtct aggctgctgc atttctggtt atgctactgt 13261 gcagtaatac acaactaata cagatgtgat ggttaatatt atgtgacaac ttgagtgggg 13321 cacagaggta cagacacttg gtaaaccatt ctgggtgcac gtaaggatag ttttggatga 13381 cataaacatt tagattagta tgctgggtaa aatacattgt ccatcccaat gggcatgggc 13441 tttgtccaac tagatgacag ctggaataga aaagtctgcc tctctcatag ttctcaggcc 13501 tttgagctca gactagacag aactcacagg ttctctgagc tttccagctt gatgaatgtc 13561 catggcagtc ttcacactta acacctgaca gacttaatga tcatatgaac caattcaaat 13621 ctgaccatca ctcgggtcat tcttttgatt ctgtcacttt ggagaactaa taccgaggac 13681 ataaaatgcc atcacatcgt tattttcttc ctgtctgtga atatttttct tttttttctt 13741 ggtttttttt tttttttttt tttttttttt tttgtttttc tctgtgtagc tttggagcct 13801 atcctggcac ttgctctgga gaccaggctg accttgaact ctcagagatc cgcctgcctc 13861 tgcctcccga gtgctgggat taaaggcgtg taccaccaac gctcggcctg tctgtgaata 13921 tttaaaatga aaactttgga aatgttctga aaccagctgg tgtcagatag tcagagaact 13981 ttcgtaaggt aggtgtgggt tatagcataa tcccacacaa gaggctgaag caggaggatt 14041 ttgtgtttga gggcagctag agccacatgg tgagtccctg cctcaaaaca caaaagcaag 14101 acaaaaacaa gctccaaata agattcactg ggccctttct ttccttcctt ctcagtgagt 14161 ccacttgctt taaaatcagg tcttaaagac gcactagatg ctgaacttaa cagtaataat 14221 aaatatcttc tcttacagta cagattatgc tctataaaca ctgcactgat aaagttcagc 14281 cttaaccttt gttctgtaaa tgtttcctag tttttctact gccgtattat aagacaaatg 14341 tcagcatgaa ggcaggtttt tcagaaaaca cagcagctcc acagatggcc tctaatccat 14401 aatcattaaa gacaagactg caactttttc aactggaaat cattcaagat gtttttctga 14461 agtccctacc aggacacaag ccaccctggt tgctgtgtga catcagttag gtagactctg 14521 aactggcttc ccaagaaatt atacaaaagc aaggtgtcac ctagtattag cataacttct 14581 gataactact gtcttagctg gggtttctat tgctgtgaag agacaccatg accacagaaa
14641 ctcttataaa ggaaagcaat tattgggtcc agcttacagt tcagaggttt aatccattgt 14701 catgattgca ggaagtatgg tggcccacag gcagacatgg tgctggagaa gtagatgaga 14761 gttctatatc agattgacac acttcttcca acaaggccac acctccactc actctgagcc 14821 tatggggcca ttttcattca aaccaccaaa gctacaaggt agcttatacc ccagcttgct 14881 atttctgatg agacttagta aatagtctta aaagcccata aaatgactca aaactagttt 14941 ttttattatt attattagtt caaattagga agaagcttgc tttacatgtc aatcccttct 15001 ccctctccct catcaaaact agttttttgt tttttaggtt ttttttcaag acagggtttc 15061 tctgtgtagc tttggagcct atcctggcac tcgctctgga gaccaggctg gcctcgaact 15121 cacagagatc tgcctgcctt tgcctcccga gtgctgggat taaaggcatg caccaccaac 15181 acctggccaa aattagtttt aagtccagtt ctaggagctc caatgccctc ttttggcttc 15241 catgggaacc aggaacacta tatatatata tatatatata tatatatata tatatatata 15301 tatatattca ggcaaatatt tatgcatata aaaataaaat aaatcttttt tccttttttt 15361 tttaaagaag tgacattgtc ttggaatttt tgtggctgct ctgcccttat gtgtaactgg 15421 acactaccag catctaaaca ctggcctgaa accagccaaa gaaaaccttt gtgccaggtc 15481 ctgtgtcaaa gtattatgtt ccttttagga tatcctatat cctaaaggat ttattttact 15541 gatagcatct taacttcctt tgaaaggttg gtcttctcaa gcagtcctcg tggagctggc 15601 tcctcagcta atgccagggg acaataatga tcccctccca aaaccaaaca gaaaaccatg 15661 gcaactctgg tttccttggg cagcacctgc tttaagaatg agcaaatgac caatcagctc 15721 atgaaactaa atactctatt attactaaaa tatttttttg agacagggca tggaattcat 15781 cacatagttc aggttggcct tgaactcaga gagactcact tacctttgcc tcccacgtgc 15841 tggaattaaa ggcatgaacc accacaccaa acataacact tgaattttgg aagagtcctt 15901 cttccaatag atttgaggtt ttgaaaatgt ggcacagaaa atatgaattc aaatataatg 15961 aaaacaagag ataactttca actaagtttc tataggttct tgctaggaat cctaagcttg 16021 tctgaaactc tagagcttct gtttctagct tctgagtgtt agtattgtag gtatgtgccc 16081 tgcctcagtg tgatgttttt gataatctta aagaaatcaa agaaatttta taaaagacta 16141 gactgtgcta cacaaaaaga atattcagat gccaagaaag agttcttaga aattaagaaa 16201 tatgctacta gtataaatcc tttataaagt ggaatgacaa atctgatgaa atcttactaa 16261 aagtagaaaa acataaacat caaagacatg aataataaga aaatcatatt gtgcatatga 16321 ttaacctaaa acattaactt gcaaaaatag aatagtccca aaaagtaaac aaaataaata 16381 aatcaccaag aacatgatac aaggacaatt cctaggatga taaaacaaga atattcatta 16441 taaaaggccc tatcactaaa gcacaacaga aacagactca aaagataaat cttcattgtc 16501 actggagaga agtccatact atcatagcac tcagaaggaa ataaaaatca aaatgtcaaa 16561 aaggacctca gcctctgaaa cacaaataca aaatatgtcc cgccttcttg acacgcatta 16621 ctcttcaatt aacattttaa gaaaactata aactgttaaa gagagcttag tattttaaga 16681 aatctgtagc tatttctttt ataagcatga caactaagtt tccctgattt aaacagacct 16741 aaaaaaccgg tgaagtgagt ggagaaaggg gatacgaaga cagcatccca catgactgct 16801 cccagtaaag gcaaggtctt catccatttt atcctgaact ctgggaaatt tataaagaac 16861 agaaatgtat ttctctcagt tctggagcct cagtccagga cactaagtct aggtactaca 16921 ctctcacatg gtggaaagta gaaagcaagc tcacttgtca ctcactacct gatgcctctt 16981 tcatcaatcc cattgataag gaagagacct ggcatctcag tttcctaagg actcagctct 17041 tactaacatt agctgtcatt tctgggtcac tgtaacagaa agcctgacag aagcaaccca 17101 ggggaagaag gatgtatttt ggctcactgt ctctgaggat ttcaacttat cccagcaata 17161 aagggataaa ggcattgcag caggaatatg tgtggcagaa gctgtttatg tcacaataaa 17221 caaataaaca cacgctagcg cgcgcgcaca cacacacaca cacacacaca cacacacaca 17281 cacagagaga gagagagaga gagagagaga gagagagaga gagagggggg ggggcagaca 17341 gacagacaga gagggagaga ggcagagagg gagagagaga gagagagaga gagagagaga 17401 gagagagaga gagagagaga gagagaaatc aaaggcccac ctccatcaga ctggtcccat 17461 atcccaaatt tctagaacct cctaaaacaa caccatcaac tgagggagac atttttggat 17521 tgaaagcata atgccattac ccaggcagaa tctgcctgtc tgggggagtc acatttaagc 17581 catggtatca attgacctca tgtaatttca gaatactaca taaaactatc agatattttt 17641 catgatgaat ttctaaagct tgaaattccc tttgaataaa ggaccaacta cagaattttg 17701 ctgagtctac aattacatac atgaaaatgt aactacgaag tggccagcca caatgaaaat 17761 taaagtgttt gggtggtctg tctctattga tgctcttctt tgccctgttt ttttttaata 17821 ttgttgatgg tttgtttttc ttttaagata cttggcccca agaaaaaaaa tgacagcctt 17881 aattaatttt gtttactctc ctgacatttt aaaagacaaa tttatgaaga cctgactgtt 17941 ccatgtagta ttagaaagat gtaaaattaa gggttgctta agctgtgtag aattgaagag 18001 cacagcattt gagtgacagg gtacaattag agatcatcag ggatgtggca caaagtgtac 18061 tcaacctcac cttttcctgc ttagcagaga acagggtgcc tcggtgagat aggaaattaa 18121 tcaaatagaa gaagaaatag taattttaga aggatcaaat tttcctggtt agaatgatca 18181 aaactacaag acttgtaact aaaatatagt caaacccatt tcaactggaa tctgtgctat 18241 tcatgtatag attaactaga atctaatttt taaattttca tcttacttcc aaaaatattt 18301 gtccaaatac tctgtgaatg cattagtttc ttatgggaaa acatcatatc ttttgtacaa 18361 tgtgtttctt agcttgaggt tctctccaaa caggaccaag acgaggccag gaccatgtga 18421 tacaacccat agtcctcaag aaatagttgt cattttctta ttccaattgc atcccaaggt 18481 ctcatctcat tttgcgtgtg cctttgacac cccataccca cataaactaa ggtggtgtta 18541 ttttttgagg ccctgaaggt atcttcagga atccataagt gagccttaag ctgcatctgg 18601 atataggaat ctgaaagtgt cccttctctg catgatctct tctttcagtt tttcaagtca 18661 gtgtgccaca ggaatcagga acgataaatg gagaggggaa gtgcagttgc ttggtataga 18721 caccccagag ggctatttgc atcctgtcct tcaaaatctc tctgagcctt cctgcctaag 18781 ctgttttgag ttgggtttgt ggtaccagaa cccctgcccc cgccccattc tgactaatga 18841 gagagagaga gagagagaga gagagagaga gagagagaga gcagcagagc atagaatgaa 18901 agtaggttag aagggcaggt aaaagcactt tagacaagag caggtataag ggccttggac 18961 tccctcccca gaacacacac atgaaggtaa acgatggtta aaggatacag ataggatgtc 19021 gaagctggac gatcacttgc ttttgtgtgc ttgaagtgac aggctgtggc tttcgggttc 19081 atggggtctg ttgttgagtt cacagtctca ccatgttagc aagcatgtca ctattaagct 19141 ctatccccgc cccccttttt tgagacatgg tcttgctaac atacccagac cggcctagga 19201 agcactttgc agtctcagct cccctgagtg ctatgatcac tcgtgtgagc tacagtaccc 19261 aaaccagaat atgtgtgttg ggtgttatga gagtttacac attgctgcct tgaatgctgc 19321 tctgcttgag ttcctgtagg aagctgagct gggaacctaa gcttcctcct cccagatagc 19381 agtaaccctg cagagacctc ccaccaagac tagctaaccc ctccttcttg tgctgtactt 19441 agcaagaacc ccaaggttct gggtccttgt gctacagttc cagaagagta tgaacaatct 19501 tagcttttct gtatatgtgt ctgtgtctgt cctgtcagat caagtcccag cctcactgta 19561 tgcaacatga aaggctgtga aaactgtgca ttttgagaat gaacatcatt agtctccagt 19621 aagttcaaaa acaaatgaag gcagccactc ataagggtct ttaatgaggc aagggggcaa 19681 aagggtggtt tctgtttgtt caaagaagcc tgtcatacat tttcagaaaa tttagaaaca 19741 cgtatcatgt catttcacgt tagtatgaag tccttataat tcatttcata ttaaatgatt 19801 tcctttggtt agaagcaaaa ttatgcataa aatgtgttcc tttgtgtttg gagcaaaatt 19861 acaagttaca ttattagtta atattctagt tcttattttt cccaatctcc aagaagcaaa 19921 atattcccct aaaccctaaa gcatcaaatt atcctatcac acagtgacca gtcatcgtaa 19981 cctaaatatt aaagcatcag attatcctgt ctatggtgac cagtcattgt aacctaaata 20041 ttattgtaat gtggattaga gttaactata ccttttcatc acactataat gtaaacactc 20101 tccaaatctt tcaaagtctt gaaaacacaa tttataaata ctgtgttctg tttgttttga 20161 gacctgatcc ggttaggaat ttcaggctgt cctcaaactc atcatcttcc tgcctcactc 20221 aggtcctaag tgctgagatt aaaggtctat gctaccacag ccatacgaat gccatgtctc 20281 catcagctta tcacttctta acttttttct tttcttcttc tacatactgc tgagtaggag 20341 catcgatgac ctcagcctag taggaatggt tcccatgtga acccttaatc tgtaggaaga 20401 tgctggactt cttccattaa gactgatctc catttgaact tgacttgtct ctctcttgtg 20461 tggagctacc atcccatata taatcttctg gtttataaac agattgcttt accctcaaga 20521 tcctttgcta gcgcagcaat gtaagtttta atacaaacag taaggtctct gattggagtg 20581 tcatggtttg gttaagtgcc ctttccaagg gcccatatag ttaagggctc aaccaccaag 20641 tgatgcttgt ggataggagg cagggcctag tggacagtct ttaggtcatg gagctatgct 20701 gttgaggggg actgtggggt cctggtcttt ttcccactcc tttttaggtc ctagctatga 20761 ggtgagtggt tttgtcctat caagcacctc tgtcctgcca tggtgtaatt gattataact 20821 acaacctctg aaactaagcc agtataacct atttatctca agatgtaact tacaggtaat 20881 ggtaagataa agctaacaaa agacaaattg ttataatcca ggcaagcctg gccccatccc 20941 ttgggggcat ggcacagagt gtgtcaccca tctgtgcatg gcaagcagta ccctgactct 21001 gtatgctgat tcaaaggtcc cttaaagcaa actcctccca cttcctctct ttttctgcca 21061 tttctctgag gagggaggcc actgtctctc tgtctctctc tgtgtctctt tttctatctt 21121 cctctccctc tcttcccttt ccccaataaa ctttccacat taagttttgt ctgaaggtat 21181 ctgtttgtct ctcacccgcc ttttaggccc cacctaccat gggatctgcc aaaggtctca 21241 cctcgagctg tattcataac acaaatgaca gacaaagatc aaccctgaag actagtagga 21301 tgtagaaggc ctggagctga cctgaagaac actgctgact tcaacattgc ccatccgtca 21361 gttatgtagc attaaagtta tagtggttcc tcagaaagca gtctcctttg aaaacttctc 21421 gttttgtgtc taaatggaat taaatacctt gttcccgaat aattgtttta gttctcttga 21481 aagatcccgt atacttacta ttaagatgta tataaacctc aagctgaaag aatgacttcc 21541 cctatggcca gatcacaaga ctctccactg atgtgcccgt tgcaacctga ttagaggaag 21601 agggtcaaag ttccccaaga ttcagctgag ttcatgcaag ttttagaaaa aaaacaagat 21661 gttcctccac agttagaaag gagtggggct ggagggatga ctcactgaga aaggttattg 21721 tcgtacaagc atgaagacct gagctcgaag cctggcaccc atgtaaaaag aaaccatgca 21781 tggtagtgtg catcttcaat cccagcattg gggagacaga gaaagagaaa gggacatccc 21841 tagagcttcc tggtcagcca gccttggcaa gccagtgaac tccaggttca gtgagagacc 21901 tgtctgggga ggaaaaaggg agggagggag ggagagagag agagacacac acacacacac 21961 acacacacag agagagagag agagagagag agagagagag agagagagag agagattgag 22021 gaagatacct gatatcaacc tcacacactc atgtacccat gtatgtaggt accttcacac 22081 acacacacac acacacacac acacacacac acacacacac acacacacac acacacacac
22141 acggatggtg ttgaattcta aggctcttat ccacacatat atggagacaa atagaagaat 22201 tacagtcgtc cctgcctttg acgctactct gtttctccaa ccctgcttcc cagatatttt 22261 tcaacatcta ctcagccttg agtggttgca ctctgacccc aggacctctt tctgtgactt 22321 ccttggcctc ctgttttgtt tttctgatgc taaaaactga atctggggcc tcatgcacac 22381 aggaagatgc tataccaatg agctacaatt ttgttgccct ttttaatttt tgagatggtc 22441 tcactaaatt gttcaggatg gcccacttgt aattctcctg ccttagcttc ccaagtagct 22501 gggcttttat acagatctgt gcttccacac ctggctgagc agacactcat gatttcattt 22561 ctgctaatca ggtagttttc ttgcccctcg ctgccatttc ctacctgcct ttccttgcca 22621 actaaactgg ttcccacaag cgacaggcta tcatttctca gctcttccac aggttagctg 22681 tgcaatttgg tatgaatcat ttagcaagcc cagttctcct ctttgtaaaa cagatgattt 22741 agatgaaatt ttttcaaagt tctctttgaa ttaaaactat cactgccttg cttgctctct 22801 gactcttgga gaccatggcc tatccctgat tagtccttgg tccacagaag gatgggtggc 22861 attggatgtg ctgaacaatc aggtactttc atgtcacttg gagtcttaca gtaactgcat 22921 gtttcaaatg aatcctttct ggctctatta gtttcttttt tgtcactgtg aaaaaaacac 22981 ctgaaagaaa caaggcacgg tttgttctga ctctcggttc agaggatata gttcaccatg 23041 gaggcaggag cttctcacag ctgtaacagc catggagtca ggtggctagt tacagtcagc 23101 tggccttagc agtcagagag ccaagagagc tcagttgagg agagtccagc caggctgtag 23161 cccttaggac ctgctcccca gagatccact ttctacagta tcttctaaac agtgtcacta 23221 gatggtgacc aggtagtcaa gcacatgagc ctgagggata atatcattca aaccatagga 23281 ttagtctaga actgaaccag atcaagaacc aggttttctt ctcacataat agataccaca 23341 catcatgttc tcatatagag tgtgatctag gtattgtttc tccaaatgga gaagccaaca 23401 ctggatgact tacatagaaa gaaagagagg gaggaaacaa gcaagggagg gggaagagtg 23461 agaattattg gaacagtacc agtgcctcaa aatccttggt ggactagaga attagcctca 23521 ggaagaagcg actaggcttc ttacagcata gacatacagt tcttaccaga ggcacagcca 23581 tcatgggtgc catggggagc atgaagttca gctccatcca gccattccta gcgatttctg 23641 gcaacctctg tcctttgaga cacttcctga agatataaga gtccagggag agacatctga 23701 ttgctttgat cccaggatct tgggatggaa ttggtgttgt ctctgctcca gctccagggt 23761 caggaaggtg aaactggaaa cacaagctag cttttcttac ttagcaaaaa cccacaggtg 23821 acataaaaga cagattgaca cgagaacagc atggcagatt tatttagtca aagttttacc 23881 agacacaagc accttcagaa aggtaaagtc agagacctta ggggaatttt cttgccagaa 23941 tttttccaga agaatcaaca gccgtgtaac aataggacta gataaacaag taagactgga 24001 cctgcagcac aaatgtgaca ataggagttg gaatccccag gactcacata aagccatggg 24061 agccgaatgt aatggtcact tgtagtttca gcctcagatg ggggtgggga ttctccagaa 24121 taagcaggct agcaagacta gccatgttgc caagctctgg gttatattga gacactctgc 24181 ctcaatgagt aagtggaaga atgatggagg ccaacttcaa ccttggactt ccacatgaac 24241 acacatacac aatgcaacca tgcatccaca gtgtatgtac acacacacac acacacacac 24301 acacacacac acacacacac acgcaaatgg acaaagaaag aggtaaaacc tacaaggaat 24361 caactgaaca gaagccaact ggtctgcctg ttcagatcct ttttggcctc tctgtgtgct 24421 tccctttctc ctgggcatgg ggcaggcagg atctgtatgg ggtgagggtc ttcagagaag 24481 cgaacagcct tcctaggttt tatggctcag tttggtggag aggggatcta gtttctctta 24541 atcatctttt taaaaattta ttaatttatt ttttatattc caatcccagt tttccctccc 24601 tcctctcttc ccctccccca cctcccatct gttccttaga gagggtaaga cctcctctag 24661 gaagtctact aagtctgccc catcatctca ttgaggcagg accaaggcac ctctccaccc 24721 ctacactctg gtgtctaggc agaacaaggt atctctccat atagaatggg ctccactaag 24781 tcagtttgtg cattagtgtt agatcttgga cccacttcca gtggcctcat atattgtccc 24841 agtcacatcg ttgtcaccta tattaaggga gtctagttcg gtcttatgca ggttccccat 24901 ttgtcagact ggagtcagtg atctctcact agctctggtc agctgattct gtggtttccc 24961 catcatgatc ttgactcctt tgttcatatt gtcactcttg cctcacttca attgtactcc 25021 aggagcttgc ccattggtta gttgtggatt tctgcatctg cttccatcta tttctggaag 25081 agggttctat cttctctggg gttgtgaatt gtagactggg tatcttttgc tttatgtctg 25141 gtatatgctt atgagtgagt acatacaaca tttgtccttc tgggtctggg ttaccccact 25201 caggatgttt tttttctagt tctgtccatt tgcctgcaaa ttttagaatg tcattgtttc 25261 ttactgctga gtagtactgc attgtgtaaa tgtaccacat tttctttatc cattcttcag 25321 ttgaggggca tctaggttgt ttccaagttc tggttattac aaataatgtt cctatgaata 25381 tagttgagca aatgtccttg tggtatgaat gtgcctcctt tgggtatatg cacaaaagtg 25441 atatttcagg gtcttgaggt aggttgattc ctaattttct gagaaatcga catactaatt 25501 tccatggagg ctgtacaagt ttgcactccc accagcaatg gaggagtgtt ctctttactc 25561 cacatcctct ccaccataag ctgtcatcag tgtttttgat cttagccttt ctgatcagct 25621 taaaatggta tctcagggtt gttttgttaa tcatcttgag aaaaaggaat tctattttct 25681 gtgactggct ctgagagaga gagaagaggg aaaggtggga ggaatgtgtg ctttcaagac 25741 cttgtgttct cccttagctc aaagtactca ccatgaaaaa ccaccagcct ttggaggagc 25801 atgctcttgc agaggcaaga tcctggcttc ctcccatctt gaatttgcca aaatagcaaa 25861 gatgtttggg tgctggacag ccaaaaatga cagctgctca cttcacagct tcctcacgta 25921 tgattacaac tccactcatc atcaagcttt aattacatca tgagcaggct tatggctgag 25981 ccgttatcct cgcatccctt cgtctcatca ctgattcaca caaatcacta ggtgctccgg 26041 ttaatgaaaa catattcatc agtacagtga ctaattcatc aggccaacat ttacatggct 26101 cctctgcatg acaaaaatga atgtttagaa tgaataatga gtcaccagag gtgggggaca 26161 tcttctgagc acaggttgcc cttgtctttc ctggtactca atcccggctg aagagctgaa 26221 caaagctgag gttatttttc ccatgacagt gcattgtggt ttagagatct gtaagcggct 26281 tatcttgatt ggcagtttga ttggttctgg gatgtactaa gagacgtgcc tcatgggcat 26341 ttccagaaag aattaactga gggggaagct cctcgccccg agaatgggta ggagcatctg 26401 gtggggtaca gatgtaaagt ggtccaaggg agaagccgca tggcctgcct gccttcactc 26461 cttgctgctg agtgtgttta tcccatctat cccgttgttg cttctgttgc agttgcaatc 26521 ctgcttctcc aggccccagc gtagactgaa cagtggctgc ccagaaattc ccaattgaag 26581 cagccgaatg gtggactgag cacctctcag tcttcagtct ctctagtttg taggcaacca 26641 ttgttggacc caactcttag tagtaagcca atctactaaa tacagaaagg ccagtgagat 26701 ggctcagtat aggtgcttac caccaagctt ggtgacccga gttcaatccc caagactcat 26761 aaggaaagaa ctaactaccg agagttgttc tctgagctcc acacatgctg aaacatgggc 26821 ctccacatgt catgaacatg ttcacacaat acatatttat ctctatatat tcatttctta 26881 taatttttag aaaatttcat tttatgtata tgagtgtttt atctgtttgt atgtctgtgt 26941 accacatgca tgcctggtgc ctgaagaagt cataagaacg tatcagattc cctctaactg 27001 gagctaaaag aagattgaga ggtacctacc atctgagtgc taggaaccaa acctgtgtct 27061 tctggaagat cagtaagcat gcttaaccac tgagccatca tgccacttat ttgtaacaca 27121 tatccatcct attggttaca gtcctgactc atacagttag atagctgagg aacctagaat 27181 tcttctgctt ttttattaca aaacaaagaa ttttatctga cttacagttc tggccttagt 27241 cagggagctg cattgggaga tggcttctct actgtcagag tccagaggtg gccgtaaagt 27301 atcatatgac atgaggcaga aagtctaact tacttgagag ttaacttgga aatgtccaaa 27361 gagacagggg gctaagtccc tcttattgaa gagaccttcc atagaagtta gcctgacaga 27421 tggccttgcc tgaactgcat tgacagtctt acttggaagg cctgttttgg ttcctaagaa 27481 attcaaggat ccaccagaga agtgtgcagc cagcaagctg gactccctat cccaagcccc 27541 agctcctcct cagggacctc agcagtcctg tgtctagctt acctcagcga tggggggaaa 27601 gatgctgttt tcctgctaag agcacactat tttatattat tgttgacaca ggttggactg 27661 catgtaacag actctccaac aacacagtga agatacaagt gtgttttgct gcatttaaat 27721 gtctccccat ctgtccctgc taagacacct actgtccttc acatgtcact gaaaactcca 27781 ccccttatga gaagtcttcc ctgatgccat ctagacaagc taagagtgct ctgctctgca 27841 ctgagcagct tctcaactct ggggttatca ttgctctgca tcacaattag cacacgtggt 27901 agtggctgtg tttgtgtttt tccacaccat gagtccagac agcatccctc tcaccagcac 27961 gccataggca caagtgctca agagtagcag gacttgaaca tgtgtggttt atcatacaga 28021 cagctgctgc tcagagacca gatcaaattc aaagcaaaat agagagatga tggttcctgc 28081 catgagcgta ctgaacaagg acaaacatca ccatcataag gaactcagct gacagggagc 28141 ggtcaccaaa cttttttttc tgtaaagtga caaaaatagt taagtatttt gccctagaca 28201 tagtgggtgg tacacatgta atctcagcat ttgtcagagt gaggcagaga gttgaatgct 28261 gggctacgta gatagtctca aaaaataaat aaataagtaa ataaataaat aaataaataa 28321 aaggaagaaa taaaaaaaag aatttgttac tcaactctgc acaatggtgc aaaagaaaca 28381 ataagcatta tgtaacctag tgggtattgg ctgtttcact ttactaacag gcattgaaat 28441 ttcaattttg caaaattttc atgttccata ttacccttat ttttattctc ccctataaat 28501 ggtgactcac caatacgcaa ctggataaga ttagggtatt tttattaggg aatatgcctt 28561 acttacagag cacctaacca gccagcagga aacatagtaa agtagcgcat gccgatgaaa 28621 caaggaaaaa gaagaactac catgtgtgac ccctaaccct taaaacctct cccacatcac 28681 cctgaccatg cccattaggc gtggtcacct agccagcccc taggaggcat ggttacggtg 28741 tccccctaca ctcccctaat catttaaaga tgcaaatgca tgcttggtga tgggctaacc 28801 ttggctcatg ggctaatctt ggctcatggg ctaaccttag ctcatgggct aataatcaag 28861 gtttactaat ctctgtcaga cagccatttt ttttttgcag agaagaatcc ccatctttgg 28921 atcatttatt tattcctttt gtatatttga tgcaatttat aaccacaaga acctactatg 28981 tgactgcact gtgccagatg gcagagaaag ctaagccccg attcttgtgg catggactca 29041 cacaactcca gtacaggact gttagtgaca atctccttaa ggcataagca tactgcagtg 29101 gcagcctctg ggttaggaga caaggataca gtttatgaca cctggtatct ggaaggcatg 29161 aaacatgtca aatgctggct acacctaaga atcagcaaca tctagtctgg ccatagccta 29221 ggatgaatgt cacagggtct taggccagaa atgtatggcc gagctgtagc agggtcctct 29281 ctagggccag aattaattcc agtgtgatgg acagccaaga ccacagggat aacaaatgag 29341 cagtgccaat gacacgtgct tctccttatt attgctgcac agtgtttgtt acacatagca 29401 ttttcgcaca gtaatataat gtgcttgggt catcttgctt catatcccat cactccctcc 29461 atctccctag tgcctcccct gttacctttg cttctcagtt ttgtttctgc tttgatgtca 29521 acagcacata caagatttta tgcaatacat cacttcctga atggctctat ttggaaatca 29581 ctaaaaggta atttatggaa catttggggt ctttttgatt ttctaattta ccaaaaaatc 29641 cacctgggga aagacaatgg agttcaagga cttctaagag gggaatgtac catggtatgc
29701 tccagccagg ggaaccagtg cttcccagga gctatggctt acaaagtggg ttatcacatg 29761 aaagcaagac taaaataatc atctcaaata ttcattagat gtgggactcc taaccatctc 29821 acaatgcctc cctcggtcta cattaaataa gaaacctcca ttttgtgctt tgcgagaaaa 29881 tgactgaaga ttatacattt ggccttgaag tggaagtatt tttgaaaatc atgaatagga 29941 aaataataaa tctctcattt caacataaaa tataagggac aaggacatct actcatgctc 30001 caaggacgga cactgaattt tccatcaggt agttgcagaa cgctgtgtcg ctcaatcaaa 30061 aattcaggat gcattgctca gagtgcatta tattaaaaga tagcatcttg gaacacagga 30121 tgctcaggaa atgggaggga cattaatctg catgcagtga tcatctcctg caaagcgggc 30181 atgagagcct gatgggagac aagccatcca gatgcccata cccaggggag ctgtactggg 30241 ctgcagccct gcgccattca gccatgcacc aggctactcc ctcctcttcc agctttctcc 30301 ttctgatggc cataggatta gaagataagg gactctagtg caggtcaact gctgaccagt 30361 gtgaaaatgc acagactaca tgctggtaga tcagcacttc aaactactgt tcaccatcat 30421 ctctggaata agcactacat ttacagggtt caaacctcaa tgaatataaa caaacaaaac 30481 acacctccct tccttcactg tctcccattt ctttggttcc catctccaca tagaatttat 30541 aattaaaatt tctaagtatc tttccagaaa tacttcacac atgttataag caaatgtgct 30601 tttaaagata ctattttaaa ttatgaaaat ggttatatta gttgagataa aagaatagaa 30661 tgggaagttc cagaatttaa ggcctcatat gaaaatataa agcgctttct cttttaagtc 30721 tagggtaggt gtactagatc agcgctcagc tccataccat gaagccatcc aggagtcaga 30781 cctctctgac agccctgcca ttgtcacaga gaagtttctg tcaccagtgc tcatgctgtc 30841 agaggagcga aggagaaaag atgtgagacc tcccaagtca aagtcatcta tggataaaac 30901 cttagttgca tggcacacca gtgttaggga gtcggggaaa cacagccata gcccagcttc 30961 ctctctgttc ttgctcttat taccaccaga aagaggttgc ttagacaacc caaaccaaga 31021 cacagggctc tgtgggaggg aatcagtccc aggcttctgg cacatgctat gtcaccggaa 31081 agccccagcc ctactccgaa tccccacaag tacagcaaat atcagattat agcatttaaa 31141 ggggcactct tgccaaagag aagcaccatt ggaatagcca tgcttgagaa ctggtcctac 31201 ttactgcaga accatggata caggctccct tttgtagatg ggcttaataa atacttctat 31261 aagtgatact ctgctttgtg aaaatgacct cgtcaatatt caaagtaatc ctctggttta 31321 ggactactat gaacctgtgg ggttcattgt tcatgtggtt aaacagcaaa gagtagttag 31381 acagttgtcc tacgtcacag agggggacat atgctatgct tggttaaata gctgtcctgg 31441 tcagagggga ggcatgctat tctgcccttt ctgacagacc ctgattgcat agacatttca 31501 gtgagataaa ggaaggaagg gaagaaggag gaaagacaac attttttgct tctgttaagg 31561 tagagactat ctgtgatcca gttcagcaca gtgcctgtga gtagaagcta caggtcaggc 31621 aggagccaag gaaatgtatt gcttttctaa ttgaacaaag gacacacagc tgccatttat 31681 tttcttcatt ttgacccttc agccctgcac tgtggatatg acatcaagaa actaagcagc 31741 cattttgtga aaatgagatc taagttagta aatgtggctg aaaaagaagc cagctgcatc 31801 ctccctggat ttacgagggg gaaatgtagg catactaaat taaaacacta aaattgaccc 31861 aaagctattt tgactgatat ttaaatatag attctgctcc tggacattcc agagttcata 31921 ggacagttgc ttctgttcag aggattcctc ttcggggttg cctctccttc cttaggcctg 31981 cttgtcctgc ccaaagctgc ccaagtgcat caggccccaa accaacttct ccatcctgac 32041 gcacagcaga ctaaatatgc aactttgtgt ctcttcatcc caggacaaaa ctttcaccca 32101 gcccctgaca tctgagactc tactacaggt tatctattaa atcttttata aagaccaaga 32161 aacaaagtgt tggcatccaa actttggtaa atcatagcct tttaataaag tcaaatggac 32221 caatgtactc taacaaaaaa atatgggtct ctcatttctg aatggcagat ttcaagccct 32281 aagaaccaca atgctcacct actgggcaac actgagttac agagacccag ctcccccacc 32341 cctcaccaag ccagagaaac actctatctg aacaatcctt ggtccatgga gcaagaatta 32401 gacatagaat ttgtatctca ttgtttttta ggaaaacccc aaaggctatt atgaagtcag 32461 tttttctggg caccttttct ttcccatgac aacgagttgt gggcagtctc agcagaatac 32521 tgaagctgtg gcttggggag acagagcata tactggattg gagttcatgg gtgggtgcat 32581 ggaatcaatg ccgggcatgg gattcaagac cttatgcatg tgggtagatg ctttgttact 32641 gggataaatc ccccacctgg gatctgactt caagcacaat ctttggaagg cggcattggc 32701 tctctgctaa tttttctagc acttttattc cacttatttt ctgcttgttt gctttgggag 32761 ttttgttcgt tataagacag tcttgctgtg tatcctaggc tgatcacaaa cctgtggcag 32821 tccttttgtc agcaggccaa aattcccact ttatctctga agacagaaag tagattgagg 32881 aatatatgat aaagacactc atcaaagcca ggcatctatc tttacttttc ttaaagcatg 32941 tttttgaatg gcataaaacc atgtagacaa ggagtcttat gttgtacatg gtcctacttt 33001 gtcacttaca atataggata ctttcaataa gcttggtagc ccttgcccta ttctacttat 33061 tctgttctct cttcctcggg tcttggggag ccttcttacc aggtggggtg gcataaaggg 33121 aaaagtcaca aagctcttcc tattcctggt tcccctccta agtgtacctt gctggtggcc 33181 ttgctagcaa atgtagtata acatctgact tatctcctct cagatatggt tgttgtactt 33241 agataaattt aatctagaaa ctcaagctgt atgtctttgg ggaccagcat tacagagctc 33301 ttcccttcct gtccttacct caccttggct actgtagtaa gttaatcctg atgattcctc 33361 catgagtcct gaaactgatt agttccaaga gctggaggat gagaagggat atagcctggt 33421 gcagggacac tttccaatga ccacaagacc ttgcacaagg tacacatgga atgtgttaga 33481 ctgtctcctt tctgtcccta gcctcagttg ccccagtgtt tatcaatgtt tattaacatt 33541 gccctagcaa aaatactaca gactaggaag cttgggtaca attgaaaaga gcttctcagg 33601 gttctggata ccgggaagtg caaaggttca gcatctggac agggctgcta ttgtagtttc 33661 aaatggttct gctgcaacac ccctttgaga gaatgaacac tgcttttcac atggtggaga 33721 gtgcacagac accaacccaa ctcctgaagg ccctttctcg agggctctaa tccatcatga 33781 gggccatact ctcaggactc attacctccc caacatcccc tctctaaata gtaccacact 33841 gcatttgcat ttcaatatat cactggagat atataaatct ccagaccaca gcataccata 33901 aatcagataa ggcaggcctg ccttctatag cctttcactc agcaaaggtg tttctagccc 33961 aaagcagtct ggactctcac tctgaaacct cttgggagtg gtggccagaa atgacttccc 34021 atcatccctc tctcctgacc tggtccagca ccaggtcacc aggaaatcct ccaagtttca 34081 ttatccccac ccccaattgt ctcttgtctc tagcaaacct cttccaatac ttccttcctt 34141 ggtgggtgta gcaagccaga tgatagcctg ccaaagaagt tcacagcctc atttctggag 34201 cctatgaata tgttacattg tgtggtaaaa ggaactttgt aggtgtgatt aaattatgaa 34261 tcttgaagtg ggcagattat ccaagtgagt ccagtgaaat tgcaaaggta catcaccaac 34321 agtgaggcag gaaggccaga gggggagaag gaagcagaga ggcagaggga ggaaaagaca 34381 agccagggga ggggagtggg gggaaagaaa ggagagagag agagagagag agagagagag 34441 agagagagag agagagagag aaatatcaca cacacacaca cacacacaca cacacacaca 34501 cacacacaca cacacctgaa cctgattgtg gaggaagaaa ccactaacca aggcattcga 34561 ggcagccttt gaaagtcaca agagacaggg aaaacagatt ctctccctcg gcccttcaga 34621 atcaacacag ccccacaact gctgatttta gtcatgttaa agccaagttg gacttctgac 34681 tgccaaaact ttagacgagc aaataaatct gcactatttt aagataccaa tgtgatttgt 34741 tcatgaaaac aatcaataag gaactaataa agtagaagtg aaaattggat cacttctgaa 34801 gtttggtaat atccacagaa actggacaca tgctgacttt gtgagccata gctccacacc 34861 caggtatgcc ccctacagaa atgtgtatat aggtgggcag gagatgtcac ctgctgtgtt 34921 catagtcgca cctttagact ttcccaagcc tgagaatagc ccaaacacct accaggagca 34981 aaataaattg agatatacag acgcagtggg atactacact tctaaaagaa tgagaaaacc 35041 acgctataca ctgtatatcg tcggaacagt aacacagggg tgacaatcag gcaataggac 35101 atattctcta tggctttaga aaacataaaa atagcataac agttctgtta gtggcaatgt 35161 gttctgtttt gtgatctgta tgatgcttcg gtttgtgcaa aagctctgga cttacctttt 35221 aaatgtatgg tggtctatac cttttaaatg tatgctagat atacatgagt aaaaatgatt 35281 aaaagagatg gaggggagga gactcatgcc ttcataaaag tttgttctgt cctttctggc 35341 actgtccaag tgaatgtgtg taaacaaaga gtgacccacc ccaggtagtc caccttctta 35401 gaacctactt ctgctacaac atgtcctgtg aatgtgcacc aaatgtttac taagggatca 35461 tgccacaggg ttttgtttaa ataaagtatg tctacctagg ggtatattga ttgtctttcc 35521 ttttgagggg gggtctcaaa actacaaact agtttgtttt gagacaagta tgtagcccag 35581 gatggccttg aactcacacc ttctgtcctg cctctttccc agcactagga tggcaggtga 35641 gactatcagc ctggccccag gaaactatct ttgattgaca ttatctggtc agaaaagatc 35701 taccttttcc tccaccaggt cctccaaata catgaagagc tgaaacagtt ctgtctaccg 35761 aatttccttt tttcttgatg tttctgtgga atttaataca taaattttaa tttgcatttt 35821 tagcttttct attaagcctt aattagagta taatgaagtt atgaatttat aaaaataaaa 35881 acaaaacggt tgctcccaca atcactcagt cttgaagtga ggttctgact ttacctgaag 35941 tgggggaaga gagtgaggaa agggacctgc ggaagctgaa tctcagaccc acaagatgga 36001 tctgagatcc atccaagcga acgtggacgc agacccggag tagggacatc caggggtcat 36061 cttcatctgt cctcgctgtg cttctgcccc tttgctcctc taccagtctc agctgtcaaa 36121 gctcagtggc ctggagggga gatggggcgg ggcttaggat cgaaggcgga gcctcggaga 36181 gcatcttctg gcccccgggg cctggactgg cccgccgccc ccacctgcag cgcggcggag 36241 cgcgggcgcg tcactcccag cggaagcgcc agcctcgcgt ctggcgaggt gcgcgcttcg 36301 cggctcccgc tccagagctt cgtggcccgc ctgtgtctgc agagcagggg cgggggcccg 36361 gcggcaccga ctgggcactg agatccaagt agccactgaa tcgtagacag tcacccagct 36421 cggacagcgc gtcggggcgg gagcagatcg ggaaggtgaa ggaccactgc ggatccgaca 36481 gcgcgtccca ggtcagtcct cccgctgcac ttggggaaac tttgggatgc ggtgacggct 36541 gcgagatgag gacactgagg gtcgcgaggc cgcgtggccc ctgtgaaccc cgcgaacccg 36601 tacctgccgc gcacctgaca ccgcagctgc cagggcgggg accgaNaccc tgctgccgcg 36661 gaccactgcg ggccaccaag ggctagcggg cttcaggggc ctctcgggag cctccggctt 36721 gcccgcgccc agccgcgcgc ctccggtcct cgcgggtccc cagctccttt tggcggctcg 36781 cgcccggacc ccgcggggct gcggattccg ccgtcttcgg gcctcgtggc gctggaggag 36841 cggcccgggg gcccatggct gcagggtggc ggccccgcgg cgggagcggc gcgtgctcgg 36901 ccggtggagc gcgcgggtcg cggggttcgg ctggagcgcg tggccgcagg tgcctgtggc 36961 cgctgggcag cggaggtgag agcgcgggct ggggacgcgg agcggattgc aacctctggc 37021 tgcaggaacc agggtcgctg ggtgagcagt cctgtccccg cggcttccgg gcgtgcacat 37081 ccctggcacc cggcatccag accccatcag ctggaggcgg gctgcagagc ggcgcctgcc 37141 cgggccgagg accagtgcct cctgctctga cacgccatct caccaacgag ggcggggtgc
37201 tagattggcg ggctgcgcgg ggaccactgg ccagggcctt ctggcacaag cccttttcgt 37261 ggacagctgc ctgctctggc ttggagtgga ggagacgaaa tgagtacccc gcccccatca 37321 gcgccccaac actgtcgccc cagtcacctt cctttgccct tctccgacag caccttggac 37381 ttgctccctc ccgaattggg gaaaatctga ggaaaccagg cagggacctt ggagataccg 37441 cagcctgcat actcaacagc ctggaaatcc agtcaccttg gtacctcgct gcttcccaga 37501 cactttggag gagcaggttt gccatttcta ccccacatcc gtaccccatc ccccgtccgt 37561 ctctgctgag gaagggactc ttatgagaga agttgggatc taggtacccc ttaaggtagc 37621 cccagagtct gtggtaacta ggctcatagg taactaaaag gcatcctagc tctgtagctt 37681 tgtgagggaa acaaacctta ccaactaatt ccttcccttt ctgaatattt cttagaagac 37741 tggagaccaa cggaagccga ctgttctggc cagtctttgc accctttgct tggctctgac 37801 tctccttcct aggcagagaa acattttgct tatgacctct ggctggcctc cttccaatcg 37861 ctgcctggcc ttggactgcc catcaggact gtgatttttt ttttttttta agacctgatt 37921 aggaaaggct gcaagcctcc ggttctagaa ggctcaaact caggggtata ctcttctctg 37981 atacccatgt gctccctaat tccactgtgg caacacctct gcccttcact cccacaagaa 38041 aattggttgt caaacctctt ggggaagatg atggaggcat ccctgtggga gcagatgcag 38101 gatttggaag caaccaggaa acaaccagga gtgaggaatc ttttttaaag gctcacatga 38161 ttctggaact aagaaaagat ggagatgcca ccagtgtatg aagcttggcc tctcctcggc 38221 ccatcccacc caactcaggg aactggcata tgcaggacct gtattgggtg atgcatattt 38281 ggaacctagt acttattgaa ttcctaagca gtaaacacat tccgaatttg aaattcctca 38341 caatcatcta ctgNaatgta gatattaaac ccccaactta tgaatgatag ccccaaaatt 38401 gttaacattg agagagccca ggttccctgc cacctcttcc acaacaggac aggaactagg 38461 acaatgaata ggaccatttg agctttaggg tcatgtgccc actttacagc tccatagcca 38521 gacaactgtt ttataagaga gggcacaaag gaaaatcact gtcctgtcca aatgaataga 38581 aagctgggga tggtggcagg acaaaggcaa caggaaaaat catctccaac aaggctttcc 38641 aagcatatca gtcttatact actgccatgt tgggtaccac acaaatcagg tatctcaaac 38701 tggacgctgc ctagggaggt ctgtcatcta aaaaggcagg gagatattga gataaaatac 38761 acagaagcta gtatttaact ccaggctggc agataatagg aatgaccttg ggagggtgtg 38821 cttacctttc cttctctctt gaacaaaatg tggactggac cagatgagca ccaaggctcc 38881 accaactcta acagaccttg tgtggtgggc ttgcctgcaa acagacttga gctaggttgc 38941 tgtgcgtggg atccattcca gactcattta caaactcgta gtcagtgaaa tgtgataaac 39001 cgaacactgt agggatttct aaacaaggaa ttaaaaaact cgactccaaa tgggagagat 39061 gcaggcaaca aatcgacagt gtttatgtgc ctctgaatag ctttgatttc cttcggtagg 39121 agctgacagc tggctgacag aaagctcacc cagggagaga agagagaaaa atcaagtatg 39181 agattaggaa taatgttttc aggtaacttt ctattcccat tcggagtggg tgtctggaag 39241 ggcgagtgta gttatggctt gaattgctcc atttatccac agatattttc ttcccaaggg 39301 ctcctgattc taagatgctg ggctttgctt ctgtctccta gtttcctggt agcagggtag 39361 agagctgggg gtcccagcat tcagcctgca tattcttcct ctatcctcac tatctgctgc 39421 ctccattatt tgtggtcttt tggatctatt tggtcagaga gtcagtcttt ggtttcttgc 39481 cctggaaact gcttgttgct acttgtggtg ggggcagcat ttggaagtcc aggtgctctg 39541 cccacaaact ttcaacccat catttgtttt tcatcccttt ctcattgcca ctttgtgtgg 39601 tgcctgggac ttctgggacc tatagttcaa gggtcatata taccaatggc tcacatgaca 39661 gcactgatca ctctgccagc tctcctctct ttgcaaaact tatttcagat ttttcatttg 39721 acaatacctt tcctccagtt gtctttattc ttggcagcat atgccttgta acctttaaaa 39781 aggaaggtaa ataatttgag aaaaaatgta ccaagtcctc agtgatacat tcttactaaa 39841 gactcccagt tttaacaagg agttgggctg gagccatggc tcaacagtta agagcactac 39901 ctgctcttcc aaaggacaca aattccattc ccagaaccca catggcccct tccaaacatt 39961 gataactctc gttccagggc acctcatgcc ctttcctggc atctgagaga accagcataa 40021 acatacatgc aggtgaacat tcatacacat aaaatgaaca ttaaaaaaga aatgaaatag 40081 agaaagggtt tacataacta tttaataact aagactgcct aataatgtag ggacccataa 40141 agaaaatcta gtaagttttt acaagattcc actcaatcag accaaacatt actgttactg 40201 acagagtaaa aagtcacttc caatagtcca agaacaactt tgtttcattt ctcaggcact 40261 gtctgttttg tggcatatgt gcatggtgtg tgtgtgtgtg tgtgtgtgtg tgtgtgtgtg 40321 tgtgtacagg tgaatgctgc tcgtgtatga gcacatgcag gtgtgtgttt gcatggtgtg 40381 tagacagagt ttctgacctg cctggtccca cagctgtttg gccacaaata aacatacaga 40441 ggcttatatt aattagaaac tgtttggcct atggcttagg cttctcactg gctatctctg 40501 tcttaattat taacccataa ctactaatct atgtatttct acgtggcgtt atcttaccgg 40561 agaatacttg gtgtcctatc ttctcagcaa ctacatggcg tcttctctct gcgtcttctc 40621 cccagaattc tcctcgtctg gttgccccgc ctatactttc tacctggcta ctggccaatc 40681 agtgttttat tcatcagcca ataagagaaa catatgtgaa gaaggacatt tccctatcaa 40741 tggtgtgtgt gtgtgtgttt gtgtgtgtgt gtgtgtgtgt gtgtgtatgt gtgtacatgg 40801 gtatgtgagc acatgtgggt atatgggtgc atgtgcacct gtgtgtgtgc atggtggcta 40861 gagttgaggt tagatgtctt ccttggctgc tctccacctt ttttttattg aagctctcac 40921 tgaacttaga gctcactgat tcagctagtc tagctacccg gcctgctctg ggggtcccct 40981 gccttcactt tccatgtggc taccatatct actttacatt tatgtgggta atggggatct 41041 gaactatggg gtcctcatgc ttgcatggca agtgctttat ggactaagac atctttctag 41101 cctttacctt tttttttttt gaaagagttt ttttttgcta actgggaact caacaccaga 41161 tagctagtct actggtcact gaggcccagg gatctactat ttctgcttct cttcccaagt 41221 gctgggacta cagactgtac caccatatcc atatttcttt tagcatgagc tctggaagtc 41281 aaactcaggt cctcacgctc acaaagtaag tgttttatct accaagccat cttcccatct 41341 ctgttgtttt aaaaggcttt gaatatggga tgtgatgaag ggaggtgaaa ttctgagata 41401 aatttcttga aaagaagaat gaatcaagta ggagaacctc ctcctggtgc tgtctttcag 41461 ttccatgtcc acacagcata aacattatga ttatcattcc acagattgta attagtcttt 41521 ctctgttttg ccagtctgct cccaaaaaat gacacagaga gacttcttat taatgatgaa 41581 agctttgcct tagcttaggc ttgtttctaa ctaactcttg taacttaaat taacccattt 41641 ctattcatct acctgctgcc acgtgattca tgacttttac ctctctctca ttctgcatat 41701 cctgcttcct ctgcttctgg ctcatgatcc cgcttttctt cctctccgag tgctctgtcc 41761 ccagaagtcc cgcctaacct cttcctgcct agcaattgcc catttggctc tttactaaac 41821 caatcacagt gacacatctt cacgcagtgt aaaggagtat tctgcaacaa caggtgatga 41881 agccaacatt ccaagaggcc agggcttgcc tagggcacat agctaactta agaaaattag 41941 gatcgcattc tacatctgtc tgactctgaa ttggatctga actgtgactt gcatggaaga 42001 cccaaagacc ctgagaaagt acaatgacaa aggggctgac tctgtccaca tggtgttagc 42061 ccaggtttcc cacaggagga aaacccatcc taggcaagag aagtggtctt catcaaacac 42121 tctatgaaaa gcaaatcaga ctcaaatgtc aggatttgtg ctttacagat cgatccggta 42181 agatgaaaga acttcctgaa agtgtgtgaa ggcctaaagt cagggctgtt catggaaggc 42241 actgactaca gaatgaggtg ccagaagcct agtcagagcc tctagggaat aaagtgtcag 42301 atgatcttct aaaaaagttg aagtttcacc agtaacagaa tggccccact attaaaatgt 42361 gagcaaactc agaagtcatt gtagcatata gaagcacaga cctatggatt gctggatgga 42421 gcccaggtat tcactccatc ctgaatagcc agctggggag ctagctcagt cagttaagta 42481 tttgctatgc aaatctgagg accagacttt ggtctcctgc atccacagaa atggtgcaca 42541 cttgtaatct cagcactggg gaagcagtca gccagatcca acagctgcct agccagcgga 42601 aacagcctta tcagaaactc atgggtcctg gtgaaagata ttatctcaaa taacaaggtg 42661 ggaagctcct gaaggacact ggaggttaac ttctggataa acataggctc gccccaccac 42721 cagtgagcat gtgcctaaat ccgtacataa caatgatgta aagatggaat tcattccagt 42781 gaaaagtaag cctcctggac tctttttttt tttttgttgc tagatattct cgagacctca 42841 ggagagaagg tttgccatca tctatataac atggtactca acttccctgt agtccacaac 42901 attcctattt ctatatgatg gagaagaggc cactgcccct cccagacatc tcagtctcaa 42961 atttgttacc agttccctct cctaataagt gcttagggtt agtgttgtag agaagggctt 43021 tacatgaagt gtgtgtgtgt gtgtgtggtg tgtgtgtgtg tgtgtgtgtg tgtgtgtgtg 43081 tgtgtgtgtg tgtgtgtaac ctaaaggctt tccatgtttc cacactgaaa ggttcttaag 43141 actgagaaca accagataag agtccaaatt ctagaaacca tgggaaagtg taatattgaa 43201 agtcagaaca aggcatggtg gtgctcacct tgaaacccac cacttggggc agaggcagtc 43261 agatctctgt gagttcaagg cccagcctgg tctacagact gtacatagtg agttccaggg 43321 ccagaactac atagtgagat cttgtctggc caaaaatata taagtaaata aaataaatca 43381 gtacatggta acttgttctt atttcagtgt ctgtttctca agcatgactt tggcttaagg 43441 atttttccca acttgttttt gtgattgcca ctgtatcatt tctttgtgtg aagttactaa 43501 gtggtttctg tatttgatat tatgttctga cctagtttct tttcatatta aacccatttg 43561 tatatgaaaa ctgcaaagaa gtgggttttt tgttttttgg gttttttttt gtttgtttgt 43621 ttgtttgttt tttcttggtg ttctcatgtg acctttccaa tgtttgcttc cagaatagac 43681 ctgcaagttg ggatccacac tgccatctga agtcctgcac cccaagtttc aggtatgttt 43741 tgatggcaga atagcttttc tagactgtga caataggggc ataaagccac aaagcattcg 43801 ctttcctaca ggttatgcac ccactctctg agtgattggc tgtgcatcat gaatattatc 43861 aaaatggagg cagttcagtt tggagtgctg tcttttatgc gcttattcat ggcaatgcca 43921 atggaacatt cggcaacata tactactaat catgcatggt aactgaactg tgttgtgcaa 43981 ggaagacctc atatgaccta cctttgcata tgctgacctt ttctgtgaca gactcctata 44041 atactgagag tggtactgta tggaagagtg tgtgaaaatg tattgtttaa ataacagaca 44101 gatgcctcta aatacaacac ccaagcagag aaatggagca tcactggcac tttggaggcc 44161 tctgggtaac ctttccagat cacactgttt tccttcctcc accaataacc actttccctt 44221 tggatgctac tcatagttaa catctttact tttgttgttg tcccactgat gctaagaaaa 44281 ataacttcaa ctagcaagca caacactaga tgaattaaga gtgatattga ctgtgtgtgg 44341 tgagtctcag aagactagct gcctcaggat tcatgaatgc ttacaggaac cctttagcaa 44401 ggtcaggaat gagtcttagg atccatgtgg ctcatagtct ccagcctgga catggagtag 44461 cacagtgtct gagtgcccca agggaatggg cttgttcagg ctcccctccc cgtccccagt 44521 tccaacaggt ctcagatcca ggacatcaga gctgagtgaa gagcagagct aaaaggagca 44581 ccatcggagc cctagaagca gaataggggg ggacacagca cacagagaca agaactgagg 44641 ccaggctgct gtgtgctttg ggcctaagtt gacagatgaa acatggtagg gtgaccacat 44701 ggaggatgtc tgtgcacatc catcaaactg gcaggtcccc ccagcatttt ctgggagctt
44761 ggggtcctct tttccatgat cttcagcttc tgtattctat gtgcgctgtt accatttcat 44821 cttggtagag tctatccttc tgttatttct tgagagtatg tcccaattct tgcctggagg 44881 tttggctaaa tatagaattc taagcagagg gtcatttctc cttcagatat ttaaagacac 44941 tttctgtatt gtgcctcatt gccattgttg atatacctga atctaaattg atcccttggt 45001 gcgtgactta tccccacagc caagggcccc ttcccttctg gtctgtgctc tggaagtctg 45061 caggcacatg gtatgggtag ccactgtttc attcatagtt caatgctccg ataggccctt 45121 ttgatttgat aactctatcc ctttccccca ttcccgttga tgatttcttc ttttgttccc 45181 cttttgatat agtttccttg ctgatgctgt gctaaaatat tcctaccaaa aacaacctgg 45241 ggaggagagg cttcatttgg cttacaattc cagctcacag tcattgaggg aagtcagggc 45301 aggaactcaa ggcagggagc atggaggaat tgcctgctgg cttcctctct gacttactca 45361 caggttcttg taggctagct ttctgataac atctcaggac cacctgctta gcaatagtgt 45421 ggtccacagc aggtttgaac cttctgcatc agttactaat caagacattt gcccaaagac 45481 atgcccacag gccagattga tgtaggcagt tcttaaatca agtctttttt gtcaagtgac 45541 tctagactgt caagtcgaca gttgatgcta actaggacac tattctacca cttttcttgg 45601 tagaaatatt attcggatat tggagttctt ggactagttt ttctggttct ccttttcttt 45661 cttttcctgt tatttatatt tgttttatga gatagggtct ctctgtgaag ttgtcctaga 45721 ccttctggcc ctcctgctta taattcctaa gaactgatat tacaggcagg tgccatgagc 45781 ccaacgtttt ttcttttctt ttcactgcac tctgtttgag agtctcatcg tcacagtcat 45841 tcacatcttc tattgtcttg tttttctttt taaatgtgca ttggtgtttt gcctgtatgt 45901 atgtctgtgt gagggtgtca gatcttggaa ttacagttcc aaataatatt tctaccaaga 45961 aaaagtggta gttgtatcct agttggcatc aaatgtcacc ttgacagcct tgagtcacct 46021 gagaagaaag acttgattta ggagctacca tgtggttgct ggtaattgaa cccaggacct 46081 ctggaagagc acccagtgct cttaactgct gagccatctc tctggcttcc ttctattgac 46141 ttttgcaggc ttctttcttg ttcttttgca atttcatggt ctctgactgt tcttcacaga 46201 ctcttacctc atgcttaaga tgtctcttac tccttcaagg atactgagtt tttgaagttt 46261 taattctcct gactactgtc ttttccctcc tgtttgtcat tctctgtttg ccctggcctc 46321 tgtctttcat gcaggaagac ttttcatttg cttttaggtt tttattttaa ctattggttc 46381 atgactaaag ggctagatga aaaggccagt gagaaggctg gagcatatgg gtgatacttg 46441 tcaaccggga gcctcactgt ggaatgcttc agtggcatgt gaaatcctgt ggtatttgct 46501 caggcaagtg cagctgttga atgcagacca gagcagcttc cttcgaagga gtcagatgtt 46561 gctgactgtc tttctgcagc tggtcaggaa ggtgggatag acttcagctc ttttcaaaca 46621 gtggtcacca aacaaccact tgcccagaga ctttgtgctt taccattctc agagaacaga 46681 cctctggatg gccccatggt ggaagcagcg cacctgtcta tcacaggtgc tctgaaggag 46741 ttggaagaac tacccattgt ccacatttcc cacattttca catgccagct tcactctggg 46801 atctgggtga cagtggggct gacataatgg caggggttgc agtttcagac tcagagtatg 46861 tggtaggaat gctgctgtct gagggaagac tcatctgagc agtggaggct ttgcctgttc 46921 cctggcatca tttgacctgc ccctccttag aactgggaac cccagttcta aagctccctg 46981 ctttaaagat tctgtgttgg ggtaagttct tagctttctc aggctaggtc ctctgctctt 47041 gggtttccac ggcactgttg ttttccctct ggctttgtga gtggttgtct tttgaaaaac 47101 tagttagttt ggaaaatttt gggagggagt caaataagat gtatgcattt tgccatgtaa 47161 gtcctaacca agccatctgc tgtggtattt tcctgagttt ggttctgccc ctataggcag 47221 agtctgtcat cacagataat tgcattttga acttgagcat ctcccttcct tctttgtctg 47281 cctgaaaaag tctctttata aaaaaatgta atgttaattt aaaaagtatt cattattctt 47341 gtgttgtgat acatgagtat atatatgcta tgatgcatat gtgcaggttg gaggacaact 47401 ttctgtagtt ggttctctct ttctcccttc atgtaggttc tggggatcga acccaagtca 47461 tcaagcttgc acaacagcac ctttaccttc taagccttct catcagccct ttttttattg 47521 attgattggt tgattgattg attgattgat gctagggata gagcctaggg tcttttacat 47581 gctaagaaaa tgctctacca ctgaactgca ctcctagccc aacctgctaa attcttacac 47641 tgtcttcaaa aagaagctct gatgctggat tctgcaaagt ccatttttat ccctaaattc 47701 ctaaagctgt ttaaatctcg tgagtcttac tgtacagacc agctctgtgc accatcttcc 47761 acaatctcca tgacctcctc aggatgggct ggtatctctg cagctctgcc cagtgcctac 47821 caggaactta caggtgtcac caatgaattt attggtgcat gctcacttca tcttgtccct 47881 atccactttc tgctttgact ccttctggta agagacaagt gtgttaacta cttgtgctat 47941 caccacacag aaatccatat cccataatct tagtcctttt tatttactta tttttgagac 48001 agggtcacac tctgtagctc ccacactggc cttaaacact gacctcgaac tcatggtgat 48061 tctcctgcct aaacttctca aataccatga ttacaagagt gacacaccat gctgggagtc 48121 ataatcttaa gtttaaaagt gagggactgg tcagtttact gtgctaggtt gacattgtat 48181 agaaatgaac agccatgttg gtctggaaat gttcctagtt ttcatttgta caaggatatg 48241 cagtgtgtga aatagggaga gtcttaccta tgtgggtttg atcacagcaa ttaataaaat 48301 atgctctaaa taatgaaaaa agccagtaac tagtagtgtt tctgaatcct cactaaagct 48361 ttaatacatc ataaataata tatcactgca gattatgtct acatgttata catatcacat 48421 ttatagtaca atctgatctt tgtcacctac tgtaagcaca actgaaaaac aaattttctc 48481 atagctcaat attaagtcat tattatcccc ataataagta attattatcc ccataatgaa 48541 actatctatt gagggagtca gaatctgaga tagttaaata aatttaagca tgtattttta 48601 gtgtcaatgg taaaaattaa atgttcataa agcctgtatg actcctttta aagtagtttt 48661 aattttatgt gtatacatat atgcatgttt tgccttcttg tatgtctgag taccacttgt 48721 atgtctggtg cctgaggagg ccagaacgta tcagatcccc tgaaactggt attacagttt 48781 tgagctacta tgtggctgtt gggaattgaa cctggatgct ctgaaagagc agccagtgct 48841 cttaatgact aggccatctc tccattttct taaaaaaaaa tttaaaacat ttactctaag 48901 atttactttt atgtaggtgc gtgtgtgaat gtgtatggtt tatgcattgg ggtggggagg 48961 atggattagc acagtcacag aagactagag gagggtctct actattgctt tctgtcttct 49021 acccttgaga cagggtctct cactaaacct gaaactcacc tttgcagctg gggtagctgg 49081 tcagaaagat cctggaatct gtctttctcc ctggccctaa tgcttgagtt acaggcccat 49141 gtgaccatac ctgtcgtttt actggggttc tacagagtca aacccaagtc ctcacgcttg 49201 catagccagc gattttaccg actgagacat ttatctgccc caattcataa ttcttctctg 49261 cttccattaa taatcccatc tatgtcccct tcatacatat ttctgaaata gacaaaatga 49321 atacaagtta gacatcgagt ctgattaatc ttcaacttct ttgataacca ggtattgatt 49381 tctgactttt gaagatggat gaaggcacag aagtctccac tgatggaaat tccctgatca 49441 aagctgtcca tcagagccgg cttcgcctca caagactttt gctcgaaggt ggtgcttaca 49501 tcaacgagag caatgaccgt ggcgaaacac ctttaatgat tgcttgtaag accaaacaca 49561 ttgaccagca gagcgttggt agagccaaga tggttaaata ccttctagag aacagtgctg 49621 accccaacat ccaggacaaa tctgggaaaa gcgctctgat gcacgcatgc ttggaaagag 49681 cgggcccgga agtggtttcc ttgctgctca agagtggggc tgacctcagc ttgcaggacc 49741 attctggcta ctcagctctg gtgtatgcta taaatgcaga agacagagat accctcaaag 49801 tcctccttag tgcttgccag gcgaaaggaa aagaggtcat tatcataacc acagcaaagt 49861 caccctctgg gaggcatacc acccagcagt acctcaacat gcctcccgca gacatggatg 49921 agagccatcc gccagccacg ccttcagaaa ttgacatcaa gacagcctcc ttgccactct 49981 catgttcttc agagacggac c SEQ ID NO: 3 (Chromosomal region 5,000-55,000 basepairs downstream of CHO GS gene coding sequence) 1 GGGCTCAGGC ATTTATCGTT CAGAGATTGA CTGAGCTGTA AAGATGGAAA GACAAACTTT 61 TTTTTTTTTT GATTGAGTCG GGGTTTCTCT ATGTAACAGC CCTGGCTGTC CAGGAACTCA 121 CTCTGTAGAC CAGGCTGGCC TTGAACTCAC AGAGATCTGC CTGCCCCTGC CTGTCGAATG 181 TTGGGATTAA AGGTGTGAGC CACCACCGCC CCGCTGACAA ACTAGACTTT TAGAATGTAT 241 TATGAGATAA GGTTTTGTTA TGTTGCCCAG GCTGGACTCA GATCTGTAGC AATCTATCTG 301 CTCCAGACTC CTGAGTGCTG GGATATACAG ACCTGAGTTA CCTGTACAGC TTTCTAATCA 361 TCCCCCGCTC CCCCAGAGAC AGGGTTTCTC TTTATTGTTT TGGAGCCTGT CCTGGCACTG 421 GCACTCACTC TGTAGACCAG GTTGGCCTCG AACTCACAGA GATCCACCTG TCTCTGCCTC 481 CTGAGTGCCG AGATTAAAGG TGTGCACCAC CAACACCCTA CTTTCTAATT CTTAAAGCAA 541 GGCTCCCAAC TCCTCCCTTG TGTGTAATCA ACAAGGTTCT TAGACCCTGT CTGCAGTGTG 601 GATTCCCACT AATAAGACAG TGGCGGCACA GTGCTGTGTG GCAGAGCAAG CGTCCATCTA 661 GTTCCTATTG TCATTCTATG ATTTGCTCTT CTGGGAGCCT TGTCATTCAG CAAGTTCCTG 721 GGCTTGTCTT GGGATTGCAA TGTGCCTCAG CTTGGCTAGT TCCTCTGCGG CAGAAGCAGT 781 GTTTGAACTC AGTGGGCACT CAGTCACTAC ATCTAACTTG TTTGAGGGCT CTCTGCATTT 841 GCTTTCCAAT TAAGGTTTAG GATGACTCCT CCCTGTGACT CTTATCATCC TGCCTATTAA 901 TGCTAAATTA GAGAGGCATT CAAGATAACT GCCGAAGATC TAATAAATAA ATGGGGTGGG 961 TGGGTAGGAC TATAAACCAG TTTATAGCAT GCAAGAAAGC TCTGAGCACC ACATTCAAAA 1021 ATAAAGTGCT GTGAGCCTGG TGGTGGTGGC TCACACCCTG ATCCCAGAAC TCAAGAAGTA 1081 GACAGAAGGC TCAGATTCAA GATTCAAGTT CTTCCACTAT ACAGCCAATT TGAAGTCAGC 1141 CCAGACTACA TGAGACCCTG TCTCAACTAA GCAAATGAAA GCAAACTGGG GTCCAAATAG 1201 GCACTATTCG ATGTTTTGAT GCAAGTTTGT GACTGAGGAG TGGAGGTGGC AAATGAAGAC 1261 TTTTTTCTTC CTCTTCTTCT TCCTCCTGGG TCCCGTTTTT TTTAGGGTGT TCTTAGGATA 1321 TGTATGTCTC ATTGGCACTA CTAAGAAGTG TGGGGTCTAG GGAACTTCCT GTTATGTATA 1381 CAAGCTAATC TTCAAACAAT TGTGTGGGCT GTTTTGGTAA CTACTCAAAT AATGCTATAG 1441 AAAATTGTAC AATATATTGG GGAAGGAAGG GAGTTTTACA CAGGAGTCAA CATGACTCTT 1501 GTCTCTGGAA AGCAACTTGT GATCCAATGA GGAGCTAAAT TTAGAGACAC AATTCAGGAA 1561 GAGAATCCAA TCAGAGCTTC CTTGTAAAAC AACTCACCTT CACAAACAAG TTCATTCCTA 1621 ATCGAATTTA AGGTCTAGAA ACTGCCAACC TATTAATGTT TCTATAAATA CACTTGGGGT 1681 CAACTACGTA GCCAAGGAAA TCTTTAATAA ATTGAACACA AATTGTCAGG GGAAGGTTAT 1741 TGCTGGGACT CCTGGAAGCA TGTATAAGCA GGGTAGGGGT GACATAGGGG TGGGGGGCAG 1801 TTAACTCACA GATATTAGTC TCAGATATTA ATGGCTTGTG TGTGAGCTGT CTGCCACACT 1861 TAATGTCAGT CACCTTGCCC GGAACTATTT TTCTCTCTGA TTCCAAATGT AGCTATTGGT 1921 CTATTAAATG ATTAACTTCC ACAGAAACTG ATAATATCCT TATGGAATCT GACTGTGGTA 1981 AGCCTGTACA CCCCCGCCCC AATTTCCTTC TAGATTTAGA ATTCCATTCC ATGAGCCATC 2041 ACACCCACGC TGAAAAAAGA AAACCTGTTG AATCAAATTT GTGTTTTGGA GGGTAAGAGC 2101 CACCCTTCCA ATTTATAAGG CTGTCTATTT CTTTGGGGGG GGGGAAATGA ACCAGTATCT
2161 TCTATTAGTA AAAGGAGTGT TTGAGCATGG GCACTACAAC CCACTTCTTT CAGGGAGATT 2221 CATTTTTCTC TGAGAACTCA GCCTCTCTGT GCTGGTGCCA CAGGAATTCT TAAACTCTTT 2281 CAACTCTCCA ATTAACCAGA GAGCAAACCC AGCACTTTCC ATCTATGAGA AATCTACACC 2341 ACTCATGGAA TCATTGTGTG CCCTCTCTCA CTGCCTAACA GGGGTACCCT TGCCAAAGAA 2401 AAGCAACTTA ATGCCAAAAA GGTGCATCAC CTGGCACTGC TTCCGAGGAT GGGCAATGTG 2461 CAAGCACTTT GTTCAGTGGC TCTGCCTTGG GGTCTCTTGA GGGGCGGCAG GTTACCTGGG 2521 GTGGGGGCGC ACACTCTCTG AAGGTGGGCT GCGTTCAGTT TCCTGCTTCA GGGGCTCCTT 2581 CATAGTACCG CCCCCTGATG AGTTTCTGCT CAGACTGGAA GGTGTCAGGT CCCAAAGAAA 2641 CCTGGGACAA GGCTCACTCA GTACCTGTCG CTTCTCCCAG CACGTCTCAC CCCACCCCTA 2701 CCCTAAACTT CTCTAGCCCA GAGGCTGGGC TCCCCCTTTC TCTTTCCTAC ATAACCCTGC 2761 CATTTTAGCT GTGAGCTCTC TCCGTCTTTA GCTCCTCTAC TGTTCTTTTA TCCTCTCTTT 2821 TCTCTCTCCT CTTCTTCTCT CACCCCCACC CCCACCCCCA TCTCTCCCCC CATGGTCTGG 2881 TTCAGTCTGG ACCCTTTCAG ATGCCTCTGT CTGAACTCTC CCTCATATCT CAATAAAACC 2941 CTTCTCTTCA GCCACGCCTT GGAGAGGTCA TAGGCTCATT TTCGTTCAGA AGGCCTATCA 3001 AAGAATCTGT GGGCTTATCT TTACATTCAC AATAGGCAGC TTGGCCCTGA GACCACAGTC 3061 CAGGTTAAAG TGTTACCTTG GAAAGAAAGT CTTTTATTCA AGGTGTCTGG TTTCTTTTCT 3121 TGTTTTTGTT TTTGTTTTTG GAGACAGGGT TTCTCTGTAT TATTTTGGAG GCTGTCCTGG 3181 AACTCGCTCT GTAGACCAGG CTGGCCTTGA ACTCACAGAG ATCCGCCTGC CTCTACCTCC 3241 TGAGTGCTGG GATTAAAGGC GTGAGCCACC AACGCCCGGC TCAAGTGTCT GGTTTCTTTT 3301 GATGTCTTTA GTTTCTTTAA TCCCATAATT CCTTTAATTA TACCCTCTTG TCTGTCGGAG 3361 AATGACATCA AGGATATCCA GTTCAAGGTT TCCTATGTAG TTCAGTCATA GAGTGCTTGC 3421 CCAGCTGCCA GACTCTGTCA GATGCCCAGC ACCACACACA TACAAAGCAT TTCCAGCTCT 3481 GTGTCTGTGT CAATTACTCC TGTCTGCTTC TCCATCCCCA GACACCAGGA GGGCCCACAA 3541 GAAGCTTGGA GCAGGGAAGA ATAAAGAGAC AATATCCATA GACACACAAA ACCTCCAAAG 3601 TACTTATGCA TTGAGGAATT ACAGCTTACA AATCCAGTCA CAGTATCTAT ATTCATGTTA 3661 GCCTGATTTC AATCCCCCAG CTACATATTC TTCCATGAGC TAGCTCCTTT CCTATTCAAG 3721 ACTCCCTTGA TAATAGTTGT TATCAGACTT TACCCCTATT AAAATATTTG GACCGTTTGA 3781 GAGCAATAGC TCACCTCTAT AATCTAGAAC CCAGGAAGTT AAAACAAGAT GTTTGCTGCA 3841 AGTTTGATGC CAGCCTGGGC TACATAGCAA TTTCCAGAAC ATCCTGAGCT ACAGGGCAAA 3901 ATTCTATCTT AAAAAACAAA AAGTAGACAG ATCAGGTGTT TCACCTTGTT TCAAAAAATG 3961 CAAAAAATAT TTTTTAATTG TAGAAATATA TACGCTAATT CCTTTGGTAC CCTAGGCCAA 4021 GTGACTAGAT GGGTTAGTCT TCCTTCTGGT CCTCACAGAA GAAAGTTAAG TTCTCAGCAG 4081 GAATAATAAA AAATATTAAA AAAAAAAACA AGCTGCAAAA TTCTGTTGTG GTTCTGCCAA 4141 AGTGTTCTCA GGAGTGAGGG CATACTGGGA TTTAGTCAAG CAGATATTTC TGTTTGAATA 4201 ACTAGGATCT GGGAGCCATG GGACACCACC CCCACCCATA AGGGCTACTG AAAACCACCC 4261 CTGGAAATCT GTAAATATTG CTAAGGCTCT ACCCTTTTGC TCAGAGAACA ACCACCCACA 4321 AGGATAGGGG ATAAGTTAGT TCTGTAGTAG AGTGCTTGCT TAGCACACAG AAAGTCTTTC 4381 TCTCTCTGTC TTTCTCTCTG TCTCTGTCTC TGTCTCTCTC TCTCTCTCTC TCTCACACAC 4441 ACACACACAC ACAAACAAAC ACATGAGTGC ACAAGAAACT TCTAGGTGCT ACTAAACTAA 4501 TGTAAAATCA TGCAAAGTTC ATAGAGAATT CAACAGCTAG TGACAGGATG ACCCGAACAC 4561 AAGATTCTGC CCTAGTCCTT GTATTCTGTA GTCCCCAGTT TCTCTTTACT GCCACAGTCT 4621 CCTATCTCTG ACAGCCTCCC TCTTTGCAGA TCTGGCAGTT TCTGGGCCTG GAACTGCTTT 4681 GGTAGAATGT CTGTACAGCA TGCACTAGGC ACTGGGTTTG ATCCCCAGCA CTGCATAAAT 4741 CAACTTTGAT GTCACACCTA TAATTTCAGC ACTTGGCAGG GATCGAAGCA GGAGGATCAG 4801 AGGTGAATCA AGGCCAGCCT GGGCTACTTG AAACCCTGGG GAGAGGGATA GAAGAAGGGG 4861 GAGGGGGGAG GGAGAAGAAA GGAAGGAGGG GGAGGGAAGA GGAGAGGAAG AGAGGAGGGA 4921 GAGGGAGGGA AACAGGGAGG GAGGAAGAGA AGGAGGGAGA GAGGGAGGAG GGAGGGAGAG 4981 ACTAGTGTAA GCAGAACCTG TAAGTTCTCT CCTCAGCCTC AACACACCCC AGCTCCCTGC 5041 TGTCTCCCGG TCCAGGGCTT CAGGGCCTGG CAGGACAGGC AGCAGGTTGT TTTGCTCTCA 5101 TAAAGCCATG TTACATAACT AACTAATGTT TTGAGCAGTG GAGCTGAGCC AATCTAGGTC 5161 ACATCAAGAG GGAATGGGGA AAGAGGATGA TCACGGAAGT GGTGAGAGGA AGGGAAACAA 5221 GAAGGGAGGA ATAAAAAAAA GAGGCGAGAG TGGAAATGGG GTGCGATTAT TTAATATCTG 5281 CTGCCTGTTC ATAGTTCCTG GTCCTTAGGG ACAGCATATA TTATCCTGAA AAGTCCTCTC 5341 TCTATTTTAT CTAGGCATTC TGTCATCCTA TAGCCCCCAC TCTGGATGGC TGAACTCTGT 5401 GCCAGCAGCC TGCAGGTATC ACCCCTTATT GGAGTGAGGT CTATTCCTTA TTGGAAGCAG 5461 TGGCAGGCTG GTAGGAAACA AACAGGCCTG GTGTTGTGGA ATGCTGTCCT CCCAGCATGA 5521 CCATCATTAG ACCTTATGGA AGCAGAGCGA GGGGGGCATT GTCCTCCTCC CCAGGCTCCT 5581 GCAAGCCTAC TCAGCTCAAC TGGTTCCCCG GGCCAGACTT AGGTGCAAGA GTTGCTTTGG 5641 TTTGTTATTG GTGGCCTGTG TAGCTGAGTA GACACATGCT CACCTACATG ATATATGATG 5701 GCTTGCAACC TTCTAAAAGT TCAGTTTCAG GAGATCCAGA ACCCTCTTTT GCCCTCCAAG 5761 GACACCAGAC ACCCATGTGG TACCCATACG TACATGCGGG CAAAACACTT GTGCATATAA 5821 AATAAAAAGA GATGGCTCCG TGGCTAAGAA TGCTCCCTAC CTCCAGCTCA CCCACATCTT 5881 CACAACTGAC TGTGAATCCA TCCATGGTTC TCTTCTGACC TCGGAGGGCA CCTGTGCCCA 5941 TGGGGCATAC ACATACACAT ACACAAAACA AGTATGTAAA TAAATAAATA TTTAAAATTG 6001 GGGCTGGAGA TGGCTTAGTG GTTGAGAGCA CTGGCTGATC CTCCAGAGGT CCAGAGTTCA 6061 ATTCCCAGCA CCTACATGGT GGCTCCCAAT CACCTAAAGT GGGACCTGAT GTCCTCTTCT 6121 GACATAAGGT CATACATGCA GATAGAGGAC TCAAATGCAT AAAATAAATA AATAAATCTT 6181 TAGAAAATAA GTACATAATA AATAAATATT TAAAATGACC CAAATTAAGA AAAAAATGAA 6241 GCCAGGCAGT GGTGGTACAC TCAGAAGGCA GAGGCAGGCA GATCTCTGAG TTTGAGACCA 6301 GCAGTTCCAG GACAGCCAGA GTTACACAGA GAAACTCTGT CTCAAAAAAA AAAAAGAAAA 6361 AAAAACAGAG AAAGAAGAGA GGAGAAAAAC AAGAACAAAA AATAACAAAA CAAAAACATG 6421 GCTTTCCCTT CATGGCATCT GCTTCATCTG CCTATTTGGT AATGATCAGG GCACTACACA 6481 CCCAGTGCTT CATACCCTGG CCATGTTTCT GTTCTTGGTG TCACCACCAA GTTTACTAAA 6541 GATGGTTCCA GAGTGACATT AGCAGCCCCA CACCCCAATT GCAGCTAGCA GTTGAGGAGA 6601 TTTCTGGCTT TTTGTCTAAG AGGAAGGTTC TTTGGCTAGG AGATATACTG AGAAGGACTA 6661 GGAAAAGGGG TGTCTAAGAA ACTTGGAGAG CACATTTTTC AAGTCAGAAA GAACATAGAC 6721 ATATTCTGGG GGTGGGGGTA GTAAGATAAT GGACCCTCCT AAGGGAAGGA TTGTGGGGTT 6781 TGCCTGAAGG GGCTGAAGCA GACCACTGAG CAGGCCAGAC CACCAGCAGC TTTTGAGAGG 6841 TGGGAACACT GCAGCTGAAG TCACTTGTCA CCTTCCCAGG TAGTTCTTAC TTCCAGCTCT 6901 GGCAGGGCTA GATAGCCTAG GAACTCCCAG ATAGGAGTTC TAGTTCTTCT TCTCCCAAGC 6961 TGACAGAACG TGAGCTCAGA GTCTAGGGAC ACTCCAGGTT AAGGACGGGG CCATTCTTGA 7021 TTGTCAGCAC AGATAGATTT TAATTAGAGA GCAATGACAT GACAGATAAA CAGCCCCTTA 7081 TCTAAAGGGG TACATCCCAA GACCCTGGAG GACTCTTGAA AACCCAGATA GGAGCCAGCC 7141 ACGGAAGCAT ATACCTTTAA TCCTAAGATT TGGGAGGCTG AGGTAGGAGG ATCTCTGTGA 7201 GTTTGAGGCC AGTCTTGTCT ACAAAGTGAA TTTTGGGACA GCTACACAGA GAAACCCTGT 7261 AAGAAAAAAA AAAAAAAGAA AGAAAGGAAG GAAGGAAGGA AGGAAGGAAG GAAGGAAGGG 7321 AAAGGAAGAA AAAGATAAAG GAAGAAAATC CAAATAGGAA AGAATCCCAT ATATACCATA 7381 TTTTTCTTAA ACATACATAG GTTTATTCAT TCTCTCTGTG TCTGTGTGTC TGTGTGTCTG 7441 TGTGTCTGTG TGTCTGTGTC TGTCTGTCTG TCTGTCTGTC TGTCTCTCTC TCTCTCTCTC 7501 TCTTTCTCTC CCTCTCTCTC TCTTTCTTGT CTCATAAATC TCAACACTCA GGGACCCAGA 7561 AGATATCCCA GTGGTTAAGA ATACACACTG CTCTTGCAGA CCTAAACTCA GTTCCTTGTC 7621 CCTACTTGGG GCAGCTCACA ACCACACCTG TAAGTCTAGC TCCAGGGAAT CCACACCTTC 7681 TGGCCTGTGC AGGCACCTGT GTGAAGGAGC ACATATCCTT CCCCATAATT AAAAAACAAT 7741 CATTGAAAAA TAAAACTCAA CCCCCTCCCC CGGGACTCAA ACCAGAGGTA GTCTCCCTGC 7801 CGTAGGCGCT CAAAAACTGG ACTTTCAGGT GTGAGCCTCT AGGCCAGGCT GCTTTTCTTA 7861 ACTGGCTACC GTGCTCTTGC CTGAAACTTC CAGCTTGAGA CCTCATAGTA AAAAGAACAT 7921 ACACGTCTTC TGTCTGTACT ATTTTACAGA CGGCTGACAT GTTCATACCA CGTATTTTAG 7981 CAATTTCAGC ACTTGGTATA TTTTCTGTCA TTCTCAAATA ACTTTCACCT TGCCACTTAG 8041 GGCAGTCCAA GGCTCCTCTT AGATATATCC AAATTATCAG CCACCACTTC TGCCTTTACT 8101 AAGTAAGACA GGGTACTTAA CATGGAGTAC TTAACACAAG CACTGTGATC TGAAGGTGGA 8161 GACTGCTTGC TACTCAGTCA CAGCTTAGCA TTGCTAGAAC AAATCCTGAA CAAAGGGTAA 8221 TTCATGACCC AGGCAGGGCA GAGGCGGATG GCTGTTCTTG CTCCTCAGAA ACCCCTGTGT 8281 ATAATTTCAA GCTTAGGAGT TGTTTGTCTT TGGATGGAGA GGGTCAGACC TAGGGCTTCA 8341 CTCACACTAG GCAAGCACCG CAGGTCTACC TTCGAAGAGA AGAATTTTCA CTTAGCGTTT 8401 TCAGATATAG GTCAACCTCA GCTGGCTGAA ACTTTGACTA AGTGAGCAAC TGTGAGGGTG 8461 GGGAACACAT GCATGCATTT CTTCATGTTA TAACATCTAT TTATACATAA ACATATCATA 8521 TAAATATATT CTATTGCATA TAAATATACA TAAATGCACA CTCATGTATA GATATCAATC 8581 ACATAATTTA TGCTTTTATT CATAGATTAT CTCTGGGAGG TGTACAATTA CTGACAATAC 8641 CTGCACATGA TAGTACACGT TGTTCTAGTT AGGTTTCTTT TGCTGTGACA AACACCACAA 8701 CCAAAAGCAA CTTGCAGAGG GAAGGGTTTA TTTCAGCTTA CAGTTGTATT CATTATGAAG 8761 AGTTGGGAAG TCAGGACAGG AACCTGGAGG CAGGAACTGA AGCAGAAACC ATGGAATAAT 8821 GCTGCTTACT GGTTTACCCA CCATGACTCA ACCTGCTTTC TTATATCACC AGGACTGCTT 8881 GCCCAGGGAT AGAACCACAC ATGGGGACTG TACCTCCCAC AACAATCATT GATCAAGAAA 8941 TGCCCTAGAG TCAGGGATGG TGGCAAATGC TTTTAATCCC AGCACTCGGG AGGCAGAACC 9001 AGGCCTTGAC TGTGAGGTCA AGGCCAGGCT GGTCTACAGA TTGAGTTCCA GGACAGCCAG 9061 GGCTACTCAG AGAAACCATG TCTCATGGAA AAGAAAAGGA GGAGGAGGAG AAAGGAGAAG 9121 GAAAAAGAGG AGGAGGAGGA GGAGGAGGAG GAGGAGGAGG AGGAGGAAAG AAGAAGAAGA 9181 AGAAGAAGAA GAAGAAGAAG AAGAAGAAGA AGAAGTAGAA GAAGAAGTGT CCACTGGACA 9241 ATCTGATGGT GGCGTTTCCC AATTGAAGTT CCCCTTCCAA GATAACTCCA GGATGTGTCA 9301 AGCAGACAAA AACAAGAACC AAGACACATG TTTATAATCC CAACACTGGG GAAGTGGAAT 9361 AAGAGGTTTG GCAGTTTAAG GCCATTTTCA GCTACATAGG GAGTTCCAGA CTATCCTGGC 9421 TACATGAGAC CCTGTCTCAA AACACCAAAA TGCAAGGGAA AAACAAAAAG CAAAATAATG 9481 AGTACAAATA GCAGTGACAT TCTGGGGAGA CAGCCTGGAG GGGGGGATTG CTTATTATCT 9541 CTCCCTACCG TTTGGAGTTT TTAAAATCAT GAATCTAACC CCAGAAAAAA AAGCATTGAG 9601 ATTCTGGGAC ACTCGGGTGG TAGAGAAGAT CATCTGATCC TGTCACCTTT CGGGTACGTC
9661 ACTTTATTAA TCTCTCTGAG ATTCAGTTTC ATCACCTCTG AAGTGGTTTG TGTCGACGTA 9721 CAGTCCTCAG GACTAAGTAA GGCCACTTGG TGGCTGTGCC AAAGCACTGT GTCAGGGACA 9781 CGGCAGATGT CTGACACATC TTGTTAGATT CCTTTTCTGT CCTCCGCTCC CCTACCCCAG 9841 AGGTGGGTAC AGCCCCATGG CACCTCATCT TTAATGGCTT GGGTTTCTTT TCTCCAGCCA 9901 GGAAAGTTGT CGCTTTGGTG ACAGCTATTT TAAGTCAACT GACCTTTCCT GCAAATGATC 9961 CAGATGCCTC TATCTTAGGC TGGTGATGAC GAAGATGGCC TATGACGGGG TTCCTGGGGG 10021 TGTGTTGGGA GGTGGGGCAG GGGTGGGGCC CGGCATTTGT CAGACCCATA TGATCTTCTG 10081 GCTCCCGGGC TCTGCAGATT TCTCCTGCTG GAGATGCCTA CCTGCCAGCA ATCTTGGAGA 10141 AGACAGAAAT AGCAGCTTTG GGTTCCAGGT CCCCTCCTCC CTTTGGCCCA ATGTAGCTAG 10201 AGCTTTGGTT TCCTGCTGCT GTCTTGGTGC CTGGAGCCCT CTCTGGATGG TCATGGAGTC 10261 TTGTCAGAGA AGCAACTTTG GGCTGGCAGA CAGTCATTCC AGAAGACATG ATCTGGAAAA 10321 ACTGCTTCAT CGTTTCCTTC AGAGGCACTG TCCCGAGCCC ATTTCCTTGT CTGGTTCCTG 10381 AAATCTCAGG GATGCCATCA GAAGAAGGTG TTCTTGTGTT TACTTTGGAC ATGGTTTTCT 10441 GTAGTGCAGA CTGCCCTTAA ACTCTACGTA GCTGAAAATG ACCTTGGTCT CCAGACCTCT 10501 TGATCTGTCA GCATCCCTGG GAAATCCAGG GTTCTGTAAT CCTCCCCTCT CACCTTGACT 10561 TACTGTACCA GCATCAAACA TCCTAAACAA ATCCAGTGTT TAGCCAAATA CAGCGGTGCA 10621 TGTCTGTAAT CCCAGCCACC TGGGAAGCCG AGGCAGAAGG ATTAAGGGAG CTGGAGGCCA 10681 GTCTGTGCAA TTTAGCAGGA CTGTCTCAAA ACAAAATTTA ATGGTTAGGG GTGGGCATGT 10741 CATTTATTTG ACTCTTATCA CATGAACACA CCTGTAATCT CATCACGAAA CGACAAGGCA 10801 GGAAAATCAA AAGTTCAAAG TCATCTTTGG CTACATAGCA AGTTCTAACC TGACCTAGGG 10861 TATGTAAGAC CTTGTCTCAA AAGCAAACAA ACAAACCCCA AATAACAACA ACAACAAAAC 10921 AAAAAGCAAA CAAGGAGAGG GTGTGCAGCT AGGGATATAA TTCAATGGGT GAGGGCTTAC 10981 CTCACATGCA CGAGGCCTTG GTTTCAACTT CCAGTTGAAA TGAAGTTTAG TGGTAGAGTT 11041 CTGTGCAAGG CTGTAGTTTC AGCTCTCCAT ACTGCAAACT GGAAAGAACA ACAGTGACAA 11101 ACAGAAACAA AAAACCCCCA CAAACAATGT GCTTTCTCAC TCAATAAAAC CACCTCTTTA 11161 CATACAACTA CAACTGCTAA GAAAGTTCTT CAGTGTTCTA GAGCCTGAGC ACCTCAAATG 11221 GTTTCCATAA AGCTGTATGC AAACACTGAT AAGCCACGAG AAGCAACTGT ACAAAGCACC 11281 CTTTGATTTT CATAGTTTAT CTACACAAGG ATTCTAGGAA AGTGTGCTAG GAAAATTTTA 11341 TGTATCAGCC TTGCGGGTTT GTCCAATAGT TTTAGATTTT GCCAGTGAAG ATTTTCCTTT 11401 CTTTATTTTT TACATGGGAA GGAAGTTTAA TTGGGGGAAG GGACGGGAGT GGGCTTTATT 11461 TTTATTTTTT AATGAGACTA GCATTTGCAT TGGTGGACAT TGAAGGAAAC AGTTTCCCCT 11521 CCCTAATGTG TGTGGGCCTC ACCTAACTCA TTGAAAGTCT TAGATAAAAC TAAGCTGAGT 11581 GAGTGAGTTG GCCCATACCT GTAGATGGAA GGAAAAGGGT CTTGAGTTTT GGTTTATCCT 11641 AGAGAGAACT TGATCCCCCA AACACCAAAC TTTCAAACCA AACCCCAGCC TCCTCAGTGT 11701 GAAGGGATGC TGTTACATGA CCACCTATGG ACTCAGACAA CCTCTCTTCC CTGAGTCTGC 11761 TGGCTTACTC ATCAGAGTCT GGGCTCACGA AGCCGCCACA CATATATGAG CCTCGTTCTC 11821 CCCACTCTTC TCTTGTGGCA CTGAGGTTCA AACCAAGGAC CTCGCACATG ATAGCAAATA 11881 CTGTACTGAA CCATAGAGCC AGCCCTTGTC AGTTTCTTAA CACAAACATA TAGATGTATA 11941 TGTATATGAA TATTTCCATG CTACCAATTC CATTTTCTCA GAGAACCAAA GAATACACCA 12001 AGTAGTCACA CTTGAAATTC TGTTCTGAGA TTGAATAAAA CCTGATCAAA TGTGAATTCG 12061 GTCCCTTCTC CCCCATCCCT GACGCCACCA CGTTGCTATA CAGACCAGGC ACAAACTCTT 12121 CTCCTTGTGA ATGTGTGTAA CACATGTTAC CACTGTGCTT GGCTTTTGTA GTTAGAAGGT 12181 TGGTTGATAT TTAAAAAAAA ACTTTAATAT TTAGTCATTA CTTTTTAGTA AAGATTTGCC 12241 TTGCTTTTAT TTTATTCATG TGCATGTGTG TGTATCTGTG TGAGTGTATG CCACGTGTGT 12301 TTGGGTGCCT CTGGAGATTG GAAAAGAATG TCAAAATCCC AGGACCTGGA GTTCCAGGCA 12361 GTTGTAAACT TCCCAATGTG GGTAATTATA ATGAACTTGG ATCCTCTAAA AGAGCAGAAC 12421 TCACTCTTAA CTGATGAGTT ATCCTTCTAC CCCCAAATTT ATTTGTTTTG TTTATTTGTT 12481 TATTTATTTG AGAGGGTCTC ACTGTGTAGC TCTGACAGTA TTAGAATTTA CTATGTAGAC 12541 CAGACTTGAT AAATGTCTAA CCCTAGAAAA AAATAGTTTT GTTTTGATTT TATGTCTGTG 12601 CCATCCACTC CTTGAACATA TATTTGGTAT CTGTGAAGCC AGTGAAGGCT GTTGGTTCCC 12661 TTAGGACTGG AGTTACAGAT GGCTCTGAGC TACCATGTGC ATGCTGGGAA ACAAACTCAG 12721 GTCCTTTGGA AGAGCAAAAA ATGTCCTTTG ATGGTGGTGG TTTGAATGAG AATTGCCCTA 12781 TCGAGCATAA AAACTTGGCA GCTTTGGCTA CATGGTTCTG GATTAAGAGT CAAGAAGGAT 12841 ACAAGAAAGC GGTTGTGGAA TCATCCCCCA TGGTTAAGGA AAACCACCAA AGCCAGGCTT 12901 GTGGCAGGGG AGTTCCTGCA TGGAGGCCAA GAGAAGCCAC TATGTCAAGC TGTGAAGGTG 12961 AAGCCTGGAT TGTGTTGGAG ACCCAAGCTA CTGGAGATGT AAGAGATGTG AGATAATGCC 13021 CAGGAGAGCT GCAGACAGGG CATGGAATCA GGCCAAGCGA GAGAAGTGTG TTGCAGTCAG 13081 CAGAACTGGG AGGGAAGAGT CATCTAAGTC CTTTGTCATC AGACATAGAG ATACAGGATC 13141 TGAAATTTGC TCTGCTGGGT TTTGGTCTTG ATTTGGCCCA GTACTTCCTA ACTATGTCCC 13201 CTTTTCTCCC TTTTAGAATA CTAATTTATA TTCTGTGCCA TTGCCGGTGG ATCAGGATGG 13261 TTCTCAGATA CTGTTTTAGT TCCATGCCTG TCTACTTCCC GTCATGACAG TCATGCACTA 13321 ACACTCTAAA ACTGTAAGCA AGCTCCCAAT GAAATGTTTT CATTTATAGA GGTGCCTTGA 13381 TCATGCTGTC TCTTCACAGC AATACAACAG TGATTAAGTC AGCTGCTGAG CAATCTCTCT 13441 GGCCCCAGAA GTATGCATGT GTGCAATTGT GTGTGTGTGT GTGTGTGTGT GTGTGTGTGT 13501 GTGTGTGNNN NNNNNNNNNN NNNNNNNNNN NAGGAAATGT CATTCTGTAA ATATGTTTAT 13561 CTTATTGGTT GATGAATAAA ACACTGTTGG CCAATAGGGC AACAAAATAG GTGGGGCCAG 13621 GATATAAGGA GGATTTTGGG AAGTGTAGGC AGAGGGGAAT TGTCATATGA TCCCAGGAAG 13681 AGACATAGAT GGGCAGAAAC TGCCTCTAGC TAACCATAGA GGTCTGGAGG TCTGTACAGA 13741 CAGGCAGGAA GTGATGTAGC TGGAAGAATC AGAATATAAG CAGGAACAAA CAGGAAATCG 13801 AGCTCTTCTT CTCTCTCCAC TTCAGAGATG CTGAACAGTT GAGATGCAGG ATGCCAGAAG 13861 AGTAAGAGGT CCCTGGACCT TTCTCCAGTA AGATAAGACC ATGTGGAAAT AGATTGATAG 13921 AAATGGGTTA GAGATTAAGT CAGAGCTAGC CAATAAGAAG CCGTAGATAT TGGCCAACCG 13981 TTTCATAATT AATATAGCAT CTGTGTATTT ATTTGGGGGA CCTGGTAGAC CAGAAAACTC 14041 GTGTTAGAGA CATCTTATCA AAGTTGAAAA AAGAAAAAAT GTGATAAAGT TAGGAAAAAA 14101 TATAGTAAAT GTTAAAAGCT AAATTCTAAA ACTACAACTT ATTTATCATT TCCTAAATGT 14161 TTAAAAATAT TATTTTATAA TGAAGATACT TAAAATTCAT TTCTCTGTCT TTTGAGACAG 14221 GGTCTCAGTG TCCTGGAACT CATTATATAC AGCAGGCTGG CTTGGAACTC ACAGAGATCC 14281 ACCTGCCTCT GTCTCCTAAA TGCTGGGATT AAAGGTGTGT GCCACCAAGC CTCAATTAAA 14341 ATGCGTTTCT TTTTCTTTCT TTCTTCCTGT CTTTCATTTT TTTGTTTGTT TAGATTTTTT 14401 TTTTTAGACA GGGTTTCTCT GTTAGCATTA GTTGTACTGG AACTCACTCT GTAGACCAGG 14461 CTGGCCATGA ACTGAGAGAT CTGCCTGCCT CTGCCTTCTG AGTGCTAGGA TTAAAGGCAT 14521 GCACCACCAC TGCCAGGCTT AAAATGTATT TCTTTTTTTA ATTTAGAAAT TTATTCTGTT 14581 TAATCCACAC GCTTTATATA GCTTTAGTTA AGAAATAAAA TAAAATGAAA CAGTGAAACC 14641 AAGAGACTAT GTCCAAGTCC AGGTCCTCCC AGCCTGCCAA TGCCAAGAGC TCTTTAGTTC 14701 TGTGTACCAA TTGGAAGAGT AAGAAAAAAA TATGGATGGG AACCACACAG TTTCATAAAA 14761 CAGATTTATG GAACTGAAGG GTCCTTGCTG AGTCTAGCAA ATTGCCTTTA CAAAAGAGAA 14821 AGAAAAAAGG GGGAGGTAGA AAAACAAAAC AAATCAACCC AAAGAGGACA AAATCCCAGA 14881 GTTCTAAATT GACTTAGGAA CCTGTCACAC TGGGACAGAA GCTTCAGCAT CCATGAGCTG 14941 TGCCTCCCCT GCTCTCTAGA GCTGGGATCT CGAGGTGTCA GCAGAGACCC CACAGGTAAC 15001 AGGAGCAAAA ACACTCACTC AGACCTTTGT GGTACTTCAA CAGTGGTCTC ACTTCTGGGC 15061 AAGCTTACAA ACCTATACAA AGTTGAAGGT GTACTTTACA TGAGTGCTAA ACTTCAAGAG 15121 GAAGGAAGAA AAAAAGGGAG GTGGAGGGGA CAGAGAGAGA GAGAAAAAAA CAAAACAAAA 15181 CAAAAACAAC CACCTCAGGA GAGGCAAGGG CATTTAAAGG AACCACAAGA ATGCCAACGA 15241 TATTAAAATG TATTTCTTAA TAGTAAATTT TATGGGAAAA GAGAGTCTCC TCTTCCTCCA 15301 AGTAGGCTAG GTAAGTACCT TGCCACTGAG CTCTATCTAT ACCCTTCAAA GTGGACAAAA 15361 TGACAAAGAT AGTTCATCTC CCCCAAAGGC CCTGTTGGGG TGCTGATTGT CACATCTGGT 15421 GAGATTTCTG TTTTTGTTTT TATTTCAAGA CAGGGCCTCT CTACATAGAT AGTCCTGGCT 15481 GCCCTGGAAC TCACTCTGTA GACCAGGCTG GCCTGGAACT CATAGACCCA CTTGCTTCTG 15541 TCTCCCAAGT GCTGGTGCTA AAGGTGTGCA CTGCCACTCT TTTTAAGTAA CTATGAGTTT 15601 CAAAACAAAT TAAAGAGCAC TGTTAAAGTG GCTTGTTGTG TAAGCCTAGC TTCAAGTCAA 15661 AGGCCCGAGG CTCCCCTACC AACCAGCTGC TATCACCTAG ACACTGTCTG TAGATCTTGC 15721 ACTGACTCAA AACTGTGGCC TAAGGTCAAA ATAATGGTCT TCCTGGATTC TGATGTGAGT 15781 GAGATTGTGT AGGAGGGCTG GCCGCTGGCC TGGCTTGAGT CACTCTCAGC TGGTTTCATC 15841 CCATTCCTGC AACTCTGTGT AAGAGGTGGA TGATCCTTGC TTAACTGATG AAGAAACCAA 15901 AGCTGTAGAA AGGATCATTT GCTTAACTCT TCACAGATGG CAAGAGGCAG AGTCAGGATT 15961 GGCAGAGTCA CTTCTGCCAA CTTCACCCTC CTGCTAACTC CACCCTCCTG CTAACTCCAC 16021 CCTCTTGCTT ATACTTGACA GTGGAGGAAA AGCCACTGAG GGAATTAAAA GTTGTTACTG 16081 GTAATGGTCA GGAAAAAAGC TGAACAAAGG AGATTAGATT CAGGGATCTT TTTCTGAAAA 16141 GAAAGAAAGA AAGGGGGACT ATAGTCTAGA AATGCTGAGA TAAAAGGGTG GATTATCATA 16201 TCTACTCTCA AACTAAAGAA GCAACTACTA GTCTCAAATA CTTTATATTG GTATGGATTT 16261 TTGTGTATTG GTACAAATTT AAGGTTATTT TTGTTATACT GTATATATGT TTTTCTTTCT 16321 TGTTTAAGGT ATTGTACCTG TATAGCTTAT TTAAAAATGC AATGTAAACA TATAGTCCTT 16381 GAAAACTATT TAAGATAATA AAGAAATACA GGTTAATAGT CATCTATAGC AATCAAACTT 16441 ATAGTCATGT TAGGTATGTT TTCAAGGGCA TACAGAAATA AATTTGAGAT AGATAGGTCA 16501 TCTTCAAACA CTCCAGAGAT CTACAGAAAA TGGCATTTAT AAAATGTTTT AATGACATAA 16561 GATTTTTCAT GATAGTGAGA AATGTCTACT CTTGGCAGCA CCAATTTACT TCAAAAATGG 16621 ACAATGGGCA TTGAAGAAAC TCCATGTGGA TTTTGCTTTC TTTGTGGCAA AAATCTAGCT 16681 ATCTGGGCAA GAAACTTCCC TTACCTTGAC TGCTGTCCTA ACTGGACAAG CAGGACATAA 16741 AAGAAATTGA CTGCTGAACT TTGCCAAGAT AGTATACATT AGTCTTTCAA AAATCCCTGC 16801 TTTACAAAAA AGTCTATCAG ATATTCTAAG CTTCTAGGCC AAAGATGGAT GCTTCAATGT 16861 TAACAGAGGA ATCTTCTGTG ACTGATGTTT CTGTCATTTC TATAGTTTTG AAAATTGCTT 16921 GCTCTGTTCT TCCCTGTTTG CTCAGGTAGT ATTATTTCCT TCTTGAGTGT CTAATGGAGT 16981 TAAAGACTAG ATAGTTATAG CTACAGTTTT CCTTGTAACC AAATTCAGAA AAGAAACTCC 17041 CAAAAGAGGT GTAAAAGTAT GAGGCTGAGA AATATAAAAA CTTAAATTTA TCTAAGAAAA 17101 TGTTTTGTTA TCTAAAAAAA AATAATTTTG GGTTAGTAAT ACAAGTTAGG ATAGAAAATG 17161 AATTAGGTAC AAAACTTTGG ACTCATCAAG AAAAAATAGA TAATGGAGTA TTTTCTCTGA
17221 ATTTGCCAAA TACAAATAGA CTGGGTATTG TAAATGTAAT TCTTACTTGA TAATTGTTCT 17281 TATTGTTTAT AGTTTATTAT GTTAGAGTCA AAACCTTTCT TTTTTATTTA GACAAAAAGG 17341 GGGAATGTAG AATATTTCTT TACACTGTGT GAAGATGTAT CACTGTGATT GGTTTAATAA 17401 AGAGCTGAAT AGCCAATAGT TAGGCAGGAA GAGGTTAGGT GAGACTTCTG GGAACAGAAG 17461 TCTCAGGGAA GGAAACAGGC TAGGTCACCA GCTAAATGAA GAGGAAATAG GACACTCAGG 17521 AGGAGAGGTA ACAGCCACAA GCCAAGTGGT GGAATATAGA TGAATGGAAA TGGGTTAATT 17581 TAAGTCATAG GAGCTAGTTA GAAACAAGCC TGAGCTAAAG CTGAGCTGTC ATAACTAAAA 17641 GTGGAGCTTT CATAATTAGT AAGTCTCTGT GTCATGATTT GGGGGCTGAC GGCCCAAAAA 17701 AGCCTGCTAC CCAAGTTCTT TTCAATTTTC AAGTTCTAGG ATTCTGGCCT TTTATTGGAA 17761 AACACTGTCA AGTTTCTATA GAGGTCTGAC TCCACAGTGT TGCCTGTGCA ATGAAATTTA 17821 TTTAATTTAT TCCGAGGCCT TGTGCACTCT GGATAATCAC TGTACCACTT AATCTATATT 17881 CCCATCCTTC ATTATAATTT AAAATGGTCT TATTAATCTG GTCACTTGGC TTTTTTTTTT 17941 TTTTTTTTCT GAGACAGGAT TTCTCTGTGT AGCCTTGGCC ATCCTAGAAC TTGCTCTGTA 18001 GACCAGCCTG GCCTGGAACT CACAGAGATC CACCTGCCTC CCCTCCAGAG TTCTGGGATT 18061 AAAGGCGTGT GCCACCACCT CCCAGTGAGT TTATGTCTTT GCAAATTATA CATGGTTTCA 18121 GTTTTTTTTT CTGTTTGTAA GTCACTTTAT TTCAAATGTA AAGTTTAAAA CAAGAAGCAA 18181 ATTACTATGA ATTTTTGTTA ACAGTCATTT TCCTTAACTA ATAAGTTTTA AATTTTCATT 18241 AATATGTTTT GATCATATTT TTTCCATGCC CCAACACCTC CAAAATCTCC CCACTCATTC 18301 AGTTCTTTCT CTATCTCAAA AAATGAAAAA TCCAAGCAAA CAACCATTAG ACAAAAAATA 18361 ACAAAACAAA ACAAAGCAAA GCAAAATAAA AGCACACGGG CTGGAGAGAT GGCTCAGAGG 18421 TTAAGAGCAC CGACTGCTCT TCCAGAGGTC CTGAGTTCAA TTCCCAGCAA CCACATGGTG 18481 GCTCACAACC ATCTGTAATG AGATCTGGTG CCCTCTTCTG GTGTACAGAT ATACATGGAA 18541 GCAGAATGTT GTATACATAA TAAATAAATA AAATCTAAAA AAAAAAAAGA AAAAAGCACA 18601 CAAAAAACCC AGAGAGTGTG TATTGAGTTG GTTAACCCCT ACTCCTCTGG AGTGTGATTG 18661 ATACAGCCAG TGCCGCTATT GGAGAACACT GATTGTCCCT GTCCTTACAG GTATCAATTG 18721 TGTGTAGCTC CTTGGTTAGG AATGGGGCTT TGTGTGCACT TCCCCTTTCA GCTTTGTAAA 18781 GGGTGTCCGA TTGAAGTTCG TATCTTCTGG GAGAGCATAA AATCAAAAAA AGATAAATGG 18841 ACTCCAGTGA AAAAGGAGCA AGCGGCACCT ATCTTTAAGG TAGAGAGGCA GAGGAGTGTG 18901 GTGTGGCCTG TCACAAACAC CCAATTCCCA ATCAGCTGGC GTCTACCAGG CTGCTTTCAC 18961 TTAGATGAAC CCTGACCTCC ATGTCTCCTT AACATTGCCA TTGTTTAACT GTTAGTGAGT 19021 CTGCCCTCTG TTCACTGAAA GACTTTCAGA AGGTGGTGTC GCCTGCCTTT AATCCTAGCA 19081 CTCGGGAGTC AGAAGCAGGT AGATAGAGCT CTGTGAGTTT GAGGCCAGGC TGGTCTGCAG 19141 AGTTCCAGGA CAGGCTACAG AGTGAAACCC AGTCTCACAA ACACCGCCTC CACCACAAAA 19201 AAAAAAGGAA ACAAGATAGA GTGAACAAAC CCAGCTACCT AGACATCTAT CTGGTAAACT 19261 GACTCATCCC AATCCTCCCT GCCCTCCCAA AGAGCTTGGC TGGCTCACTT CCCCAAATGC 19321 TCTTCCCCTT TAACATTTAA CTAGTTCTTG TCTCTTGTAT GGTTTCCTTT TAACTGTATC 19381 CACCACCCCT ACCTTGACTT TTGTCCTGGT TGGTTTTTAA TTGTAAACTT GACACACAAA 19441 GTCACCTGGG AAAAGGGAAC CTTAATTGAA GAATTGTCTT AGATTGGCCT GTGGGTGTAT 19501 TTATAGGGCA TTGTCTTGAT TGCCAATTGA TTCGGGGTGG GGAGTGGGAG GGTAGGGTGG 19561 GGGTGGGAGC AGCCCACTAT GGGACTCACT TTCCCTAGGC AGATGGCTAT ATTAGAAAGG 19621 TAGCTGAGCC TAAGCCAGCG GGTGAGCCGA GCCAGCAAGT AGCATTCTTC TATGGTTTCT 19681 TTCTTTCTTT TTCTTTTTCT TTTTCTTTTT CTTTTTCTTT TTCTTTTTCT CTTTCTTTTC 19741 TTTTCTTTTT TTTTTTTTCT TCCCGAGACA GGGTTTCTTT GTGTAGCTTT GGAGCCTATC 19801 CTGGCACTCG CTCTGGAGAC CAGGCTGGCC TCAAACTCAC AGAGATCCTC CTGCCTCTGC 19861 CTCCCGAGTG CTGGGATTAA AGGCATGCGT CACCAACGCC CAGCTCTTCT GTGGTTTCTG 19921 CTTCAGATTT CTGCTTTGAG TTCCTGTCTG ACTTCCCTCA ATAATTGTTT GTAACCTAGG 19981 AGTGTAAGAC AAATGAACCC TTTCATCCCC AAGTAGCTAT GGATTTAGAG TGGTTTATCA 20041 CAGCCACAGA GTGAAACCAG AACAACTTTC TAGTAGCCTC TTGTTCTACT CCAGCTGCTC 20101 CTCTGACTAT TCCTAAAAGG TAGTTGGGCT CAGGGAACCA CATCCCGAGA GATTCAGCCC 20161 ATATGAAAAT AGCTCCATTG TGTTGAAGAA ATGTGACCCT CCAGGATTTC AGGCATCAGG 20221 ATTCCATGTT GAAAATGAAA ACAATTATTT TCCTCTCTCT CAAGATTCCT TTAGTCACCT 20281 TCCCTTACCC CAGTTCCTGG CTTTCCTTCT AAACAAATGT TCAGGGAGGT TCAAACAAAC 20341 AGCTGTGAAG AGCAGCATCC CATACCCCCA CCTTCCGACC CAACACTTGC CAGTGCTATA 20401 AGTAGACTGG GATCATCCCT GGACACTGTG TTAAATTACC CATGACCAAC CTTCTAGCAA 20461 GCTCTCCTTT TCAGGATTTT GTTGTTTGTT TGGGTTTGTT TGTTTGTGAC TTGATCTCAT 20521 GTAAGCTGAC CTGGAATTTG CTTAATAGCC AAGGATAGAC TTACAACCTG TGATGCTCCA 20581 GCCTCTGACT CCTGAGTACC AGGGATTACA CATGTGTGGC ATCACAATGA AAGATTTTAG 20641 TTTGCTGAGA GAAAAAGTTT TTAAAGATTT TAGTTCACAG AGAGAATAAG TTTCCCACAG 20701 GCCTTGGTCC AGGACAAGGA AGTTGGTCCC AACCCGAGGG CAGACAAACA ATCCTTTTTG 20761 GGTCACACCT GGCTGGCCAA CAGACAATAA AGGACTTCTC AGGGTACATT CTATGGTTGA 20821 CCACTCTAAC ATGAGATCAT ACTTTGTAAT CAATCACTTT GTGCCCCTTG CCTGTATGCT 20881 GATCTGCGGT TTTTTACAGG CTCCTATATA AGGAGTCTGT AACCCTTGCT GGGGTGTGCA 20941 GCTTCCCCGA TATTGCTGAC ACCCGAATGA GCATTCGTTC AATAAACCCT CTTGCTTTTG 21001 CAGCTCTTGG TCTGGTTTCT GAGTCTTGGG GCCTCCTTGG GATCCTGAGA CCCTTAAGGG 21061 TCTGGGGGTC TTTCAACACT TAACTTTCCT GTTTTTAAGT AGGAAGATCT GAAATCCCAG 21121 ATTCCTGACT CCATTGCACA TTTTCTGTAT TAGAGGCTGT AGCTCTGTAT AGTGGGTTGT 21181 GTGGCTTACA CATGCTCTGA GCTGGAGATT CTAGGGACAC TTAGGGTAAA GTGGAGTGTC 21241 AGCCCCTTTC CCTGCTAGAC TGAGGCCTTT CTGTTCTTTC CTAACTGGGA GGCTGTATAG 21301 CACCCAATGT GTTCATTAAA CTCCATATGT TAGCACTGCA TGGAATCTGA CACACACACA 21361 CACACACACA CACCCTCTAC CACCACCATC ATCAGCACCA CCCCCATCAG CACCACCCTC 21421 ATCCCCCCAC CCCCCACCCT GCCCCNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNC 21481 AACTGGAGGG TAGCATTAGC ACCCAGATGC CATTAATGTG CCAAATATTT GCTTGCTTGC 21541 TTGCTTGTTT GTTCCAGCAT CCTTAGTGAA TGCTCCTGCC CTCCTGGTTA AAGATGGCTT 21601 TGGCATCTCT TGGCATCTTT CTTGTATTCT AGGCCTGAAA TAGGGATGAA TGGTGAAGGG 21661 CAAGGAGCTC AAGTGTCACT TACCACCTGC ACTTGTCCCT TTAAGGGGTT TCCCTAGAAG 21721 CAGTCTACAT TTCATTAGCC AGAGCTTTGT CACCTGGCTA CTTGTGAAGG AGGTGGTGAA 21781 GAAGCCTTAC CTTTGACTCT GCCACTTGGA GCCAAGTCAG GATTCTCTCC CTGGAAAGGA 21841 AATGGAAGAT TAATACCTTG TTGGTTGTTA GACCTAGCCC ATTATGCGCC ATGAGGAAAG 21901 AGAGACAACA GTGGGTCACT GATTGATCAG GGTTACAGGA CAAGGAGCCT TGTTTCTCCT 21961 AACAGCTCTG AGCGGAGACA GAAGTGGAGT ATATAGGCAT AAAATTCACA AACATTTGCT 22021 GCCACGTTAC AGGTACATTT TTTCACCAGT CAGAAATCAA AGATTAGGGA CTTTGCTTGT 22081 GTGTTCCATC ACTGTCAACT GACATACACG GCAAGCCTTT TAGTCCAACC AATCAGAATC 22141 ATTTGTTCCT TCTGTTGTTA GGAGCAGCCA TAATGATTCT AAAGAACTAA CAATGCATAA 22201 TGACTATTTT TGTAGTTTAG GGATGAGGTA TGTCAGCCAT TGGACAGTTC TCAGCTCCCC 22261 TAGGGCTTGG GAACTTGAAC TTTATTTCAT CCTGCATGTA ATGGAGTCTG AAGTCAAAAT 22321 GGCAGTACTT AGGTCAAGGT GCTCGTGCCT GCTGCCTTCA AGGTGGTTTC CCATTCCCAC 22381 CATACCAGAG ACTTCCTACT GCATCTCCAG TCAAGGACAC AAACACTTTT AAGTCCTGAC 22441 TGTTGATTCA ATCTATATAG TTACCAGCAT AGAGGCTAAG AGTCACACTG GCTTGCAGGG 22501 GACTTCTCTA GCATATGTGA AGCCCCGTTT GAATCCTAAA CACAAGAGTC TAAGCTTTGG 22561 AGTCAGAGAC AAGCATGTTC AAATCTGTAC GTCACCACCC TATAGACATA GACAAGTCCC 22621 TTGGGCTCAG TTTTTTCACT ACAGAGAGTA ATTGTTATTT CAGATTCCTA GGGTTGTGGT 22681 AATTAAATAG TTGAAAGATA TAGCCCATGG AACATAAAAA AAACTCAAAA CCAGGCACAG 22741 TGGCACATGT CTTTAATTTC AGCACTCAAG AGACAGAGGC AAGTGGATCT CTGTGAGTTT 22801 GAGGCCAGGC TGGTCTATAT AGAGAGTTCC AGGTCTACAC AGAGAAACAG GCTCAAAACC 22861 AAAGCAAAAG CAAAACCTCA ACTAATGTTC ATAAAATTAT GAAATTGCTG GTACCAGTGA 22921 CATGACTCAT TGGTAAAGAC ACTTGCTAGC AAGTTTAATG ATCTGAGTTT TATCTCCGGG 22981 ATCTACAATG TAGAAGAAGA AAAACAACTC TCAAGAGTTG TCCTCTGATT TCCACTTATG 23041 CAAAATAGCA TGGGAACACA CTTAAGCAGG TAGGTAGGTA GGTAGATAGA TAGATAGATA 23101 GATAGATAGA TAGATAGATA ATAGACATAA TTAAGAACGT TCAGTTGCAG CACAGTTCAT 23161 ACTGAACTGC ATTTGGACAC CTCTGTGAAA AGTCAGGAGC TCTCCTGTCC TCCTGGTGAC 23221 ATTTAAACAT TGAAGGCAAC TATTTTAACT GTCAGTTATA TACAAATCCA CTGGCCTTGT 23281 AAAATTTTAA AACATAACAG AGGAGGCTAA AGTCCTGTTT AACAACCCTC TCCTTTTACC 23341 ATCCCAGGAA GCCAAAATTG TTCACAATTT GTTCTCTTCC CTCAGGCCTT CCATATTTCA 23401 AATACCACAT AAAACACCTA TGGAAAAACA TGAGGTATTA AAAATGTCAC TTGGAAATCC 23461 TTCTTCAAAC AAGCTTGTTC TTTCTTTTTT CTTTTATGTA CAGTGAATGG AATCCAGGAC 23521 CTTTGCAGAT GCTAGGCGAG TCCTTTACCT CATTCCTCTT TCGATTTAAA ACTTTTTCTT 23581 GTTTTGTGGA GACAGGGTTT CTCTGTGTAG CCATAGATGT CCTAGAACTA GCTCTGTAGA 23641 CTAGGCTGGT CTCAAATTCA GAAGCCAGTC TGCCTCTGCC TCGGGAGCGC TAGGATTAAA 23701 GGTGTGGGCA GAGTGCTAGG ATGAAAGGTA TGCACACCAC CACTCCTGGT TGATTTTAAA 23761 AAGATGCTTT TTAAAAAAAA TGATGTGTAG GTAGTGGGGG GAGAGACGGT TTCATGCCTA 23821 AGAGCACTGA CAGCTCTTCT AGAGGACTCA GGTTCAATTC CCAGCACCCA CATGGCAGCT 23881 CATAACCATC TGTAACCCCG GTCCCAGGGA ATCCAACACC CTCTTCTGGT CTCTGTGAAT 23941 GACAGATATG CATGGGATAT ACAAACATAT ACGCAGACAA AACACTGTAT ACATTAAATA 24001 AGTACAAATT TAAAATATGT GTAGGCATGT ATGTCTGCAT GTGGGTATGT GTACACTGAA 24061 TGCAAGTTCA CTTGGAGGCC AGAGATATAT AGATCCCCTG GAGTTGCAGT TACAGATACT 24121 TGCGAGCTGC TGTGAGTGTG CTGGGAACCA AATCCTCTGG AACAGCAGCA AGTGCTCTCA 24181 CCTGCTGAGC CATTTCTTCA CCCGCTTCTT TCTACTTTTT ATTTTGAGAC AAGGTCTTAC 24241 TAAGTTATAT ATTCACTTGG GGCTTGAATT CATTTTGTCA GCAGGCAGAC CATAAACTTG 24301 CCTTCCTCTT GCCTCGGGCT CCTGAGTAGC TGAGACTTCA CCATGAGGTC TGGCTTTGAT 24361 TACATTTTTC TTTGTTTTCT TTTTGGGGGT GGGGCTGATC ATGAACTCTA AATAGCCAAG 24421 GATTGATAGT GAAGTCCAGA TTCCCCCACC TATCACCGGG TGGAATTACA GGTGTGCACT 24481 ACCACACCCA ATTTGGTTTG ATTTTTTTTT TTTTTTTCAG GACAAGCTCT CCTTTTATAG 24541 CTCTGACTGG GTTGGAATTT ACTATGTAGA CTAGGCTAGT GTCAAAATCA CAGAGATCTT 24601 CCTGTCCCTG CTTCCTGAGT ACTGGGATTA AAGGCATGTA CCACCACACC TTCGGGTGTG 24661 GTGATGCACA GCTTTAATCC CAGCACTCAG GCAGGCGAAT CTCTCTGAGT TTGAGGCTAG
24721 CCTAGTCTTC AGAGTGAGTT CCAGAACAGC CAAGGCTACA CAGAGACACT TTGTTTCGAA 24781 AAACAAACAA AAACAAAAGA GGCTAGCCTG AAACTCCTGA TTCTACCAGC ACCTCCCAAG 24841 GGCTGGGATG ACAGGTTGTG GCCCCATGCT CTCTGCCGGG GCCTCTCTTT TCTTTCTTCT 24901 GTTTGAGGTA GAGGCTTACT AGGTTGGCTG GGTGAGTTGT GAACTCACTC TGCAGCCCAC 24961 ACAGGAACTG ATCTTGTGAT CCTCCTGCCT CAGTCTCCCT AGCAGCTAGG ATTGCAGGCC 25021 TGCACCATCA GGCCCATCGT ACACTGTTTT CTGAGTTTGA AAATTGCCTC TGTTGTTGAC 25081 TATAAGGCAT GCTCTCCTCC TAACATTGTC CTTGGTGCCT CTGCCACCCT TTGGGACTAG 25141 AGAGAACAGA TCTTATTCCT ATTTCACATG CTGTGCCAAC CCAGTAACAA ACTCAGATTC 25201 CTGCTTCCGC CCCCACCACC CCCATCTAAT TGTTCAGTGT TTCTGTGAAG ATAAACACGA 25261 TCATCTTTGT GAAAGCCACT TAAGTTCCTT TCAAGGTTGG GATATAAGTT AGAGTGATAG 25321 CTTGTTCCCA GGGTGGGGAG AGCATGTGAA TTCCCCTCTC GCTCAAGTAG GCTATACTAA 25381 TTTTCATTTA GATATTTCTG AGGCAAAGTC TCATGCTGGC CATCCACCTG CCTTAGCTTC 25441 TCAAGTGCTT GGATTACAGG CATGAGCTAC AATATCTGGC TTAGTTTCAA GGTTGTGAAA 25501 ATTATACTGT GTTCTGATGA CCTGAGTTCA ATTCCCTGGA CCTGGGTGAT GGACGGAGAG 25561 GACAGACCCC TGCAGATTGT CCTTTGACCT CCCTGTCACT ATGTGAACAC TCGTGTACAC 25621 ACACACACAC ACACACACAC ACACACACTA AATGAATGTA ATAAAATATA AAAAGGTGTT 25681 CACTAGTTAA TAAGACATGA GAGAAAAAGC TTACCATCCC TAATCAATGG GGAAGCATTG 25741 AATATAAGTG ACTGTGGTCA TGGAAAGCAG TATAGAGGTT CCTCAATAAA CTGGAATATA 25801 GCAGCATATA CTTGTAAGCC TCCCACAACA GGAGAAAGGT AAAGAGGGGC GGCCACTCTG 25861 GAATATTATT AATATCCTGT TTCATAAACA AGTAAATAGA ACAAACCCCT CAACAACAAG 25921 AACCGGTGTG CTGGCACACA CCTGCAATCC CAGCATTTGG GACTTGGAGG CAGCACAATT 25981 GAAGTTCGTT CTTGGTCATC CTCAGCTATG TATGAAATCT GAAGCCTGCC TGGCCTACAG 26041 GAGACCCTGT CTCAAAAAAA TAAACTAAAT AGATTAAAAT GAAAATTAGA AGCAGGTAGT 26101 GTGGAAGTTG AATAAGAATA GCCGCCATGG GCTCATGTAT TTGAATGTTT AGTGGCACAA 26161 CTTGAGTGAG TTAGGAGGTG TGGCCTGTTG GAGTTGTGTG TCACTGGGAG TGAGCTTTGG 26221 GATTTTAGAA GCCCAAGCCA GGCCCAGGGA CTTGCTCTCT TCCTGCGATC TGAGGAACTG 26281 GATGTAGAAC GCTTAGCTAC TTCTTCAGCA CCATGTCTGC CTGCATGCTG CCATGTTCCC 26341 TGTCAAAATG ATAATGGACT GACCCTCTGA AACTTGGTCT CTTTTGGCTG AGGAGTTAGC 26401 AAGGTAAGAG GTGGCTGTGG CTTGCTCTTG TTTCTCTCTC TCTGATCTTT CATCATTTTC 26461 TCCCGTATCT GGCTGTGGGT TTTTATTATT AAGAGTAATT AGAACTCATG TTACAGTGGT 26521 ACATGCATGC CACAGACCCA GTGTGGATGC CAGAGGACAA CATGTGTAAA TTTTTTCTTT 26581 CCTTGTATGT GCGTCCAGGC TAGTTTCAGA CTTGTGGGCT TCTGCTTCAG CCTCCCAAAG 26641 GTGGGGACCA CAGGCTTATA TACCTACACT CACCTCTTTA TTCCCAGTGG ATGTGTGTGT 26701 GTGTGTGTGT GTGTGTGTGT GTGTGTGTGT GTGTTTGTGT GTTTTACACA GACCTGTACC 26761 ACATTCATTT GGTTACTTTT TTTTCCTGCA TTTTGTTTTT AGGTAGGGTC TCACTATGTA 26821 ACCCTGACTG TCCTGGAACA TGCTATTTAG ATTAGACTGA CCTGCTGGTC CCTACCTTCC 26881 GAGTGCTGGG ATTAAAGGTG TGTACTACCA TACCTGGTGA TTAGTTTGTC TTTTGAGACT 26941 GGGTCTCTTG TAGCCCAGGT TGGTCTTGAA CTCCTGGTTT TCCAGACTCT ACCTTCCAAA 27001 TATTGATATT GCAGGTGGTC ACTACCATGT GTGGAATTTA TTTTTGAGCA GTGTTCTGTG 27061 GGTGGATGAT AAGGTCATGT CTATGGTAAA ATTGTTTCTA ATAATGATGA ATAGCTTCAT 27121 GTGTGTATGC ATCTATCAGG TTTGTTCAAC CTGAAGTGTA GGCCTAATAT TTGGATTTAT 27181 TTAGCCAGTG ATAGCTATGA ATTGAGCCCA GAAAAAATCA TAAACTTGAC TAAAACATCT 27241 TAAGAATTTT GTAACTTCTT TTGTAACTCA ACTGTATTGT TTCTGAGCAT GAATGTTGTA 27301 AATGACAATG TCAGCTGCCA TGTCAAAAGG TTGAACATTA CTTGGCAGTG GTGGCACACA 27361 CCTTTAACTC CAACACTCAG GAGGCAGAGG CAGGCAGATC TCTGAGTTAG AGGCCAGCCT 27421 GGTCCACATA GGGAGTTCCA CACCAGCTAA GGTGACAGAG TGAGACCTTG TCTAATTTTT 27481 TTTTAAGGTT GGACATGTAT AATTCCAGAG AATAATTTTT CACTAATCGG AAAAGAGGCA 27541 GTTTCAACTT GGAGTTCACA AGATTTAATC TTTCTTTGAA GATTTATTTA TTTTTAGTTA 27601 TGTGTGTGTA TATATGTATG TATGTATGTA TGTATTGGTG TGTTAAACCC CTGGGGCTGG 27661 AATTACAGGT GGTTGTGAAC CTGATGTTGT AATAAGCTCC CAGACCGTAG CACAAATGAC 27721 TCTATGAAGA AAGTACCATT CAGGCTGTAA AATCCACATA GACAGCACCA CCTGGAAAAA 27781 CTAAAACAAA AATCCAATCC ATCAAACTCC ACAGATCTGG GAAAGTATCT AAATGCACTA 27841 ACCTTGATTT TTGGCTTCTG TAGTTCTGCT TCTGGCTAAC TATTCTTGTT AACTGAAGTA 27901 TGTGAACCCA CAACATGGTT TTTGTGCTTA AAAGTTCTCT GTTCTACAGA ATGAATTCCA 27961 GGACAGCCAG AGCTGCATGG AGAAAATCTG CCTCAAAACA AAACAAACAA ATAAAAACCT 28021 TGAGAAAGGC TCAGGGCTAT ACTGGTATCC CATACACTCA GTGTAGTCGC CAACTGTCAA 28081 AGACTTTTTG TTGACTTAAA CCCATTTCTA AGCAGTATTC TCTTATGGAT ACCCCTTACA 28141 AGTGGGTGCT GGGACTTGAA CTCAGGTCCT CTGGAAAAGC AGAGGATTTC TCACCTGCTG 28201 AGCACCTCTC CAGGCCCATA AGATCTATCT TAAGACAAGA CCTGAGCAGC CTTATGGAGA 28261 TGGCAGTCTG GGGAACCACT GGTGCGCCTT TTCTTCTGCT GGTCACAAAC TGCTGTGGGA 28321 ATTTCCATCT GAAGTTCCTG CCTCTTCTCA CATTCCATGA TATGAGAAAG CTATCAATGT 28381 TCTAAATCTG TTTGCTTTCT GCTTTGCAAG ACCTTTCTCT TTCCTAGGTC ACCCTCCAAG 28441 AGTTCTTGAC CTCAGCCCCG ACTGGTGTCT TGGGATGGGT GACTGGGTTC TGGGGGCTTC 28501 CCTGTGCCTT GGAATATGGT AAAAGAGCAT CTCAGGTATT CACTCAGTAG ATGCTAGTAG 28561 CACTCCCTCC CTCCATTTCT GTCTACAGAT GTTGCTAGCT GGCCCCTATG AGGTAGTCTT 28621 TGCCCCTTTG TTATTGCTGC AGACTCAGAA AAAAGAGGAA ATATAGAACT CCTCGTGGTC 28681 TTCTACTCAA TATCCAAGCA AGGGGGAACA ACTGAGCATC CATACACTGC TGTTTTGGCT 28741 TCTCAATTGC TTGCTTGTAC ATCACCAAGA AGCTTTCATT GGTCAGTGTA AACAAGATCT 28801 GGGAGTTGAT GGTAGAGCAG TTGGATGAGT GACTCTGTCT TTCACCTTTG TTGAGTCATT 28861 TGGTGTGTGC ACATTGTGGG TCCCTGCCTC GCTTCCCATT AAATGTCAAG GTGAACTTTA 28921 TGAGGTTGAA ACTTTTATAT GTAGTGCAAC TGTACTCCTT CCTCTCTATC TCTTCCTTCA 28981 TTTTTCTTCC TTCACCTTCT CTTCCTTTAA AAAAAGAAAA ACTTTAAAAA ATGTGAATCT 29041 GATGTATCCC AGGATGGCCT CAAACTGTTT GCTTTCTCAG AAGATGACCT TGAACTTTCA 29101 ATCCTCCTGC CTCCACCTCC CAAATGCTGG GCTTACAGGA ATTCATCACC ATGCCTGGTT 29161 TTCCTCTCTC CTGGTGAGTG AATCCAGGGC TTCATGCTTG CCAGGCAAGT GTTCTGCTGA 29221 CTGAGTTACA TGCTTAGCCT GTATCCACAT CTTGACTGAG TAATTTCTGC ACCAAAACTT 29281 TAGGTTTCAT CTCAGTGACT CTGCCAATGT GTTTCCATTT TAGAGTGACG ACTGGCCTTA 29341 GAGGAGAGTG TAAGAGAAAT AGAGTCTCTT TCCTTGGTCT GCTTTTTAAA TTTTAATTTC 29401 TTTTTAGACA TCTTATATTT ATTCATGCAT GTGTGTGTAT AACTAGCAGA ACTCAGCTGT 29461 CTCTTTCTAC CACTCAGGTC ACCAGGCTTG GTGGCAGGGA CTCTTACCTG CCTTCGAGCA 29521 GGCTCTGCCC TCCTTTTGGA GAAACTGGTT TGCAGAAGGA AGAGACAGCA CAGCTCAGAA 29581 GACAGCCGTG CTTTCAGATG CCTGAGAATC CTGCCAAGGA CACTGCTGCA TTCTCCTATT 29641 CTTTTGTAAG GGTCCCATCT CTGCTGAGCT AAACTGGGCT TTCTCAGCCC TTCTCCTCTG 29701 ACAGTATTTT AAAACCCTAC CTAAAGGGGG ATGGAGAGAT GGCTCAGCAA TTAGGAGCAT 29761 ATCCTACTCT TCCGGAGACC CCTACTTCTG TTCCCAGCAC CAATGCTGGT CAATTTACAA 29821 CTGTAACTCT GCTCCAGGTC ATCGGATGCT GCTATCCTCC TCAGGCAACT TCACTCATGT 29881 GCACATACAC ATACTTAAAA ACAAAATAAG TCTTTAAAAA TCACCTAAGA AATATAAAGG 29941 CACATATCAT AATTCAGCCT GCTGTGACGT ATAGCTATAG TCCCAGAATT CTGAAGGCAG 30001 AGGCAAGAGG ATCACCTCAA GCTTGGGGCC AGCGTGGTCT ACAGTGAGAC CCTGGAGACT 30061 TTAATCTCAA AATATGTAAC AAAACAAATA TGTAAATAGA CATATATCAC AATTTATATT 30121 TAAGTAAAAT GGGGGGCATT GGAGAGATAG CTTTGTGGTT AAGAGCATGT ACTGTTCTTG 30181 TCAAGGACCC AAGTTTGATT CCCAGTGTCT ACACTGGTTG GTCTCCAACC CAATTCCAAG 30241 AGATCTGCTG CCTTCTTCTC CTCTCTACTG GAACTGCATT CATGTGCAAA TGTCCATATG 30301 CACACACATA CCCACATGCA TACACACAAA CACATACATA CTCATTTTGC CTGACATCGT 30361 GGTAAAGTGG GAAGACTTGT TGCCCTATTA CTTGGTCTTC ATTTGCCTAT GAGCACCATG 30421 TTGGCATGAA CTCATTCATT AATATCTTTC CTGTACAACT CCCCAATAAC CAAGATGACA 30481 CTTGGCACAC ATTAATTGCT AAGTATAATG AAAATTTAGT TTAAATTAGC TAAATAATTT 30541 AAAGTTCCCC CTCAAGCCTC ATGCCTGATT TAAAGTAGTA CTTATTAATG CTGGGCCTGG 30601 TGGCATACAT TTCTAATTCT AACACTTAGG AGGCTGAGGC AGGAGGATGG CCAATTCAAG 30661 GCCAGCTTAG CCAGCTTAGT AAGACCTTGT CTCCAAGCAA ATTACAGCAA AGTCTGAGAT 30721 ATAGTTCAGT AATTAGGGTG TTTGTCTACC ATGTGTGAAG ACCTGAGTTC AGTTTCTAAC 30781 AACAAAACAA AACTAAACAA ACCAGAACCT AGAGGTTATC ATTTATTTTT TTATTTTTAT 30841 TTTTTTTTGG AGTTTATGCC TTTGGATTAT CCATTCTATG TCCAGACATC AGTACTGCCA 30901 TGTTACAGTC AATAAAAGTC TTCCTTCATC ACCCTTAATC TTATCACCAC TAAAGTCTCT 30961 ACTTGACAGA CATGCCATAC ATAATTATAG CTGTTACCTT CTATCATAAA GTAGACATTT 31021 TATTTTATTT GTGTATTCAT TTTCATTTAT TTTGTTGTTG TTGTTGTTTT ATGAGACAGA 31081 GTTTCTCTGT GCAGCCCTGG TTATCCTGGA ACTCACTCTG CAGACCAGGC TGGCTTCAAA 31141 CACACAGAGA TCCACCTGCC TCTGCCTCCT GAGTGCTAAG ATTAAAGGAG TGTGCTGCCA 31201 TCTTCCCAGC AACATTCTAA ATTATTTTTT GTTTATGTTT TGAAATGGTC TAATGTAGCT 31261 GAGGTGGGCC TCAAGCTTGT TATATAGCTG GGGAACCTTG AACTTGTGTT CTTCCTACCT 31321 CTAGAACTCT GGAGTGCTGG AATTACAGGT ATGAACCATC ACATTCCAGT TTTAATCAAA 31381 TCCAGACTTC ATGGGTACTA GGAAAGCACT CTACAAATTA AACTTCACCC CTAGTTCATA 31441 TATATATATG TGTGTGTGTG TCCATGTATG TATGCCTACA TGATTTTATG TGTGCCACAT 31501 GTGTGCAGGT GCTCTTGGAG GTCAGAGGGT GTCAAATCCC CTGGCACCTG AGTTATAGGT 31561 GGTTGTGAGC CACCTGATGT GGATTCTGGG AACTGAACTT TGGTCCTCTG CAGGAGAAGT 31621 CACTGTTCCT CTGAGTGAAC GTTTCTACTT TTTAATATAC TTCCCATTCG AATTAGAAAG 31681 TAGAAGCTCT CGGAGGTTGA GACCTTACCT AAAGTCACCC AACTAGTAAG AAAACTAAAA 31741 TATCAACTTG GTTTTCTGAG TTTTAAATAT TTTTTCCCAA TGTGTAATTA CACAGGAGAA 31801 TTAATGGGGA CACTTCAAGG TAAAACAGAA GCTTTAGACA TAGCAAGGCA TGGTGGCACA 31861 CATCCCATTG AGAGGCAGGA GGATCAGGAG GCCAGCTTTG GCTGCATACT TAAGAGGCAT 31921 CCAGGGCTAC ATGAGGCGCT ACCTAAAAAA ATTAAATTAG GCAGGGCGTT GGTGGCGCAC 31981 GCCTTTAATC CCAGCACTCG GGAGGCAGAG GCAGGCGGAT CTCTGTGAGT TCAAGGCTAG 32041 CCTGGTCTTC AGAGCGAGTG CCAGGATAGG CTCCAAAGCT ACACAGAGAA ACCCTGTCTT 32101 GAAAAACCAA AAAAGCACTG GTCATTGTCA TTTTCTTTCC TAACAGGGCA CTGGAACCCT 32161 GATGTTGGTT GGCTCCTAGA TTTCTTCTCC ACAGCAGAGA GTTCTTGCCC TGTTAGAGCC 32221 AGAAGGATGC TCTGGAGAGT CAGTATATAG CAAAGCAGGG TCATCTGGAG TAGTAAAAAC
32281 CCTCTGGCAC AGTCAGACCT CATTTCCTCT TGTCCTGTGC TCGTGGCTCT AGCATTATGC 32341 AAGGAGAGGC GCAAACAGCA AACAATTTGG AAGGGCTAGC ACTTGAGCAA CTCTTTGTAG 32401 CTTCCTCTTC TCTACTCTTT TGCCCCTGGC TTCTACTGGA ACAGGTGACT TTCCATTGCA 32461 TTGCATTCTC CAAACTCAGA TGATTTTGAG AATGTGGCAC TACTAAAAGT CACATGGACA 32521 TACAAGGTAC AACTAGAACT ATCCCGGGAA ACAGTGATAC ACGATCTAGT TTGAGGCCTT 32581 GAGCCATAGC TTGTCAGAAG CTCAGAAATG ATTGAGTCTC TGGGAGCCCT CACCTCAGCA 32641 TCCCTGCTTG CAAAAGGCTT CTTGAAGTAG TAAAAACTGC TGGGACCTTG TCTAGGCTGG 32701 GTAACCTTGC ATAATTACTC AACCTTACTG AGCTCAGTCC CCTCCTCTAT AAAATAAGTG 32761 CAACAGTATT TACCTTAGTG GCCCACCTGA AAACATCACA GCTGCCATAG CTAGCTCTTG 32821 GCTTTTGTTC TATCTCCTCC TCCCCCTACT TTCTCTTCCC TCCCTCCCTC CCTCCCTCAT 32881 TTTTCTTTAT TCCTTTCTTT GTATTTTTTT CTTTTTTCTT CCTCACACCT CTCCTTATTC 32941 CCCACCCTCC TCTCTCTCTC TCCCTTCCCA CTTCTCTTTC TTTCATGGCA GGATATCATG 33001 TATCCTAGCT ATACTTGAAT TCACTATATA GCTGAAGAGG AGCTTCCAGC CCTTTTGCCT 33061 CTGCCTCCCA AGTGCTGAGA TTATAGGTGT CCACCTCCAC GTCTACTTAT GCTTTGCTAA 33121 GGATCAAACC AGGGCTTTGT ATGTGCATGC TAGGCAAGAG CCAACTACAT CGCCAGACCT 33181 ATATAATACC CCTTTCTCAG CGAAACTGGG GTTGCTGATG GCTGGTGTTG GGGGAAGGCA 33241 CTAAATATTT AGCAGAAGTA TAGGAAAACT CTAGAAGTCT AGAGATCCTC AAAGTAAGTT 33301 TGGAGAGCCT TGGCCTTTTC TTAGTTGAAA GTCATGGTGC CTACTCACTT TGACTGCTCA 33361 AGGAATATCC ATTCACCACC TGGAAATAAG AAAGGAGGGA GAACCAGCTA GGGATGTGAC 33421 TTAGTAGTAG AGCACTTGTC TAGCATGAGC GTGGTCCTGG GTTCAAGCTC CAGTACAAAG 33481 GCTGGGTGGG GGGGTGGAGA AAGGCTTCTT TCCCATGGCG TTCTAGAGAT GGCGGGGAGA 33541 AACCACCAAT CCACATCTAT CTACAACAGT TCAAGTAGAA CTAATCTTGG TGGTATGGCT 33601 ATAGTAGTCC TAATCCCATC TCAGGGATGC TTCTCTTTGC AATTGATACA AAACACATTA 33661 CAGAAAACCA CAGTGAATCA AAATGCAGAG TTGTGGTGCC TAGTTCCAAT GGATGCATCT 33721 ACAGTACAAC TCCCATGCCT AAGGCTCAGG GATCATTGTG GAAGACAAAG ATCCTCCCAG 33781 GAGATCAGGG AGTTTGCTGT CTCCTAGGAA TTTCAGAAAA TACATCTGTA AAGGCTCACC 33841 AACGTGAATT CCTAAACATG AGCTGAACAA GGATGACAAT AGACATGCTA ACAAGGATGG 33901 GAAAAAGCCC TTGAAGCCTC AGACCTACAC AAAGAGCCGC AGTTGATTAA GGAATGCTGA 33961 TTGTGGGAGA AACCATCTTC CCAAATTGTT ATCTAATACC ACATAGTCAG CCCTGAAAAC 34021 ACACATGCAA ATAAGATTAT ACAAAACAAG GGGGTTGTAC ATATGTATTT AGGAATATAT 34081 ATATATATAT ATATATATAT ATATATATAT GTAACAATAA TTAATAGAAA AAGAGACCAT 34141 GAATTTGAAA AAGAACAAGG AGGGGTACAT GGAAGGGTTT AGGATGCTTT GACCCTTTAA 34201 TATAGTTTCT TGTGTTGTGG TGACCCCAAT CATAAAATTA TTTTTGTTGC TAGTTCACAA 34261 CTGTAATTTT GCTGCTGTTA TGAATTGTAA AGTAAATACC TATGGTTTTT GATGATCTTA 34321 GGCAATCCCT GTTAAACTGT CATTCAGTCC CCAAAGGGGT CAAGACCCAC AGGTTGAGAA 34381 CTGCTGATTT AGAGAGAGGA AAGGGAAGGG GGGGTGAAAT GCTGTAATTA TAATTCCAAA 34441 AAAAAATTTT TAAAAATTTC TTAAAGGAAC TGAAGAAAAG AGCTGAACAT TCTAAGCTTA 34501 AGGGGGGAAA GGTTCTGGAA TGTTACATTT TTCTGGTTTC CTTAGTCTCA GCAACAGGCT 34561 CCCAGCCTTC TGTTTGGACA GTGGTTTACA GGCATGTGAG CTCAGGGAAC ACTCTTCCAA 34621 GTGAATCAGA CTTCAGGAGA AGACATTCAG TTCAGGGCCC TGGGGAAAGT AAGGACAGAA 34681 CTCCATTCCT GAGAATTACC AGGTTTGCTC AGAAGATAAA ACTGGTGAGC CCAATGGCTG 34741 TGTGCACAAC CCTGACCTCA GTGTCTAGGA TAGCTGGACT CTAGCTGCTA GAAGATAGTC 34801 AGAGGGCCAT CCTTTCCCTG AGGCTAATCT GTGAATCAAG TAAACTACAG TCAGGAAGGG 34861 AGCTGGAGAT GGGGGCCCAG CAAACAGGTC CCCCTTAAAG CCCAGCACAT AGGTGGGGAA 34921 CCCAACCTCC CATTTTGTCT TCACCCCACC ACCAGGCCTT TACCAAGGCC CGAGGTTGCC 34981 ACTATTTTCA GCTTGCCAGG CTCTTTGCAG TTTTAGGGGG ATGAGGAGGA GATGCTCTGA 35041 GGTGCTGGGA GGCACATGGC GGGTGCTATT TATGGCTTGG GCTGAACTCC GATGTCCTAG 35101 AAAGAGTGTT TCTGACACTT TCTGCCTTCT GGGAATCAGG AGACTCATGA CAAACACTGC 35161 CTGGCAGTGT TTCTTTCTTG TTCACAGCAA GAAGTGTGCA GTCCATGGCA CGAAAGAGGC 35221 CTGAGCAGGG CAAGATGGAC ACGATGACAT CACTGAAGGA GCTTCCCAGG GGCTGTCTTG 35281 ACTGCTTCAT TAACTCATTC ATGCAGTTTA TTCAGCAGCT ATGCCTGTCA GACCCCATTC 35341 TGTCTGCACA AGACACATGG CAACAAAGGA GACTTACTAT TCCCATCTTC ATGGGTTTTA 35401 TGTTCTGGCA AGAGGAAGAT AGTAATAATT TTTAAAAAGT AACCAGTCTT GAGAGCATGA 35461 TAAATATGGT TGATAACAAT ATGCTATATT TTAAAAGTTG TGAGATAGTA TACTTTAAGT 35521 GTTCTCAAAA CAAAATGATG AATATGGGTG ATATAACATG TTAATTGGTT TAATTTAGCC 35581 ATGCCTTTGT GAACATACTG TATCGTGTAT CATAATTGTG CATGACTTTA TTTATGAGCT 35641 AAATAAATGA ATGGAAAAAA AAGTAACCAG TCTTGATGCT TACCTGCCAT CCTGGAAGGA 35701 AATGGAAATA GGATCTGCCG CCGCAGCATT GCCCTATGCT CTTATTTCTT CTCTTGAAGA 35761 GGTAGGGGTG TGTGTGTGTG TGTGTGTGTG TGTGTGTGTG TGTTACTAGA GACTGAGCTA 35821 CCGGCCTCAC ACATTCTAGG CAAATGCTCT ACTTTATATT AAACACTTTA TAAAACATTA 35881 AGCCTTTCAG GGTCAGCAAG GTAGCTCAGA GAGTCCGGGC ATTTGCTACC AAGCCTGACA 35941 ACCTGAGTTC GATTGATGAT CCCCCAGACT CACGTGATAG GAGGAAGCTG ACACCTGTGG 36001 GTTGTCCTCT GACTATAGGC ATGCACACAC ATCCCATGAA TAAATATTTA TACATTTTCA 36061 AATCATACTT ATTTTACAAT GATTTTTATT TGTTTGCCTG TCTTTCTGTC TGTGTAGAGA 36121 CAAGGTTTCA TGCAGCTCAG GTTGGCCTCA AACTCACTCT GTGGCAAGGA TGCCTTAACT 36181 TCAGGTCTTC CAGGTCCAGG TAACAAAATG TTCAGGAGGA ACCTGGTACC TCATCATAAC 36241 CGGTTCTAGA TGGTCTTCCC AGGGCTGCTG TAAGAAAGTG CTACACGACG AGTTATTTCA 36301 AACATTCTCA CAGTTCTGGG GATTAGAAGT TTGAAACTAA GGTGCTGAAG AGATTAGTTC 36361 CTTCTGGAAG CTCAGAAGAG CCATCTGGTC CATACTTTTC TCCAGGTTTC TCTTAGTTTT 36421 TGGCAATCCT TGGAATCCCT TGGTTTGTAG ATGCAGCTTC CAAAGCTCAA GATCTCTCTC 36481 CAATGCTGTG TGGCATTTCC CCGTGTTTAT GAGTGTCTAA ATGGCTTTTA AAAACATTTT 36541 TGAGATGTGA AATTCTGGCT GACCCAGAAT ATATAAACCA GGCTGACCTT TGTCTCCCAG 36601 AGATCTCCCT GCCTCTGCTT CCCAAACCTT TTGATTAAAG GTGTGTGTCA AGTGCCCAGA 36661 CCAAATGCCC TTCTTGTAAG GACAACGGTC ATATTGGATT TAGTGTCTAA GTGAGTCCCC 36721 TATGAACTCA TCTCGAACTC AGTTTGCATA GAACACTGTA CCATGCAAAA TAAATGACAC 36781 AGAGACTGAT ATTGGGGTTC ACACTTCAAG CTGAAGGTCA GAAAAGCAAA GCATTGGGCC 36841 ACTAGCTCTT ACCACTACCT CAGGCTGAAC GGGCTGATCC TGCTGCCTCT CCTCAGCATG 36901 GCTGGAGAAT ATCTTCATAT CCTCATTGTG GCTGGAAAAT GAATGCCTGA TATGGAGAAC 36961 TTGCTCCTGT TTTATATAAC TCCCTAATGC TGGGATTAAA GATGTGTGAT CCCAGGTGCT 37021 GAGATCATCT TTGTGTGAGC TGTTTCTCTT TAGGACTGGA TCAATTTTGT GTAGATCTGG 37081 ATGGCTTTGG GCTCACTGAG ATCTATCTAC CTCTTAATCC CTGGTCCTAG GATTAAAGGT 37141 ATGTACCACC ACATCCTAGC TTCTGGCTGC TGGGATTAAA GGTGTATGCC TGGCTTCGAT 37201 GGCTTGTGGC TGACTTTGCT TTCTGAATCC GCAGGCAAGC TTAAAAAAAT CATAAATAAT 37261 ATATCACCAT AGACCACACT TCCAAATAGG CTTCCATTTA GAGGCGCCAG TGGGTGATAA 37321 TGTAGGCGGT TTTACTCAGT TTTGTGCAGA TGGCTGGCGT CCTGTCTGGT GAGTTCAGAT 37381 TTTTTTTTTT TTTTTTTTTA AGTTCAGAAT CTTACCCAGC TCAGCTTTTC AGGCTGCATT 37441 CAGTGTCCGG CTTTTTTCTC ACCGTCTTGA CTTCCTGTCC TGCATCCCAT TTCTCAGCCT 37501 GGACCCTGCC AGTCTATCAG ATAGATAACA TAAACAAAAT TGTACTGGAT TAATGGGAGC 37561 TGTTTGGACA TTTCCTACTT TTGCCTTTTC ACCAATGATT TGCATACTTA AGCCTGCAAC 37621 TACAGCCCCG ATGCAGTAAG CTCAGTCTCT GGCAAGCAAA GGTCTCTCTG GGGTCTTGTT 37681 TAAGAACCAG CTCAGGCTGC TGGCTCTGTT GGCAGTGGAG GTATTTCCTA TAATGGGATG 37741 ATGGGATGGG TTATTCACAC ACATCTCAGT TACTGGGCTA CATGGATCCA AATCAGCCAC 37801 CCAAGGGTTT GCAGTCACAT GTGAGTCACT TAGCACAGAG AAAGAAGCCT GGAGGAGGAG 37861 GGGTCCTCCC AGCTTCAGGA GGGTTTTCCA GGATATAGGC TTCTAGTCTC GTTTTGGATC 37921 AATTTATCAG TTTTGGATTG GGTCTAATAA CTCTTTCCTG AGCCTGGACT GGGCTCAAAG 37981 GCATGAGTAT GTGAGGGGAA TTTACTAGAA TTCACCTGTA GTTTCTGTAT CATTCCTAGA 38041 GAAGGGGAAG TAGAGACACT GGTGATGGGA AATAAAAACA AAACAAAACC TAAATATTGG 38101 GAGCACAGAG GTCCTTGTTC CACAGCTCTT GATAGAAGTC AGGAATGTTA TGTATGTACA 38161 ATTGCCCTTG AAAAGGAAAG GATGTATGAC CTGTTTTTCT GTCCCGAAGG CTGGGAACTG 38221 GGGATGATTA ACAGCCTGTT GATCTGCATT ATCTGAAGGG CTAGGCCATA TCAAGCTCCC 38281 ACAGCTAGCA CTGAAGGAGA ATAGGGCCTT ACAAAGGGAA TTCCCTCTTT GGATCGAACC 38341 TAGGAACATC TTCTGTTTTA CCGCTCTCTC CTTGTTTCAT CTGCAAAGGG AGGAGCTTGG 38401 TAGTGATGTT GAGGCAGGCA CCACTTGTAT TTTTCTAAGC CACAGAGACT GTTTCCCTAC 38461 CTTACAAACA TCCCTGTGCA TCACTGCAGC TCTGTCTCTT ATGGCAGTGT CTCAGTTAGG 38521 GCTTCTATTG CTGCGACTAA ACACCATGAC CAAAAAAGCT CACACTTCCA TACTCCTGTT 38581 CATTATTGAA GAATGTCAGG ACTGGAGCGC AAACAGGGCA GGGTCCTGGA GGCAGGAGCT 38641 GATGCAGAGG TCATGGAGGA AGGCTGCTTA CTGGCTTGCT CTCCATGGCT TGCTCAGCCT 38701 GCTTTCTTAT AGAACCCAGG ACCACCTGCC CAGGGATGAC ACCACCTACA ATGGGCTGGG 38761 CGCTAATATG AGGGATCAAA GAGATGGAGT TGTGGGAGGG ACAGAGGGGG AGAGCAATGA 38821 AAGAGATAAT CTTGATAGAG GGAGCCGTTA TGGGGTTAGG GAGAAACCTG GTGCTAGAGA 38881 AATTCCCAGG AATCCACAAG GAAGACCCCA GCTAAGACTC CTAGCAATAA TGAAGAGGAT 38941 GTCTGAACGG GTCTTCCCCT TTAATCAGAT TAGTGACTAC CCTAATTGTC ATCACAGAAC 39001 CTACATCCAG TAACTGATGG AAGCAGATGC AGTGATCCAC AGCCAAGCAC TGGGCTGAGC 39061 TTCGGGAGTT CAGTTGAAGA GAGAAGGGAT CATGTGAGCA AGGGGGTGGG GGAAGTCAAG 39121 ATCATGATGG GGAAAACCAC AGAGACAGCT GACCCGAGCT AGTGGGAGCT CATGGACTAT 39181 GAAACGCCAG ACGTTGTAGA CTCCCTAAGG AAGGCCTTAC CCCCTCTGAA GAGTGGATGG 39241 GGGGTGGGAA GTGGGGACGC TGGGGGACAG GAGAAAGGGA GGGAGGGGGA ACTGGGTTGG 39301 TTTGTAAAAT GAAAAAATAG ATTTTTTTTA AATAAAAAAA GAAAGTGCTT TACATCTGGA 39361 TTTCATGGAG GCATTTTCTT AACTGAAGCT CCTTCCTCTC TGGCGACTCT AGTTTGTGTC 39421 AAGTTAACAC AGAACCAGCC AGTACAGGCA GCAGAAATAC CTTGCAGAAA TATCTTAGTT 39481 CAGGAGTCCA CGGTGGTCTC AGTCACTTCC TCATGTGCCA CCTGAGTTTA ACATTCCCCA 39541 AAACTTGGAA CACAGGCCAC CACATCATGG AGCCCTGGCT TAAAGCTCAA GTTTTATGGT 39601 ATTTTCTTTT ATCACTGTCT ATAATTCCTA AACATGCTAC AATGTTGTGA GCCCTCACCG 39661 TCTCCTAGGT CCATAGTGAC TTCCTGGCAT TAATAGACTG TGCCCCAAGA GCTCTATGGC 39721 CACGACCACC ACCTGCCATT CCCCTCCCCC TCCATGGTCC CAGCCTCACT TCTTCACTTC
39781 CTGGTCCTTC CGAGCCCAAT GTGCAAACCC ACAGAATCTG TCTGCTTATG TAAGTTTCCT 39841 GGTCACTGAG TGGGGTGACT CAGCACCAAG GTGGTGCCCT GCGATTTCCC AGCCCCAGGC 39901 AGCAGAACAA CTGAAATGGA AAACAAGTCC CGTTAATAGG GTCCAGCTGA GAGCCTCCCT 39961 TTCTCAGGGA GTCTGGCAAA TCTACTCCTC GGGGAACTGC CCTGGGCAGT GGAATTCTCC 40021 AGCTCCCTGC TCATTTCCTA GTTCCTCTTC CCTCTTCTCA CCTTTGGCTG AGGATCAGAA 40081 AGGTTCCCAC TGAGGTCTGC TTTGCCCTGG GCCTGCTCTT TTCAGAGTCC CATTTTTGGA 40141 ATGAATTTTT TTTGTCTCCT ACTTTCAAGT TCACATATTG AAGCCATTAT TGCCAAGGTG 40201 ATGGTATCAG AAGGAGGGAC CTTTGGGAGA TGAATGGATG GATTCCAAGA GGTTATGTGG 40261 GCAGAGCACC CATGATGGGG TTGGTGCCTT CATAGGAAGA AGACACAGTA GAAGGGAAAG 40321 AGATGCCGAC TGAAAAACAG GAAGTCTCCT GGAGTAGGCC ACTCAGCCTA TGACACGCCA 40381 GCACTCAGAT CTCGGACTTC CCATCTCCCA AATGGTGATA AACAAATGCT GTTGTCCAGG 40441 CTGCACAGTC TACGGCATTT TGTTGCAAGG GCCTGGACCA ACCAGGCTCA GGCAGGAAGT 40501 GAATCTAGTG TGGGAGGATG TACAGACTGC CACTCAGTCT GGACACAAAC TGTCCTCAGG 40561 GATCACCTGA GCCACATCTA CCTAAGAATG GCTATTCTTT CCATTTGTTA ACATCAAATG 40621 CCAAGCCCCT ACTGTATGTA GGCTCTTGCT AGCAGTGGAT ATGATGCTAT GTGAGATGGG 40681 AGCAATCCTC TCTGCACAGA ACTATACATA GAACTATGCA TAGAAGACCA ACAGGGAGAC 40741 ATCAGATAAC TATTAACTGT GATAGCTCTG TGGGAGACAA ACAGAATGAG GGAATGGACA 40801 ATGACTTTGA GGAAAAACTA TGATTGAAAA TACTCTATCT GGCTGGGCGG TGGTGGCGCA 40861 TGCCTTTAAT CCCAGCACTT GGGAGGCAGA GGCAGGTAGA TCTCTGTGAG TTCGAGACCA 40921 GCCTGGTCTA TAAGAGCTAG TTCCAGGACA GCCTCCAAAG CCACAGAGAA ACCCTGTCTC 40981 AAAAAAAACA AAACAAACAC ACAAAAAAGA AAATATTCTG TGAGGTAAAC AAGCATCTGG 41041 AAGGGTTGGG AGATAATGCA GGCAAAAATG CATTAGACAG CACACAGTAC AACACAGCAA 41101 TCAAACTTAA TATAAACACA GCAAATGTCA TCTTTGGGCT TTGCCCCATT TCCTGATCTG 41161 ACCATAACAG CCTAGTGTCT GGAAAGCACA CTAAAGCCAT TTACGTCACA CAGGAGTTCA 41221 ATGTTGAGTT CAGAGGGAGG GGGTGGAGGG CAGATTAGCG AGGTACAAGT TCTGGTCCCT 41281 TTGATGAAGT GTTGATGTAC CCATCGACAC CACACAAATA TACCATCATG CTCCATGTTA 41341 GGGTCAGTGA AGGATTGCAT ATGTGACGGT GGCCCACTGG GCTGAGAAAG CCCTATTGCT 41401 TAGTGACATC TGTGATAATG ACATGCGAGC CCTATTGCTT AGTGACATCA CTCTTCTCAT 41461 AGTGTGGGAT CCAATGTGTT TCTTGTACAC TTGTGATAAT GACATGCAAA CAAGTCTATT 41521 GTGCGGCCAG TCACACAAAA AATATATTAT GTGCAGTCAG GAACAGTCCA TAGTACTTGA 41581 TTGGGACAGC ACAAGTCTGT GTTGCTGGTT CACACATTAA TCATTACCAC TGTTTTAGTG 41641 TGCTCCTATA TATATATATT TAAAAATTAC TATAAAATGA TACACCGTGC TGAGCAATAG 41701 CACCTCTTAT ACCTTGTGTT TACTGGATGT ACTCAAGCTA TTTTCTCTTG TGCTTGATTT 41761 ATTTGTATTT GTATTTTTGA GAGAACCTCA TCTAGTCCAT GCTGGCTTCA AACTTGTTAT 41821 AAAGCTGAGG ATGGCTTCGA ACTCCTGATC CCCCAGCCTC TGCCTCCCAA ATGATGAGAT 41881 TACAGGCATA TGCTACCAAA CATGACTTTT ATTTATTTTT ATTACTTAGG TGGTATGGGT 41941 GGTTTGAATG AGACTGTCCC CTTTGGCTTA TATATTTGTA GGTGGACCTT TGGAAAGGTT 42001 TAACAGGTAT GACCATAGTG GAGGCAGTGT GTCAGTAGGG GAGGTCTTTG GGGAACCCAA 42061 TACTCAATCA ATTCCAAGTT AGGGCTGTCT GTCTGTCTGT CCCCTGATTG TGTCACAAGG 42121 CAGAAACTCT CAGCTACTGC TCTAGTTCTA TGCCTACCCA CCTGTTGCCA TGGTCCCTGC 42181 CATGATGGTC ATGTACTTCA ACCCTTTGGA TAGGTGGCCC CCAAATTAAA TGGTTTCTTT 42241 TATAAGTTGC CTTGGTCATG GTGTTTTGTC ATGGCGATAA GAAAGTGACT GAGACAGGTT 42301 TGTTGCTGTT GTTACAAGGT TTAGTCCAGG CATCTGGCAC CACCTCTGGC CTGTGCTTGA 42361 TTCAATCATG TTACCTTTAG AAATAGCAGG CTAAAGGACA TATACCTGTG TACGTATATG 42421 TGTACGTATA TATTAGCTGT ATAGTCTAAG TGTGCACCTG ACTCTAATAT CTAGGTTTGT 42481 GTAAGTAGAC TCCACCAAGC TCACTAAGCA ATGGTATCAC AGTTTTCAGA TAGTGTTCAG 42541 CGATGCTTGG CTGAGTGTTA GTTCTTTTTT TAATATTTTA TTTATTTATT ATGTATACAA 42601 CATTCTGCTT CCATGTATCT CTGCACACCA GAAGAGGACA CCAAATCTCA TAACGGATGG 42661 TTTTGAGCCA CCATGTGGTT GCTGGGAATT GAACTCAGGA CCTCTGGAAG AGCAGTCGGT 42721 GCTCTTAACC TCTGAGCCAT CTCTCCAGCC CCTGAGTGTT TTTAAATCAA GGAAAAAAGC 42781 CTGAGGGAAG GGAGCTCAGG CTGAAGGGGA GGAGTCAAGA CAGTCTGACC CCAAGGCATT 42841 GTGGGACGTA AAGAGTTCTG GGACAAGACT GAGGTCTCTT CCTTCTCAGA GACTGTGGGC 42901 TTCAGTTTCC TTGGTAGCCG GAAGCAAAGC TAATCCATGG CTTAAAATAT AATACTCAGT 42961 GTAACCTTGT GTTGTAGAAG TGACTTGCTT GTCTTCTTCC ATAATTCTAA AACATCTTTA 43021 AGAGCAGGAT CCAGGAAGGG AAAAGGAGAG ATTCTCATCT TCTTCAAAAG GCAGCTTTCC 43081 CTAAAGCATT TTCTGATGAA ATTTAAGTTC TAAAACCAGC AGTGGTATAA TCCCATCATG 43141 AATGGGGATC TCTGAGTTTA AGGCCAGCCT GGTCTACAGA GCAAGTTCCA GGACAGCCAC 43201 GGTTACACAA AGAAATCCTG TCTTAAAACA AAACAAAACC CAAAACAAAC ATAAACAAAA 43261 ACTATCCAAA ACCAACCAAC CCCCCCAACT CAGAAAGAAA GAAAGAAAGA AATCAAGAAA 43321 GAACTGCCCA CCGGGTGTTG GTGGTGCAAG CCTTTAATCC CAGCACTCGG GAGGCAGAGG 43381 CAGGCAGATC TCTGTGAGTT TGAGGCCAAC CTGTTCTCCA GAAAGAGTGC CAGGATAGGC 43441 TCCAAAGCTA CACAGAGAAA CCCTGTCTTG AAAAAAGAAA AGAAAGAACT ACCCATGACC 43501 AAACAGTTCC ATGGCCAGGT AGAGAATGAG GACGCTGAAA GTCACACCTT CTCAGAGTCT 43561 CAAACTGCAC ATCTGGCCTC AAAGTCCAGA AATGAGTGCA AGACCATTAA TGACAGTCTT 43621 TGGAAACAAA CCAGACCAAA GAACATTTGG CTCCTGATAC ATATTCTGAG GGTCACATAG 43681 AAAGAAAGAT CTGCCTTTGG CCACCTCCTT TTGAAGTGGG GAATTTTATT TTCTTCTGCA 43741 TGGAAACTTC ATGTAGGTAT TTGAGAATAC ATACAGACAT GCAGGTGCAC ATGCACGGAC 43801 ATGAACACAC ACATACACCC CGGGTAGGCA GGCAAGAAAG TGTGTGGAAT AACACTTGAA 43861 CTTCCCTTCC AGAACAGAAG CCCTCTGAAG TGTGACATTC ATGCTGGCTG CATGGGGTCT 43921 GATCAGTACT AGTGAGTGGA GGTGGAGGGG TAGGAAACAT GGGGATGATA ATAGGTTGTC 43981 AGGAAAGTGG TGCCCCAGGT AGCACAGAGT AGAAATTTGT CCCCCAAAAT CCTTTTGAAC 44041 CCAGTTGATT TGAATGCCGT GCCCCTGCCA CCCAGGCTTC AGAGCTAAGT GACTTATGTC 44101 TTCAGGTCAG TGATGATTAC CACGGTTGCA GTGCTAACAC AGATGCTTTA TCTACCAGGA 44161 CAGAAACAAG AAAGATGCTC CTTCCCAGGC CCCTTAGCAC TCTCTGGGTG GGGAGGATTG 44221 CCCCACCTTC CAAAAATAGA ATACTGTTTT GGTAAACAGC CACTTTGAGC CCATGAGGAT 44281 ATCTTCATTA GCTATGGAGA CAGGTTTTAG TAAGAAAGCA AGATGAGAGG CTAAAAAACC 44341 CTTGGGGAGC AGGAACTGGG AAGACTGTGG TACCTTGTTC CCAGATCCAC CAGAAACCTT 44401 GCCACCAGAC GATGTGTCCA GGCCCCACAT ATTTCACAAA AAGTTGGATC TGATAACAAT 44461 GAGGATGGAA TCCCGGTCTT AAGGTGGGTT TGGGGTGGGA AGAGGCGGGA TAATGGGTGA 44521 GAGGGTCGGT GGGGACAGGT GAGATGGGGT ATGGTGGGGA GAGGTGGAAT GGGGTGGGGT 44581 GGGTTGAGAT GGAGTATGGT ACAGCGGGGA GGGATAGAAT TGTCTTTTCC CTGTACCACA 44641 GAGAAGTTTG ACTGCTACCC TTGGCAATTA ATCAATTATA GAAAATGCAA CTTTGCTTTT 44701 AAAATGTGTC TATTTCCAAA GGCTTCTTCC CCTCCCCTAC CTAGGGAGAA GGAAAGAATG 44761 GATAATGCTA CTGTAGAGGA GGGTAGCATC ACTATAGAGG CCTCAGTATC TGCCCCAGGG 44821 AGCTGGGAGA GAGTTCTATC ACACAAACAC AGCCCGAGTC ACATACTCAA CAAACCCCAC 44881 AAAACAAAAC AACAATAATG AAGATACAAA ATCTCATTAT GTAGCCCAGG CTAGTCCTAG 44941 ATTTCTGTTT TCTTTTTTTG TTTTTCGAGA CAGGGTTTCT CTGTGTAGCT TTGGAGCCTA 45001 TCCTGGCACT TGCTCTGAAG CCCAGGCTGC CCTCACTCAC AGAGATCCGC CTGCCTCTGT 45061 CTCCAGAGTG CTGGGATTAA AGGCGTGCAC CACTAATGCC TGGCTAGTCC TAGATTTTTT 45121 TATCCTCCTG CCTCAGGCTC CCAACTGTTG GGTTTACTTT TGGGAGTCCA TTTTCTTCCA 45181 GCATGGATTC TTTGAATTGA AATTCAGATT ATCAGGTTTC TGTAGCAATC CCACCAGCCC 45241 ATTTTTTTGT CTGACACTGC TTGTTTTGAG ACACAGTCTC CCACTGCTGT AGCCCAGGCT 45301 GCCCTAGATT TTCTATGTAG CCCAGGCTGG CCTTGAACTC CCAGGAGTCC TCTGGCCTCT 45361 CCCTTTTGAT TACTGGAACT AGAAGAAGTC ACTATGCTTG ACTTGGAACT AATATTAGAA 45421 CAAAATATAT TTTTCATTGA GATTCAACTT TGAAATCCTG ATGCTCCTGC CTCACTCAGG 45481 TCATCAGGGT TGGCAGCAAG AGCCTTTATC CACTGAGTCA TATTGGGCCC TGACCTGCTT 45541 TTAAATTTTG CCTTTAGGGC TGGAGATGTA GCTCGGCTGG TTCAGTGCTT GCCTGGTACC 45601 CACGAAGCCC TGGGTTTGAT CTACAACACA GTATAAGCCA GGCCTGATGG CGTATACATG 45661 TAATCCTAAC ACTTGGGGAG CAAGAGGGAG GCCAAAGCCA TCCTCTGCTA CTTGGTGAGC 45721 TTGAGGCCAG CCTGGGATCC TTGAGACCCT GTTTCAAAAC AATAACAACA AACACAGACT 45781 ACTAAAAAAA ATTAATAAGG GCCAGACTGG GTGGTGTATT CCTTTAATCC AAGCAATGAG 45841 GAGGCAGAGG CAGGCAAATG TCTGTGAGTC TGGGGACAGC CTGGTCTACT GAGCAGCAGG 45901 CCAACTAAGG CTACATAGTG AGACTATCTC AAAAAAAGCA AAATAACAAT AAACAGACCA 45961 GTTCCCCATC TCCTATTTTG CCTTTACCTC CTATTCCCTG CTCAGCAGGT TATTTTTTGT 46021 TCCTGCATCT TGGTTCACTG ATCTGTAAAC TTGTCTGAAT AAGTAGGTAC AGGGTTGTTT 46081 TAAAATTAGA TAATATATTC AATGAGAAGG GCTACCAAGT GCTCAACCAA TGTATGCATA 46141 TGTATGTATG TATGTATGTA TGTATGTATT TATTTTTGTT TTGTTTTTCA AGATAAGGTT 46201 TCTCTGTGTA GTTTTAGAGT CTGTCCTAGA ACTTGCTCTG AAGAGCAGGC TGGTCTTGAA 46261 CTCACAAAGA TCCACCTGTC TCTGCCTCCC AAGTGCTGGG ATTAAAGGCA TGTGACACCA 46321 CCCCCAAAGC CAATGTTCTT ATAGGCATCT TTGATTTTTT TTCTCTTTCT TTGAGTGGAG 46381 TCTGACTAAG TAGCCCATAC TAGCTCTGCA TTTACAATCT GAACACATGG ATAAGAGTGG 46441 TGAAAATTAT CAAGATCATG TTATGCTATG CCTCCTGAGT CACCATGCCC TGCTTCAGAC 46501 TTCTTTGTAT TAAAGAACTG TGTAAAAAAA AAAAAAAAGA CATTTGAAGG CACATAATCA 46561 GAGGAATTTG TCAGTGATTT TTCACATACT GTCTTATTTG TGGCCAAGGT AAGCCTAGAG 46621 AGTATTTCTT AAAATTAAAA ATAGTGGGCA GATTTTGGAG GCGATCTGAT ATGAAAATCC 46681 CTTCCCACCC CAGGTAGTCA TGGGCTGACT ATCAAGGATA CATTCTGAGA CATATATCCT 46741 CAAGCAGTTT CTGCCTTACG CAAATATCAT AGGTCATAGC ACACTGAGAC TATGTGGCAG 46801 TCTATGTGTC TATATACACA TGGTGTGGCC TATTGTTCCC ATGGTCACAA AGAACAAAAC 46861 AACTTTTTCA CAAGGCTTTA CCCCTAGAGG AAGAGCTACA GGCAATCAAT GGTTGCTGAG 46921 AGGAGTATCA GTCTTCTCCA GGGACTTAGC CAATCCCAAG AGGTCAGCCA CGCATAGGAA 46981 CGCTTAGCCA CGCTTGTATA GAACATCTCA AACAACAACC ACCTCAGTGT AAAGCAAGCA 47041 CACAAGGAAC TGATGCAACT AAGAGACAAA GGGCCCGGTG TGTGTGGCCC GTAGCTGTCA 47101 TCCCAGCACT TGAGACTAAG GAAGGAAGGT TGAGAATTTG AGGCCAGCAT GGACTCCACA 47161 GAAAGACCGT TTTCTTTCTC AGAAAAAAGA AGCAAAAACC AAGAACAAGG TGTATGGGAA 47221 TGCTACTGTC TTGGCATATT GTTTATAGAA AACTTTTTTA TATATAAAAG GAATGCACTA 47281 CAAAAATTAT AAACTACTGT AATATTAACT GCATAGATCT ATAACATGGT CATTTATTAT
47341 TGAGTATGAT TATCTATCTA CCCACGCTGC AGGTTTAGAC AGTTGCACTA CAGTAGATCT 47401 GTTTGCAGTA GCATCATTAT TAGACATTTT GGACAAAGCC AAGTGGTAAT GGCACATGCC 47461 TTTAATCCCA GCACTTGGGA AGCAGAGGTA GGCGGATCTC TGTGAGTCAG AGACCAGCCT 47521 GGTCTACAAA GAACTAGTTC CAGGAGAGTC TCCAAGGCCA CAGAGAAACC CTGTCTCGAA 47581 AAACCAAAAG AAAAAAAGAA AACAAAAAAC TAAAAAATAA ATAAATTTGG GGCAATATCT 47641 TGTCCTATGA TGTTACTGGG TAATGGGATT TCCTCCTCTT GTATTATTTT TTCTTTGGGG 47701 GTTTTACTTA TTATTTACTT GAGACAGAGT CTCATTTATG ACAGGCTGGC CTCAAACAGG 47761 AAATGAAGCC AAGGAAGACC TTGAAGACCT AATCCTTCTG TTTCTTCCTC CTATATGGTG 47821 AGTTAAAGGC ATACAGTACC ATGCCCAGTC TATTCACTGC CCAGGGCTTC ATGCATGCTA 47881 GCAAAGCACC AACTGAGCTG CATCCCCACC CCTCCTCCTG GCTTCCATCT CCTTATGTAG 47941 CTAGAAATGA GCCTGTCTGT CTCAAATACT GGGATTATGG GTGTGTGCCA CCACACCTGG 48001 CTTCCTATTA TAGCCTTGTG GGATCACTGT TGTTTACTGA AGCATTGTGA CACACTGCAG 48061 ATTGCTGGAA CAGCGTCTGC CATCATCATG ACACAACTTC AGAGAAAGAG AGAGTTCCCA 48121 ACCAGCCACA CACTTAACTC AATGCCTGTA GCCCTTATTC TGTTAAGACG ATTTCCTGCC 48181 ATCTTACTCA AAGACCCTCT TTAACTCGGT AGGAACATCT GTTACACTGA AAGTCCTGCC 48241 TGTTGCTCCA CTGACCTCCT TCACAAATTA TTATATTTTG GAGCCAATTC TGAACCCAGG 48301 TTTTCTGAGT GACACATTTT AGTATTTTTT TTTTCTTTCT ATTTTCTTTC ATGGAAAGTC 48361 TCTTGTTACT GTTCACATGA CCAAGGATCA CTGCATCATC TTCCAAGGCC AATTTTGGAT 48421 GTTTCAGCAA GGGAGACTGA AGATCCTGAG TCTCAGTGTT GATCTCCTTT AGAATGTCCT 48481 CTGGAGAAGG TAGTGACAAC ACTGCAAGGA TAATAGGTGA ATAAAGGGAA GCCAGAGTGT 48541 CCTCTGGGAT GTGCGGCACT TACATGAAGG ATTCATTTAT AAATTTTAAG TTATGGAGTA 48601 TAATAATAAG ACTAAATATG TAGTGTCGTA ATTTTATAAC TATACATATG TATATAGTAA 48661 ATATAAATTT ATATGTAATG TATTTATAGT AAGTGTACAT AGAATTGAAC ATATGTTACA 48721 TAAATGGCAG AAAGGAATGA TTCTCAATTG CTTTTTTTCT AATTATAATT TCTATTGCTC 48781 TTTGTGGATT TCACACCATG CATTCTGATC CCACTTATCT CCTTGTCTCC TTGCATTTGC 48841 CCTCTGCCCT TGCAACCTCA CCCCCAAATC AAAGCCAAAT TTAAAAAAAA AACCAAAATC 48901 CAAACAAAAC AGAGACAAAA CAAAAATAAA AGCAACAACA AAAAAAGGAG AATCTTGTCA 48961 TGGTAGCTGT AGTGTGGCCT GTTGAATCAC ACAGTATACC CTTTAGTCCA TTCATCTTTT 49021 CTTCCAAGTG TTCATTGATA CAAGTCACGG TCTGGCTCGA GGATTCTGGT TTCTGCTATA 49081 TTACTAATAA TGGGCTCTCA CTGGGGCTCC CCTTGGATAT CCTATTGTCC TGTGTTATGG 49141 AGAGCCTGCT GTTTTGGATA TGTAGGTTTG TCCCCTTCAC ATGCTATAAC AATTCATAAA 49201 TTCAGTGAAT GTTGGGGTGG GCCAACTCAT AGCCCTGGTT CTGGGCTTGG GTGGTATTAT 49261 TAAACCCACT GATGGAGAAT AAGACCACTA CCATAATTTA AAAGCCAAAT TGAAGCAAGT 49321 TTTAATTCAA TACTGCCCAG GTGGACAGGC TCTGGCTAGG TCCATCTCTG AGTTTCCAGG 49381 AGGTGGCCCT GACTCACGGT TTACAGTGGC TTGAGTATTT TCCATAAGGT CCAATCAGGG 49441 GCAAGCATAC ATCCTGATGT ACCTCCAGTC TATATCCAAT CGGGGGCAAG TGTACATCTT 49501 GATGTATTTC CTGCCTGTGA ACCTACTGCC CACATGTGAT CAAGCACATC CGGTGCAGTT 49561 GGGTCAAACA GACTTGTTTA GGGCAATGAA AAACACATGG CTTTTTATCT CCCATAAACA 49621 ATAGCCTCCA GCGGTTCAGG GACTATTTGT CCTTGGGCAA GGAATTTACA GATCCTATAG 49681 GTGAGTCAGG GTCAGCATCC TGCTCTCATG CCCTCAGGGC TGGCTCACTT GTTACCTCCC 49741 CGACCCTCTC TCAACAGGGT CAGCTCTGAG GTGCTGCCCA GGTGGGGTGC AGGGCCTACT 49801 CTTCCGCATG TTGCAGCTGG TCAGGGTTAG TTCTCTCATA TGCCACAGGT GGCAATGGGT 49861 GAAGGGGGAG GGCATGTTTC CCTCATCAAC GCCATTACAT GGGGGGATGG GGTCAGCTCT 49921 CATGCCCTTA GGGTTGGCTC ACCTGCATCC TTGACCATAG GGTCAGCTCT AGTATGCTGC 49981 TCAAGTGAGG CGCACACCTA SEQ ID NO: 4 (LoxP sequence from bacteriaphage P1) 1 ATAACTTCGT ATAGCATACA TTATACGAAG TTAT SEQ ID NO: 5 (FRT sequence from the 2 .mu.m plasmid of the baker's yeast Saccharomyces cerevisiae) 1 GAAGTTCCTA TTCtctagaa aGTATAGGAA CTTC SEQ ID NO: 6 (attB sequence from E. coli) 1 cCTGCTTt t TtatAc tAA CTTGa SEQ ID NO: 7 (Recognition site for the CHO-23/24 meganuclease, 35,699 basepairs downstream of CHO DHFR) 1 TAAGGCCTCA TATGAAAATA TA SEQ ID NO: 8 (Recognition site for the CHO-51/52 meganuclease, 15,898 basepairs downstream of CHO DHFR) 1 ATAGATGTCT TGCATACTCT AG SEQ ID NO: 9 (CHO-23/24 meganuclease) 1 MAPKKKRKVH MNTKYNKEFL LYLAGFVDGD GSIKAQIFPN QCYKFKHQLR LRFQVTQKTQ 61 RRWFLDKLVD EIGVGYVTDR GSVSDYMLSQ IKPLHNFLTQ LQPFLKLKQK QANLVLKIIE 121 QLPSAKESPD KFLEVCTWVD QIAALNDSKT RKTTSETVRA VLDSLPGSVG GLSPSQASSA 181 ASSASSSPGS GISEALRAGA GSGTGYNKEF LLYLAGFVDG DGSIIAQIKP GQSYKFKHTL 241 QLVFQVTQKT QRRWFLDKLV DEIGVGYVID RGSASDYRLS EIKPLHNFLT QLQPFLKLKQ 301 KQANLVLKII EQLPSAKESP DKFLEVCTWV DQIAALNDSK TRKTTSETVR AVLDSLSEKK 361 KSSP SEQ ID NO: 10 (CHO-51/52 meganuclease) 1 MAPKKKRKVH MNTKYNKEFL LYLAGFVDGD GSIIAQIPPN QSCKFKHQLR LTFQVTQKTQ 61 RRWFLDKLVD EIGVGYVRDR GSVSDYILSE IKPLHNFLTQ LQPFLKLKQK QANLVLKIIE 121 QLPSAKESPD KFLEVCTWVD QIAALNDSKT RKTTSETVRA VLDSLPGSVG GLSPSQASSA 181 ASSASSSPGS GISEALRAGA GSGTGYNKEF LLYLAGFVDG DGSIYAGIAP NQSCKFKHQL 241 RLWFVVSQKT QRRWFLDKLV DEIGVGYVID NGSVSHYRLS EIKPLHNFLT QLQPFLKLKQ 301 KQANLVLKII EQLPSAKESP DKFLEVCTWV DQIAALNDSK TRKTTSETVR AVLDSLSEKK 361 KSSP SEQ ID NO: 11 (CHO-51/52 donor plasmid with EcoRI site) 1 TCGCGCGTTT CGGTGATGAC GGTGAAAACC TCTGACACAT GCAGCTCCCG GAGACGGTCA 61 CAGCTTGTCT GTAAGCGGAT GCCGGGAGCA GACAAGCCCG TCAGGGCGCG TCAGCGGGTG 121 TTGGCGGGTG TCGGGGCTGG CTTAACTATG CGGCATCAGA GCAGATTGTA CTGAGAGTGC 181 ACCATATGCG GTGTGAAATA CCGCACAGAT GCGTAAGGAG AAAATACCGC ATCAGGCGCC 241 ATTCGCCATT CAGGCTGCGC AACTGTTGGG AAGGGCGATC GGTGCGGGCC TCTTCGCTAT 301 TACGCCAGCT GGCGAAAGGG GGATGTGCTG CAAGGCGATT AAGTTGGGTA ACGCCAGGGT 361 TTTCCCAGTC ACGACGTTGT AAAACGACGG CCAGTGAATT CGAGCTCGGT ACCCAGAAAC 421 CTTTCAACCA GCTTTTGAGC TAATGATAGA GAGAAGCTCA AGGAATTGGA GCAATGCTTG 481 ACTAGGGATG TCAGAGGGAG GCTATCCAGA GGAGCTTACA ACTGAGGTAA ACTTAAAAGT 541 TAGGGAGTTT GTCAACTTCA ACCCACAGAA TAGAGCAGAG CCAGGAGGAG CTGAGGCTTC 601 TGAGTGTTAT GGTGGAAGCA TCACCCCAAC CCTTGACATC CATATGCCTG AAGAGTCTGG 661 AATGTTATGG TGGAAGTTCC ACCCAAGCCT CCCTTCCCGG TCGCCCTCCA AACCCTGCTA 721 CATCTCAGAA ATCCCACCAA ATGATGACTC CCTCCCCCAG AGATATTCAA GACCACTCCC 781 ACAGGGTATT TAAACTGCCC CCCAACCCCC AGAAAATAGA TGTGTGGTTT TCCAATCTCT 841 CTTTCCTATC ACGTCTCTGG GGAGCTGGCA GGCCATTTGG GAGCATTGTA TCCATTAAAC 901 GACTTCTCAG TGGAGACTCT GAAAGCCAGA AGAGCCTAGA CAGATAGATG TCTTGCGAAT 961 TCTTGCATAC TCTAGAGACT ACAGATGCCG GCCCAGACTA TTATATCCAG CAAAAGTTTC 1021 AAACACCATA CAAAGTCAAA TTTAAACAGT ATCTATCTAC AAATCCAATA TTACAGAAGG 1081 TGCTAGTAGG AAAACTCCAA ACTAAGATTA ACTATACCTG TGAAGACACA GGAAATAATC 1141 TCACACTGGC AAAAGAAGAA AAACCTCTCT CTCTCTCTCC TCTCTCTCTC TCTCTCTCTC 1201 TCTCTCTCTC TCTCTCTCTC TCTCTCTCTC TCACACACAC ACACACACAC ACACACACAC 1261 ACCAACACCA ATACCATGAA CAACAAAATA ACAGGAATTA ACAATAATTG ATGTGTGTGT 1321 ATGTCCCTGT GTGTGTGTCC TTGTGTGTGT CTGTTTGTGT GTCTGTGTAT ATGTTTGTCA 1381 CCTGAGGGGT GGCTCTTCCT TGGTTTGTGA GGTTTCTACC CAAAAGCTTG GCGTAATCAT 1441 GGTCATAGCT GTTTCCTGTG TGAAATTGTT ATCCGCTCAC AATTCCACAC AACATACGAG 1501 CCGGAAGCAT AAAGTGTAAA GCCTGGGGTG CCTAATGAGT GAGCTAACTC ACATTAATTG 1561 CGTTGCGCTC ACTGCCCGCT TTCCAGTCGG GAAACCTGTC GTGCCAGCTG CATTAATGAA 1621 TCGGCCAACG CGCGGGGAGA GGCGGTTTGC GTATTGGGCG CTCTTCCGCT TCCTCGCTCA 1681 CTGACTCGCT GCGCTCGGTC GTTCGGCTGC GGCGAGCGGT ATCAGCTCAC TCAAAGGCGG 1741 TAATACGGTT ATCCACAGAA TCAGGGGATA ACGCAGGAAA GAACATGTGA GCAAAAGGCC 1801 AGCAAAAGGC CAGGAACCGT AAAAAGGCCG CGTTGCTGGC GTTTTTCCAT AGGCTCCGCC 1861 CCCCTGACGA GCATCACAAA AATCGACGCT CAAGTCAGAG GTGGCGAAAC CCGACAGGAC 1921 TATAAAGATA CCAGGCGTTT CCCCCTGGAA GCTCCCTCGT GCGCTCTCCT GTTCCGACCC 1981 TGCCGCTTAC CGGATACCTG TCCGCCTTTC TCCCTTCGGG AAGCGTGGCG CTTTCTCATA 2041 GCTCACGCTG TAGGTATCTC AGTTCGGTGT AGGTCGTTCG CTCCAAGCTG GGCTGTGTGC 2101 ACGAACCCCC CGTTCAGCCC GACCGCTGCG CCTTATCCGG TAACTATCGT CTTGAGTCCA 2161 ACCCGGTAAG ACACGACTTA TCGCCACTGG CAGCAGCCAC TGGTAACAGG ATTAGCAGAG 2221 CGAGGTATGT AGGCGGTGCT ACAGAGTTCT TGAAGTGGTG GCCTAACTAC GGCTACACTA 2281 GAAGAACAGT ATTTGGTATC TGCGCTCTGC TGAAGCCAGT TACCTTCGGA AAAAGAGTTG 2341 GTAGCTCTTG ATCCGGCAAA CAAACCACCG CTGGTAGCGG TGGTTTTTTT GTTTGCAAGC 2401 AGCAGATTAC GCGCAGAAAA AAAGGATCTC AAGAAGATCC TTTGATCTTT TCTACGGGGT 2461 CTGACGCTCA GTGGAACGAA AACTCACGTT AAGGGATTTT GGTCATGAGA TTATCAAAAA 2521 GGATCTTCAC CTAGATCCTT TTAAATTAAA AATGAAGTTT TAAATCAATC TAAAGTATAT 2581 ATGAGTAAAC TTGGTCTGAC AGTTACCAAT GCTTAATCAG TGAGGCACCT ATCTCAGCGA 2641 TCTGTCTATT TCGTTCATCC ATAGTTGCCT GACTCCCCGT CGTGTAGATA ACTACGATAC 2701 GGGAGGGCTT ACCATCTGGC CCCAGTGCTG CAATGATACC GCGAGACCCA CGCTCACCGG 2761 CTCCAGATTT ATCAGCAATA AACCAGCCAG CCGGAAGGGC CGAGCGCAGA AGTGGTCCTG 2821 CAACTTTATC CGCCTCCATC CAGTCTATTA ATTGTTGCCG GGAAGCTAGA GTAAGTAGTT 2881 CGCCAGTTAA TAGTTTGCGC AACGTTGTTG CCATTGCTAC AGGCATCGTG GTGTCACGCT 2941 CGTCGTTTGG TATGGCTTCA TTCAGCTCCG GTTCCCAACG ATCAAGGCGA GTTACATGAT 3001 CCCCCATGTT GTGCAAAAAA GCGGTTAGCT CCTTCGGTCC TCCGATCGTT GTCAGAAGTA 3061 AGTTGGCCGC AGTGTTATCA CTCATGGTTA TGGCAGCACT GCATAATTCT CTTACTGTCA 3121 TGCCATCCGT AAGATGCTTT TCTGTGACTG GTGAGTACTC AACCAAGTCA TTCTGAGAAT 3181 AGTGTATGCG GCGACCGAGT TGCTCTTGCC CGGCGTCAAT ACGGGATAAT ACCGCGCCAC 3241 ATAGCAGAAC TTTAAAAGTG CTCATCATTG GAAAACGTTC TTCGGGGCGA AAACTCTCAA 3301 GGATCTTACC GCTGTTGAGA TCCAGTTCGA TGTAACCCAC TCGTGCACCC AACTGATCTT
3361 CAGCATCTTT TACTTTCACC AGCGTTTCTG GGTGAGCAAA AACAGGAAGG CAAAATGCCG 3421 CAAAAAAGGG AATAAGGGCG ACACGGAAAT GTTGAATACT CATACTCTTC CTTTTTCAAT 3481 ATTATTGAAG CATTTATCAG GGTTATTGTC TCATGAGCGG ATACATATTT GAATGTATTT 3541 AGAAAAATAA ACAAATAGGG GTTCCGCGCA CATTTCCCCG AAAAGTGCCA CCTGACGTCT 3601 AAGAAACCAT TATTATCATG ACATTAACCT ATAAAAATAG GCGTATCACG AGGCCCTTTC 3661 GTC SEQ ID NO: 12 (Recognition site for the CHO-13/14 meganuclease, in Intron 2 of CHO DHFR) 1 TACATGTATG TACAAAATAT AT SEQ ID NO: 13 (CHO-13/14 meganuclease) 1 MAPKKKRKVH MNTKYNKEFL LYLAGFVDGD GSIFASITPR QCYKFKHELQ LTFVVTQKTQ 61 RRWFLDKLVD EIGVGYVIDQ GSVSHYRLSE IKPLHNFLTQ LQPFLKLKQK QANLVLKIIE 121 QLPSAKESPD KFLEVCTWVD QIAALNDSKT RKTTSETVRA VLDSLPGSVG GLSPSQASSA 181 ASSASSSPGS GISEALRAGA GSGTGYNKEF LLYLAGFVDG DGSIIAQIKP NQSCKFKHQL 241 MLTFTVAQKT QRRWFLDKLV DEIGVGYVID IGSVSEYRLS QIKPLHNFLT QLQPFLKLKQ 301 KQANLVLKII EQLPSAKESP DKFLEVCTWV DQIAALNDSK TRKTTSETVR AVLDSLSEKK 361 KSSP SEQ ID NO: 14 (Recognition site for the CGS-5/6 meganuclease, in Exon 4 of CHO GS) 1 AAGGCACTCG TGTAAACGGA TA SEQ ID NO: 15 (CGS-5/6 meganuclease) 1 MAPKKKRKVH MNTKYNKEFL LYLAGFVDGD GSIKAIIRPE QSYKFKHRLR LVFQVTQKTQ 61 RRWFLDKLVD EIGVGYVYDR GSVSDYYLSE IKPLHNFLTQ LQPFLKLKQK QANLVLKIIE 121 QLPSAKESPD KFLEVCTWVD QIAALNDSKT RKTTSETVRA VLDSLPGSVG GLSPSQASSA 181 ASSASSSPGS GISEALRAGA GSGTGYNKEF LLYLAGFVDG DGSIWARIKP GQSYKFKHTL 241 ELVFQVTQKT QRRWILDKLV DEIGVGYVTD AGSASVYRLS EIKPLHNFLT QLQPFLKLKQ 301 KQANLVLKII EQLPSAKESP DKFLEVCTWV DQIAALNDSK TRKTTSETVR AVLDSLSEKK 361 KSSP SEQ ID NO: 16 (Forward PCR primer for evaluating CHO-23/24 target site) 1 ggagggacat taatctgcat gcagtgatc SEQ ID NO: 17 (Reverse PCR primer for evaluating CHO-23/24 target site) 1 gtcttggttt gggttgtcta agcaacctc SEQ ID NO: 18 (Forward PCR primer for evaluating CHO-51/52 target site) 1 CACAGGTGTC CACTCCCAGT TCAATTACAG CTCTTAAGG SEQ ID NO: 19 (Reverse PCR primer for evaluating CHO-51/52 target site) 1 CGATGGCCCA CTACGTGAAC CATCACC SEQ ID NO: 20 (PCR template for mRNA encoding CHO-23/24) 1 CACAGGTGTC CACTCCCAGT TCAATTACAG CTCTTAAGGC TAGAGTACTT AATACGACTC 61 ACTATAGGCT AGCCTCGAGC CGCCACCATG GCACCGAAGA AGAAGCGCAA GGTGCATATG 121 GCACCGAAGA AGAAGCGCAA GGTGCATATG AACACCAAGT ACAACAAGGA GTTCCTGCTC 181 TACCTGGCGG GCTTCGTCGA CGGGGACGGC TCCATCAAGG CCCAGATCTT TCCGAACCAG 241 TGCTACAAGT TCAAGCATCA GCTGAGGCTC CGTTTCCAGG TCACCCAGAA GACACAGCGC 301 CGTTGGTTCC TCGACAAGCT GGTGGACGAG ATCGGGGTGG GCTACGTGAC TGACCGCGGC 361 AGCGTCTCCG ACTACATGCT GAGCCAGATC AAGCCTCTGC ACAACTTCCT GACCCAGCTC 421 CAGCCCTTCC TGAAGCTCAA GCAGAAGCAG GCCAACCTCG TGCTGAAGAT CATCGAGCAG 481 CTGCCCTCCG CCAAGGAATC CCCGGACAAG TTCCTGGAGG TGTGCACGTG GGTGGACCAG 541 ATCGCGGCCC TCAACGACAG CAAGACCCGC AAGACGACCT CGGAGACGGT GCGGGCGGTC 601 CTGGACTCCC TCCCAGGATC CGTGGGAGGT CTATCGCCAT CTCAGGCATC CAGCGCCGCA 661 TCCTCGGCTT CCTCAAGCCC GGGTTCAGGG ATCTCCGAAG CACTCAGAGC TGGAGCAGGT 721 TCCGGCACTG GATACAACAA GGAATTCCTG CTCTACCTGG CGGGCTTCGT GGACGGGGAC 781 GGCTCCATCA TCGCCCAGAT CAAGCCGGGT CAGTCCTACA AGTTCAAGCA TACCCTGCAG 841 CTCGTTTTCC AGGTCACGCA GAAGACACAG CGCCGTTGGA TCCTCGACAA GCTGGTGGAC 901 GAGATCGGGG TGGGCTATGT GATCGACCGC GGCAGCGCCT CCGACTACCG CCTGAGCGAG 961 ATCAAGCCTC TGCACAACTT CCTGACCCAG CTCCAGCCCT TCCTGAAGCT CAAGCAGAAG 1021 CAGGCCAACC TCGTGCTGAA GATCATCGAG CAGCTGCCCT CCGCCAAGGA ATCCCCGGAC 1081 AAGTTCCTGG AGGTGTGCAC CTGGGTGGAC CAGATCGCCG CTCTGAACGA CTCCAAGACC 1141 CGCAAGACCA CTTCCGAGAC CGTCCGCGCC GTTCTAGACA GTCTCTCCGA GAAGAAGAAG 1201 TCGTCCCCCT AGACAGTCTC TCCGAGAAGA AGAAGTCGTC CCCCTAGCGG CCGCTTCGAG 1261 CAGACATGAT AAGATACATT GATGAGTTTG GACAAACCAC AACTAGAATG CAGTGAAAAA 1321 AATGCTTTAT TTGTGAAATT TGTGATGCTA TTGCTTTATT TGTAACCATT ATAAGCTGCA 1381 ATAAACAAGT TAACAACAAC AATTGCATTC ATTTTATGTT TCAGGTTCAG GGGGAGATGT 1441 GGGAGGTTTT TTAAAGCAAG TAAAACCTCT ACAAATGTGG TAAAATCGAT AAGATCTTGA 1501 TCCGGGCTGG CGTAATAGCG AAGAGGCCCG CACCGATCGC CCTTCCCAAC AGTTGCGCAG 1561 CCTGAATGGC GAATGGACGC GCCCTGTAGC GGCGCATTAA GCGCGGCGGG TGTGGTGGTT 1621 ACGCGCAGCG TGACCGCTAC ACTTGCCAGC GCCCTAGCGC CCGCTCCTTT CGCTTTCTTC 1681 CCTTCCTTTC TCGCCACGTT CGCCGGCTTT CCCCGTCAAG CTCTAAATCG GGGGCTCCCT 1741 TTAGGGTTCC GATTTAGTGC TTTACGGCAC CTCGACCCCA AAAAACTTGA TTAGGGTGAT 1801 GGTTCACGTA GTGGGCCATC G SEQ ID NO: 21 (PCR template for mRNA encoding CHO-51/52) 1 CACAGGTGTC CACTCCCAGT TCAATTACAG CTCTTAAGGC TAGAGTACTT AATACGACTC 61 ACTATAGGCT AGCCTCGAGC CGCCACCATG GCACCGAAGA AGAAGCGCAA GGTGCATatg 121 gCACCGAAGA AGAAGCGCAA GGTGCATATG AACACCAAGT ACAACAAGGA GTTCCTGCTC 181 TACCTGGCGG GCTTCGTGGA CGGGGACGGC TCCATCATCG CCCAGATCCC GCCGAACCAG 241 TCCTGCAAGT TCAAGCATCA GCTGCGCCTC ACCTTCCAGG TCACGCAGAA GACACAGCGC 301 CGTTGGTTCC TCGACAAGCT GGTGGACGAG ATCGGGGTGG GCTACGTGCG CGACCGCGGC 361 AGCGTCTCCG ACTACATCCT GAGCGAGATC AAGCCTCTGC ACAACTTCCT GACCCAGCTC 421 CAGCCCTTCC TGAAGCTCAA GCAGAAGCAG GCCAACCTCG TGCTGAAGAT CATCGAGCAG 481 CTGCCCTCCG CCAAGGAATC CCCGGACAAG TTCCTGGAGG TGTGCACCTG GGTGGACCAG 541 ATCGCCGCTC TGAACGACTC CAAGACCCGC AAGACCACTT CCGAGACTGT CCGCGCCGTT 601 CTAGACAGTC TCCCAGGATC CGTGGGAGGT CTATCGCCAT CTCAGGCATC CAGCGCCGCA 661 TCCTCGGCTT CCTCAAGCCC GGGTTCAGGG ATCTCCGAAG CACTCAGAGC TGGAGCAGGT 721 TCCGGCACTG GATACAACAA GGAATTCCTG CTCTACCTGG CGGGCTTCGT GGACGGGGAC 781 GGCTCCATCT ACGCCGGGAT CGCGCCGAAC CAGTCCTGCA AGTTCAAGCA TCAGCTGCGC 841 CTCTGGTTCG TGGTCAGCCA GAAGACACAG CGCCGTTGGT TCCTCGACAA GCTGGTGGAC 901 GAGATCGGGG TGGGCTACGT GATTGACAAT GGCAGCGTCT CCCATTACCG CCTGAGCGAG 961 ATCAAGCCTC TGCACAACTT CCTGACCCAG CTCCAGCCCT TCCTGAAGCT CAAGCAGAAG 1021 CAGGCCAACC TCGTGCTGAA GATCATCGAG CAGCTGCCCT CCGCCAAGGA ATCCCCGGAC 1081 AAGTTCCTGG AGGTGTGCAC CTGGGTGGAC CAGATCGCCG CTTTGAACGA CTCCAAGACC 1141 CGCAAGACCA CTTCCGAGAC TGTCCGCGCC GTTCTAGACA GTCTCTCCGA GAAGAAGAAG 1201 TCGTCCCCCT AGACAGTCTC TCCGAGAAGA AGAAGTCGTC CCCCTAGCGG CCGCTTCGAG 1261 CAGACATGAT AAGATACATT GATGAGTTTG GACAAACCAC AACTAGAATG CAGTGAAAAA 1321 AATGCTTTAT TTGTGAAATT TGTGATGCTA TTGCTTTATT TGTAACCATT ATAAGCTGCA 1381 ATAAACAAGT TAACAACAAC AATTGCATTC ATTTTATGTT TCAGGTTCAG GGGGAGATGT 1441 GGGAGGTTTT TTAAAGCAAG TAAAACCTCT ACAAATGTGG TAAAATCGAT AAGATCTTGA 1501 TCCGGGCTGG CGTAATAGCG AAGAGGCCCG CACCGATCGC CCTTCCCAAC AGTTGCGCAG 1561 CCTGAATGGC GAATGGACGC GCCCTGTAGC GGCGCATTAA GCGCGGCGGG TGTGGTGGTT 1621 ACGCGCAGCG TGACCGCTAC ACTTGCCAGC GCCCTAGCGC CCGCTCCTTT CGCTTTCTTC 1681 CCTTCCTTTC TCGCCACGTT CGCCGGCTTT CCCCGTCAAG CTCTAAATCG GGGGCTCCCT 1741 TTAGGGTTCC GATTTAGTGC TTTACGGCAC CTCGACCCCA AAAAACTTGA TTAGGGTGAT 1801 GGTTCACGTA GTGGGCCATC G SEQ ID NO: 22 (PCR template for mRNA encoding CGS-5/6) 1 CACAGGTGTC CACTCCCAGT TCAATTACAG CTCTTAAGGC TAGAGTACTT AATACGACTC 61 ACTATAGGCT AGCCTCGAGC CGCCACCATG GCACCGAAGA AGAAGCGCAA GGTGCATATG 121 GCACCGAAGA AGAAGCGCAA GGTGCATATG AACACCAAGT ACAACAAGGA GTTCCTGCTC 181 TACCTGGCGG GCTTCGTCGA CGGGGACGGC TCCATCAAGG CCATTATCCG GCCAGAGCAG 241 TCCTACAAGT TCAAGCATCG CCTGCGGCTC GTTTTCCAGG TCACGCAGAA GACACAGCGC 301 CGTTGGTTCC TCGACAAGCT GGTGGACGAG ATCGGGGTGG GCTACGTGTA CGACCGCGGC 361 AGCGTCTCCG ACTACTATCT GAGCGAGATC AAGCCTCTGC ACAACTTCCT GACCCAGCTC 421 CAGCCCTTCC TGAAGCTCAA GCAGAAGCAG GCCAACCTCG TGCTGAAGAT CATCGAGCAG 481 CTGCCCTCCG CCAAGGAATC CCCGGACAAG TTCCTGGAGG TGTGCACGTG GGTGGACCAG 541 ATCGCGGCCC TCAACGACAG CAAGACCCGC AAGACGACCT CGGAGACGGT GCGAGCGGTC 601 CTGGACTCCC TCCCAGGATC CGTGGGAGGT CTATCGCCAT CTCAGGCATC CAGCGCCGCA 661 TCCTCGGCTT CCTCAAGCCC GGGTTCAGGG ATCTCCGAAG CACTCAGAGC TGGAGCAGGT 721 TCCGGCACTG GATACAACAA GGAATTCCTG CTCTACCTGG CGGGCTTCGT GGACGGGGAC 781 GGCTCCATCT GGGCCCGGAT CAAGCCGGGG CAGTCCTACA AGTTCAAGCA TACCCTGGAG 841 CTCGTGTTCC AGGTCACCCA GAAGACACAG CGCCGTTGGA TCCTCGACAA GCTGGTGGAC 901 GAGATCGGGG TGGGCTACGT GACCGACGCC GGCAGCGCCT CCGTCTACCG CCTGAGCGAG 961 ATCAAGCCTC TGCACAACTT CCTGACCCAG CTCCAGCCCT TCCTGAAGCT CAAGCAGAAG 1021 CAGGCCAACC TCGTGCTGAA GATCATCGAG CAGCTGCCCT CCGCCAAGGA ATCCCCGGAC 1081 AAGTTCCTGG AGGTGTGCAC CTGGGTGGAC CAGATCGCCG CTCTGAACGA CTCCAAGACC 1141 CGCAAGACCA CTTCCGAGAC CGTCCGCGCC GTTCTAGACA GTCTCTCCGA GAAGAAGAAG 1201 TCGTCCCCCT AGACAGTCTC TCCGAGAAGA AGAAGTCGTC CCCCTAGCGG CCGCTTCGAG 1261 CAGACATGAT AAGATACATT GATGAGTTTG GACAAACCAC AACTAGAATG CAGTGAAAAA 1321 AATGCTTTAT TTGTGAAATT TGTGATGCTA TTGCTTTATT TGTAACCATT ATAAGCTGCA 1381 ATAAACAAGT TAACAACAAC AATTGCATTC ATTTTATGTT TCAGGTTCAG GGGGAGATGT 1441 GGGAGGTTTT TTAAAGCAAG TAAAACCTCT ACAAATGTGG TAAAATCGAT AAGATCTTGA 1501 TCCGGGCTGG CGTAATAGCG AAGAGGCCCG CACCGATCGC CCTTCCCAAC AGTTGCGCAG 1561 CCTGAATGGC GAATGGACGC GCCCTGTAGC GGCGCATTAA GCGCGGCGGG TGTGGTGGTT 1621 ACGCGCAGCG TGACCGCTAC ACTTGCCAGC GCCCTAGCGC CCGCTCCTTT CGCTTTCTTC 1681 CCTTCCTTTC TCGCCACGTT CGCCGGCTTT CCCCGTCAAG CTCTAAATCG GGGGCTCCCT 1741 TTAGGGTTCC GATTTAGTGC TTTACGGCAC CTCGACCCCA AAAAACTTGA TTAGGGTGAT
1801 GGTTCACGTA GTGGGCCATC G SEQ ID NO: 23 (Forward PCR primer for evaluating CGS-5/6 target site) 1 tgacagctct ggccttaagt gcctacgaaa ctag SEQ ID NO: 24 (Reverse PCR primer for evaluating CGS-5/6 target site) 1 gtctttcctc tttgctgtag ccttggtaga actactgcc SEQ ID NO: 25 (CHO-23/24 Insertion target sequence donor plasmid) 1 TCGCGCGTTT CGGTGATGAC GGTGAAAACC TCTGACACAT GCAGCTCCCG GAGACGGTCA 61 CAGCTTGTCT GTAAGCGGAT GCCGGGAGCA GACAAGCCCG TCAGGGCGCG TCAGCGGGTG 121 TTGGCGGGTG TCGGGGCTGG CTTAACTATG CGGCATCAGA GCAGATTGTA CTGAGAGTGC 181 ACCATATGCG GTGTGAAATA CCGCACAGAT GCGTAAGGAG AAAATACCGC ATCAGGCGCC 241 ATTCGCCATT CAGGCTGCGC AACTGTTGGG AAGGGCGATC GGTGCGGGCC TCTTCGCTAT 301 TACGCCAGCT GGCGAAAGGG GGATGTGCTG CAAGGCGATT AAGTTGGGTA ACGCCAGGGT 361 TTTCCCAGTC ACGACGTTGT AAAACGACGG CCAGTGAATT CCATACCCAG GGGAGCTGTA 421 CTGGGCTGCA GCCCTGCGCC ATTCAGCCAT GCACCAGGCT ACTCCCTCCT CTTCCAGCTT 481 TCTCCTTCTG ATGGCCATAG GATTAGAAGA TAAGGGACTC TAGTGCAGGT CAACTGCTGA 541 CCAGTGTGAA AATGCACAGA CTACATGCTG GTAGATCAGC ACTTCAAACT ACTGTTCACC 601 ATCATCTCTG GAATAAGCAC TACATTTACA GGGTTCAAAC CTCAATGAAT ATAAACAAAC 661 AAAACACACC TCCCTTCCTT CACTGTCTCC CATTTCTTTG GTTCCCATCT CCACATAGAA 721 TTTATAATTA AAATTTCTAA GTATCTTTCC AGAAATACTT CACACATGTT ATAAGCAAAT 781 GTGCTTTTAA AGATACTATT TTAAATTATG AAAATGGTTA TATTAGTTGA GATAAAAGAA 841 TAGAATGGGA AGTTCCAGAA TTTAAGGCCT CATATGGATC CCAGCTGTGG AATGTGTGTC 901 AGTTAGGGTG TGGAAAGTCC CCAGGCTCCC CAGCAGGCAG AAGTATGCAA AGCATGCATC 961 TCAATTAGTC AGCAACCAGG TGTGGAAAGT CCCCAGGCTC CCCAGCAGGC AGAAGTATGC 1021 AAAGCATGCA TCTCAATTAG TCAGCAACCA TAGTCCCGCC CCTAACTCCG CCCATCCCGC 1081 CCCTAACTCC GCCCAGTTCC GCCCATTCTC CGCCCCATGG CTGACTAATT TTTTTTATTT 1141 ATGCAGAGGC CGAGGCCGCC TCGGCCTCTG AGCTATTCCA GAAGTAGTGA GGAGGCTTTT 1201 TTGGAGGCTA CCATGGAGAA GTTACTATTC CGAAGTTCCT ATTCTCTAGA AAGTATAGGA 1261 ACTTCAAGCT TGGCACTGGG TACCGCCAAG TTGACCAGTG CCGTTCCGGT GCTCACCGCG 1321 CGCGACGTCG CCGGAGCGGT CGAGTTCTGG ACCGACCGGC TCGGGTTCTC CCGGGACTTC 1381 GTGGAGGACG ACTTCGCCGG TGTGGTCCGG GACGACGTGA CCCTGTTCAT CAGCGCGGTC 1441 CAGGACCAGG TGGTGCCGGA CAACACCCTG GCCTGGGTGT GGGTGCGCGG CCTGGACGAG 1501 CTGTACGCCG AGTGGTCGGA GGTCGTGTCC ACGAACTTCC GGGACGCCTC CGGGCCGGCC 1561 ATGACCGAGA TCGGCGAGCA GCCGTGGGGG CGGGAGTTCG CCCTGCGCGA CCCGGCCGGC 1621 AACTGCGTGC ACTTCGTGGC CGAGGAGCAG GACTGACACC CGAGCGAAAA CGGTCTGCGC 1681 TGCGGGACGC GCGAATTGAA TTATGGCCCA CACCAGTGGC GCGGCGACTT CCAGTTCAAC 1741 ATCAGCCGCT ACAGTCAACA GCAACTGATG GAAACCAGCC ATCGCCATCT GCTGCACGCG 1801 GAAGAAGGCA CATGGCTGAA TATCGACGGT TTCCATATGG GGATTGGTGG CGACGACTCC 1861 TGGAGCCCGT CAGTATCGGC GGAATTCCAG CTGAGCGCCG GTCGCTACCA TTACCAGTTG 1921 GTCTGGTGTC AAAAATAATA ATAACCGGGC AGGGGGGATC TGCATGGATC TTTGTGAAGG 1981 AACCTTACTT CTGTGGTGTG ACATAATTGG ACAAACTACC TACAGAGATT TAAAGCTCTA 2041 AGGTAAATAT AAAATTTTTA AGTGTATAAT GTGTTAAACT ACTGATTCTA ATTGTTTGTG 2101 TATTTTAGAT TCCAACCTAT GGAACTGATG AATGGGAGCA GTGGTGGAAT GCCTTTAATG 2161 AGGAAAACCT GTTTTGCTCA GAAGAAATGC CATCTAGTGA TGATGAGGCT ACTGCTGACT 2221 CTCAACATTC TACTCCTCCA AAAAAGAAGA GAAAGGTAGA AGACCCCAAG GACTTTCCTT 2281 CAGAATTGCT AAGTTTTTTG AGTCATGCTG TGTTTAGTAA TAGAACTCTT GCTTGCTTTG 2341 CTATTTACAC CACAAAGGAA AAAGCTGCAC TGCTATACAA GAAAATTATG GAAAAATATT 2401 CTGTAACCTT TATAAGTAGG CATAACAGTT ATAATCATAA CATACTGTTT TTTCTTACTC 2461 CACACAGGCA TAGAGTGTCT GCTATTAATA ACTATGCTCA AAAATTGTGT ACCTTTAGCT 2521 TTTTAATTTG TAAAGGGGTT AATAAGGAAT ATTTGATGTA TAGTGCCTTG ACTAGAGATC 2581 ATAATCAGCC ATACCACATT TGTAGAGGTT TTACTTGCTT TAAAAAACCT CCCACACCTC 2641 CCCCTGAACC TGAAACATAA AATGAATGCA ATTGTTGTTG TTAACTTGTT TATTGCAGCT 2701 TATAATGGTT ACAAATAAAG CAATAGCATC ACAAATTTCA CAAATAAAGC ATTTTTTTCA 2761 CTGCATTCTA GTTGTGGTTT GTCCAAACTC ATCAATGTAT CTTATCATGT CTGGATCCCC 2821 AGGAAGCTCC TCTGTGTCCT CATAAACCCT AACCTCCTCT ACTTGAGAGG ACATTCCAAT 2881 CATAGGCTGC CCATCCACCC TACTAGTATA TGAAAATATA AAGCGCTTTC TCTTTTAAGT 2941 CTAGGGTAGG TGTACTAGAT CAGCGCTCAG CTCCATACCA TGAAGCCATC CAGGAGTCAG 3001 ACCTCTCTGA CAGCCCTGCC ATTGTCACAG AGAAGTTTCT GTCACCAGTG CTCATGCTGT 3061 CAGAGGAGCG AAGGAGAAAA GATGTGAGAC CTCCCAAGTC AAAGTCATCT ATGGATAAAA 3121 CCTTAGTTGC ATGGCACACC AGTGTTAGGG AGTCGGGGAA ACACAGCCAT AGCCCAGCTT 3181 CCTCTCTGTT CTTGCTCTTA TTACCACCAG AAAGAGGTTG CTTAGACAAC CCAAACCAAG 3241 ACACAGGGCT CTGTGGGAGG GAATCAGTCC CAGGCTTCTG GCACATGCTA TGTCACCGGA 3301 AAGCCCCAGC CCTACTCCGA ATCCCCACAA GTACAGCAAA TATCAGATTA TAGCATTTAA 3361 AGGGGCACTC TTGCCAAAGA GAAGCACCAT TGGAATAGCC ATGCTTGAGA ACTAAGCTTG 3421 GCGTAATCAT GGTCATAGCT GTTTCCTGTG TGAAATTGTT ATCCGCTCAC AATTCCACAC 3481 AACATACGAG CCGGAAGCAT AAAGTGTAAA GCCTGGGGTG CCTAATGAGT GAGCTAACTC 3541 ACATTAATTG CGTTGCGCTC ACTGCCCGCT TTCCAGTCGG GAAACCTGTC GTGCCAGCTG 3601 CATTAATGAA TCGGCCAACG CGCGGGGAGA GGCGGTTTGC GTATTGGGCG CTCTTCCGCT 3661 TCCTCGCTCA CTGACTCGCT GCGCTCGGTC GTTCGGCTGC GGCGAGCGGT ATCAGCTCAC 3721 TCAAAGGCGG TAATACGGTT ATCCACAGAA TCAGGGGATA ACGCAGGAAA GAACATGTGA 3781 GCAAAAGGCC AGCAAAAGGC CAGGAACCGT AAAAAGGCCG CGTTGCTGGC GTTTTTCCAT 3841 AGGCTCCGCC CCCCTGACGA GCATCACAAA AATCGACGCT CAAGTCAGAG GTGGCGAAAC 3901 CCGACAGGAC TATAAAGATA CCAGGCGTTT CCCCCTGGAA GCTCCCTCGT GCGCTCTCCT 3961 GTTCCGACCC TGCCGCTTAC CGGATACCTG TCCGCCTTTC TCCCTTCGGG AAGCGTGGCG 4021 CTTTCTCATA GCTCACGCTG TAGGTATCTC AGTTCGGTGT AGGTCGTTCG CTCCAAGCTG 4081 GGCTGTGTGC ACGAACCCCC CGTTCAGCCC GACCGCTGCG CCTTATCCGG TAACTATCGT 4141 CTTGAGTCCA ACCCGGTAAG ACACGACTTA TCGCCACTGG CAGCAGCCAC TGGTAACAGG 4201 ATTAGCAGAG CGAGGTATGT AGGCGGTGCT ACAGAGTTCT TGAAGTGGTG GCCTAACTAC 4261 GGCTACACTA GAAGAACAGT ATTTGGTATC TGCGCTCTGC TGAAGCCAGT TACCTTCGGA 4321 AAAAGAGTTG GTAGCTCTTG ATCCGGCAAA CAAACCACCG CTGGTAGCGG TGGTTTTTTT 4381 GTTTGCAAGC AGCAGATTAC GCGCAGAAAA AAAGGATCTC AAGAAGATCC TTTGATCTTT 4441 TCTACGGGGT CTGACGCTCA GTGGAACGAA AACTCACGTT AAGGGATTTT GGTCATGAGA 4501 TTATCAAAAA GGATCTTCAC CTAGATCCTT TTAAATTAAA AATGAAGTTT TAAATCAATC 4561 TAAAGTATAT ATGAGTAAAC TTGGTCTGAC AGTTACCAAT GCTTAATCAG TGAGGCACCT 4621 ATCTCAGCGA TCTGTCTATT TCGTTCATCC ATAGTTGCCT GACTCCCCGT CGTGTAGATA 4681 ACTACGATAC GGGAGGGCTT ACCATCTGGC CCCAGTGCTG CAATGATACC GCGAGACCCA 4741 CGCTCACCGG CTCCAGATTT ATCAGCAATA AACCAGCCAG CCGGAAGGGC CGAGCGCAGA 4801 AGTGGTCCTG CAACTTTATC CGCCTCCATC CAGTCTATTA ATTGTTGCCG GGAAGCTAGA 4861 GTAAGTAGTT CGCCAGTTAA TAGTTTGCGC AACGTTGTTG CCATTGCTAC AGGCATCGTG 4921 GTGTCACGCT CGTCGTTTGG TATGGCTTCA TTCAGCTCCG GTTCCCAACG ATCAAGGCGA 4981 GTTACATGAT CCCCCATGTT GTGCAAAAAA GCGGTTAGCT CCTTCGGTCC TCCGATCGTT 5041 GTCAGAAGTA AGTTGGCCGC AGTGTTATCA CTCATGGTTA TGGCAGCACT GCATAATTCT 5101 CTTACTGTCA TGCCATCCGT AAGATGCTTT TCTGTGACTG GTGAGTACTC AACCAAGTCA 5161 TTCTGAGAAT AGTGTATGCG GCGACCGAGT TGCTCTTGCC CGGCGTCAAT ACGGGATAAT 5221 ACCGCGCCAC ATAGCAGAAC TTTAAAAGTG CTCATCATTG GAAAACGTTC TTCGGGGCGA 5281 AAACTCTCAA GGATCTTACC GCTGTTGAGA TCCAGTTCGA TGTAACCCAC TCGTGCACCC 5341 AACTGATCTT CAGCATCTTT TACTTTCACC AGCGTTTCTG GGTGAGCAAA AACAGGAAGG 5401 CAAAATGCCG CAAAAAAGGG AATAAGGGCG ACACGGAAAT GTTGAATACT CATACTCTTC 5461 CTTTTTCAAT ATTATTGAAG CATTTATCAG GGTTATTGTC TCATGAGCGG ATACATATTT 5521 GAATGTATTT AGAAAAATAA ACAAATAGGG GTTCCGCGCA CATTTCCCCG AAAAGTGCCA 5581 CCTGACGTCT AAGAAACCAT TATTATCATG ACATTAACCT ATAAAAATAG GCGTATCACG 5641 AGGCCCTTTC GTC SEQ ID NO: 26 (reverse PCR primer in the SV40 early promoter) 1 AGATGCATGC TTTGCATACT TCTGCCTGC SEQ ID NO: 27 (donor plasmid for inserting GFP into FRT Insertion target sequence) 1 GACGGATCGG GAGATCTCCC GATCCCCTAT GGTGCACTCT CAGTACAATC TGCTCTGATG 61 CCGCATAGTT AAGCCAGTAT CTGCTCCCTG CTTGTGTGTT GGAGGTCGCT GAGTAGTGCG 121 CGAGCAAAAT TTAAGCTACA ACAAGGCAAG GCTTGACCGA CAATTGCATG AAGAATCTGC 181 TTAGGGTTAG GCGTTTTGCG CTGCTTCGCG ATGTACGGGC CAGATATACG CGTTGACATT 241 GATTATTGAC TAGTTATTAA TAGTAATCAA TTACGGGGTC ATTAGTTCAT AGCCCATATA 301 TGGAGTTCCG CGTTACATAA CTTACGGTAA ATGGCCCGCC TGGCTGACCG CCCAACGACC 361 CCCGCCCATT GACGTCAATA ATGACGTATG TTCCCATAGT AACGCCAATA GGGACTTTCC 421 ATTGACGTCA ATGGGTGGAG TATTTACGGT AAACTGCCCA CTTGGCAGTA CATCAAGTGT 481 ATCATATGCC AAGTACGCCC CCTATTGACG TCAATGACGG TAAATGGCCC GCCTGGCATT 541 ATGCCCAGTA CATGACCTTA TGGGACTTTC CTACTTGGCA GTACATCTAC GTATTAGTCA 601 TCGCTATTAC CATGGTGATG CGGTTTTGGC AGTACATCAA TGGGCGTGGA TAGCGGTTTG 661 ACTCACGGGG ATTTCCAAGT CTCCACCCCA TTGACGTCAA TGGGAGTTTG TTTTGGCACC 721 AAAATCAACG GGACTTTCCA AAATGTCGTA ACAACTCCGC CCCATTGACG CAAATGGGCG 781 GTAGGCGTGT ACGGTGGGAG GTCTATATAA GCAGAGCTCT CTGGCTAACT AGAGAACCCA 841 CTGCTTACTG GCTTATCGAA ATTAATACGA CTCACTATAG GGAGACCCAA GCTGGCTAGC 901 GTTTAAACTT AAGCTTAGCC ACCaTGGTGA GCAAGGGCGA GGAGCTGTTC ACCGGGGTGG 961 TGCCCATCCT GGTCGAGCTG GACGGCGACG TAAACGGCCA CAAGTTCAGC GTGTCCGGCG 1021 AGGGCGAGGG CGATGCCACC TACGGCAAGC TGACCCTGAA GTTCATCTGC ACCACCGGCA 1081 AGCTGCCCGT GCCCTGGCCC ACCCTCGTGA CCACCCTGAC CTACGGAGTG CAGTGCTTCA 1141 GCCGCTACCC CGACCACATG AAGCAGCACG ACTTCTTCAA GTCCGCCATG CCCGAAGGCT 1201 ACGTCCAGGA GCGCACCATC TTCTTCAAGG ACGACGGCAA CTACAAGACC CGCGCCGAGG 1261 TGAAGTTCGA GGGCGACACC CTGGTGAACC GCATCGAGCT GAAGGGCATC GACTTCAAGG 1321 AGGACGGCAA CATCCTGGGG CACAAGCTGG AGTACAACTA CAACAGCCAC AACGTCTATA
1381 TCATGGCCGA CAAGCAGAAG AACGGCATCA AGGTGAACTT CAAGATCCGC CACAACATCG 1441 AGGACGGCAG CGTGCAGCTC GCCGACCACT ACCAGCAGAA CACCCCCATC GGCGACGGCC 1501 CCGTGCTGCT GCCCGACAAC CACTACCTGA GCACCCAGTC CGCCCTGAGC AAAGACCCCA 1561 ACGAGAAGCG CGATCACATG GTCCTGCTGG AGTTCGTGAC CGCCGCCGGG ATCACTCTCG 1621 GCATGGACGA GCTGTACAAG TAAGGATCCA CTAGTCCAGT GTGGTGGAAT TCTGCAGATA 1681 TCCAGCACAG TGGCGGCCGC TCGAGTCTAG AGGGCCCGTT TAAACCCGCT GATCAGCCTC 1741 GACTGTGCCT TCTAGTTGCC AGCCATCTGT TGTTTGCCCC TCCCCCGTGC CTTCCTTGAC 1801 CCTGGAAGGT GCCACTCCCA CTGTCCTTTC CTAATAAAAT GAGGAAATTG CATCGCATTG 1861 TCTGAGTAGG TGTCATTCTA TTCTGGGGGG TGGGGTGGGG CAGGACAGCA AGGGGGAGGA 1921 TTGGGAAGAC AATAGCAGGC ATGCTGGGGA TGCGGTGGGC TCTATGGCTT CTGAGGCGGA 1981 AAGAACCAGC TGGGGCTCTA GGGGGTATCC CCACGCGCCC TGTAGCGGCG CATTAAGCGC 2041 GGCGGGTGTG GTGGTTACGC GCAGCGTGAC CGCTACACTT GCCAGCGCCC TAGCGCCCGC 2101 TCCTTTCGCT TTCTTCCCTT CCTTTCTCGC CACGTTCGCC GGCTTTCCCC GTCAAGCTCT 2161 AAATCGGGGG CTCCCTTTAG GGTTCCGATT TAGTGCTTTA CGGCACCTCG ACCCCAAAAA 2221 ACTTGATTAG GGTGATGGTT CACGTACCTA GAAGTTCCTA TTCCGAAGTT CCTATTCTCT 2281 AGAAAGTATA GGAACTTCCT TGGCCAAAAA GCCTGAACTC ACCGCGACGT CTGTCGAGAA 2341 GTTTCTGATC GAAAAGTTCG ACAGCGTCTC CGACCTGATG CAGCTCTCGG AGGGCGAAGA 2401 ATCTCGTGCT TTCAGCTTCG ATGTAGGAGG GCGTGGATAT GTCCTGCGGG TAAATAGCTG 2461 CGCCGATGGT TTCTACAAAG ATCGTTATGT TTATCGGCAC TTTGCATCGG CCGCGCTCCC 2521 GATTCCGGAA GTGCTTGACA TTGGGGAATT CAGCGAGAGC CTGACCTATT GCATCTCCCG 2581 CCGTGCACAG GGTGTCACGT TGCAAGACCT GCCTGAAACC GAACTGCCCG CTGTTCTGCA 2641 GCCGGTCGCG GAGGCCATGG ATGCGATCGC TGCGGCCGAT CTTAGCCAGA CGAGCGGGTT 2701 CGGCCCATTC GGACCGCAAG GAATCGGTCA ATACACTACA TGGCGTGATT TCATATGCGC 2761 GATTGCTGAT CCCCATGTGT ATCACTGGCA AACTGTGATG GACGACACCG TCAGTGCGTC 2821 CGTCGCGCAG GCTCTCGATG AGCTGATGCT TTGGGCCGAG GACTGCCCCG AAGTCCGGCA 2881 CCTCGTGCAC GCGGATTTCG GCTCCAACAA TGTCCTGACG GACAATGGCC GCATAACAGC 2941 GGTCATTGAC TGGAGCGAGG CGATGTTCGG GGATTCCCAA TACGAGGTCG CCAACATCTT 3001 CTTCTGGAGG CCGTGGTTGG CTTGTATGGA GCAGCAGACG CGCTACTTCG AGCGGAGGCA 3061 TCCGGAGCTT GCAGGATCGC CGCGGCTCCG GGCGTATATG CTCCGCATTG GTCTTGACCA 3121 ACTCTATCAG AGCTTGGTTG ACGGCAATTT CGATGATGCA GCTTGGGCGC AGGGTCGATG 3181 CGACGCAATC GTCCGATCCG GAGCCGGGAC TGTCGGGCGT ACACAAATCG CCCGCAGAAG 3241 CGCGGCCGTC TGGACCGATG GCTGTGTAGA AGTACTCGCC GATAGTGGAA ACCGACGCCC 3301 CAGCACTCGT CCGAGGGCAA AGGAATAGCA CGTACTACGA GATTTCGATT CCACCGCCGC 3361 CTTCTATGAA AGGTTGGGCT TCGGAATCGT TTTCCGGGAC GCCGGCTGGA TGATCCTCCA 3421 GCGCGGGGAT CTCATGCTGG AGTTCTTCGC CCACCCCAAC TTGTTTATTG CAGCTTATAA 3481 TGGTTACAAA TAAAGCAATA GCATCACAAA TTTCACAAAT AAAGCATTTT TTTCACTGCA 3541 TTCTAGTTGT GGTTTGTCCA AACTCATCAA TGTATCTTAT CATGTCTGTA TACCGTCGAC 3601 CTCTAGCTAG AGCTTGGCGT AATCATGGTC ATAGCTGTTT CCTGTGTGAA ATTGTTATCC 3661 GCTCACAATT CCACACAACA TACGAGCCGG AAGCATAAAG TGTAAAGCCT GGGGTGCCTA 3721 ATGAGTGAGC TAACTCACAT TAATTGCGTT GCGCTCACTG CCCGCTTTCC AGTCGGGAAA 3781 CCTGTCGTGC CAGCTGCATT AATGAATCGG CCAACGCGCG GGGAGAGGCG GTTTGCGTAT 3841 TGGGCGCTCT TCCGCTTCCT CGCTCACTGA CTCGCTGCGC TCGGTCGTTC GGCTGCGGCG 3901 AGCGGTATCA GCTCACTCAA AGGCGGTAAT ACGGTTATCC ACAGAATCAG GGGATAACGC 3961 AGGAAAGAAC ATGTGAGCAA AAGGCCAGCA AAAGGCCAGG AACCGTAAAA AGGCCGCGTT 4021 GCTGGCGTTT TTCCATAGGC TCCGCCCCCC TGACGAGCAT CACAAAAATC GACGCTCAAG 4081 TCAGAGGTGG CGAAACCCGA CAGGACTATA AAGATACCAG GCGTTTCCCC CTGGAAGCTC 4141 CCTCGTGCGC TCTCCTGTTC CGACCCTGCC GCTTACCGGA TACCTGTCCG CCTTTCTCCC 4201 TTCGGGAAGC GTGGCGCTTT CTCATAGCTC ACGCTGTAGG TATCTCAGTT CGGTGTAGGT 4261 CGTTCGCTCC AAGCTGGGCT GTGTGCACGA ACCCCCCGTT CAGCCCGACC GCTGCGCCTT 4321 ATCCGGTAAC TATCGTCTTG AGTCCAACCC GGTAAGACAC GACTTATCGC CACTGGCAGC 4381 AGCCACTGGT AACAGGATTA GCAGAGCGAG GTATGTAGGC GGTGCTACAG AGTTCTTGAA 4441 GTGGTGGCCT AACTACGGCT ACACTAGAAG GACAGTATTT GGTATCTGCG CTCTGCTGAA 4501 GCCAGTTACC TTCGGAAAAA GAGTTGGTAG CTCTTGATCC GGCAAACAAA CCACCGCTGG 4561 TAGCGGTGGT TTTTTTGTTT GCAAGCAGCA GATTACGCGC AGAAAAAAAG GATCTCAAGA 4621 AGATCCTTTG ATCTTTTCTA CGGGGTCTGA CGCTCAGTGG AACGAAAACT CACGTTAAGG 4681 GATTTTGGTC ATGAGATTAT CAAAAAGGAT CTTCACCTAG ATCCTTTTAA ATTAAAAATG 4741 AAGTTTTAAA TCAATCTAAA GTATATATGA GTAAACTTGG TCTGACAGTT ACCAATGCTT 4801 AATCAGTGAG GCACCTATCT CAGCGATCTG TCTATTTCGT TCATCCATAG TTGCCTGACT 4861 CCCCGTCGTG TAGATAACTA CGATACGGGA GGGCTTACCA TCTGGCCCCA GTGCTGCAAT 4921 GATACCGCGA GACCCACGCT CACCGGCTCC AGATTTATCA GCAATAAACC AGCCAGCCGG 4981 AAGGGCCGAG CGCAGAAGTG GTCCTGCAAC TTTATCCGCC TCCATCCAGT CTATTAATTG 5041 TTGCCGGGAA GCTAGAGTAA GTAGTTCGCC AGTTAATAGT TTGCGCAACG TTGTTGCCAT 5101 TGCTACAGGC ATCGTGGTGT CACGCTCGTC GTTTGGTATG GCTTCATTCA GCTCCGGTTC 5161 CCAACGATCA AGGCGAGTTA CATGATCCCC CATGTTGTGC AAAAAAGCGG TTAGCTCCTT 5221 CGGTCCTCCG ATCGTTGTCA GAAGTAAGTT GGCCGCAGTG TTATCACTCA TGGTTATGGC 5281 AGCACTGCAT AATTCTCTTA CTGTCATGCC ATCCGTAAGA TGCTTTTCTG TGACTGGTGA 5341 GTACTCAACC AAGTCATTCT GAGAATAGTG TATGCGGCGA CCGAGTTGCT CTTGCCCGGC 5401 GTCAATACGG GATAATACCG CGCCACATAG CAGAACTTTA AAAGTGCTCA TCATTGGAAA 5461 ACGTTCTTCG GGGCGAAAAC TCTCAAGGAT CTTACCGCTG TTGAGATCCA GTTCGATGTA 5521 ACCCACTCGT GCACCCAACT GATCTTCAGC ATCTTTTACT TTCACCAGCG TTTCTGGGTG 5581 AGCAAAAACA GGAAGGCAAA ATGCCGCAAA AAAGGGAATA AGGGCGACAC GGAAATGTTG 5641 AATACTCATA CTCTTCCTTT TTCAATATTA TTGAAGCATT TATCAGGGTT ATTGTCTCAT 5701 GAGCGGATAC ATATTTGAAT GTATTTAGAA AAATAAACAA ATAGGGGTTC CGCGCACATT 5761 TCCCCGAAAA GTGCCACCTG ACGTC SEQ ID NO: 28 (reverse PCR primer in the hygromycin-resistance gene) 1 CAGAAACTTC TCGACAGACG TCGCGGTGAG SEQ ID NO: 29 (CHOX-45/46 amino acid sequence) 1 MAPKKKRKVH MNTKYNKEFL LYLAGFVDGD GSICASIRPE QERKFKHRLV LRFEVTQKTQ 61 RRWFLDKLVD EIGVGYVYDS GSVSRYYLSQ IKPLHNFLTQ LQPFLKLKQK QANLVLKIIE 121 QLPSAKESPD KFLEVCTWVD QIAALNDSKT RKTTSETVRA VLDSLPGSVG GLSPSQASSA 181 ASSASSSPGS GISEALRAGA GSGTGYNKEF LLYLAGFVDG DGSIFATICP RQQYKFKHQL 241 RLRFEVDQKT QRRWFLDKLV DEIGVGYVYD LGSVSRYGLS EIKPLHNFLT QLQPFLKLKQ 301 KQANLVLKII EQLPSAKESP DKFLEVCTWV DQIAALNDSK TRKTTSETVR AVLDSLSEKK 361 KSSP SEQ ID NO: 30 (CHOX-45/46 recognition site sequence) 1 CAGCACGTCT CACCCCACCC CT SEQ ID NO: 31 (CHOX-45/46 forward screening primer) 1 GGAATCTGAC TGTGGTAAGC CTGTACAC SEQ ID NO: 32 (CHOX-45/46 reverse screening primer) 1 CAGCACTCAG GAGGTAGAGG CAGG SEQ ID NO: 33 (artificial splice acceptor) 1 TCTTACTGAC ATCCACTTTG CCTTTCTCTC CACAGG SEQ ID NO: 34 (SV40 polyadenylation signal) 1 ACTTGTTTAT TGCAGCTTAT AATGGTTACA AATAAAGCAA TAGCATCACA AATTTCACAA 61 ATAAAGCATT TTTTTCACTG CATTCTAGTT GTGGTTTGTC CAAACTCATC AATGTATCTT 121 ATCATGTCTG SEQ ID NO: 35 (BGH polyadenylation signal) 1 CTGTGCCTTC TAGTTGCCAG CCATCTGTTG TTTGCCCCTC CCCCGTGCCT TCCTTGACCC 61 TGGAAGGTGC CACTCCCACT GTCCTTTCCT AATAAAATGA GGAAATTGCA TCGCATTGTC 121 TGAGTAGGTG TCATTCTATT CTGGGGGGTG GGGTGGGGCA GGACAGCAAG GGGGAGGATT 181 GGGAAGACAA TAGCAGGCAT GCTGGGGATG CGGTGGGCTC TATGG
Sequence CWU
1
1
641163PRTChlamydomonas reinhardtii 1Met Asn Thr Lys Tyr Asn Lys Glu Phe
Leu Leu Tyr Leu Ala Gly Phe1 5 10
15Val Asp Gly Asp Gly Ser Ile Ile Ala Gln Ile Lys Pro Asn Gln
Ser 20 25 30Tyr Lys Phe Lys
His Gln Leu Ser Leu Thr Phe Gln Val Thr Gln Lys 35
40 45Thr Gln Arg Arg Trp Phe Leu Asp Lys Leu Val Asp
Glu Ile Gly Val 50 55 60Gly Tyr Val
Arg Asp Arg Gly Ser Val Ser Asp Tyr Ile Leu Ser Glu65 70
75 80Ile Lys Pro Leu His Asn Phe Leu
Thr Gln Leu Gln Pro Phe Leu Lys 85 90
95Leu Lys Gln Lys Gln Ala Asn Leu Val Leu Lys Ile Ile Glu
Gln Leu 100 105 110Pro Ser Ala
Lys Glu Ser Pro Asp Lys Phe Leu Glu Val Cys Thr Trp 115
120 125Val Asp Gln Ile Ala Ala Leu Asn Asp Ser Lys
Thr Arg Lys Thr Thr 130 135 140Ser Glu
Thr Val Arg Ala Val Leu Asp Ser Leu Ser Glu Lys Lys Lys145
150 155 160Ser Ser
Pro250001DNACricetulus griseusmodified_base(36646)..(36646)a, c, t, g,
unknown or othermisc_feature(36646)..(36646)n is a, c, g, or
tmodified_base(38354)..(38354)a, c, t, g, unknown or
othermisc_feature(38354)..(38354)n is a, c, g, or t 2taaaactcaa
gatgccagct ttgtagctag cttaggaaac aaagtagtaa aaaataataa 60tgggtgggtg
aaggtctgaa gcatttacag agttctctca agacaaagca cagaggctgg 120tggccacata
acttggcaac tgatttgggg gaacagaata caagaaagga aatttaaata 180ctgtttttct
caatgttgaa ctatatgggc atagtcacag ctgcctaacc tatagagact 240ggaagctgga
acctcggcta tctaagatag aataatcaag aaatgtcaat tatttgagaa 300aaacatcagg
aataaatagc tgctaagtta caagttggtg ctttagacat ttggagagga 360taggatgggg
gctcccagac ctggggctcc ctaataaagc tgtgctggcc tacaagttcc 420agggatcctc
cagtccatgc ctcccactgt tgggactgcg ggcgatggtt tctgacgtgg 480gtactgaggg
cctgaactgt ccacacactt aagccacacg ccttttactg agtcatctcc 540tcatctcaga
acattttcct ttaatctttc ttaatgaaaa ggtcgcattt cttccgaggg 600ctagcctcct
gttactctct atacatgtca cataaaacta catgaaaact ttgaaggcac 660tatatgtcca
tactcagatg aaaagccatt agctgtggtc atacaaaacc ccacagacca 720actgttggga
aacatcagac ttttttcctg cagcgcctgc cctgatcttc cacagagaat 780tcagtctcac
tttttccagg atgacttctg aactatcacc gtaagatgag aatttgaaac 840aaagatgtaa
gtaatgaact tcatgtgttc tgaacacaca gcttagtgca ttgaaattac 900gtaacacccg
cttccttata agccatttct caaaatgttc ccattacacc tgcatcgggg 960atgggtccca
gaatcttcct tttaaataaa caccccagag gattctgaag ctagaacacc 1020aaggactgac
agagagaagc atgcctgtgg gcgactccag acacctggga gctgcctgct 1080ttcttgctac
tgatttagaa ggcatttgcc cccgaatggg gctgggggac tgtcactatt 1140tctcattctc
gggactttga aaggaagcaa aacagaaaac catgcaaagt ataagccacc 1200atggaataat
ggcagacgat ccggttgtgc agattagatt ttacatattg ctgattttga 1260agctaaagac
ctttcacttc ttaaatatat aataaaattc atacaagagt attttgtgta 1320ggtaactcag
tcagatacaa ggtaagcaaa gtaaatgata ggtgcccctt aacaaaatgc 1380attctcatag
ttcatttatc aattatagaa atggtggact ggagggaagg cttgaggtca 1440ggagaatgtg
ctgctcttcc agacagcccg ggttcttttc cccagcaatc tgggactcac 1500gtctgcctgt
agctccaggc ccaggggatc tggcaccttc ttctggcctc tgcaggcacc 1560catacacaca
tggcatacac acacatacac aaattctaaa attaaatagt aggttgtagg 1620cctacacaaa
aacatgcata cattaactaa ataattaata gttaataaat aaaaatcaac 1680caaacacata
cactgattaa gtaacatgac tctgtaaggt caaaggcggc tgaccagctg 1740tgggaagggt
taaataataa caatcacctt tgaaagactg gacctggtga ttaaggatgt 1800tccagctgtg
tcgtggatga gaaatcaaat gcataattga atgagtgcca ggaatagaac 1860tggagacttt
ctggtgagaa tgcttttact ggcagtagag tccctgtcta aacaggagag 1920agacctgcag
tagccctgtg gcggccctgc agtggccctg tgatggctct gcagttgtac 1980tcttcctgag
ataggagaca cactagagag tgtttctaat gagcagctcc tgtactttct 2040gttcccctgg
agaccgcacg tgtttctccg ataatacatt gacatttctg ttaaaccatt 2100ttcttcttgg
aacaaaaatg gagaacaaat cagattggtg tgtggtcttt taaataactt 2160ggtacttaat
aacacaaaac aaaattatca gaggctggat tttaggtgct ctcagcatct 2220gccacccctg
agccatcagt caggtcttgg aggaacaatc tccaaggaga aaacagttct 2280gtcctcagaa
aagctggagg aatatgagat tttctacagc actcatagca aaatcattta 2340cggaagggat
cctgagtaag atggcctctt cttcatcaca tggtcatagt ctgcttcaat 2400ggggagaata
gttcaatcta gcatcgagaa atcgaaggtt cccttttgac tggcaatgcc 2460ccatagatag
atagatatag attatgtata tattgtgtaa aacacacgta tgtatatata 2520atacacatac
atgtatgtgt atacatacat acatacatac atacatacat acatacatac 2580atacatagat
acgtgtgtgt gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt 2640ttgagactga
gtttctctac tatgtagctc tggctgtcct gaaagttgct aagtagacca 2700gactggccag
accagatcca ccctcctctg cctcctaagt gctgagatta aaggcctgca 2760cccaccccca
cccagcccat cttatatttt gcttcatttc aaagtaagct ctatgcatca 2820tttattcctg
catattatta gccatggttc agtcttgttt gtgttttgga atatttactt 2880aacaaaactt
gaaaaacatt tttcaagatt tgtttgtttt taagatttat ttatttatta 2940tgtataataa
taaatattat tatgaaaaac ggtgttctgc ctgcagggca gaagagggca 3000ccagattgaa
ttacagatgg ttgtgagcca ccatgtggtt gctgggactt gaactcagga 3060cctctggaag
agcagccagt acttttaact gctgagccat ctccccaggc ccaaaataca 3120catcttaagt
gtattgccac aagcatacat cttcatggcc caatcttctg tccatcactt 3180cagacagctc
tccttctttc cctggccagt cacaacaccc tcagctatca ggaaaggccc 3240tatgggggtt
gttttgtttt cccactccag ttcccttgcc tgctctgacc tcatgagtag 3300actcatacag
gatgtgctca cttcacttgg gatgatttct ttttcaccca ttgttgctct 3360gcccagaatt
tgttcctttt tattgtctta gtgttaatca actatcaaag ccagcaacaa 3420aaaatagtag
ggaaactttt ttgatagggt aaacctgatt gattgcaggc tttggttgcc 3480ttgtttggtc
tatccccttg agagtccctt acaatgtgag ttagttagtg gctgctaact 3540agttgaatct
caacttcctt tttctttaat gtgggtattt gtaaggaata gcccccttaa 3600atctagattc
tgttctcaaa tcaagcaagc tcaaggctgt aagcatggat tcaccaactt 3660tcctgctcaa
ggaatttaaa tgtctggtct ccatcatatt actttaatag taatagttta 3720ttatacacat
gtgccagctg tatatccctt ttcttcttga tggacctatg aactctgttg 3780aggtgagatt
tgaacccctt agaaggtgct agagaagagg tacctgatgg tcaaggcaag 3840gctgatactt
attcatgggt cccacatctg ctaatgtaag caataacaga taatatgctt 3900tgtgtttaga
cccacagtgg ttgcatgtac actaagtatg tatcatcatt gtcttatcgt 3960tcctttagaa
tacagctaat aattatgacc gctattctca tagcatttat attatatgag 4020cattgtaaat
tattttgaaa tgctttaaga tatacttgag aactatgcat atcatgcgta 4080tgttgttcta
ccagctggga ccttgaaatg agatcccttg aggccagcat aaagagaaag 4140ttttcatctc
aaacaaacaa aagatacact tgataataga tgagggataa atgtcatact 4200ttttatatag
tgattgagaa tctacagatt tgggtatcct ggtcacttag gagaccaagg 4260gaggactatt
agctctagag ctatgaactt tatctccaga ttccaaagcc aatacaaact 4320ctagccaagt
tggggtgctg ttacctgtat ccctctgtca aattccaagt gttttcacca 4380cctttactgt
atctttccaa ctgttctctt ttataaccac acatagttca tggtctttcc 4440ttctctcact
tgactgtgga gtaacctaac ttgcgtgttt ccagttttcg atctcttcct 4500taaatctaca
ctagttaacc acaaagaccc tcttttctga gctgtgtcta ttctatcact 4560gtcaccattc
cttaatgctc tcccagatgc agccaaactt cactttgggc ttgagagtct 4620tctccaggtg
acagtgacta atgtctccag attgagcatc taccatctac cctgtgtatt 4680acacatgaat
agccttagct tttcagcaat agacagatag atccatagtt agccatgtca 4740acacccttct
tcatgctgtt ctcacagtaa taagtcctaa ttcctgtttt ctcccatcta 4800aactcaaccc
tgtcctaaat accttactca aatcctaatt gtatctcttc cacaaacatt 4860tcccccttct
ctccattaca aggtggaaac tcagagatcc aggtgtcttg catgttgttg 4920attctgtcct
caacaaggaa ttccccaggt tcctgcacga aggaaagcat ggaggaccat 4980acttgaggct
actggtgtag tgggaagaca ggcccaaacc atgtcacaga aacccatcac 5040cagaaagttg
ggggaggcag cccagttgtg gagcaggaga aggagaaaac aggcttgggg 5100aactgctagc
tatgctttgt cacagtcaca agaaaaaagg gccctagcct ggcctacata 5160ttctacaact
tcctgaatct ttgctctgaa atgaagaggt ttggatggct gtctgggaat 5220tcatcttgct
tgcagtgaag ctccttgggg tatttgaaac caggaagttt gaaggagttg 5280atgctaattg
ttttctaaag tgtgtgagga gtactggcag agttcaggcc ttgtgaggaa 5340agaatcctat
atctagtctg cactcctggg cacatgagac attcagctat ctcccttata 5400aagcatagaa
agtactcttg tacttgacac agaaataatt tcagtatgta gagcattaaa 5460aaaaagtatg
aatgacttag agagatggct catcagttaa aagcacatac tgctcttcca 5520gaggtcctga
gttcaattcc caacaaccac aaaaactcac acatatgcat gtgattaaaa 5580ataaaatctc
tctctctctc tctctctgtg tgtgtgtgtg tgtgtgtgtg tgtgtgtgag 5640tgtgtgtgtg
tgtgtgtgag tgtgtgagtg tgtgtgtgtg tgtgtgtgtg tgtgtgtgtg 5700tgtgtgtgtg
tgtgtgtgtg tgtgtgatgg tgggcttgtg tttgcaagcc cagcactagg 5760gagttaaggc
ctcactcaca gtgccaggcc agtctaggtt acagtgagtt ctagacagcc 5820caagctacag
agtaaggtac tgacaaagaa agaaagaaag aaaaaaagaa agaaagaaag 5880aaagaaagaa
agaaagaaag aaagaaagaa agaaagaaag aaagaaagga gagaggtgag 5940agggagggaa
ggaactggaa gggggaagga gggaaagaaa agaaaaagaa acaaccaaag 6000gaacaaacca
ctgtatgcca ttatacatta gctttgggct ttacaggtta tacactctat 6060attgtcatag
ccaatgtctc aatattccat aagaggtgtc tagttgtggg tatgttcttt 6120cttagtcctt
ttatttagac tacatgacct gtttttgcct aataggccat tagtaatact 6180gacttctcca
catgctgccc tcaaaactta ctcctggaag atctttattt aagctatgaa 6240cgaaaatctt
aaccctgtga cctgccaccc agaatgcctc tgggaacaac ctcaggcaac 6300ctatcaagcc
gcttttccaa catttggggc aacagggatt aaaattatga ttgttgtctg 6360cctgctgagt
tcaaactcac agagggacca gaagctgact cactgatatc aagcagttct 6420aaattttcag
tttaaaactc taattattaa acaggggatg tcctcagacc agcactcaag 6480agaaggagat
aggcagagct ctatgagttg agttataggc cagcctggtt ttcatagtga 6540gttttagctc
tccagagagt taccagcaag accctgtcac aaacaaataa aaacaaacaa 6600acaattaggg
gatatacata taactaaatg ataaagcctt acctagcaca ttcaagtccc 6660caggttcaat
tgctagccct gggtggggat ttggacaaat ttaaaaagac cttttttgta 6720tcacacataa
atatgactgc actggttgtt gttttccatg gaaacagaat caatgtggca 6780tgtattttac
ggcattagct catatagttg tgcaggctgg caagtgtgga atgtataggg 6840caggccagga
atcagaaatt gatacaaaat tcaggaaaga cctctgggtg caatggtgca 6900cacctttaat
tcaagcactt gaaaggcaga ggcaggtgat ctttgtgagt tccaggccag 6960cctggtctac
atagtgaatt ccgggacagc cagggcttca tagaaagaac ctgtctcaaa 7020acacacaaac
aatcagaggg aagggcttat tttgtttttg agacagggtc ttctatgtag 7080cccaggctgg
cctcaaactc atgctcttga tatgcccacc tcacaagtgc atgttaagat 7140tacaggtgcc
tgacacacac cacttttgtg aagtgctgaa gagtaagccc agggcttcat 7200ggacgctggg
caagcactgt gccagctgag ccacactccc cagtgtgcac gatactttgc 7260aaagatagat
ccatatggat gctgtgcttc tatctaaaca gaatgacaac cacactctgg 7320caggttctgg
ttcataactg agtcttattg gtcacctcct tctccatttt tcgctggtat 7380ttctcaagga
gagaccacaa atgagaagtg aagcctaact tttaatgcgg tctctcctat 7440gtcacctaaa
ttctagctca aacagggttt ctggctctta ccttttcctc gggtttctgg 7500atacttgaag
tgttaacggg catttctctt aaagaccaaa tctggccaga ttcaaatggc 7560tggccttcaa
ctcggcaaac taggaacaat aatgtccgct gcatgtggct tgtagcactc 7620tgtttctatt
catggacttg tgagtgattt ctgggaaaca cgaattataa gataagtcct 7680tttcagtgga
cttcacaagt tcaccctcag gtagtatact gtcaggtaga aacgtctttc 7740agagaagcga
gaggtgacaa gccctctggg ctggccattg tccctgctgg cattgaacag 7800cctgttcagc
acatgaaagc atcgcctgat gctcccaaag ctggagcact ggcagccccc 7860tgcagtcagg
tgtgtagggt gggttagcag gggtgcttag gcgggttttg tagttacctt 7920ttcaacacaa
atgcaaaagc cagagagaga gagagagaga gagagagaga gagagagaga 7980gagagggaga
gagagagaga gagagagaga gagagagaga gagagagaga gcaggaaagc 8040atccaggctt
tgaagcaagc cagccttcag ctctgtcctt gagccattct gagtggaatg 8100gagtaattgt
ctgcttggag aactgaagaa tagcacatgg caaagaacaa tttgtacctg 8160gaatatattc
attagcttgc atgtcaaaag gccacatgca gatagaaacc attatcttgg 8220cattctttaa
aaccttgcag ccttgagact tgaggtgcag aaacccacat gcccatgtga 8280ctgactacct
gtcgatctct ccagccctgc ctggctaaca gggacaatat agggggatgg 8340tgggagggga
cagcttagac tcctgtggac ttggattgaa agaagaacag ggaagacagg 8400ggactgtgca
aataagcact ctattaggac ctatttttgg tgtcttggga ccctcctact 8460ggtttagctt
aaattgagag gggatttggt ttgcctcact agctgtttct tcccactcaa 8520ttcacaatta
cagctttctt cattgtcatt aaaatacatt aaatgtgtac ttgttggggt 8580aaggctttct
gttgaaatct gcataaagac aatgtccaca gcccccagtc agtggaaaga 8640gcagtaggac
cagaaggcat gtgtttccat cccgagtcta tattggaatg tttgttaaaa 8700cctgcacttg
taagagacaa acactagaac catcagcttg caggtctaca ggccagtgtt 8760gccagtgcag
ataatgccca aactggaacc taaagatgaa ggcctttggg agctgaggtg 8820gaagagtcag
ctgtgatctc ccagatgtcc tcctcatgcc ccattgccac tctagcctcc 8880cacctccaag
cacatttggg atccaactgc taacccctgg tgttcttttc ttagttgaaa 8940ttctcaggga
ataacctaag agtctctgtc actcagtcta tggcatccta tgataacagc 9000caaggctaaa
tagccatcat tgttcttttt ccagatgctc agcaatgagg atgcagaggt 9060gaacaaaggt
ggttcagggc tgccctgatg atgaatttga caagccagaa tctaacaaga 9120tcagtcggta
aacagaatcc tccttcctat ccagagatgt tggcttgttc tgtcactgga 9180tgggcatcat
ttactataag tcatacaggc accagacact cagagataaa taacatgaag 9240tttccagtct
tatgcagtcc tgtctagttg acttgccagt attctcaagg aagttccacc 9300ccagcccctg
gcatccatag accaaggact ctggaatgtt ctgggaaagc tccacctgag 9360ctcctagcac
ccatatatcc aaagagtctg gaacgttatg gtggaagccc cacctctctc 9420tccccagacc
tcgccccctc aaaaagtcca ccaaagactc cccacccccc acacaccccc 9480agatgctcaa
gaccacttcc atagagtatt taaactgcct cccagaaaac agaattcatt 9540ttttcagtct
ctcttcccca tgtcctctca gggtgggggg caggggtatt agtattcaag 9600cacctatact
ggcctgtcct tggggttctg acaagatatg acctcagcta cagccactaa 9660gatcaccacc
tgtgtatatc cactatgctc ccttttaaaa gggccctgtc cacctcccat 9720tctctctgtc
tctctctctg tctctgtctc tgtgtgtgtg tgtctctgtc tctctctctc 9780tttctctctc
tctctgtctc tctctctctc tccttctctg cctgactctc cctccctccc 9840ctgctctctt
ctttcctgct gcttttgtcc ctagaggcta gtctcctctc tccccttccc 9900ccttttccca
ttcactttcc cccaataaaa aactctccac ccaagctcta tcacatggca 9960tcattctctt
gctccatgat tttaaaatca caatgaggag gggagcatgg aaaaattatc 10020caggaagact
ttatccatta aacctgggtg ctttttcttt cttccttcct tcctttcttt 10080ccttctttct
ttcttccttt cttttttcct ttcttccttt cttttttcct tttttccttt 10140ctttttgttt
tgttttgttt tgagacagcg tttctctgta gctttggaga ctgccctgaa 10200actcaatctg
tagagcaggc tggccttgag ctcacagaga tccacctgcc tctgcctccc 10260atgtgcttga
attaaaggtg tgcaccacca ctgcctggct taaaactggg ctttttctaa 10320gtcagtttga
tttggattgc tgcattggca gagaggttta ttggggtgca gaaacctttc 10380aaccagcttt
tgagctaatg atagagagaa gctcaaggaa ttggagcaat gcttgactag 10440ggatgtcaga
gggaggctat ccagaggagc ttacaactga ggtaaactta aaagttaggg 10500agtttgtcaa
cttcaaccca cagaatagag cagagccagg aggagctgag gcttctgagt 10560gttatggtgg
aagcatcacc ccaacccttg acatccatat gcctgaagag tctggaatgt 10620tatggtggaa
gttccaccca agcctccctt cccggtcgcc ctccaaaccc tgctacatct 10680cagaaatccc
accaaatgat gactccctcc cccagagata ttcaagacca ctcccacagg 10740gtatttaaac
tgccccccaa cccccagaaa atagatgtgt ggttttccaa tctctctttc 10800ctatcacgtc
tctggggagc tggcaggcca tttgggagca ttgtatccat taaacgactt 10860ctcagtggag
actctgaaag ccagaagagc ctagacagat agatgtcttg catactctag 10920agactacaga
tgccggccca gactattata tccagcaaaa gtttcaaaca ccatacaaag 10980tcaaatttaa
acagtatcta tctacaaatc caatattaca gaaggtgcta gtaggaaaac 11040tccaaactaa
gattaactat acctgtgaag acacaggaaa taatctcaca ctggcaaaag 11100aagaaaaacc
tctctctctc tctcctctct ctctctctct ctctctctct ctctctctct 11160ctctctctct
ctctctcaca cacacacaca cacacacaca cacacaccaa caccaatacc 11220atgaacaaca
aaataacagg aattaacaat aattgatgtg tgtgtatgtc cctgtgtgtg 11280tgtccttgtg
tgtgtctgtt tgtgtgtctg tgtatatgtt tgtcacctga ggggtggctc 11340ttccttggtt
tgtgaggttt ctacccaatc tataactccc ttttcttcat tcacttcctc 11400atgtccttac
tagtctctat tgtggattaa ggaaactgtg tggagaacag ttttcttcta 11460gaaaagaaca
ctagccatct catgtaatca aattggtgac tatcctaatt attatgagag 11520agcttccgtc
cagtaagtgc tagaagtaga tgcagagatc cacagacaag cactgagcca 11580agctccagga
gtcctgttga aaagagagag gaaggattgt aggagccaaa gagtcaagag 11640catgacaggg
aaacccacag agacagctga cctgggcttg tgggtgggag ctcatggact 11700cttgaccaac
aattagggaa cctgcatgag gccaacctag gaactctgca tgtgtgtgac 11760agttgtatag
catggtctgt ttgtgaggct tctagcagtg ggatcagggc ctgtccttgg 11820cgcttgagct
ggcttttggg aacctgttcc gcatgctgga ttaccacacc cagccttgat 11880gctgggggaa
gcacttggtc ctgcctcaac ttgatgcgcc ttgcattgtt ggattctcat 11940gggaggactg
cccctttctg aaaaagaaca aggagaagtg aataggggag gggattggga 12000ggagaggaag
gagaggaaac tgtgataggg atgtaaaata aattaaaaaa ttaattaatt 12060aaaaaagaac
acttgtactg gtagattggc taaaatgaaa caaagataaa agtacacagg 12120aaaaagagag
gagaaacctg gggagggggg ctccaaagag aggtgagggg gggatgggaa 12180tggcagctta
gtggaggaag gaagacatga cctacacgaa tcgagctgta gtttttatct 12240ggagcatagg
gtaaagatgt ttgaggagaa ggaggaacac atgcttgtaa aacatggtct 12300tcagaaccag
caacaatcat acagagtgtc cagggtccat gggcacatga aggacagacc 12360aacacatatt
taacagtaaa gtgtccatat ttggtatgaa agtgatgggt aaattgtcct 12420gggactgtaa
tttagttgta aaggacttgt ctggcatgtg ggtattcttg ggttccctcc 12480ttagcactga
aaaaaaaaaa aaacacacac acacacacac atatattcta gtgttttgta 12540gaaaaggatt
caaagaaagc catgatttct cttttgataa atccagaata atgtaataag 12600aacacacagt
ggtgtgattt cagcaatcaa gtacaggttg cttgtctgtt tgttgtatgg 12660gatggttggg
tggttgtttg cttggtttgt aagatgggtg ggtgggttgg tgggtggttg 12720cttggttggg
tagttggttg ggtgattggg tgggtgggta tttggttggg tgggtggtgg 12780gttggttggt
cgtttggttg ggtggggtgg gttttgtttt gagacaggga tttactctat 12840atctcagttt
gtctcaaact cactatgtgc acatgagtat gtgatgagat tatctaagac 12900catagtgtct
gtgttcatgg aatgtctctc tagcttagag aatttaaaaa atggccatgt 12960agggaaaccc
ctcagaaaag gagtttctat ggcctccaag aataagaatg gatcctccta 13020gctcggagtc
agcaaggaac tgaagccctt aattttatag acacaaagga atccattgtg 13080tggctccttc
ccagccaagt ctcagatgag tcacagacct gcatggcacc ttatgcagtc 13140ttttgaggtc
ccaagaatag gatgcagata agccatgcca gaatcccaac acacaaagcc 13200ttagtgatat
agtaaatatg tattgtgtct aggctgctgc atttctggtt atgctactgt 13260gcagtaatac
acaactaata cagatgtgat ggttaatatt atgtgacaac ttgagtgggg 13320cacagaggta
cagacacttg gtaaaccatt ctgggtgcac gtaaggatag ttttggatga 13380cataaacatt
tagattagta tgctgggtaa aatacattgt ccatcccaat gggcatgggc 13440tttgtccaac
tagatgacag ctggaataga aaagtctgcc tctctcatag ttctcaggcc 13500tttgagctca
gactagacag aactcacagg ttctctgagc tttccagctt gatgaatgtc 13560catggcagtc
ttcacactta acacctgaca gacttaatga tcatatgaac caattcaaat 13620ctgaccatca
ctcgggtcat tcttttgatt ctgtcacttt ggagaactaa taccgaggac 13680ataaaatgcc
atcacatcgt tattttcttc ctgtctgtga atatttttct tttttttctt 13740ggtttttttt
tttttttttt tttttttttt tttgtttttc tctgtgtagc tttggagcct 13800atcctggcac
ttgctctgga gaccaggctg accttgaact ctcagagatc cgcctgcctc 13860tgcctcccga
gtgctgggat taaaggcgtg taccaccaac gctcggcctg tctgtgaata 13920tttaaaatga
aaactttgga aatgttctga aaccagctgg tgtcagatag tcagagaact 13980ttcgtaaggt
aggtgtgggt tatagcataa tcccacacaa gaggctgaag caggaggatt 14040ttgtgtttga
gggcagctag agccacatgg tgagtccctg cctcaaaaca caaaagcaag 14100acaaaaacaa
gctccaaata agattcactg ggccctttct ttccttcctt ctcagtgagt 14160ccacttgctt
taaaatcagg tcttaaagac gcactagatg ctgaacttaa cagtaataat 14220aaatatcttc
tcttacagta cagattatgc tctataaaca ctgcactgat aaagttcagc 14280cttaaccttt
gttctgtaaa tgtttcctag tttttctact gccgtattat aagacaaatg 14340tcagcatgaa
ggcaggtttt tcagaaaaca cagcagctcc acagatggcc tctaatccat 14400aatcattaaa
gacaagactg caactttttc aactggaaat cattcaagat gtttttctga 14460agtccctacc
aggacacaag ccaccctggt tgctgtgtga catcagttag gtagactctg 14520aactggcttc
ccaagaaatt atacaaaagc aaggtgtcac ctagtattag cataacttct 14580gataactact
gtcttagctg gggtttctat tgctgtgaag agacaccatg accacagaaa 14640ctcttataaa
ggaaagcaat tattgggtcc agcttacagt tcagaggttt aatccattgt 14700catgattgca
ggaagtatgg tggcccacag gcagacatgg tgctggagaa gtagatgaga 14760gttctatatc
agattgacac acttcttcca acaaggccac acctccactc actctgagcc 14820tatggggcca
ttttcattca aaccaccaaa gctacaaggt agcttatacc ccagcttgct 14880atttctgatg
agacttagta aatagtctta aaagcccata aaatgactca aaactagttt 14940ttttattatt
attattagtt caaattagga agaagcttgc tttacatgtc aatcccttct 15000ccctctccct
catcaaaact agttttttgt tttttaggtt ttttttcaag acagggtttc 15060tctgtgtagc
tttggagcct atcctggcac tcgctctgga gaccaggctg gcctcgaact 15120cacagagatc
tgcctgcctt tgcctcccga gtgctgggat taaaggcatg caccaccaac 15180acctggccaa
aattagtttt aagtccagtt ctaggagctc caatgccctc ttttggcttc 15240catgggaacc
aggaacacta tatatatata tatatatata tatatatata tatatatata 15300tatatattca
ggcaaatatt tatgcatata aaaataaaat aaatcttttt tccttttttt 15360tttaaagaag
tgacattgtc ttggaatttt tgtggctgct ctgcccttat gtgtaactgg 15420acactaccag
catctaaaca ctggcctgaa accagccaaa gaaaaccttt gtgccaggtc 15480ctgtgtcaaa
gtattatgtt ccttttagga tatcctatat cctaaaggat ttattttact 15540gatagcatct
taacttcctt tgaaaggttg gtcttctcaa gcagtcctcg tggagctggc 15600tcctcagcta
atgccagggg acaataatga tcccctccca aaaccaaaca gaaaaccatg 15660gcaactctgg
tttccttggg cagcacctgc tttaagaatg agcaaatgac caatcagctc 15720atgaaactaa
atactctatt attactaaaa tatttttttg agacagggca tggaattcat 15780cacatagttc
aggttggcct tgaactcaga gagactcact tacctttgcc tcccacgtgc 15840tggaattaaa
ggcatgaacc accacaccaa acataacact tgaattttgg aagagtcctt 15900cttccaatag
atttgaggtt ttgaaaatgt ggcacagaaa atatgaattc aaatataatg 15960aaaacaagag
ataactttca actaagtttc tataggttct tgctaggaat cctaagcttg 16020tctgaaactc
tagagcttct gtttctagct tctgagtgtt agtattgtag gtatgtgccc 16080tgcctcagtg
tgatgttttt gataatctta aagaaatcaa agaaatttta taaaagacta 16140gactgtgcta
cacaaaaaga atattcagat gccaagaaag agttcttaga aattaagaaa 16200tatgctacta
gtataaatcc tttataaagt ggaatgacaa atctgatgaa atcttactaa 16260aagtagaaaa
acataaacat caaagacatg aataataaga aaatcatatt gtgcatatga 16320ttaacctaaa
acattaactt gcaaaaatag aatagtccca aaaagtaaac aaaataaata 16380aatcaccaag
aacatgatac aaggacaatt cctaggatga taaaacaaga atattcatta 16440taaaaggccc
tatcactaaa gcacaacaga aacagactca aaagataaat cttcattgtc 16500actggagaga
agtccatact atcatagcac tcagaaggaa ataaaaatca aaatgtcaaa 16560aaggacctca
gcctctgaaa cacaaataca aaatatgtcc cgccttcttg acacgcatta 16620ctcttcaatt
aacattttaa gaaaactata aactgttaaa gagagcttag tattttaaga 16680aatctgtagc
tatttctttt ataagcatga caactaagtt tccctgattt aaacagacct 16740aaaaaaccgg
tgaagtgagt ggagaaaggg gatacgaaga cagcatccca catgactgct 16800cccagtaaag
gcaaggtctt catccatttt atcctgaact ctgggaaatt tataaagaac 16860agaaatgtat
ttctctcagt tctggagcct cagtccagga cactaagtct aggtactaca 16920ctctcacatg
gtggaaagta gaaagcaagc tcacttgtca ctcactacct gatgcctctt 16980tcatcaatcc
cattgataag gaagagacct ggcatctcag tttcctaagg actcagctct 17040tactaacatt
agctgtcatt tctgggtcac tgtaacagaa agcctgacag aagcaaccca 17100ggggaagaag
gatgtatttt ggctcactgt ctctgaggat ttcaacttat cccagcaata 17160aagggataaa
ggcattgcag caggaatatg tgtggcagaa gctgtttatg tcacaataaa 17220caaataaaca
cacgctagcg cgcgcgcaca cacacacaca cacacacaca cacacacaca 17280cacagagaga
gagagagaga gagagagaga gagagagaga gagagggggg ggggcagaca 17340gacagacaga
gagggagaga ggcagagagg gagagagaga gagagagaga gagagagaga 17400gagagagaga
gagagagaga gagagaaatc aaaggcccac ctccatcaga ctggtcccat 17460atcccaaatt
tctagaacct cctaaaacaa caccatcaac tgagggagac atttttggat 17520tgaaagcata
atgccattac ccaggcagaa tctgcctgtc tgggggagtc acatttaagc 17580catggtatca
attgacctca tgtaatttca gaatactaca taaaactatc agatattttt 17640catgatgaat
ttctaaagct tgaaattccc tttgaataaa ggaccaacta cagaattttg 17700ctgagtctac
aattacatac atgaaaatgt aactacgaag tggccagcca caatgaaaat 17760taaagtgttt
gggtggtctg tctctattga tgctcttctt tgccctgttt ttttttaata 17820ttgttgatgg
tttgtttttc ttttaagata cttggcccca agaaaaaaaa tgacagcctt 17880aattaatttt
gtttactctc ctgacatttt aaaagacaaa tttatgaaga cctgactgtt 17940ccatgtagta
ttagaaagat gtaaaattaa gggttgctta agctgtgtag aattgaagag 18000cacagcattt
gagtgacagg gtacaattag agatcatcag ggatgtggca caaagtgtac 18060tcaacctcac
cttttcctgc ttagcagaga acagggtgcc tcggtgagat aggaaattaa 18120tcaaatagaa
gaagaaatag taattttaga aggatcaaat tttcctggtt agaatgatca 18180aaactacaag
acttgtaact aaaatatagt caaacccatt tcaactggaa tctgtgctat 18240tcatgtatag
attaactaga atctaatttt taaattttca tcttacttcc aaaaatattt 18300gtccaaatac
tctgtgaatg cattagtttc ttatgggaaa acatcatatc ttttgtacaa 18360tgtgtttctt
agcttgaggt tctctccaaa caggaccaag acgaggccag gaccatgtga 18420tacaacccat
agtcctcaag aaatagttgt cattttctta ttccaattgc atcccaaggt 18480ctcatctcat
tttgcgtgtg cctttgacac cccataccca cataaactaa ggtggtgtta 18540ttttttgagg
ccctgaaggt atcttcagga atccataagt gagccttaag ctgcatctgg 18600atataggaat
ctgaaagtgt cccttctctg catgatctct tctttcagtt tttcaagtca 18660gtgtgccaca
ggaatcagga acgataaatg gagaggggaa gtgcagttgc ttggtataga 18720caccccagag
ggctatttgc atcctgtcct tcaaaatctc tctgagcctt cctgcctaag 18780ctgttttgag
ttgggtttgt ggtaccagaa cccctgcccc cgccccattc tgactaatga 18840gagagagaga
gagagagaga gagagagaga gagagagaga gcagcagagc atagaatgaa 18900agtaggttag
aagggcaggt aaaagcactt tagacaagag caggtataag ggccttggac 18960tccctcccca
gaacacacac atgaaggtaa acgatggtta aaggatacag ataggatgtc 19020gaagctggac
gatcacttgc ttttgtgtgc ttgaagtgac aggctgtggc tttcgggttc 19080atggggtctg
ttgttgagtt cacagtctca ccatgttagc aagcatgtca ctattaagct 19140ctatccccgc
cccccttttt tgagacatgg tcttgctaac atacccagac cggcctagga 19200agcactttgc
agtctcagct cccctgagtg ctatgatcac tcgtgtgagc tacagtaccc 19260aaaccagaat
atgtgtgttg ggtgttatga gagtttacac attgctgcct tgaatgctgc 19320tctgcttgag
ttcctgtagg aagctgagct gggaacctaa gcttcctcct cccagatagc 19380agtaaccctg
cagagacctc ccaccaagac tagctaaccc ctccttcttg tgctgtactt 19440agcaagaacc
ccaaggttct gggtccttgt gctacagttc cagaagagta tgaacaatct 19500tagcttttct
gtatatgtgt ctgtgtctgt cctgtcagat caagtcccag cctcactgta 19560tgcaacatga
aaggctgtga aaactgtgca ttttgagaat gaacatcatt agtctccagt 19620aagttcaaaa
acaaatgaag gcagccactc ataagggtct ttaatgaggc aagggggcaa 19680aagggtggtt
tctgtttgtt caaagaagcc tgtcatacat tttcagaaaa tttagaaaca 19740cgtatcatgt
catttcacgt tagtatgaag tccttataat tcatttcata ttaaatgatt 19800tcctttggtt
agaagcaaaa ttatgcataa aatgtgttcc tttgtgtttg gagcaaaatt 19860acaagttaca
ttattagtta atattctagt tcttattttt cccaatctcc aagaagcaaa 19920atattcccct
aaaccctaaa gcatcaaatt atcctatcac acagtgacca gtcatcgtaa 19980cctaaatatt
aaagcatcag attatcctgt ctatggtgac cagtcattgt aacctaaata 20040ttattgtaat
gtggattaga gttaactata ccttttcatc acactataat gtaaacactc 20100tccaaatctt
tcaaagtctt gaaaacacaa tttataaata ctgtgttctg tttgttttga 20160gacctgatcc
ggttaggaat ttcaggctgt cctcaaactc atcatcttcc tgcctcactc 20220aggtcctaag
tgctgagatt aaaggtctat gctaccacag ccatacgaat gccatgtctc 20280catcagctta
tcacttctta acttttttct tttcttcttc tacatactgc tgagtaggag 20340catcgatgac
ctcagcctag taggaatggt tcccatgtga acccttaatc tgtaggaaga 20400tgctggactt
cttccattaa gactgatctc catttgaact tgacttgtct ctctcttgtg 20460tggagctacc
atcccatata taatcttctg gtttataaac agattgcttt accctcaaga 20520tcctttgcta
gcgcagcaat gtaagtttta atacaaacag taaggtctct gattggagtg 20580tcatggtttg
gttaagtgcc ctttccaagg gcccatatag ttaagggctc aaccaccaag 20640tgatgcttgt
ggataggagg cagggcctag tggacagtct ttaggtcatg gagctatgct 20700gttgaggggg
actgtggggt cctggtcttt ttcccactcc tttttaggtc ctagctatga 20760ggtgagtggt
tttgtcctat caagcacctc tgtcctgcca tggtgtaatt gattataact 20820acaacctctg
aaactaagcc agtataacct atttatctca agatgtaact tacaggtaat 20880ggtaagataa
agctaacaaa agacaaattg ttataatcca ggcaagcctg gccccatccc 20940ttgggggcat
ggcacagagt gtgtcaccca tctgtgcatg gcaagcagta ccctgactct 21000gtatgctgat
tcaaaggtcc cttaaagcaa actcctccca cttcctctct ttttctgcca 21060tttctctgag
gagggaggcc actgtctctc tgtctctctc tgtgtctctt tttctatctt 21120cctctccctc
tcttcccttt ccccaataaa ctttccacat taagttttgt ctgaaggtat 21180ctgtttgtct
ctcacccgcc ttttaggccc cacctaccat gggatctgcc aaaggtctca 21240cctcgagctg
tattcataac acaaatgaca gacaaagatc aaccctgaag actagtagga 21300tgtagaaggc
ctggagctga cctgaagaac actgctgact tcaacattgc ccatccgtca 21360gttatgtagc
attaaagtta tagtggttcc tcagaaagca gtctcctttg aaaacttctc 21420gttttgtgtc
taaatggaat taaatacctt gttcccgaat aattgtttta gttctcttga 21480aagatcccgt
atacttacta ttaagatgta tataaacctc aagctgaaag aatgacttcc 21540cctatggcca
gatcacaaga ctctccactg atgtgcccgt tgcaacctga ttagaggaag 21600agggtcaaag
ttccccaaga ttcagctgag ttcatgcaag ttttagaaaa aaaacaagat 21660gttcctccac
agttagaaag gagtggggct ggagggatga ctcactgaga aaggttattg 21720tcgtacaagc
atgaagacct gagctcgaag cctggcaccc atgtaaaaag aaaccatgca 21780tggtagtgtg
catcttcaat cccagcattg gggagacaga gaaagagaaa gggacatccc 21840tagagcttcc
tggtcagcca gccttggcaa gccagtgaac tccaggttca gtgagagacc 21900tgtctgggga
ggaaaaaggg agggagggag ggagagagag agagacacac acacacacac 21960acacacacag
agagagagag agagagagag agagagagag agagagagag agagattgag 22020gaagatacct
gatatcaacc tcacacactc atgtacccat gtatgtaggt accttcacac 22080acacacacac
acacacacac acacacacac acacacacac acacacacac acacacacac 22140acggatggtg
ttgaattcta aggctcttat ccacacatat atggagacaa atagaagaat 22200tacagtcgtc
cctgcctttg acgctactct gtttctccaa ccctgcttcc cagatatttt 22260tcaacatcta
ctcagccttg agtggttgca ctctgacccc aggacctctt tctgtgactt 22320ccttggcctc
ctgttttgtt tttctgatgc taaaaactga atctggggcc tcatgcacac 22380aggaagatgc
tataccaatg agctacaatt ttgttgccct ttttaatttt tgagatggtc 22440tcactaaatt
gttcaggatg gcccacttgt aattctcctg ccttagcttc ccaagtagct 22500gggcttttat
acagatctgt gcttccacac ctggctgagc agacactcat gatttcattt 22560ctgctaatca
ggtagttttc ttgcccctcg ctgccatttc ctacctgcct ttccttgcca 22620actaaactgg
ttcccacaag cgacaggcta tcatttctca gctcttccac aggttagctg 22680tgcaatttgg
tatgaatcat ttagcaagcc cagttctcct ctttgtaaaa cagatgattt 22740agatgaaatt
ttttcaaagt tctctttgaa ttaaaactat cactgccttg cttgctctct 22800gactcttgga
gaccatggcc tatccctgat tagtccttgg tccacagaag gatgggtggc 22860attggatgtg
ctgaacaatc aggtactttc atgtcacttg gagtcttaca gtaactgcat 22920gtttcaaatg
aatcctttct ggctctatta gtttcttttt tgtcactgtg aaaaaaacac 22980ctgaaagaaa
caaggcacgg tttgttctga ctctcggttc agaggatata gttcaccatg 23040gaggcaggag
cttctcacag ctgtaacagc catggagtca ggtggctagt tacagtcagc 23100tggccttagc
agtcagagag ccaagagagc tcagttgagg agagtccagc caggctgtag 23160cccttaggac
ctgctcccca gagatccact ttctacagta tcttctaaac agtgtcacta 23220gatggtgacc
aggtagtcaa gcacatgagc ctgagggata atatcattca aaccatagga 23280ttagtctaga
actgaaccag atcaagaacc aggttttctt ctcacataat agataccaca 23340catcatgttc
tcatatagag tgtgatctag gtattgtttc tccaaatgga gaagccaaca 23400ctggatgact
tacatagaaa gaaagagagg gaggaaacaa gcaagggagg gggaagagtg 23460agaattattg
gaacagtacc agtgcctcaa aatccttggt ggactagaga attagcctca 23520ggaagaagcg
actaggcttc ttacagcata gacatacagt tcttaccaga ggcacagcca 23580tcatgggtgc
catggggagc atgaagttca gctccatcca gccattccta gcgatttctg 23640gcaacctctg
tcctttgaga cacttcctga agatataaga gtccagggag agacatctga 23700ttgctttgat
cccaggatct tgggatggaa ttggtgttgt ctctgctcca gctccagggt 23760caggaaggtg
aaactggaaa cacaagctag cttttcttac ttagcaaaaa cccacaggtg 23820acataaaaga
cagattgaca cgagaacagc atggcagatt tatttagtca aagttttacc 23880agacacaagc
accttcagaa aggtaaagtc agagacctta ggggaatttt cttgccagaa 23940tttttccaga
agaatcaaca gccgtgtaac aataggacta gataaacaag taagactgga 24000cctgcagcac
aaatgtgaca ataggagttg gaatccccag gactcacata aagccatggg 24060agccgaatgt
aatggtcact tgtagtttca gcctcagatg ggggtgggga ttctccagaa 24120taagcaggct
agcaagacta gccatgttgc caagctctgg gttatattga gacactctgc 24180ctcaatgagt
aagtggaaga atgatggagg ccaacttcaa ccttggactt ccacatgaac 24240acacatacac
aatgcaacca tgcatccaca gtgtatgtac acacacacac acacacacac 24300acacacacac
acacacacac acgcaaatgg acaaagaaag aggtaaaacc tacaaggaat 24360caactgaaca
gaagccaact ggtctgcctg ttcagatcct ttttggcctc tctgtgtgct 24420tccctttctc
ctgggcatgg ggcaggcagg atctgtatgg ggtgagggtc ttcagagaag 24480cgaacagcct
tcctaggttt tatggctcag tttggtggag aggggatcta gtttctctta 24540atcatctttt
taaaaattta ttaatttatt ttttatattc caatcccagt tttccctccc 24600tcctctcttc
ccctccccca cctcccatct gttccttaga gagggtaaga cctcctctag 24660gaagtctact
aagtctgccc catcatctca ttgaggcagg accaaggcac ctctccaccc 24720ctacactctg
gtgtctaggc agaacaaggt atctctccat atagaatggg ctccactaag 24780tcagtttgtg
cattagtgtt agatcttgga cccacttcca gtggcctcat atattgtccc 24840agtcacatcg
ttgtcaccta tattaaggga gtctagttcg gtcttatgca ggttccccat 24900ttgtcagact
ggagtcagtg atctctcact agctctggtc agctgattct gtggtttccc 24960catcatgatc
ttgactcctt tgttcatatt gtcactcttg cctcacttca attgtactcc 25020aggagcttgc
ccattggtta gttgtggatt tctgcatctg cttccatcta tttctggaag 25080agggttctat
cttctctggg gttgtgaatt gtagactggg tatcttttgc tttatgtctg 25140gtatatgctt
atgagtgagt acatacaaca tttgtccttc tgggtctggg ttaccccact 25200caggatgttt
tttttctagt tctgtccatt tgcctgcaaa ttttagaatg tcattgtttc 25260ttactgctga
gtagtactgc attgtgtaaa tgtaccacat tttctttatc cattcttcag 25320ttgaggggca
tctaggttgt ttccaagttc tggttattac aaataatgtt cctatgaata 25380tagttgagca
aatgtccttg tggtatgaat gtgcctcctt tgggtatatg cacaaaagtg 25440atatttcagg
gtcttgaggt aggttgattc ctaattttct gagaaatcga catactaatt 25500tccatggagg
ctgtacaagt ttgcactccc accagcaatg gaggagtgtt ctctttactc 25560cacatcctct
ccaccataag ctgtcatcag tgtttttgat cttagccttt ctgatcagct 25620taaaatggta
tctcagggtt gttttgttaa tcatcttgag aaaaaggaat tctattttct 25680gtgactggct
ctgagagaga gagaagaggg aaaggtggga ggaatgtgtg ctttcaagac 25740cttgtgttct
cccttagctc aaagtactca ccatgaaaaa ccaccagcct ttggaggagc 25800atgctcttgc
agaggcaaga tcctggcttc ctcccatctt gaatttgcca aaatagcaaa 25860gatgtttggg
tgctggacag ccaaaaatga cagctgctca cttcacagct tcctcacgta 25920tgattacaac
tccactcatc atcaagcttt aattacatca tgagcaggct tatggctgag 25980ccgttatcct
cgcatccctt cgtctcatca ctgattcaca caaatcacta ggtgctccgg 26040ttaatgaaaa
catattcatc agtacagtga ctaattcatc aggccaacat ttacatggct 26100cctctgcatg
acaaaaatga atgtttagaa tgaataatga gtcaccagag gtgggggaca 26160tcttctgagc
acaggttgcc cttgtctttc ctggtactca atcccggctg aagagctgaa 26220caaagctgag
gttatttttc ccatgacagt gcattgtggt ttagagatct gtaagcggct 26280tatcttgatt
ggcagtttga ttggttctgg gatgtactaa gagacgtgcc tcatgggcat 26340ttccagaaag
aattaactga gggggaagct cctcgccccg agaatgggta ggagcatctg 26400gtggggtaca
gatgtaaagt ggtccaaggg agaagccgca tggcctgcct gccttcactc 26460cttgctgctg
agtgtgttta tcccatctat cccgttgttg cttctgttgc agttgcaatc 26520ctgcttctcc
aggccccagc gtagactgaa cagtggctgc ccagaaattc ccaattgaag 26580cagccgaatg
gtggactgag cacctctcag tcttcagtct ctctagtttg taggcaacca 26640ttgttggacc
caactcttag tagtaagcca atctactaaa tacagaaagg ccagtgagat 26700ggctcagtat
aggtgcttac caccaagctt ggtgacccga gttcaatccc caagactcat 26760aaggaaagaa
ctaactaccg agagttgttc tctgagctcc acacatgctg aaacatgggc 26820ctccacatgt
catgaacatg ttcacacaat acatatttat ctctatatat tcatttctta 26880taatttttag
aaaatttcat tttatgtata tgagtgtttt atctgtttgt atgtctgtgt 26940accacatgca
tgcctggtgc ctgaagaagt cataagaacg tatcagattc cctctaactg 27000gagctaaaag
aagattgaga ggtacctacc atctgagtgc taggaaccaa acctgtgtct 27060tctggaagat
cagtaagcat gcttaaccac tgagccatca tgccacttat ttgtaacaca 27120tatccatcct
attggttaca gtcctgactc atacagttag atagctgagg aacctagaat 27180tcttctgctt
ttttattaca aaacaaagaa ttttatctga cttacagttc tggccttagt 27240cagggagctg
cattgggaga tggcttctct actgtcagag tccagaggtg gccgtaaagt 27300atcatatgac
atgaggcaga aagtctaact tacttgagag ttaacttgga aatgtccaaa 27360gagacagggg
gctaagtccc tcttattgaa gagaccttcc atagaagtta gcctgacaga 27420tggccttgcc
tgaactgcat tgacagtctt acttggaagg cctgttttgg ttcctaagaa 27480attcaaggat
ccaccagaga agtgtgcagc cagcaagctg gactccctat cccaagcccc 27540agctcctcct
cagggacctc agcagtcctg tgtctagctt acctcagcga tggggggaaa 27600gatgctgttt
tcctgctaag agcacactat tttatattat tgttgacaca ggttggactg 27660catgtaacag
actctccaac aacacagtga agatacaagt gtgttttgct gcatttaaat 27720gtctccccat
ctgtccctgc taagacacct actgtccttc acatgtcact gaaaactcca 27780ccccttatga
gaagtcttcc ctgatgccat ctagacaagc taagagtgct ctgctctgca 27840ctgagcagct
tctcaactct ggggttatca ttgctctgca tcacaattag cacacgtggt 27900agtggctgtg
tttgtgtttt tccacaccat gagtccagac agcatccctc tcaccagcac 27960gccataggca
caagtgctca agagtagcag gacttgaaca tgtgtggttt atcatacaga 28020cagctgctgc
tcagagacca gatcaaattc aaagcaaaat agagagatga tggttcctgc 28080catgagcgta
ctgaacaagg acaaacatca ccatcataag gaactcagct gacagggagc 28140ggtcaccaaa
cttttttttc tgtaaagtga caaaaatagt taagtatttt gccctagaca 28200tagtgggtgg
tacacatgta atctcagcat ttgtcagagt gaggcagaga gttgaatgct 28260gggctacgta
gatagtctca aaaaataaat aaataagtaa ataaataaat aaataaataa 28320aaggaagaaa
taaaaaaaag aatttgttac tcaactctgc acaatggtgc aaaagaaaca 28380ataagcatta
tgtaacctag tgggtattgg ctgtttcact ttactaacag gcattgaaat 28440ttcaattttg
caaaattttc atgttccata ttacccttat ttttattctc ccctataaat 28500ggtgactcac
caatacgcaa ctggataaga ttagggtatt tttattaggg aatatgcctt 28560acttacagag
cacctaacca gccagcagga aacatagtaa agtagcgcat gccgatgaaa 28620caaggaaaaa
gaagaactac catgtgtgac ccctaaccct taaaacctct cccacatcac 28680cctgaccatg
cccattaggc gtggtcacct agccagcccc taggaggcat ggttacggtg 28740tccccctaca
ctcccctaat catttaaaga tgcaaatgca tgcttggtga tgggctaacc 28800ttggctcatg
ggctaatctt ggctcatggg ctaaccttag ctcatgggct aataatcaag 28860gtttactaat
ctctgtcaga cagccatttt ttttttgcag agaagaatcc ccatctttgg 28920atcatttatt
tattcctttt gtatatttga tgcaatttat aaccacaaga acctactatg 28980tgactgcact
gtgccagatg gcagagaaag ctaagccccg attcttgtgg catggactca 29040cacaactcca
gtacaggact gttagtgaca atctccttaa ggcataagca tactgcagtg 29100gcagcctctg
ggttaggaga caaggataca gtttatgaca cctggtatct ggaaggcatg 29160aaacatgtca
aatgctggct acacctaaga atcagcaaca tctagtctgg ccatagccta 29220ggatgaatgt
cacagggtct taggccagaa atgtatggcc gagctgtagc agggtcctct 29280ctagggccag
aattaattcc agtgtgatgg acagccaaga ccacagggat aacaaatgag 29340cagtgccaat
gacacgtgct tctccttatt attgctgcac agtgtttgtt acacatagca 29400ttttcgcaca
gtaatataat gtgcttgggt catcttgctt catatcccat cactccctcc 29460atctccctag
tgcctcccct gttacctttg cttctcagtt ttgtttctgc tttgatgtca 29520acagcacata
caagatttta tgcaatacat cacttcctga atggctctat ttggaaatca 29580ctaaaaggta
atttatggaa catttggggt ctttttgatt ttctaattta ccaaaaaatc 29640cacctgggga
aagacaatgg agttcaagga cttctaagag gggaatgtac catggtatgc 29700tccagccagg
ggaaccagtg cttcccagga gctatggctt acaaagtggg ttatcacatg 29760aaagcaagac
taaaataatc atctcaaata ttcattagat gtgggactcc taaccatctc 29820acaatgcctc
cctcggtcta cattaaataa gaaacctcca ttttgtgctt tgcgagaaaa 29880tgactgaaga
ttatacattt ggccttgaag tggaagtatt tttgaaaatc atgaatagga 29940aaataataaa
tctctcattt caacataaaa tataagggac aaggacatct actcatgctc 30000caaggacgga
cactgaattt tccatcaggt agttgcagaa cgctgtgtcg ctcaatcaaa 30060aattcaggat
gcattgctca gagtgcatta tattaaaaga tagcatcttg gaacacagga 30120tgctcaggaa
atgggaggga cattaatctg catgcagtga tcatctcctg caaagcgggc 30180atgagagcct
gatgggagac aagccatcca gatgcccata cccaggggag ctgtactggg 30240ctgcagccct
gcgccattca gccatgcacc aggctactcc ctcctcttcc agctttctcc 30300ttctgatggc
cataggatta gaagataagg gactctagtg caggtcaact gctgaccagt 30360gtgaaaatgc
acagactaca tgctggtaga tcagcacttc aaactactgt tcaccatcat 30420ctctggaata
agcactacat ttacagggtt caaacctcaa tgaatataaa caaacaaaac 30480acacctccct
tccttcactg tctcccattt ctttggttcc catctccaca tagaatttat 30540aattaaaatt
tctaagtatc tttccagaaa tacttcacac atgttataag caaatgtgct 30600tttaaagata
ctattttaaa ttatgaaaat ggttatatta gttgagataa aagaatagaa 30660tgggaagttc
cagaatttaa ggcctcatat gaaaatataa agcgctttct cttttaagtc 30720tagggtaggt
gtactagatc agcgctcagc tccataccat gaagccatcc aggagtcaga 30780cctctctgac
agccctgcca ttgtcacaga gaagtttctg tcaccagtgc tcatgctgtc 30840agaggagcga
aggagaaaag atgtgagacc tcccaagtca aagtcatcta tggataaaac 30900cttagttgca
tggcacacca gtgttaggga gtcggggaaa cacagccata gcccagcttc 30960ctctctgttc
ttgctcttat taccaccaga aagaggttgc ttagacaacc caaaccaaga 31020cacagggctc
tgtgggaggg aatcagtccc aggcttctgg cacatgctat gtcaccggaa 31080agccccagcc
ctactccgaa tccccacaag tacagcaaat atcagattat agcatttaaa 31140ggggcactct
tgccaaagag aagcaccatt ggaatagcca tgcttgagaa ctggtcctac 31200ttactgcaga
accatggata caggctccct tttgtagatg ggcttaataa atacttctat 31260aagtgatact
ctgctttgtg aaaatgacct cgtcaatatt caaagtaatc ctctggttta 31320ggactactat
gaacctgtgg ggttcattgt tcatgtggtt aaacagcaaa gagtagttag 31380acagttgtcc
tacgtcacag agggggacat atgctatgct tggttaaata gctgtcctgg 31440tcagagggga
ggcatgctat tctgcccttt ctgacagacc ctgattgcat agacatttca 31500gtgagataaa
ggaaggaagg gaagaaggag gaaagacaac attttttgct tctgttaagg 31560tagagactat
ctgtgatcca gttcagcaca gtgcctgtga gtagaagcta caggtcaggc 31620aggagccaag
gaaatgtatt gcttttctaa ttgaacaaag gacacacagc tgccatttat 31680tttcttcatt
ttgacccttc agccctgcac tgtggatatg acatcaagaa actaagcagc 31740cattttgtga
aaatgagatc taagttagta aatgtggctg aaaaagaagc cagctgcatc 31800ctccctggat
ttacgagggg gaaatgtagg catactaaat taaaacacta aaattgaccc 31860aaagctattt
tgactgatat ttaaatatag attctgctcc tggacattcc agagttcata 31920ggacagttgc
ttctgttcag aggattcctc ttcggggttg cctctccttc cttaggcctg 31980cttgtcctgc
ccaaagctgc ccaagtgcat caggccccaa accaacttct ccatcctgac 32040gcacagcaga
ctaaatatgc aactttgtgt ctcttcatcc caggacaaaa ctttcaccca 32100gcccctgaca
tctgagactc tactacaggt tatctattaa atcttttata aagaccaaga 32160aacaaagtgt
tggcatccaa actttggtaa atcatagcct tttaataaag tcaaatggac 32220caatgtactc
taacaaaaaa atatgggtct ctcatttctg aatggcagat ttcaagccct 32280aagaaccaca
atgctcacct actgggcaac actgagttac agagacccag ctcccccacc 32340cctcaccaag
ccagagaaac actctatctg aacaatcctt ggtccatgga gcaagaatta 32400gacatagaat
ttgtatctca ttgtttttta ggaaaacccc aaaggctatt atgaagtcag 32460tttttctggg
caccttttct ttcccatgac aacgagttgt gggcagtctc agcagaatac 32520tgaagctgtg
gcttggggag acagagcata tactggattg gagttcatgg gtgggtgcat 32580ggaatcaatg
ccgggcatgg gattcaagac cttatgcatg tgggtagatg ctttgttact 32640gggataaatc
ccccacctgg gatctgactt caagcacaat ctttggaagg cggcattggc 32700tctctgctaa
tttttctagc acttttattc cacttatttt ctgcttgttt gctttgggag 32760ttttgttcgt
tataagacag tcttgctgtg tatcctaggc tgatcacaaa cctgtggcag 32820tccttttgtc
agcaggccaa aattcccact ttatctctga agacagaaag tagattgagg 32880aatatatgat
aaagacactc atcaaagcca ggcatctatc tttacttttc ttaaagcatg 32940tttttgaatg
gcataaaacc atgtagacaa ggagtcttat gttgtacatg gtcctacttt 33000gtcacttaca
atataggata ctttcaataa gcttggtagc ccttgcccta ttctacttat 33060tctgttctct
cttcctcggg tcttggggag ccttcttacc aggtggggtg gcataaaggg 33120aaaagtcaca
aagctcttcc tattcctggt tcccctccta agtgtacctt gctggtggcc 33180ttgctagcaa
atgtagtata acatctgact tatctcctct cagatatggt tgttgtactt 33240agataaattt
aatctagaaa ctcaagctgt atgtctttgg ggaccagcat tacagagctc 33300ttcccttcct
gtccttacct caccttggct actgtagtaa gttaatcctg atgattcctc 33360catgagtcct
gaaactgatt agttccaaga gctggaggat gagaagggat atagcctggt 33420gcagggacac
tttccaatga ccacaagacc ttgcacaagg tacacatgga atgtgttaga 33480ctgtctcctt
tctgtcccta gcctcagttg ccccagtgtt tatcaatgtt tattaacatt 33540gccctagcaa
aaatactaca gactaggaag cttgggtaca attgaaaaga gcttctcagg 33600gttctggata
ccgggaagtg caaaggttca gcatctggac agggctgcta ttgtagtttc 33660aaatggttct
gctgcaacac ccctttgaga gaatgaacac tgcttttcac atggtggaga 33720gtgcacagac
accaacccaa ctcctgaagg ccctttctcg agggctctaa tccatcatga 33780gggccatact
ctcaggactc attacctccc caacatcccc tctctaaata gtaccacact 33840gcatttgcat
ttcaatatat cactggagat atataaatct ccagaccaca gcataccata 33900aatcagataa
ggcaggcctg ccttctatag cctttcactc agcaaaggtg tttctagccc 33960aaagcagtct
ggactctcac tctgaaacct cttgggagtg gtggccagaa atgacttccc 34020atcatccctc
tctcctgacc tggtccagca ccaggtcacc aggaaatcct ccaagtttca 34080ttatccccac
ccccaattgt ctcttgtctc tagcaaacct cttccaatac ttccttcctt 34140ggtgggtgta
gcaagccaga tgatagcctg ccaaagaagt tcacagcctc atttctggag 34200cctatgaata
tgttacattg tgtggtaaaa ggaactttgt aggtgtgatt aaattatgaa 34260tcttgaagtg
ggcagattat ccaagtgagt ccagtgaaat tgcaaaggta catcaccaac 34320agtgaggcag
gaaggccaga gggggagaag gaagcagaga ggcagaggga ggaaaagaca 34380agccagggga
ggggagtggg gggaaagaaa ggagagagag agagagagag agagagagag 34440agagagagag
agagagagag aaatatcaca cacacacaca cacacacaca cacacacaca 34500cacacacaca
cacacctgaa cctgattgtg gaggaagaaa ccactaacca aggcattcga 34560ggcagccttt
gaaagtcaca agagacaggg aaaacagatt ctctccctcg gcccttcaga 34620atcaacacag
ccccacaact gctgatttta gtcatgttaa agccaagttg gacttctgac 34680tgccaaaact
ttagacgagc aaataaatct gcactatttt aagataccaa tgtgatttgt 34740tcatgaaaac
aatcaataag gaactaataa agtagaagtg aaaattggat cacttctgaa 34800gtttggtaat
atccacagaa actggacaca tgctgacttt gtgagccata gctccacacc 34860caggtatgcc
ccctacagaa atgtgtatat aggtgggcag gagatgtcac ctgctgtgtt 34920catagtcgca
cctttagact ttcccaagcc tgagaatagc ccaaacacct accaggagca 34980aaataaattg
agatatacag acgcagtggg atactacact tctaaaagaa tgagaaaacc 35040acgctataca
ctgtatatcg tcggaacagt aacacagggg tgacaatcag gcaataggac 35100atattctcta
tggctttaga aaacataaaa atagcataac agttctgtta gtggcaatgt 35160gttctgtttt
gtgatctgta tgatgcttcg gtttgtgcaa aagctctgga cttacctttt 35220aaatgtatgg
tggtctatac cttttaaatg tatgctagat atacatgagt aaaaatgatt 35280aaaagagatg
gaggggagga gactcatgcc ttcataaaag tttgttctgt cctttctggc 35340actgtccaag
tgaatgtgtg taaacaaaga gtgacccacc ccaggtagtc caccttctta 35400gaacctactt
ctgctacaac atgtcctgtg aatgtgcacc aaatgtttac taagggatca 35460tgccacaggg
ttttgtttaa ataaagtatg tctacctagg ggtatattga ttgtctttcc 35520ttttgagggg
gggtctcaaa actacaaact agtttgtttt gagacaagta tgtagcccag 35580gatggccttg
aactcacacc ttctgtcctg cctctttccc agcactagga tggcaggtga 35640gactatcagc
ctggccccag gaaactatct ttgattgaca ttatctggtc agaaaagatc 35700taccttttcc
tccaccaggt cctccaaata catgaagagc tgaaacagtt ctgtctaccg 35760aatttccttt
tttcttgatg tttctgtgga atttaataca taaattttaa tttgcatttt 35820tagcttttct
attaagcctt aattagagta taatgaagtt atgaatttat aaaaataaaa 35880acaaaacggt
tgctcccaca atcactcagt cttgaagtga ggttctgact ttacctgaag 35940tgggggaaga
gagtgaggaa agggacctgc ggaagctgaa tctcagaccc acaagatgga 36000tctgagatcc
atccaagcga acgtggacgc agacccggag tagggacatc caggggtcat 36060cttcatctgt
cctcgctgtg cttctgcccc tttgctcctc taccagtctc agctgtcaaa 36120gctcagtggc
ctggagggga gatggggcgg ggcttaggat cgaaggcgga gcctcggaga 36180gcatcttctg
gcccccgggg cctggactgg cccgccgccc ccacctgcag cgcggcggag 36240cgcgggcgcg
tcactcccag cggaagcgcc agcctcgcgt ctggcgaggt gcgcgcttcg 36300cggctcccgc
tccagagctt cgtggcccgc ctgtgtctgc agagcagggg cgggggcccg 36360gcggcaccga
ctgggcactg agatccaagt agccactgaa tcgtagacag tcacccagct 36420cggacagcgc
gtcggggcgg gagcagatcg ggaaggtgaa ggaccactgc ggatccgaca 36480gcgcgtccca
ggtcagtcct cccgctgcac ttggggaaac tttgggatgc ggtgacggct 36540gcgagatgag
gacactgagg gtcgcgaggc cgcgtggccc ctgtgaaccc cgcgaacccg 36600tacctgccgc
gcacctgaca ccgcagctgc cagggcgggg accganaccc tgctgccgcg 36660gaccactgcg
ggccaccaag ggctagcggg cttcaggggc ctctcgggag cctccggctt 36720gcccgcgccc
agccgcgcgc ctccggtcct cgcgggtccc cagctccttt tggcggctcg 36780cgcccggacc
ccgcggggct gcggattccg ccgtcttcgg gcctcgtggc gctggaggag 36840cggcccgggg
gcccatggct gcagggtggc ggccccgcgg cgggagcggc gcgtgctcgg 36900ccggtggagc
gcgcgggtcg cggggttcgg ctggagcgcg tggccgcagg tgcctgtggc 36960cgctgggcag
cggaggtgag agcgcgggct ggggacgcgg agcggattgc aacctctggc 37020tgcaggaacc
agggtcgctg ggtgagcagt cctgtccccg cggcttccgg gcgtgcacat 37080ccctggcacc
cggcatccag accccatcag ctggaggcgg gctgcagagc ggcgcctgcc 37140cgggccgagg
accagtgcct cctgctctga cacgccatct caccaacgag ggcggggtgc 37200tagattggcg
ggctgcgcgg ggaccactgg ccagggcctt ctggcacaag cccttttcgt 37260ggacagctgc
ctgctctggc ttggagtgga ggagacgaaa tgagtacccc gcccccatca 37320gcgccccaac
actgtcgccc cagtcacctt cctttgccct tctccgacag caccttggac 37380ttgctccctc
ccgaattggg gaaaatctga ggaaaccagg cagggacctt ggagataccg 37440cagcctgcat
actcaacagc ctggaaatcc agtcaccttg gtacctcgct gcttcccaga 37500cactttggag
gagcaggttt gccatttcta ccccacatcc gtaccccatc ccccgtccgt 37560ctctgctgag
gaagggactc ttatgagaga agttgggatc taggtacccc ttaaggtagc 37620cccagagtct
gtggtaacta ggctcatagg taactaaaag gcatcctagc tctgtagctt 37680tgtgagggaa
acaaacctta ccaactaatt ccttcccttt ctgaatattt cttagaagac 37740tggagaccaa
cggaagccga ctgttctggc cagtctttgc accctttgct tggctctgac 37800tctccttcct
aggcagagaa acattttgct tatgacctct ggctggcctc cttccaatcg 37860ctgcctggcc
ttggactgcc catcaggact gtgatttttt ttttttttta agacctgatt 37920aggaaaggct
gcaagcctcc ggttctagaa ggctcaaact caggggtata ctcttctctg 37980atacccatgt
gctccctaat tccactgtgg caacacctct gcccttcact cccacaagaa 38040aattggttgt
caaacctctt ggggaagatg atggaggcat ccctgtggga gcagatgcag 38100gatttggaag
caaccaggaa acaaccagga gtgaggaatc ttttttaaag gctcacatga 38160ttctggaact
aagaaaagat ggagatgcca ccagtgtatg aagcttggcc tctcctcggc 38220ccatcccacc
caactcaggg aactggcata tgcaggacct gtattgggtg atgcatattt 38280ggaacctagt
acttattgaa ttcctaagca gtaaacacat tccgaatttg aaattcctca 38340caatcatcta
ctgnaatgta gatattaaac ccccaactta tgaatgatag ccccaaaatt 38400gttaacattg
agagagccca ggttccctgc cacctcttcc acaacaggac aggaactagg 38460acaatgaata
ggaccatttg agctttaggg tcatgtgccc actttacagc tccatagcca 38520gacaactgtt
ttataagaga gggcacaaag gaaaatcact gtcctgtcca aatgaataga 38580aagctgggga
tggtggcagg acaaaggcaa caggaaaaat catctccaac aaggctttcc 38640aagcatatca
gtcttatact actgccatgt tgggtaccac acaaatcagg tatctcaaac 38700tggacgctgc
ctagggaggt ctgtcatcta aaaaggcagg gagatattga gataaaatac 38760acagaagcta
gtatttaact ccaggctggc agataatagg aatgaccttg ggagggtgtg 38820cttacctttc
cttctctctt gaacaaaatg tggactggac cagatgagca ccaaggctcc 38880accaactcta
acagaccttg tgtggtgggc ttgcctgcaa acagacttga gctaggttgc 38940tgtgcgtggg
atccattcca gactcattta caaactcgta gtcagtgaaa tgtgataaac 39000cgaacactgt
agggatttct aaacaaggaa ttaaaaaact cgactccaaa tgggagagat 39060gcaggcaaca
aatcgacagt gtttatgtgc ctctgaatag ctttgatttc cttcggtagg 39120agctgacagc
tggctgacag aaagctcacc cagggagaga agagagaaaa atcaagtatg 39180agattaggaa
taatgttttc aggtaacttt ctattcccat tcggagtggg tgtctggaag 39240ggcgagtgta
gttatggctt gaattgctcc atttatccac agatattttc ttcccaaggg 39300ctcctgattc
taagatgctg ggctttgctt ctgtctccta gtttcctggt agcagggtag 39360agagctgggg
gtcccagcat tcagcctgca tattcttcct ctatcctcac tatctgctgc 39420ctccattatt
tgtggtcttt tggatctatt tggtcagaga gtcagtcttt ggtttcttgc 39480cctggaaact
gcttgttgct acttgtggtg ggggcagcat ttggaagtcc aggtgctctg 39540cccacaaact
ttcaacccat catttgtttt tcatcccttt ctcattgcca ctttgtgtgg 39600tgcctgggac
ttctgggacc tatagttcaa gggtcatata taccaatggc tcacatgaca 39660gcactgatca
ctctgccagc tctcctctct ttgcaaaact tatttcagat ttttcatttg 39720acaatacctt
tcctccagtt gtctttattc ttggcagcat atgccttgta acctttaaaa 39780aggaaggtaa
ataatttgag aaaaaatgta ccaagtcctc agtgatacat tcttactaaa 39840gactcccagt
tttaacaagg agttgggctg gagccatggc tcaacagtta agagcactac 39900ctgctcttcc
aaaggacaca aattccattc ccagaaccca catggcccct tccaaacatt 39960gataactctc
gttccagggc acctcatgcc ctttcctggc atctgagaga accagcataa 40020acatacatgc
aggtgaacat tcatacacat aaaatgaaca ttaaaaaaga aatgaaatag 40080agaaagggtt
tacataacta tttaataact aagactgcct aataatgtag ggacccataa 40140agaaaatcta
gtaagttttt acaagattcc actcaatcag accaaacatt actgttactg 40200acagagtaaa
aagtcacttc caatagtcca agaacaactt tgtttcattt ctcaggcact 40260gtctgttttg
tggcatatgt gcatggtgtg tgtgtgtgtg tgtgtgtgtg tgtgtgtgtg 40320tgtgtacagg
tgaatgctgc tcgtgtatga gcacatgcag gtgtgtgttt gcatggtgtg 40380tagacagagt
ttctgacctg cctggtccca cagctgtttg gccacaaata aacatacaga 40440ggcttatatt
aattagaaac tgtttggcct atggcttagg cttctcactg gctatctctg 40500tcttaattat
taacccataa ctactaatct atgtatttct acgtggcgtt atcttaccgg 40560agaatacttg
gtgtcctatc ttctcagcaa ctacatggcg tcttctctct gcgtcttctc 40620cccagaattc
tcctcgtctg gttgccccgc ctatactttc tacctggcta ctggccaatc 40680agtgttttat
tcatcagcca ataagagaaa catatgtgaa gaaggacatt tccctatcaa 40740tggtgtgtgt
gtgtgtgttt gtgtgtgtgt gtgtgtgtgt gtgtgtatgt gtgtacatgg 40800gtatgtgagc
acatgtgggt atatgggtgc atgtgcacct gtgtgtgtgc atggtggcta 40860gagttgaggt
tagatgtctt ccttggctgc tctccacctt ttttttattg aagctctcac 40920tgaacttaga
gctcactgat tcagctagtc tagctacccg gcctgctctg ggggtcccct 40980gccttcactt
tccatgtggc taccatatct actttacatt tatgtgggta atggggatct 41040gaactatggg
gtcctcatgc ttgcatggca agtgctttat ggactaagac atctttctag 41100cctttacctt
tttttttttt gaaagagttt ttttttgcta actgggaact caacaccaga 41160tagctagtct
actggtcact gaggcccagg gatctactat ttctgcttct cttcccaagt 41220gctgggacta
cagactgtac caccatatcc atatttcttt tagcatgagc tctggaagtc 41280aaactcaggt
cctcacgctc acaaagtaag tgttttatct accaagccat cttcccatct 41340ctgttgtttt
aaaaggcttt gaatatggga tgtgatgaag ggaggtgaaa ttctgagata 41400aatttcttga
aaagaagaat gaatcaagta ggagaacctc ctcctggtgc tgtctttcag 41460ttccatgtcc
acacagcata aacattatga ttatcattcc acagattgta attagtcttt 41520ctctgttttg
ccagtctgct cccaaaaaat gacacagaga gacttcttat taatgatgaa 41580agctttgcct
tagcttaggc ttgtttctaa ctaactcttg taacttaaat taacccattt 41640ctattcatct
acctgctgcc acgtgattca tgacttttac ctctctctca ttctgcatat 41700cctgcttcct
ctgcttctgg ctcatgatcc cgcttttctt cctctccgag tgctctgtcc 41760ccagaagtcc
cgcctaacct cttcctgcct agcaattgcc catttggctc tttactaaac 41820caatcacagt
gacacatctt cacgcagtgt aaaggagtat tctgcaacaa caggtgatga 41880agccaacatt
ccaagaggcc agggcttgcc tagggcacat agctaactta agaaaattag 41940gatcgcattc
tacatctgtc tgactctgaa ttggatctga actgtgactt gcatggaaga 42000cccaaagacc
ctgagaaagt acaatgacaa aggggctgac tctgtccaca tggtgttagc 42060ccaggtttcc
cacaggagga aaacccatcc taggcaagag aagtggtctt catcaaacac 42120tctatgaaaa
gcaaatcaga ctcaaatgtc aggatttgtg ctttacagat cgatccggta 42180agatgaaaga
acttcctgaa agtgtgtgaa ggcctaaagt cagggctgtt catggaaggc 42240actgactaca
gaatgaggtg ccagaagcct agtcagagcc tctagggaat aaagtgtcag 42300atgatcttct
aaaaaagttg aagtttcacc agtaacagaa tggccccact attaaaatgt 42360gagcaaactc
agaagtcatt gtagcatata gaagcacaga cctatggatt gctggatgga 42420gcccaggtat
tcactccatc ctgaatagcc agctggggag ctagctcagt cagttaagta 42480tttgctatgc
aaatctgagg accagacttt ggtctcctgc atccacagaa atggtgcaca 42540cttgtaatct
cagcactggg gaagcagtca gccagatcca acagctgcct agccagcgga 42600aacagcctta
tcagaaactc atgggtcctg gtgaaagata ttatctcaaa taacaaggtg 42660ggaagctcct
gaaggacact ggaggttaac ttctggataa acataggctc gccccaccac 42720cagtgagcat
gtgcctaaat ccgtacataa caatgatgta aagatggaat tcattccagt 42780gaaaagtaag
cctcctggac tctttttttt tttttgttgc tagatattct cgagacctca 42840ggagagaagg
tttgccatca tctatataac atggtactca acttccctgt agtccacaac 42900attcctattt
ctatatgatg gagaagaggc cactgcccct cccagacatc tcagtctcaa 42960atttgttacc
agttccctct cctaataagt gcttagggtt agtgttgtag agaagggctt 43020tacatgaagt
gtgtgtgtgt gtgtgtggtg tgtgtgtgtg tgtgtgtgtg tgtgtgtgtg 43080tgtgtgtgtg
tgtgtgtaac ctaaaggctt tccatgtttc cacactgaaa ggttcttaag 43140actgagaaca
accagataag agtccaaatt ctagaaacca tgggaaagtg taatattgaa 43200agtcagaaca
aggcatggtg gtgctcacct tgaaacccac cacttggggc agaggcagtc 43260agatctctgt
gagttcaagg cccagcctgg tctacagact gtacatagtg agttccaggg 43320ccagaactac
atagtgagat cttgtctggc caaaaatata taagtaaata aaataaatca 43380gtacatggta
acttgttctt atttcagtgt ctgtttctca agcatgactt tggcttaagg 43440atttttccca
acttgttttt gtgattgcca ctgtatcatt tctttgtgtg aagttactaa 43500gtggtttctg
tatttgatat tatgttctga cctagtttct tttcatatta aacccatttg 43560tatatgaaaa
ctgcaaagaa gtgggttttt tgttttttgg gttttttttt gtttgtttgt 43620ttgtttgttt
tttcttggtg ttctcatgtg acctttccaa tgtttgcttc cagaatagac 43680ctgcaagttg
ggatccacac tgccatctga agtcctgcac cccaagtttc aggtatgttt 43740tgatggcaga
atagcttttc tagactgtga caataggggc ataaagccac aaagcattcg 43800ctttcctaca
ggttatgcac ccactctctg agtgattggc tgtgcatcat gaatattatc 43860aaaatggagg
cagttcagtt tggagtgctg tcttttatgc gcttattcat ggcaatgcca 43920atggaacatt
cggcaacata tactactaat catgcatggt aactgaactg tgttgtgcaa 43980ggaagacctc
atatgaccta cctttgcata tgctgacctt ttctgtgaca gactcctata 44040atactgagag
tggtactgta tggaagagtg tgtgaaaatg tattgtttaa ataacagaca 44100gatgcctcta
aatacaacac ccaagcagag aaatggagca tcactggcac tttggaggcc 44160tctgggtaac
ctttccagat cacactgttt tccttcctcc accaataacc actttccctt 44220tggatgctac
tcatagttaa catctttact tttgttgttg tcccactgat gctaagaaaa 44280ataacttcaa
ctagcaagca caacactaga tgaattaaga gtgatattga ctgtgtgtgg 44340tgagtctcag
aagactagct gcctcaggat tcatgaatgc ttacaggaac cctttagcaa 44400ggtcaggaat
gagtcttagg atccatgtgg ctcatagtct ccagcctgga catggagtag 44460cacagtgtct
gagtgcccca agggaatggg cttgttcagg ctcccctccc cgtccccagt 44520tccaacaggt
ctcagatcca ggacatcaga gctgagtgaa gagcagagct aaaaggagca 44580ccatcggagc
cctagaagca gaataggggg ggacacagca cacagagaca agaactgagg 44640ccaggctgct
gtgtgctttg ggcctaagtt gacagatgaa acatggtagg gtgaccacat 44700ggaggatgtc
tgtgcacatc catcaaactg gcaggtcccc ccagcatttt ctgggagctt 44760ggggtcctct
tttccatgat cttcagcttc tgtattctat gtgcgctgtt accatttcat 44820cttggtagag
tctatccttc tgttatttct tgagagtatg tcccaattct tgcctggagg 44880tttggctaaa
tatagaattc taagcagagg gtcatttctc cttcagatat ttaaagacac 44940tttctgtatt
gtgcctcatt gccattgttg atatacctga atctaaattg atcccttggt 45000gcgtgactta
tccccacagc caagggcccc ttcccttctg gtctgtgctc tggaagtctg 45060caggcacatg
gtatgggtag ccactgtttc attcatagtt caatgctccg ataggccctt 45120ttgatttgat
aactctatcc ctttccccca ttcccgttga tgatttcttc ttttgttccc 45180cttttgatat
agtttccttg ctgatgctgt gctaaaatat tcctaccaaa aacaacctgg 45240ggaggagagg
cttcatttgg cttacaattc cagctcacag tcattgaggg aagtcagggc 45300aggaactcaa
ggcagggagc atggaggaat tgcctgctgg cttcctctct gacttactca 45360caggttcttg
taggctagct ttctgataac atctcaggac cacctgctta gcaatagtgt 45420ggtccacagc
aggtttgaac cttctgcatc agttactaat caagacattt gcccaaagac 45480atgcccacag
gccagattga tgtaggcagt tcttaaatca agtctttttt gtcaagtgac 45540tctagactgt
caagtcgaca gttgatgcta actaggacac tattctacca cttttcttgg 45600tagaaatatt
attcggatat tggagttctt ggactagttt ttctggttct ccttttcttt 45660cttttcctgt
tatttatatt tgttttatga gatagggtct ctctgtgaag ttgtcctaga 45720ccttctggcc
ctcctgctta taattcctaa gaactgatat tacaggcagg tgccatgagc 45780ccaacgtttt
ttcttttctt ttcactgcac tctgtttgag agtctcatcg tcacagtcat 45840tcacatcttc
tattgtcttg tttttctttt taaatgtgca ttggtgtttt gcctgtatgt 45900atgtctgtgt
gagggtgtca gatcttggaa ttacagttcc aaataatatt tctaccaaga 45960aaaagtggta
gttgtatcct agttggcatc aaatgtcacc ttgacagcct tgagtcacct 46020gagaagaaag
acttgattta ggagctacca tgtggttgct ggtaattgaa cccaggacct 46080ctggaagagc
acccagtgct cttaactgct gagccatctc tctggcttcc ttctattgac 46140ttttgcaggc
ttctttcttg ttcttttgca atttcatggt ctctgactgt tcttcacaga 46200ctcttacctc
atgcttaaga tgtctcttac tccttcaagg atactgagtt tttgaagttt 46260taattctcct
gactactgtc ttttccctcc tgtttgtcat tctctgtttg ccctggcctc 46320tgtctttcat
gcaggaagac ttttcatttg cttttaggtt tttattttaa ctattggttc 46380atgactaaag
ggctagatga aaaggccagt gagaaggctg gagcatatgg gtgatacttg 46440tcaaccggga
gcctcactgt ggaatgcttc agtggcatgt gaaatcctgt ggtatttgct 46500caggcaagtg
cagctgttga atgcagacca gagcagcttc cttcgaagga gtcagatgtt 46560gctgactgtc
tttctgcagc tggtcaggaa ggtgggatag acttcagctc ttttcaaaca 46620gtggtcacca
aacaaccact tgcccagaga ctttgtgctt taccattctc agagaacaga 46680cctctggatg
gccccatggt ggaagcagcg cacctgtcta tcacaggtgc tctgaaggag 46740ttggaagaac
tacccattgt ccacatttcc cacattttca catgccagct tcactctggg 46800atctgggtga
cagtggggct gacataatgg caggggttgc agtttcagac tcagagtatg 46860tggtaggaat
gctgctgtct gagggaagac tcatctgagc agtggaggct ttgcctgttc 46920cctggcatca
tttgacctgc ccctccttag aactgggaac cccagttcta aagctccctg 46980ctttaaagat
tctgtgttgg ggtaagttct tagctttctc aggctaggtc ctctgctctt 47040gggtttccac
ggcactgttg ttttccctct ggctttgtga gtggttgtct tttgaaaaac 47100tagttagttt
ggaaaatttt gggagggagt caaataagat gtatgcattt tgccatgtaa 47160gtcctaacca
agccatctgc tgtggtattt tcctgagttt ggttctgccc ctataggcag 47220agtctgtcat
cacagataat tgcattttga acttgagcat ctcccttcct tctttgtctg 47280cctgaaaaag
tctctttata aaaaaatgta atgttaattt aaaaagtatt cattattctt 47340gtgttgtgat
acatgagtat atatatgcta tgatgcatat gtgcaggttg gaggacaact 47400ttctgtagtt
ggttctctct ttctcccttc atgtaggttc tggggatcga acccaagtca 47460tcaagcttgc
acaacagcac ctttaccttc taagccttct catcagccct ttttttattg 47520attgattggt
tgattgattg attgattgat gctagggata gagcctaggg tcttttacat 47580gctaagaaaa
tgctctacca ctgaactgca ctcctagccc aacctgctaa attcttacac 47640tgtcttcaaa
aagaagctct gatgctggat tctgcaaagt ccatttttat ccctaaattc 47700ctaaagctgt
ttaaatctcg tgagtcttac tgtacagacc agctctgtgc accatcttcc 47760acaatctcca
tgacctcctc aggatgggct ggtatctctg cagctctgcc cagtgcctac 47820caggaactta
caggtgtcac caatgaattt attggtgcat gctcacttca tcttgtccct 47880atccactttc
tgctttgact ccttctggta agagacaagt gtgttaacta cttgtgctat 47940caccacacag
aaatccatat cccataatct tagtcctttt tatttactta tttttgagac 48000agggtcacac
tctgtagctc ccacactggc cttaaacact gacctcgaac tcatggtgat 48060tctcctgcct
aaacttctca aataccatga ttacaagagt gacacaccat gctgggagtc 48120ataatcttaa
gtttaaaagt gagggactgg tcagtttact gtgctaggtt gacattgtat 48180agaaatgaac
agccatgttg gtctggaaat gttcctagtt ttcatttgta caaggatatg 48240cagtgtgtga
aatagggaga gtcttaccta tgtgggtttg atcacagcaa ttaataaaat 48300atgctctaaa
taatgaaaaa agccagtaac tagtagtgtt tctgaatcct cactaaagct 48360ttaatacatc
ataaataata tatcactgca gattatgtct acatgttata catatcacat 48420ttatagtaca
atctgatctt tgtcacctac tgtaagcaca actgaaaaac aaattttctc 48480atagctcaat
attaagtcat tattatcccc ataataagta attattatcc ccataatgaa 48540actatctatt
gagggagtca gaatctgaga tagttaaata aatttaagca tgtattttta 48600gtgtcaatgg
taaaaattaa atgttcataa agcctgtatg actcctttta aagtagtttt 48660aattttatgt
gtatacatat atgcatgttt tgccttcttg tatgtctgag taccacttgt 48720atgtctggtg
cctgaggagg ccagaacgta tcagatcccc tgaaactggt attacagttt 48780tgagctacta
tgtggctgtt gggaattgaa cctggatgct ctgaaagagc agccagtgct 48840cttaatgact
aggccatctc tccattttct taaaaaaaaa tttaaaacat ttactctaag 48900atttactttt
atgtaggtgc gtgtgtgaat gtgtatggtt tatgcattgg ggtggggagg 48960atggattagc
acagtcacag aagactagag gagggtctct actattgctt tctgtcttct 49020acccttgaga
cagggtctct cactaaacct gaaactcacc tttgcagctg gggtagctgg 49080tcagaaagat
cctggaatct gtctttctcc ctggccctaa tgcttgagtt acaggcccat 49140gtgaccatac
ctgtcgtttt actggggttc tacagagtca aacccaagtc ctcacgcttg 49200catagccagc
gattttaccg actgagacat ttatctgccc caattcataa ttcttctctg 49260cttccattaa
taatcccatc tatgtcccct tcatacatat ttctgaaata gacaaaatga 49320atacaagtta
gacatcgagt ctgattaatc ttcaacttct ttgataacca ggtattgatt 49380tctgactttt
gaagatggat gaaggcacag aagtctccac tgatggaaat tccctgatca 49440aagctgtcca
tcagagccgg cttcgcctca caagactttt gctcgaaggt ggtgcttaca 49500tcaacgagag
caatgaccgt ggcgaaacac ctttaatgat tgcttgtaag accaaacaca 49560ttgaccagca
gagcgttggt agagccaaga tggttaaata ccttctagag aacagtgctg 49620accccaacat
ccaggacaaa tctgggaaaa gcgctctgat gcacgcatgc ttggaaagag 49680cgggcccgga
agtggtttcc ttgctgctca agagtggggc tgacctcagc ttgcaggacc 49740attctggcta
ctcagctctg gtgtatgcta taaatgcaga agacagagat accctcaaag 49800tcctccttag
tgcttgccag gcgaaaggaa aagaggtcat tatcataacc acagcaaagt 49860caccctctgg
gaggcatacc acccagcagt acctcaacat gcctcccgca gacatggatg 49920agagccatcc
gccagccacg ccttcagaaa ttgacatcaa gacagcctcc ttgccactct 49980catgttcttc
agagacggac c
50001350000DNACricetulus griseusmodified_base(13508)..(13531)a, c, t, g,
unknown or othermisc_feature(13508)..(13531)n is a, c, g, or
tmodified_base(21446)..(21479)a, c, t, g, unknown or
othermisc_feature(21446)..(21479)n is a, c, g, or t 3gggctcaggc
atttatcgtt cagagattga ctgagctgta aagatggaaa gacaaacttt 60tttttttttt
gattgagtcg gggtttctct atgtaacagc cctggctgtc caggaactca 120ctctgtagac
caggctggcc ttgaactcac agagatctgc ctgcccctgc ctgtcgaatg 180ttgggattaa
aggtgtgagc caccaccgcc ccgctgacaa actagacttt tagaatgtat 240tatgagataa
ggttttgtta tgttgcccag gctggactca gatctgtagc aatctatctg 300ctccagactc
ctgagtgctg ggatatacag acctgagtta cctgtacagc tttctaatca 360tcccccgctc
ccccagagac agggtttctc tttattgttt tggagcctgt cctggcactg 420gcactcactc
tgtagaccag gttggcctcg aactcacaga gatccacctg tctctgcctc 480ctgagtgccg
agattaaagg tgtgcaccac caacacccta ctttctaatt cttaaagcaa 540ggctcccaac
tcctcccttg tgtgtaatca acaaggttct tagaccctgt ctgcagtgtg 600gattcccact
aataagacag tggcggcaca gtgctgtgtg gcagagcaag cgtccatcta 660gttcctattg
tcattctatg atttgctctt ctgggagcct tgtcattcag caagttcctg 720ggcttgtctt
gggattgcaa tgtgcctcag cttggctagt tcctctgcgg cagaagcagt 780gtttgaactc
agtgggcact cagtcactac atctaacttg tttgagggct ctctgcattt 840gctttccaat
taaggtttag gatgactcct ccctgtgact cttatcatcc tgcctattaa 900tgctaaatta
gagaggcatt caagataact gccgaagatc taataaataa atggggtggg 960tgggtaggac
tataaaccag tttatagcat gcaagaaagc tctgagcacc acattcaaaa 1020ataaagtgct
gtgagcctgg tggtggtggc tcacaccctg atcccagaac tcaagaagta 1080gacagaaggc
tcagattcaa gattcaagtt cttccactat acagccaatt tgaagtcagc 1140ccagactaca
tgagaccctg tctcaactaa gcaaatgaaa gcaaactggg gtccaaatag 1200gcactattcg
atgttttgat gcaagtttgt gactgaggag tggaggtggc aaatgaagac 1260ttttttcttc
ctcttcttct tcctcctggg tcccgttttt tttagggtgt tcttaggata 1320tgtatgtctc
attggcacta ctaagaagtg tggggtctag ggaacttcct gttatgtata 1380caagctaatc
ttcaaacaat tgtgtgggct gttttggtaa ctactcaaat aatgctatag 1440aaaattgtac
aatatattgg ggaaggaagg gagttttaca caggagtcaa catgactctt 1500gtctctggaa
agcaacttgt gatccaatga ggagctaaat ttagagacac aattcaggaa 1560gagaatccaa
tcagagcttc cttgtaaaac aactcacctt cacaaacaag ttcattccta 1620atcgaattta
aggtctagaa actgccaacc tattaatgtt tctataaata cacttggggt 1680caactacgta
gccaaggaaa tctttaataa attgaacaca aattgtcagg ggaaggttat 1740tgctgggact
cctggaagca tgtataagca gggtaggggt gacatagggg tggggggcag 1800ttaactcaca
gatattagtc tcagatatta atggcttgtg tgtgagctgt ctgccacact 1860taatgtcagt
caccttgccc ggaactattt ttctctctga ttccaaatgt agctattggt 1920ctattaaatg
attaacttcc acagaaactg ataatatcct tatggaatct gactgtggta 1980agcctgtaca
cccccgcccc aatttccttc tagatttaga attccattcc atgagccatc 2040acacccacgc
tgaaaaaaga aaacctgttg aatcaaattt gtgttttgga gggtaagagc 2100cacccttcca
atttataagg ctgtctattt ctttgggggg ggggaaatga accagtatct 2160tctattagta
aaaggagtgt ttgagcatgg gcactacaac ccacttcttt cagggagatt 2220catttttctc
tgagaactca gcctctctgt gctggtgcca caggaattct taaactcttt 2280caactctcca
attaaccaga gagcaaaccc agcactttcc atctatgaga aatctacacc 2340actcatggaa
tcattgtgtg ccctctctca ctgcctaaca ggggtaccct tgccaaagaa 2400aagcaactta
atgccaaaaa ggtgcatcac ctggcactgc ttccgaggat gggcaatgtg 2460caagcacttt
gttcagtggc tctgccttgg ggtctcttga ggggcggcag gttacctggg 2520gtgggggcgc
acactctctg aaggtgggct gcgttcagtt tcctgcttca ggggctcctt 2580catagtaccg
ccccctgatg agtttctgct cagactggaa ggtgtcaggt cccaaagaaa 2640cctgggacaa
ggctcactca gtacctgtcg cttctcccag cacgtctcac cccaccccta 2700ccctaaactt
ctctagccca gaggctgggc tccccctttc tctttcctac ataaccctgc 2760cattttagct
gtgagctctc tccgtcttta gctcctctac tgttctttta tcctctcttt 2820tctctctcct
cttcttctct cacccccacc cccaccccca tctctccccc catggtctgg 2880ttcagtctgg
accctttcag atgcctctgt ctgaactctc cctcatatct caataaaacc 2940cttctcttca
gccacgcctt ggagaggtca taggctcatt ttcgttcaga aggcctatca 3000aagaatctgt
gggcttatct ttacattcac aataggcagc ttggccctga gaccacagtc 3060caggttaaag
tgttaccttg gaaagaaagt cttttattca aggtgtctgg tttcttttct 3120tgtttttgtt
tttgtttttg gagacagggt ttctctgtat tattttggag gctgtcctgg 3180aactcgctct
gtagaccagg ctggccttga actcacagag atccgcctgc ctctacctcc 3240tgagtgctgg
gattaaaggc gtgagccacc aacgcccggc tcaagtgtct ggtttctttt 3300gatgtcttta
gtttctttaa tcccataatt cctttaatta taccctcttg tctgtcggag 3360aatgacatca
aggatatcca gttcaaggtt tcctatgtag ttcagtcata gagtgcttgc 3420ccagctgcca
gactctgtca gatgcccagc accacacaca tacaaagcat ttccagctct 3480gtgtctgtgt
caattactcc tgtctgcttc tccatcccca gacaccagga gggcccacaa 3540gaagcttgga
gcagggaaga ataaagagac aatatccata gacacacaaa acctccaaag 3600tacttatgca
ttgaggaatt acagcttaca aatccagtca cagtatctat attcatgtta 3660gcctgatttc
aatcccccag ctacatattc ttccatgagc tagctccttt cctattcaag 3720actcccttga
taatagttgt tatcagactt tacccctatt aaaatatttg gaccgtttga 3780gagcaatagc
tcacctctat aatctagaac ccaggaagtt aaaacaagat gtttgctgca 3840agtttgatgc
cagcctgggc tacatagcaa tttccagaac atcctgagct acagggcaaa 3900attctatctt
aaaaaacaaa aagtagacag atcaggtgtt tcaccttgtt tcaaaaaatg 3960caaaaaatat
tttttaattg tagaaatata tacgctaatt cctttggtac cctaggccaa 4020gtgactagat
gggttagtct tccttctggt cctcacagaa gaaagttaag ttctcagcag 4080gaataataaa
aaatattaaa aaaaaaaaca agctgcaaaa ttctgttgtg gttctgccaa 4140agtgttctca
ggagtgaggg catactggga tttagtcaag cagatatttc tgtttgaata 4200actaggatct
gggagccatg ggacaccacc cccacccata agggctactg aaaaccaccc 4260ctggaaatct
gtaaatattg ctaaggctct acccttttgc tcagagaaca accacccaca 4320aggatagggg
ataagttagt tctgtagtag agtgcttgct tagcacacag aaagtctttc 4380tctctctgtc
tttctctctg tctctgtctc tgtctctctc tctctctctc tctcacacac 4440acacacacac
acaaacaaac acatgagtgc acaagaaact tctaggtgct actaaactaa 4500tgtaaaatca
tgcaaagttc atagagaatt caacagctag tgacaggatg acccgaacac 4560aagattctgc
cctagtcctt gtattctgta gtccccagtt tctctttact gccacagtct 4620cctatctctg
acagcctccc tctttgcaga tctggcagtt tctgggcctg gaactgcttt 4680ggtagaatgt
ctgtacagca tgcactaggc actgggtttg atccccagca ctgcataaat 4740caactttgat
gtcacaccta taatttcagc acttggcagg gatcgaagca ggaggatcag 4800aggtgaatca
aggccagcct gggctacttg aaaccctggg gagagggata gaagaagggg 4860gaggggggag
ggagaagaaa ggaaggaggg ggagggaaga ggagaggaag agaggaggga 4920gagggaggga
aacagggagg gaggaagaga aggagggaga gagggaggag ggagggagag 4980actagtgtaa
gcagaacctg taagttctct cctcagcctc aacacacccc agctccctgc 5040tgtctcccgg
tccagggctt cagggcctgg caggacaggc agcaggttgt tttgctctca 5100taaagccatg
ttacataact aactaatgtt ttgagcagtg gagctgagcc aatctaggtc 5160acatcaagag
ggaatgggga aagaggatga tcacggaagt ggtgagagga agggaaacaa 5220gaagggagga
ataaaaaaaa gaggcgagag tggaaatggg gtgcgattat ttaatatctg 5280ctgcctgttc
atagttcctg gtccttaggg acagcatata ttatcctgaa aagtcctctc 5340tctattttat
ctaggcattc tgtcatccta tagcccccac tctggatggc tgaactctgt 5400gccagcagcc
tgcaggtatc accccttatt ggagtgaggt ctattcctta ttggaagcag 5460tggcaggctg
gtaggaaaca aacaggcctg gtgttgtgga atgctgtcct cccagcatga 5520ccatcattag
accttatgga agcagagcga ggggggcatt gtcctcctcc ccaggctcct 5580gcaagcctac
tcagctcaac tggttccccg ggccagactt aggtgcaaga gttgctttgg 5640tttgttattg
gtggcctgtg tagctgagta gacacatgct cacctacatg atatatgatg 5700gcttgcaacc
ttctaaaagt tcagtttcag gagatccaga accctctttt gccctccaag 5760gacaccagac
acccatgtgg tacccatacg tacatgcggg caaaacactt gtgcatataa 5820aataaaaaga
gatggctccg tggctaagaa tgctccctac ctccagctca cccacatctt 5880cacaactgac
tgtgaatcca tccatggttc tcttctgacc tcggagggca cctgtgccca 5940tggggcatac
acatacacat acacaaaaca agtatgtaaa taaataaata tttaaaattg 6000gggctggaga
tggcttagtg gttgagagca ctggctgatc ctccagaggt ccagagttca 6060attcccagca
cctacatggt ggctcccaat cacctaaagt gggacctgat gtcctcttct 6120gacataaggt
catacatgca gatagaggac tcaaatgcat aaaataaata aataaatctt 6180tagaaaataa
gtacataata aataaatatt taaaatgacc caaattaaga aaaaaatgaa 6240gccaggcagt
ggtggtacac tcagaaggca gaggcaggca gatctctgag tttgagacca 6300gcagttccag
gacagccaga gttacacaga gaaactctgt ctcaaaaaaa aaaaagaaaa 6360aaaaacagag
aaagaagaga ggagaaaaac aagaacaaaa aataacaaaa caaaaacatg 6420gctttccctt
catggcatct gcttcatctg cctatttggt aatgatcagg gcactacaca 6480cccagtgctt
cataccctgg ccatgtttct gttcttggtg tcaccaccaa gtttactaaa 6540gatggttcca
gagtgacatt agcagcccca caccccaatt gcagctagca gttgaggaga 6600tttctggctt
tttgtctaag aggaaggttc tttggctagg agatatactg agaaggacta 6660ggaaaagggg
tgtctaagaa acttggagag cacatttttc aagtcagaaa gaacatagac 6720atattctggg
ggtgggggta gtaagataat ggaccctcct aagggaagga ttgtggggtt 6780tgcctgaagg
ggctgaagca gaccactgag caggccagac caccagcagc ttttgagagg 6840tgggaacact
gcagctgaag tcacttgtca ccttcccagg tagttcttac ttccagctct 6900ggcagggcta
gatagcctag gaactcccag ataggagttc tagttcttct tctcccaagc 6960tgacagaacg
tgagctcaga gtctagggac actccaggtt aaggacgggg ccattcttga 7020ttgtcagcac
agatagattt taattagaga gcaatgacat gacagataaa cagcccctta 7080tctaaagggg
tacatcccaa gaccctggag gactcttgaa aacccagata ggagccagcc 7140acggaagcat
atacctttaa tcctaagatt tgggaggctg aggtaggagg atctctgtga 7200gtttgaggcc
agtcttgtct acaaagtgaa ttttgggaca gctacacaga gaaaccctgt 7260aagaaaaaaa
aaaaaaagaa agaaaggaag gaaggaagga aggaaggaag gaaggaaggg 7320aaaggaagaa
aaagataaag gaagaaaatc caaataggaa agaatcccat atataccata 7380tttttcttaa
acatacatag gtttattcat tctctctgtg tctgtgtgtc tgtgtgtctg 7440tgtgtctgtg
tgtctgtgtc tgtctgtctg tctgtctgtc tgtctctctc tctctctctc 7500tctttctctc
cctctctctc tctttcttgt ctcataaatc tcaacactca gggacccaga 7560agatatccca
gtggttaaga atacacactg ctcttgcaga cctaaactca gttccttgtc 7620cctacttggg
gcagctcaca accacacctg taagtctagc tccagggaat ccacaccttc 7680tggcctgtgc
aggcacctgt gtgaaggagc acatatcctt ccccataatt aaaaaacaat 7740cattgaaaaa
taaaactcaa ccccctcccc cgggactcaa accagaggta gtctccctgc 7800cgtaggcgct
caaaaactgg actttcaggt gtgagcctct aggccaggct gcttttctta 7860actggctacc
gtgctcttgc ctgaaacttc cagcttgaga cctcatagta aaaagaacat 7920acacgtcttc
tgtctgtact attttacaga cggctgacat gttcatacca cgtattttag 7980caatttcagc
acttggtata ttttctgtca ttctcaaata actttcacct tgccacttag 8040ggcagtccaa
ggctcctctt agatatatcc aaattatcag ccaccacttc tgcctttact 8100aagtaagaca
gggtacttaa catggagtac ttaacacaag cactgtgatc tgaaggtgga 8160gactgcttgc
tactcagtca cagcttagca ttgctagaac aaatcctgaa caaagggtaa 8220ttcatgaccc
aggcagggca gaggcggatg gctgttcttg ctcctcagaa acccctgtgt 8280ataatttcaa
gcttaggagt tgtttgtctt tggatggaga gggtcagacc tagggcttca 8340ctcacactag
gcaagcaccg caggtctacc ttcgaagaga agaattttca cttagcgttt 8400tcagatatag
gtcaacctca gctggctgaa actttgacta agtgagcaac tgtgagggtg 8460gggaacacat
gcatgcattt cttcatgtta taacatctat ttatacataa acatatcata 8520taaatatatt
ctattgcata taaatataca taaatgcaca ctcatgtata gatatcaatc 8580acataattta
tgcttttatt catagattat ctctgggagg tgtacaatta ctgacaatac 8640ctgcacatga
tagtacacgt tgttctagtt aggtttcttt tgctgtgaca aacaccacaa 8700ccaaaagcaa
cttgcagagg gaagggttta tttcagctta cagttgtatt cattatgaag 8760agttgggaag
tcaggacagg aacctggagg caggaactga agcagaaacc atggaataat 8820gctgcttact
ggtttaccca ccatgactca acctgctttc ttatatcacc aggactgctt 8880gcccagggat
agaaccacac atggggactg tacctcccac aacaatcatt gatcaagaaa 8940tgccctagag
tcagggatgg tggcaaatgc ttttaatccc agcactcggg aggcagaacc 9000aggccttgac
tgtgaggtca aggccaggct ggtctacaga ttgagttcca ggacagccag 9060ggctactcag
agaaaccatg tctcatggaa aagaaaagga ggaggaggag aaaggagaag 9120gaaaaagagg
aggaggagga ggaggaggag gaggaggagg aggaggaaag aagaagaaga 9180agaagaagaa
gaagaagaag aagaagaaga agaagtagaa gaagaagtgt ccactggaca 9240atctgatggt
ggcgtttccc aattgaagtt ccccttccaa gataactcca ggatgtgtca 9300agcagacaaa
aacaagaacc aagacacatg tttataatcc caacactggg gaagtggaat 9360aagaggtttg
gcagtttaag gccattttca gctacatagg gagttccaga ctatcctggc 9420tacatgagac
cctgtctcaa aacaccaaaa tgcaagggaa aaacaaaaag caaaataatg 9480agtacaaata
gcagtgacat tctggggaga cagcctggag ggggggattg cttattatct 9540ctccctaccg
tttggagttt ttaaaatcat gaatctaacc ccagaaaaaa aagcattgag 9600attctgggac
actcgggtgg tagagaagat catctgatcc tgtcaccttt cgggtacgtc 9660actttattaa
tctctctgag attcagtttc atcacctctg aagtggtttg tgtcgacgta 9720cagtcctcag
gactaagtaa ggccacttgg tggctgtgcc aaagcactgt gtcagggaca 9780cggcagatgt
ctgacacatc ttgttagatt ccttttctgt cctccgctcc cctaccccag 9840aggtgggtac
agccccatgg cacctcatct ttaatggctt gggtttcttt tctccagcca 9900ggaaagttgt
cgctttggtg acagctattt taagtcaact gacctttcct gcaaatgatc 9960cagatgcctc
tatcttaggc tggtgatgac gaagatggcc tatgacgggg ttcctggggg 10020tgtgttggga
ggtggggcag gggtggggcc cggcatttgt cagacccata tgatcttctg 10080gctcccgggc
tctgcagatt tctcctgctg gagatgccta cctgccagca atcttggaga 10140agacagaaat
agcagctttg ggttccaggt cccctcctcc ctttggccca atgtagctag 10200agctttggtt
tcctgctgct gtcttggtgc ctggagccct ctctggatgg tcatggagtc 10260ttgtcagaga
agcaactttg ggctggcaga cagtcattcc agaagacatg atctggaaaa 10320actgcttcat
cgtttccttc agaggcactg tcccgagccc atttccttgt ctggttcctg 10380aaatctcagg
gatgccatca gaagaaggtg ttcttgtgtt tactttggac atggttttct 10440gtagtgcaga
ctgcccttaa actctacgta gctgaaaatg accttggtct ccagacctct 10500tgatctgtca
gcatccctgg gaaatccagg gttctgtaat cctcccctct caccttgact 10560tactgtacca
gcatcaaaca tcctaaacaa atccagtgtt tagccaaata cagcggtgca 10620tgtctgtaat
cccagccacc tgggaagccg aggcagaagg attaagggag ctggaggcca 10680gtctgtgcaa
tttagcagga ctgtctcaaa acaaaattta atggttaggg gtgggcatgt 10740catttatttg
actcttatca catgaacaca cctgtaatct catcacgaaa cgacaaggca 10800ggaaaatcaa
aagttcaaag tcatctttgg ctacatagca agttctaacc tgacctaggg 10860tatgtaagac
cttgtctcaa aagcaaacaa acaaacccca aataacaaca acaacaaaac 10920aaaaagcaaa
caaggagagg gtgtgcagct agggatataa ttcaatgggt gagggcttac 10980ctcacatgca
cgaggccttg gtttcaactt ccagttgaaa tgaagtttag tggtagagtt 11040ctgtgcaagg
ctgtagtttc agctctccat actgcaaact ggaaagaaca acagtgacaa 11100acagaaacaa
aaaaccccca caaacaatgt gctttctcac tcaataaaac cacctcttta 11160catacaacta
caactgctaa gaaagttctt cagtgttcta gagcctgagc acctcaaatg 11220gtttccataa
agctgtatgc aaacactgat aagccacgag aagcaactgt acaaagcacc 11280ctttgatttt
catagtttat ctacacaagg attctaggaa agtgtgctag gaaaatttta 11340tgtatcagcc
ttgcgggttt gtccaatagt tttagatttt gccagtgaag attttccttt 11400ctttattttt
tacatgggaa ggaagtttaa ttgggggaag ggacgggagt gggctttatt 11460tttatttttt
aatgagacta gcatttgcat tggtggacat tgaaggaaac agtttcccct 11520ccctaatgtg
tgtgggcctc acctaactca ttgaaagtct tagataaaac taagctgagt 11580gagtgagttg
gcccatacct gtagatggaa ggaaaagggt cttgagtttt ggtttatcct 11640agagagaact
tgatccccca aacaccaaac tttcaaacca aaccccagcc tcctcagtgt 11700gaagggatgc
tgttacatga ccacctatgg actcagacaa cctctcttcc ctgagtctgc 11760tggcttactc
atcagagtct gggctcacga agccgccaca catatatgag cctcgttctc 11820cccactcttc
tcttgtggca ctgaggttca aaccaaggac ctcgcacatg atagcaaata 11880ctgtactgaa
ccatagagcc agcccttgtc agtttcttaa cacaaacata tagatgtata 11940tgtatatgaa
tatttccatg ctaccaattc cattttctca gagaaccaaa gaatacacca 12000agtagtcaca
cttgaaattc tgttctgaga ttgaataaaa cctgatcaaa tgtgaattcg 12060gtcccttctc
ccccatccct gacgccacca cgttgctata cagaccaggc acaaactctt 12120ctccttgtga
atgtgtgtaa cacatgttac cactgtgctt ggcttttgta gttagaaggt 12180tggttgatat
ttaaaaaaaa actttaatat ttagtcatta ctttttagta aagatttgcc 12240ttgcttttat
tttattcatg tgcatgtgtg tgtatctgtg tgagtgtatg ccacgtgtgt 12300ttgggtgcct
ctggagattg gaaaagaatg tcaaaatccc aggacctgga gttccaggca 12360gttgtaaact
tcccaatgtg ggtaattata atgaacttgg atcctctaaa agagcagaac 12420tcactcttaa
ctgatgagtt atccttctac ccccaaattt atttgttttg tttatttgtt 12480tatttatttg
agagggtctc actgtgtagc tctgacagta ttagaattta ctatgtagac 12540cagacttgat
aaatgtctaa ccctagaaaa aaatagtttt gttttgattt tatgtctgtg 12600ccatccactc
cttgaacata tatttggtat ctgtgaagcc agtgaaggct gttggttccc 12660ttaggactgg
agttacagat ggctctgagc taccatgtgc atgctgggaa acaaactcag 12720gtcctttgga
agagcaaaaa atgtcctttg atggtggtgg tttgaatgag aattgcccta 12780tcgagcataa
aaacttggca gctttggcta catggttctg gattaagagt caagaaggat 12840acaagaaagc
ggttgtggaa tcatccccca tggttaagga aaaccaccaa agccaggctt 12900gtggcagggg
agttcctgca tggaggccaa gagaagccac tatgtcaagc tgtgaaggtg 12960aagcctggat
tgtgttggag acccaagcta ctggagatgt aagagatgtg agataatgcc 13020caggagagct
gcagacaggg catggaatca ggccaagcga gagaagtgtg ttgcagtcag 13080cagaactggg
agggaagagt catctaagtc ctttgtcatc agacatagag atacaggatc 13140tgaaatttgc
tctgctgggt tttggtcttg atttggccca gtacttccta actatgtccc 13200cttttctccc
ttttagaata ctaatttata ttctgtgcca ttgccggtgg atcaggatgg 13260ttctcagata
ctgttttagt tccatgcctg tctacttccc gtcatgacag tcatgcacta 13320acactctaaa
actgtaagca agctcccaat gaaatgtttt catttataga ggtgccttga 13380tcatgctgtc
tcttcacagc aatacaacag tgattaagtc agctgctgag caatctctct 13440ggccccagaa
gtatgcatgt gtgcaattgt gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt 13500gtgtgtgnnn
nnnnnnnnnn nnnnnnnnnn naggaaatgt cattctgtaa atatgtttat 13560cttattggtt
gatgaataaa acactgttgg ccaatagggc aacaaaatag gtggggccag 13620gatataagga
ggattttggg aagtgtaggc agaggggaat tgtcatatga tcccaggaag 13680agacatagat
gggcagaaac tgcctctagc taaccataga ggtctggagg tctgtacaga 13740caggcaggaa
gtgatgtagc tggaagaatc agaatataag caggaacaaa caggaaatcg 13800agctcttctt
ctctctccac ttcagagatg ctgaacagtt gagatgcagg atgccagaag 13860agtaagaggt
ccctggacct ttctccagta agataagacc atgtggaaat agattgatag 13920aaatgggtta
gagattaagt cagagctagc caataagaag ccgtagatat tggccaaccg 13980tttcataatt
aatatagcat ctgtgtattt atttggggga cctggtagac cagaaaactc 14040gtgttagaga
catcttatca aagttgaaaa aagaaaaaat gtgataaagt taggaaaaaa 14100tatagtaaat
gttaaaagct aaattctaaa actacaactt atttatcatt tcctaaatgt 14160ttaaaaatat
tattttataa tgaagatact taaaattcat ttctctgtct tttgagacag 14220ggtctcagtg
tcctggaact cattatatac agcaggctgg cttggaactc acagagatcc 14280acctgcctct
gtctcctaaa tgctgggatt aaaggtgtgt gccaccaagc ctcaattaaa 14340atgcgtttct
ttttctttct ttcttcctgt ctttcatttt tttgtttgtt tagatttttt 14400tttttagaca
gggtttctct gttagcatta gttgtactgg aactcactct gtagaccagg 14460ctggccatga
actgagagat ctgcctgcct ctgccttctg agtgctagga ttaaaggcat 14520gcaccaccac
tgccaggctt aaaatgtatt tcttttttta atttagaaat ttattctgtt 14580taatccacac
gctttatata gctttagtta agaaataaaa taaaatgaaa cagtgaaacc 14640aagagactat
gtccaagtcc aggtcctccc agcctgccaa tgccaagagc tctttagttc 14700tgtgtaccaa
ttggaagagt aagaaaaaaa tatggatggg aaccacacag tttcataaaa 14760cagatttatg
gaactgaagg gtccttgctg agtctagcaa attgccttta caaaagagaa 14820agaaaaaagg
gggaggtaga aaaacaaaac aaatcaaccc aaagaggaca aaatcccaga 14880gttctaaatt
gacttaggaa cctgtcacac tgggacagaa gcttcagcat ccatgagctg 14940tgcctcccct
gctctctaga gctgggatct cgaggtgtca gcagagaccc cacaggtaac 15000aggagcaaaa
acactcactc agacctttgt ggtacttcaa cagtggtctc acttctgggc 15060aagcttacaa
acctatacaa agttgaaggt gtactttaca tgagtgctaa acttcaagag 15120gaaggaagaa
aaaaagggag gtggagggga cagagagaga gagaaaaaaa caaaacaaaa 15180caaaaacaac
cacctcagga gaggcaaggg catttaaagg aaccacaaga atgccaacga 15240tattaaaatg
tatttcttaa tagtaaattt tatgggaaaa gagagtctcc tcttcctcca 15300agtaggctag
gtaagtacct tgccactgag ctctatctat acccttcaaa gtggacaaaa 15360tgacaaagat
agttcatctc ccccaaaggc cctgttgggg tgctgattgt cacatctggt 15420gagatttctg
tttttgtttt tatttcaaga cagggcctct ctacatagat agtcctggct 15480gccctggaac
tcactctgta gaccaggctg gcctggaact catagaccca cttgcttctg 15540tctcccaagt
gctggtgcta aaggtgtgca ctgccactct ttttaagtaa ctatgagttt 15600caaaacaaat
taaagagcac tgttaaagtg gcttgttgtg taagcctagc ttcaagtcaa 15660aggcccgagg
ctcccctacc aaccagctgc tatcacctag acactgtctg tagatcttgc 15720actgactcaa
aactgtggcc taaggtcaaa ataatggtct tcctggattc tgatgtgagt 15780gagattgtgt
aggagggctg gccgctggcc tggcttgagt cactctcagc tggtttcatc 15840ccattcctgc
aactctgtgt aagaggtgga tgatccttgc ttaactgatg aagaaaccaa 15900agctgtagaa
aggatcattt gcttaactct tcacagatgg caagaggcag agtcaggatt 15960ggcagagtca
cttctgccaa cttcaccctc ctgctaactc caccctcctg ctaactccac 16020cctcttgctt
atacttgaca gtggaggaaa agccactgag ggaattaaaa gttgttactg 16080gtaatggtca
ggaaaaaagc tgaacaaagg agattagatt cagggatctt tttctgaaaa 16140gaaagaaaga
aagggggact atagtctaga aatgctgaga taaaagggtg gattatcata 16200tctactctca
aactaaagaa gcaactacta gtctcaaata ctttatattg gtatggattt 16260ttgtgtattg
gtacaaattt aaggttattt ttgttatact gtatatatgt ttttctttct 16320tgtttaaggt
attgtacctg tatagcttat ttaaaaatgc aatgtaaaca tatagtcctt 16380gaaaactatt
taagataata aagaaataca ggttaatagt catctatagc aatcaaactt 16440atagtcatgt
taggtatgtt ttcaagggca tacagaaata aatttgagat agataggtca 16500tcttcaaaca
ctccagagat ctacagaaaa tggcatttat aaaatgtttt aatgacataa 16560gatttttcat
gatagtgaga aatgtctact cttggcagca ccaatttact tcaaaaatgg 16620acaatgggca
ttgaagaaac tccatgtgga ttttgctttc tttgtggcaa aaatctagct 16680atctgggcaa
gaaacttccc ttaccttgac tgctgtccta actggacaag caggacataa 16740aagaaattga
ctgctgaact ttgccaagat agtatacatt agtctttcaa aaatccctgc 16800tttacaaaaa
agtctatcag atattctaag cttctaggcc aaagatggat gcttcaatgt 16860taacagagga
atcttctgtg actgatgttt ctgtcatttc tatagttttg aaaattgctt 16920gctctgttct
tccctgtttg ctcaggtagt attatttcct tcttgagtgt ctaatggagt 16980taaagactag
atagttatag ctacagtttt ccttgtaacc aaattcagaa aagaaactcc 17040caaaagaggt
gtaaaagtat gaggctgaga aatataaaaa cttaaattta tctaagaaaa 17100tgttttgtta
tctaaaaaaa aataattttg ggttagtaat acaagttagg atagaaaatg 17160aattaggtac
aaaactttgg actcatcaag aaaaaataga taatggagta ttttctctga 17220atttgccaaa
tacaaataga ctgggtattg taaatgtaat tcttacttga taattgttct 17280tattgtttat
agtttattat gttagagtca aaacctttct tttttattta gacaaaaagg 17340gggaatgtag
aatatttctt tacactgtgt gaagatgtat cactgtgatt ggtttaataa 17400agagctgaat
agccaatagt taggcaggaa gaggttaggt gagacttctg ggaacagaag 17460tctcagggaa
ggaaacaggc taggtcacca gctaaatgaa gaggaaatag gacactcagg 17520aggagaggta
acagccacaa gccaagtggt ggaatataga tgaatggaaa tgggttaatt 17580taagtcatag
gagctagtta gaaacaagcc tgagctaaag ctgagctgtc ataactaaaa 17640gtggagcttt
cataattagt aagtctctgt gtcatgattt gggggctgac ggcccaaaaa 17700agcctgctac
ccaagttctt ttcaattttc aagttctagg attctggcct tttattggaa 17760aacactgtca
agtttctata gaggtctgac tccacagtgt tgcctgtgca atgaaattta 17820tttaatttat
tccgaggcct tgtgcactct ggataatcac tgtaccactt aatctatatt 17880cccatccttc
attataattt aaaatggtct tattaatctg gtcacttggc tttttttttt 17940ttttttttct
gagacaggat ttctctgtgt agccttggcc atcctagaac ttgctctgta 18000gaccagcctg
gcctggaact cacagagatc cacctgcctc ccctccagag ttctgggatt 18060aaaggcgtgt
gccaccacct cccagtgagt ttatgtcttt gcaaattata catggtttca 18120gttttttttt
ctgtttgtaa gtcactttat ttcaaatgta aagtttaaaa caagaagcaa 18180attactatga
atttttgtta acagtcattt tccttaacta ataagtttta aattttcatt 18240aatatgtttt
gatcatattt tttccatgcc ccaacacctc caaaatctcc ccactcattc 18300agttctttct
ctatctcaaa aaatgaaaaa tccaagcaaa caaccattag acaaaaaata 18360acaaaacaaa
acaaagcaaa gcaaaataaa agcacacggg ctggagagat ggctcagagg 18420ttaagagcac
cgactgctct tccagaggtc ctgagttcaa ttcccagcaa ccacatggtg 18480gctcacaacc
atctgtaatg agatctggtg ccctcttctg gtgtacagat atacatggaa 18540gcagaatgtt
gtatacataa taaataaata aaatctaaaa aaaaaaaaga aaaaagcaca 18600caaaaaaccc
agagagtgtg tattgagttg gttaacccct actcctctgg agtgtgattg 18660atacagccag
tgccgctatt ggagaacact gattgtccct gtccttacag gtatcaattg 18720tgtgtagctc
cttggttagg aatggggctt tgtgtgcact tcccctttca gctttgtaaa 18780gggtgtccga
ttgaagttcg tatcttctgg gagagcataa aatcaaaaaa agataaatgg 18840actccagtga
aaaaggagca agcggcacct atctttaagg tagagaggca gaggagtgtg 18900gtgtggcctg
tcacaaacac ccaattccca atcagctggc gtctaccagg ctgctttcac 18960ttagatgaac
cctgacctcc atgtctcctt aacattgcca ttgtttaact gttagtgagt 19020ctgccctctg
ttcactgaaa gactttcaga aggtggtgtc gcctgccttt aatcctagca 19080ctcgggagtc
agaagcaggt agatagagct ctgtgagttt gaggccaggc tggtctgcag 19140agttccagga
caggctacag agtgaaaccc agtctcacaa acaccgcctc caccacaaaa 19200aaaaaaggaa
acaagataga gtgaacaaac ccagctacct agacatctat ctggtaaact 19260gactcatccc
aatcctccct gccctcccaa agagcttggc tggctcactt ccccaaatgc 19320tcttcccctt
taacatttaa ctagttcttg tctcttgtat ggtttccttt taactgtatc 19380caccacccct
accttgactt ttgtcctggt tggtttttaa ttgtaaactt gacacacaaa 19440gtcacctggg
aaaagggaac cttaattgaa gaattgtctt agattggcct gtgggtgtat 19500ttatagggca
ttgtcttgat tgccaattga ttcggggtgg ggagtgggag ggtagggtgg 19560gggtgggagc
agcccactat gggactcact ttccctaggc agatggctat attagaaagg 19620tagctgagcc
taagccagcg ggtgagccga gccagcaagt agcattcttc tatggtttct 19680ttctttcttt
ttctttttct ttttcttttt ctttttcttt ttctttttct ctttcttttc 19740ttttcttttt
ttttttttct tcccgagaca gggtttcttt gtgtagcttt ggagcctatc 19800ctggcactcg
ctctggagac caggctggcc tcaaactcac agagatcctc ctgcctctgc 19860ctcccgagtg
ctgggattaa aggcatgcgt caccaacgcc cagctcttct gtggtttctg 19920cttcagattt
ctgctttgag ttcctgtctg acttccctca ataattgttt gtaacctagg 19980agtgtaagac
aaatgaaccc tttcatcccc aagtagctat ggatttagag tggtttatca 20040cagccacaga
gtgaaaccag aacaactttc tagtagcctc ttgttctact ccagctgctc 20100ctctgactat
tcctaaaagg tagttgggct cagggaacca catcccgaga gattcagccc 20160atatgaaaat
agctccattg tgttgaagaa atgtgaccct ccaggatttc aggcatcagg 20220attccatgtt
gaaaatgaaa acaattattt tcctctctct caagattcct ttagtcacct 20280tcccttaccc
cagttcctgg ctttccttct aaacaaatgt tcagggaggt tcaaacaaac 20340agctgtgaag
agcagcatcc cataccccca ccttccgacc caacacttgc cagtgctata 20400agtagactgg
gatcatccct ggacactgtg ttaaattacc catgaccaac cttctagcaa 20460gctctccttt
tcaggatttt gttgtttgtt tgggtttgtt tgtttgtgac ttgatctcat 20520gtaagctgac
ctggaatttg cttaatagcc aaggatagac ttacaacctg tgatgctcca 20580gcctctgact
cctgagtacc agggattaca catgtgtggc atcacaatga aagattttag 20640tttgctgaga
gaaaaagttt ttaaagattt tagttcacag agagaataag tttcccacag 20700gccttggtcc
aggacaagga agttggtccc aacccgaggg cagacaaaca atcctttttg 20760ggtcacacct
ggctggccaa cagacaataa aggacttctc agggtacatt ctatggttga 20820ccactctaac
atgagatcat actttgtaat caatcacttt gtgccccttg cctgtatgct 20880gatctgcggt
tttttacagg ctcctatata aggagtctgt aacccttgct ggggtgtgca 20940gcttccccga
tattgctgac acccgaatga gcattcgttc aataaaccct cttgcttttg 21000cagctcttgg
tctggtttct gagtcttggg gcctccttgg gatcctgaga cccttaaggg 21060tctgggggtc
tttcaacact taactttcct gtttttaagt aggaagatct gaaatcccag 21120attcctgact
ccattgcaca ttttctgtat tagaggctgt agctctgtat agtgggttgt 21180gtggcttaca
catgctctga gctggagatt ctagggacac ttagggtaaa gtggagtgtc 21240agcccctttc
cctgctagac tgaggccttt ctgttctttc ctaactggga ggctgtatag 21300cacccaatgt
gttcattaaa ctccatatgt tagcactgca tggaatctga cacacacaca 21360cacacacaca
caccctctac caccaccatc atcagcacca cccccatcag caccaccctc 21420atccccccac
cccccaccct gccccnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnc 21480aactggaggg
tagcattagc acccagatgc cattaatgtg ccaaatattt gcttgcttgc 21540ttgcttgttt
gttccagcat ccttagtgaa tgctcctgcc ctcctggtta aagatggctt 21600tggcatctct
tggcatcttt cttgtattct aggcctgaaa tagggatgaa tggtgaaggg 21660caaggagctc
aagtgtcact taccacctgc acttgtccct ttaaggggtt tccctagaag 21720cagtctacat
ttcattagcc agagctttgt cacctggcta cttgtgaagg aggtggtgaa 21780gaagccttac
ctttgactct gccacttgga gccaagtcag gattctctcc ctggaaagga 21840aatggaagat
taataccttg ttggttgtta gacctagccc attatgcgcc atgaggaaag 21900agagacaaca
gtgggtcact gattgatcag ggttacagga caaggagcct tgtttctcct 21960aacagctctg
agcggagaca gaagtggagt atataggcat aaaattcaca aacatttgct 22020gccacgttac
aggtacattt tttcaccagt cagaaatcaa agattaggga ctttgcttgt 22080gtgttccatc
actgtcaact gacatacacg gcaagccttt tagtccaacc aatcagaatc 22140atttgttcct
tctgttgtta ggagcagcca taatgattct aaagaactaa caatgcataa 22200tgactatttt
tgtagtttag ggatgaggta tgtcagccat tggacagttc tcagctcccc 22260tagggcttgg
gaacttgaac tttatttcat cctgcatgta atggagtctg aagtcaaaat 22320ggcagtactt
aggtcaaggt gctcgtgcct gctgccttca aggtggtttc ccattcccac 22380cataccagag
acttcctact gcatctccag tcaaggacac aaacactttt aagtcctgac 22440tgttgattca
atctatatag ttaccagcat agaggctaag agtcacactg gcttgcaggg 22500gacttctcta
gcatatgtga agccccgttt gaatcctaaa cacaagagtc taagctttgg 22560agtcagagac
aagcatgttc aaatctgtac gtcaccaccc tatagacata gacaagtccc 22620ttgggctcag
ttttttcact acagagagta attgttattt cagattccta gggttgtggt 22680aattaaatag
ttgaaagata tagcccatgg aacataaaaa aaactcaaaa ccaggcacag 22740tggcacatgt
ctttaatttc agcactcaag agacagaggc aagtggatct ctgtgagttt 22800gaggccaggc
tggtctatat agagagttcc aggtctacac agagaaacag gctcaaaacc 22860aaagcaaaag
caaaacctca actaatgttc ataaaattat gaaattgctg gtaccagtga 22920catgactcat
tggtaaagac acttgctagc aagtttaatg atctgagttt tatctccggg 22980atctacaatg
tagaagaaga aaaacaactc tcaagagttg tcctctgatt tccacttatg 23040caaaatagca
tgggaacaca cttaagcagg taggtaggta ggtagataga tagatagata 23100gatagataga
tagatagata atagacataa ttaagaacgt tcagttgcag cacagttcat 23160actgaactgc
atttggacac ctctgtgaaa agtcaggagc tctcctgtcc tcctggtgac 23220atttaaacat
tgaaggcaac tattttaact gtcagttata tacaaatcca ctggccttgt 23280aaaattttaa
aacataacag aggaggctaa agtcctgttt aacaaccctc tccttttacc 23340atcccaggaa
gccaaaattg ttcacaattt gttctcttcc ctcaggcctt ccatatttca 23400aataccacat
aaaacaccta tggaaaaaca tgaggtatta aaaatgtcac ttggaaatcc 23460ttcttcaaac
aagcttgttc tttctttttt cttttatgta cagtgaatgg aatccaggac 23520ctttgcagat
gctaggcgag tcctttacct cattcctctt tcgatttaaa actttttctt 23580gttttgtgga
gacagggttt ctctgtgtag ccatagatgt cctagaacta gctctgtaga 23640ctaggctggt
ctcaaattca gaagccagtc tgcctctgcc tcgggagcgc taggattaaa 23700ggtgtgggca
gagtgctagg atgaaaggta tgcacaccac cactcctggt tgattttaaa 23760aagatgcttt
ttaaaaaaaa tgatgtgtag gtagtggggg gagagacggt ttcatgccta 23820agagcactga
cagctcttct agaggactca ggttcaattc ccagcaccca catggcagct 23880cataaccatc
tgtaaccccg gtcccaggga atccaacacc ctcttctggt ctctgtgaat 23940gacagatatg
catgggatat acaaacatat acgcagacaa aacactgtat acattaaata 24000agtacaaatt
taaaatatgt gtaggcatgt atgtctgcat gtgggtatgt gtacactgaa 24060tgcaagttca
cttggaggcc agagatatat agatcccctg gagttgcagt tacagatact 24120tgcgagctgc
tgtgagtgtg ctgggaacca aatcctctgg aacagcagca agtgctctca 24180cctgctgagc
catttcttca cccgcttctt tctacttttt attttgagac aaggtcttac 24240taagttatat
attcacttgg ggcttgaatt cattttgtca gcaggcagac cataaacttg 24300ccttcctctt
gcctcgggct cctgagtagc tgagacttca ccatgaggtc tggctttgat 24360tacatttttc
tttgttttct ttttgggggt ggggctgatc atgaactcta aatagccaag 24420gattgatagt
gaagtccaga ttcccccacc tatcaccggg tggaattaca ggtgtgcact 24480accacaccca
atttggtttg attttttttt tttttttcag gacaagctct ccttttatag 24540ctctgactgg
gttggaattt actatgtaga ctaggctagt gtcaaaatca cagagatctt 24600cctgtccctg
cttcctgagt actgggatta aaggcatgta ccaccacacc ttcgggtgtg 24660gtgatgcaca
gctttaatcc cagcactcag gcaggcgaat ctctctgagt ttgaggctag 24720cctagtcttc
agagtgagtt ccagaacagc caaggctaca cagagacact ttgtttcgaa 24780aaacaaacaa
aaacaaaaga ggctagcctg aaactcctga ttctaccagc acctcccaag 24840ggctgggatg
acaggttgtg gccccatgct ctctgccggg gcctctcttt tctttcttct 24900gtttgaggta
gaggcttact aggttggctg ggtgagttgt gaactcactc tgcagcccac 24960acaggaactg
atcttgtgat cctcctgcct cagtctccct agcagctagg attgcaggcc 25020tgcaccatca
ggcccatcgt acactgtttt ctgagtttga aaattgcctc tgttgttgac 25080tataaggcat
gctctcctcc taacattgtc cttggtgcct ctgccaccct ttgggactag 25140agagaacaga
tcttattcct atttcacatg ctgtgccaac ccagtaacaa actcagattc 25200ctgcttccgc
ccccaccacc cccatctaat tgttcagtgt ttctgtgaag ataaacacga 25260tcatctttgt
gaaagccact taagttcctt tcaaggttgg gatataagtt agagtgatag 25320cttgttccca
gggtggggag agcatgtgaa ttcccctctc gctcaagtag gctatactaa 25380ttttcattta
gatatttctg aggcaaagtc tcatgctggc catccacctg ccttagcttc 25440tcaagtgctt
ggattacagg catgagctac aatatctggc ttagtttcaa ggttgtgaaa 25500attatactgt
gttctgatga cctgagttca attccctgga cctgggtgat ggacggagag 25560gacagacccc
tgcagattgt cctttgacct ccctgtcact atgtgaacac tcgtgtacac 25620acacacacac
acacacacac acacacacta aatgaatgta ataaaatata aaaaggtgtt 25680cactagttaa
taagacatga gagaaaaagc ttaccatccc taatcaatgg ggaagcattg 25740aatataagtg
actgtggtca tggaaagcag tatagaggtt cctcaataaa ctggaatata 25800gcagcatata
cttgtaagcc tcccacaaca ggagaaaggt aaagaggggc ggccactctg 25860gaatattatt
aatatcctgt ttcataaaca agtaaataga acaaacccct caacaacaag 25920aaccggtgtg
ctggcacaca cctgcaatcc cagcatttgg gacttggagg cagcacaatt 25980gaagttcgtt
cttggtcatc ctcagctatg tatgaaatct gaagcctgcc tggcctacag 26040gagaccctgt
ctcaaaaaaa taaactaaat agattaaaat gaaaattaga agcaggtagt 26100gtggaagttg
aataagaata gccgccatgg gctcatgtat ttgaatgttt agtggcacaa 26160cttgagtgag
ttaggaggtg tggcctgttg gagttgtgtg tcactgggag tgagctttgg 26220gattttagaa
gcccaagcca ggcccaggga cttgctctct tcctgcgatc tgaggaactg 26280gatgtagaac
gcttagctac ttcttcagca ccatgtctgc ctgcatgctg ccatgttccc 26340tgtcaaaatg
ataatggact gaccctctga aacttggtct cttttggctg aggagttagc 26400aaggtaagag
gtggctgtgg cttgctcttg tttctctctc tctgatcttt catcattttc 26460tcccgtatct
ggctgtgggt ttttattatt aagagtaatt agaactcatg ttacagtggt 26520acatgcatgc
cacagaccca gtgtggatgc cagaggacaa catgtgtaaa ttttttcttt 26580ccttgtatgt
gcgtccaggc tagtttcaga cttgtgggct tctgcttcag cctcccaaag 26640gtggggacca
caggcttata tacctacact cacctcttta ttcccagtgg atgtgtgtgt 26700gtgtgtgtgt
gtgtgtgtgt gtgtgtgtgt gtgtttgtgt gttttacaca gacctgtacc 26760acattcattt
ggttactttt ttttcctgca ttttgttttt aggtagggtc tcactatgta 26820accctgactg
tcctggaaca tgctatttag attagactga cctgctggtc cctaccttcc 26880gagtgctggg
attaaaggtg tgtactacca tacctggtga ttagtttgtc ttttgagact 26940gggtctcttg
tagcccaggt tggtcttgaa ctcctggttt tccagactct accttccaaa 27000tattgatatt
gcaggtggtc actaccatgt gtggaattta tttttgagca gtgttctgtg 27060ggtggatgat
aaggtcatgt ctatggtaaa attgtttcta ataatgatga atagcttcat 27120gtgtgtatgc
atctatcagg tttgttcaac ctgaagtgta ggcctaatat ttggatttat 27180ttagccagtg
atagctatga attgagccca gaaaaaatca taaacttgac taaaacatct 27240taagaatttt
gtaacttctt ttgtaactca actgtattgt ttctgagcat gaatgttgta 27300aatgacaatg
tcagctgcca tgtcaaaagg ttgaacatta cttggcagtg gtggcacaca 27360cctttaactc
caacactcag gaggcagagg caggcagatc tctgagttag aggccagcct 27420ggtccacata
gggagttcca caccagctaa ggtgacagag tgagaccttg tctaattttt 27480ttttaaggtt
ggacatgtat aattccagag aataattttt cactaatcgg aaaagaggca 27540gtttcaactt
ggagttcaca agatttaatc tttctttgaa gatttattta tttttagtta 27600tgtgtgtgta
tatatgtatg tatgtatgta tgtattggtg tgttaaaccc ctggggctgg 27660aattacaggt
ggttgtgaac ctgatgttgt aataagctcc cagaccgtag cacaaatgac 27720tctatgaaga
aagtaccatt caggctgtaa aatccacata gacagcacca cctggaaaaa 27780ctaaaacaaa
aatccaatcc atcaaactcc acagatctgg gaaagtatct aaatgcacta 27840accttgattt
ttggcttctg tagttctgct tctggctaac tattcttgtt aactgaagta 27900tgtgaaccca
caacatggtt tttgtgctta aaagttctct gttctacaga atgaattcca 27960ggacagccag
agctgcatgg agaaaatctg cctcaaaaca aaacaaacaa ataaaaacct 28020tgagaaaggc
tcagggctat actggtatcc catacactca gtgtagtcgc caactgtcaa 28080agactttttg
ttgacttaaa cccatttcta agcagtattc tcttatggat accccttaca 28140agtgggtgct
gggacttgaa ctcaggtcct ctggaaaagc agaggatttc tcacctgctg 28200agcacctctc
caggcccata agatctatct taagacaaga cctgagcagc cttatggaga 28260tggcagtctg
gggaaccact ggtgcgcctt ttcttctgct ggtcacaaac tgctgtggga 28320atttccatct
gaagttcctg cctcttctca cattccatga tatgagaaag ctatcaatgt 28380tctaaatctg
tttgctttct gctttgcaag acctttctct ttcctaggtc accctccaag 28440agttcttgac
ctcagccccg actggtgtct tgggatgggt gactgggttc tgggggcttc 28500cctgtgcctt
ggaatatggt aaaagagcat ctcaggtatt cactcagtag atgctagtag 28560cactccctcc
ctccatttct gtctacagat gttgctagct ggcccctatg aggtagtctt 28620tgcccctttg
ttattgctgc agactcagaa aaaagaggaa atatagaact cctcgtggtc 28680ttctactcaa
tatccaagca agggggaaca actgagcatc catacactgc tgttttggct 28740tctcaattgc
ttgcttgtac atcaccaaga agctttcatt ggtcagtgta aacaagatct 28800gggagttgat
ggtagagcag ttggatgagt gactctgtct ttcacctttg ttgagtcatt 28860tggtgtgtgc
acattgtggg tccctgcctc gcttcccatt aaatgtcaag gtgaacttta 28920tgaggttgaa
acttttatat gtagtgcaac tgtactcctt cctctctatc tcttccttca 28980tttttcttcc
ttcaccttct cttcctttaa aaaaagaaaa actttaaaaa atgtgaatct 29040gatgtatccc
aggatggcct caaactgttt gctttctcag aagatgacct tgaactttca 29100atcctcctgc
ctccacctcc caaatgctgg gcttacagga attcatcacc atgcctggtt 29160ttcctctctc
ctggtgagtg aatccagggc ttcatgcttg ccaggcaagt gttctgctga 29220ctgagttaca
tgcttagcct gtatccacat cttgactgag taatttctgc accaaaactt 29280taggtttcat
ctcagtgact ctgccaatgt gtttccattt tagagtgacg actggcctta 29340gaggagagtg
taagagaaat agagtctctt tccttggtct gctttttaaa ttttaatttc 29400tttttagaca
tcttatattt attcatgcat gtgtgtgtat aactagcaga actcagctgt 29460ctctttctac
cactcaggtc accaggcttg gtggcaggga ctcttacctg ccttcgagca 29520ggctctgccc
tccttttgga gaaactggtt tgcagaagga agagacagca cagctcagaa 29580gacagccgtg
ctttcagatg cctgagaatc ctgccaagga cactgctgca ttctcctatt 29640cttttgtaag
ggtcccatct ctgctgagct aaactgggct ttctcagccc ttctcctctg 29700acagtatttt
aaaaccctac ctaaaggggg atggagagat ggctcagcaa ttaggagcat 29760atcctactct
tccggagacc cctacttctg ttcccagcac caatgctggt caatttacaa 29820ctgtaactct
gctccaggtc atcggatgct gctatcctcc tcaggcaact tcactcatgt 29880gcacatacac
atacttaaaa acaaaataag tctttaaaaa tcacctaaga aatataaagg 29940cacatatcat
aattcagcct gctgtgacgt atagctatag tcccagaatt ctgaaggcag 30000aggcaagagg
atcacctcaa gcttggggcc agcgtggtct acagtgagac cctggagact 30060ttaatctcaa
aatatgtaac aaaacaaata tgtaaataga catatatcac aatttatatt 30120taagtaaaat
ggggggcatt ggagagatag ctttgtggtt aagagcatgt actgttcttg 30180tcaaggaccc
aagtttgatt cccagtgtct acactggttg gtctccaacc caattccaag 30240agatctgctg
ccttcttctc ctctctactg gaactgcatt catgtgcaaa tgtccatatg 30300cacacacata
cccacatgca tacacacaaa cacatacata ctcattttgc ctgacatcgt 30360ggtaaagtgg
gaagacttgt tgccctatta cttggtcttc atttgcctat gagcaccatg 30420ttggcatgaa
ctcattcatt aatatctttc ctgtacaact ccccaataac caagatgaca 30480cttggcacac
attaattgct aagtataatg aaaatttagt ttaaattagc taaataattt 30540aaagttcccc
ctcaagcctc atgcctgatt taaagtagta cttattaatg ctgggcctgg 30600tggcatacat
ttctaattct aacacttagg aggctgaggc aggaggatgg ccaattcaag 30660gccagcttag
ccagcttagt aagaccttgt ctccaagcaa attacagcaa agtctgagat 30720atagttcagt
aattagggtg tttgtctacc atgtgtgaag acctgagttc agtttctaac 30780aacaaaacaa
aactaaacaa accagaacct agaggttatc atttattttt ttatttttat 30840ttttttttgg
agtttatgcc tttggattat ccattctatg tccagacatc agtactgcca 30900tgttacagtc
aataaaagtc ttccttcatc acccttaatc ttatcaccac taaagtctct 30960acttgacaga
catgccatac ataattatag ctgttacctt ctatcataaa gtagacattt 31020tattttattt
gtgtattcat tttcatttat tttgttgttg ttgttgtttt atgagacaga 31080gtttctctgt
gcagccctgg ttatcctgga actcactctg cagaccaggc tggcttcaaa 31140cacacagaga
tccacctgcc tctgcctcct gagtgctaag attaaaggag tgtgctgcca 31200tcttcccagc
aacattctaa attatttttt gtttatgttt tgaaatggtc taatgtagct 31260gaggtgggcc
tcaagcttgt tatatagctg gggaaccttg aacttgtgtt cttcctacct 31320ctagaactct
ggagtgctgg aattacaggt atgaaccatc acattccagt tttaatcaaa 31380tccagacttc
atgggtacta ggaaagcact ctacaaatta aacttcaccc ctagttcata 31440tatatatatg
tgtgtgtgtg tccatgtatg tatgcctaca tgattttatg tgtgccacat 31500gtgtgcaggt
gctcttggag gtcagagggt gtcaaatccc ctggcacctg agttataggt 31560ggttgtgagc
cacctgatgt ggattctggg aactgaactt tggtcctctg caggagaagt 31620cactgttcct
ctgagtgaac gtttctactt tttaatatac ttcccattcg aattagaaag 31680tagaagctct
cggaggttga gaccttacct aaagtcaccc aactagtaag aaaactaaaa 31740tatcaacttg
gttttctgag ttttaaatat tttttcccaa tgtgtaatta cacaggagaa 31800ttaatgggga
cacttcaagg taaaacagaa gctttagaca tagcaaggca tggtggcaca 31860catcccattg
agaggcagga ggatcaggag gccagctttg gctgcatact taagaggcat 31920ccagggctac
atgaggcgct acctaaaaaa attaaattag gcagggcgtt ggtggcgcac 31980gcctttaatc
ccagcactcg ggaggcagag gcaggcggat ctctgtgagt tcaaggctag 32040cctggtcttc
agagcgagtg ccaggatagg ctccaaagct acacagagaa accctgtctt 32100gaaaaaccaa
aaaagcactg gtcattgtca ttttctttcc taacagggca ctggaaccct 32160gatgttggtt
ggctcctaga tttcttctcc acagcagaga gttcttgccc tgttagagcc 32220agaaggatgc
tctggagagt cagtatatag caaagcaggg tcatctggag tagtaaaaac 32280cctctggcac
agtcagacct catttcctct tgtcctgtgc tcgtggctct agcattatgc 32340aaggagaggc
gcaaacagca aacaatttgg aagggctagc acttgagcaa ctctttgtag 32400cttcctcttc
tctactcttt tgcccctggc ttctactgga acaggtgact ttccattgca 32460ttgcattctc
caaactcaga tgattttgag aatgtggcac tactaaaagt cacatggaca 32520tacaaggtac
aactagaact atcccgggaa acagtgatac acgatctagt ttgaggcctt 32580gagccatagc
ttgtcagaag ctcagaaatg attgagtctc tgggagccct cacctcagca 32640tccctgcttg
caaaaggctt cttgaagtag taaaaactgc tgggaccttg tctaggctgg 32700gtaaccttgc
ataattactc aaccttactg agctcagtcc cctcctctat aaaataagtg 32760caacagtatt
taccttagtg gcccacctga aaacatcaca gctgccatag ctagctcttg 32820gcttttgttc
tatctcctcc tccccctact ttctcttccc tccctccctc cctccctcat 32880ttttctttat
tcctttcttt gtattttttt cttttttctt cctcacacct ctccttattc 32940cccaccctcc
tctctctctc tcccttccca cttctctttc tttcatggca ggatatcatg 33000tatcctagct
atacttgaat tcactatata gctgaagagg agcttccagc ccttttgcct 33060ctgcctccca
agtgctgaga ttataggtgt ccacctccac gtctacttat gctttgctaa 33120ggatcaaacc
agggctttgt atgtgcatgc taggcaagag ccaactacat cgccagacct 33180atataatacc
cctttctcag cgaaactggg gttgctgatg gctggtgttg ggggaaggca 33240ctaaatattt
agcagaagta taggaaaact ctagaagtct agagatcctc aaagtaagtt 33300tggagagcct
tggccttttc ttagttgaaa gtcatggtgc ctactcactt tgactgctca 33360aggaatatcc
attcaccacc tggaaataag aaaggaggga gaaccagcta gggatgtgac 33420ttagtagtag
agcacttgtc tagcatgagc gtggtcctgg gttcaagctc cagtacaaag 33480gctgggtggg
ggggtggaga aaggcttctt tcccatggcg ttctagagat ggcggggaga 33540aaccaccaat
ccacatctat ctacaacagt tcaagtagaa ctaatcttgg tggtatggct 33600atagtagtcc
taatcccatc tcagggatgc ttctctttgc aattgataca aaacacatta 33660cagaaaacca
cagtgaatca aaatgcagag ttgtggtgcc tagttccaat ggatgcatct 33720acagtacaac
tcccatgcct aaggctcagg gatcattgtg gaagacaaag atcctcccag 33780gagatcaggg
agtttgctgt ctcctaggaa tttcagaaaa tacatctgta aaggctcacc 33840aacgtgaatt
cctaaacatg agctgaacaa ggatgacaat agacatgcta acaaggatgg 33900gaaaaagccc
ttgaagcctc agacctacac aaagagccgc agttgattaa ggaatgctga 33960ttgtgggaga
aaccatcttc ccaaattgtt atctaatacc acatagtcag ccctgaaaac 34020acacatgcaa
ataagattat acaaaacaag ggggttgtac atatgtattt aggaatatat 34080atatatatat
atatatatat atatatatat gtaacaataa ttaatagaaa aagagaccat 34140gaatttgaaa
aagaacaagg aggggtacat ggaagggttt aggatgcttt gaccctttaa 34200tatagtttct
tgtgttgtgg tgaccccaat cataaaatta tttttgttgc tagttcacaa 34260ctgtaatttt
gctgctgtta tgaattgtaa agtaaatacc tatggttttt gatgatctta 34320ggcaatccct
gttaaactgt cattcagtcc ccaaaggggt caagacccac aggttgagaa 34380ctgctgattt
agagagagga aagggaaggg ggggtgaaat gctgtaatta taattccaaa 34440aaaaaatttt
taaaaatttc ttaaaggaac tgaagaaaag agctgaacat tctaagctta 34500aggggggaaa
ggttctggaa tgttacattt ttctggtttc cttagtctca gcaacaggct 34560cccagccttc
tgtttggaca gtggtttaca ggcatgtgag ctcagggaac actcttccaa 34620gtgaatcaga
cttcaggaga agacattcag ttcagggccc tggggaaagt aaggacagaa 34680ctccattcct
gagaattacc aggtttgctc agaagataaa actggtgagc ccaatggctg 34740tgtgcacaac
cctgacctca gtgtctagga tagctggact ctagctgcta gaagatagtc 34800agagggccat
cctttccctg aggctaatct gtgaatcaag taaactacag tcaggaaggg 34860agctggagat
gggggcccag caaacaggtc ccccttaaag cccagcacat aggtggggaa 34920cccaacctcc
cattttgtct tcaccccacc accaggcctt taccaaggcc cgaggttgcc 34980actattttca
gcttgccagg ctctttgcag ttttaggggg atgaggagga gatgctctga 35040ggtgctggga
ggcacatggc gggtgctatt tatggcttgg gctgaactcc gatgtcctag 35100aaagagtgtt
tctgacactt tctgccttct gggaatcagg agactcatga caaacactgc 35160ctggcagtgt
ttctttcttg ttcacagcaa gaagtgtgca gtccatggca cgaaagaggc 35220ctgagcaggg
caagatggac acgatgacat cactgaagga gcttcccagg ggctgtcttg 35280actgcttcat
taactcattc atgcagttta ttcagcagct atgcctgtca gaccccattc 35340tgtctgcaca
agacacatgg caacaaagga gacttactat tcccatcttc atgggtttta 35400tgttctggca
agaggaagat agtaataatt tttaaaaagt aaccagtctt gagagcatga 35460taaatatggt
tgataacaat atgctatatt ttaaaagttg tgagatagta tactttaagt 35520gttctcaaaa
caaaatgatg aatatgggtg atataacatg ttaattggtt taatttagcc 35580atgcctttgt
gaacatactg tatcgtgtat cataattgtg catgacttta tttatgagct 35640aaataaatga
atggaaaaaa aagtaaccag tcttgatgct tacctgccat cctggaagga 35700aatggaaata
ggatctgccg ccgcagcatt gccctatgct cttatttctt ctcttgaaga 35760ggtaggggtg
tgtgtgtgtg tgtgtgtgtg tgtgtgtgtg tgttactaga gactgagcta 35820ccggcctcac
acattctagg caaatgctct actttatatt aaacacttta taaaacatta 35880agcctttcag
ggtcagcaag gtagctcaga gagtccgggc atttgctacc aagcctgaca 35940acctgagttc
gattgatgat cccccagact cacgtgatag gaggaagctg acacctgtgg 36000gttgtcctct
gactataggc atgcacacac atcccatgaa taaatattta tacattttca 36060aatcatactt
attttacaat gatttttatt tgtttgcctg tctttctgtc tgtgtagaga 36120caaggtttca
tgcagctcag gttggcctca aactcactct gtggcaagga tgccttaact 36180tcaggtcttc
caggtccagg taacaaaatg ttcaggagga acctggtacc tcatcataac 36240cggttctaga
tggtcttccc agggctgctg taagaaagtg ctacacgacg agttatttca 36300aacattctca
cagttctggg gattagaagt ttgaaactaa ggtgctgaag agattagttc 36360cttctggaag
ctcagaagag ccatctggtc catacttttc tccaggtttc tcttagtttt 36420tggcaatcct
tggaatccct tggtttgtag atgcagcttc caaagctcaa gatctctctc 36480caatgctgtg
tggcatttcc ccgtgtttat gagtgtctaa atggctttta aaaacatttt 36540tgagatgtga
aattctggct gacccagaat atataaacca ggctgacctt tgtctcccag 36600agatctccct
gcctctgctt cccaaacctt ttgattaaag gtgtgtgtca agtgcccaga 36660ccaaatgccc
ttcttgtaag gacaacggtc atattggatt tagtgtctaa gtgagtcccc 36720tatgaactca
tctcgaactc agtttgcata gaacactgta ccatgcaaaa taaatgacac 36780agagactgat
attggggttc acacttcaag ctgaaggtca gaaaagcaaa gcattgggcc 36840actagctctt
accactacct caggctgaac gggctgatcc tgctgcctct cctcagcatg 36900gctggagaat
atcttcatat cctcattgtg gctggaaaat gaatgcctga tatggagaac 36960ttgctcctgt
tttatataac tccctaatgc tgggattaaa gatgtgtgat cccaggtgct 37020gagatcatct
ttgtgtgagc tgtttctctt taggactgga tcaattttgt gtagatctgg 37080atggctttgg
gctcactgag atctatctac ctcttaatcc ctggtcctag gattaaaggt 37140atgtaccacc
acatcctagc ttctggctgc tgggattaaa ggtgtatgcc tggcttcgat 37200ggcttgtggc
tgactttgct ttctgaatcc gcaggcaagc ttaaaaaaat cataaataat 37260atatcaccat
agaccacact tccaaatagg cttccattta gaggcgccag tgggtgataa 37320tgtaggcggt
tttactcagt tttgtgcaga tggctggcgt cctgtctggt gagttcagat 37380tttttttttt
ttttttttta agttcagaat cttacccagc tcagcttttc aggctgcatt 37440cagtgtccgg
cttttttctc accgtcttga cttcctgtcc tgcatcccat ttctcagcct 37500ggaccctgcc
agtctatcag atagataaca taaacaaaat tgtactggat taatgggagc 37560tgtttggaca
tttcctactt ttgccttttc accaatgatt tgcatactta agcctgcaac 37620tacagccccg
atgcagtaag ctcagtctct ggcaagcaaa ggtctctctg gggtcttgtt 37680taagaaccag
ctcaggctgc tggctctgtt ggcagtggag gtatttccta taatgggatg 37740atgggatggg
ttattcacac acatctcagt tactgggcta catggatcca aatcagccac 37800ccaagggttt
gcagtcacat gtgagtcact tagcacagag aaagaagcct ggaggaggag 37860gggtcctccc
agcttcagga gggttttcca ggatataggc ttctagtctc gttttggatc 37920aatttatcag
ttttggattg ggtctaataa ctctttcctg agcctggact gggctcaaag 37980gcatgagtat
gtgaggggaa tttactagaa ttcacctgta gtttctgtat cattcctaga 38040gaaggggaag
tagagacact ggtgatggga aataaaaaca aaacaaaacc taaatattgg 38100gagcacagag
gtccttgttc cacagctctt gatagaagtc aggaatgtta tgtatgtaca 38160attgcccttg
aaaaggaaag gatgtatgac ctgtttttct gtcccgaagg ctgggaactg 38220gggatgatta
acagcctgtt gatctgcatt atctgaaggg ctaggccata tcaagctccc 38280acagctagca
ctgaaggaga atagggcctt acaaagggaa ttccctcttt ggatcgaacc 38340taggaacatc
ttctgtttta ccgctctctc cttgtttcat ctgcaaaggg aggagcttgg 38400tagtgatgtt
gaggcaggca ccacttgtat ttttctaagc cacagagact gtttccctac 38460cttacaaaca
tccctgtgca tcactgcagc tctgtctctt atggcagtgt ctcagttagg 38520gcttctattg
ctgcgactaa acaccatgac caaaaaagct cacacttcca tactcctgtt 38580cattattgaa
gaatgtcagg actggagcgc aaacagggca gggtcctgga ggcaggagct 38640gatgcagagg
tcatggagga aggctgctta ctggcttgct ctccatggct tgctcagcct 38700gctttcttat
agaacccagg accacctgcc cagggatgac accacctaca atgggctggg 38760cgctaatatg
agggatcaaa gagatggagt tgtgggaggg acagaggggg agagcaatga 38820aagagataat
cttgatagag ggagccgtta tggggttagg gagaaacctg gtgctagaga 38880aattcccagg
aatccacaag gaagacccca gctaagactc ctagcaataa tgaagaggat 38940gtctgaacgg
gtcttcccct ttaatcagat tagtgactac cctaattgtc atcacagaac 39000ctacatccag
taactgatgg aagcagatgc agtgatccac agccaagcac tgggctgagc 39060ttcgggagtt
cagttgaaga gagaagggat catgtgagca agggggtggg ggaagtcaag 39120atcatgatgg
ggaaaaccac agagacagct gacccgagct agtgggagct catggactat 39180gaaacgccag
acgttgtaga ctccctaagg aaggccttac cccctctgaa gagtggatgg 39240ggggtgggaa
gtggggacgc tgggggacag gagaaaggga gggaggggga actgggttgg 39300tttgtaaaat
gaaaaaatag atttttttta aataaaaaaa gaaagtgctt tacatctgga 39360tttcatggag
gcattttctt aactgaagct ccttcctctc tggcgactct agtttgtgtc 39420aagttaacac
agaaccagcc agtacaggca gcagaaatac cttgcagaaa tatcttagtt 39480caggagtcca
cggtggtctc agtcacttcc tcatgtgcca cctgagttta acattcccca 39540aaacttggaa
cacaggccac cacatcatgg agccctggct taaagctcaa gttttatggt 39600attttctttt
atcactgtct ataattccta aacatgctac aatgttgtga gccctcaccg 39660tctcctaggt
ccatagtgac ttcctggcat taatagactg tgccccaaga gctctatggc 39720cacgaccacc
acctgccatt cccctccccc tccatggtcc cagcctcact tcttcacttc 39780ctggtccttc
cgagcccaat gtgcaaaccc acagaatctg tctgcttatg taagtttcct 39840ggtcactgag
tggggtgact cagcaccaag gtggtgccct gcgatttccc agccccaggc 39900agcagaacaa
ctgaaatgga aaacaagtcc cgttaatagg gtccagctga gagcctccct 39960ttctcaggga
gtctggcaaa tctactcctc ggggaactgc cctgggcagt ggaattctcc 40020agctccctgc
tcatttccta gttcctcttc cctcttctca cctttggctg aggatcagaa 40080aggttcccac
tgaggtctgc tttgccctgg gcctgctctt ttcagagtcc catttttgga 40140atgaattttt
tttgtctcct actttcaagt tcacatattg aagccattat tgccaaggtg 40200atggtatcag
aaggagggac ctttgggaga tgaatggatg gattccaaga ggttatgtgg 40260gcagagcacc
catgatgggg ttggtgcctt cataggaaga agacacagta gaagggaaag 40320agatgccgac
tgaaaaacag gaagtctcct ggagtaggcc actcagccta tgacacgcca 40380gcactcagat
ctcggacttc ccatctccca aatggtgata aacaaatgct gttgtccagg 40440ctgcacagtc
tacggcattt tgttgcaagg gcctggacca accaggctca ggcaggaagt 40500gaatctagtg
tgggaggatg tacagactgc cactcagtct ggacacaaac tgtcctcagg 40560gatcacctga
gccacatcta cctaagaatg gctattcttt ccatttgtta acatcaaatg 40620ccaagcccct
actgtatgta ggctcttgct agcagtggat atgatgctat gtgagatggg 40680agcaatcctc
tctgcacaga actatacata gaactatgca tagaagacca acagggagac 40740atcagataac
tattaactgt gatagctctg tgggagacaa acagaatgag ggaatggaca 40800atgactttga
ggaaaaacta tgattgaaaa tactctatct ggctgggcgg tggtggcgca 40860tgcctttaat
cccagcactt gggaggcaga ggcaggtaga tctctgtgag ttcgagacca 40920gcctggtcta
taagagctag ttccaggaca gcctccaaag ccacagagaa accctgtctc 40980aaaaaaaaca
aaacaaacac acaaaaaaga aaatattctg tgaggtaaac aagcatctgg 41040aagggttggg
agataatgca ggcaaaaatg cattagacag cacacagtac aacacagcaa 41100tcaaacttaa
tataaacaca gcaaatgtca tctttgggct ttgccccatt tcctgatctg 41160accataacag
cctagtgtct ggaaagcaca ctaaagccat ttacgtcaca caggagttca 41220atgttgagtt
cagagggagg gggtggaggg cagattagcg aggtacaagt tctggtccct 41280ttgatgaagt
gttgatgtac ccatcgacac cacacaaata taccatcatg ctccatgtta 41340gggtcagtga
aggattgcat atgtgacggt ggcccactgg gctgagaaag ccctattgct 41400tagtgacatc
tgtgataatg acatgcgagc cctattgctt agtgacatca ctcttctcat 41460agtgtgggat
ccaatgtgtt tcttgtacac ttgtgataat gacatgcaaa caagtctatt 41520gtgcggccag
tcacacaaaa aatatattat gtgcagtcag gaacagtcca tagtacttga 41580ttgggacagc
acaagtctgt gttgctggtt cacacattaa tcattaccac tgttttagtg 41640tgctcctata
tatatatatt taaaaattac tataaaatga tacaccgtgc tgagcaatag 41700cacctcttat
accttgtgtt tactggatgt actcaagcta ttttctcttg tgcttgattt 41760atttgtattt
gtatttttga gagaacctca tctagtccat gctggcttca aacttgttat 41820aaagctgagg
atggcttcga actcctgatc ccccagcctc tgcctcccaa atgatgagat 41880tacaggcata
tgctaccaaa catgactttt atttattttt attacttagg tggtatgggt 41940ggtttgaatg
agactgtccc ctttggctta tatatttgta ggtggacctt tggaaaggtt 42000taacaggtat
gaccatagtg gaggcagtgt gtcagtaggg gaggtctttg gggaacccaa 42060tactcaatca
attccaagtt agggctgtct gtctgtctgt cccctgattg tgtcacaagg 42120cagaaactct
cagctactgc tctagttcta tgcctaccca cctgttgcca tggtccctgc 42180catgatggtc
atgtacttca accctttgga taggtggccc ccaaattaaa tggtttcttt 42240tataagttgc
cttggtcatg gtgttttgtc atggcgataa gaaagtgact gagacaggtt 42300tgttgctgtt
gttacaaggt ttagtccagg catctggcac cacctctggc ctgtgcttga 42360ttcaatcatg
ttacctttag aaatagcagg ctaaaggaca tatacctgtg tacgtatatg 42420tgtacgtata
tattagctgt atagtctaag tgtgcacctg actctaatat ctaggtttgt 42480gtaagtagac
tccaccaagc tcactaagca atggtatcac agttttcaga tagtgttcag 42540cgatgcttgg
ctgagtgtta gttctttttt taatatttta tttatttatt atgtatacaa 42600cattctgctt
ccatgtatct ctgcacacca gaagaggaca ccaaatctca taacggatgg 42660ttttgagcca
ccatgtggtt gctgggaatt gaactcagga cctctggaag agcagtcggt 42720gctcttaacc
tctgagccat ctctccagcc cctgagtgtt tttaaatcaa ggaaaaaagc 42780ctgagggaag
ggagctcagg ctgaagggga ggagtcaaga cagtctgacc ccaaggcatt 42840gtgggacgta
aagagttctg ggacaagact gaggtctctt ccttctcaga gactgtgggc 42900ttcagtttcc
ttggtagccg gaagcaaagc taatccatgg cttaaaatat aatactcagt 42960gtaaccttgt
gttgtagaag tgacttgctt gtcttcttcc ataattctaa aacatcttta 43020agagcaggat
ccaggaaggg aaaaggagag attctcatct tcttcaaaag gcagctttcc 43080ctaaagcatt
ttctgatgaa atttaagttc taaaaccagc agtggtataa tcccatcatg 43140aatggggatc
tctgagttta aggccagcct ggtctacaga gcaagttcca ggacagccac 43200ggttacacaa
agaaatcctg tcttaaaaca aaacaaaacc caaaacaaac ataaacaaaa 43260actatccaaa
accaaccaac ccccccaact cagaaagaaa gaaagaaaga aatcaagaaa 43320gaactgccca
ccgggtgttg gtggtgcaag cctttaatcc cagcactcgg gaggcagagg 43380caggcagatc
tctgtgagtt tgaggccaac ctgttctcca gaaagagtgc caggataggc 43440tccaaagcta
cacagagaaa ccctgtcttg aaaaaagaaa agaaagaact acccatgacc 43500aaacagttcc
atggccaggt agagaatgag gacgctgaaa gtcacacctt ctcagagtct 43560caaactgcac
atctggcctc aaagtccaga aatgagtgca agaccattaa tgacagtctt 43620tggaaacaaa
ccagaccaaa gaacatttgg ctcctgatac atattctgag ggtcacatag 43680aaagaaagat
ctgcctttgg ccacctcctt ttgaagtggg gaattttatt ttcttctgca 43740tggaaacttc
atgtaggtat ttgagaatac atacagacat gcaggtgcac atgcacggac 43800atgaacacac
acatacaccc cgggtaggca ggcaagaaag tgtgtggaat aacacttgaa 43860cttcccttcc
agaacagaag ccctctgaag tgtgacattc atgctggctg catggggtct 43920gatcagtact
agtgagtgga ggtggagggg taggaaacat ggggatgata ataggttgtc 43980aggaaagtgg
tgccccaggt agcacagagt agaaatttgt cccccaaaat ccttttgaac 44040ccagttgatt
tgaatgccgt gcccctgcca cccaggcttc agagctaagt gacttatgtc 44100ttcaggtcag
tgatgattac cacggttgca gtgctaacac agatgcttta tctaccagga 44160cagaaacaag
aaagatgctc cttcccaggc cccttagcac tctctgggtg gggaggattg 44220ccccaccttc
caaaaataga atactgtttt ggtaaacagc cactttgagc ccatgaggat 44280atcttcatta
gctatggaga caggttttag taagaaagca agatgagagg ctaaaaaacc 44340cttggggagc
aggaactggg aagactgtgg taccttgttc ccagatccac cagaaacctt 44400gccaccagac
gatgtgtcca ggccccacat atttcacaaa aagttggatc tgataacaat 44460gaggatggaa
tcccggtctt aaggtgggtt tggggtggga agaggcggga taatgggtga 44520gagggtcggt
ggggacaggt gagatggggt atggtgggga gaggtggaat ggggtggggt 44580gggttgagat
ggagtatggt acagcgggga gggatagaat tgtcttttcc ctgtaccaca 44640gagaagtttg
actgctaccc ttggcaatta atcaattata gaaaatgcaa ctttgctttt 44700aaaatgtgtc
tatttccaaa ggcttcttcc cctcccctac ctagggagaa ggaaagaatg 44760gataatgcta
ctgtagagga gggtagcatc actatagagg cctcagtatc tgccccaggg 44820agctgggaga
gagttctatc acacaaacac agcccgagtc acatactcaa caaaccccac 44880aaaacaaaac
aacaataatg aagatacaaa atctcattat gtagcccagg ctagtcctag 44940atttctgttt
tctttttttg tttttcgaga cagggtttct ctgtgtagct ttggagccta 45000tcctggcact
tgctctgaag cccaggctgc cctcactcac agagatccgc ctgcctctgt 45060ctccagagtg
ctgggattaa aggcgtgcac cactaatgcc tggctagtcc tagatttttt 45120tatcctcctg
cctcaggctc ccaactgttg ggtttacttt tgggagtcca ttttcttcca 45180gcatggattc
tttgaattga aattcagatt atcaggtttc tgtagcaatc ccaccagccc 45240atttttttgt
ctgacactgc ttgttttgag acacagtctc ccactgctgt agcccaggct 45300gccctagatt
ttctatgtag cccaggctgg ccttgaactc ccaggagtcc tctggcctct 45360cccttttgat
tactggaact agaagaagtc actatgcttg acttggaact aatattagaa 45420caaaatatat
ttttcattga gattcaactt tgaaatcctg atgctcctgc ctcactcagg 45480tcatcagggt
tggcagcaag agcctttatc cactgagtca tattgggccc tgacctgctt 45540ttaaattttg
cctttagggc tggagatgta gctcggctgg ttcagtgctt gcctggtacc 45600cacgaagccc
tgggtttgat ctacaacaca gtataagcca ggcctgatgg cgtatacatg 45660taatcctaac
acttggggag caagagggag gccaaagcca tcctctgcta cttggtgagc 45720ttgaggccag
cctgggatcc ttgagaccct gtttcaaaac aataacaaca aacacagact 45780actaaaaaaa
attaataagg gccagactgg gtggtgtatt cctttaatcc aagcaatgag 45840gaggcagagg
caggcaaatg tctgtgagtc tggggacagc ctggtctact gagcagcagg 45900ccaactaagg
ctacatagtg agactatctc aaaaaaagca aaataacaat aaacagacca 45960gttccccatc
tcctattttg cctttacctc ctattccctg ctcagcaggt tattttttgt 46020tcctgcatct
tggttcactg atctgtaaac ttgtctgaat aagtaggtac agggttgttt 46080taaaattaga
taatatattc aatgagaagg gctaccaagt gctcaaccaa tgtatgcata 46140tgtatgtatg
tatgtatgta tgtatgtatt tatttttgtt ttgtttttca agataaggtt 46200tctctgtgta
gttttagagt ctgtcctaga acttgctctg aagagcaggc tggtcttgaa 46260ctcacaaaga
tccacctgtc tctgcctccc aagtgctggg attaaaggca tgtgacacca 46320cccccaaagc
caatgttctt ataggcatct ttgatttttt ttctctttct ttgagtggag 46380tctgactaag
tagcccatac tagctctgca tttacaatct gaacacatgg ataagagtgg 46440tgaaaattat
caagatcatg ttatgctatg cctcctgagt caccatgccc tgcttcagac 46500ttctttgtat
taaagaactg tgtaaaaaaa aaaaaaaaga catttgaagg cacataatca 46560gaggaatttg
tcagtgattt ttcacatact gtcttatttg tggccaaggt aagcctagag 46620agtatttctt
aaaattaaaa atagtgggca gattttggag gcgatctgat atgaaaatcc 46680cttcccaccc
caggtagtca tgggctgact atcaaggata cattctgaga catatatcct 46740caagcagttt
ctgccttacg caaatatcat aggtcatagc acactgagac tatgtggcag 46800tctatgtgtc
tatatacaca tggtgtggcc tattgttccc atggtcacaa agaacaaaac 46860aactttttca
caaggcttta cccctagagg aagagctaca ggcaatcaat ggttgctgag 46920aggagtatca
gtcttctcca gggacttagc caatcccaag aggtcagcca cgcataggaa 46980cgcttagcca
cgcttgtata gaacatctca aacaacaacc acctcagtgt aaagcaagca 47040cacaaggaac
tgatgcaact aagagacaaa gggcccggtg tgtgtggccc gtagctgtca 47100tcccagcact
tgagactaag gaaggaaggt tgagaatttg aggccagcat ggactccaca 47160gaaagaccgt
tttctttctc agaaaaaaga agcaaaaacc aagaacaagg tgtatgggaa 47220tgctactgtc
ttggcatatt gtttatagaa aactttttta tatataaaag gaatgcacta 47280caaaaattat
aaactactgt aatattaact gcatagatct ataacatggt catttattat 47340tgagtatgat
tatctatcta cccacgctgc aggtttagac agttgcacta cagtagatct 47400gtttgcagta
gcatcattat tagacatttt ggacaaagcc aagtggtaat ggcacatgcc 47460tttaatccca
gcacttggga agcagaggta ggcggatctc tgtgagtcag agaccagcct 47520ggtctacaaa
gaactagttc caggagagtc tccaaggcca cagagaaacc ctgtctcgaa 47580aaaccaaaag
aaaaaaagaa aacaaaaaac taaaaaataa ataaatttgg ggcaatatct 47640tgtcctatga
tgttactggg taatgggatt tcctcctctt gtattatttt ttctttgggg 47700gttttactta
ttatttactt gagacagagt ctcatttatg acaggctggc ctcaaacagg 47760aaatgaagcc
aaggaagacc ttgaagacct aatccttctg tttcttcctc ctatatggtg 47820agttaaaggc
atacagtacc atgcccagtc tattcactgc ccagggcttc atgcatgcta 47880gcaaagcacc
aactgagctg catccccacc cctcctcctg gcttccatct ccttatgtag 47940ctagaaatga
gcctgtctgt ctcaaatact gggattatgg gtgtgtgcca ccacacctgg 48000cttcctatta
tagccttgtg ggatcactgt tgtttactga agcattgtga cacactgcag 48060attgctggaa
cagcgtctgc catcatcatg acacaacttc agagaaagag agagttccca 48120accagccaca
cacttaactc aatgcctgta gcccttattc tgttaagacg atttcctgcc 48180atcttactca
aagaccctct ttaactcggt aggaacatct gttacactga aagtcctgcc 48240tgttgctcca
ctgacctcct tcacaaatta ttatattttg gagccaattc tgaacccagg 48300ttttctgagt
gacacatttt agtatttttt ttttctttct attttctttc atggaaagtc 48360tcttgttact
gttcacatga ccaaggatca ctgcatcatc ttccaaggcc aattttggat 48420gtttcagcaa
gggagactga agatcctgag tctcagtgtt gatctccttt agaatgtcct 48480ctggagaagg
tagtgacaac actgcaagga taataggtga ataaagggaa gccagagtgt 48540cctctgggat
gtgcggcact tacatgaagg attcatttat aaattttaag ttatggagta 48600taataataag
actaaatatg tagtgtcgta attttataac tatacatatg tatatagtaa 48660atataaattt
atatgtaatg tatttatagt aagtgtacat agaattgaac atatgttaca 48720taaatggcag
aaaggaatga ttctcaattg ctttttttct aattataatt tctattgctc 48780tttgtggatt
tcacaccatg cattctgatc ccacttatct ccttgtctcc ttgcatttgc 48840cctctgccct
tgcaacctca cccccaaatc aaagccaaat ttaaaaaaaa aaccaaaatc 48900caaacaaaac
agagacaaaa caaaaataaa agcaacaaca aaaaaaggag aatcttgtca 48960tggtagctgt
agtgtggcct gttgaatcac acagtatacc ctttagtcca ttcatctttt 49020cttccaagtg
ttcattgata caagtcacgg tctggctcga ggattctggt ttctgctata 49080ttactaataa
tgggctctca ctggggctcc ccttggatat cctattgtcc tgtgttatgg 49140agagcctgct
gttttggata tgtaggtttg tccccttcac atgctataac aattcataaa 49200ttcagtgaat
gttggggtgg gccaactcat agccctggtt ctgggcttgg gtggtattat 49260taaacccact
gatggagaat aagaccacta ccataattta aaagccaaat tgaagcaagt 49320tttaattcaa
tactgcccag gtggacaggc tctggctagg tccatctctg agtttccagg 49380aggtggccct
gactcacggt ttacagtggc ttgagtattt tccataaggt ccaatcaggg 49440gcaagcatac
atcctgatgt acctccagtc tatatccaat cgggggcaag tgtacatctt 49500gatgtatttc
ctgcctgtga acctactgcc cacatgtgat caagcacatc cggtgcagtt 49560gggtcaaaca
gacttgttta gggcaatgaa aaacacatgg ctttttatct cccataaaca 49620atagcctcca
gcggttcagg gactatttgt ccttgggcaa ggaatttaca gatcctatag 49680gtgagtcagg
gtcagcatcc tgctctcatg ccctcagggc tggctcactt gttacctccc 49740cgaccctctc
tcaacagggt cagctctgag gtgctgccca ggtggggtgc agggcctact 49800cttccgcatg
ttgcagctgg tcagggttag ttctctcata tgccacaggt ggcaatgggt 49860gaagggggag
ggcatgtttc cctcatcaac gccattacat ggggggatgg ggtcagctct 49920catgccctta
gggttggctc acctgcatcc ttgaccatag ggtcagctct agtatgctgc 49980tcaagtgagg
cgcacaccta
50000434DNAEnterobacteria phage P1 4ataacttcgt atagcataca ttatacgaag ttat
34534DNASaccharomyces cerevisiae
5gaagttccta ttctctagaa agtataggaa cttc
34623DNAEscherichia coli 6cctgcttttt tatactaact tga
23722DNACricetulus griseus 7taaggcctca tatgaaaata
ta 22822DNACricetulus
griseus 8atagatgtct tgcatactct ag
229364PRTArtificial SequenceDescription of Artificial Sequence
Synthetic polypeptide 9Met Ala Pro Lys Lys Lys Arg Lys Val His Met
Asn Thr Lys Tyr Asn1 5 10
15Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe Val Asp Gly Asp Gly Ser
20 25 30Ile Lys Ala Gln Ile Phe Pro
Asn Gln Cys Tyr Lys Phe Lys His Gln 35 40
45Leu Arg Leu Arg Phe Gln Val Thr Gln Lys Thr Gln Arg Arg Trp
Phe 50 55 60Leu Asp Lys Leu Val Asp
Glu Ile Gly Val Gly Tyr Val Thr Asp Arg65 70
75 80Gly Ser Val Ser Asp Tyr Met Leu Ser Gln Ile
Lys Pro Leu His Asn 85 90
95Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys Leu Lys Gln Lys Gln Ala
100 105 110Asn Leu Val Leu Lys Ile
Ile Glu Gln Leu Pro Ser Ala Lys Glu Ser 115 120
125Pro Asp Lys Phe Leu Glu Val Cys Thr Trp Val Asp Gln Ile
Ala Ala 130 135 140Leu Asn Asp Ser Lys
Thr Arg Lys Thr Thr Ser Glu Thr Val Arg Ala145 150
155 160Val Leu Asp Ser Leu Pro Gly Ser Val Gly
Gly Leu Ser Pro Ser Gln 165 170
175Ala Ser Ser Ala Ala Ser Ser Ala Ser Ser Ser Pro Gly Ser Gly Ile
180 185 190Ser Glu Ala Leu Arg
Ala Gly Ala Gly Ser Gly Thr Gly Tyr Asn Lys 195
200 205Glu Phe Leu Leu Tyr Leu Ala Gly Phe Val Asp Gly
Asp Gly Ser Ile 210 215 220Ile Ala Gln
Ile Lys Pro Gly Gln Ser Tyr Lys Phe Lys His Thr Leu225
230 235 240Gln Leu Val Phe Gln Val Thr
Gln Lys Thr Gln Arg Arg Trp Phe Leu 245
250 255Asp Lys Leu Val Asp Glu Ile Gly Val Gly Tyr Val
Ile Asp Arg Gly 260 265 270Ser
Ala Ser Asp Tyr Arg Leu Ser Glu Ile Lys Pro Leu His Asn Phe 275
280 285Leu Thr Gln Leu Gln Pro Phe Leu Lys
Leu Lys Gln Lys Gln Ala Asn 290 295
300Leu Val Leu Lys Ile Ile Glu Gln Leu Pro Ser Ala Lys Glu Ser Pro305
310 315 320Asp Lys Phe Leu
Glu Val Cys Thr Trp Val Asp Gln Ile Ala Ala Leu 325
330 335Asn Asp Ser Lys Thr Arg Lys Thr Thr Ser
Glu Thr Val Arg Ala Val 340 345
350Leu Asp Ser Leu Ser Glu Lys Lys Lys Ser Ser Pro 355
36010364PRTArtificial SequenceDescription of Artificial Sequence
Synthetic polypeptide 10Met Ala Pro Lys Lys Lys Arg Lys Val His Met
Asn Thr Lys Tyr Asn1 5 10
15Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe Val Asp Gly Asp Gly Ser
20 25 30Ile Ile Ala Gln Ile Pro Pro
Asn Gln Ser Cys Lys Phe Lys His Gln 35 40
45Leu Arg Leu Thr Phe Gln Val Thr Gln Lys Thr Gln Arg Arg Trp
Phe 50 55 60Leu Asp Lys Leu Val Asp
Glu Ile Gly Val Gly Tyr Val Arg Asp Arg65 70
75 80Gly Ser Val Ser Asp Tyr Ile Leu Ser Glu Ile
Lys Pro Leu His Asn 85 90
95Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys Leu Lys Gln Lys Gln Ala
100 105 110Asn Leu Val Leu Lys Ile
Ile Glu Gln Leu Pro Ser Ala Lys Glu Ser 115 120
125Pro Asp Lys Phe Leu Glu Val Cys Thr Trp Val Asp Gln Ile
Ala Ala 130 135 140Leu Asn Asp Ser Lys
Thr Arg Lys Thr Thr Ser Glu Thr Val Arg Ala145 150
155 160Val Leu Asp Ser Leu Pro Gly Ser Val Gly
Gly Leu Ser Pro Ser Gln 165 170
175Ala Ser Ser Ala Ala Ser Ser Ala Ser Ser Ser Pro Gly Ser Gly Ile
180 185 190Ser Glu Ala Leu Arg
Ala Gly Ala Gly Ser Gly Thr Gly Tyr Asn Lys 195
200 205Glu Phe Leu Leu Tyr Leu Ala Gly Phe Val Asp Gly
Asp Gly Ser Ile 210 215 220Tyr Ala Gly
Ile Ala Pro Asn Gln Ser Cys Lys Phe Lys His Gln Leu225
230 235 240Arg Leu Trp Phe Val Val Ser
Gln Lys Thr Gln Arg Arg Trp Phe Leu 245
250 255Asp Lys Leu Val Asp Glu Ile Gly Val Gly Tyr Val
Ile Asp Asn Gly 260 265 270Ser
Val Ser His Tyr Arg Leu Ser Glu Ile Lys Pro Leu His Asn Phe 275
280 285Leu Thr Gln Leu Gln Pro Phe Leu Lys
Leu Lys Gln Lys Gln Ala Asn 290 295
300Leu Val Leu Lys Ile Ile Glu Gln Leu Pro Ser Ala Lys Glu Ser Pro305
310 315 320Asp Lys Phe Leu
Glu Val Cys Thr Trp Val Asp Gln Ile Ala Ala Leu 325
330 335Asn Asp Ser Lys Thr Arg Lys Thr Thr Ser
Glu Thr Val Arg Ala Val 340 345
350Leu Asp Ser Leu Ser Glu Lys Lys Lys Ser Ser Pro 355
360113663DNAArtificial SequenceDescription of Artificial Sequence
Synthetic polynucleotide 11tcgcgcgttt cggtgatgac ggtgaaaacc
tctgacacat gcagctcccg gagacggtca 60cagcttgtct gtaagcggat gccgggagca
gacaagcccg tcagggcgcg tcagcgggtg 120ttggcgggtg tcggggctgg cttaactatg
cggcatcaga gcagattgta ctgagagtgc 180accatatgcg gtgtgaaata ccgcacagat
gcgtaaggag aaaataccgc atcaggcgcc 240attcgccatt caggctgcgc aactgttggg
aagggcgatc ggtgcgggcc tcttcgctat 300tacgccagct ggcgaaaggg ggatgtgctg
caaggcgatt aagttgggta acgccagggt 360tttcccagtc acgacgttgt aaaacgacgg
ccagtgaatt cgagctcggt acccagaaac 420ctttcaacca gcttttgagc taatgataga
gagaagctca aggaattgga gcaatgcttg 480actagggatg tcagagggag gctatccaga
ggagcttaca actgaggtaa acttaaaagt 540tagggagttt gtcaacttca acccacagaa
tagagcagag ccaggaggag ctgaggcttc 600tgagtgttat ggtggaagca tcaccccaac
ccttgacatc catatgcctg aagagtctgg 660aatgttatgg tggaagttcc acccaagcct
cccttcccgg tcgccctcca aaccctgcta 720catctcagaa atcccaccaa atgatgactc
cctcccccag agatattcaa gaccactccc 780acagggtatt taaactgccc cccaaccccc
agaaaataga tgtgtggttt tccaatctct 840ctttcctatc acgtctctgg ggagctggca
ggccatttgg gagcattgta tccattaaac 900gacttctcag tggagactct gaaagccaga
agagcctaga cagatagatg tcttgcgaat 960tcttgcatac tctagagact acagatgccg
gcccagacta ttatatccag caaaagtttc 1020aaacaccata caaagtcaaa tttaaacagt
atctatctac aaatccaata ttacagaagg 1080tgctagtagg aaaactccaa actaagatta
actatacctg tgaagacaca ggaaataatc 1140tcacactggc aaaagaagaa aaacctctct
ctctctctcc tctctctctc tctctctctc 1200tctctctctc tctctctctc tctctctctc
tcacacacac acacacacac acacacacac 1260accaacacca ataccatgaa caacaaaata
acaggaatta acaataattg atgtgtgtgt 1320atgtccctgt gtgtgtgtcc ttgtgtgtgt
ctgtttgtgt gtctgtgtat atgtttgtca 1380cctgaggggt ggctcttcct tggtttgtga
ggtttctacc caaaagcttg gcgtaatcat 1440ggtcatagct gtttcctgtg tgaaattgtt
atccgctcac aattccacac aacatacgag 1500ccggaagcat aaagtgtaaa gcctggggtg
cctaatgagt gagctaactc acattaattg 1560cgttgcgctc actgcccgct ttccagtcgg
gaaacctgtc gtgccagctg cattaatgaa 1620tcggccaacg cgcggggaga ggcggtttgc
gtattgggcg ctcttccgct tcctcgctca 1680ctgactcgct gcgctcggtc gttcggctgc
ggcgagcggt atcagctcac tcaaaggcgg 1740taatacggtt atccacagaa tcaggggata
acgcaggaaa gaacatgtga gcaaaaggcc 1800agcaaaaggc caggaaccgt aaaaaggccg
cgttgctggc gtttttccat aggctccgcc 1860cccctgacga gcatcacaaa aatcgacgct
caagtcagag gtggcgaaac ccgacaggac 1920tataaagata ccaggcgttt ccccctggaa
gctccctcgt gcgctctcct gttccgaccc 1980tgccgcttac cggatacctg tccgcctttc
tcccttcggg aagcgtggcg ctttctcata 2040gctcacgctg taggtatctc agttcggtgt
aggtcgttcg ctccaagctg ggctgtgtgc 2100acgaaccccc cgttcagccc gaccgctgcg
ccttatccgg taactatcgt cttgagtcca 2160acccggtaag acacgactta tcgccactgg
cagcagccac tggtaacagg attagcagag 2220cgaggtatgt aggcggtgct acagagttct
tgaagtggtg gcctaactac ggctacacta 2280gaagaacagt atttggtatc tgcgctctgc
tgaagccagt taccttcgga aaaagagttg 2340gtagctcttg atccggcaaa caaaccaccg
ctggtagcgg tggttttttt gtttgcaagc 2400agcagattac gcgcagaaaa aaaggatctc
aagaagatcc tttgatcttt tctacggggt 2460ctgacgctca gtggaacgaa aactcacgtt
aagggatttt ggtcatgaga ttatcaaaaa 2520ggatcttcac ctagatcctt ttaaattaaa
aatgaagttt taaatcaatc taaagtatat 2580atgagtaaac ttggtctgac agttaccaat
gcttaatcag tgaggcacct atctcagcga 2640tctgtctatt tcgttcatcc atagttgcct
gactccccgt cgtgtagata actacgatac 2700gggagggctt accatctggc cccagtgctg
caatgatacc gcgagaccca cgctcaccgg 2760ctccagattt atcagcaata aaccagccag
ccggaagggc cgagcgcaga agtggtcctg 2820caactttatc cgcctccatc cagtctatta
attgttgccg ggaagctaga gtaagtagtt 2880cgccagttaa tagtttgcgc aacgttgttg
ccattgctac aggcatcgtg gtgtcacgct 2940cgtcgtttgg tatggcttca ttcagctccg
gttcccaacg atcaaggcga gttacatgat 3000cccccatgtt gtgcaaaaaa gcggttagct
ccttcggtcc tccgatcgtt gtcagaagta 3060agttggccgc agtgttatca ctcatggtta
tggcagcact gcataattct cttactgtca 3120tgccatccgt aagatgcttt tctgtgactg
gtgagtactc aaccaagtca ttctgagaat 3180agtgtatgcg gcgaccgagt tgctcttgcc
cggcgtcaat acgggataat accgcgccac 3240atagcagaac tttaaaagtg ctcatcattg
gaaaacgttc ttcggggcga aaactctcaa 3300ggatcttacc gctgttgaga tccagttcga
tgtaacccac tcgtgcaccc aactgatctt 3360cagcatcttt tactttcacc agcgtttctg
ggtgagcaaa aacaggaagg caaaatgccg 3420caaaaaaggg aataagggcg acacggaaat
gttgaatact catactcttc ctttttcaat 3480attattgaag catttatcag ggttattgtc
tcatgagcgg atacatattt gaatgtattt 3540agaaaaataa acaaataggg gttccgcgca
catttccccg aaaagtgcca cctgacgtct 3600aagaaaccat tattatcatg acattaacct
ataaaaatag gcgtatcacg aggccctttc 3660gtc
36631222DNACricetulus griseus
12tacatgtatg tacaaaatat at
2213364PRTArtificial SequenceDescription of Artificial Sequence Synthetic
polypeptide 13Met Ala Pro Lys Lys Lys Arg Lys Val His Met Asn Thr
Lys Tyr Asn1 5 10 15Lys
Glu Phe Leu Leu Tyr Leu Ala Gly Phe Val Asp Gly Asp Gly Ser 20
25 30Ile Phe Ala Ser Ile Thr Pro Arg
Gln Cys Tyr Lys Phe Lys His Glu 35 40
45Leu Gln Leu Thr Phe Val Val Thr Gln Lys Thr Gln Arg Arg Trp Phe
50 55 60Leu Asp Lys Leu Val Asp Glu Ile
Gly Val Gly Tyr Val Ile Asp Gln65 70 75
80Gly Ser Val Ser His Tyr Arg Leu Ser Glu Ile Lys Pro
Leu His Asn 85 90 95Phe
Leu Thr Gln Leu Gln Pro Phe Leu Lys Leu Lys Gln Lys Gln Ala
100 105 110Asn Leu Val Leu Lys Ile Ile
Glu Gln Leu Pro Ser Ala Lys Glu Ser 115 120
125Pro Asp Lys Phe Leu Glu Val Cys Thr Trp Val Asp Gln Ile Ala
Ala 130 135 140Leu Asn Asp Ser Lys Thr
Arg Lys Thr Thr Ser Glu Thr Val Arg Ala145 150
155 160Val Leu Asp Ser Leu Pro Gly Ser Val Gly Gly
Leu Ser Pro Ser Gln 165 170
175Ala Ser Ser Ala Ala Ser Ser Ala Ser Ser Ser Pro Gly Ser Gly Ile
180 185 190Ser Glu Ala Leu Arg Ala
Gly Ala Gly Ser Gly Thr Gly Tyr Asn Lys 195 200
205Glu Phe Leu Leu Tyr Leu Ala Gly Phe Val Asp Gly Asp Gly
Ser Ile 210 215 220Ile Ala Gln Ile Lys
Pro Asn Gln Ser Cys Lys Phe Lys His Gln Leu225 230
235 240Met Leu Thr Phe Thr Val Ala Gln Lys Thr
Gln Arg Arg Trp Phe Leu 245 250
255Asp Lys Leu Val Asp Glu Ile Gly Val Gly Tyr Val Ile Asp Ile Gly
260 265 270Ser Val Ser Glu Tyr
Arg Leu Ser Gln Ile Lys Pro Leu His Asn Phe 275
280 285Leu Thr Gln Leu Gln Pro Phe Leu Lys Leu Lys Gln
Lys Gln Ala Asn 290 295 300Leu Val Leu
Lys Ile Ile Glu Gln Leu Pro Ser Ala Lys Glu Ser Pro305
310 315 320Asp Lys Phe Leu Glu Val Cys
Thr Trp Val Asp Gln Ile Ala Ala Leu 325
330 335Asn Asp Ser Lys Thr Arg Lys Thr Thr Ser Glu Thr
Val Arg Ala Val 340 345 350Leu
Asp Ser Leu Ser Glu Lys Lys Lys Ser Ser Pro 355
3601422DNACricetulus griseus 14aaggcactcg tgtaaacgga ta
2215364PRTArtificial SequenceDescription of
Artificial Sequence Synthetic polypeptide 15Met Ala Pro Lys Lys Lys
Arg Lys Val His Met Asn Thr Lys Tyr Asn1 5
10 15Lys Glu Phe Leu Leu Tyr Leu Ala Gly Phe Val Asp
Gly Asp Gly Ser 20 25 30Ile
Lys Ala Ile Ile Arg Pro Glu Gln Ser Tyr Lys Phe Lys His Arg 35
40 45Leu Arg Leu Val Phe Gln Val Thr Gln
Lys Thr Gln Arg Arg Trp Phe 50 55
60Leu Asp Lys Leu Val Asp Glu Ile Gly Val Gly Tyr Val Tyr Asp Arg65
70 75 80Gly Ser Val Ser Asp
Tyr Tyr Leu Ser Glu Ile Lys Pro Leu His Asn 85
90 95Phe Leu Thr Gln Leu Gln Pro Phe Leu Lys Leu
Lys Gln Lys Gln Ala 100 105
110Asn Leu Val Leu Lys Ile Ile Glu Gln Leu Pro Ser Ala Lys Glu Ser
115 120 125Pro Asp Lys Phe Leu Glu Val
Cys Thr Trp Val Asp Gln Ile Ala Ala 130 135
140Leu Asn Asp Ser Lys Thr Arg Lys Thr Thr Ser Glu Thr Val Arg
Ala145 150 155 160Val Leu
Asp Ser Leu Pro Gly Ser Val Gly Gly Leu Ser Pro Ser Gln
165 170 175Ala Ser Ser Ala Ala Ser Ser
Ala Ser Ser Ser Pro Gly Ser Gly Ile 180 185
190Ser Glu Ala Leu Arg Ala Gly Ala Gly Ser Gly Thr Gly Tyr
Asn Lys 195 200 205Glu Phe Leu Leu
Tyr Leu Ala Gly Phe Val Asp Gly Asp Gly Ser Ile 210
215 220Trp Ala Arg Ile Lys Pro Gly Gln Ser Tyr Lys Phe
Lys His Thr Leu225 230 235
240Glu Leu Val Phe Gln Val Thr Gln Lys Thr Gln Arg Arg Trp Ile Leu
245 250 255Asp Lys Leu Val Asp
Glu Ile Gly Val Gly Tyr Val Thr Asp Ala Gly 260
265 270Ser Ala Ser Val Tyr Arg Leu Ser Glu Ile Lys Pro
Leu His Asn Phe 275 280 285Leu Thr
Gln Leu Gln Pro Phe Leu Lys Leu Lys Gln Lys Gln Ala Asn 290
295 300Leu Val Leu Lys Ile Ile Glu Gln Leu Pro Ser
Ala Lys Glu Ser Pro305 310 315
320Asp Lys Phe Leu Glu Val Cys Thr Trp Val Asp Gln Ile Ala Ala Leu
325 330 335Asn Asp Ser Lys
Thr Arg Lys Thr Thr Ser Glu Thr Val Arg Ala Val 340
345 350Leu Asp Ser Leu Ser Glu Lys Lys Lys Ser Ser
Pro 355 3601629DNAArtificial SequenceDescription
of Artificial Sequence Synthetic primer 16ggagggacat taatctgcat
gcagtgatc 291729DNAArtificial
SequenceDescription of Artificial Sequence Synthetic primer
17gtcttggttt gggttgtcta agcaacctc
291839DNAArtificial SequenceDescription of Artificial Sequence Synthetic
primer 18cacaggtgtc cactcccagt tcaattacag ctcttaagg
391927DNAArtificial SequenceDescription of Artificial Sequence
Synthetic primer 19cgatggccca ctacgtgaac catcacc
27201821DNAArtificial SequenceDescription of
Artificial Sequence Synthetic polynucleotide 20cacaggtgtc cactcccagt
tcaattacag ctcttaaggc tagagtactt aatacgactc 60actataggct agcctcgagc
cgccaccatg gcaccgaaga agaagcgcaa ggtgcatatg 120gcaccgaaga agaagcgcaa
ggtgcatatg aacaccaagt acaacaagga gttcctgctc 180tacctggcgg gcttcgtcga
cggggacggc tccatcaagg cccagatctt tccgaaccag 240tgctacaagt tcaagcatca
gctgaggctc cgtttccagg tcacccagaa gacacagcgc 300cgttggttcc tcgacaagct
ggtggacgag atcggggtgg gctacgtgac tgaccgcggc 360agcgtctccg actacatgct
gagccagatc aagcctctgc acaacttcct gacccagctc 420cagcccttcc tgaagctcaa
gcagaagcag gccaacctcg tgctgaagat catcgagcag 480ctgccctccg ccaaggaatc
cccggacaag ttcctggagg tgtgcacgtg ggtggaccag 540atcgcggccc tcaacgacag
caagacccgc aagacgacct cggagacggt gcgggcggtc 600ctggactccc tcccaggatc
cgtgggaggt ctatcgccat ctcaggcatc cagcgccgca 660tcctcggctt cctcaagccc
gggttcaggg atctccgaag cactcagagc tggagcaggt 720tccggcactg gatacaacaa
ggaattcctg ctctacctgg cgggcttcgt ggacggggac 780ggctccatca tcgcccagat
caagccgggt cagtcctaca agttcaagca taccctgcag 840ctcgttttcc aggtcacgca
gaagacacag cgccgttgga tcctcgacaa gctggtggac 900gagatcgggg tgggctatgt
gatcgaccgc ggcagcgcct ccgactaccg cctgagcgag 960atcaagcctc tgcacaactt
cctgacccag ctccagccct tcctgaagct caagcagaag 1020caggccaacc tcgtgctgaa
gatcatcgag cagctgccct ccgccaagga atccccggac 1080aagttcctgg aggtgtgcac
ctgggtggac cagatcgccg ctctgaacga ctccaagacc 1140cgcaagacca cttccgagac
cgtccgcgcc gttctagaca gtctctccga gaagaagaag 1200tcgtccccct agacagtctc
tccgagaaga agaagtcgtc cccctagcgg ccgcttcgag 1260cagacatgat aagatacatt
gatgagtttg gacaaaccac aactagaatg cagtgaaaaa 1320aatgctttat ttgtgaaatt
tgtgatgcta ttgctttatt tgtaaccatt ataagctgca 1380ataaacaagt taacaacaac
aattgcattc attttatgtt tcaggttcag ggggagatgt 1440gggaggtttt ttaaagcaag
taaaacctct acaaatgtgg taaaatcgat aagatcttga 1500tccgggctgg cgtaatagcg
aagaggcccg caccgatcgc ccttcccaac agttgcgcag 1560cctgaatggc gaatggacgc
gccctgtagc ggcgcattaa gcgcggcggg tgtggtggtt 1620acgcgcagcg tgaccgctac
acttgccagc gccctagcgc ccgctccttt cgctttcttc 1680ccttcctttc tcgccacgtt
cgccggcttt ccccgtcaag ctctaaatcg ggggctccct 1740ttagggttcc gatttagtgc
tttacggcac ctcgacccca aaaaacttga ttagggtgat 1800ggttcacgta gtgggccatc g
1821211821DNAArtificial
SequenceDescription of Artificial Sequence Synthetic polynucleotide
21cacaggtgtc cactcccagt tcaattacag ctcttaaggc tagagtactt aatacgactc
60actataggct agcctcgagc cgccaccatg gcaccgaaga agaagcgcaa ggtgcatatg
120gcaccgaaga agaagcgcaa ggtgcatatg aacaccaagt acaacaagga gttcctgctc
180tacctggcgg gcttcgtgga cggggacggc tccatcatcg cccagatccc gccgaaccag
240tcctgcaagt tcaagcatca gctgcgcctc accttccagg tcacgcagaa gacacagcgc
300cgttggttcc tcgacaagct ggtggacgag atcggggtgg gctacgtgcg cgaccgcggc
360agcgtctccg actacatcct gagcgagatc aagcctctgc acaacttcct gacccagctc
420cagcccttcc tgaagctcaa gcagaagcag gccaacctcg tgctgaagat catcgagcag
480ctgccctccg ccaaggaatc cccggacaag ttcctggagg tgtgcacctg ggtggaccag
540atcgccgctc tgaacgactc caagacccgc aagaccactt ccgagactgt ccgcgccgtt
600ctagacagtc tcccaggatc cgtgggaggt ctatcgccat ctcaggcatc cagcgccgca
660tcctcggctt cctcaagccc gggttcaggg atctccgaag cactcagagc tggagcaggt
720tccggcactg gatacaacaa ggaattcctg ctctacctgg cgggcttcgt ggacggggac
780ggctccatct acgccgggat cgcgccgaac cagtcctgca agttcaagca tcagctgcgc
840ctctggttcg tggtcagcca gaagacacag cgccgttggt tcctcgacaa gctggtggac
900gagatcgggg tgggctacgt gattgacaat ggcagcgtct cccattaccg cctgagcgag
960atcaagcctc tgcacaactt cctgacccag ctccagccct tcctgaagct caagcagaag
1020caggccaacc tcgtgctgaa gatcatcgag cagctgccct ccgccaagga atccccggac
1080aagttcctgg aggtgtgcac ctgggtggac cagatcgccg ctttgaacga ctccaagacc
1140cgcaagacca cttccgagac tgtccgcgcc gttctagaca gtctctccga gaagaagaag
1200tcgtccccct agacagtctc tccgagaaga agaagtcgtc cccctagcgg ccgcttcgag
1260cagacatgat aagatacatt gatgagtttg gacaaaccac aactagaatg cagtgaaaaa
1320aatgctttat ttgtgaaatt tgtgatgcta ttgctttatt tgtaaccatt ataagctgca
1380ataaacaagt taacaacaac aattgcattc attttatgtt tcaggttcag ggggagatgt
1440gggaggtttt ttaaagcaag taaaacctct acaaatgtgg taaaatcgat aagatcttga
1500tccgggctgg cgtaatagcg aagaggcccg caccgatcgc ccttcccaac agttgcgcag
1560cctgaatggc gaatggacgc gccctgtagc ggcgcattaa gcgcggcggg tgtggtggtt
1620acgcgcagcg tgaccgctac acttgccagc gccctagcgc ccgctccttt cgctttcttc
1680ccttcctttc tcgccacgtt cgccggcttt ccccgtcaag ctctaaatcg ggggctccct
1740ttagggttcc gatttagtgc tttacggcac ctcgacccca aaaaacttga ttagggtgat
1800ggttcacgta gtgggccatc g
1821221821DNAArtificial SequenceDescription of Artificial Sequence
Synthetic polynucleotide 22cacaggtgtc cactcccagt tcaattacag
ctcttaaggc tagagtactt aatacgactc 60actataggct agcctcgagc cgccaccatg
gcaccgaaga agaagcgcaa ggtgcatatg 120gcaccgaaga agaagcgcaa ggtgcatatg
aacaccaagt acaacaagga gttcctgctc 180tacctggcgg gcttcgtcga cggggacggc
tccatcaagg ccattatccg gccagagcag 240tcctacaagt tcaagcatcg cctgcggctc
gttttccagg tcacgcagaa gacacagcgc 300cgttggttcc tcgacaagct ggtggacgag
atcggggtgg gctacgtgta cgaccgcggc 360agcgtctccg actactatct gagcgagatc
aagcctctgc acaacttcct gacccagctc 420cagcccttcc tgaagctcaa gcagaagcag
gccaacctcg tgctgaagat catcgagcag 480ctgccctccg ccaaggaatc cccggacaag
ttcctggagg tgtgcacgtg ggtggaccag 540atcgcggccc tcaacgacag caagacccgc
aagacgacct cggagacggt gcgagcggtc 600ctggactccc tcccaggatc cgtgggaggt
ctatcgccat ctcaggcatc cagcgccgca 660tcctcggctt cctcaagccc gggttcaggg
atctccgaag cactcagagc tggagcaggt 720tccggcactg gatacaacaa ggaattcctg
ctctacctgg cgggcttcgt ggacggggac 780ggctccatct gggcccggat caagccgggg
cagtcctaca agttcaagca taccctggag 840ctcgtgttcc aggtcaccca gaagacacag
cgccgttgga tcctcgacaa gctggtggac 900gagatcgggg tgggctacgt gaccgacgcc
ggcagcgcct ccgtctaccg cctgagcgag 960atcaagcctc tgcacaactt cctgacccag
ctccagccct tcctgaagct caagcagaag 1020caggccaacc tcgtgctgaa gatcatcgag
cagctgccct ccgccaagga atccccggac 1080aagttcctgg aggtgtgcac ctgggtggac
cagatcgccg ctctgaacga ctccaagacc 1140cgcaagacca cttccgagac cgtccgcgcc
gttctagaca gtctctccga gaagaagaag 1200tcgtccccct agacagtctc tccgagaaga
agaagtcgtc cccctagcgg ccgcttcgag 1260cagacatgat aagatacatt gatgagtttg
gacaaaccac aactagaatg cagtgaaaaa 1320aatgctttat ttgtgaaatt tgtgatgcta
ttgctttatt tgtaaccatt ataagctgca 1380ataaacaagt taacaacaac aattgcattc
attttatgtt tcaggttcag ggggagatgt 1440gggaggtttt ttaaagcaag taaaacctct
acaaatgtgg taaaatcgat aagatcttga 1500tccgggctgg cgtaatagcg aagaggcccg
caccgatcgc ccttcccaac agttgcgcag 1560cctgaatggc gaatggacgc gccctgtagc
ggcgcattaa gcgcggcggg tgtggtggtt 1620acgcgcagcg tgaccgctac acttgccagc
gccctagcgc ccgctccttt cgctttcttc 1680ccttcctttc tcgccacgtt cgccggcttt
ccccgtcaag ctctaaatcg ggggctccct 1740ttagggttcc gatttagtgc tttacggcac
ctcgacccca aaaaacttga ttagggtgat 1800ggttcacgta gtgggccatc g
18212334DNAArtificial
SequenceDescription of Artificial Sequence Synthetic primer
23tgacagctct ggccttaagt gcctacgaaa ctag
342439DNAArtificial SequenceDescription of Artificial Sequence Synthetic
primer 24gtctttcctc tttgctgtag ccttggtaga actactgcc
39255653DNAArtificial SequenceDescription of Artificial Sequence
Synthetic polynucleotide 25tcgcgcgttt cggtgatgac ggtgaaaacc
tctgacacat gcagctcccg gagacggtca 60cagcttgtct gtaagcggat gccgggagca
gacaagcccg tcagggcgcg tcagcgggtg 120ttggcgggtg tcggggctgg cttaactatg
cggcatcaga gcagattgta ctgagagtgc 180accatatgcg gtgtgaaata ccgcacagat
gcgtaaggag aaaataccgc atcaggcgcc 240attcgccatt caggctgcgc aactgttggg
aagggcgatc ggtgcgggcc tcttcgctat 300tacgccagct ggcgaaaggg ggatgtgctg
caaggcgatt aagttgggta acgccagggt 360tttcccagtc acgacgttgt aaaacgacgg
ccagtgaatt ccatacccag gggagctgta 420ctgggctgca gccctgcgcc attcagccat
gcaccaggct actccctcct cttccagctt 480tctccttctg atggccatag gattagaaga
taagggactc tagtgcaggt caactgctga 540ccagtgtgaa aatgcacaga ctacatgctg
gtagatcagc acttcaaact actgttcacc 600atcatctctg gaataagcac tacatttaca
gggttcaaac ctcaatgaat ataaacaaac 660aaaacacacc tcccttcctt cactgtctcc
catttctttg gttcccatct ccacatagaa 720tttataatta aaatttctaa gtatctttcc
agaaatactt cacacatgtt ataagcaaat 780gtgcttttaa agatactatt ttaaattatg
aaaatggtta tattagttga gataaaagaa 840tagaatggga agttccagaa tttaaggcct
catatggatc ccagctgtgg aatgtgtgtc 900agttagggtg tggaaagtcc ccaggctccc
cagcaggcag aagtatgcaa agcatgcatc 960tcaattagtc agcaaccagg tgtggaaagt
ccccaggctc cccagcaggc agaagtatgc 1020aaagcatgca tctcaattag tcagcaacca
tagtcccgcc cctaactccg cccatcccgc 1080ccctaactcc gcccagttcc gcccattctc
cgccccatgg ctgactaatt ttttttattt 1140atgcagaggc cgaggccgcc tcggcctctg
agctattcca gaagtagtga ggaggctttt 1200ttggaggcta ccatggagaa gttactattc
cgaagttcct attctctaga aagtatagga 1260acttcaagct tggcactggg taccgccaag
ttgaccagtg ccgttccggt gctcaccgcg 1320cgcgacgtcg ccggagcggt cgagttctgg
accgaccggc tcgggttctc ccgggacttc 1380gtggaggacg acttcgccgg tgtggtccgg
gacgacgtga ccctgttcat cagcgcggtc 1440caggaccagg tggtgccgga caacaccctg
gcctgggtgt gggtgcgcgg cctggacgag 1500ctgtacgccg agtggtcgga ggtcgtgtcc
acgaacttcc gggacgcctc cgggccggcc 1560atgaccgaga tcggcgagca gccgtggggg
cgggagttcg ccctgcgcga cccggccggc 1620aactgcgtgc acttcgtggc cgaggagcag
gactgacacc cgagcgaaaa cggtctgcgc 1680tgcgggacgc gcgaattgaa ttatggccca
caccagtggc gcggcgactt ccagttcaac 1740atcagccgct acagtcaaca gcaactgatg
gaaaccagcc atcgccatct gctgcacgcg 1800gaagaaggca catggctgaa tatcgacggt
ttccatatgg ggattggtgg cgacgactcc 1860tggagcccgt cagtatcggc ggaattccag
ctgagcgccg gtcgctacca ttaccagttg 1920gtctggtgtc aaaaataata ataaccgggc
aggggggatc tgcatggatc tttgtgaagg 1980aaccttactt ctgtggtgtg acataattgg
acaaactacc tacagagatt taaagctcta 2040aggtaaatat aaaattttta agtgtataat
gtgttaaact actgattcta attgtttgtg 2100tattttagat tccaacctat ggaactgatg
aatgggagca gtggtggaat gcctttaatg 2160aggaaaacct gttttgctca gaagaaatgc
catctagtga tgatgaggct actgctgact 2220ctcaacattc tactcctcca aaaaagaaga
gaaaggtaga agaccccaag gactttcctt 2280cagaattgct aagttttttg agtcatgctg
tgtttagtaa tagaactctt gcttgctttg 2340ctatttacac cacaaaggaa aaagctgcac
tgctatacaa gaaaattatg gaaaaatatt 2400ctgtaacctt tataagtagg cataacagtt
ataatcataa catactgttt tttcttactc 2460cacacaggca tagagtgtct gctattaata
actatgctca aaaattgtgt acctttagct 2520ttttaatttg taaaggggtt aataaggaat
atttgatgta tagtgccttg actagagatc 2580ataatcagcc ataccacatt tgtagaggtt
ttacttgctt taaaaaacct cccacacctc 2640cccctgaacc tgaaacataa aatgaatgca
attgttgttg ttaacttgtt tattgcagct 2700tataatggtt acaaataaag caatagcatc
acaaatttca caaataaagc atttttttca 2760ctgcattcta gttgtggttt gtccaaactc
atcaatgtat cttatcatgt ctggatcccc 2820aggaagctcc tctgtgtcct cataaaccct
aacctcctct acttgagagg acattccaat 2880cataggctgc ccatccaccc tactagtata
tgaaaatata aagcgctttc tcttttaagt 2940ctagggtagg tgtactagat cagcgctcag
ctccatacca tgaagccatc caggagtcag 3000acctctctga cagccctgcc attgtcacag
agaagtttct gtcaccagtg ctcatgctgt 3060cagaggagcg aaggagaaaa gatgtgagac
ctcccaagtc aaagtcatct atggataaaa 3120ccttagttgc atggcacacc agtgttaggg
agtcggggaa acacagccat agcccagctt 3180cctctctgtt cttgctctta ttaccaccag
aaagaggttg cttagacaac ccaaaccaag 3240acacagggct ctgtgggagg gaatcagtcc
caggcttctg gcacatgcta tgtcaccgga 3300aagccccagc cctactccga atccccacaa
gtacagcaaa tatcagatta tagcatttaa 3360aggggcactc ttgccaaaga gaagcaccat
tggaatagcc atgcttgaga actaagcttg 3420gcgtaatcat ggtcatagct gtttcctgtg
tgaaattgtt atccgctcac aattccacac 3480aacatacgag ccggaagcat aaagtgtaaa
gcctggggtg cctaatgagt gagctaactc 3540acattaattg cgttgcgctc actgcccgct
ttccagtcgg gaaacctgtc gtgccagctg 3600cattaatgaa tcggccaacg cgcggggaga
ggcggtttgc gtattgggcg ctcttccgct 3660tcctcgctca ctgactcgct gcgctcggtc
gttcggctgc ggcgagcggt atcagctcac 3720tcaaaggcgg taatacggtt atccacagaa
tcaggggata acgcaggaaa gaacatgtga 3780gcaaaaggcc agcaaaaggc caggaaccgt
aaaaaggccg cgttgctggc gtttttccat 3840aggctccgcc cccctgacga gcatcacaaa
aatcgacgct caagtcagag gtggcgaaac 3900ccgacaggac tataaagata ccaggcgttt
ccccctggaa gctccctcgt gcgctctcct 3960gttccgaccc tgccgcttac cggatacctg
tccgcctttc tcccttcggg aagcgtggcg 4020ctttctcata gctcacgctg taggtatctc
agttcggtgt aggtcgttcg ctccaagctg 4080ggctgtgtgc acgaaccccc cgttcagccc
gaccgctgcg ccttatccgg taactatcgt 4140cttgagtcca acccggtaag acacgactta
tcgccactgg cagcagccac tggtaacagg 4200attagcagag cgaggtatgt aggcggtgct
acagagttct tgaagtggtg gcctaactac 4260ggctacacta gaagaacagt atttggtatc
tgcgctctgc tgaagccagt taccttcgga 4320aaaagagttg gtagctcttg atccggcaaa
caaaccaccg ctggtagcgg tggttttttt 4380gtttgcaagc agcagattac gcgcagaaaa
aaaggatctc aagaagatcc tttgatcttt 4440tctacggggt ctgacgctca gtggaacgaa
aactcacgtt aagggatttt ggtcatgaga 4500ttatcaaaaa ggatcttcac ctagatcctt
ttaaattaaa aatgaagttt taaatcaatc 4560taaagtatat atgagtaaac ttggtctgac
agttaccaat gcttaatcag tgaggcacct 4620atctcagcga tctgtctatt tcgttcatcc
atagttgcct gactccccgt cgtgtagata 4680actacgatac gggagggctt accatctggc
cccagtgctg caatgatacc gcgagaccca 4740cgctcaccgg ctccagattt atcagcaata
aaccagccag ccggaagggc cgagcgcaga 4800agtggtcctg caactttatc cgcctccatc
cagtctatta attgttgccg ggaagctaga 4860gtaagtagtt cgccagttaa tagtttgcgc
aacgttgttg ccattgctac aggcatcgtg 4920gtgtcacgct cgtcgtttgg tatggcttca
ttcagctccg gttcccaacg atcaaggcga 4980gttacatgat cccccatgtt gtgcaaaaaa
gcggttagct ccttcggtcc tccgatcgtt 5040gtcagaagta agttggccgc agtgttatca
ctcatggtta tggcagcact gcataattct 5100cttactgtca tgccatccgt aagatgcttt
tctgtgactg gtgagtactc aaccaagtca 5160ttctgagaat agtgtatgcg gcgaccgagt
tgctcttgcc cggcgtcaat acgggataat 5220accgcgccac atagcagaac tttaaaagtg
ctcatcattg gaaaacgttc ttcggggcga 5280aaactctcaa ggatcttacc gctgttgaga
tccagttcga tgtaacccac tcgtgcaccc 5340aactgatctt cagcatcttt tactttcacc
agcgtttctg ggtgagcaaa aacaggaagg 5400caaaatgccg caaaaaaggg aataagggcg
acacggaaat gttgaatact catactcttc 5460ctttttcaat attattgaag catttatcag
ggttattgtc tcatgagcgg atacatattt 5520gaatgtattt agaaaaataa acaaataggg
gttccgcgca catttccccg aaaagtgcca 5580cctgacgtct aagaaaccat tattatcatg
acattaacct ataaaaatag gcgtatcacg 5640aggccctttc gtc
56532629DNAArtificial
SequenceDescription of Artificial Sequence Synthetic primer
26agatgcatgc tttgcatact tctgcctgc
29275785DNAArtificial SequenceDescription of Artificial Sequence
Synthetic polynucleotide 27gacggatcgg gagatctccc gatcccctat
ggtgcactct cagtacaatc tgctctgatg 60ccgcatagtt aagccagtat ctgctccctg
cttgtgtgtt ggaggtcgct gagtagtgcg 120cgagcaaaat ttaagctaca acaaggcaag
gcttgaccga caattgcatg aagaatctgc 180ttagggttag gcgttttgcg ctgcttcgcg
atgtacgggc cagatatacg cgttgacatt 240gattattgac tagttattaa tagtaatcaa
ttacggggtc attagttcat agcccatata 300tggagttccg cgttacataa cttacggtaa
atggcccgcc tggctgaccg cccaacgacc 360cccgcccatt gacgtcaata atgacgtatg
ttcccatagt aacgccaata gggactttcc 420attgacgtca atgggtggag tatttacggt
aaactgccca cttggcagta catcaagtgt 480atcatatgcc aagtacgccc cctattgacg
tcaatgacgg taaatggccc gcctggcatt 540atgcccagta catgacctta tgggactttc
ctacttggca gtacatctac gtattagtca 600tcgctattac catggtgatg cggttttggc
agtacatcaa tgggcgtgga tagcggtttg 660actcacgggg atttccaagt ctccacccca
ttgacgtcaa tgggagtttg ttttggcacc 720aaaatcaacg ggactttcca aaatgtcgta
acaactccgc cccattgacg caaatgggcg 780gtaggcgtgt acggtgggag gtctatataa
gcagagctct ctggctaact agagaaccca 840ctgcttactg gcttatcgaa attaatacga
ctcactatag ggagacccaa gctggctagc 900gtttaaactt aagcttagcc accatggtga
gcaagggcga ggagctgttc accggggtgg 960tgcccatcct ggtcgagctg gacggcgacg
taaacggcca caagttcagc gtgtccggcg 1020agggcgaggg cgatgccacc tacggcaagc
tgaccctgaa gttcatctgc accaccggca 1080agctgcccgt gccctggccc accctcgtga
ccaccctgac ctacggagtg cagtgcttca 1140gccgctaccc cgaccacatg aagcagcacg
acttcttcaa gtccgccatg cccgaaggct 1200acgtccagga gcgcaccatc ttcttcaagg
acgacggcaa ctacaagacc cgcgccgagg 1260tgaagttcga gggcgacacc ctggtgaacc
gcatcgagct gaagggcatc gacttcaagg 1320aggacggcaa catcctgggg cacaagctgg
agtacaacta caacagccac aacgtctata 1380tcatggccga caagcagaag aacggcatca
aggtgaactt caagatccgc cacaacatcg 1440aggacggcag cgtgcagctc gccgaccact
accagcagaa cacccccatc ggcgacggcc 1500ccgtgctgct gcccgacaac cactacctga
gcacccagtc cgccctgagc aaagacccca 1560acgagaagcg cgatcacatg gtcctgctgg
agttcgtgac cgccgccggg atcactctcg 1620gcatggacga gctgtacaag taaggatcca
ctagtccagt gtggtggaat tctgcagata 1680tccagcacag tggcggccgc tcgagtctag
agggcccgtt taaacccgct gatcagcctc 1740gactgtgcct tctagttgcc agccatctgt
tgtttgcccc tcccccgtgc cttccttgac 1800cctggaaggt gccactccca ctgtcctttc
ctaataaaat gaggaaattg catcgcattg 1860tctgagtagg tgtcattcta ttctgggggg
tggggtgggg caggacagca agggggagga 1920ttgggaagac aatagcaggc atgctgggga
tgcggtgggc tctatggctt ctgaggcgga 1980aagaaccagc tggggctcta gggggtatcc
ccacgcgccc tgtagcggcg cattaagcgc 2040ggcgggtgtg gtggttacgc gcagcgtgac
cgctacactt gccagcgccc tagcgcccgc 2100tcctttcgct ttcttccctt cctttctcgc
cacgttcgcc ggctttcccc gtcaagctct 2160aaatcggggg ctccctttag ggttccgatt
tagtgcttta cggcacctcg accccaaaaa 2220acttgattag ggtgatggtt cacgtaccta
gaagttccta ttccgaagtt cctattctct 2280agaaagtata ggaacttcct tggccaaaaa
gcctgaactc accgcgacgt ctgtcgagaa 2340gtttctgatc gaaaagttcg acagcgtctc
cgacctgatg cagctctcgg agggcgaaga 2400atctcgtgct ttcagcttcg atgtaggagg
gcgtggatat gtcctgcggg taaatagctg 2460cgccgatggt ttctacaaag atcgttatgt
ttatcggcac tttgcatcgg ccgcgctccc 2520gattccggaa gtgcttgaca ttggggaatt
cagcgagagc ctgacctatt gcatctcccg 2580ccgtgcacag ggtgtcacgt tgcaagacct
gcctgaaacc gaactgcccg ctgttctgca 2640gccggtcgcg gaggccatgg atgcgatcgc
tgcggccgat cttagccaga cgagcgggtt 2700cggcccattc ggaccgcaag gaatcggtca
atacactaca tggcgtgatt tcatatgcgc 2760gattgctgat ccccatgtgt atcactggca
aactgtgatg gacgacaccg tcagtgcgtc 2820cgtcgcgcag gctctcgatg agctgatgct
ttgggccgag gactgccccg aagtccggca 2880cctcgtgcac gcggatttcg gctccaacaa
tgtcctgacg gacaatggcc gcataacagc 2940ggtcattgac tggagcgagg cgatgttcgg
ggattcccaa tacgaggtcg ccaacatctt 3000cttctggagg ccgtggttgg cttgtatgga
gcagcagacg cgctacttcg agcggaggca 3060tccggagctt gcaggatcgc cgcggctccg
ggcgtatatg ctccgcattg gtcttgacca 3120actctatcag agcttggttg acggcaattt
cgatgatgca gcttgggcgc agggtcgatg 3180cgacgcaatc gtccgatccg gagccgggac
tgtcgggcgt acacaaatcg cccgcagaag 3240cgcggccgtc tggaccgatg gctgtgtaga
agtactcgcc gatagtggaa accgacgccc 3300cagcactcgt ccgagggcaa aggaatagca
cgtactacga gatttcgatt ccaccgccgc 3360cttctatgaa aggttgggct tcggaatcgt
tttccgggac gccggctgga tgatcctcca 3420gcgcggggat ctcatgctgg agttcttcgc
ccaccccaac ttgtttattg cagcttataa 3480tggttacaaa taaagcaata gcatcacaaa
tttcacaaat aaagcatttt tttcactgca 3540ttctagttgt ggtttgtcca aactcatcaa
tgtatcttat catgtctgta taccgtcgac 3600ctctagctag agcttggcgt aatcatggtc
atagctgttt cctgtgtgaa attgttatcc 3660gctcacaatt ccacacaaca tacgagccgg
aagcataaag tgtaaagcct ggggtgccta 3720atgagtgagc taactcacat taattgcgtt
gcgctcactg cccgctttcc agtcgggaaa 3780cctgtcgtgc cagctgcatt aatgaatcgg
ccaacgcgcg gggagaggcg gtttgcgtat 3840tgggcgctct tccgcttcct cgctcactga
ctcgctgcgc tcggtcgttc ggctgcggcg 3900agcggtatca gctcactcaa aggcggtaat
acggttatcc acagaatcag gggataacgc 3960aggaaagaac atgtgagcaa aaggccagca
aaaggccagg aaccgtaaaa aggccgcgtt 4020gctggcgttt ttccataggc tccgcccccc
tgacgagcat cacaaaaatc gacgctcaag 4080tcagaggtgg cgaaacccga caggactata
aagataccag gcgtttcccc ctggaagctc 4140cctcgtgcgc tctcctgttc cgaccctgcc
gcttaccgga tacctgtccg cctttctccc 4200ttcgggaagc gtggcgcttt ctcatagctc
acgctgtagg tatctcagtt cggtgtaggt 4260cgttcgctcc aagctgggct gtgtgcacga
accccccgtt cagcccgacc gctgcgcctt 4320atccggtaac tatcgtcttg agtccaaccc
ggtaagacac gacttatcgc cactggcagc 4380agccactggt aacaggatta gcagagcgag
gtatgtaggc ggtgctacag agttcttgaa 4440gtggtggcct aactacggct acactagaag
gacagtattt ggtatctgcg ctctgctgaa 4500gccagttacc ttcggaaaaa gagttggtag
ctcttgatcc ggcaaacaaa ccaccgctgg 4560tagcggtggt ttttttgttt gcaagcagca
gattacgcgc agaaaaaaag gatctcaaga 4620agatcctttg atcttttcta cggggtctga
cgctcagtgg aacgaaaact cacgttaagg 4680gattttggtc atgagattat caaaaaggat
cttcacctag atccttttaa attaaaaatg 4740aagttttaaa tcaatctaaa gtatatatga
gtaaacttgg tctgacagtt accaatgctt 4800aatcagtgag gcacctatct cagcgatctg
tctatttcgt tcatccatag ttgcctgact 4860ccccgtcgtg tagataacta cgatacggga
gggcttacca tctggcccca gtgctgcaat 4920gataccgcga gacccacgct caccggctcc
agatttatca gcaataaacc agccagccgg 4980aagggccgag cgcagaagtg gtcctgcaac
tttatccgcc tccatccagt ctattaattg 5040ttgccgggaa gctagagtaa gtagttcgcc
agttaatagt ttgcgcaacg ttgttgccat 5100tgctacaggc atcgtggtgt cacgctcgtc
gtttggtatg gcttcattca gctccggttc 5160ccaacgatca aggcgagtta catgatcccc
catgttgtgc aaaaaagcgg ttagctcctt 5220cggtcctccg atcgttgtca gaagtaagtt
ggccgcagtg ttatcactca tggttatggc 5280agcactgcat aattctctta ctgtcatgcc
atccgtaaga tgcttttctg tgactggtga 5340gtactcaacc aagtcattct gagaatagtg
tatgcggcga ccgagttgct cttgcccggc 5400gtcaatacgg gataataccg cgccacatag
cagaacttta aaagtgctca tcattggaaa 5460acgttcttcg gggcgaaaac tctcaaggat
cttaccgctg ttgagatcca gttcgatgta 5520acccactcgt gcacccaact gatcttcagc
atcttttact ttcaccagcg tttctgggtg 5580agcaaaaaca ggaaggcaaa atgccgcaaa
aaagggaata agggcgacac ggaaatgttg 5640aatactcata ctcttccttt ttcaatatta
ttgaagcatt tatcagggtt attgtctcat 5700gagcggatac atatttgaat gtatttagaa
aaataaacaa ataggggttc cgcgcacatt 5760tccccgaaaa gtgccacctg acgtc
57852830DNAArtificial
SequenceDescription of Artificial Sequence Synthetic primer
28cagaaacttc tcgacagacg tcgcggtgag
3029364PRTArtificial SequenceDescription of Artificial Sequence Synthetic
polypeptide 29Met Ala Pro Lys Lys Lys Arg Lys Val His Met Asn Thr
Lys Tyr Asn1 5 10 15Lys
Glu Phe Leu Leu Tyr Leu Ala Gly Phe Val Asp Gly Asp Gly Ser 20
25 30Ile Cys Ala Ser Ile Arg Pro Glu
Gln Glu Arg Lys Phe Lys His Arg 35 40
45Leu Val Leu Arg Phe Glu Val Thr Gln Lys Thr Gln Arg Arg Trp Phe
50 55 60Leu Asp Lys Leu Val Asp Glu Ile
Gly Val Gly Tyr Val Tyr Asp Ser65 70 75
80Gly Ser Val Ser Arg Tyr Tyr Leu Ser Gln Ile Lys Pro
Leu His Asn 85 90 95Phe
Leu Thr Gln Leu Gln Pro Phe Leu Lys Leu Lys Gln Lys Gln Ala
100 105 110Asn Leu Val Leu Lys Ile Ile
Glu Gln Leu Pro Ser Ala Lys Glu Ser 115 120
125Pro Asp Lys Phe Leu Glu Val Cys Thr Trp Val Asp Gln Ile Ala
Ala 130 135 140Leu Asn Asp Ser Lys Thr
Arg Lys Thr Thr Ser Glu Thr Val Arg Ala145 150
155 160Val Leu Asp Ser Leu Pro Gly Ser Val Gly Gly
Leu Ser Pro Ser Gln 165 170
175Ala Ser Ser Ala Ala Ser Ser Ala Ser Ser Ser Pro Gly Ser Gly Ile
180 185 190Ser Glu Ala Leu Arg Ala
Gly Ala Gly Ser Gly Thr Gly Tyr Asn Lys 195 200
205Glu Phe Leu Leu Tyr Leu Ala Gly Phe Val Asp Gly Asp Gly
Ser Ile 210 215 220Phe Ala Thr Ile Cys
Pro Arg Gln Gln Tyr Lys Phe Lys His Gln Leu225 230
235 240Arg Leu Arg Phe Glu Val Asp Gln Lys Thr
Gln Arg Arg Trp Phe Leu 245 250
255Asp Lys Leu Val Asp Glu Ile Gly Val Gly Tyr Val Tyr Asp Leu Gly
260 265 270Ser Val Ser Arg Tyr
Gly Leu Ser Glu Ile Lys Pro Leu His Asn Phe 275
280 285Leu Thr Gln Leu Gln Pro Phe Leu Lys Leu Lys Gln
Lys Gln Ala Asn 290 295 300Leu Val Leu
Lys Ile Ile Glu Gln Leu Pro Ser Ala Lys Glu Ser Pro305
310 315 320Asp Lys Phe Leu Glu Val Cys
Thr Trp Val Asp Gln Ile Ala Ala Leu 325
330 335Asn Asp Ser Lys Thr Arg Lys Thr Thr Ser Glu Thr
Val Arg Ala Val 340 345 350Leu
Asp Ser Leu Ser Glu Lys Lys Lys Ser Ser Pro 355
3603022DNAArtificial SequenceDescription of Artificial Sequence Synthetic
oligonucleotide 30cagcacgtct caccccaccc ct
223128DNAArtificial SequenceDescription of Artificial
Sequence Synthetic primer 31ggaatctgac tgtggtaagc ctgtacac
283224DNAArtificial SequenceDescription of
Artificial Sequence Synthetic primer 32cagcactcag gaggtagagg cagg
243336DNAArtificial
SequenceDescription of Artificial Sequence Synthetic oligonucleotide
33tcttactgac atccactttg cctttctctc cacagg
3634130DNAArtificial SequenceDescription of Artificial Sequence Synthetic
polynucleotide 34acttgtttat tgcagcttat aatggttaca aataaagcaa
tagcatcaca aatttcacaa 60ataaagcatt tttttcactg cattctagtt gtggtttgtc
caaactcatc aatgtatctt 120atcatgtctg
13035225DNAArtificial SequenceDescription of
Artificial Sequence Synthetic polynucleotide 35ctgtgccttc tagttgccag
ccatctgttg tttgcccctc ccccgtgcct tccttgaccc 60tggaaggtgc cactcccact
gtcctttcct aataaaatga ggaaattgca tcgcattgtc 120tgagtaggtg tcattctatt
ctggggggtg gggtggggca ggacagcaag ggggaggatt 180gggaagacaa tagcaggcat
gctggggatg cggtgggctc tatgg 225369PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 36Met
Ala Pro Lys Lys Lys Arg Lys Val1 53753DNACricetulus griseus
37gaatgggaag ttccagaatt taaggcctca tatgaaaata taaagcgctt tct
533836DNACricetulus griseus 38gaatgggaag ttccagaatt taataaagcg ctttct
363928DNACricetulus griseus 39gaatgggaag
ttccagaaag cgctttct
284027DNACricetulus griseus 40gaatgggaag ttccagaagc gctttct
274169DNACricetulus griseus 41tctgaaagcc
agaagagcct agacagatag atgtcttgca tactctagag actacagatg 60ccggcccag
694258DNACricetulus griseus 42tctgaaagcc agaagagcct agacatgcat actctagaga
ctacagatgc cggcccag 584349DNACricetulus griseus 43tctgaaagcc
agaagagcct agacagatag atgtcagatg ccggcccag
494436DNACricetulus griseus 44tctgaaagcc agaagagcct acagatgccg gcccag
364551DNACricetulus griseus 45tctgaaagcc
agaagagcct agacagatag atgtcttgca tgccggccca g
514665DNACricetulus griseus 46tctgaaagcc agaagagcct agacagatag atgtcttgct
ctagagacta cagatgccgg 60cccag
654767DNACricetulus griseus 47tctgaaagcc
agaagagcct agacagatag atgtcttata ctctagagac tacagatgcc 60ggcccag
674838DNACricetulus griseus 48tctgaaagcc agaagagcct agacagatgc cggcccag
384954DNACricetulus griseus 49tctgaaagcc
agaagagcct agacagatag atgtctttac agatgccgac ccag
545065DNACricetulus griseus 50tctgaaagcc agaagagcct agacagatag atgtcttgct
ctagagacta cagatgccgg 60cccag
655169DNACricetulus griseus 51tctgaaagcc
agaagagcct agacagatag atgtctttgc atactctaga gactacagat 60gccggccca
695275DNACricetulus griseus 52gatgctttat tcctagagac caatttaagg cactcgtgta
aacggataat ggacatggtg 60agcaaccagc acccc
755369DNACricetulus griseus 53gatgctttat
tcctagagac caatttaagg cactcgtgtg taatggacat ggtgagcaac 60cagcacccc
695474DNACricetulus griseus 54gatgctttat tcctagagac caatttaagg cactcgtgta
acggataatg gacatggtga 60gcaaccagca cccc
745571DNACricetulus griseus 55gatgctttat
tcctagagac caatttaagg cactcaaacg gataatggac atggtgagca 60accagcaccc c
715673DNACricetulus griseus 56gatgctttat tcctagagac caatttaagg cactcgtaaa
cggataatgg acatggtgag 60caaccagcac ccc
735774DNACricetulus griseus 57gatgctttat
tcctagagac caatttaagg cgctgtgtaa acggataatg gacatggtga 60gcaaccagca
cccc
745875DNACricetulus griseus 58gatgctttat tcctagagac caatttaagg cattcgtgta
aacggataat ggacatggtg 60agcaaccagc acccc
755975DNACricetulus griseus 59gatgctttat
tcctagagac caatttaagg cactcatgta aacggataat ggacatggtg 60agcaaccagc
acccc
756075DNACricetulus griseus 60gatgctttat tcctagagac caatttaagg cgctcgtgta
aacggataat ggacatggtg 60agcaaccagc acccc
756175DNACricetulus griseus 61gatgctttat
tcctagagac caatttaagg cactcgcgta aacggataat ggacatggtg 60agcaaccagc
acccc
756275DNACricetulus griseus 62gatgctttat tcctagagac caatttaagg cacacgtgta
aacggataat ggacatggtg 60agcaaccagc acccc
756374DNACricetulus griseus 63gatgctttat
tcctagagac caatttaagg cactcgtgtg taaacggata atggacatgg 60tgagcaacca
gcac
746457DNACricetulus griseus 64gatgctttat tcctagagac caatttaagg catggtgagt
aaccgagcaa ccagcac 57
User Contributions:
Comment about this patent or add new information about this topic: