Patent application title: INDUCIBLE PROMOTER SEQUENCES FOR REGULATED EXPRESSION AND METHODS OF USE
Inventors:
Andrew Cigan (Johnston, IA, US)
Erica Unger-Wallace (Ames, IA, US)
IPC8 Class: AC12N1582FI
USPC Class:
800278
Class name: Multicellular living organisms and unmodified parts thereof and related processes method of introducing a polynucleotide molecule into or rearrangement of genetic material within a plant or plant part
Publication date: 2015-04-09
Patent application number: 20150101077
Abstract:
The plant promoter of a CBSU-Anther_Subtraction library (CAS1) gene
encoding a mannitol dehydrogenase, and fragments thereof, and their use
in promoting the expression of one or more heterologous nucleic acid
fragments in an inducible manner in plants are described. These promoter
fragments are also useful in creating recombinant DNA constructs
comprising nucleic acid sequences encoding a desired gene product
operably linked to such promoter fragments which can be utilized to
transform plants and bring the expression of the gene product under
external chemical and/or heat control in monocotyledonous and
dicotyledonous plants.Claims:
1. A recombinant DNA construct comprising: a) a nucleotide sequence
comprising any one of the sequences set forth in SEQ ID NO:9 or SEQ ID
NO:10, or a full-length complement thereof; b) a nucleotide sequence
comprising a functional fragment of SEQ ID NO:10, or a full-length
complement thereof; c) a nucleotide sequence comprising a sequence having
at least 85% sequence identity, based on the BLASTN method of alignment,
when compared to the nucleotide sequence of (a) or (b); d) a nucleotide
sequence which hybridizes to SEQ ID NO:9 under highly stringent
conditions of a wash of 0.1 SSC, 0.1% (w/v) SDS at 65.degree. C.; e) a
nucleotide sequence comprising all or a fragment of a 1.7 kb 5'
non-coding sequence of a mannitol dehydrogenase; or, f) a derivative of
one of the nucleotide sequences indicated in (a), (b), (c), (d) or (e)
obtained by substitution, addition and/or deletion of one or more
nucleotides; and, wherein said nucleotide sequence is an inducible
promoter.
2. The recombinant DNA construct of claim 1, wherein the nucleotide sequence of c) has at least 90% identity, based on the BLASTN method of alignment, when compared to the sequence set forth in SEQ ID NO:1.
3-4. (canceled)
5. The recombinant DNA construct of claim 1 wherein said inducible promoter is induced by a chemical or stress treatment.
6. The recombinant DNA construct of claim 1 wherein said inducible promoter is induced by a safener or heat treatment.
7. (canceled)
8. The recombinant DNA construct of claim 6, wherein said heat treatment comprises a temperature greater than 26.degree. C.
9-11. (canceled)
12. A vector comprising the recombinant DNA construct of claim 1.
13. A cell comprising the recombinant DNA construct of claim 1.
14. The cell of claim 13, wherein the cell is a plant cell.
15. (canceled)
16. A transgenic plant having stably incorporated into its genome the recombinant DNA construct of claim 1.
17-20. (canceled)
21. Transgenic seed produced by the transgenic plant of claim 16.
22. A plant stably transformed with a recombinant expression construct comprising a plant promoter and a heterologous nucleic acid fragment operably linked to said promoter, wherein said promoter is an inducible promoter and capable of controlling expression of said heterologous nucleic acid fragment in a plant cell, and further wherein said promoter comprises a fragment of SEQ ID NO:10.
23. A method of expressing a coding sequence or a functional RNA in a plant cell comprising: a) introducing the recombinant DNA construct of claim 1 into a plant cell, wherein the at least one heterologous sequence comprises a coding sequence or a functional RNA; b) growing the plant cell of step a); c) induction of the inducible promoter by chemical or stress treatment on the plant cell of b); and, d) selecting a plant cell displaying expression of the coding sequence or the functional RNA of the recombinant DNA construct.
24-29. (canceled)
30. A method for altering expression of at least one heterologous nucleic acid fragment in a plant comprising: (a) transforming a plant cell with the recombinant expression construct of claim 1; (b) induction of the inducible promoter by chemical or stress treatment on the cell of (a) (c) growing fertile mature plants from the transformed plant cell of step (a); and, (d) selecting plants containing the transformed plant cell wherein the expression of the heterologous nucleic acid fragment is increased or decreased.
31. A method of transgenically altering a marketable plant trait, comprising: a) introducing a recombinant DNA construct of claim 1 into a plant; b) induction of the inducible promoter by chemical or stress treatment on the plant of (a); c) growing a fertile, mature plant resulting from step b); and d) selecting a plant expressing the at least one heterologous nucleotide sequence in at least one plant tissue based on the altered marketable trait.
32. (canceled)
33. A recombinant DNA construct comprising: a) a nucleotide sequence comprising all or a functional fragment of SEQ ID NO: 19 or SEQ ID NO: 22; b) a nucleotide sequence comprising a full-length complement of the nucleotide sequence (a); or, c) a nucleotide sequence comprising a sequence having at least 90% sequence identity, based on the BLASTN method of alignment, when compared to the nucleotide sequence of (a) or (b); and, wherein said nucleotide sequence is a promoter.
Description:
[0001] This application claims the benefit of U.S. Provisional Application
No. 61/648,758, filed May 18, 2012, the entire content of which is herein
incorporated by reference.
FIELD OF THE INVENTION
[0002] The present invention relates to a plant promoter, and fragments thereof, and their use in altering expression of at least one heterologous nucleic acid sequence in plants in an inducible manner. These promoter fragments are also useful in creating recombinant DNA constructs comprising nucleic acid sequences encoding a desired gene product operably linked to such promoter fragments which can be utilized to transform plants and bring the expression of the gene product under external chemical and/or stress control in monocotyledonous and dicotyledonous plants.
BACKGROUND OF THE INVENTION
[0003] Recent advances in plant genetic engineering have opened new doors to engineer plants to have improved characteristics or traits, such as plant disease resistance, insect resistance, herbicidal resistance, yield improvement, improvement of the nutritional quality of the edible portions of the plant, and enhanced stability or shelf-life of the ultimate consumer product obtained from the plants. Thus, a desired gene (or genes) with the molecular function to impart different or improved characteristics or qualities, can be incorporated properly into the plant's genome. The newly integrated gene (or genes) coding sequence can then be expressed in the plant cell to exhibit the desired new trait or characteristics. It is important that appropriate regulatory signals must be present in proper configurations in order to obtain the expression of the newly inserted gene coding sequence in the plant cell. These regulatory signals typically include a promoter region, a 5' non-translated leader sequence and a 3' transcription termination/polyadenylation sequence.
[0004] A promoter is a non-coding genomic DNA sequence, usually upstream (5') to the relevant coding sequence, to which RNA polymerase binds before initiating transcription. This binding aligns the RNA polymerase so that transcription will initiate at a specific transcription initiation site. The nucleotide sequence of the promoter determines the nature of the enzyme and other related protein factors that attach to it and the rate of RNA synthesis. The RNA is processed to produce messenger RNA (mRNA) which serves as a template for translation of the RNA sequence into the amino acid sequence of the encoded polypeptide. The 5' non-translated leader sequence is a region of the mRNA upstream of the coding region that may play a role in initiation and translation of the mRNA. The 3' transcription termination/polyadenylation signal is a non-translated region downstream of the coding region that functions in the plant cell to cause termination of the RNA synthesis and the addition of polyadenylate nucleotides to the 3' end.
[0005] It has been shown that certain promoters are able to direct RNA synthesis at a higher rate than others. These are called "strong promoters". Certain other promoters have been shown to direct RNA synthesis at higher levels only in particular types of cells or tissues and are often referred to as "tissue specific promoters", or "tissue-preferred promoters" if the promoters direct RNA synthesis preferably in certain tissues but also in other tissues at reduced levels. Certain promoters are able to direct RNA synthesis at relatively similar levels across all tissues of a plant. These are called "constitutive promoters" or "tissue-independent" promoters. Constitutive promoters can be divided into strong, moderate and weak according to their effectiveness to direct RNA synthesis. In some cases promoters are able to direct RNA synthesis when they are induced by external stimuli such as chemicals, stress, or biotic stimuli. These are called "inducible promoters".
[0006] The ability to externally control the expression of selected genes and thereby their gene products in plant cells and/or field grown plants can provide important agronomic and foodstuff benefits. This control is desirable for the regulation of genes that might be placed into transgenic plants and has many applications including, but not limited to, (1) prolonging or extending the accumulation of desirable nutritional food reserve in seeds, roots, (2) producing and accumulating products in plant tissues at a defined time in the developmental cycle such that these products are convenient for harvest and/or isolation, and (3) initiating the expression a pest-specific toxin at the site of pathogen attack. There is an ongoing interest in the isolation of novel inducible promoters which are capable of controlling the expression of a chimeric gene or (genes) at certain levels in a plant cell when exposed to external stimuli.
SUMMARY OF THE INVENTION
[0007] This invention relates to a plant promoter of a CBSU-Anther_Subtraction library (CAS1) gene encoding a mannitol dehydrogenase, and functional fragments thereof, and their use in promoting the expression of one or more heterologous nucleic acid fragments in an inducible manner in plants. These promoter fragments are also useful in creating recombinant DNA constructs comprising nucleic acid sequences encoding a desired gene product operably linked to such promoter fragments which can be utilized to transform plants and bring the expression of the gene product under external chemical and/or heat control in monocotyledonous and dicotyledonous plants. One embodiment of the invention concerns an isolated nucleic acid fragment comprising an inducible ZmCAS1 promoter wherein said promoter consists essentially of the nucleotide sequence set forth in SEQ ID NOs: 9 or 10, or said promoter consists essentially of a fragment that is substantially similar and functionally equivalent to the nucleotide sequence set forth in SEQ ID NOs: 9 or 10. The ZmCAS1 promoter can be induced by a chemical or stress treatment. The chemical can be a safener such as, but not limited to, N-(aminocarbonyl)-2-chlorobenzenesulfonamide (2-CBSU). The stress treatment can be a treatment such as, but not limited to, a heat shock treatment of a temperature greater than 26° C.
[0008] The invention also concerns a recombinant DNA construct comprising at least one heterologous nucleic acid fragment operably linked to the promoter of the invention.
[0009] In another embodiment, this invention concems a cell, plant, or seed comprising a recombinant expression construct of the present disclosure.
[0010] In another embodiment, this invention concerns a plant stably transformed with a recombinant expression construct comprising a plant promoter and a heterologous nucleic acid fragment operably linked to said promoter, wherein said promoter is an inducible promoter and capable of controlling expression of said heterologous nucleic acid fragment in a plant cell, and further wherein said promoter comprises a fragment of SEQ ID NOs: 9 or 10.
[0011] In another embodiment, this invention concerns a method of expressing a coding sequence or a functional RNA in a plant cell comprising: a) introducing the recombinant DNA construct of the current disclosure into a plant cell, wherein at least one heterologous sequence comprises a coding sequence or a functional RNA, b) growing the plant cell of step a); c) induction of the inducible promoter by chemical or stress treatment on the plant cell of b); and, d) selecting a plant cell displaying expression of the coding sequence or the functional RNA of the recombinant DNA construct. In another embodiment, this invention concerns a method of expressing a coding sequence or a functional RNA driven by the promoter of the current invention in anther, callus, leaf or root cells.
BRIEF DESCRIPTION OF THE DRAWINGS AND SEQUENCE LISTINGS
[0012] The invention can be more fully understood from the following detailed description and the accompanying drawings and Sequence Listing that form a part of this application.
[0013] FIG. 1: Alignment the amino acid sequence encoded by the ZmCAS1cDNA (SEQ ID NO:5) with a maize mannitol dehydrogenase (GI:226528549; SEQ ID NO:6) (A) and percent identity (B).
[0014] FIG. 2: Norther blot of maize anther RNA of wild-type fertile (F) and sterile (S) maize control plants (-) and maize CBSU treated plants (+). Maize anther RNA was analyzed with probes specific for ZmCAS1, IN2-2, 5126, MS45, ACTIN and UBI gene expression.
[0015] FIG. 3: Northern blot of maize callus (C), leaf (L) and anther (A) RNAs from wild-type maize tissues and CBSU-treated (+) tissues. Maize RNA was analyzed with probes specific for IN2-2 and ZmCAS1.
[0016] FIG. 4 shows A) maize callus transformed with PHP16975 comprising the 1.7 kb ZmCAS1 promoter for three different events (1, 2, 3) C=control maintenance media; 10=10 mg/l CBSU, 100=100 mg/l CBSU; B) maize callus transformed with PHP16974 comprising the truncated 1.0 kb ZmCAS1 promoter and induced with either CBSU or heat (37° C.); 26C=control callus at 26° C.; 26C+CBSU=CBSU treated callus at 26° C.; 37C=callus induced by heat treatment of 37° C. Results from seven events (1-7) are shown.
[0017] FIG. 5 shows maize leaf punches from three (1, 2, 3) maize plants transformed with PHP16975 and induced with CBSU. Leaf punches from plants regenerated from the 3 bialophos-resistant events were collected pre- (C) and post-watering (S).
[0018] FIG. 6 shows a Northern blot of maize callus RNA from five (1, 2, 3, 4 and 5) events transformed with PHP16972 and treated with (+) or without (-) CBSU.
[0019] FIG. 7 shows a Western analysis of leaves from ms45/ms45 maize plants transformed with PHP16973 using antibodies directed against the maize MS45 protein. C=leaves from uninduced control plants, +=leaves from CBSU induced plants. Whole-cell anther extract from a wild-type MS45 plant is shown in Lane 1 and used to identify the mobility of the immunoreactive MS45 protein as indicated by the arrow.
[0020] FIG. 8 shows a Western analysis of anthers from ms45/ms45 maize plants transformed with PHP16973 using antibodies directed against the maize MS45 protein. C=leaves from uninduced control plants, +=leaves from CBSU induced plants.
[0021] FIG. 9: Rice events transformed with PHP16974 show GUS expression when driven by the 1.0 kb ZmCAS1 promoter and induced by CBSU.
[0022] FIG. 10: Rice seedlings transformed with PHP16974 show GUS expression when driven by the 1.0 kb ZmCAS1 promoter and induced by CBSU.
[0023] The patent or application file contains at least one drawing executed in color. Copies of this patent or patent application publication with color drawing(s) will be provided by the Office upon request and payment of the necessary fee.
[0024] The sequence descriptions summarize the Sequence Listing attached hereto. The Sequence Listing contains one letter codes for nucleotide sequence characters and the single and three letter codes for amino acids as defined in the IUPAC-IUB standards described in Nucleic Acids Research 13:3021-3030 (1985) and in the Biochemical Journal 219(2):345-373 (1984). The symbols and format used for nucleotide and amino acid sequence data comply with the rules set forth in 37 C.F.R. §1.822.
[0025] SEQ ID NO:1 DNA insert comprising the ZmCAS1c-1 cDNA.
[0026] SEQ ID NO:2 DNA insert comprising the ZmCAS1c-2 cDNA.
[0027] SEQ ID NO:3 A 1354 bp (base pair) SalI-NotI DNA insert comprising the maize ZmCAS1 full length cDNA.
[0028] SEQ ID NO:4 the 1338 bp maize ZmCAS1 full length cDNA.
[0029] SEQ ID NO:5 the amino acid sequence encoded by SEQ ID NO:4
[0030] SEQ ID NO:6 the amino acid sequence of a maize mannitol dehydrogenase (GI number 226528549, NP--001147757.1)
[0031] SEQ ID NO:7 a 4069 bp DNA fragment comprising the maize B73 ZmCAS1 promoter
[0032] SEQ ID NO:8 is the DNA sequence of the oligonucleotide used for mutagenesis to introduce RCAI DNA restriction site.
[0033] SEQ ID NO:9 is a 1049 bp truncated form of the maize ZmCAS1 promoter (bp 698-1746 of SEQ ID NO:9) also referred to as the 1.0 kb ZmCAS1 promoter.
[0034] SEQ ID NO:10 is 1746 bp maize ZmCAS1 promoter, also referred to as the 1.7 kb ZmCAS1 promoter.
[0035] SEQ ID NO: 11 is the nucleotide sequence of PHP16974 comprising the 1.0 kb ZmCAS1 promoter.
[0036] SEQ ID NO: 12 is the nucleotide sequence of PHP16975 comprising the 1.7 kb ZmCAS1 promoter.
[0037] SEQ ID NO: 13 is the nucleotide sequence of PHP16972 comprising the 1.0 kb ZmCAS1 promoter.
[0038] SEQ ID NO: 14 is the nucleotide sequence of PHP16973 comprising the 1.7 kb ZmCAS1 promoter.
[0039] SEQ ID NO: 15 is the HindIII-Rca1 fragment (ZMCAS1HINDIIIPRO) comprising the 1.0 kb ZmCAS1 promoter of SEQ ID NO:9.
[0040] SEQ ID NO: 16 is the BamH1-Rca1 fragment (ZMCAS1BAMPRO) comprising the 1.7 kb ZmCAS1 promoter of SEQ ID NO:10.
[0041] SEQ ID NO: 17 is the amino acid sequence of a mannitol dehydrogenase (AAP52597) from rice (Oryza sativa).
[0042] SEQ ID NO: 18 is a nudeotide sequence from a mannitol dehydrogenase gene region (DP000086) from rice (Oryza sativa).
[0043] SEQ ID NO: 19 is a nucleotide sequence of a putative 5'UTR-Promoter region from a mannitol dehydrogenase gene (DP000086) from rice (Oryza sativa).
[0044] SEQ ID NO: 20 is the amino acid sequence of a mannitol dehydrogenase (XP-002436634) from Sorghum.
[0045] SEQ ID NO: 21 is a nucleotide sequence from a mannitol dehydrogenase gene region (NC-012879) from Sorghum.
[0046] SEQ ID NO: 22 is a nucleotide sequence of a putative 5'UTR-Promoter region from a mannitol dehydrogenase gene (NC-012879) from Sorghum.
DETAILED DESCRIPTION OF THE INVENTION
[0047] The disclosure of all patents, patent applications, and publications cited herein are incorporated by reference in their entirety.
[0048] Unless defined otherwise, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this invention belongs. Unless mentioned otherwise, the techniques employed or contemplated herein are standard methodologies well known to one of ordinary skill in the art. The materials, methods and examples are illustrative only and not limiting.
[0049] As used herein and in the appended claims, the singular forms "a", "an", and "the" include plural reference unless the context clearly dictates otherwise. Thus, for example, reference to "a plant" includes a plurality of such plants, reference to "a cell" includes one or more cells and equivalents thereof known to those skilled in the art, and so forth.
[0050] In the context of this disclosure, a number of terms shall be utilized. As used herein, a "ZmCAS1 promoter" refers to one type of inducible promoter. The native ZmCAS1 promoter is the promoter of a maize gene isolated from a CBSU-Anther_Subtraction library with significant homology to mannitol dehydrogenase genes identified in various plant species including maize that are deposited in National Center for Biotechnology Information (NCBI) database. The "ZmCAS1 promoter", as used herein, also refers to fragments of the full-length native promoter that retain significant promoter activity. For example, a ZmCAS1 promoter can be 1.7 kb in length (SEQ ID NO: 10) or a promoter-functioning fragment thereof, which includes, among others, the polynucleotide of SEQ ID NO: 9. A ZmCAS1 promoter also includes variants that are substantially similar and functionally equivalent to any portion of the nucleotide sequence, in increments of one base pair, between the 1.0 kb (SQE ID NO:9) and 1.7 kb (SEQ ID NO:10) fragments and sequences.
[0051] The term "Promoter" refers to a nucleotide sequence capable of regulating the expression of a coding sequence or functional RNA. Functional RNA includes, but is not limited to, transfer RNA (tRNA) and ribosomal RNA (rRNA). The promoter sequence consists of proximal and more distal upstream elements, the latter elements often referred to as enhancers. The promoter usually comprises a TATA box capable of directing RNA polymerase II to initiate RNA synthesis at the appropriate transcription initiation site for a particular coding sequence. A promoter can additionally comprise other recognition sequences generally positioned upstream or 5' to the TATA box, referred to as upstream promoter elements, which influence the transcription initiation rate. It is recognized that having identified the nucleotide sequences for the promoter region disclosed herein, it is within the state of the art to isolate and identify further regulatory elements in the region upstream of the TATA box from the particular promoter region identified herein. Accordingly, an "enhancer" is a DNA sequence which can stimulate promoter activity and may be an innate element of the promoter or a heterologous element inserted to enhance the level or tissue-specificity of a promoter. Promoters may be derived in their entirety from a native gene, or be composed of different elements derived from different promoters found in nature, or even comprise synthetic DNA segments. It is understood by those skilled in the art that different promoters may direct the expression of a gene in different tissues or cell types, or at different stages of development, or in response to different environmental or abiotic conditions.
[0052] The promoter elements which enable the inducible expression in the desired tissue can be identified, isolated, and used with other core promoters to confirm inducible expression. By core promoter is meant the minimal sequence required to initiate transcription, such as the sequence called the TATA box which is common to promoters in genes encoding proteins. Thus, the ZmCAS1 promoter can optionally be used in conjunction with its own or core promoters from other sources. The promoter may be native or non-native to the cell in which it is found.
[0053] Promoters which cause a gene to be expressed in most cell types at most times are commonly referred to as "constitutive promoters". New promoters of various types useful in plant cells are constantly being discovered; numerous examples may be found in the compilation by Okamuro and Goldberg (Biochemistry of Plants 15:1-82 (1989)). It is further recognized that since in most cases the exact boundaries of regulatory sequences have not been completely defined, DNA fragments of some variation may have identical promoter activity.
[0054] High level, constitutive expression of the candidate gene under control of the 35S or UBI promoter may have pleiotropic effects, although candidate gene efficacy may be estimated when driven by a constitutive promoter. Use of inducible or stress-specific promoters may eliminate undesirable effects but retain the ability to enhance drought tolerance. This effect has been observed in Arabidopsis (Kasuga et al. (1999) Nature Biotechnol. 17:287-91).
[0055] The term "inducible promoter" refers to promoters that selectively express a coding sequence or functional RNA in response to the presence of an endogenous or exogenous stimulus, for example by chemical compounds (chemical inducers) or in response to environmental, hormonal, chemical, and/or developmental signals. Inducible or regulated promoters include, for example, promoters induced or regulated by light, heat, stress, flooding or drought, salt stress, osmotic stress, phytohormones, wounding, or chemicals such as ethanol, abscisic acid (ABA), jasmonate, salicylic acid, or safeners.
[0056] An example of a stress-inducible is RD29A promoter (Kasuga et al. (1999) Nature Biotechnol. 17:287-91). One of ordinary skill in the art is familiar with protocols for simulating drought conditions and for evaluating drought tolerance of plants that have been subjected to simulated or naturally-occurring drought conditions. For example, one can simulate drought conditions by giving plants less water than normally required or no water over a period of time, and one can evaluate drought tolerance by looking for differences in physiological and/or physical condition, including (but not limited to) vigor, growth, size, or root length, or in particular, leaf color or leaf area size. Other techniques for evaluating drought tolerance include measuring chlorophyll fluorescence, photosynthetic rates and gas exchange rates. Also, one of ordinary skill in the art is familiar with protocols for simulating stress conditions such as osmotic stress, salt stress and temperature stress and for evaluating stress tolerance of plants that have been subjected to simulated or naturally-occurring stress conditions.
[0057] The sequences of the invention may be isolated from any plant, including, but not limited to corn (Zea mays), canola (Brassica napus, Brassica rapa ssp.), alfalfa (Medicago sativa), rice (Oryza sativa), rye (Secale cereale), sorghum (Sorghum bicolor, Sorghum vulgare), sunflower (Helianthus annuus), wheat (Triticum aestivum), soybean (Glycine max), tobacco (Nicotiana tabacum), millet (Panicum spp.), potato (Solanum tuberosum), peanuts (Arachis hypogaea), cotton (Gossypium hirsutum), sweet potato (Ilpomoea batatus), cassava (Manihot esculenta), coffee (Cofea spp.), coconut (Cocos nucifera), pineapple (Ananas comosus), citrus trees (Citrus spp.), cocoa (Theobroma cacao), tea (Camellia sinensis), banana (Musa spp.), avocado (Persea americana), fig (Ficus casica), guava (Psidium guajava), mango (Mangifera indica), olive (Olea europaea), oats (Avena sativa), barley (Hordeum vulgare), vegetables, ornamentals, and conifers. Preferably, plants include corn, soybean, sunflower, safflower, canola, wheat, barley, rye, alfalfa, rice, cotton and sorghum.
[0058] This invention concerns an isolated nucleic acid fragment comprising an inducible ZmCAS1 promoter. This invention also concerns an isolated nucleic acid fragment comprising a promoter wherein said promoter consists essentially of the nucleotide sequence set forth in SEQ ID NO:9, or said promoter consists essentially of a fragment that is substantially similar and functionally equivalent to the nucleotide sequence set forth in SEQ ID NO:10. A nucleic acid fragment that is functionally equivalent to the instant ZmCAS1 promoter is any nucleic acid fragment that is capable of controlling the expression of a coding sequence or functional RNA in a similar manner to the ZmCAS1 promoter. The expression patterns of ZmCAS1 gene and its promoter are set forth in Examples 1-3.
[0059] The promoter activity of the maize genomic DNA fragment SEQ ID NO:9 or SEQ ID NO:10 upstream of the ZmCAS1 protein coding sequence was assessed by linking the fragment to a GUS gene or a MS45 gene, transforming the promoter:GUS (or MS45) expression cassette into maize, and analyzing GUS (or MS45) expression in various cell types of the transgenic plants (Examples 1-3). These results indicated that the nucleic acid fragment contained an inducible promoter.
[0060] In one embodiment, the invention is an isolated polynucleotide comprising, or consisting essentially of or consisting of:
[0061] a) a nucleotide sequence comprising the sequence set forth in SEQ ID NO:9 or a full-length complement thereof;
[0062] b) a nucleotide sequence comprising a fragment of SEQ ID NO:10, or a full-length complement thereof
[0063] c) a nucleotide sequence comprising a sequence having at least 90% sequence identity, based on the BLASTN method of alignment, when compared to the nucleotide sequence of (a) or (b);
[0064] d) a nucleotide sequence comprising all or a fragment of a 1.7 kb 5' non-coding sequence of a mannitol dehydrogenase; or,
[0065] e) a derivative of one of the nucleotide sequences indicated in (a), (b), or (c) obtained by substitution, addition and/or deletion of one or more nucleotides; and,
[0066] wherein said nucleotide sequence is an inducible promoter.
[0067] In another embodiment of the invention the ZmCAS1 promoter is induced by a safener treatment of N-(aminocarbonyl)-2-chlorobenzenesulfonamide (2-CBSU). In another embodiment of the invention the ZmCAS1 promoter is induced by a heat treatment of a temperature greater than 26° C. and up to and including 37° C.
[0068] The terms "N-(aminocarbonyl)-2-chlorobenzenesulfonamide", 2-CBSU" and "CBSU" are used interchangeably herein.
[0069] The promoter nucleotide sequences and methods disclosed herein are useful in regulating inducible expression of any heterologous nucleotide sequences in a host plant in order to alter the phenotype of a plant.
[0070] Various changes in phenotype are of interest including, but not limited to, modifying the fatty acid composition in a plant, altering the amino acid content of a plant, altering a plant's pathogen defense mechanism, and the like. These results can be achieved by providing expression of heterologous products or increased expression of endogenous products in plants. Alternatively, the results can be achieved by providing for a reduction of expression of one or more endogenous products, particularly enzymes or cofactors in the plant. These changes result in a change in phenotype of the transformed plant.
[0071] Genes of interest are reflective of the commercial markets and interests of those involved in the development of the crop. Crops and markets of interest change, and as developing nations open up world markets, new crops and technologies will emerge also. In addition, as our understanding of agronomic characteristics and traits such as yield and heterosis increase, the choice of genes for transformation will change accordingly. General categories of genes of interest include, but are not limited to, those genes involved in information, such as zinc fingers, those involved in communication, such as kinases, and those involved in housekeeping, such as heat shock proteins. Other gene of interest are genes allowing for site specific gene integration and gene stacking include, but not limited to, double-strand break inducing genes and recombinase genes. More specific categories of transgenes, for example, include, but are not limited to, genes encoding important traits for agronomics, insect resistance, disease resistance, herbicide resistance, sterility, grain or seed characteristics, and commercial products. Genes of interest include, generally, those involved in oil, starch, carbohydrate, or nutrient metabolism as well as those affecting seed size, plant development, plant growth regulation, and yield improvement. Plant development and growth regulation also refer to the development and growth regulation of various parts of a plant, such as the flower, seed, root, leaf and shoot.
[0072] Other commercially desirable traits are genes and proteins conferring cold, heat, salt, and drought resistance.
[0073] One embodiment of the invention relates to a recombinant DNA comprising the isolated polynucleotide of the invention operably linked to at least one heterologous nucleic acid sequence, wherein the heterologous nucleic acid sequence codes for a gene selected from the group consisting of: a double-strand break inducing gene, a recombinase gene, a reporter gene, a selection marker, a disease resistance conferring gene, a herbicide resistance conferring gene, an insect resistance conferring gene; a gene involved in carbohydrate metabolism, a gene involved in fatty acid metabolism, a gene involved in amino acid metabolism, a gene involved in plant development, a gene involved in plant growth regulation, a gene involved in yield improvement, a gene involved in drought resistance, a gene involved in cold resistance, a gene involved in heat and salt resistance in plants.
[0074] Another embodiment of the invention relates to a recombinant DNA comprising the isolated polynucleotide of the invention operably linked to at least one heterologous nucleic acid sequence, wherein the heterologous nucleic acid sequence encodes a protein selected from the group consisting of: a double-strand break inducing protein, a recombinase protein, a reporter protein, a selection marker, a protein conferring disease resistance, protein conferring herbicide resistance, protein conferring insect resistance; protein involved in carbohydrate metabolism, protein involved in fatty acid metabolism, protein involved in amino acid metabolism, protein involved in plant development, protein involved in plant growth regulation, protein involved in yield improvement, protein involved in drought resistance, protein involved in cold resistance, protein involved in heat resistance and salt resistance in plants.
[0075] One embodiment of the invention, comprises a plant (for example, maize or a soybean plant) comprising in its genome a recombinant DNA construct comprising a polynucleotide operably linked to a promoter fragment of the invention, wherein said promoter fragment comprises at least 40%, 45%, 50%, 51%, 52%, 53%, 54%, 55%, 56%, 57%, 58%, 59%, 60%, 61%, 62%, 63%, 64%, 65%, 66%, 67%, 68%, 69%, 70%, 71%, 72%, 73%, 74%, 75%, 76%, 77%, 78%, 79%, 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% sequence identity, based on the Clustal V method of alignment, when compared to SEQ ID NO: 9 or 10, and wherein said plant exhibits an alteration of at least one agronomic characteristic when compared to a control plant not comprising said recombinant DNA construct.
[0076] Another embodiment of the invention, comprises a plant (for example, maize or a soybean plant) comprising in its genome a suppression DNA construct comprising a promoter fragment of the invention, wherein said promoter fragment comprises at least 40%, 45%, 50%, 51%, 52%, 53%, 54%, 55%, 56%, 57%, 58%, 59%, 60%, 61%, 62%, 63%, 64%, 65%, 66%, 67%, 68%, 69%, 70%, 71%, 72%, 73%, 74%, 75%, 76%, 77%, 78%, 79%, 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% sequence identity, based on the Clustal V method of alignment, when compared to SEQ ID NOs: 9 or 10, and wherein said plant exhibits an alteration of at least one agronomic characteristic when compared to a control plant not comprising said recombinant DNA construct.
[0077] In any of the foregoing embodiments or any other embodiments of the present invention, the at least one agronomic characteristic may be selected from the group consisting of greenness, yield, growth rate, biomass, fresh weight at maturation, dry weight at maturation, fruit yield, seed yield, total plant nitrogen content, fruit nitrogen content, seed nitrogen content, nitrogen content in a vegetative tissue, total plant free amino acid content, fruit free amino acid content, seed free amino acid content, free amino acid content in a vegetative tissue, total plant protein content, fruit protein content, seed protein content, protein content in a vegetative tissue, drought tolerance, nitrogen uptake, root lodging, harvest index, stalk lodging, plant height, ear height, ear length, early seedling vigor and seedling emergence under low temperature stress. For example, the alteration of at least one agronomic characteristic may be an increase in yield, greenness or biomass.
[0078] Disease and/or insect resistance genes may encode resistance to pests that have great yield drag such as for example, anthracnose, soybean mosaic virus, soybean cyst nematode, root-knot nematode, brown leaf spot, Downy mildew, purple seed stain, seed decay and seedling diseases caused commonly by the fungi--Pythium sp., Phytophthora sp., Rhizoctonia sp., Diaporthe sp. Bacterial blight caused by the bacterium Pseudomonas syringae pv. Glycinea. Genes conferring insect resistance include, for example, Bacillus thuringiensis toxic protein genes (U.S. Pat. Nos. 5,366,892; 5,747,450; 5,737,514; 5,723,756; 5,593,881; and Geiser et al (1986) Gene 48:109); lectins (Van Damme et al. (1994) Plant Mol. Biol. 24:825); and the like.
[0079] Herbicide resistance traits may include genes coding for resistance to herbicides that act to inhibit the action of acetolactate synthase (ALS), in particular the sulfonylurea-type herbicides (e.g., the acetolactate synthase ALS gene containing mutations leading to such resistance, in particular the S4 and/or HRA mutations). The ALS-gene mutants encode resistance to the herbicide chlorsulfuron. Glyphosate acetyl transferase (GAT) is an N-acetyltransferase from Bacillus licheniformis that was optimized by gene shuffling for acetylation of the broad spectrum herbicide, glyphosate, forming the basis of a novel mechanism of glyphosate tolerance in transgenic plants (Castle et al. (2004) Science 304, 1151-1154).
[0080] Antibiotic resistance genes include, for example, neomycin phosphotransferase (npt) and hygromycin phosphotransferase (hpt). Two neomycin phosphotransferase genes are used in selection of transformed organisms: the neomycin phosphotransferase I (nptI) gene and the neomycin phosphotransferase II (nptII) gene. The second one is more widely used. It was initially isolated from the transposon Tn5 that was present in the bacterium strain Escherichia coli K12. The gene codes for the aminoglycoside 3'-phosphotransferase (denoted aph(3')-II or NPTII) enzyme, which inactivates by phosphorylation a range of aminoglycoside antibiotics such as kanamycin, neomycin, geneticin and paroromycin. NPTII is widely used as a selectable marker for plant transformation. It is also used in gene expression and regulation studies in different organisms in part because N-terminal fusions can be constructed that retain enzyme activity. NPTII protein activity can be detected by enzymatic assay. In other detection methods, the modified substrates, the phosphorylated antibiotics, are detected by thin-layer chromatography, dot-blot analysis or polyacrylamide gel electrophoresis. Plants such as maize, cotton, tobacco, Arabidopsis, flax, soybean and many others have been successfully transformed with the nptII gene.
[0081] The hygromycin phosphotransferase (denoted hpt, hph or aphIV) gene was originally derived from Escherichia coli. The gene codes for hygromycin phosphotransferase (HPT), which detoxifies the aminocyclitol antibiotic hygromycin B. A large number of plants have been transformed with the hpt gene and hygromycin B has proved very effective in the selection of a wide range of plants, including monocotyledonous. Most plants exhibit higher sensitivity to hygromycin B than to kanamycin, for instance cereals. Likewise, the hpt gene is used widely in selection of transformed mammalian cells. The sequence of the hpt gene has been modified for its use in plant transformation. Deletions and substitutions of amino acid residues close to the carboxy (C)-terminus of the enzyme have increased the level of resistance in certain plants, such as tobacco. At the same time, the hydrophilic C-terminus of the enzyme has been maintained and may be essential for the strong activity of HPT. HPT activity can be checked using an enzymatic assay. A non-destructive callus induction test can be used to verify hygromycin resistance.
[0082] Genes involved in plant growth and development have been identified in plants. One such gene, which is involved in cytokinin biosynthesis, is isopentenyl transferase (IPT). Cytokinin plays a critical role in plant growth and development by stimulating cell division and cell differentiation (Sun et al. (2003), Plant Physiol. 131: 167-176).
[0083] Calcium-dependent protein kinases (CDPK), a family of serine-threonine kinase found primarily in the plant kingdom, are likely to function as sensor molecules in calcium-mediated signaling pathways. Calcium ions are important second messengers during plant growth and development (Harper et al. Science 252, 951-954 (1993); Roberts et al. Curr Opin Cell Biol 5, 242-246 (1993); Roberts et al. Annu Rev Plant Mol Biol 43, 375-414 (1992)).
[0084] Nematode responsive protein (NRP) is produced by soybean upon the infection of soybean cyst nematode. NRP has homology to a taste-modifying glycoprotein miraculin and the NF34 protein involved in tumor formation and hyper response induction. NRP is believed to function as a defense-inducer in response to nematode infection (Tenhaken et al. BMC Bioinformatics 6:169 (2005)).
[0085] The quality of seeds and grains is reflected in traits such as levels and types of fatty acids or oils, saturated and unsaturated, quality and quantity of essential amino acids, and levels of carbohydrates. Therefore, commercial traits can also be encoded on a gene or genes that could increase for example methionine and cysteine, two sulfur containing amino acids that are present in low amounts in soybeans. Cystathionine gamma synthase (CGS) and serine acetyl transferase (SAT) are proteins involved in the synthesis of methionine and cysteine, respectively.
[0086] Other commercial traits can encode genes to increase for example monounsaturated fatty acids, such as oleic acid, in oil seeds. Soybean oil for example contains high levels of polyunsaturated fatty acids and is more prone to oxidation than oils with higher levels of monounsaturated and saturated fatty acids. High oleic soybean seeds can be prepared by recombinant manipulation of the activity of oleoyl 12-desaturase (Fad2). High oleic soybean oil can be used in applications that require a high degree of oxidative stability, such as cooking for a long period of time at an elevated temperature.
[0087] Raffinose saccharides accumulate in significant quantities in the edible portion of many economically significant crop species, such as soybean (Glycine max L. Merrill), sugar beet (Beta vulgaris), cotton (Gossypium hirsutum L.), canola (Brassica sp.) and all of the major edible leguminous crops including beans (Phaseolus sp.), chick pea (Cicer arietinum), cowpea (Vigna unguiculata), mung bean (Vigna radiata), peas (Pisum sativum), lentil (Lens culinaris) and lupine (Lupinus sp.). Although abundant in many species, raffinose saccharides are an obstacle to the efficient utilization of some economically important crop species.
[0088] Down regulation of the expression of the enzymes involved in raffinose saccharide synthesis, such as galactinol synthase for example, would be a desirable trait.
[0089] In certain embodiments, the present invention contemplates the transformation of a recipient cell with more than one advantageous transgene. Two or more transgenes can be supplied in a single transformation event using either distinct transgene-encoding vectors, or a single vector incorporating two or more gene coding sequences. Any two or more transgenes of any description, such as those conferring herbicide, insect, disease (viral, bacterial, fungal, and nematode) or drought resistance, oil quantity and quality, or those increasing yield or nutritional quality may be employed as desired.
[0090] The term "Anther" or "Anther tissue" refers to male plant tissue encompassing cells, cell-layers and cell types that give rise to pollen grains capable of effecting fertilization. These cells include but are not limited to archesporial cells, pollen mother cells, meiocytes, microspores, tapetum, supporting cell layers, pollen and cells derived from these cell types.
[0091] An "isolated nucleic acid fragment" refers to a polymer of ribonucleotides (RNA) or deoxyribonucleotides (DNA) that is single- or double-stranded, optionally containing synthetic, non-natural or altered nucleotide bases. An isolated nucleic acid fragment in the form of DNA may be comprised of one or more segments of cDNA, genomic DNA or synthetic DNA.
[0092] The terms "polynucleotide", "polynucleotide sequence", "nucleic acid sequence", and "nucleic acid fragment"/"isolated nucleic acid fragment" are used interchangeably herein. These terms encompass nucleotide sequences and the like. A polynucleotide may be a polymer of RNA or DNA that is single- or double-stranded, that optionally contains synthetic, non-natural or altered nucleotide bases. A polynucleotide in the form of a polymer of DNA may be comprised of one or more segments of cDNA, genomic DNA, synthetic DNA, or mixtures thereof. Nucleotides (usually found in their 5'-monophosphate form) are referred to by a single letter designation as follows: "A" for adenylate or deoxyadenylate (for RNA or DNA, respectively), "C" for cytidylate or deoxycytidylate, "G" for guanylate or deoxyguanylate, "U" for uridylate, "T" for deoxythymidylate, "R" for purines (A or G), "Y" for pyrimidines (C or T), "K" for G or T, "H" for A or C or T, "I" for inosine, and "N" for any nucleotide.
[0093] A "heterologous nucleic acid fragment" refers to a sequence that is not naturally occurring with the plant promoter sequence of the invention. While this nucleotide sequence is heterologous to the promoter sequence, it may be homologous, or native, or heterologous, or foreign, to the plant host. However, it is recognized that the instant promoters may be used with their native coding sequences to increase or decrease expression resulting in a change in phenotype in the transformed seed.
[0094] The terms "fragment (or variant) that is functionally equivalent" and "functionally equivalent fragment (or variant)" are used interchangeably herein. These terms refer to a portion or subsequence or variant of the promoter sequence of the present invention in which the ability to initiate transcription or drive gene expression (such as to produce a certain phenotype) is retained. Fragments and variants can be obtained via methods such as site-directed mutagenesis and synthetic construction. As with the provided promoter sequences described herein, the contemplated fragments and variants operate to promote inducible expression of an operably linked heterologous nucleic acid sequence, forming a recombinant DNA construct (also, a chimeric gene). For example, the fragment or variant can be used in the design of recombinant DNA constructs to produce the desired phenotype in a transformed plant. Recombinant DNA constructs can be designed for use in co-suppression or antisense by linking a promoter fragment or variant thereof in the appropriate orientation relative to a heterologous nucleotide sequence.
[0095] A functional fragment of the regulatory sequence can be formed by one or more deletions from a larger sequence. For example, the 5' portion of a promoter up to the TATA box near the transcription start site can be deleted without abolishing promoter activity, as described by Opsahl-Sorteberg, H-G. et al., "Identification of a 49-bp fragment of the HvLTP2 promoter directing aleruone cell specific expression" Gene 341:49-58 (2004). Such variants should retain promoter activity. Activity can be measured by Norther blot analysis, reporter activity measurements when using transcriptional fusions, and the like. See, for example, Sambrook et al. (1989) Molecular Cloning: A Laboratory Manual (2nd ed. Cold Spring Harbor Laboratory, Cold Spring Harbor, N.Y.), herein incorporated by reference.
[0096] Sequences which hybridize to the regulatory sequences of the present invention are within the scope of the invention. Sequences that correspond to the promoter sequences of the present invention and hybridize to the promoter sequences disclosed herein will be at least 40% homologous, 50% homologous, 70% homologous, and even 85% 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% homologous or more with the disclosed sequence.
[0097] Smaller fragments may yet contain the regulatory properties of the promoter so identified and deletion analysis is one method of identifying essential regions. Deletion analysis can occur from both the 5' and 3' ends of the regulatory region. Fragments can be obtained by site-directed mutagenesis, mutagenesis using the polymerase chain reaction and the like. (See, Directed Mutagenesis: A Practical Approach IRL Press (1991)).
[0098] In some aspects of the present invention, the promoter fragments can comprise at least about 20 contiguous nucleotides, or at least about 50 contiguous nucleotides, or at least about 75 contiguous nucleotides, or at least about 100, 150, 200, 250, 300, 350, 400, 450, 500 contiguous nucleotides of SEQ ID NO:8 or up to the number of nucleotides present in a full-length nucleotide sequence disclosed herein (for example 1746, SEQ ID NO: 10).
[0099] In another aspect, a promoter fragment is the nucleotide sequence set forth in SEQ ID NO: 9 or SEQ ID NO: 10. The nucleotides of such fragments will usually comprise the TATA recognition sequence of the particular promoter sequence. Such fragments may be obtained by use of restriction enzymes to cleave the naturally occurring promoter nucleotide sequences disclosed herein, by synthesizing a nucleotide sequence from the naturally occurring promoter DNA sequence, or may be obtained through the use of PCR technology. See particularly, Mullis et al., Methods Enzymol. 155:335-350 (1987), and Higuchi, R. In PCR Technology: Principles and Applications for DNA Amplifications; Erlich, H. A., Ed.; Stockton Press Inc.: New York, 1989.
[0100] The isolated promoter sequences of the present invention can be modified to provide a range of inducible expression levels of the heterologous nucleotide sequence. Thus, less than the entire promoter regions may be utilized and the ability to drive expression of the coding sequence retained. As described in Examples 1-3, the 1.0 kb ZmCAS1 promoter fragment as well as the longer 1.7 kb ZmCAS1 promoter fragment were able to drive gene expression when induced by a chemical or stress treatment.
[0101] Modifications of the isolated promoter sequences of the present invention can provide for a range of inducible expression of the heterologous nucleotide sequence. Thus, they may be modified to be weak inducible promoters or strong inducible promoters. Generally, by "weak promoter" is intended a promoter that drives expression of a coding sequence at a low level. By "low level" is intended at levels about 1/10,000 transcripts to about 1/100,000 transcripts to about 1/500,000 transcripts. Conversely, a strong promoter drives expression of a coding sequence at high level, or at about 1/10 transcripts to about 1/100 transcripts to about 1/1,000 transcripts.
[0102] The terms "substantially similar" and "corresponding substantially" as used herein refer to nucleic acid fragments wherein changes in one or more nucleotide bases do not affect the ability of the nucleic acid fragment to mediate gene expression or produce a certain phenotype. These terms also refer to modifications of the nucleic acid fragments of the instant invention such as deletion or insertion of one or more nucleotides that do not substantially alter the functional properties of the resulting nucleic acid fragment relative to the initial, unmodified fragment. It is therefore understood, as those skilled in the art will appreciate, that the invention encompasses more than the specific exemplary sequences.
[0103] Moreover, the skilled artisan recognizes that substantially similar nucleic acid sequences encompassed by this invention are also defined by their ability to hybridize, under moderately stringent conditions (for example, 0.5×SSC, 0.1% SDS, 60° C.) with the sequences exemplified herein, or to any portion of the nucleotide sequences reported herein and which are functionally equivalent to the promoter of the invention. Estimates of such homology are provided by either DNA-DNA or DNA-RNA hybridization under conditions of stringency as is well understood by those skilled in the art (Hames and Higgins, Eds.; In Nucleic Acid Hybridisation; IRL Press: Oxford, U.K., 1985). Stringency conditions can be adjusted to screen for moderately similar fragments, such as homologous sequences from distantly related organisms, to highly similar fragments, such as genes that duplicate functional enzymes from closely related organisms. Post-hybridization washes partially determine stringency conditions. One set of conditions uses a series of washes starting with 6×SSC, 0.5% SDS at room temperature for 15 min, then repeated with 2×SSC, 0.5% SDS at 45° C. for 30 min, and then repeated twice with 0.2×SSC, 0.5% SDS at 50° C. for 30 min. Another set of stringent conditions uses higher temperatures in which the washes are identical to those above except for the temperature of the final two 30 min washes in 0.2×SSC, 0.5% SDS was increased to 60° C. Another set of highly stringent conditions uses two final washes in 0.1×SSC, 0.1% SDS at 65° C.
[0104] In general, sequences that correspond to the nucleotide sequences of the present invention and hybridize to the nucleotide sequence disclosed herein will be at least 40% homologous, 50% homologous, 70% homologous, and even 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% homologous or more with the disclosed sequence. That is, the sequence similarity between probe and target may range, sharing at least about 40%, about 50%, about 70%, and even about 85% or more sequence similarity.
[0105] Preferred substantially similar nucleic acid sequences encompassed by this invention are those sequences that are 80% identical to the nucleic acid fragments reported herein or which are 80% identical to any portion of the nucleotide sequences reported herein. More preferred are nucleic acid fragments which are 90% identical to the nucleic acid sequences reported herein, or which are 90% identical to any portion of the nucleotide sequences reported herein. Most preferred are nucleic acid fragments which are 95% identical to the nucleic acid sequences reported herein, or which are 95% identical to any portion of the nucleotide sequences reported herein. It is well understood by one skilled in the art that many levels of sequence identity are useful in identifying related polynucleotide sequences. Useful examples of percent identities are those listed above, or also preferred is any integer percentage from 80% to 100%, such as 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98 and 99%.
[0106] A "substantially homologous sequence" refers to variants of the disclosed sequences such as those that result from site-directed mutagenesis, as well as synthetically derived sequences. A substantially homologous sequence of the present invention also refers to those fragments of a particular promoter nucleotide sequence disclosed herein that operate to promote the inducible expression of an operably linked heterologous nucleic acid fragment. These promoter fragments will comprise at least about 20 contiguous nucleotides, preferably at least about 50 contiguous nucleotides, more preferably at least about 75 contiguous nucleotides, even more preferably at least about 100 contiguous nucleotides of the particular promoter nucleotide sequence disclosed herein. The nucleotides of such fragments will usually comprise the TATA recognition sequence of the particular promoter sequence. Such fragments may be obtained by use of restriction enzymes to cleave the naturally occurring promoter nucleotide sequences disclosed herein; by synthesizing a nucleotide sequence from the naturally occurring promoter DNA sequence; or may be obtained through the use of PCR technology. See particularly, Mullis et al., Methods Enzymol. 155:335-350 (1987), and Higuchi, R. In PCR Technology: Principles and Applications for DNA Amplifications; Erlich, H. A., Ed.; Stockton Press Inc.: New York, 1989. Again, variants of these promoter fragments, such as those resulting from site-directed mutagenesis, are encompassed by the compositions of the present invention.
[0107] "Codon degeneracy" refers to divergence in the genetic code permitting variation of the nucleotide sequence without affecting the amino acid sequence of an encoded polypeptide. Accordingly, the instant invention relates to any nucleic acid fragment comprising a nucleotide sequence that encodes all or a substantial portion of the amino acid sequences set forth herein. The skilled artisan is well aware of the "codon-bias" exhibited by a specific host cell in usage of nucleotide codons to specify a given amino acid. Therefore, when synthesizing a nucleic acid fragment for improved expression in a host cell, it is desirable to design the nucleic acid fragment such that its frequency of codon usage approaches the frequency of preferred codon usage of the host cell.
[0108] Sequence alignments and percent similarity calculations may be determined using the Megalign program of the LASARGENE bioinformatics computing suite (DNASTAR Inc., Madison, Wis.) or using the AlignX program of the Vector NTI bioinformatics computing suite (Invitrogen). Multiple alignment of the sequences are performed using the Clustal method of alignment (Higgins and Sharp, CABIOS 5:151-153 (1989)) with the default parameters (GAP PENALTY=10, GAP LENGTH PENALTY=10). Default parameters for pairwise alignments and calculation of percent identity of protein sequences using the Clustal method are KTUPLE=1, GAP PENALTY=3, WINDOW=5 and DIAGONALS SAVED=5. For nucleic acids these parameters are GAP PENALTY=10, GAP LENGTH PENALTY=10, KTUPLE=2, GAP PENALTY=5. WINDOW=4 and DIAGONALS SAVED=4. A "substantial portion" of an amino acid or nucleotide sequence comprises enough of the amino acid sequence of a polypeptide or the nucleotide sequence of a gene to afford putative identification of that polypeptide or gene, either by manual evaluation of the sequence by one skilled in the art, or by computer-automated sequence comparison and identification using algorithms such as BLAST (Altschul, S. F. et al., J. Mol. Biol. 215:403-410 (1993)) and Gapped Blast (Altschul, S. F. et al., Nucleic Acids Res. 25:3389-3402 (1997)). BLASTN refers to a BLAST program that compares a nucleotide query sequence against a nucleotide sequence database.
[0109] "Gene" refers to a nucleic acid fragment that expresses a specific protein, including regulatory sequences preceding (5' non-coding sequences) and following (3' non-coding sequences) the coding sequence. "Native gene" refers to a gene as found in nature with its own regulatory sequences. "Chimeric gene" or "recombinant expression construct", which are used interchangeably, refers to any gene that is not a native gene, comprising regulatory and coding sequences that are not found together in nature. Accordingly, a chimeric gene may comprise regulatory sequences and coding sequences that are derived from different sources, or regulatory sequences and coding sequences derived from the same source, but arranged in a manner different than that found in nature. "Endogenous gene" refers to a native gene in its natural location in the genome of an organism. A "foreign" gene refers to a gene not normally found in the host organism, but that is introduced into the host organism by gene transfer. Foreign genes can comprise native genes inserted into a non-native organism, or chimeric genes. A "transgene" is a gene that has been introduced into the genome by a transformation procedure.
[0110] "Coding sequence" refers to a DNA sequence which codes for a specific amino acid sequence. "Regulatory sequences" refer to nucleotide sequences located upstream (5' non-coding sequences), within, or downstream (3' non-coding sequences) of a coding sequence, and which influence the transcription, RNA processing or stability, or translation of the associated coding sequence. Regulatory sequences may include, but are not limited to, promoters, translation leader sequences, introns, and polyadenylation recognition sequences.
[0111] The "5' non-coding sequences" refer to DNA sequences located upstream of a coding sequence which influence the transcription, RNA processing or stability, or translation of the associated coding sequence.
[0112] An "intron" is an intervening sequence in a gene that is transcribed into RNA but is then excised in the process of generating the mature mRNA. The term is also used for the excised RNA sequences. An "exon" is a portion of the sequence of a gene that is transcribed and is found in the mature messenger RNA derived from the gene, but is not necessarily a part of the sequence that encodes the final gene product.
[0113] The term "constitutive promoter" refers to promoters active in all or most tissues of a plant at all or most developing stages. As with other promoters classified as "constitutive" (e.g. ubiquitin), some variation in absolute levels of expression can exist among different tissues or stages.
[0114] The term "constitutive promoter" or "tissue-independent" are used interchangeably herein
[0115] The term "tissue specific promoter" refers to promoters that have been shown to direct RNA synthesis at higher levels only in particular types of cells or tissues and are often referred to as "tissue specific promoters", or "tissue-preferred promoters" if the promoters direct RNA synthesis preferably in certain tissues but also in other tissues at reduced levels.
[0116] Among the most commonly used promoters are the nopaline synthase (NOS) promoter (Ebert et al., Proc. Natl. Acad. Sci. U.S.A. 84:5745-5749 (1987)), the octapine synthase (OCS) promoter, caulimovirus promoters such as the cauliflower mosaic virus (CaMV) 19S promoter (Lawton et al., Plant Mol. Biol. 9:315-324 (1987)), the CaMV 35S promoter (Odell et al., Nature 313:810-812 (1985)), and the figwort mosaic virus 35S promoter (Sanger et al., Plant Mol. Biol. 14:433-43 (1990)), the light inducible promoter from the small subunit of rubisco, the Adh promoter (Walker et al., Proc. Natl. Acad. Sci. U.S.A. 84:6624-66280 (1987), the sucrose synthase promoter (Yang et al., Proc. Natl. Acad. Sci. U.S.A. 87:4144-4148 (1990)), the R gene complex promoter (Chandler et al., Plant Cell 1:1175-1183 (1989)), the chlorophyll a/b binding protein gene promoter, etc. Other commonly used promoters are, the promoters for the potato tuber ADPGPP genes, the sucrose synthase promoter, the granule bound starch synthase promoter, the glutelin gene promoter, the maize waxy promoter, Brittle gene promoter, and Shrunken 2 promoter, the acid chitinase gene promoter, and the zein gene promoters (15 kD, 16 kD, 19 kD, 22 kD, and 27 kD; Perdersen et al., Cell 29:1015-1026 (1982)). A plethora of promoters is described in PCT Publication No. WO 00/18963 published on Apr. 6, 2000, the disclosure of which is hereby incorporated by reference.
[0117] The "translation leader sequence" refers to a DNA sequence located between the promoter sequence of a gene and the coding sequence. The translation leader sequence is present in the fully processed mRNA upstream of the translation start sequence. The translation leader sequence may affect processing of the primary transcript to mRNA, mRNA stability or translation efficiency. Examples of translation leader sequences have been described (Turner, R. and Foster, G. D., Molecular Biotechnology 3:225 (1995)).
[0118] The "3' non-coding sequences" refer to DNA sequences located downstream of a coding sequence and include polyadenylation recognition sequences and other sequences encoding regulatory signals capable of affecting mRNA processing or gene expression. The polyadenylation signal is usually characterized by affecting the addition of polyadenylic acid tracts to the 3' end of the mRNA precursor. The use of different 3' non-coding sequences is exemplified by Ingelbrecht et al., Plant Cell 1:671-680 (1989).
[0119] "RNA transcript" refers to a product resulting from RNA polymerase-catalyzed transcription of a DNA sequence. When an RNA transcript is a perfect complementary copy of a DNA sequence, it is referred to as a primary transcript or it may be a RNA sequence derived from posttranscriptional processing of a primary transcript and is referred to as a mature RNA. "Messenger RNA" ("mRNA") refers to RNA that is without introns and that can be translated into protein by the cell. "cDNA" refers to a DNA that is complementary to and synthesized from an mRNA template using the enzyme reverse transcriptase. The cDNA can be single-stranded or converted into the double-stranded by using the Klenow fragment of DNA polymerase I. "Sense" RNA refers to RNA transcript that includes mRNA and so can be translated into protein within a cell or in vitro. "Antisense RNA" refers to a RNA transcript that is complementary to all or part of a target primary transcript or mRNA and that blocks expression or transcripts accumulation of a target gene (U.S. Pat. No. 5,107,065). The complementarity of an antisense RNA may be with any part of the specific gene transcript, i.e. at the 5' non-coding sequence, 3' non-coding sequence, introns, or the coding sequence. "Functional RNA" refers to antisense RNA, ribozyme RNA, or other RNA that may not be translated but yet has an effect on cellular processes.
[0120] The term "operably linked" refers to the association of nucleic acid sequences on a single nucleic acid fragment so that the function of one is affected by the other. For example, a promoter is operably linked with a coding sequence when it is capable of affecting the expression of that coding sequence (i.e., that the coding sequence is under the transcriptional control of the promoter). Coding sequences can be operably linked to regulatory sequences in sense or antisense orientation.
[0121] The term "expression", as used herein, refers to the production of a functional end-product e.g., an mRNA or a protein (precursor or mature).
[0122] The term "expression cassette" as used herein, refers to a discrete nucleic acid fragment into which a nucleic acid sequence or fragment can be moved.
[0123] Expression or overexpression of a gene involves transcription of the gene and translation of the mRNA into a precursor or mature protein. "Antisense inhibition" refers to the production of antisense RNA transcripts capable of suppressing the expression of the target protein. "Overexpression" refers to the production of a gene product in transgenic organisms that exceeds levels of production in normal or non-transformed organisms. "Co-suppression" refers to the production of sense RNA transcripts capable of suppressing the expression or transcript accumulation of identical or substantially similar foreign or endogenous genes (U.S. Pat. No. 5,231,020). The mechanism of co-suppression may be at the DNA level (such as DNA methylation), at the transcriptional level, or at post-transcriptional level.
[0124] Co-suppression constructs in plants previously have been designed by focusing on overexpression of a nucleic acid sequence having homology to an endogenous mRNA, in the sense orientation, which results in the reduction of all RNA having homology to the overexpressed sequence (see Vaucheret et al., Plant J. 16:651-659 (1998); and Gura, Nature 404:804-808 (2000)). The overall efficiency of this phenomenon is low, and the extent of the RNA reduction is widely variable. Recent work has described the use of "hairpin" structures that incorporate all, or part, of an mRNA encoding sequence in a complementary orientation that results in a potential "stem-loop" structure for the expressed RNA (PCT Publication No. WO 99/53050 published on Oct. 21, 1999; and PCT Publication No. WO 02/00904 published on Jan. 3, 2002). This increases the frequency of co-suppression in the recovered transgenic plants. Another variation describes the use of plant viral sequences to direct the suppression, or "silencing", of proximal mRNA encoding sequences (PCT Publication No. WO 98/36083 published on Aug. 20, 1998). Genetic and molecular evidences have been obtained suggesting that dsRNA mediated mRNA cleavage may have been the conserved mechanism underlying these gene silencing phenomena (Elmayan et al., Plant Cell 10:1747-1757 (1998); Galun, In Vitro Cell. Dev. Biol. Plant 41(2):113-123 (2005); Pickford et al, Cell. Mol. Life Sci. 60(5):871-882 (2003)).
[0125] As stated herein, "suppression" refers to a reduction of the level of enzyme activity or protein functionality (e.g., a phenotype associated with a protein) detectable in a transgenic plant when compared to the level of enzyme activity or protein functionality detectable in a non-transgenic or wild type plant with the native enzyme or protein. The level of enzyme activity in a plant with the native enzyme is referred to herein as "wild type" activity. The level of protein functionality in a plant with the native protein is referred to herein as "wild type" functionality. The term "suppression" includes lower, reduce, decline, decrease, inhibit, eliminate and prevent. This reduction may be due to a decrease in translation of the native mRNA into an active enzyme or functional protein. It may also be due to the transcription of the native DNA into decreased amounts of mRNA and/or to rapid degradation of the native mRNA. The term "native enzyme" refers to an enzyme that is produced naturally in a non-transgenic or wild type cell. The terms "non-transgenic" and "wild type" are used interchangeably herein.
[0126] "Altering expression" refers to the production of gene product(s) in transgenic organisms in amounts or proportions that differ significantly from the amount of the gene product(s) produced by the corresponding wild-type organisms (i.e., expression is increased or decreased).
[0127] "Transformation" refers to the transfer of a nucleic acid fragment into the genome of a host organism, resulting in genetically stable inheritance. Host organisms containing the transformed nucleic acid fragments are referred to as "transgenic" organisms. The preferred method of soybean cell transformation is the use of particle-accelerated or "gene gun" transformation technology (Klein, T., Nature (London) 327:70-73 (1987); U.S. Pat. No. 4,945,050).
[0128] "Transient expression" refers to the temporary expression of often reporter genes such as β-glucuronidase (GUS), fluorescent protein genes GFP, ZS-YELLOW1 N1, AM-CYAN1, DS-RED in selected certain cell types of the host organism in which the transgenic gene is introduced temporally by a transformation method. The transformed materials of the host organism are subsequently discarded after the transient gene expression assay.
[0129] Standard recombinant DNA and molecular cloning techniques used herein are well known in the art and are described more fully in Sambrook, J. et al., In Molecular Cloning: A Laboratory Manual; 2nd ed.; Cold Spring Harbor Laboratory Press: Cold Spring Harbor, N.Y., 1989 (hereinafter "Sambrook et al., 1989") or Ausubel, F. M., Brent, R., Kingston, R. E., Moore, D. D., Seidman, J. G., Smith, J. A. and Struhl, K., Eds.; In Current Protocols in Molecular Biology; John Wiley and Sons: New York, 1990 (hereinafter "Ausubel et al., 1990").
[0130] "PCR" or "Polymerase Chain Reaction" is a technique for the synthesis of large quantities of specific DNA segments, consisting of a series of repetitive cycles (Perkin Elmer Cetus Instruments, Norwalk, Conn.). Typically, the double stranded DNA is heat denatured, the two primers complementary to the 3' boundaries of the target segment are annealed at low temperature and then extended at an intermediate temperature. One set of these three consecutive steps comprises a cycle.
[0131] The terms "recombinant polynucleotide", "recombinant nucleotide", "recombinant DNA", "recombinant DNA construct" and "recombinant expression construct" are used interchangeably herein. A recombinant DNA construct comprises an artificial or heterologous combination of nucleic acid sequences, e.g., regulatory and coding sequences that are not found together in nature. For example, a recombinant DNA construct can comprise a plasmid vector or a fragment thereof comprising the instant inducible promoter and a heterologous polynucleotide of interest. In other embodiments, a recombinant construct may comprise regulatory sequences and coding sequences that are derived from maize, rice, sorghum or different sources, or regulatory sequences and coding sequences derived from the same source, but arranged in a manner different than that found in nature. Such a construct may be used by itself or may be used in conjunction with a vector. If a vector is used, then the choice of vector is dependent upon the method that will be used to transform host cells as is well known to those skilled in the art. For example, a plasmid vector can be used. The skilled artisan is well aware of the genetic elements that must be present on the vector in order to successfully transform, select and propagate host cells comprising any of the isolated nucleic acid fragments provided herein. The skilled artisan will also recognize that different independent transformation events will result in different levels and patterns of expression (Jones et al., EMBO J. 4:2411-2418 (1985); De Almeida et al., Mol. Gen. Genetics 218:78-86 (1989)), and thus that multiple events must be screened in order to obtain lines displaying the desired expression level and pattern. Such screening may be accomplished by Southern analysis of DNA, Northern analysis of mRNA expression, immunoblotting analysis of protein expression, or phenotypic analysis, among others.
[0132] It is demonstrated herein that the maize mannitol dehydrogenase gene promoter ZmCAS1 can, in fact, be used as an inducible promoter to drive efficient expression of transgenes, and that such promoter can be isolated and used by one skilled in the art. Induced GUS and MS45 expression has been observed in sink tissues such as anthers, callus, root and shoots of seedlings as well as developing leaves (Examples 1-3)`
[0133] Mannitol metabolism plays an important role in plant responses to both biotic and abiotic stresses. (Stoop et al. 2001, Trends in Plant Science, Volume 1, Issue 5, May 1996, Pages 139-144). Celery plants exposed to high salinity showed an increased mannitol accumulation primarily caused by a decrease in mannitol dehydrogenase activity in sink tissues (Stoop and Pharr. 1993 Plant Physiol. 103:1001-1008). As shown in FIG. 1 B, the ZmCAS1cDNA (SEQ ID NO:5) showed a high % identity with a maize mannitol dehydrogenase (GI:226528549; SEQ ID NO:6), FIG. 1(B). Taken together with our observations that the ZmCAS1 promoter can be induced by a chemical such as a safener, or a stress such as a heat treatment, one can further test the ability of the ZmCAS1 promoter to be responsive to stresses such as, but not limited to, drought, osmotic or salt stress, or a combination thereof.
[0134] It is clear from the disclosure set forth herein that one of ordinary skill in the art could perform the following procedure:
[0135] 1) operably linking the nucleic acid fragment containing the ZMCAS1 promoter sequence to a suitable reporter gene; there are a variety of reporter genes that are well known to those skilled in the art, including the bacterial GUS gene, the firefly luciferase gene, and the cyan, green, red, and yellow fluorescent protein genes; any gene for which an easy and reliable assay is available can serve as the reporter gene.
[0136] 2) transforming a chimeric ZmCAS1 promoter:reporter gene expression cassette into an appropriate plant for expression of the promoter. There are a variety of appropriate plants which can be used as a host for transformation that are well known to those skilled in the art, including the dicots, Arabidopsis, tobacco, soybean, oilseed rape, peanut, sunflower, safflower, cotton, tomato, potato, cocoa and the monocots, corn, wheat, rice, barley and palm.
[0137] 3) testing for expression of the ZmCAS1 promoter in various cell types of transgenic plant tissues, e.g., leaves, roots, flowers, seeds, transformed with the chimeric ZmCAS1 promoter:reporter gene expression cassette by assaying for expression of the reporter gene product.
[0138] In another aspect, this invention concerns a recombinant DNA construct comprising at least one heterologous nucleic acid fragment operably linked to any promoter, or combination of promoter elements, of the present invention. Recombinant DNA constructs can be constructed by operably linking the nucleic acid fragment of the invention promoter or a fragment that is substantially similar and functionally equivalent to any portion of the nucleotide sequence set forth in SEQ ID NOs: 9 or 10 to a heterologous nucleic acid fragment. Any heterologous nucleic acid fragment can be used to practice the invention. The selection will depend upon the desired application or phenotype to be achieved. The various nucleic acid sequences can be manipulated so as to provide for the nucleic acid sequences in the proper orientation. It is believed that various combinations of promoter elements as described herein may be useful in practicing the present invention.
[0139] In another aspect, this invention concerns a recombinant DNA construct comprising at least one acetolactate synthase (ALS) nucleic acid fragment operably linked to ZmCAS1 promoter, or combination of promoter elements, of the present invention. The acetolactate synthase gene is involved in the biosynthesis of branched chain amino acids in plants and is the site of action of several herbicides including sulfonyl urea. Expression of a mutated acetolactate synthase gene encoding a protein that can no longer bind the herbicide will enable the transgenic plants to be resistant to the herbicide (U.S. Pat. No. 5,605,011, U.S. Pat. No. 5,378,824). The mutated acetolactate synthase gene is also widely used in plant transformation to select transgenic plants.
[0140] In another embodiment, this invention concerns host cells comprising either the recombinant DNA constructs of the invention as described herein or isolated polynucleotides of the invention as described herein. Examples of host cells which can be used to practice the invention include, but are not limited to, yeast, bacteria, and plants.
[0141] Plasmid vectors comprising the instant recombinant expression construct can be constructed. The choice of plasmid vector is dependent upon the method that will be used to transform host cells. The skilled artisan is well aware of the genetic elements that must be present on the plasmid vector in order to successfully transform, select and propagate host cells containing the chimeric gene.
[0142] The method of transformation/transfection is not critical to the instant invention; various methods of transformation or transfection are currently available. As newer methods are available to transform crops or other host cells they may be directly applied. Accordingly, a wide variety of methods have been developed to insert a DNA sequence into the genome of a host cell to obtain the transcription or transcript and translation of the sequence to effect phenotypic changes in the organism. Thus, any method which provides for efficient transformation/transfection may be employed.
[0143] Methods for introducing expression vectors into plant tissue available to one skilled in the art are varied and will depend on the plant selected. Procedures for transforming a wide variety of plant species are well known and described throughout the literature. See, for example, Miki et al, "Procedures for Introducing Foreign DNA into Plants" in Methods in Plant Molecular Biotechnology, supra; Klein et al, Bio/Technology 10:268 (1992); and Weising et al., Ann. Rev. Genet. 22: 421-477 (1988). For example, the DNA construct may be introduced into the genomic DNA of the plant cell using techniques such as microprojectile-mediated delivery, Klein et al., Nature 327: 70-73 (1987); electroporation, Fromm et al., Proc. Natl. Acad. Sci. 82: 5824 (1985); polyethylene glycol (PEG) precipitation, Paszkowski et al., EMBO J. 3: 2717-2722 (1984); direct gene transfer WO 85/01856 and EP No. 0 275 069; in vitro protoplast transformation, U.S. Pat. No. 4,684,611; and microinjection of plant cell protoplasts or embryogenic callus, Crossway, Mol. Gen. Genetics 202:179-185 (1985). Co-cultivation of plant tissue with Agrobacterium tumefaciens is another option, where the DNA constructs are placed into a binary vector system. See e.g., U.S. Pat. No. 5,591,616; Ishida et al., "High Efficiency Transformation of Maize (Zea mays L.) mediated by Agrobacterium tumefaciens" Nature Biotechnology 14:745-750 (1996). The virulence functions of the Agrobacterium tumefaciens host will direct the insertion of the construct into the plant cell DNA when the cell is infected by the bacteria. See, for example Horsch et al., Science 233: 496-498 (1984), and Fraley et al., Proc. Natl. Acad. Sci. 80: 4803 (1983).
[0144] Standard methods for transformation of canola are described at Moloney et al. "High Efficiency Transformation of Brassica napus using Agrobacterium Vectors" Plant Cell Reports 8:238-242 (1989). Corn transformation is described by Fromm et al, Bio/Technology 8:833 (1990). Agrobacterium is primarily used in dicots, but certain monocots such as maize can be transformed by Agrobacterium (U.S. Pat. No. 5,550,318). Rice transformation is described by Hiei et al., "Efficient Transformation of Rice (Oryza sativa L.) Mediated by Agrobacterium and Sequence Analysis of the Boundaries of the T-DNA" The Plant Joumal 6(2): 271-282 (1994, Christou et al, Trends in Biotechnology 10:239 (1992) and Lee et al, Proc. Nat'l Acad. Sci. USA 88:6389 (1991). Wheat can be transformed by techniques similar to those used for transforming com or rice. Sorghum transformation is described at Casas et al, supra and sorghum by Wan et al, PlantPhysiol. 104:37 (1994). Soybean transformation is described in a number of publications, including U.S. Pat. No. 5,015,580.
[0145] When referring to "introduction" of the nucleotide sequence into a plant, it is meant that this can occur by direct transformation methods, such as Agrobacterium transformation of plant tissue, microprojectile bombardment, electroporation, or any one of many methods known to one skilled in the art; or, it can occur by crossing a plant having the heterologous nucleotide sequence with another plant so that progeny have the nucleotide sequence incorporated into their genomes. Such breeding techniques are well known to one skilled in the art.
[0146] Methods for transforming dicots, primarily by use of Agrobacterium tumefaciens, and obtaining transgenic plants have been published, among others, for cotton (U.S. Pat. No. 5,004,863, U.S. Pat. No. 5,159,135); soybean (U.S. Pat. No. 5,569,834, U.S. Pat. No. 5,416,011); Brassica (U.S. Pat. No. 5,463,174); peanut (Cheng et al., Plant Cell Rep. 15:653-657 (1996), McKently et al., Plant Cell Rep. 14:699-703 (1995)); papaya (Ling et al., Bio/technology 9:752-758 (1991)); and pea (Grant et al., Plant Cell Rep. 15:254-258 (1995)). For a review of other commonly used methods of plant transformation see Newell, C. A., Mol. Biotechnol. 16:53-65 (2000). One of these methods of transformation uses Agrobacterium rhizogenes (Tepfler, M. and Casse-Delbart, F., Microbiol. Sci. 4:24-28 (1987)). Transformation of soybeans using direct delivery of DNA has been published using PEG fusion (PCT Publication No. WO 92/17598), electroporation (Chowrira et al., Mol. Biotechnol. 3:17-23 (1995); Christou et al., Proc. Natl. Acad. Sci. U.S.A. 84:3962-3966 (1987)), microinjection, or particle bombardment (McCabe et al., BiolTechnology 6:923 (1988); Christou et al., Plant Physiol. 87:671-674 (1988)).
[0147] There are a variety of methods for the regeneration of plants from plant tissues. The particular method of regeneration will depend on the starting plant tissue and the particular plant species to be regenerated. The regeneration, development and cultivation of plants from single plant protoplast transformants or from various transformed explants is well known in the art (Weissbach and Weissbach, Eds.; In Methods for Plant Molecular Biology; Academic Press, Inc.: San Diego, Calif., 1988). This regeneration and growth process typically includes the steps of selection of transformed cells, culturing those individualized cells through the usual stages of embryonic development or through the rooted plantlet stage. Transgenic embryos and seeds are similarly regenerated. The resulting transgenic rooted shoots are thereafter planted in an appropriate plant growth medium such as soil. Preferably, the regenerated plants are self-pollinated to provide homozygous transgenic plants. Otherwise, pollen obtained from the regenerated plants is crossed to seed-grown plants of agronomically important lines. Conversely, pollen from plants of these important lines is used to pollinate regenerated plants. A transgenic plant of the present invention containing a desired polypeptide is cultivated using methods well known to one skilled in the art.
[0148] In addition to the above discussed procedures, practitioners are familiar with the standard resource materials which describe specific conditions and procedures for the construction, manipulation and isolation of macromolecules (e.g., DNA molecules, plasmids, etc.), generation of recombinant DNA fragments and recombinant expression constructs and the screening and isolating of clones, (see for example, Sambrook, J. et al., In Molecular Cloning: A Laboratory Manual; 2nd ed.; Cold Spring Harbor Laboratory Press: Cold Spring Harbor, N.Y., 1989; Maliga et al., In Methods in Plant Molecular Biology; Cold Spring Harbor Press, 1995; Birren et al., In Genome Analysis: Detecting Genes, 1; Cold Spring Harbor N.Y., 1998; Birren et al., In Genome Analysis: Analyzing DNA, 2; Cold Spring Harbor: New York, 1998; Clark, Ed., In Plant Molecular Biology: A Laboratory Manual; Springer: New York, 1997).
[0149] The skilled artisan will also recognize that different independent transformation events will result in different levels and patterns of expression of the chimeric genes (Jones et al., EMBO J. 4:2411-2418 (1985); De Almeida et al., Mol. Gen. Genetics 218:78-86 (1989)). Thus, multiple events must be screened in order to obtain lines displaying the desired expression level and pattern. Such screening may be accomplished by Northern analysis of mRNA expression, Western analysis of protein expression, or phenotypic analysis. Also of interest are seeds obtained from transformed plants displaying the desired gene expression profile.
[0150] Inducible expression of chimeric genes in most plant cells makes the ZmCAS1 promoter of the instant invention especially useful when inducible expression of a target heterologous nucleic acid fragment is required.
[0151] Another general application of the ZmCAS1 promoter of the invention is to construct chimeric genes that can be used to reduce expression of at least one heterologous nucleic acid fragment in a plant cell. To accomplish this, a chimeric gene designed for gene silencing of a heterologous nucleic acid fragment can be constructed by linking the fragment to the ZmCAS1 promoter of the present invention. (See U.S. Pat. No. 5,231,020, and PCT Publication No. WO 99/53050 published on Oct. 21, 1999, PCT Publication No. WO 02/00904 published on Jan. 3, 2002, and PCT Publication No. WO 98/36083 published on Aug. 20, 1998, for methodology to block plant gene expression via cosuppression.) Alternatively, a chimeric gene designed to express antisense RNA for a heterologous nucleic acid fragment can be constructed by linking the fragment in reverse orientation to the ZmCAS1 promoter of the present invention. (See U.S. Pat. No. 5,107,065 for methodology to block plant gene expression via antisense RNA.) Either the cosuppression or antisense chimeric gene can be introduced into plants via transformation. Transformants wherein expression of the heterologous nucleic acid fragment is decreased or eliminated are then selected.
[0152] This invention also concerns a method of altering (increasing or decreasing) the expression of at least one heterologous nucleic acid fragment in a plant cell which comprises:
[0153] (a) transforming a plant cell with the recombinant expression construct of described herein;
[0154] (b) induction of the inducible promoter by chemical or stress treatment on the cell of (a)
[0155] (c) growing fertile mature plants from the transformed plant cell of step (a); and,
[0156] (d) selecting plants containing the transformed plant cell wherein the expression of the heterologous nucleic acid fragment is increased or decreased. Transformation and selection can be accomplished using methods well-known to those skilled in the art including, but not limited to, the methods described herein.
[0157] Non-limiting examples of compositions and methods disclosed herein are as follows:
[0158] 1. An isolated polynucleotide comprising:
[0159] a) a nucleotide sequence comprising the sequence set forth in SEQ ID NO:9 or SEQ ID NO:10, or a full-length complement thereof;
[0160] b) a nucleotide sequence comprising a functional fragment of SEQ ID NO:10, or a full-length complement thereof;
[0161] c) a nucleotide sequence comprising a sequence having at least 85% sequence identity, based on the BLASTN method of alignment, when compared to the nucleotide sequence of (a) or (b);
[0162] d) a nucleotide sequence which hybridizes to SEQ ID NO:9 under highly stringent conditions of a wash of 0.1 SSC, 0.1% (w/v) SDS at 65° C.;
[0163] e) a nucleotide sequence comprising all or a fragment of a 1.7 kb 5' non-coding sequence of a mannitol dehydrogenase; or,
[0164] f) a derivative of one of the nucleotide sequences indicated in (a), (b), (c), (d) or (e) obtained by substitution, addition and/or deletion of one or more nucleotides; and,
[0165] wherein said nucleotide sequence is an inducible promoter.
[0166] 2. The isolated polynucleotide of embodiment 1, wherein the nucleotide sequence of c) has at least 90% identity, based on the BLASTN method of alignment, when compared to the sequence set forth in SEQ ID NO:1.
[0167] 3. The isolated polynucleotide of embodiment 1, wherein the nucleotide sequence of c) has at least 95% identity, based on the BLASTN method of alignment, when compared to the sequence set forth in SEQ ID NO:1.
[0168] 4. The isolated polynucleotide of embodiment 1, wherein the nucleotide sequence of c) has at least 98% identity, based on the BLASTN method of alignment, when compared to the sequence set forth in SEQ ID NO:1.
[0169] 5. The isolated polynucleotide of embodiment 1 wherein said inducible promoter is induced by a chemical or stress treatment.
[0170] 6. The isolated polynucleotide of embodiment 1 wherein said inducible promoter is induced by a safener or heat treatment.
[0171] 7. The isolated polynucleotide of embodiment 6, wherein the safener is N-(aminocarbonyl)-2-chlorobenzenesulfonamide.
[0172] 8. The isolated polynucleotide of embodiment 6, wherein said heat treatment comprises a temperature greater than 26° C.
[0173] 9. A recombinant DNA construct comprising the isolated polynucleotide of embodiment 1 operably linked to at least one heterologous nucleic acid sequence.
[0174] 10. The recombinant DNA construct of embodiment 9, wherein the heterologous nucleic acid sequence codes for a gene selected from the group consisting of: a double-strand break inducing gene, a recombinase gene, a reporter gene, a selection marker, a disease resistance conferring gene, a herbicide resistance conferring gene, an insect resistance conferring gene; a gene involved in carbohydrate metabolism, a gene involved in fatty acid metabolism, a gene involved in amino acid metabolism, a gene involved in plant development, a gene involved in plant growth regulation, a gene involved in yield improvement, a gene involved in drought resistance, a gene involved in cold resistance, a gene involved in heat and salt resistance in plants.
[0175] 11. The recombinant DNA construct of embodiment 9, wherein the heterologous nucleic acid sequence encodes a protein selected from the group consisting of: a double-strand break inducing protein, a recombinase protein, a reporter protein, a selection marker, a protein conferring disease resistance, protein conferring herbicide resistance, protein conferring insect resistance; protein involved in carbohydrate metabolism, protein involved in fatty acid metabolism, protein involved in amino acid metabolism, protein involved in plant development, protein involved in plant growth regulation, protein involved in yield improvement, protein involved in drought resistance, protein involved in cold resistance, protein involved in heat resistance and salt resistance in plants.
[0176] 12. A vector comprising the recombinant DNA construct of embodiment 9.
[0177] 13. A cell comprising the recombinant DNA construct of embodiment 9.
[0178] 14. The cell of embodiment 13, wherein the cell is a plant cell.
[0179] 15. The plant cell of embodiment 14 having stably incorporated into its genome the recombinant DNA construct of embodiment 9.
[0180] 16. A transgenic plant having stably incorporated into its genome the recombinant DNA construct of embodiment 9.
[0181] 17. The transgenic plant of embodiment 16 wherein said plant is a monocot plant.
[0182] 18. The transgenic plant of embodiment 17, wherein said monocot is selected from the group comprising: maize, wheat, rice, barley, sorghum, millet, sugarcane and rye.
[0183] 19. The transgenic plant of embodiment 16, wherein said plant is a dicot plant.
[0184] 20. The transgenic plant of embodiment 19, wherein said dicot is selected from the group comprising: soy, Brassica sp., cotton, safflower, tobacco, alfalfa and sunflower.
[0185] 21. Transgenic seed produced by the transgenic plant of embodiment 16.
[0186] 22. A plant stably transformed with a recombinant expression construct comprising a plant promoter and a heterologous nucleic acid fragment operably linked to said promoter, wherein said promoter is an inducible promoter and capable of controlling expression of said heterologous nucleic acid fragment in a plant cell, and further wherein said promoter comprises a fragment of SEQ ID NO:10.
[0187] 23. A method of expressing a coding sequence or a functional RNA in a plant cell comprising:
[0188] a) introducing the recombinant DNA construct of embodiment 9 into a plant cell, wherein the at least one heterologous sequence comprises a coding sequence or a functional RNA;
[0189] b) growing the plant cell of step a);
[0190] c) induction of the inducible promoter by chemical or stress treatment on the plant cell of b); and,
[0191] d) selecting a plant cell displaying expression of the coding sequence or the functional RNA of the recombinant DNA construct.
[0192] 24. The method of embodiment 23, wherein the chemical is a safener.
[0193] 25. The method of embodiment 23 wherein the stress treatment is a heat treatment.
[0194] 26. The method of embodiment 23 further comprising growing the plant cell of d) into a plant.
[0195] 27. A method of expressing a coding sequence or a functional RNA in anther cells, said method comprising:
[0196] a) introducing the recombinant DNA construct of embodiment 9 into a plant cell, wherein the at least one heterologous sequence comprises a coding sequence or a functional RNA;
[0197] b) growing the plant cell of step a);
[0198] c) induction of the inducible promoter by chemical or stress treatment on the plant cell of b); and,
[0199] d) identification of anther cells displaying expression of the coding sequence or the functional RNA of the recombinant DNA construct.
[0200] 28. The method of embodiment 23 or embodiment 27 wherein the at least one heterologous sequence is transiently expressed.
[0201] 29. The method of embodiment 23 or embodiment 27 wherein the at least one heterologous sequence is stably incorporated in the plant cell.
[0202] 30. A method for altering expression of at least one heterologous nucleic acid fragment in a plant comprising:
[0203] (a) transforming a plant cell with the recombinant expression construct of embodiment 9;
[0204] (b) induction of the inducible promoter by chemical or stress treatment on the cell of (a)
[0205] (c) growing fertile mature plants from the transformed plant cell of step (a); and,
[0206] (d) selecting plants containing the transformed plant cell wherein the expression of the heterologous nucleic acid fragment is increased or decreased.
[0207] 31. A method of transgenically altering a marketable plant trait, comprising:
[0208] a) introducing a recombinant DNA construct of embodiment 9 into a plant;
[0209] b) induction of the inducible promoter by chemical or stress treatment on the plant of (a);
[0210] c) growing a fertile, mature plant resulting from step b); and
[0211] d) selecting a plant expressing the at least one heterologous nucleotide sequence in at least one plant tissue based on the altered marketable trait.
[0212] 32. The method of embodiment 31 wherein the marketable trait is selected from the group consisting of: disease resistance, herbicide resistance, insect resistance carbohydrate metabolism, fatty acid metabolism, amino acid metabolism, plant development, plant growth regulation, yield improvement, drought resistance, cold resistance, heat resistance, and salt resistance.
[0213] 33. An isolated polynucleotide comprising:
[0214] a) a nucleotide sequence comprising all or a functional fragment of SEQ ID NO:19 or SEQ ID NO:22;
[0215] b) a nucleotide sequence comprising a full-length complement of the nucleotide sequence (a); or,
[0216] c) a nucleotide sequence comprising a sequence having at least 90% sequence identity, based on the BLASTN method of alignment, when compared to the nucleotide sequence of (a) or (b); and, wherein said nucleotide sequence is a promoter.
[0217] 34. The isolate polynucleotide of embodiment 33 wherein said promoter is an inducible promoter.
EXAMPLES
[0218] The present invention is further defined in the following Examples, in which parts and percentages are by weight and degrees are Celsius, unless otherwise stated. Sequences of promoters, cDNA, adaptors, and primers listed in this invention all are in the 5' to 3' orientation unless described otherwise. Techniques in molecular biology were typically performed as described in Ausubel, F. M. et al., In Current Protocols in Molecular Biology; John Wiley and Sons: New York, 1990 or Sambrook, J. et al., In Molecular Cloning: A Laboratory Manual; 2nd ed.; Cold Spring Harbor Laboratory Press: Cold Spring Harbor, N.Y., 1989 (hereinafter "Sambrook et al., 1989"). It should be understood that these Examples, while indicating preferred embodiments of the invention, are given by way of illustration only. From the above discussion and these Examples, one skilled in the art can ascertain the essential characteristics of this invention, and without departing from the spirit and scope thereof, can make various changes and modifications of the invention to adapt it to various usages and conditions. Thus, various modifications of the invention in addition to those shown and described herein will be apparent to those skilled in the art from the foregoing description. Such modifications are also intended to fall within the scope of the appended claims.
[0219] The disclosure of each reference set forth herein is incorporated herein by reference in its entirety.
Example 1
Identification of Safener-Inducible cDNAs Expressed in Microspores and/or Tapetum
[0220] Strategy Design for the Identification of Safener-Inducible cDNAs.
[0221] The isolation of conditionally regulated promoters with tissue specificity in plants which are different than the safener induced promoter ZmlN2-2 (Hershey et al. U.S. Pat. No. 5,364,780 Nov. 15, 1994) would enable conditional regulation of genes in microspores and/or the tapetum. Previously, it has been demonstrated that while ZmlN2-2 transcript expression increases in callus, leaf and anther tissues in maize after safener treatment, genes regulated by this promoter do not express in maize tapetal cells (Cigan et al. 2001. Sex. Plant Reprod. 14, 135-142). Immunolocalization studies demonstrated that genes regulated by ZmlN2-2 are present in all anther cell types except the tapetum or microspores. To date, no promoters that respond to CBSU (Chlorobenzenesulfonamide) safener and are specifically expressed in tapetal cells or microspores at the tetrad stage of microsporogenesis have been identified. To enable the isolation of safener-inducible candidate promoters that are expressed in microspores or tapetum, a strategy was designed which takes advantage of two fundamental observations made of plants transformed with the E. coli DAMethylase gene expressed from the maize anther-specific promoter 5126 (5126:DAM; Unger et al., 2001, Transgenic Res. 10, 409-422). First, cytological examination of tetrad staged anthers from male-sterile plants expressing 5126:DAM revealed abnormal microspores and nearly ablated tapetal cells in otherwise structurally normal appearing anthers. Second, Northern analysis of mRNA isolated from 5126:DAM sterile anthers indicates a loss of two tapetal-specific transcripts, 5126 and MS45, while a transcript not expected to be the tapetal-specific (maize actin), is easily detected (Cigan et al. 2001. Sex. Plant Reprod. 14, 135-142). Therefore, anthers isolated from 5126:DAM sterile plants should be reduced or perhaps completely devoid of tapetal- and/or microspore-specific mRNAs.
[0222] In addition, comparison of the ZmlN2-2 transcript expression from RNAs isolated from wild-type male-fertile CBSU-treated plants to RNAs isolated from male-sterile CBSU-treated 5126:DAM plants showed, that in contrast the MS45 and 5126 tapetal-specific mRNAs, the ZmlN2-2 was not reduced in anther RNAs isolated from 5126:DAM CBSU-treated plants (Cigan et al. 2001. Sex. Plant Reprod. 14, 135-142).
[0223] A strategy was designed using sterile plants which were reduced or devoid of tapetal- and/or microspore-specific mRNAs. The strategy involved treating maize plants with CBSU and comparing anther mRNA transcript profiles from these treated control plants with treated 5126:DAM plants. Such a strategy did lead to the identification and isolation of mRNAs and, ultimately, promoters which are responsive to the safener and are microspore- or tapetum-expressed as described below.
[0224] Toward this end, differential RNA hybridization was used to enrich for maize anther or callus mRNAs that are increased by safener or heat treatment. Subsequently, these mRNAs were used as probes to isolate cDNAs from anther cDNA libraries prepared from CBSU-treated maize plants. These cDNAs were then used to screen mRNAs isolated from male-fertile and male-sterile 5126:DAM control and safener-treated plants as a means to identify transcripts which are induced by CBSU or heat treatment or and expressed in the tapetum or microspores as described below.
Maize Anther cDNA Library Construction from CBSU-Treated Wild-Type Plants and Isolation of Safener Inducible cDNA's.
[0225] Wild type maize plants were grown to the meiosis stage of microspore development. Plants were watered with 30 mg 2-CBSU and allowed to develop to the quartet and early vacuolate stage of microspore development. PolyA+ anther RNA was isolated from wild type control and CBSU treated plants and stored. A cDNA library was constructed from mRNAs isolated from CBSU-treated plants, arrayed onto nylon filters and stored. A cDNA subtraction library was generated using the Clontech PCR-Select cDNA Subtraction Kit (#K1804-1) following the manufacturer's instructions to enrich for CBSU-specific transcripts. Using this approach anther PolyA+mRNA from wild-type plants was used to enrich for transcripts found in the anther PolyA+mRNA from CBSU treated plants. This cDNA subtraction library was subcloned into pSPORT (BRL) vector, colonies plated, picked and sequenced. Among the vector inserts sequenced, two DNA sequences were present at a high proportion, at more than 10% of the inserts sequenced, and referred to as ZmCAS1c-1 (477 bp; SEQ ID NO: 1) and ZmCAS1c-2 (438 bp; SEQ ID NO: 2) (QBSU-Anther-Subtract 1). Both ZmCAS1c-1 and ZmCAS1c-2 had sequence identity to mannitol dehydrogenases from plants. ZmCASlc-1 and ZmCAS1c-2 DNA fragments were used as hybridization probes to screen the filter arrayed cDNA CBSU-anther library described above to isolate full-length cDNA containing hybridizing clones. Both ZmCAS1c-1 and ZmCAS1c-2 identified identical cDNA clones. One cDNA clone contained a 1354 bp SalI-NotI insert (SEQ ID NO: 3) that was sequenced and identified as a 1338 bp full length cDNA clone referred to ZmCAS1cDNA (SEQ ID NO: 4). ZmCAS1cDNA is capable of encoding a 354 amino acid sequence (SEQ ID NO: 5) with 99.7% identity to a maize mannitol dehydogenase (GI number 226528549, NP--001147757.1, SEQ ID NO:6; FIG. 1).
[0226] To determine whether the ZmCAS1 cDNA was 1) induced in maize anthers by CBSU-treatment and 2) reduced or absent in tapetum and microspore-ablated maize anthers from CBSU-treated 5126:DAM plants, a 477 bp ZmCAS1c-1 DNA fragment (SEQ ID NO:1), as well as DNA fragments from ZmMS45, Zm5126, ZmActin, and ZmUbiquitin (Cigan et al. 2001. Sex. Plant Reprod. 14, 135-142) were used as a hybridization probes against maize anther mRNAs isolated from male-fertile (F) and male-sterile (S), control (-) or CBSU-treated (+) plants. As shown in FIG. 2, the constitutively expressed maize actin (ACTIN) and ubiquitin (UBI) transcripts were easily detected and did not change their steady-state level across all anther RNA samples and treatments. MS45 and 5126 transcripts were easily detected in anther RNAs from male-fertile plants but absent in anther RNAs from male-sterile plants (FIG. 2) further supporting the observation that these RNAs are localized to the maize tapetum (Cigan et al. 2001. Sex. Plant Reprod. 14, 135-142).
[0227] Anther RNAs from control male-fertile (-, F) and control sterile plants (-, S) did not reveal detectable levels of the IN2-2 transcript, while strong hybridization signals were detected in mRNAs from anthers derived from CBSU-treated control male-fertile and male-sterile plants (FIG. 2). In contrast strong hybridization signals for ZmCAS1 are only revealed in mRNAs from anthers derived from male-fertile CBSU-treated plants (+, F). The reduced ZmCAS1 signal observed in mRNAs from anthers of male-sterile CBSU-treated plants (+, S) indicates that this ZmCAS1 transcript was present in cell layers of anthers. This observation indicates that ZmCAS1 tissue specific expression is different from the IN2-2 expression and thus makes the ZmCAS1 promoter a candidate which differs from the IN2-2 5 promoter in spatial expression in anthers.
[0228] When the ZmCAS1 probe was used to hybridize to maize callus or maize leaf treated with CBSU, strong hybridization was observed in mRNAs from callus, leaf and anther (FIG. 3), similar to the IN2-2 probe, suggesting that in addition to expression in anthers, ZmCAS1 transcript is also expressed in callus and leaf in response to safener, CBSU, treatment.
Isolation of the 1.7 Kb and Truncated 1.0 Kb ZmCAS1 Promoter Fraaments
[0229] In order to isolate DNA sequences which correspond to the ZmCAS1 promoter, subgenomic SaullIA genomic phage libraries from the maize line B73 were screened using the 477 bp ZmCAS1c-1 DNA fragment (SEQ ID NO:1) as a hybridization probe. A phage which contained a 4069 bp maize B73 DNA fragment (SEQ ID NO: 7). and hybridized to the ZmCAS1c-1 probe was isolated, plasmid excised and sequenced. DNA sequence analysis of this 4069 bp genomic DNA identified several regions of sequence identity to the ZmCAS1C cDNA. For promoter studies, oligonucleotide directed mutagenesis was used to introduce an RcaI DNA restriction site at nucleotide positions 3447-3452 in SEQ ID NO:7 using the MORPH Site-Specific Plasmid DNA Mutagenesis Kit 5 Prime-3 Prime (Boulder, Colo.) according to the vendors instructions using the oligonucleotide 5'-GCAGTTCATTCCTCATGACTGCTGCAGCAGAGC-3'(SEQ ID NO:8). A HindIII-RcaI fragment (ZmCAS1HindIII Pro, SEQ ID NO: 15) comprising the truncated 1.0 kb maize ZmCAS1 promoter of SEQ ID NO:9 and a BamHI-RcaI (ZmCAS1BamPro, SEQ ID NO: 16) fragment comprising the 1.7 kb maize ZmCAS1 promoter of SEQ ID NO:10 were isolated and used for promoter studies in plants.
[0230] As shown in FIG. 4, Example 2, and other examples described herein, both the 1.7 kb ZmCAS1 promoter (SEQ ID NO:10) and the truncated 1.0 kb ZmCAS1 promoter (SEQ ID NO:9) were active in plant cells and induced by both a safener (CBSU, FIG. 4A, 4B) and/or a heat treatment (FIG. 4B).
Example 2
Increased Expression of GUS and ZmMS45 is Observed in Maize Cells and Plants when these Aenes are Placed Under the Transcriptional Control of the ZmCAS1 Promoter in Response to the Safener. CBSU or a Heat Treatment
[0231] Agrobacterium-mediated transformation of immature embryos was used to generate integrated copies of PHP16974 (SEQ ID NO:11; ZmCAS1HindIII Pro:GUS/35SPAT) comprising the 1.0 kb ZmCAS1 promoter (SEQ ID NO:9), PHP16975 (SEQ ID NO:12, ZmCAS1BamPro:GUS/35SPAT) comprising the 1.7 kb ZmCAS1 promoter (SEQ ID NO:10) and PHP16972 (SEQ ID NO:13, ZmCAS1 HindIII Pro:MS45/35SPAT) comprising the 1.0 kb ZmCAS1 promoter or PHP16973 (SEQ ID NO:14, ZmCASIBamPro:MS45/35SPAT:) comprising the 1.7 kb ZmCAS1 promoter.
[0232] As described in Example 1 and FIG. 3, in addition to expression in anthers, ZmCAS1 transcript was also expressed in callus and leaf tissue in response to safener, CBSU, treatment. Bialophos-resistant callus events were selected for analysis and plant regeneration.
[0233] To determine whether the 1.7 kb or the 1.0 kb ZmCAS1 promoter from maize could direct induced expression of the GUS reporter, three bialophos-resistant callus events were placed onto maintenance media and maintenance media containing increasing amounts of the safener CBSU-2 for at room temperature for 18 hours, removed and stained with X-Gluc to detect GUS activity. As shown in FIG. 4A, slight GUS expression is detected in callus grown on maintenance media (FIG. 4A: C). In contrast, low levels of GUS expression are detected in callus grown on 10 mg/I CBSU (FIG. 4A: 10) while strong GUS expression is observed in PHP16975 callus events grown on 100 mg/I CBSU (FIG. 4A: 100). This data indicated that the 1.7 kb ZmCAS1 promoter is active in maize callus and can be induced by safener treatment.
[0234] Seven random bialophos-resistant callus events containing PHP16974 which contains a truncated 1.0 kb fragment of the ZmCAS1 promoter driving the GUS reporter were capable of inducible GUS expression when incubated in the presence of 100 mg/liter CBSU at room temperature (FIG. 4B; 26C+CBSU). In a separate experiment, when these 7 callus events were grown on maintenance media without CBSU but incubated for 2 days at 37° C. returned to room temperature and then stained with X-gluc, increased GUS expression was also observed (FIG. 4B; 37C). This data indicated that the truncated 1.0 kb ZmCAS1 promoter is active in maize callus and can be induced by safener and/or heat treatment.
[0235] Plants were also regenerated from callus events containing PHP16975 and grown in the greenhouse to approximately the 5 leaf stage. At this stage of development, leaf punches from plants regenerated from the 3 bialophos-resistant events shown in FIG. 5 were collected pre- (C) and post-watering (S) with 30 mg of 2-CBSU to examine GUS expression in leaf in response to application of the safener. As shown in FIG. 5 strong GUS expression was detected in leaf punches 2 days after watering (S) with CBSU across the 3 PHP16975 transformed plants analyzed.). This data further indicates that the 1.7 kb ZmCAS1 promoter is active in maize leaves and can be induced by safener treatment.
[0236] T-DNA vectors PHP16972 (SEQ ID NO:13, ZmCAS1HindIII Pro: MS45/35S:PAT) and PHP16973 (SEQ ID NO: 14, ZmCAS1 BamPro:MS45/35S:PAT) were used to transform maize callus which was generated to contain a segregating population of MS45/ms45 heterozygous and ms45/ms45 homozygous mutant plants. In order to detect MS45 RNA or protein expression under the control of the 1.0 or 1.7 kb ZmCAS promoter in maize anthers, plants containing a naturally occurring mutation in the maize MS45 gene which results in loss of MS45 RNA and protein were used for these studies. Pollen from MS45/ms45 plants were used to fertilize male-sterile ms45/ms45 plants for the purpose of generating embryos which would be ms45/ms45 as described in Cigan et al 2001. By placing the maize MS45 gene under the control of the ZmCAS1 promoter in these transformation vectors, genes other than GUS could be tested for transcriptional-induction in response to safener in callus, leaf and maize anthers as has been previously demonstrated for Zmln2-2:MS45 regulated expression (Cigan et al 2001. Sex. Plant Reprod. 14, 135-142). Five random bialophos-resistant callus events containing integrated copies of PHP16972 were placed onto maintenance media (FIG. 5, (-)) and maintenance media containing 100 mg/liter safener CBSU-2 (FIG. 5 (+)) at room temperature for 18 hours, removed and PolyA+RNA prepared and used for RNA analysis as described (Cigan et al. 2001. Sex. Plant Reprod. 14, 135-142) using the ZmMS45 and ZmActin probes for hybridization analysis. As shown in FIG. 6, strong induction of a hybridization signal corresponding to the MS45 mRNA is detected within RNA transcripts from ms45/ms45 callus grown on CBSU (+). A very low signal was observed when callus was grown in the absence of CBSU. Actin was used as a control probe to show nearly equivalent RNA levels were present in all samples. Multiple plants were regenerated from ms45/ms45 callus events transformed with PHP16973 and grown in the greenhouse. Plants were watered with 30 mg of CBSU at the meiosis stage of microspore development. Leaf and anthers (quartet, early uninucleate microspore stage) were collected 2 days later from control and CBSU-treated plants and whole-cell protein extracts were prepared from 4 leaf punches or 6 anthers as described (Cigan et al 2001). Leaf and anther proteins were electrophoresed on 10% SDS-denaturing polyacrylamide gels, transferred to supported nitrocellulose, and used for Western analysis using antibodies directed against the maize MS45 protein. Examination of leaf extracts from PHP16973 control (C) and treated (+) plants (FIG. 7) demonstrates increased steady-state levels of the MS45 protein in leaf extracts derived from CBSU-treated plants (lanes 3, 5, 7, 9). Increased MS45 protein is also detected in anther extracts (FIG. 8) derived from PHP16973 CBSU treated plants (Lane 2, 4, 7). This data further supports that the ZmCAS1 promoter is active in maize cells such as anthem, callus and leaves when induced by a safener.
[0237] Taken together the GUS and MS45 results described herein support that genes can be transcriptionally-induced when placed under the control of either the 1.7 kb or the 1.0 Kb ZmCAS1 promoter in maize cells and plants and transcription can be increased in callus, leaf and anthem in response to application of the safener CBSU and or heat treatment.
Example 3
Heat Treatment of Rice Plants Transformed with PHP16974 Comprising the Truncated 1.0 Kb ZmCAS1 Promoter Driving GUS Expression Results in GUS Expression in Germinating Seedlings
[0238] To determine whether the ZmCAS1 promoter could conditionally regulate expression in response to safener treatment or heat treatment in plant species other than maize, Agrobacterium-mediated transformation was used to generate integrated copies of PHP16974 (ZmCAS1HindIII Pro:GUS) comprising the truncated 1.0 kb ZmCAS1 promoter for studies in rice. Scutellum from 10-14 day old germinating seeds (Oryza sativa cv.Kitaake) was used for rice transformation experiment (Toki. 1997, PI Mol. Biol Reporter 15:16-21). Bialophos-resistant callus events containing PHP16974 were selected and screened for their ability to respond to safener application. Four independent bialophos-resistant events were grown on maintenance media or maintenance media containing 100 mg/liter CBSU for 24 hours, removed and stained with X-Gluc. As shown FIG. 9, strong GUS expression is observed in PHP16974 callus events when grown on media containing the CBSU safener (FIG. 9). PHP16974 bialophos-resistant events were regenerated into plants. Leaf tissue was collected from these plants and used for DNA hybridization analyses to identify single-copy PHP16974 insertions. These plants were allowed to set selfed seed and were used for subsequent studies to monitor GUS expression under the transcriptional control of the ZmCAS1 promoter. Sixteen seed were selected from 2 single-copy PHP16974 events were sterilized, grown on hormone-free media at 28° C. or 37° C. for 48 hours and allowed to germinate. Germinating seed was then incubated at 28° C. for 2 additional days and histochemically stained with X-Gluc to detect GUS activity (Reference). As shown in FIGS. 10A and 10C, seedlings germinated at 28° C. exhibit very low levels of detectable Gus staining. In contrast, rice seedlings germinated at 37° C. show pronounced blue staining and root and at the base of shoots (FIGS. 10B and 10D). These results are consistent with observations in maize. That is when the GUS gene is regulated by the 1.0 kb ZmCAS1 promoter, incubation at 37 C resulted in increased Gus activity even in the absence of safener treatment.
Sequence CWU
1
1
221477DNAartificial sequenceZmCAS1C-1 insert 1gtacctccga caatactacc
agacaaggtc cgcccaccac ggttgaggct cgcagggtgc 60atcttgatct cacttggaaa
gcacactatt gtcatcacac cgccaacctt aaggagcgag 120agatacggat cgaatggatg
gtcaccagaa gcagtgtcaa ctatgaagtg cagggagttt 180ttcagggcct ccatctgctg
cgtgtctgac gatataacaa agttatctgc tccaaggagg 240ttaatggctt cttccctctt
cgattcgctc gtgctgaaaa ccgtgacctt gagaccaaag 300gccttaccaa atttgaccgc
catgtggcct angccaccga gcccaatgac tcccagcgac 360ttcccaggct gcttcatgtt
gtgccgcgcc attggggtat agacggtgat tccagcacac 420anaagaagcg ctgcctgcgc
cagagggtaa ccatcgggta tcttgaaaca atacctc 4772438DNAartificial
sequenceZmCAS1C-2insert 2gggggnaggg gcttttnnng ganganaatg tnagtggngt
atccccccnt tgtgaagggn 60ccancaacaa tnnttgtgtt gtaaatnaaa aacgnttccg
ggnaatnatt ncccnaaaan 120ccgntncnat tnccccnatn gccggaacaa tttnagttag
gtncnaaccc ctgnttggtc 180accaaccttt gaaaggcctt gacgtctgcg ccaaacctca
atcacgaacc caagctntct 240catgcgcagg aancanagga taaactgagt cgttgtgctt
gttctgtatc cacataacgt 300caccgtaaca caccccgcag tatatgatct tcagcgaaac
gtcgctgctt tgcactgccc 360tgcggttaaa cttgtatggt gagagaaccc agaagggtct
ctggccgccc aagcatcgca 420nttgccacct cgggcgct
43831354DNAartificial seqeunceZmCAScDNA insert
3gtcgacccac gcgtccgcgg acgctgggca gacacagact ccaccacccc gcttcgatct
60tcttgttgca gctgaaatct gtcagattct gcagttcatt ccaaatggct gctgcagcag
120agcacggcaa ctgcgatgct tgggcggcca gagacccttc tggggttctc tcaccataca
180agtttaaccg cagggcagtg caaagcagcg acgtttcgct gaagatcata tactgcgggg
240tgtgttacgg tgacgttatg tggatacaga acaagcacaa cgactcagtt tatcctctgg
300ttcctgggca tgagatagct ggggttgtga ctgaggttgg cgcagacgtc aaggccttca
360aggtgggtga ccacgcaggc gttggaacct acgtgaactc gtgccggcac tgcgagaact
420gcaacagctc tctggagaac tactgcccag aaacagtttt cacttacaac acaactgatg
480ctgatgggac catcacaaag gggggctact ccactcacat tgtcgtccat gaaaggtact
540gcttcaagat acccgatggc taccctctgg cgcaggcagc gcctcttctg tgtgctggaa
600tcaccgtcta taccccaatg gcgcggcaca acatgaagca gcctgggaag tcgctgggag
660tcattgggct cggtggccta ggccacatgg cggtcaaatt tggtaaggcc tttggtctca
720aggtcacggt tttcagcacg agcgaatcga agagggaaga agccattaac ctccttggag
780cagataactt tgttatatcg tcagacacgc agcagatgga ggccctgaaa aactccctgc
840acttcatagt tgacactgct tctggtgacc atccattcga tccgtatctc tcgctcctta
900aggttggcgg tgtgatgaca atagtgtgct ttccaagtga gatcaagatg caccctgcga
960gcctcaaccg tggtgggcgg accttgtctg gtagtattgt cggaggtaca aaagacatcc
1020aggagatggt taacttttgc gcggagaaca aaatctatcc agagatcgag atcatcaaga
1080tggattatat caacgaggct ctcgccaggc ttgttaaccg agacgtgaaa taccgctttg
1140tcatcgacat caagaactct ttcgagtagc atgctcattc acatgatccc tgtcttcttt
1200gtcaatgtat gagagataat gagtcgtttc gaataaagcg tagacatgat aaataacaag
1260tatgcttgtg attgtaaaat cgttctataa ataaatgcgc tgtgttgaac gttaaaaaaa
1320aaaaaaaaaa aaaaaaaaaa aaaagggcgg ccgc
135441338DNAZea mays 4ccacgcgtcc gcggacgctg ggcagacaca gactccacca
ccccgcttcg atcttcttgt 60tgcagctgaa atctgtcaga ttctgcagtt cattccaaat
ggctgctgca gcagagcacg 120gcaactgcga tgcttgggcg gccagagacc cttctggggt
tctctcacca tacaagttta 180accgcagggc agtgcaaagc agcgacgttt cgctgaagat
catatactgc ggggtgtgtt 240acggtgacgt tatgtggata cagaacaagc acaacgactc
agtttatcct ctggttcctg 300ggcatgagat agctggggtt gtgactgagg ttggcgcaga
cgtcaaggcc ttcaaggtgg 360gtgaccacgc aggcgttgga acctacgtga actcgtgccg
gcactgcgag aactgcaaca 420gctctctgga gaactactgc ccagaaacag ttttcactta
caacacaact gatgctgatg 480ggaccatcac aaaggggggc tactccactc acattgtcgt
ccatgaaagg tactgcttca 540agatacccga tggctaccct ctggcgcagg cagcgcctct
tctgtgtgct ggaatcaccg 600tctatacccc aatggcgcgg cacaacatga agcagcctgg
gaagtcgctg ggagtcattg 660ggctcggtgg cctaggccac atggcggtca aatttggtaa
ggcctttggt ctcaaggtca 720cggttttcag cacgagcgaa tcgaagaggg aagaagccat
taacctcctt ggagcagata 780actttgttat atcgtcagac acgcagcaga tggaggccct
gaaaaactcc ctgcacttca 840tagttgacac tgcttctggt gaccatccat tcgatccgta
tctctcgctc cttaaggttg 900gcggtgtgat gacaatagtg tgctttccaa gtgagatcaa
gatgcaccct gcgagcctca 960accgtggtgg gcggaccttg tctggtagta ttgtcggagg
tacaaaagac atccaggaga 1020tggttaactt ttgcgcggag aacaaaatct atccagagat
cgagatcatc aagatggatt 1080atatcaacga ggctctcgcc aggcttgtta accgagacgt
gaaataccgc tttgtcatcg 1140acatcaagaa ctctttcgag tagcatgctc attcacatga
tccctgtctt ctttgtcaat 1200gtatgagaga taatgagtcg tttcgaataa agcgtagaca
tgataaataa caagtatgct 1260tgtgattgta aaatcgttct ataaataaat gcgctgtgtt
gaacgttaaa aaaaaaaaaa 1320aaaaaaaaaa aaaaaaaa
13385354PRTZea mays 5Met Ala Ala Ala Ala Glu His
Gly Asn Cys Asp Ala Trp Ala Ala Arg 1 5
10 15 Asp Pro Ser Gly Val Leu Ser Pro Tyr Lys Phe
Asn Arg Arg Ala Val 20 25
30 Gln Ser Ser Asp Val Ser Leu Lys Ile Ile Tyr Cys Gly Val Cys
Tyr 35 40 45 Gly
Asp Val Met Trp Ile Gln Asn Lys His Asn Asp Ser Val Tyr Pro 50
55 60 Leu Val Pro Gly His Glu
Ile Ala Gly Val Val Thr Glu Val Gly Ala 65 70
75 80 Asp Val Lys Ala Phe Lys Val Gly Asp His Ala
Gly Val Gly Thr Tyr 85 90
95 Val Asn Ser Cys Arg His Cys Glu Asn Cys Asn Ser Ser Leu Glu Asn
100 105 110 Tyr Cys
Pro Glu Thr Val Phe Thr Tyr Asn Thr Thr Asp Ala Asp Gly 115
120 125 Thr Ile Thr Lys Gly Gly Tyr
Ser Thr His Ile Val Val His Glu Arg 130 135
140 Tyr Cys Phe Lys Ile Pro Asp Gly Tyr Pro Leu Ala
Gln Ala Ala Pro 145 150 155
160 Leu Leu Cys Ala Gly Ile Thr Val Tyr Thr Pro Met Ala Arg His Asn
165 170 175 Met Lys Gln
Pro Gly Lys Ser Leu Gly Val Ile Gly Leu Gly Gly Leu 180
185 190 Gly His Met Ala Val Lys Phe Gly
Lys Ala Phe Gly Leu Lys Val Thr 195 200
205 Val Phe Ser Thr Ser Glu Ser Lys Arg Glu Glu Ala Ile
Asn Leu Leu 210 215 220
Gly Ala Asp Asn Phe Val Ile Ser Ser Asp Thr Gln Gln Met Glu Ala 225
230 235 240 Leu Lys Asn Ser
Leu His Phe Ile Val Asp Thr Ala Ser Gly Asp His 245
250 255 Pro Phe Asp Pro Tyr Leu Ser Leu Leu
Lys Val Gly Gly Val Met Thr 260 265
270 Ile Val Cys Phe Pro Ser Glu Ile Lys Met His Pro Ala Ser
Leu Asn 275 280 285
Arg Gly Gly Arg Thr Leu Ser Gly Ser Ile Val Gly Gly Thr Lys Asp 290
295 300 Ile Gln Glu Met Val
Asn Phe Cys Ala Glu Asn Lys Ile Tyr Pro Glu 305 310
315 320 Ile Glu Ile Ile Lys Met Asp Tyr Ile Asn
Glu Ala Leu Ala Arg Leu 325 330
335 Val Asn Arg Asp Val Lys Tyr Arg Phe Val Ile Asp Ile Lys Asn
Ser 340 345 350 Phe
Glu 6354PRTZea mays 6Met Ala Ala Ala Ala Glu His Gly Asn Cys Asp Ala Trp
Ala Ala Arg 1 5 10 15
Asp Pro Ser Gly Val Leu Ser Pro Tyr Lys Phe Asn Arg Arg Ala Val
20 25 30 Gln Ser Ser Asp
Val Ser Leu Lys Ile Ile Tyr Cys Gly Val Cys Tyr 35
40 45 Gly Asp Val Met Trp Ile Gln Asn Lys
His Asn Asp Ser Val Tyr Pro 50 55
60 Leu Val Pro Gly His Glu Ile Ala Gly Val Val Thr Glu
Val Gly Ala 65 70 75
80 Asp Val Lys Ala Phe Lys Val Gly Asp His Ala Gly Val Gly Thr Tyr
85 90 95 Val Asn Ser Cys
Arg His Cys Glu Asn Cys Asn Ser Ser Leu Glu Asn 100
105 110 Tyr Cys Pro Glu Thr Val Phe Thr Tyr
Asn Thr Thr Asp Ala Asp Gly 115 120
125 Thr Ile Thr Lys Gly Gly Tyr Ser Thr His Ile Val Val His
Glu Arg 130 135 140
Tyr Cys Phe Lys Ile Pro Asp Gly Tyr Pro Leu Ala Lys Ala Ala Pro 145
150 155 160 Leu Leu Cys Ala Gly
Ile Thr Val Tyr Thr Pro Met Ala Arg His Asn 165
170 175 Met Lys Gln Pro Gly Lys Ser Leu Gly Val
Ile Gly Leu Gly Gly Leu 180 185
190 Gly His Met Ala Val Lys Phe Gly Lys Ala Phe Gly Leu Lys Val
Thr 195 200 205 Val
Phe Ser Thr Ser Glu Ser Lys Arg Glu Glu Ala Ile Asn Leu Leu 210
215 220 Gly Ala Asp Asn Phe Val
Ile Ser Ser Asp Thr Gln Gln Met Glu Ala 225 230
235 240 Leu Lys Asn Ser Leu His Phe Ile Val Asp Thr
Ala Ser Gly Asp His 245 250
255 Pro Phe Asp Pro Tyr Leu Ser Leu Leu Lys Val Gly Gly Val Met Thr
260 265 270 Ile Val
Cys Phe Pro Ser Glu Ile Lys Met His Pro Ala Ser Leu Asn 275
280 285 Arg Gly Gly Arg Thr Leu Ser
Gly Ser Ile Val Gly Gly Thr Lys Asp 290 295
300 Ile Gln Glu Met Val Asn Phe Cys Ala Glu Asn Lys
Ile Tyr Pro Glu 305 310 315
320 Ile Glu Ile Ile Lys Met Asp Tyr Ile Asn Glu Ala Leu Ala Arg Leu
325 330 335 Val Asn Arg
Asp Val Lys Tyr Arg Phe Val Ile Asp Ile Lys Asn Ser 340
345 350 Phe Glu 74069DNAZea mays
7cctcttaagg tcctggtaca tctcctcgat caacggcttc cacagacgaa ccctcgcgtt
60tatgaaccaa ttcgaaacct ggtacaaacg caaaaccact cgaatccatc attaccccat
120ctcgccaaaa aagaatcgaa tttgaaacga gtctggtggg atttcgatct gtgcctggtt
180cctggtgagg ctgctccttg ccgccagcac gtccttctca tggtccttgg gatacctgag
240cagagcagag cagagcgcga ttctgatcat ggccaacaat gtcgatagac gatttgcgca
300acaacgtcgc gggacagact tacgggtgca ggaagttctc gaacatccac gccttgagca
360cggcaacaga cttctctggc agcccgcact ggggccgcca gcactgctgc tcggtgcgcc
420acagctgctg cgccgaccag tgcttctgta tgaaggcgac tccagctctg ctccacgtcg
480ccgcccgcgg ccaccgtcac cgacgacgac gactcgcccc agcagctggc cctgctcgcc
540gtagccatga tctcgccggc gagccaccgc ctcagcccac ggtacatcgc cgacacggcg
600cggtgcgcga acggcgcgca gatgccaccg ccgcccgggg gcgagtgcat cagggtgttg
660aacttggctg tcgtgctctg gatcttgtcc aagcactggt tactcgtcca tctgcaaacc
720atcatccaaa tccaacatgc ttcacaaatc tgctccagaa ttcatataac taatagtgtg
780aacgaacgag tggtcaccag gaagaactcc aacatgcttc acaaaccatc gtccaactcc
840aacctggcat gtggccgacg acgtcgttga gcaaatcctg gaccacggct gcgtaccgcg
900agcgcaccac caccgcggtg aagtgcagcg ccatccgcga ccgcgaccgc gcccgcggca
960gctcggtcag ggcggaccgg ctcgctcccg aggagcacta ctccgtcgcg ttcagcgtgc
1020tgtccgagga agcgatggag ttggagcaca gcgtcggctc gctcgccacc accgccgcgg
1080acgcgttggc gccactgcag gcggcgagcg ggtagtggag gcctcctgcg tcggagagcc
1140tggccccggc gaggtggtat gtctcatggg ccggcacggg tactggtccg ccggcgctgg
1200tccagatacc gtacggcttc ttggacgacg accagaccga catggccccg gattggctgg
1260ggcaccgcaa gctcgccgcg acgcggctgt cgcagccggc gacgccgtac agctcgtcgt
1320ccatctcgtc cggcggcgtc ctcccgccgg cacgatcatc tcgcggcggc gcgggggcgt
1380tagcgaggag ctgggcagct ggctggcgaa ggaggccagc acgatgttgt tcgccgcgta
1440gtgcgcgccc agatccgccg acgaccgccg tgttgccgag gccgaacccg aagccgccct
1500ggtcaagaag cacgcagggg tggaagtgcg gcacctcagc gctcataatg ccgccacctc
1560tgtagcccag gccgaggcgg aggatggatg ggactcgaat ccgggggcgg aggtggagga
1620cccttccatg gagaccgagc ggggtttagg gatgagcgag catcacatac gccttccatg
1680gagagcatgg atgcggacgc ggatccgggg cggaagatgg cagggacgcg gattcagggc
1740ggacgcgctt gccgagggcg cgggggacca cagcgtgcgt tacggggaca gggcgggcat
1800cgcgaggacg ggtgcgggag cggagccaca tctggtggtg gacgcctact ttgctctctt
1860atagagtagt aaagattcgt ggaccaaaca acaccctagc ttgtacaaat attcttaggc
1920agttgctact gatgagagaa aaataacatc actccactgc atttgcgtga tttattgaac
1980agatcacaat tacatctatt caaatttatt tacctgtacg tgtccgattt ttaggggagg
2040atttttttac ggtatttttt ttttaaaaaa ataaatttag gcaacaattt tatagaatcg
2100agtgctttat ctattatctt ttacaaggca cacgcgtaca ataaggtttg gtcgttcgtg
2160acttggatag tggttttggt tgcaattccg taattcttgg cataggatac agcccaaccc
2220agaaaaaaat aatgttgcgg tcagttctgg ctttgagatt cggagtacca cgtggcgtaa
2280aggcaggccg tgtcttacag atgaataaag gacctgggtc tcacgtgatt ggtttccagt
2340ttcgtgcatc aagatgtgga attttcaaac tgccgtcgtg tttgtttcgt cacataaaag
2400ctttttggaa ggctaaggag aggaagccgg cgagaaggag ggggcgtttt acgtgtcact
2460gtcctgtcgt gttggctgtt gacacgaatc atttcttccg cgcgtgggaa gaagaagatg
2520cacattagcg gcctgaagta gagatgtcaa tggggaattc cccagcgggg attaactccc
2580cagacccgta cccatgaaca tagaccggcc cccatccccg aacccgaacc cgacctcggg
2640tacgaaaatc ctcccatacc cattcccgac cgggtactaa atacccatgg gtatccatac
2700ccgacccgat tattcaaaaa ttaatgggct ttttatttgt taaccggcgg acgcaatgct
2760tgggactcta ggttttttta ctttgttgac cggctggcgg ctgggctttt tcctacaggc
2820ccaaagttgg tcggcagcca ctaggccaca cgtcacaggc agcccacaag taaatgtcgt
2880tggattgctg gatggtggaa taaaaatcct agatgctaga ttgttctggt tccgggtatt
2940tttctccatg gctaatcggg tttgggttta gccctcccaa acccgaaccc gccatacccg
3000atgggtaagg gatttattcc aaatctatac ccatggggat ttgttttaac ccatacctta
3060accctaatag aggaattccc cacgggtaat cgggtttcgg ggcccattga catctctaga
3120ctgaaggcgt ccaactcaaa tcattaaaaa gtgttgacgc acgcgctgat gcgccggccg
3180cacagcacag gctgcacagc ccgtttaatc agcgatggag ccccggccgt cagccagcca
3240ggtccggcgt ccgggtctgc gccctgcggc gtcactgctg tcgccaccgt ctccgatggt
3300cccacatcca tccagcgggc cgcgcgtggt acaaaaggct cttcctcgcc gtcaggtgca
3360gctgcccaaa caccagacac agactccacc accccgcttc gatcttctgt tgcagctgaa
3420atctgtcaga ttctgcagtt cattccaaat ggctgctgca gcagagcacg gcaactgcga
3480tgcttgggcg gccagagacc cttctggggt tctctcacca tacaagttta accgcaggtg
3540aggtggctcc ccctcctcat ttggtgcaca actcatcatc atcatgtttt tttatcgcct
3600tttcagaaca ctgctgtctg tcttctcgct atgctagaat tctttggaat agctgccagt
3660gttgttttcc ttgcctttta catgtacatc atgttttttt tcctgcctgt ggagtgggtt
3720gatttaaatc actagcaagt caaaaattat ttctaatttc tttcgatctc attaaattca
3780cacagaacgg gaataaccga acacagccta aaatcgtgaa actgatgcat agcagacggt
3840agaccatccg taaccgtaca tattcataac gtctactatg ccttctggta tttttttttt
3900ccagggcagt gcaaagcagc gacgtttcgc tgaagatcat atactgcggg gtgtgttacg
3960gtgacgttat gtggatacag aacaagcaca acgactcagt ttatcctctg gttcctgggt
4020aagaacatgt acactgaacc cctagcttag tagagctagg tagggagct
4069833DNAartificial sequenceCAS1_RCA_MUTAGENESIS_PRIMER 8gcagttcatt
cctcatgact gctgcagcag agc 3391049DNAZea
mays 9aagctttttg gaaggctaag gagaggaagc cggcgagaag gagggggcgt tttacgtgtc
60actgtcctgt cgtgttggct gttgacacga atcatttctt ccgcgcgtgg gaagaagaag
120atgcacatta gcggcctgaa gtagagatgt caatggggaa ttccccagcg gggattaact
180ccccagaccc gtacccatga acatagaccg gcccccatcc ccgaacccga acccgacctc
240gggtacgaaa atcctcccat acccattccc gaccgggtac taaataccca tgggtatcca
300tacccgaccc gattattcaa aaattaatgg gctttttatt tgttaaccgg cggacgcaat
360gcttgggact ctaggttttt ttactttgtt gaccggctgg cggctgggct ttttcctaca
420ggcccaaagt tggtcggcag ccactaggcc acacgtcaca ggcagcccac aagtaaatgt
480cgttggattg ctggatggtg gaataaaaat cctagatgct agattgttct ggttccgggt
540atttttctcc atggctaatc gggtttgggt ttagccctcc caaacccgaa cccgccatac
600ccgatgggta agggatttat tccaaatcta tacccatggg gatttgtttt aacccatacc
660ttaaccctaa tagaggaatt ccccacgggt aatcgggttt cggggcccat tgacatctct
720agactgaagg cgtccaactc aaatcattaa aaagtgttga cgcacgcgct gatgcgccgg
780ccgcacagca caggctgcac agcccgttta atcagcgatg gagccccggc cgtcagccag
840ccaggtccgg cgtccgggtc tgcgccctgc ggcgtcactg ctgtcgccac cgtctccgat
900ggtcccacat ccatccagcg ggccgcgcgt ggtacaaaag gctcttcctc gccgtcaggt
960gcagctgccc aaacaccaga cacagactcc accaccccgc ttcgatcttc tgttgcagct
1020gaaatctgtc agattctgca gttcattcc
1049101746DNAZea mays 10ggatccgggg cggaagatgg cagggacgcg gattcagggc
ggacgcgctt gccgagggcg 60cgggggacca cagcgtgcgt tacggggaca gggcgggcat
cgcgaggacg ggtgcgggag 120cggagccaca tctggtggtg gacgcctact ttgctctctt
atagagtagt aaagattcgt 180ggaccaaaca acaccctagc ttgtacaaat attcttaggc
agttgctact gatgagagaa 240aaataacatc actccactgc atttgcgtga tttattgaac
agatcacaat tacatctatt 300caaatttatt tacctgtacg tgtccgattt ttaggggagg
atttttttac ggtatttttt 360ttttaaaaaa ataaatttag gcaacaattt tatagaatcg
agtgctttat ctattatctt 420ttacaaggca cacgcgtaca ataaggtttg gtcgttcgtg
acttggatag tggttttggt 480tgcaattccg taattcttgg cataggatac agcccaaccc
agaaaaaaat aatgttgcgg 540tcagttctgg ctttgagatt cggagtacca cgtggcgtaa
aggcaggccg tgtcttacag 600atgaataaag gacctgggtc tcacgtgatt ggtttccagt
ttcgtgcatc aagatgtgga 660attttcaaac tgccgtcgtg tttgtttcgt cacataaaag
ctttttggaa ggctaaggag 720aggaagccgg cgagaaggag ggggcgtttt acgtgtcact
gtcctgtcgt gttggctgtt 780gacacgaatc atttcttccg cgcgtgggaa gaagaagatg
cacattagcg gcctgaagta 840gagatgtcaa tggggaattc cccagcgggg attaactccc
cagacccgta cccatgaaca 900tagaccggcc cccatccccg aacccgaacc cgacctcggg
tacgaaaatc ctcccatacc 960cattcccgac cgggtactaa atacccatgg gtatccatac
ccgacccgat tattcaaaaa 1020ttaatgggct ttttatttgt taaccggcgg acgcaatgct
tgggactcta ggttttttta 1080ctttgttgac cggctggcgg ctgggctttt tcctacaggc
ccaaagttgg tcggcagcca 1140ctaggccaca cgtcacaggc agcccacaag taaatgtcgt
tggattgctg gatggtggaa 1200taaaaatcct agatgctaga ttgttctggt tccgggtatt
tttctccatg gctaatcggg 1260tttgggttta gccctcccaa acccgaaccc gccatacccg
atgggtaagg gatttattcc 1320aaatctatac ccatggggat ttgttttaac ccatacctta
accctaatag aggaattccc 1380cacgggtaat cgggtttcgg ggcccattga catctctaga
ctgaaggcgt ccaactcaaa 1440tcattaaaaa gtgttgacgc acgcgctgat gcgccggccg
cacagcacag gctgcacagc 1500ccgtttaatc agcgatggag ccccggccgt cagccagcca
ggtccggcgt ccgggtctgc 1560gccctgcggc gtcactgctg tcgccaccgt ctccgatggt
cccacatcca tccagcgggc 1620cgcgcgtggt acaaaaggct cttcctcgcc gtcaggtgca
gctgcccaaa caccagacac 1680agactccacc accccgcttc gatcttctgt tgcagctgaa
atctgtcaga ttctgcagtt 1740cattcc
17461147747DNAartificial sequencePHP16974
ZmCAS1HindIII ProGUS/35SPAT 11tctagagctc gttcctcgag gaacggtacc tgcggggaag
cttacaataa tgtgtgttgt 60taagtcttgt tgcctgtcat cgtctgactg actttcgtca
taaatcccgg cctccgtaac 120ccagctttgg gcaagctcac ggatttgatc cggcggaacg
ggaatatcga gatgccgggc 180tgaacgctgc agttccagct ttccctttcg ggacaggtac
tccagctgat tgattatctg 240ctgaagggtc ttggttccac ctcctggcac aatgcgaatg
attacttgag cgcgatcggg 300catccaattt tctcccgtca ggtgcgtggt caagtgctac
aaggcacctt tcagtaacga 360gcgaccgtcg atccgtcgcc gggatacgga caaaatggag
cgcagtagtc catcgagggc 420ggcgaaagcc tcgccaaaag caatacgttc atctcgcaca
gcctccagat ccgatcgagg 480gtcttcggcg taggcagata gaagcatgga tacattgctt
gagagtattc cgatggactg 540aagtatggct tccatctttt ctcgtgtgtc tgcatctatt
tcgagaaagc ccccgatgcg 600gcgcaccgca acgcgaattg ccatactatc cgaaagtccc
agcaggcgcg cttgatagga 660aaaggtttca tactcggccg atcgcagacg ggcactcacg
accttgaacc cttcaacttt 720cagggatcga tgctggttga tggtagtctc actcgacgtg
gctctggtgt gttttgacat 780agcttcctcc aaagaaagcg gaaggtctgg atactccagc
acgaaatgtg cccgggtaga 840cggatggaag tctagccctg ctcaatatga aatcaacagt
acatttacag tcaatactga 900atatacttgc tacatttgca attgtcttat aacgaatgtg
aaataaaaat agtgtaacaa 960cgcttttact catcgataat cacaaaaaca tttatacgaa
caaaaataca aatgcactcc 1020ggtttcacag gataggcggg atcagaatat gcaacttttg
acgttttgtt ctttcaaagg 1080gggtgctggc aaaaccaccg cactcatggg cctttgcgct
gctttggcaa atgacggtaa 1140acgagtggcc ctctttgatg ccgacgaaaa ccggcctctg
acgcgatgga gagaaaacgc 1200cttacaaagc agtactggga tcctcgctgt gaagtctatt
ccgccgacga aatgcccctt 1260cttgaagcag cctatgaaaa tgccgagctc gaaggatttg
attatgcgtt ggccgatacg 1320cgtggcggct cgagcgagct caacaacaca atcatcgcta
gctcaaacct gcttctgatc 1380cccaccatgc taacgccgct cgacatcgat gaggcactat
ctacctaccg ctacgtcatc 1440gagctgctgt tgagtgaaaa tttggcaatt cctacagctg
ttttgcgcca acgcgtcccg 1500gtcggccgat tgacaacatc gcaacgcagg atgtcagaga
cgctagagag ccttccagtt 1560gtaccgtctc ccatgcatga aagagatgca tttgccgcga
tgaaagaacg cggcatgttg 1620catcttacat tactaaacac gggaactgat ccgacgatgc
gcctcataga gaggaatctt 1680cggattgcga tggaggaagt cgtggtcatt tcgaaactga
tcagcaaaat cttggaggct 1740tgaagatggc aattcgcaag cccgcattgt cggtcggcga
agcacggcgg cttgctggtg 1800ctcgacccga gatccaccat cccaacccga cacttgttcc
ccagaagctg gacctccagc 1860acttgcctga aaaagccgac gagaaagacc agcaacgtga
gcctctcgtc gccgatcaca 1920tttacagtcc cgatcgacaa cttaagctaa ctgtggatgc
ccttagtcca cctccgtccc 1980cgaaaaagct ccaggttttt ctttcagcgc gaccgcccgc
gcctcaagtg tcgaaaacat 2040atgacaacct cgttcggcaa tacagtccct cgaagtcgct
acaaatgatt ttaaggcgcg 2100cgttggacga tttcgaaagc atgctggcag atggatcatt
tcgcgtggcc ccgaaaagtt 2160atccgatccc ttcaactaca gaaaaatccg ttctcgttca
gacctcacgc atgttcccgg 2220ttgcgttgct cgaggtcgct cgaagtcatt ttgatccgtt
ggggttggag accgctcgag 2280ctttcggcca caagctggct accgccgcgc tcgcgtcatt
ctttgctgga gagaagccat 2340cgagcaattg gtgaagaggg acctatcgga acccctcacc
aaatattgag tgtaggtttg 2400aggccgctgg ccgcgtcctc agtcaccttt tgagccagat
aattaagagc caaatgcaat 2460tggctcaggc tgccatcgtc cccccgtgcg aaacctgcac
gtccgcgtca aagaaataac 2520cggcacctct tgctgttttt atcagttgag ggcttgacgg
atccgcctca agtttgcggc 2580gcagccgcaa aatgagaaca tctatactcc tgtcgtaaac
ctcctcgtcg cgtactcgac 2640tggcaatgag aagttgctcg cgcgatagaa cgtcgcgggg
tttctctaaa aacgcgagga 2700gaagattgaa ctcacctgcc gtaagtttca cctcaccgcc
agcttcggac atcaagcgac 2760gttgcctgag attaagtgtc cagtcagtaa aacaaaaaga
ccgtcggtct ttggagcgga 2820caacgttggg gcgcacgcgc aaggcaaccc gaatgcgtgc
aagaaactct ctcgtactaa 2880acggcttagc gataaaatca cttgctccta gctcgagtgc
aacaacttta tccgtctcct 2940caaggcggtc gccactgata attatgattg gaatatcaga
ctttgccgcc agatttcgaa 3000cgatctcaag cccatcttca cgacctaaat ttagatcaac
aaccacgaca tcgaccgtcg 3060cggaagagag tactctagtg aactgggtgc tgtcggctac
cgcggtcact ttgaaggcgt 3120ggatcgtaag gtattcgata ataagatgcc gcatagcgac
atcgtcatcg ataagaagaa 3180cgtgtttcaa cggctcacct ttcaatctaa aatctgaacc
cttgttcaca gcgcttgaga 3240aattttcacg tgaaggatgt acaatcatct ccagctaaat
gggcagttcg tcagaattgc 3300ggctgaccgc ggatgacgaa aatgcgaacc aagtatttca
attttatgac aaaagttctc 3360aatcgttgtt acaagtgaaa cgcttcgagg ttacagctac
tattgattaa ggagatcgcc 3420tatggtctcg ccccggcgtc gtgcgtccgc cgcgagccag
atctcgccta cttcataaac 3480gtcctcatag gcacggaatg gaatgatgac atcgatcgcc
gtagagagca tgtcaatcag 3540tgtgcgatct tccaagctag caccttgggc gctacttttg
acaagggaaa acagtttctt 3600gaatccttgg attggattcg cgccgtgtat tgttgaaatc
gatcccggat gtcccgagac 3660gacttcactc agataagccc atgctgcatc gtcgcgcatc
tcgccaagca atatccggtc 3720cggccgcata cgcagacttg cttggagcaa gtgctcggcg
ctcacagcac ccagcccagc 3780accgttcttg gagtagagta gtctaacatg attatcgtgt
ggaatgacga gttcgagcgt 3840atcttctatg gtgattagcc tttcctgggg ggggatggcg
ctgatcaagg tcttgctcat 3900tgttgtcttg ccgcttccgg tagggccaca tagcaacatc
gtcagtcggc tgacgacgca 3960tgcgtgcaga aacgcttcca aatccccgtt gtcaaaatgc
tgaaggatag cttcatcatc 4020ctgattttgg cgtttccttc gtgtctgcca ctggttccac
ctcgaagcat cataacggga 4080ggagacttct ttaagaccag aaacacgcga gcttggccgt
cgaatggtca agctgacggt 4140gcccgaggga acggtcggcg gcagacagat ttgtagtcgt
tcaccaccag gaagttcagt 4200ggcgcagagg gggttacgtg gtccgacatc ctgctttctc
agcgcgcccg ctaaaatagc 4260gatatcttca agatcatcat aagagacggg caaaggcatc
ttggtaaaaa tgccggcttg 4320gcgcacaaat gcctctccag gtcgattgat cgcaatttct
tcagtcttcg ggtcatcgag 4380ccattccaaa atcggcttca gaagaaagcg tagttgcgga
tccacttcca tttacaatgt 4440atcctatctc taagcggaaa tttgaattca ttaagagcgg
cggttcctcc cccgcgtggc 4500gccgccagtc aggcggagct ggtaaacacc aaagaaatcg
aggtcccgtg ctacgaaaat 4560ggaaacggtg tcaccctgat tcttcttcag ggttggcggt
atgttgatgg ttgccttaag 4620ggctgtctca gttgtctgct caccgttatt ttgaaagctg
ttgaagctca tcccgccacc 4680cgagctgccg gcgtaggtgc tagctgcctg gaaggcgcct
tgaacaacac tcaagagcat 4740agctccgcta aaacgctgcc agaagtggct gtcgaccgag
cccggcaatc ctgagcgacc 4800gagttcgtcc gcgcttggcg atgttaacga gatcatcgca
tggtcaggtg tctcggcgcg 4860atcccacaac acaaaaacgc gcccatctcc ctgttgcaag
ccacgctgta tttcgccaac 4920aacggtggtg ccacgatcaa gaagcacgat attgttcgtt
gttccacgaa tatcctgagg 4980caagacacac tttacatagc ctgccaaatt tgtgtcgatt
gcggtttgca agatgcacgg 5040aattattgtc ccttgcgtta ccataaaatc ggggtgcggc
aagagcgtgg cgctgctggg 5100ctgcagctcg gtgggtttca tacgtatcga caaatcgttc
tcgccggaca cttcgccatt 5160cggcaaggag ttgtcgtcac gcttgccttc ttgtcttcgg
cccgtgtcgc cctgaatggc 5220gcgtttgctg accccttgat cgccgctgct atatgcaaaa
atcggtgttt cttccggccg 5280tggctcatgc cgctccggtt cgcccctcgg cggtagagga
gcagcaggct gaacagcctc 5340ttgaaccgct ggaggatccg gcggcacctc aatcggagct
ggatgaaatg gcttggtgtt 5400tgttgcgatc aaagttgacg gcgatgcgtt ctcattcacc
ttcttttggc gcccacctag 5460ccaaatgagg cttaatgata acgcgagaac gacacctccg
acgatcaatt tctgagaccc 5520cgaaagacgc cggcgatgtt tgtcggagac cagggatcca
gatgcatcaa cctcatgtgc 5580cgcttgctga ctatcgttat tcatcccttc gcccccttca
ggacgcgttt cacatcgggc 5640ctcaccgtgc ccgtttgcgg cctttggcca acgggatcgt
aagcggtgtt ccagatacat 5700agtactgtgt ggccatccct cagacgccaa cctcgggaaa
ccgaagaaat ctcgacatcg 5760ctccctttaa ctgaatagtt ggcaacagct tccttgccat
caggattgat ggtgtagatg 5820gagggtatgc gtacattgcc cggaaagtgg aataccgtcg
taaatccatt gtcgaagact 5880tcgagtggca acagcgaacg atcgccttgg gcgacgtagt
gccaattact gtccgccgca 5940ccaagggctg tgacaggctg atccaataaa ttctcagctt
tccgttgata ttgtgcttcc 6000gcgtgtagtc tgtccacaac agccttctgt tgtgcctccc
ttcgccgagc cgccgcatcg 6060tcggcggggt aggcgaattg gacgctgtaa tagagatcgg
gctgctcttt atcgaggtgg 6120gacagagtct tggaacttat actgaaaaca taacggcgca
tcccggagtc gcttgcggtt 6180agcacgatta ctggctgagg cgtgaggacc tggcttgcct
tgaaaaatag ataatttccc 6240cgcggtaggg ctgctagatc tttgctattt gaaacggcaa
ccgctgtcac cgtttcgttc 6300gtggcgaatg ttacgaccaa agtagctcca accgccgtcg
agaggcgcac cacttgatcg 6360ggattgtaag ccaaataacg catgcgcgga tctagcttgc
ccgccattgg agtgtcttca 6420gcctccgcac cagtcgcagc ggcaaataaa catgctaaaa
tgaaaagtgc ttttctgatc 6480atggttcgct gtggcctacg tttgaaacgg tatcttccga
tgtctgatag gaggtgacaa 6540ccagacctgc cgggttggtt agtctcaatc tgccgggcaa
gctggtcacc ttttcgtagc 6600gaactgtcgc ggtccacgta ctcaccacag gcattttgcc
gtcaacgacg agggtccttt 6660tatagcgaat ttgctgcgtg cttggagtta catcatttga
agcgatgtgc tcgacctcca 6720ccctgccgcg tttgccaaga atgacttgag gcgaactggg
attgggatag ttgaagaatt 6780gctggtaatc ctggcgcact gttggggcac tgaagttcga
taccaggtcg taggcgtact 6840gagcggtgtc ggcatcataa ctctcgcgca ggcgaacgta
ctcccacaat gaggcgttaa 6900cgacggcctc ctcttgagtt gcaggcaatc gcgagacaga
cacctcgctg tcaacggtgc 6960cgtccggccg tatccataga tatacgggca caagcctgct
caacggcacc attgtggcta 7020tagcgaacgc ttgagcaaca tttcccaaaa tcgcgatagc
tgcgacagct gcaatgagtt 7080tggagagacg tcgcgccgat ttcgctcgcg cggtttgaaa
ggcttctact tccttatagt 7140gctcggcaag gctttcgcgc gccactagca tggcatattc
aggccccgtc atagcgtcca 7200cccgaattgc cgagctgaag atctgacgga gtaggctgcc
atcgccccac attcagcggg 7260aagatcgggc ctttgcagct cgctaatgtg tcgtttgtct
ggcagccgct caaagcgaca 7320actaggcaca gcaggcaata cttcatagaa ttctccattg
aggcgaattt ttgcgcgacc 7380tagcctcgct caacctgagc gaagcgacgg tacaagctgc
tggcagattg ggttgcgccg 7440ctccagtaac tgcctccaat gttgccggcg atcgccggca
aagcgacaat gagcgcatcc 7500cctgtcagaa aaaacatatc gagttcgtaa agaccaatga
tcttggccgc ggtcgtaccg 7560gcgaaggtga ttacaccaag cataagggtg agcgcagtcg
cttcggttag gatgacgatc 7620gttgccacga ggtttaagag gagaagcaag agaccgtagg
tgataagttg cccgatccac 7680ttagctgcga tgtcccgcgt gcgatcaaaa atatatccga
cgaggatcag aggcccgatc 7740gcgagaagca ctttcgtgag aattccaacg gcgtcgtaaa
ctccgaaggc agaccagagc 7800gtgccgtaaa ggacccactg tgccccttgg aaagcaagga
tgtcctggtc gttcatcgga 7860ccgatttcgg atgcgatttt ctgaaaaacg gcctgggtca
cggcgaacat tgtatccaac 7920tgtgccggaa cagtctgcag aggcaagccg gttacactaa
actgctgaac aaagtttggg 7980accgtctttt cgaagatgga aaccacatag tcttggtagt
tagcctgccc aacaattaga 8040gcaacaacga tggtgaccgt gatcacccga gtgataccgc
tacgggtatc gacttcgccg 8100cgtatgacta aaataccctg aacaataatc caaagagtga
cacaggcgat caatggcgca 8160ctcaccgcct cctggatagt ctcaagcatc gagtccaagc
ctgtcgtgaa ggctacatcg 8220aagatcgtat gaatggccgt aaacggcgcc ggaatcgtga
aattcatcga ttggacctga 8280acttgactgg tttgtcgcat aatgttggat aaaatgagct
cgcattcggc gaggatgcgg 8340gcggatgaac aaatcgccca gccttagggg agggcaccaa
agatgacagc ggtcttttga 8400tgctccttgc gttgagcggc cgcctcttcc gcctcgtgaa
ggccggcctg cgcggtagtc 8460atcgttaata ggcttgtcgc ctgtacattt tgaatcattg
cgtcatggat ctgcttgaga 8520agcaaaccat tggtcacggt tgcctgcatg atattgcgag
atcgggaaag ctgagcagac 8580gtatcagcat tcgccgtcaa gcgtttgtcc atcgtttcca
gattgtcagc cgcaatgcca 8640gcgctgtttg cggaaccggt gatctgcgat cgcaacaggt
ccgcttcagc atcactaccc 8700acgactgcac gatctgtatc gctggtgatc gcacgtgccg
tggtcgacat tggcattcgc 8760ggcgaaaaca tttcattgtc taggtccttc gtcgaaggat
actgattttt ctggttgagc 8820gaagtcagta gtccagtaac gccgtaggcc gacgtcaaca
tcgtaaccat cgctatagtc 8880tgagtgagat tctccgcagt cgcgagcgca gtcgcgagcg
tctcagcctc cgttgccggg 8940tcgctaacaa caaactgcgc ccgcgcgggc tgaatatata
gaaagctgca ggtcaaaact 9000gttgcaataa gttgcgtcgt cttcatcgtt tcctacctta
tcaatcttct gcctcgtggt 9060gacgggccat gaattcgctg agccagccag atgagttgcc
ttcttgtgcc tcgcgtagtc 9120gagttgcaaa gcgcaccgtg ttggcacgcc ccgaaagcac
ggcgacatat tcacgcatat 9180cccgcagatc aaattcgcag atgacgcttc cactttctcg
tttaagaaga aacttacggc 9240tgccgaccgt catgtcttca cggatcgcct gaaattcctt
ttcggtacat ttcagtccat 9300cgacataagc cgatcgatct gcggttggtg atggatagaa
aatcttcgtc atacattgcg 9360caaccaagct ggctcctagc ggcgattcca gaacatgctc
tggttgctgc gttgccagta 9420ttagcatccc gttgtttttt cgaacggtca ggaggaattt
gtcgacgaca gtcgaaaatt 9480tagggtttaa caaataggcg cgaaactcat cgcagctcat
cacaaaacgg cggccgtcga 9540tcatggctcc aatccgatgc aggagatatg ctgcagcggg
agcgcatact tcctcgtatt 9600cgagaagatg cgtcatgtcg aagccggtaa tcgacggatc
taactttact tcgtcaactt 9660cgccgtcaaa tgcccagcca agcgcatggc cccggcacca
gcgttggagc cgcgctcctg 9720cgccttcggc gggcccatgc aacaaaaatt cacgtaaccc
cgcgattgaa cgcatttgtg 9780gatcaaacga gagctgacga tggataccac ggaccagacg
gcggttctct tccggagaaa 9840tcccaccccg accatcactc tcgatgagag ccacgatcca
ttcgcgcaga aaatcgtgtg 9900aggctgctgt gttttctagg ccacgcaacg gcgccaaccc
gctgggtgtg cctctgtgaa 9960gtgccaaata tgttcctcct gtggcgcgaa ccagcaattc
gccaccccgg tccttgtcaa 10020agaacacgac cgtacctgca cggtcgacca tgctctgttc
gagcatggct agaacaaaca 10080tcatgagcgt cgtcttaccc ctcccgatag gcccgaatat
tgccgtcatg ccaacatcgt 10140gctcatgcgg gatatagtcg aaaggcgttc cgccattggt
acgaaatcgg gcaatcgcgt 10200tgccccagtg gcctgagctg gcgccctctg gaaagttttc
gaaagagaca aaccctgcga 10260aattgcgtga agtgattgcg ccagggcgtg tgcgccactt
aaaattcccc ggcaattggg 10320accaataggc cgcttccata ccaatacctt cttggacaac
cacggcacct gcatccgcca 10380ttcgtgtccg agcccgcgcg cccctgtccc caagactatt
gagatcgtct gcatagacgc 10440aaaggctcaa atgatgtgag cccataacga attcgttgct
cgcaagtgcg tcctcagcct 10500cggataattt gccgatttga gtcacggctt tatcgccgga
actcagcatc tggctcgatt 10560tgaggctaag tttcgcgtgc gcttgcgggc gagtcaggaa
cgaaaaactc tgcgtgagaa 10620caagtggaaa atcgagggat agcagcgcgt tgagcatgcc
cggccgtgtt tttgcagggt 10680attcgcgaaa cgaatagatg gatccaacgt aactgtcttt
tggcgttctg atctcgagtc 10740ctcgcttgcc gcaaatgact ctgtcggtat aaatcgaagc
gccgagtgag ccgctgacga 10800ccggaaccgg tgtgaaccga ccagtcatga tcaaccgtag
cgcttcgcca atttcggtga 10860agagcacacc ctgcttctcg cggatgccaa gacgatgcag
gccatacgct ttaagagagc 10920cagcgacaac atgccaaaga tcttccatgt tcctgatctg
gcccgtgaga tcgttttccc 10980tttttccgct tagcttggtg aacctcctct ttaccttccc
taaagccgcc tgtgggtaga 11040caatcaacgt aaggaagtgt tcattgcgga ggagttggcc
ggagagcacg cgctgttcaa 11100aagcttcgtt caggctagcg gcgaaaacac tacggaagtg
tcgcggcgcc gatgatggca 11160cgtcggcatg acgtacgagg tgagcatata ttgacacatg
atcatcagcg atattgcgca 11220acagcgtgtt gaacgcacga caacgcgcat tgcgcatttc
agtttcctca agctcgaatg 11280caacgccatc aattctcgca atggtcatga tcgatccgtc
ttcaagaagg acgatatggt 11340cgctgaggtg gccaatataa gggagataga tctcaccgga
tctttcggtc gttccactcg 11400cgccgagcat cacaccattc ctctccctcg tgggggaacc
ctaattggat ttgggctaac 11460agtagcgccc ccccaaactg cactatcaat gcttcttccc
gcggtccgca aaaatagcag 11520gacgacgctc gccgcattgt agtctcgctc cacgatgagc
cgggctgcaa accataacgg 11580cacgagaacg acttcgtaga gcgggttctg aacgataacg
atgacaaagc cggcgaacat 11640catgaataac cctgccaatg tcagtggcac cccaagaaac
aatgcgggcc gtgtggctgc 11700gaggtaaagg gtcgattctt ccaaacgatc agccatcaac
taccgccagt gagcgtttgg 11760ccgaggaagc tcgccccaaa catgataaca atgccgccga
cgacgccggc aaccagccca 11820agcgaagccc gcccgaacat ccaggagatc ccgatagcga
caatgccgag aacagcgagt 11880gactggccga acggaccaag gataaacgtg catatattgt
taaccattgt ggcggggtca 11940gtgccgccac ccgcagattg cgctgcggcg ggtccggatg
aggaaatgct ccatgcaatt 12000gcaccgcaca agcttggggc gcagctcgat atcacgcgca
tcatcgcatt cgagagcgag 12060aggcgattta gatgtaaacg gtatctctca aagcatcgca
tcaatgcgca cctccttagt 12120ataagtcgaa taagacttga ttgtcgtctg cggatttgcc
gttgtcctgg tgtggcggtg 12180gcggagcgat taaaccgcca gcgccatcct cctgcgagcg
gcgctgatat gacccccaaa 12240catcccacgt ctcttcggat tttagcgcct cgtgatcgtc
ttttggaggc tcgattaacg 12300cgggcaccag cgattgagca gctgtttcaa cttttcgcac
gtagccgttt gcaaaaccgc 12360cgatgaaatt accggtgttg taagcggaga tcgcccgacg
aagcgcaaat tgcttctcgt 12420caatcgtttc gccgcctgca taacgacttt tcagcatgtt
tgcagcggca gataatgatg 12480tgcacgcctg gagcgcaccg tcaggtgtca gaccgagcat
agaaaaattt cgagagttta 12540tttgcatgag gccaacatcc agcgaatgcc gtgcatcgag
acggtgcctg acgacttggg 12600ttgcttggct gtgatcttgc cagtgaagcg tttcgccggt
cgtgttgtca tgaatcgcta 12660aaggatcaaa gcgactctcc accttagcta tcgccgcaag
cgtagatgtc gcaactgatg 12720gggcacactt gcgagcaaca tggtcaaact cagcagatga
gagtggcgtg gcaaggctcg 12780acgaacagaa ggagaccatc aaggcaagag aaagcgaccc
cgatctctta agcatacctt 12840atctccttag ctcgcaacta acaccgcctc tcccgttgga
agaagtgcgt tgttttatgt 12900tgaagattat cgggagggtc ggttactcga aaattttcaa
ttgcttcttt atgatttcaa 12960ttgaagcgag aaacctcgcc cggcgtcttg gaacgcaaca
tggaccgaga accgcgcatc 13020catgactaag caaccggatc gacctattca ggccgcagtt
ggtcaggtca ggctcagaac 13080gaaaatgctc ggcgaggtta cgctgtctgt aaacccattc
gatgaacggg aagcttcctt 13140ccgattgctc ttggcaggaa tattggccca tgcctgcttg
cgctttgcaa atgctcttat 13200cgcgttggta tcatatgcct tgtccgccag cagaaacgca
ctctaagcga ttatttgtaa 13260aaatgtttcg gtcatgcggc ggtcatgggc ttgacccgct
gtcagcgcaa gacggatcgg 13320tcaaccgtcg gcatcgacaa cagcgtgaat cttggtggtc
aaaccgccac gggaacgtcc 13380catacagcca tcgtcttgat cccgctgttt cccgtcgccg
catgttggtg gacgcggaca 13440caggaactgt caatcatgac gacattctat cgaaagcctt
ggaaatcaca ctcagaatat 13500gatcccagac gtctgcctca cgccatcgta caaagcgatt
gtagcaggtt gtacaggaac 13560cgtatcgatc aggaacgtct gcccagggcg ggcccgtccg
gaagcgccac aagatgacat 13620tgatcacccg cgtcaacgcg cggcacgcga cgcggcttat
ttgggaacaa aggactgaac 13680aacagtccat tcgaaatcgg tgacatcaaa gcggggacgg
gttatcagtg gcctccaagt 13740caagcctcaa tgaatcaaaa tcagaccgat ttgcaaacct
gatttatgag tgtgcggcct 13800aaatgatgaa atcgtccttc tagatcgcct ccgtggtgta
gcaacacctc gcagtatcgc 13860cgtgctgacc ttggccaggg aattgactgg caagggtgct
ttcacatgac cgctcttttg 13920gccgcgatag atgatttcgt tgctgctttg ggcacgtaga
aggagagaag tcatatcgga 13980gaaattcctc ctggcgcgag agcctgctct atcgcgacgg
catcccactg tcgggaacag 14040accggatcat tcacgaggcg aaagtcgtca acacatgcgt
tataggcatc ttcccttgaa 14100ggatgatctt gttgctgcca atctggaggt gcggcagccg
caggcagatg cgatctcagc 14160gcaacttgcg gcaaaacatc tcactcacct gaaaaccact
agcgagtctc gcgatcagac 14220gaaggccttt tacttaacga cacaatatcc gatgtctgca
tcacaggcgt cgctatccca 14280gtcaatacta aagcggtgca ggaactaaag attactgatg
acttaggcgt gccacgaggc 14340ctgagacgac gcgcgtagac agttttttga aatcattatc
aaagtgatgg cctccgctga 14400agcctatcac ctctgcgccg gtctgtcgga gagatgggca
agcattatta cggtcttcgc 14460gcccgtacat gcattggacg attgcagggt caatggatct
gagatcatcc agaggattgc 14520cgcccttacc ttccgtttcg agttggagcc agcccctaaa
tgagacgaca tagtcgactt 14580gatgtgacaa tgccaagaga gagatttgct taacccgatt
tttttgctca agcgtaagcc 14640tattgaagct tgccggcatg acgtccgcgc cgaaagaata
tcctacaagt aaaacattct 14700gcacaccgaa atgcttggtg tagacatcga ttatgtgacc
aagatcctta gcagtttcgc 14760ttggggaccg ctccgaccag aaataccgaa gtgaactgac
gccaatgaca ggaatccctt 14820ccgtctgcag ataggtacca tcgatagatc tgctgcctcg
cgcgtttcgg tgatgacggt 14880gaaaacctct gacacatgca gctcccggag acggtcacag
cttgtctgta agcggatgcc 14940gggagcagac aagcccgtca gggcgcgtca gcgggtgttg
gcgggtgtcg gggcgcagcc 15000atgacccagt cacgtagcga tagcggagtg tatactggct
taactatgcg gcatcagagc 15060agattgtact gagagtgcac catatgcggt gtgaaatacc
gcacagatgc gtaaggagaa 15120aataccgcat caggcgctct tccgcttcct cgctcactga
ctcgctgcgc tcggtcgttc 15180ggctgcggcg agcggtatca gctcactcaa aggcggtaat
acggttatcc acagaatcag 15240gggataacgc aggaaagaac atgtgagcaa aaggccagca
aaaggccagg aaccgtaaaa 15300aggccgcgtt gctggcgttt ttccataggc tccgcccccc
tgacgagcat cacaaaaatc 15360gacgctcaag tcagaggtgg cgaaacccga caggactata
aagataccag gcgtttcccc 15420ctggaagctc cctcgtgcgc tctcctgttc cgaccctgcc
gcttaccgga tacctgtccg 15480cctttctccc ttcgggaagc gtggcgcttt ctcatagctc
acgctgtagg tatctcagtt 15540cggtgtaggt cgttcgctcc aagctgggct gtgtgcacga
accccccgtt cagcccgacc 15600gctgcgcctt atccggtaac tatcgtcttg agtccaaccc
ggtaagacac gacttatcgc 15660cactggcagc agccactggt aacaggatta gcagagcgag
gtatgtaggc ggtgctacag 15720agttcttgaa gtggtggcct aactacggct acactagaag
gacagtattt ggtatctgcg 15780ctctgctgaa gccagttacc ttcggaaaaa gagttggtag
ctcttgatcc ggcaaacaaa 15840ccaccgctgg tagcggtggt ttttttgttt gcaagcagca
gattacgcgc agaaaaaaag 15900gatctcaaga agatcctttg atcttttcta cggggtctga
cgctcagtgg aacgaaaact 15960cacgttaagg gattttggtc atgagattat caaaaaggat
cttcacctag atccttttaa 16020attaaaaatg aagttttaaa tcaatctaaa gtatatatga
gtaaacttgg tctgacagtt 16080accaatgctt aatcagtgag gcacctatct cagcgatctg
tctatttcgt tcatccatag 16140ttgcctgact ccccgtcgtg tagataacta cgatacggga
gggcttacca tctggcccca 16200gtgctgcaat gataccgcga gacccacgct caccggctcc
agatttatca gcaataaacc 16260agccagccgg aagggccgag cgcagaagtg gtcctgcaac
tttatccgcc tccatccagt 16320ctattaattg ttgccgggaa gctagagtaa gtagttcgcc
agttaatagt ttgcgcaacg 16380ttgttgccat tgctgcaggg gggggggggg ggggggactt
ccattgttca ttccacggac 16440aaaaacagag aaaggaaacg acagaggcca aaaagcctcg
ctttcagcac ctgtcgtttc 16500ctttcttttc agagggtatt ttaaataaaa acattaagtt
atgacgaaga agaacggaaa 16560cgccttaaac cggaaaattt tcataaatag cgaaaacccg
cgaggtcgcc gccccgtaac 16620ctgtcggatc accggaaagg acccgtaaag tgataatgat
tatcatctac atatcacaac 16680gtgcgtggag gccatcaaac cacgtcaaat aatcaattat
gacgcaggta tcgtattaat 16740tgatctgcat caacttaacg taaaaacaac ttcagacaat
acaaatcagc gacactgaat 16800acggggcaac ctcatgtccc cccccccccc ccccctgcag
ggcatcgtgg tgtcacgctc 16860gtcgtttggt atggcttcat tcagctccgg ttcccaacga
tcaaggcgag ttacatgatc 16920ccccatgttg tgcaaaaaag cggttagctc cttcggtcct
ccgatcgttg tcagaagtaa 16980gttggccgca gtgttatcac tcatggttat ggcagcactg
cataattctc ttactgtcat 17040gccatccgta agatgctttt ctgtgactgg tgagtactca
accaagtcat tctgagaata 17100gtgtatgcgg cgaccgagtt gctcttgccc ggcgtcaaca
cgggataata ccgcgccaca 17160tagcagaact ttaaaagtgc tcatcattgg aaaacgttct
tcggggcgaa aactctcaag 17220gatcttaccg ctgttgagat ccagttcgat gtaacccact
cgtgcaccca actgatcttc 17280agcatctttt actttcacca gcgtttctgg gtgagcaaaa
acaggaaggc aaaatgccgc 17340aaaaaaggga ataagggcga cacggaaatg ttgaatactc
atactcttcc tttttcaata 17400ttattgaagc atttatcagg gttattgtct catgagcgga
tacatatttg aatgtattta 17460gaaaaataaa caaatagggg ttccgcgcac atttccccga
aaagtgccac ctgacgtcta 17520agaaaccatt attatcatga cattaaccta taaaaatagg
cgtatcacga ggccctttcg 17580tcttcaagaa ttggtcgacg atcttgctgc gttcggatat
tttcgtggag ttcccgccac 17640agacccggat tgaaggcgag atccagcaac tcgcgccaga
tcatcctgtg acggaacttt 17700ggcgcgtgat gactggccag gacgtcggcc gaaagagcga
caagcagatc acgcttttcg 17760acagcgtcgg atttgcgatc gaggattttt cggcgctgcg
ctacgtccgc gaccgcgttg 17820agggatcaag ccacagcagc ccactcgacc ttctagccga
cccagacgag ccaagggatc 17880tttttggaat gctgctccgt cgtcaggctt tccgacgttt
gggtggttga acagaagtca 17940ttatcgtacg gaatgccaag cactcccgag gggaaccctg
tggttggcat gcacatacaa 18000atggacgaac ggataaacct tttcacgccc ttttaaatat
ccgttattct aataaacgct 18060cttttctctt aggtttaccc gccaatatat cctgtcaaac
actgatagtt taaactgaag 18120gcgggaaacg acaatctgat catgagcgga gaattaaggg
agtcacgtta tgacccccgc 18180cgatgacgcg ggacaagccg ttttacgttt ggaactgaca
gaaccgcaac gttgaaggag 18240ccactcagcc caagcttttt ggaaggctaa ggagaggaag
ccggcgagaa ggagggggcg 18300ttttacgtgt cactgtcctg tcgtgttggc tgttgacacg
aatcatttct tccgcgcgtg 18360ggaagaagaa gatgcacatt agcggcctga agtagagatg
tcaatgggga attccccagc 18420ggggattaac tccccagacc cgtacccatg aacatagacc
ggcccccatc cccgaacccg 18480aacccgacct cgggtacgaa aatcctccca tacccattcc
cgaccgggta ctaaataccc 18540atgggtatcc atacccgacc cgattattca aaaattaatg
ggctttttat ttgttaaccg 18600gcggacgcaa tgcttgggac tctaggtttt tttactttgt
tgaccggctg gcggctgggc 18660tttttcctac aggcccaaag ttggtcggca gccactaggc
cacacgtcac aggcagccca 18720caagtaaatg tcgttggatt gctggatggt ggaataaaaa
tcctagatgc tagattgttc 18780tggttccggg tatttttctc catggctaat cgggtttggg
tttagccctc ccaaacccga 18840acccgccata cccgatgggt aagggattta ttccaaatct
atacccatgg ggatttgttt 18900taacccatac cttaacccta atagaggaat tccccacggg
taatcgggtt tcggggccca 18960ttgacatctc tagactgaag gcgtccaact caaatcatta
aaaagtgttg acgcacgcgc 19020tgatgcgccg gccgcacagc acaggctgca cagcccgttt
aatcagcgat ggagccccgg 19080ccgtcagcca gccaggtccg gcgtccgggt ctgcgccctg
cggcgtcact gctgtcgcca 19140ccgtctccga tggtcccaca tccatccagc gggccgcgcg
tggtacaaaa ggctcttcct 19200cgccgtcagg tgcagctgcc caaacaccag acacagactc
caccaccccg cttcgatctt 19260ctgttgcagc tgaaatctgt cagattctgc agttcattcc
tcatggtccg tcctgtagaa 19320accccaaccc gtgaaatcaa aaaactcgac ggcctgtggg
cattcagtct ggatcgcgaa 19380aactgtggaa ttgatcagcg ttggtgggaa agcgcgttac
aagaaagccg ggcaattgct 19440gtgccaggca gttttaacga tcagttcgcc gatgcagata
ttcgtaatta tgcgggcaac 19500gtctggtatc agcgcgaagt ctttataccg aaaggttggg
caggccagcg tatcgtgctg 19560cgtttcgatg cggtcactca ttacggcaaa gtgtgggtca
ataatcagga agtgatggag 19620catcagggcg gctatacgcc atttgaagcc gatgtcacgc
cgtatgttat tgccgggaaa 19680agtgtacgta tcaccgtttg tgtgaacaac gaactgaact
ggcagactat cccgccggga 19740atggtgatta ccgacgaaaa cggcaagaaa aagcagtctt
acttccatga tttctttaac 19800tatgccggaa tccatcgcag cgtaatgctc tacaccacgc
cgaacacctg ggtggacgat 19860atcaccgtgg tgacgcatgt cgcgcaagac tgtaaccacg
cgtctgttga ctgccaggtg 19920gtggccaatg gtgatgtcag cgttgaactg cgtgatgcgg
atcaacaggt ggttgcaact 19980ggacaaggca ctagcgggac tttgcaagtg gtgaatccgc
acctctgcca accgggtgaa 20040ggttatctct atgaactgtg cgtcacagcc aaaagccaga
cagagtgtga tatctacccg 20100cttcgcgtcg gcatccggtc agtggcagtg aagggccaac
agttcctgat taaccacaaa 20160ccgttctact ttactggctt tggtcgtcat gaagatgcgg
acttacgtgg caaaggattc 20220gataacgtgc tgatggtgca cgaccacgca ttaatggact
ggattggggc caactcctac 20280cgtacctcgc attaccctta cgctgaagag atgctcgact
gggcagatga acatggcatc 20340gtggtgattg atgaaactgc tgctgtcggc tttaacctct
ctttaggcat tggtttcgaa 20400gcgggcaaca agccgaaaga actgtacagc gaagaggcag
tcaacgggga aactcagcaa 20460gcgcacttac aggcgattaa agagctgata gcgcgtgaca
aaaaccaccc aagcgtggtg 20520atgtggagta ttgccaacga accggatacc cgtccgcaag
tgcacgggaa tatttcgcca 20580ctggcggaag caacgcgtaa actcgacccg acgcgtccga
tcacctgcgt caatgtaatg 20640ttctgcgacg ctcacaccga taccatcagc gatctctttg
atgtgctgtg cctgaaccgt 20700tattacggat ggtatgtcca aagcggcgat ttggaaacgg
cagagaaggt actggaaaaa 20760gaacttctgg cctggcagga gaaactgcat cagccgatta
tcatcaccga atacggcgtg 20820gatacgttag ccgggctgca ctcaatgtac accgacatgt
ggagtgaaga gtatcagtgt 20880gcatggctgg atatgtatca ccgcgtcttt gatcgcgtca
gcgccgtcgt cggtgaacag 20940gtatggaatt tcgccgattt tgcgacctcg caaggcatat
tgcgcgttgg cggtaacaag 21000aaagggatct tcactcgcga ccgcaaaccg aagtcggcgg
cttttctgct gcaaaaacgc 21060tggactggca tgaacttcgg tgaaaaaccg cagcagggag
gcaaacaatg aatcaacaac 21120tctcctggcg caccatcgtc ggctacagcc tcggtgacgt
ggggcaacct agacttgtcc 21180atcttctgga ttggccaact taattaatgt atgaaataaa
aggatgcaca catagtgaca 21240tgctaatcac tataatgtgg gcatcaaagt tgtgtgttat
gtgtaattac tagttatctg 21300aataaaagag aaagagatca tccatatttc ttatcctaaa
tgaatgtcac gtgtctttat 21360aattctttga tgaaccagat gcatttcatt aaccaaatcc
atatacatat aaatattaat 21420catatataat taatatcaat tgggttagca aaacaaatct
agtctaggtg tgttttgcga 21480attgcggccc catggagtca aagattcaaa tagaggacct
aacagaactc gccgtaaaga 21540ctggcgaaca gttcatacag agtctcttac gactcaatga
caagaagaaa atcttcgtca 21600acatggtgga gcacgacacg cttgtctact ccaaaaatat
caaagataca gtctcagaag 21660accaaagggc aattgagact tttcaacaaa gggtaatatc
cggaaacctc ctcggattcc 21720attgcccagc tatctgtcac tttattgtga agatagtgga
aaaggaaggt ggctcctaca 21780aatgccatca ttgcgataaa ggaaaggcca tcgttgaaga
tgcctctgcc gacagtggtc 21840ccaaagatgg acccccaccc acgaggagca tcgtggaaaa
agaagacgtt ccaaccacgt 21900cttcaaagca agtggattga tgtgatatct ccactgacgt
aagggatgac gcacaatccc 21960actatccttc gcaagaccct tcctctatat aaggaagttc
atttcatttg gagaggacag 22020ggtacccggg gatccaccat gtctccggag aggagaccag
ttgagattag gccagctaca 22080gcagctgata tggccgcggt ttgtgatatc gttaaccatt
acattgagac gtctacagtg 22140aactttagga cagagccaca aacaccacaa gagtggattg
atgatctaga gaggttgcaa 22200gatagatacc cttggttggt tgctgaggtt gagggtgttg
tggctggtat tgcttacgct 22260gggccctgga aggctaggaa cgcttacgat tggacagttg
agagtactgt ttacgtgtca 22320cataggcatc aaaggttggg cctaggatcc acattgtaca
cacatttgct taagtctatg 22380gaggcgcaag gttttaagtc tgtggttgct gttataggcc
ttccaaacga tccatctgtt 22440aggttgcatg aggctttggg atacacagcc cggggtacat
tgcgcgcagc tggatacaag 22500catggtggat ggcatgatgt tggtttttgg caaagggatt
ttgagttgcc agctcctcca 22560aggccagtta ggccagttac ccagatctga gtcgacctgc
aggcatgccg ctgaaatcac 22620cagtctctct ctacaaatct atctctctct ataataatgt
gtgagtagtt cccagataag 22680ggaattaggg ttcttatagg gtttcgctca tgtgttgagc
atataagaaa cccttagtat 22740gtatttgtat ttgtaaaata cttctatcaa taaaatttct
aattcctaaa accaaaatcc 22800agtgggtacc gagctcgaat tcagtacatt aaaaacgtcc
gcaatgtgtt attaagttgt 22860ctaagcgtca atttgtttac accacaatat atcctgccac
cagccagcca acagctcccc 22920gaccggcagc tcggcacaaa atcaccactc gatacaggca
gcccatcagt ccgggacggc 22980gtcagcggga gagccgttgt aaggcggcag actttgctca
tgttaccgat gctattcgga 23040agaacggcaa ctaagctgcc gggtttgaaa cacggatgat
ctcgcggagg gtagcatgtt 23100gattgtaacg atgacagagc gttgctgcct gtgatcaaat
atcatctccc tcgcagagat 23160ccgaattatc agccttctta ttcatttctc gcttaaccgt
gacaggctgt cgatcttgag 23220aactatgccg acataatagg aaatcgctgg ataaagccgc
tgaggaagct gagtggcgct 23280atttctttag aagtgaacgt tgacgatcgt cgaccgtacc
ccgatgaatt aattcggacg 23340tacgttctga acacagctgg atacttactt gggcgattgt
catacatgac atcaacaatg 23400tacccgtttg tgtaaccgtc tcttggaggt tcgtatgaca
ctagtggttc ccctcagctt 23460gcgactagat gttgaggcct aacattttat tagagagcag
gctagttgct tagatacatg 23520atcttcaggc cgttatctgt cagggcaagc gaaaattggc
catttatgac gaccaatgcc 23580ccgcagaagc tcccatcttt gccgccatag acgccgcgcc
ccccttttgg ggtgtagaac 23640atccttttgc cagatgtgga aaagaagttc gttgtcccat
tgttggcaat gacgtagtag 23700ccggcgaaag tgcgagaccc atttgcgcta tatataagcc
tacgatttcc gttgcgacta 23760ttgtcgtaat tggatgaact attatcgtag ttgctctcag
agttgtcgta atttgatgga 23820ctattgtcgt aattgcttat ggagttgtcg tagttgcttg
gagaaatgtc gtagttggat 23880ggggagtagt catagggaag acgagcttca tccactaaaa
caattggcag gtcagcaagt 23940gcctgccccg atgccatcgc aagtacgagg cttagaacca
ccttcaacag atcgcgcata 24000gtcttcccca gctctctaac gcttgagtta agccgcgccg
cgaagcggcg tcggcttgaa 24060cgaattgtta gacattattt gccgactacc ttggtgatct
cgcctttcac gtagtgaaca 24120aattcttcca actgatctgc gcgcgaggcc aagcgatctt
cttgtccaag ataagcctgc 24180ctagcttcaa gtatgacggg ctgatactgg gccggcaggc
gctccattgc ccagtcggca 24240gcgacatcct tcggcgcgat tttgccggtt actgcgctgt
accaaatgcg ggacaacgta 24300agcactacat ttcgctcatc gccagcccag tcgggcggcg
agttccatag cgttaaggtt 24360tcatttagcg cctcaaatag atcctgttca ggaaccggat
caaagagttc ctccgccgct 24420ggacctacca aggcaacgct atgttctctt gcttttgtca
gcaagatagc cagatcaatg 24480tcgatcgtgg ctggctcgaa gatacctgca agaatgtcat
tgcgctgcca ttctccaaat 24540tgcagttcgc gcttagctgg ataacgccac ggaatgatgt
cgtcgtgcac aacaatggtg 24600acttctacag cgcggagaat ctcgctctct ccaggggaag
ccgaagtttc caaaaggtcg 24660ttgatcaaag ctcgccgcgt tgtttcatca agccttacgg
tcaccgtaac cagcaaatca 24720atatcactgt gtggcttcag gccgccatcc actgcggagc
cgtacaaatg tacggccagc 24780aacgtcggtt cgagatggcg ctcgatgacg ccaactacct
ctgatagttg agtcgatact 24840tcggcgatca ccgcttccct catgatgttt aactcctgaa
ttaagccgcg ccgcgaagcg 24900gtgtcggctt gaatgaattg ttaggcgtca tcctgtgctc
ccgagaacca gtaccagtac 24960atcgctgttt cgttcgagac ttgaggtcta gttttatacg
tgaacaggtc aatgccgccg 25020agagtaaagc cacattttgc gtacaaattg caggcaggta
cattgttcgt ttgtgtctct 25080aatcgtatgc caaggagctg tctgcttagt gcccactttt
tcgcaaattc gatgagactg 25140tgcgcgactc ctttgcctcg gtgcgtgtgc gacacaacaa
tgtgttcgat agaggctaga 25200tcgttccatg ttgagttgag ttcaatcttc ccgacaagct
cttggtcgat gaatgcgcca 25260tagcaagcag agtcttcatc agagtcatca tccgagatgt
aatccttccg gtaggggctc 25320acacttctgg tagatagttc aaagccttgg tcggataggt
gcacatcgaa cacttcacga 25380acaatgaaat ggttctcagc atccaatgtt tccgccacct
gctcagggat caccgaaatc 25440ttcatatgac gcctaacgcc tggcacagcg gatcgcaaac
ctggcgcggc ttttggcaca 25500aaaggcgtga caggtttgcg aatccgttgc tgccacttgt
taaccctttt gccagatttg 25560gtaactataa tttatgttag aggcgaagtc ttgggtaaaa
actggcctaa aattgctggg 25620gatttcagga aagtaaacat caccttccgg ctcgatgtct
attgtagata tatgtagtgt 25680atctacttga tcgggggatc tgctgcctcg cgcgtttcgg
tgatgacggt gaaaacctct 25740gacacatgca gctcccggag acggtcacag cttgtctgta
agcggatgcc gggagcagac 25800aagcccgtca gggcgcgtca gcgggtgttg gcgggtgtcg
gggcgcagcc atgacccagt 25860cacgtagcga tagcggagtg tatactggct taactatgcg
gcatcagagc agattgtact 25920gagagtgcac catatgcggt gtgaaatacc gcacagatgc
gtaaggagaa aataccgcat 25980caggcgctct tccgcttcct cgctcactga ctcgctgcgc
tcggtcgttc ggctgcggcg 26040agcggtatca gctcactcaa aggcggtaat acggttatcc
acagaatcag gggataacgc 26100aggaaagaac atgtgagcaa aaggccagca aaaggccagg
aaccgtaaaa aggccgcgtt 26160gctggcgttt ttccataggc tccgcccccc tgacgagcat
cacaaaaatc gacgctcaag 26220tcagaggtgg cgaaacccga caggactata aagataccag
gcgtttcccc ctggaagctc 26280cctcgtgcgc tctcctgttc cgaccctgcc gcttaccgga
tacctgtccg cctttctccc 26340ttcgggaagc gtggcgcttt ctcatagctc acgctgtagg
tatctcagtt cggtgtaggt 26400cgttcgctcc aagctgggct gtgtgcacga accccccgtt
cagcccgacc gctgcgcctt 26460atccggtaac tatcgtcttg agtccaaccc ggtaagacac
gacttatcgc cactggcagc 26520agccactggt aacaggatta gcagagcgag gtatgtaggc
ggtgctacag agttcttgaa 26580gtggtggcct aactacggct acactagaag gacagtattt
ggtatctgcg ctctgctgaa 26640gccagttacc ttcggaaaaa gagttggtag ctcttgatcc
ggcaaacaaa ccaccgctgg 26700tagcggtggt ttttttgttt gcaagcagca gattacgcgc
agaaaaaaag gatctcaaga 26760agatcctttg atcttttcta cggggtctga cgctcagtgg
aacgaaaact cacgttaagg 26820gattttggtc atgagattat caaaaaggat cttcacctag
atccttttaa attaaaaatg 26880aagttttaaa tcaatctaaa gtatatatga gtaaacttgg
tctgacagtt accaatgctt 26940aatcagtgag gcacctatct cagcgatctg tctatttcgt
tcatccatag ttgcctgact 27000ccccgtcgtg tagataacta cgatacggga gggcttacca
tctggcccca gtgctgcaat 27060gataccgcga gacccacgct caccggctcc agatttatca
gcaataaacc agccagccgg 27120aagggccgag cgcagaagtg gtcctgcaac tttatccgcc
tccatccagt ctattaattg 27180ttgccgggaa gctagagtaa gtagttcgcc agttaatagt
ttgcgcaacg ttgttgccat 27240tgctgcaggg gggggggggg ggggggactt ccattgttca
ttccacggac aaaaacagag 27300aaaggaaacg acagaggcca aaaagcctcg ctttcagcac
ctgtcgtttc ctttcttttc 27360agagggtatt ttaaataaaa acattaagtt atgacgaaga
agaacggaaa cgccttaaac 27420cggaaaattt tcataaatag cgaaaacccg cgaggtcgcc
gccccgtaac ctgtcggatc 27480accggaaagg acccgtaaag tgataatgat tatcatctac
atatcacaac gtgcgtggag 27540gccatcaaac cacgtcaaat aatcaattat gacgcaggta
tcgtattaat tgatctgcat 27600caacttaacg taaaaacaac ttcagacaat acaaatcagc
gacactgaat acggggcaac 27660ctcatgtccc cccccccccc ccccctgcag gcatcgtggt
gtcacgctcg tcgtttggta 27720tggcttcatt cagctccggt tcccaacgat caaggcgagt
tacatgatcc cccatgttgt 27780gcaaaaaagc ggttagctcc ttcggtcctc cgatcgttgt
cagaagtaag ttggccgcag 27840tgttatcact catggttatg gcagcactgc ataattctct
tactgtcatg ccatccgtaa 27900gatgcttttc tgtgactggt gagtactcaa ccaagtcatt
ctgagaatag tgtatgcggc 27960gaccgagttg ctcttgcccg gcgtcaacac gggataatac
cgcgccacat agcagaactt 28020taaaagtgct catcattgga aaacgttctt cggggcgaaa
actctcaagg atcttaccgc 28080tgttgagatc cagttcgatg taacccactc gtgcacccaa
ctgatcttca gcatctttta 28140ctttcaccag cgtttctggg tgagcaaaaa caggaaggca
aaatgccgca aaaaagggaa 28200taagggcgac acggaaatgt tgaatactca tactcttcct
ttttcaatat tattgaagca 28260tttatcaggg ttattgtctc atgagcggat acatatttga
atgtatttag aaaaataaac 28320aaataggggt tccgcgcaca tttccccgaa aagtgccacc
tgacgtctaa gaaaccatta 28380ttatcatgac attaacctat aaaaataggc gtatcacgag
gccctttcgt cttcaagaat 28440tcggagcttt tgccattctc accggattca gtcgtcactc
atggtgattt ctcacttgat 28500aaccttattt ttgacgaggg gaaattaata ggttgtattg
atgttggacg agtcggaatc 28560gcagaccgat accaggatct tgccatccta tggaactgcc
tcggtgagtt ttctccttca 28620ttacagaaac ggctttttca aaaatatggt attgataatc
ctgatatgaa taaattgcag 28680tttcatttga tgctcgatga gtttttctaa tcagaattgg
ttaattggtt gtaacactgg 28740cagagcatta cgctgacttg acgggacggc ggctttgttg
aataaatcga acttttgctg 28800agttgaagga tcagatcacg catcttcccg acaacgcaga
ccgttccgtg gcaaagcaaa 28860agttcaaaat caccaactgg tccacctaca acaaagctct
catcaaccgt ggctccctca 28920ctttctggct ggatgatggg gcgattcagg cctggtatga
gtcagcaaca ccttcttcac 28980gaggcagacc tcagcgccag aaggccgcca gagaggccga
gcgcggccgt gaggcttgga 29040cgctagggca gggcatgaaa aagcccgtag cgggctgcta
cgggcgtctg acgcggtgga 29100aagggggagg ggatgttgtc tacatggctc tgctgtagtg
agtgggttgc gctccggcag 29160cggtcctgat caatcgtcac cctttctcgg tccttcaacg
ttcctgacaa cgagcctcct 29220tttcgccaat ccatcgacaa tcaccgcgag tccctgctcg
aacgctgcgt ccggaccggc 29280ttcgtcgaag gcgtctatcg cggcccgcaa cagcggcgag
agcggagcct gttcaacggt 29340gccgccgcgc tcgccggcat cgctgtcgcc ggcctgctcc
tcaagcacgg ccccaacagt 29400gaagtagctg attgtcatca gcgcattgac ggcgtccccg
gccgaaaaac ccgcctcgca 29460gaggaagcga agctgcgcgt cggccgtttc catctgcggt
gcgcccggtc gcgtgccggc 29520atggatgcgc gcgccatcgc ggtaggcgag cagcgcctgc
ctgaagctgc gggcattccc 29580gatcagaaat gagcgccagt cgtcgtcggc tctcggcacc
gaatgcgtat gattctccgc 29640cagcatggct tcggccagtg cgtcgagcag cgcccgcttg
ttcctgaagt gccagtaaag 29700cgccggctgc tgaaccccca accgttccgc cagtttgcgt
gtcgtcagac cgtctacgcc 29760gacctcgttc aacaggtcca gggcggcacg gatcactgta
ttcggctgca actttgtcat 29820gcttgacact ttatcactga taaacataat atgtccacca
acttatcagt gataaagaat 29880ccgcgcgttc aatcggacca gcggaggctg gtccggaggc
cagacgtgaa acccaacata 29940cccctgatcg taattctgag cactgtcgcg ctcgacgctg
tcggcatcgg cctgattatg 30000ccggtgctgc cgggcctcct gcgcgatctg gttcactcga
acgacgtcac cgcccactat 30060ggcattctgc tggcgctgta tgcgttggtg caatttgcct
gcgcacctgt gctgggcgcg 30120ctgtcggatc gtttcgggcg gcggccaatc ttgctcgtct
cgctggccgg cgccactgtc 30180gactacgcca tcatggcgac agcgcctttc ctttgggttc
tctatatcgg gcggatcgtg 30240gccggcatca ccggggcgac tggggcggta gccggcgctt
atattgccga tatcactgat 30300ggcgatgagc gcgcgcggca cttcggcttc atgagcgcct
gtttcgggtt cgggatggtc 30360gcgggacctg tgctcggtgg gctgatgggc ggtttctccc
cccacgctcc gttcttcgcc 30420gcggcagcct tgaacggcct caatttcctg acgggctgtt
tccttttgcc ggagtcgcac 30480aaaggcgaac gccggccgtt acgccgggag gctctcaacc
cgctcgcttc gttccggtgg 30540gcccggggca tgaccgtcgt cgccgccctg atggcggtct
tcttcatcat gcaacttgtc 30600ggacaggtgc cggccgcgct ttgggtcatt ttcggcgagg
atcgctttca ctgggacgcg 30660accacgatcg gcatttcgct tgccgcattt ggcattctgc
attcactcgc ccaggcaatg 30720atcaccggcc ctgtagccgc ccggctcggc gaaaggcggg
cactcatgct cggaatgatt 30780gccgacggca caggctacat cctgcttgcc ttcgcgacac
ggggatggat ggcgttcccg 30840atcatggtcc tgcttgcttc gggtggcatc ggaatgccgg
cgctgcaagc aatgttgtcc 30900aggcaggtgg atgaggaacg tcaggggcag ctgcaaggct
cactggcggc gctcaccagc 30960ctgacctcga tcgtcggacc cctcctcttc acggcgatct
atgcggcttc tataacaacg 31020tggaacgggt gggcatggat tgcaggcgct gccctctact
tgctctgcct gccggcgctg 31080cgtcgcgggc tttggagcgg cgcagggcaa cgagccgatc
gctgatcgtg gaaacgatag 31140gcctatgcca tgcgggtcaa ggcgacttcc ggcaagctat
acgcgcccta ggagtgcggt 31200tggaacgttg gcccagccag atactcccga tcacgagcag
gacgccgatg atttgaagcg 31260cactcagcgt ctgatccaag aacaaccatc ctagcaacac
ggcggtcccc gggctgagaa 31320agcccagtaa ggaaacaact gtaggttcga gtcgcgagat
cccccggaac caaaggaagt 31380aggttaaacc cgctccgatc aggccgagcc acgccaggcc
gagaacattg gttcctgtag 31440gcatcgggat tggcggatca aacactaaag ctactggaac
gagcagaagt cctccggccg 31500ccagttgcca ggcggtaaag gtgagcagag gcacgggagg
ttgccacttg cgggtcagca 31560cggttccgaa cgccatggaa accgcccccg ccaggcccgc
tgcgacgccg acaggatcta 31620gcgctgcgtt tggtgtcaac accaacagcg ccacgcccgc
agttccgcaa atagccccca 31680ggaccgccat caatcgtatc gggctaccta gcagagcggc
agagatgaac acgaccatca 31740gcggctgcac agcgcctacc gtcgccgcga ccccgcccgg
caggcggtag accgaaataa 31800acaacaagct ccagaatagc gaaatattaa gtgcgccgag
gatgaagatg cgcatccacc 31860agattcccgt tggaatctgt cggacgatca tcacgagcaa
taaacccgcc ggcaacgccc 31920gcagcagcat accggcgacc cctcggcctc gctgttcggg
ctccacgaaa acgccggaca 31980gatgcgcctt gtgagcgtcc ttggggccgt cctcctgttt
gaagaccgac agcccaatga 32040tctcgccgtc gatgtaggcg ccgaatgcca cggcatctcg
caaccgttca gcgaacgcct 32100ccatgggctt tttctcctcg tgctcgtaaa cggacccgaa
catctctgga gctttcttca 32160gggccgacaa tcggatctcg cggaaatcct gcacgtcggc
cgctccaagc cgtcgaatct 32220gagccttaat cacaattgtc aattttaatc ctctgtttat
cggcagttcg tagagcgcgc 32280cgtgcgtccc gagcgatact gagcgaagca agtgcgtcga
gcagtgcccg cttgttcctg 32340aaatgccagt aaagcgctgg ctgctgaacc cccagccgga
actgacccca caaggcccta 32400gcgtttgcaa tgcaccaggt catcattgac ccaggcgtgt
tccaccaggc cgctgcctcg 32460caactcttcg caggcttcgc cgacctgctc gcgccacttc
ttcacgcggg tggaatccga 32520tccgcacatg aggcggaagg tttccagctt gagcgggtac
ggctcccggt gcgagctgaa 32580atagtcgaac atccgtcggg ccgtcggcga cagcttgcgg
tacttctccc atatgaattt 32640cgtgtagtgg tcgccagcaa acagcacgac gatttcctcg
tcgatcagga cctggcaacg 32700ggacgttttc ttgccacggt ccaggacgcg gaagcggtgc
agcagcgaca ccgattccag 32760gtgcccaacg cggtcggacg tgaagcccat cgccgtcgcc
tgtaggcgcg acaggcattc 32820ctcggccttc gtgtaatacc ggccattgat cgaccagccc
aggtcctggc aaagctcgta 32880gaacgtgaag gtgatcggct cgccgatagg ggtgcgcttc
gcgtactcca acacctgctg 32940ccacaccagt tcgtcatcgt cggcccgcag ctcgacgccg
gtgtaggtga tcttcacgtc 33000cttgttgacg tggaaaatga ccttgttttg cagcgcctcg
cgcgggattt tcttgttgcg 33060cgtggtgaac agggcagagc gggccgtgtc gtttggcatc
gctcgcatcg tgtccggcca 33120cggcgcaata tcgaacaagg aaagctgcat ttccttgatc
tgctgcttcg tgtgtttcag 33180caacgcggcc tgcttggcct cgctgacctg ttttgccagg
tcctcgccgg cggtttttcg 33240cttcttggtc gtcatagttc ctcgcgtgtc gatggtcatc
gacttcgcca aacctgccgc 33300ctcctgttcg agacgacgcg aacgctccac ggcggccgat
ggcgcgggca gggcaggggg 33360agccagttgc acgctgtcgc gctcgatctt ggccgtagct
tgctggacca tcgagccgac 33420ggactggaag gtttcgcggg gcgcacgcat gacggtgcgg
cttgcgatgg tttcggcatc 33480ctcggcggaa aaccccgcgt cgatcagttc ttgcctgtat
gccttccggt caaacgtccg 33540attcattcac cctccttgcg ggattgcccc gactcacgcc
ggggcaatgt gcccttattc 33600ctgatttgac ccgcctggtg ccttggtgtc cagataatcc
accttatcgg caatgaagtc 33660ggtcccgtag accgtctggc cgtccttctc gtacttggta
ttccgaatct tgccctgcac 33720gaataccagc gaccccttgc ccaaatactt gccgtgggcc
tcggcctgag agccaaaaca 33780cttgatgcgg aagaagtcgg tgcgctcctg cttgtcgccg
gcatcgttgc gccactcttc 33840attaaccgct atatcgaaaa ttgcttgcgg cttgttagaa
ttgccatgac gtacctcggt 33900gtcacgggta agattaccga taaactggaa ctgattatgg
ctcatatcga aagtctcctt 33960gagaaaggag actctagttt agctaaacat tggttccgct
gtcaagaact ttagcggcta 34020aaattttgcg ggccgcgacc aaaggtgcga ggggcggctt
ccgctgtgta caaccagata 34080tttttcacca acatccttcg tctgctcgat gagcggggca
tgacgaaaca tgagctgtcg 34140gagagggcag gggtttcaat ttcgttttta tcagacttaa
ccaacggtaa ggccaacccc 34200tcgttgaagg tgatggaggc cattgccgac gccctggaaa
ctcccctacc tcttctcctg 34260gagtccaccg accttgaccg cgaggcactc gcggagattg
cgggtcatcc tttcaagagc 34320agcgtgccgc ccggatacga acgcatcagt gtggttttgc
cgtcacataa ggcgtttatc 34380gtaaagaaat ggggcgacga cacccgaaaa aagctgcgtg
gaaggctctg acgccaaggg 34440ttagggcttg cacttccttc tttagccgct aaaacggccc
cttctctgcg ggccgtcggc 34500tcgcgcatca tatcgacatc ctcaacggaa gccgtgccgc
gaatggcatc gggcgggtgc 34560gctttgacag ttgttttcta tcagaacccc tacgtcgtgc
ggttcgatta gctgtttgtc 34620ttgcaggcta aacactttcg gtatatcgtt tgcctgtgcg
ataatgttgc taatgatttg 34680ttgcgtaggg gttactgaaa agtgagcggg aaagaagagt
ttcagaccat caaggagcgg 34740gccaagcgca agctggaacg cgacatgggt gcggacctgt
tggccgcgct caacgacccg 34800aaaaccgttg aagtcatgct caacgcggac ggcaaggtgt
ggcacgaacg ccttggcgag 34860ccgatgcggt acatctgcga catgcggccc agccagtcgc
aggcgattat agaaacggtg 34920gccggattcc acggcaaaga ggtcacgcgg cattcgccca
tcctggaagg cgagttcccc 34980ttggatggca gccgctttgc cggccaattg ccgccggtcg
tggccgcgcc aacctttgcg 35040atccgcaagc gcgcggtcgc catcttcacg ctggaacagt
acgtcgaggc gggcatcatg 35100acccgcgagc aatacgaggt cattaaaagc gccgtcgcgg
cgcatcgaaa catcctcgtc 35160attggcggta ctggctcggg caagaccacg ctcgtcaacg
cgatcatcaa tgaaatggtc 35220gccttcaacc cgtctgagcg cgtcgtcatc atcgaggaca
ccggcgaaat ccagtgcgcc 35280gcagagaacg ccgtccaata ccacaccagc atcgacgtct
cgatgacgct gctgctcaag 35340acaacgctgc gtatgcgccc cgaccgcatc ctggtcggtg
aggtacgtgg ccccgaagcc 35400cttgatctgt tgatggcctg gaacaccggg catgaaggag
gtgccgccac cctgcacgca 35460aacaacccca aagcgggcct gagccggctc gccatgctta
tcagcatgca cccggattca 35520ccgaaaccca ttgagccgct gattggcgag gcggttcatg
tggtcgtcca tatcgccagg 35580acccctagcg gccgtcgagt gcaagaaatt ctcgaagttc
ttggttacga gaacggccag 35640tacatcacca aaaccctgta aggagtattt ccaatgacaa
cggctgttcc gttccgtctg 35700accatgaatc gcggcatttt gttctacctt gccgtgttct
tcgttctcgc tctcgcgtta 35760tccgcgcatc cggcgatggc ctcggaaggc accggcggca
gcttgccata tgagagctgg 35820ctgacgaacc tgcgcaactc cgtaaccggc ccggtggcct
tcgcgctgtc catcatcggc 35880atcgtcgtcg ccggcggcgt gctgatcttc ggcggcgaac
tcaacgcctt cttccgaacc 35940ctgatcttcc tggttctggt gatggcgctg ctggtcggcg
cgcagaacgt gatgagcacc 36000ttcttcggtc gtggtgccga aatcgcggcc ctcggcaacg
gggcgctgca ccaggtgcaa 36060gtcgcggcgg cggatgccgt gcgtgcggta gcggctggac
ggctcgccta atcatggctc 36120tgcgcacgat ccccatccgt cgcgcaggca accgagaaaa
cctgttcatg ggtggtgatc 36180gtgaactggt gatgttctcg ggcctgatgg cgtttgcgct
gattttcagc gcccaagagc 36240tgcgggccac cgtggtcggt ctgatcctgt ggttcggggc
gctctatgcg ttccgaatca 36300tggcgaaggc cgatccgaag atgcggttcg tgtacctgcg
tcaccgccgg tacaagccgt 36360attacccggc ccgctcgacc ccgttccgcg agaacaccaa
tagccaaggg aagcaatacc 36420gatgatccaa gcaattgcga ttgcaatcgc gggcctcggc
gcgcttctgt tgttcatcct 36480ctttgcccgc atccgcgcgg tcgatgccga actgaaactg
aaaaagcatc gttccaagga 36540cgccggcctg gccgatctgc tcaactacgc cgctgtcgtc
gatgacggcg taatcgtggg 36600caagaacggc agctttatgg ctgcctggct gtacaagggc
gatgacaacg caagcagcac 36660cgaccagcag cgcgaagtag tgtccgcccg catcaaccag
gccctcgcgg gcctgggaag 36720tgggtggatg atccatgtgg acgccgtgcg gcgtcctgct
ccgaactacg cggagcgggg 36780cctgtcggcg ttccctgacc gtctgacggc agcgattgaa
gaagagcgct cggtcttgcc 36840ttgctcgtcg gtgatgtact tcaccagctc cgcgaagtcg
ctcttcttga tggagcgcat 36900ggggacgtgc ttggcaatca cgcgcacccc ccggccgttt
tagcggctaa aaaagtcatg 36960gctctgccct cgggcggacc acgcccatca tgaccttgcc
aagctcgtcc tgcttctctt 37020cgatcttcgc cagcagggcg aggatcgtgg catcaccgaa
ccgcgccgtg cgcgggtcgt 37080cggtgagcca gagtttcagc aggccgccca ggcggcccag
gtcgccattg atgcgggcca 37140gctcgcggac gtgctcatag tccacgacgc ccgtgatttt
gtagccctgg ccgacggcca 37200gcaggtaggc cgacaggctc atgccggccg ccgccgcctt
ttcctcaatc gctcttcgtt 37260cgtctggaag gcagtacacc ttgataggtg ggctgccctt
cctggttggc ttggtttcat 37320cagccatccg cttgccctca tctgttacgc cggcggtagc
cggccagcct cgcagagcag 37380gattcccgtt gagcaccgcc aggtgcgaat aagggacagt
gaagaaggaa cacccgctcg 37440cgggtgggcc tacttcacct atcctgcccg gctgacgccg
ttggatacac caaggaaagt 37500ctacacgaac cctttggcaa aatcctgtat atcgtgcgaa
aaaggatgga tataccgaaa 37560aaatcgctat aatgaccccg aagcagggtt atgcagcgga
aaagcgctgc ttccctgctg 37620ttttgtggaa tatctaccga ctggaaacag gcaaatgcag
gaaattactg aactgagggg 37680acaggcgaga gacgatgcca aagagctaca ccgacgagct
ggccgagtgg gttgaatccc 37740gcgcggccaa gaagcgccgg cgtgatgagg ctgcggttgc
gttcctggcg gtgagggcgg 37800atgtcgaggc ggcgttagcg tccggctatg cgctcgtcac
catttgggag cacatgcggg 37860aaacggggaa ggtcaagttc tcctacgaga cgttccgctc
gcacgccagg cggcacatca 37920aggccaagcc cgccgatgtg cccgcaccgc aggccaaggc
tgcggaaccc gcgccggcac 37980ccaagacgcc ggagccacgg cggccgaagc aggggggcaa
ggctgaaaag ccggcccccg 38040ctgcggcccc gaccggcttc accttcaacc caacaccgga
caaaaaggat ctactgtaat 38100ggcgaaaatt cacatggttt tgcagggcaa gggcggggtc
ggcaagtcgg ccatcgccgc 38160gatcattgcg cagtacaaga tggacaaggg gcagacaccc
ttgtgcatcg acaccgaccc 38220ggtgaacgcg acgttcgagg gctacaaggc cctgaacgtc
cgccggctga acatcatggc 38280cggcgacgaa attaactcgc gcaacttcga caccctggtc
gagctgattg cgccgaccaa 38340ggatgacgtg gtgatcgaca acggtgccag ctcgttcgtg
cctctgtcgc attacctcat 38400cagcaaccag gtgccggctc tgctgcaaga aatggggcat
gagctggtca tccataccgt 38460cgtcaccggc ggccaggctc tcctggacac ggtgagcggc
ttcgcccagc tcgccagcca 38520gttcccggcc gaagcgcttt tcgtggtctg gctgaacccg
tattgggggc ctatcgagca 38580tgagggcaag agctttgagc agatgaaggc gtacacggcc
aacaaggccc gcgtgtcgtc 38640catcatccag attccggccc tcaaggaaga aacctacggc
cgcgatttca gcgacatgct 38700gcaagagcgg ctgacgttcg accaggcgct ggccgatgaa
tcgctcacga tcatgacgcg 38760gcaacgcctc aagatcgtgc ggcgcggcct gtttgaacag
ctcgacgcgg cggccgtgct 38820atgagcgacc agattgaaga gctgatccgg gagattgcgg
ccaagcacgg catcgccgtc 38880ggccgcgacg acccggtgct gatcctgcat accatcaacg
cccggctcat ggccgacagt 38940gcggccaagc aagaggaaat ccttgccgcg ttcaaggaag
agctggaagg gatcgcccat 39000cgttggggcg aggacgccaa ggccaaagcg gagcggatgc
tgaacgcggc cctggcggcc 39060agcaaggacg caatggcgaa ggtaatgaag gacagcgccg
cgcaggcggc cgaagcgatc 39120cgcagggaaa tcgacgacgg ccttggccgc cagctcgcgg
ccaaggtcgc ggacgcgcgg 39180cgcgtggcga tgatgaacat gatcgccggc ggcatggtgt
tgttcgcggc cgccctggtg 39240gtgtgggcct cgttatgaat cgcagaggcg cagatgaaaa
agcccggcgt tgccgggctt 39300tgtttttgcg ttagctgggc ttgtttgaca ggcccaagct
ctgactgcgc ccgcgctcgc 39360gctcctgggc ctgtttcttc tcctgctcct gcttgcgcat
cagggcctgg tgccgtcggg 39420ctgcttcacg catcgaatcc cagtcgccgg ccagctcggg
atgctccgcg cgcatcttgc 39480gcgtcgccag ttcctcgatc ttgggcgcgt gaatgcccat
gccttccttg atttcgcgca 39540ccatgtccag ccgcgtgtgc agggtctgca agcgggcttg
ctgttgggcc tgctgctgct 39600gccaggcggc ctttgtacgc ggcagggaca gcaagccggg
ggcattggac tgtagctgct 39660gcaaacgcgc ctgctgacgg tctacgagct gttctaggcg
gtcctcgatg cgctccacct 39720ggtcatgctt tgcctgcacg tagagcgcaa gggtctgctg
gtaggtctgc tcgatgggcg 39780cggattctaa gagggcctgc tgttccgtct cggcctcctg
ggccgcctgt agcaaatcct 39840cgccgctgtt gccgctggac tgctttactg ccggggactg
ctgttgccct gctcgcgccg 39900tcgtcgcagt tcggcttgcc cccactcgat tgactgcttc
atttcgagcc gcagcgatgc 39960gatctcggat tgcgtcaacg gacggggcag cgcggaggtg
tccggcttct ccttgggtga 40020gtcggtcgat gccatagcca aaggtttcct tccaaaatgc
gtccattgct ggaccgtgtt 40080tctcattgat gcccgcaagc atcttcggct tgaccgccag
gtcaagcgcg ccttcatggg 40140cggtcatgac ggacgccgcc atgaccttgc cgccgttgtt
ctcgatgtag ccgcgtaatg 40200aggcaatggt gccgcccatc gtcagcgtgt catcgacaac
gatgtacttc tggccgggga 40260tcacctcccc ctcgaaagtc gggttgaacg ccaggcgatg
atctgaaccg gctccggttc 40320gggcgacctt ctcccgctgc acaatgtccg tttcgacctc
aaggccaagg cggtcggcca 40380gaacgaccgc catcatggcc ggaatcttgt tgttccccgc
cgcctcgacg gcgaggactg 40440gaacgatgcg gggcttgtcg tcgccgatca gcgtcttgag
ctgggcaaca gtgtcgtccg 40500aaatcaggcg ctcgaccaaa ttaagcgccg cttccgcgtc
gccctgcttc gcagcctggt 40560attcaggctc gttggtcaaa gaaccaaggt cgccgttgcg
aaccaccttc gggaagtctc 40620cccacggtgc gcgctcggct ctgctgtagc tgctcaagac
gcctcccttt ttagccgcta 40680aaactctaac gagtgcgccc gcgactcaac ttgacgcttt
cggcacttac ctgtgccttg 40740ccacttgcgt cataggtgat gcttttcgca ctcccgattt
caggtacttt atcgaaatct 40800gaccgggcgt gcattacaaa gttcttcccc acctgttggt
aaatgctgcc gctatctgcg 40860tggacgatgc tgccgtcgtg gcgctgcgac ttatcggcct
tttgggccat atagatgttg 40920taaatgccag gtttcagggc cccggcttta tctaccttct
ggttcgtcca tgcgccttgg 40980ttctcggtct ggacaattct ttgcccattc atgaccagga
ggcggtgttt cattgggtga 41040ctcctgacgg ttgcctctgg tgttaaacgt gtcctggtcg
cttgccggct aaaaaaaagc 41100cgacctcggc agttcgaggc cggctttccc tagagccggg
cgcgtcaagg ttgttccatc 41160tattttagtg aactgcgttc gatttatcag ttactttcct
cccgctttgt gtttcctccc 41220actcgtttcc gcgtctagcc gacccctcaa catagcggcc
tcttcttggg ctgcctttgc 41280ctcttgccgc gcttcgtcac gctcggcttg caccgtcgta
aagcgctcgg cctgcctggc 41340cgcctcttgc gccgccaact tcctttgctc ctggtgggcc
tcggcgtcgg cctgcgcctt 41400cgctttcacc gctgccaact ccgtgcgcaa actctccgct
tcgcgcctgg tggcgtcgcg 41460ctcgccgcga agcgcctgca tttcctggtt ggccgcgtcc
agggtcttgc ggctctcttc 41520tttgaatgcg cgggcgtcct ggtgagcgta gtccagctcg
gcgcgcagct cctgcgctcg 41580acgctccacc tcgtcggccc gctgcgtcgc cagcgcggcc
cgctgctcgg ctcctgccag 41640ggcggtgcgt gcttcggcca gggcttgccg ctggcgtgcg
gccagctcgg ccgcctcggc 41700ggcctgctgc tctagcaatg taacgcgcgc ctgggcttct
tccagctcgc gggcctgcgc 41760ctcgaaggcg tcggccagct ccccgcgcac ggcttccaac
tcgttgcgct cacgatccca 41820gccggcttgc gctgcctgca acgattcatt ggcaagggcc
tgggcggctt gccagagggc 41880ggccacggcc tggttgccgg cctgctgcac cgcgtccggc
acctggactg ccagcggggc 41940ggcctgcgcc gtgcgctggc gtcgccattc gcgcatgccg
gcgctggcgt cgttcatgtt 42000gacgcgggcg gccttacgca ctgcatccac ggtcgggaag
ttctcccggt cgccttgctc 42060gaacagctcg tccgcagccg caaaaatgcg gtcgcgcgtc
tctttgttca gttccatgtt 42120ggctccggta attggtaaga ataataatac tcttacctac
cttatcagcg caagagttta 42180gctgaacagt tctcgactta acggcaggtt ttttagcggc
tgaagggcag gcaaaaaaag 42240ccccgcacgg tcggcggggg caaagggtca gcgggaaggg
gattagcggg cgtcgggctt 42300cttcatgcgt cggggccgcg cttcttggga tggagcacga
cgaagcgcgc acgcgcatcg 42360tcctcggccc tatcggcccg cgtcgcggtc aggaacttgt
cgcgcgctag gtcctccctg 42420gtgggcacca ggggcatgaa ctcggcctgc tcgatgtagg
tccactccat gaccgcatcg 42480cagtcgaggc cgcgttcctt caccgtctct tgcaggtcgc
ggtacgcccg ctcgttgagc 42540ggctggtaac gggccaattg gtcgtaaatg gctgtcggcc
atgagcggcc tttcctgttg 42600agccagcagc cgacgacgaa gccggcaatg caggcccctg
gcacaaccag gccgacgccg 42660ggggcagggg atggcagcag ctcgccaacc aggaaccccg
ccgcgatgat gccgatgccg 42720gtcaaccagc ccttgaaact atccggcccc gaaacacccc
tgcgcattgc ctggatgctg 42780cgccggatag cttgcaacat caggagccgt ttcttttgtt
cgtcagtcat ggtccgccct 42840caccagttgt tcgtatcggt gtcggacgaa ctgaaatcgc
aagagctgcc ggtatcggtc 42900cagccgctgt ccgtgtcgct gctgccgaag cacggcgagg
ggtccgcgaa cgccgcagac 42960ggcgtatccg gccgcagcgc atcgcccagc atggccccgg
tcagcgagcc gccggccagg 43020tagcccagca tggtgctgtt ggtcgccccg gccaccaggg
ccgacgtgac gaaatcgccg 43080tcattccctc tggattgttc gctgctcggc ggggcagtgc
gccgcgccgg cggcgtcgtg 43140gatggctcgg gttggctggc ctgcgacggc cggcgaaagg
tgcgcagcag ctcgttatcg 43200accggctgcg gcgtcggggc cgccgccttg cgctgcggtc
ggtgttcctt cttcggctcg 43260cgcagcttga acagcatgat cgcggaaacc agcagcaacg
ccgcgcctac gcctcccgcg 43320atgtagaaca gcatcggatt cattcttcgg tcctccttgt
agcggaaccg ttgtctgtgc 43380ggcgcgggtg gcccgcgccg ctgtctttgg ggatcagccc
tcgatgagcg cgaccagttt 43440cacgtcggca aggttcgcct cgaactcctg gccgtcgtcc
tcgtacttca accaggcata 43500gccttccgcc ggcggccgac ggttgaggat aaggcgggca
gggcgctcgt cgtgctcgac 43560ctggacgatg gcctttttca gcttgtccgg gtccggctcc
ttcgcgccct tttccttggc 43620gtccttaccg tcctggtcgc cgtcctcgcc gtcctggccg
tcgccggcct ccgcgtcacg 43680ctcggcatca gtctggccgt tgaaggcatc gacggtgttg
ggatcgcggc ccttctcgtc 43740caggaactcg cgcagcagct tgaccgtgcc gcgcgtgatt
tcctgggtgt cgtcgtcaag 43800ccacgcctcg acttcctccg ggcgcttctt gaaggccgtc
accagctcgt tcaccacggt 43860cacgtcgcgc acgcggccgg tgttgaacgc atcggcgatc
ttctccggca ggtccagcag 43920cgtgacgtgc tgggtgatga acgccggcga cttgccgatt
tccttggcga tatcgccttt 43980cttcttgccc ttcgccagct cgcggccaat gaagtcggca
atttcgcgcg gggtcagctc 44040gttgcgttgc aggttctcga taacctggtc ggcttcgttg
tagtcgttgt cgatgaacgc 44100cgggatggac ttcttgccgg cccacttcga gccacggtag
cggcgggcgc cgtgattgat 44160gatatagcgg cccggctgct cctggttctc gcgcaccgaa
atgggtgact tcaccccgcg 44220ctctttgatc gtggcaccga tttccgcgat gctctccggg
gaaaagccgg ggttgtcggc 44280cgtccgcggc tgatgcggat cttcgtcgat caggtccagg
tccagctcga tagggccgga 44340accgccctga gacgccgcag gagcgtccag gaggctcgac
aggtcgccga tgctatccaa 44400ccccaggccg gacggctgcg ccgcgcctgc ggcttcctga
gcggccgcag cggtgttttt 44460cttggtggtc ttggcttgag ccgcagtcat tgggaaatct
ccatcttcgt gaacacgtaa 44520tcagccaggg cgcgaacctc tttcgatgcc ttgcgcgcgg
ccgttttctt gatcttccag 44580accggcacac cggatgcgag ggcatcggcg atgctgctgc
gcaggccaac ggtggccgga 44640atcatcatct tggggtacgc ggccagcagc tcggcttggt
ggcgcgcgtg gcgcggattc 44700cgcgcatcga ccttgctggg caccatgcca aggaattgca
gcttggcgtt cttctggcgc 44760acgttcgcaa tggtcgtgac catcttcttg atgccctgga
tgctgtacgc ctcaagctcg 44820atgggggaca gcacatagtc ggccgcgaag agggcggccg
ccaggccgac gccaagggtc 44880ggggccgtgt cgatcaggca cacgtcgaag ccttggttcg
ccagggcctt gatgttcgcc 44940ccgaacagct cgcgggcgtc gtccagcgac agccgttcgg
cgttcgccag taccgggttg 45000gactcgatga gggcgaggcg cgcggcctgg ccgtcgccgg
ctgcgggtgc ggtttcggtc 45060cagccgccgg cagggacagc gccgaacagc ttgcttgcat
gcaggccggt agcaaagtcc 45120ttgagcgtgt aggacgcatt gccctggggg tccaggtcga
tcacggcaac ccgcaagccg 45180cgctcgaaaa agtcgaaggc aagatgcaca agggtcgaag
tcttgccgac gccgcctttc 45240tggttggccg tgaccaaagt tttcatcgtt tggtttcctg
ttttttcttg gcgtccgctt 45300cccacttccg gacgatgtac gcctgatgtt ccggcagaac
cgccgttacc cgcgcgtacc 45360cctcgggcaa gttcttgtcc tcgaacgcgg cccacacgcg
atgcaccgct tgcgacactg 45420cgcccctggt cagtcccagc gacgttgcga acgtcgcctg
tggcttccca tcgactaaga 45480cgccccgcgc tatctcgatg gtctgctgcc ccacttccag
cccctggatc gcctcctgga 45540actggctttc ggtaagccgt ttcttcatgg ataacaccca
taatttgctc cgcgccttgg 45600ttgaacatag cggtgacagc cgccagcaca tgagagaagt
ttagctaaac atttctcgca 45660cgtcaacacc tttagccgct aaaactcgtc cttggcgtaa
caaaacaaaa gcccggaaac 45720cgggctttcg tctcttgccg cttatggctc tgcacccggc
tccatcacca acaggtcgcg 45780cacgcgcttc actcggttgc ggatcgacac tgccagccca
acaaagccgg ttgccgccgc 45840cgccaggatc gcgccgatga tgccggccac accggccatc
gcccaccagg tcgccgcctt 45900ccggttccat tcctgctggt actgcttcgc aatgctggac
ctcggctcac cataggctga 45960ccgctcgatg gcgtatgccg cttctcccct tggcgtaaaa
cccagcgccg caggcggcat 46020tgccatgctg cccgccgctt tcccgaccac gacgcgcgca
ccaggcttgc ggtccagacc 46080ttcggccacg gcgagctgcg caaggacata atcagccgcc
gacttggctc cacgcgcctc 46140gatcagctct tgcactcgcg cgaaatcctt ggcctccacg
gccgccatga atcgcgcacg 46200cggcgaaggc tccgcagggc cggcgtcgtg atcgccgccg
agaatgccct tcaccaagtt 46260cgacgacacg aaaatcatgc tgacggctat caccatcatg
cagacggatc gcacgaaccc 46320gctgaattga acacgagcac ggcacccgcg accactatgc
caagaatgcc caaggtaaaa 46380attgccggcc ccgccatgaa gtccgtgaat gccccgacgg
ccgaagtgaa gggcaggccg 46440ccacccaggc cgccgccctc actgcccggc acctggtcgc
tgaatgtcga tgccagcacc 46500tgcggcacgt caatgcttcc gggcgtcgcg ctcgggctga
tcgcccatcc cgttactgcc 46560ccgatcccgg caatggcaag gactgccagc gctgccattt
ttggggtgag gccgttcgcg 46620gccgaggggc gcagcccctg gggggatggg aggcccgcgt
tagcgggccg ggagggttcg 46680agaagggggg gcacccccct tcggcgtgcg cggtcacgcg
cacagggcgc agccctggtt 46740aaaaacaagg tttataaata ttggtttaaa agcaggttaa
aagacaggtt agcggtggcc 46800gaaaaacggg cggaaaccct tgcaaatgct ggattttctg
cctgtggaca gcccctcaaa 46860tgtcaatagg tgcgcccctc atctgtcagc actctgcccc
tcaagtgtca aggatcgcgc 46920ccctcatctg tcagtagtcg cgcccctcaa gtgtcaatac
cgcagggcac ttatccccag 46980gcttgtccac atcatctgtg ggaaactcgc gtaaaatcag
gcgttttcgc cgatttgcga 47040ggctggccag ctccacgtcg ccggccgaaa tcgagcctgc
ccctcatctg tcaacgccgc 47100gccgggtgag tcggcccctc aagtgtcaac gtccgcccct
catctgtcag tgagggccaa 47160gttttccgcg aggtatccac aacgccggcg gccgcggtgt
ctcgcacacg gcttcgacgg 47220cgtttctggc gcgtttgcag ggccatagac ggccgccagc
ccagcggcga gggcaaccag 47280cccggtgagc gtcggaaagg cgctggaagc cccgtagcga
cgcggagagg ggcgagacaa 47340gccaagggcg caggctcgat gcgcagcacg acatagccgg
ttctcgcaag gacgagaatt 47400tccctgcggt gcccctcaag tgtcaatgaa agtttccaac
gcgagccatt cgcgagagcc 47460ttgagtccac gctagatgag agctttgttg taggtggacc
agttggtgat tttgaacttt 47520tgctttgcca cggaacggtc tgcgttgtcg ggaagatgcg
tgatctgatc cttcaactca 47580gcaaaagttc gatttattca acaaagccac gttgtgtctc
aaaatctctg atgttacatt 47640gcacaagata aaaatatatc atcatgaaca ataaaactgt
ctgcttacat aaacagtaat 47700acaaggggtg ttatgagcca tattcaacgg gaaacgtctt
gctcgac 477471248474DNAartificial sequencePHP16975
ZmCAS1BamProGUS/35SPAT 12tctagagctc gttcctcgag gaacggtacc tgcggggaag
cttacaataa tgtgtgttgt 60taagtcttgt tgcctgtcat cgtctgactg actttcgtca
taaatcccgg cctccgtaac 120ccagctttgg gcaagctcac ggatttgatc cggcggaacg
ggaatatcga gatgccgggc 180tgaacgctgc agttccagct ttccctttcg ggacaggtac
tccagctgat tgattatctg 240ctgaagggtc ttggttccac ctcctggcac aatgcgaatg
attacttgag cgcgatcggg 300catccaattt tctcccgtca ggtgcgtggt caagtgctac
aaggcacctt tcagtaacga 360gcgaccgtcg atccgtcgcc gggatacgga caaaatggag
cgcagtagtc catcgagggc 420ggcgaaagcc tcgccaaaag caatacgttc atctcgcaca
gcctccagat ccgatcgagg 480gtcttcggcg taggcagata gaagcatgga tacattgctt
gagagtattc cgatggactg 540aagtatggct tccatctttt ctcgtgtgtc tgcatctatt
tcgagaaagc ccccgatgcg 600gcgcaccgca acgcgaattg ccatactatc cgaaagtccc
agcaggcgcg cttgatagga 660aaaggtttca tactcggccg atcgcagacg ggcactcacg
accttgaacc cttcaacttt 720cagggatcga tgctggttga tggtagtctc actcgacgtg
gctctggtgt gttttgacat 780agcttcctcc aaagaaagcg gaaggtctgg atactccagc
acgaaatgtg cccgggtaga 840cggatggaag tctagccctg ctcaatatga aatcaacagt
acatttacag tcaatactga 900atatacttgc tacatttgca attgtcttat aacgaatgtg
aaataaaaat agtgtaacaa 960cgcttttact catcgataat cacaaaaaca tttatacgaa
caaaaataca aatgcactcc 1020ggtttcacag gataggcggg atcagaatat gcaacttttg
acgttttgtt ctttcaaagg 1080gggtgctggc aaaaccaccg cactcatggg cctttgcgct
gctttggcaa atgacggtaa 1140acgagtggcc ctctttgatg ccgacgaaaa ccggcctctg
acgcgatgga gagaaaacgc 1200cttacaaagc agtactggga tcctcgctgt gaagtctatt
ccgccgacga aatgcccctt 1260cttgaagcag cctatgaaaa tgccgagctc gaaggatttg
attatgcgtt ggccgatacg 1320cgtggcggct cgagcgagct caacaacaca atcatcgcta
gctcaaacct gcttctgatc 1380cccaccatgc taacgccgct cgacatcgat gaggcactat
ctacctaccg ctacgtcatc 1440gagctgctgt tgagtgaaaa tttggcaatt cctacagctg
ttttgcgcca acgcgtcccg 1500gtcggccgat tgacaacatc gcaacgcagg atgtcagaga
cgctagagag ccttccagtt 1560gtaccgtctc ccatgcatga aagagatgca tttgccgcga
tgaaagaacg cggcatgttg 1620catcttacat tactaaacac gggaactgat ccgacgatgc
gcctcataga gaggaatctt 1680cggattgcga tggaggaagt cgtggtcatt tcgaaactga
tcagcaaaat cttggaggct 1740tgaagatggc aattcgcaag cccgcattgt cggtcggcga
agcacggcgg cttgctggtg 1800ctcgacccga gatccaccat cccaacccga cacttgttcc
ccagaagctg gacctccagc 1860acttgcctga aaaagccgac gagaaagacc agcaacgtga
gcctctcgtc gccgatcaca 1920tttacagtcc cgatcgacaa cttaagctaa ctgtggatgc
ccttagtcca cctccgtccc 1980cgaaaaagct ccaggttttt ctttcagcgc gaccgcccgc
gcctcaagtg tcgaaaacat 2040atgacaacct cgttcggcaa tacagtccct cgaagtcgct
acaaatgatt ttaaggcgcg 2100cgttggacga tttcgaaagc atgctggcag atggatcatt
tcgcgtggcc ccgaaaagtt 2160atccgatccc ttcaactaca gaaaaatccg ttctcgttca
gacctcacgc atgttcccgg 2220ttgcgttgct cgaggtcgct cgaagtcatt ttgatccgtt
ggggttggag accgctcgag 2280ctttcggcca caagctggct accgccgcgc tcgcgtcatt
ctttgctgga gagaagccat 2340cgagcaattg gtgaagaggg acctatcgga acccctcacc
aaatattgag tgtaggtttg 2400aggccgctgg ccgcgtcctc agtcaccttt tgagccagat
aattaagagc caaatgcaat 2460tggctcaggc tgccatcgtc cccccgtgcg aaacctgcac
gtccgcgtca aagaaataac 2520cggcacctct tgctgttttt atcagttgag ggcttgacgg
atccgcctca agtttgcggc 2580gcagccgcaa aatgagaaca tctatactcc tgtcgtaaac
ctcctcgtcg cgtactcgac 2640tggcaatgag aagttgctcg cgcgatagaa cgtcgcgggg
tttctctaaa aacgcgagga 2700gaagattgaa ctcacctgcc gtaagtttca cctcaccgcc
agcttcggac atcaagcgac 2760gttgcctgag attaagtgtc cagtcagtaa aacaaaaaga
ccgtcggtct ttggagcgga 2820caacgttggg gcgcacgcgc aaggcaaccc gaatgcgtgc
aagaaactct ctcgtactaa 2880acggcttagc gataaaatca cttgctccta gctcgagtgc
aacaacttta tccgtctcct 2940caaggcggtc gccactgata attatgattg gaatatcaga
ctttgccgcc agatttcgaa 3000cgatctcaag cccatcttca cgacctaaat ttagatcaac
aaccacgaca tcgaccgtcg 3060cggaagagag tactctagtg aactgggtgc tgtcggctac
cgcggtcact ttgaaggcgt 3120ggatcgtaag gtattcgata ataagatgcc gcatagcgac
atcgtcatcg ataagaagaa 3180cgtgtttcaa cggctcacct ttcaatctaa aatctgaacc
cttgttcaca gcgcttgaga 3240aattttcacg tgaaggatgt acaatcatct ccagctaaat
gggcagttcg tcagaattgc 3300ggctgaccgc ggatgacgaa aatgcgaacc aagtatttca
attttatgac aaaagttctc 3360aatcgttgtt acaagtgaaa cgcttcgagg ttacagctac
tattgattaa ggagatcgcc 3420tatggtctcg ccccggcgtc gtgcgtccgc cgcgagccag
atctcgccta cttcataaac 3480gtcctcatag gcacggaatg gaatgatgac atcgatcgcc
gtagagagca tgtcaatcag 3540tgtgcgatct tccaagctag caccttgggc gctacttttg
acaagggaaa acagtttctt 3600gaatccttgg attggattcg cgccgtgtat tgttgaaatc
gatcccggat gtcccgagac 3660gacttcactc agataagccc atgctgcatc gtcgcgcatc
tcgccaagca atatccggtc 3720cggccgcata cgcagacttg cttggagcaa gtgctcggcg
ctcacagcac ccagcccagc 3780accgttcttg gagtagagta gtctaacatg attatcgtgt
ggaatgacga gttcgagcgt 3840atcttctatg gtgattagcc tttcctgggg ggggatggcg
ctgatcaagg tcttgctcat 3900tgttgtcttg ccgcttccgg tagggccaca tagcaacatc
gtcagtcggc tgacgacgca 3960tgcgtgcaga aacgcttcca aatccccgtt gtcaaaatgc
tgaaggatag cttcatcatc 4020ctgattttgg cgtttccttc gtgtctgcca ctggttccac
ctcgaagcat cataacggga 4080ggagacttct ttaagaccag aaacacgcga gcttggccgt
cgaatggtca agctgacggt 4140gcccgaggga acggtcggcg gcagacagat ttgtagtcgt
tcaccaccag gaagttcagt 4200ggcgcagagg gggttacgtg gtccgacatc ctgctttctc
agcgcgcccg ctaaaatagc 4260gatatcttca agatcatcat aagagacggg caaaggcatc
ttggtaaaaa tgccggcttg 4320gcgcacaaat gcctctccag gtcgattgat cgcaatttct
tcagtcttcg ggtcatcgag 4380ccattccaaa atcggcttca gaagaaagcg tagttgcgga
tccacttcca tttacaatgt 4440atcctatctc taagcggaaa tttgaattca ttaagagcgg
cggttcctcc cccgcgtggc 4500gccgccagtc aggcggagct ggtaaacacc aaagaaatcg
aggtcccgtg ctacgaaaat 4560ggaaacggtg tcaccctgat tcttcttcag ggttggcggt
atgttgatgg ttgccttaag 4620ggctgtctca gttgtctgct caccgttatt ttgaaagctg
ttgaagctca tcccgccacc 4680cgagctgccg gcgtaggtgc tagctgcctg gaaggcgcct
tgaacaacac tcaagagcat 4740agctccgcta aaacgctgcc agaagtggct gtcgaccgag
cccggcaatc ctgagcgacc 4800gagttcgtcc gcgcttggcg atgttaacga gatcatcgca
tggtcaggtg tctcggcgcg 4860atcccacaac acaaaaacgc gcccatctcc ctgttgcaag
ccacgctgta tttcgccaac 4920aacggtggtg ccacgatcaa gaagcacgat attgttcgtt
gttccacgaa tatcctgagg 4980caagacacac tttacatagc ctgccaaatt tgtgtcgatt
gcggtttgca agatgcacgg 5040aattattgtc ccttgcgtta ccataaaatc ggggtgcggc
aagagcgtgg cgctgctggg 5100ctgcagctcg gtgggtttca tacgtatcga caaatcgttc
tcgccggaca cttcgccatt 5160cggcaaggag ttgtcgtcac gcttgccttc ttgtcttcgg
cccgtgtcgc cctgaatggc 5220gcgtttgctg accccttgat cgccgctgct atatgcaaaa
atcggtgttt cttccggccg 5280tggctcatgc cgctccggtt cgcccctcgg cggtagagga
gcagcaggct gaacagcctc 5340ttgaaccgct ggaggatccg gcggcacctc aatcggagct
ggatgaaatg gcttggtgtt 5400tgttgcgatc aaagttgacg gcgatgcgtt ctcattcacc
ttcttttggc gcccacctag 5460ccaaatgagg cttaatgata acgcgagaac gacacctccg
acgatcaatt tctgagaccc 5520cgaaagacgc cggcgatgtt tgtcggagac cagggatcca
gatgcatcaa cctcatgtgc 5580cgcttgctga ctatcgttat tcatcccttc gcccccttca
ggacgcgttt cacatcgggc 5640ctcaccgtgc ccgtttgcgg cctttggcca acgggatcgt
aagcggtgtt ccagatacat 5700agtactgtgt ggccatccct cagacgccaa cctcgggaaa
ccgaagaaat ctcgacatcg 5760ctccctttaa ctgaatagtt ggcaacagct tccttgccat
caggattgat ggtgtagatg 5820gagggtatgc gtacattgcc cggaaagtgg aataccgtcg
taaatccatt gtcgaagact 5880tcgagtggca acagcgaacg atcgccttgg gcgacgtagt
gccaattact gtccgccgca 5940ccaagggctg tgacaggctg atccaataaa ttctcagctt
tccgttgata ttgtgcttcc 6000gcgtgtagtc tgtccacaac agccttctgt tgtgcctccc
ttcgccgagc cgccgcatcg 6060tcggcggggt aggcgaattg gacgctgtaa tagagatcgg
gctgctcttt atcgaggtgg 6120gacagagtct tggaacttat actgaaaaca taacggcgca
tcccggagtc gcttgcggtt 6180agcacgatta ctggctgagg cgtgaggacc tggcttgcct
tgaaaaatag ataatttccc 6240cgcggtaggg ctgctagatc tttgctattt gaaacggcaa
ccgctgtcac cgtttcgttc 6300gtggcgaatg ttacgaccaa agtagctcca accgccgtcg
agaggcgcac cacttgatcg 6360ggattgtaag ccaaataacg catgcgcgga tctagcttgc
ccgccattgg agtgtcttca 6420gcctccgcac cagtcgcagc ggcaaataaa catgctaaaa
tgaaaagtgc ttttctgatc 6480atggttcgct gtggcctacg tttgaaacgg tatcttccga
tgtctgatag gaggtgacaa 6540ccagacctgc cgggttggtt agtctcaatc tgccgggcaa
gctggtcacc ttttcgtagc 6600gaactgtcgc ggtccacgta ctcaccacag gcattttgcc
gtcaacgacg agggtccttt 6660tatagcgaat ttgctgcgtg cttggagtta catcatttga
agcgatgtgc tcgacctcca 6720ccctgccgcg tttgccaaga atgacttgag gcgaactggg
attgggatag ttgaagaatt 6780gctggtaatc ctggcgcact gttggggcac tgaagttcga
taccaggtcg taggcgtact 6840gagcggtgtc ggcatcataa ctctcgcgca ggcgaacgta
ctcccacaat gaggcgttaa 6900cgacggcctc ctcttgagtt gcaggcaatc gcgagacaga
cacctcgctg tcaacggtgc 6960cgtccggccg tatccataga tatacgggca caagcctgct
caacggcacc attgtggcta 7020tagcgaacgc ttgagcaaca tttcccaaaa tcgcgatagc
tgcgacagct gcaatgagtt 7080tggagagacg tcgcgccgat ttcgctcgcg cggtttgaaa
ggcttctact tccttatagt 7140gctcggcaag gctttcgcgc gccactagca tggcatattc
aggccccgtc atagcgtcca 7200cccgaattgc cgagctgaag atctgacgga gtaggctgcc
atcgccccac attcagcggg 7260aagatcgggc ctttgcagct cgctaatgtg tcgtttgtct
ggcagccgct caaagcgaca 7320actaggcaca gcaggcaata cttcatagaa ttctccattg
aggcgaattt ttgcgcgacc 7380tagcctcgct caacctgagc gaagcgacgg tacaagctgc
tggcagattg ggttgcgccg 7440ctccagtaac tgcctccaat gttgccggcg atcgccggca
aagcgacaat gagcgcatcc 7500cctgtcagaa aaaacatatc gagttcgtaa agaccaatga
tcttggccgc ggtcgtaccg 7560gcgaaggtga ttacaccaag cataagggtg agcgcagtcg
cttcggttag gatgacgatc 7620gttgccacga ggtttaagag gagaagcaag agaccgtagg
tgataagttg cccgatccac 7680ttagctgcga tgtcccgcgt gcgatcaaaa atatatccga
cgaggatcag aggcccgatc 7740gcgagaagca ctttcgtgag aattccaacg gcgtcgtaaa
ctccgaaggc agaccagagc 7800gtgccgtaaa ggacccactg tgccccttgg aaagcaagga
tgtcctggtc gttcatcgga 7860ccgatttcgg atgcgatttt ctgaaaaacg gcctgggtca
cggcgaacat tgtatccaac 7920tgtgccggaa cagtctgcag aggcaagccg gttacactaa
actgctgaac aaagtttggg 7980accgtctttt cgaagatgga aaccacatag tcttggtagt
tagcctgccc aacaattaga 8040gcaacaacga tggtgaccgt gatcacccga gtgataccgc
tacgggtatc gacttcgccg 8100cgtatgacta aaataccctg aacaataatc caaagagtga
cacaggcgat caatggcgca 8160ctcaccgcct cctggatagt ctcaagcatc gagtccaagc
ctgtcgtgaa ggctacatcg 8220aagatcgtat gaatggccgt aaacggcgcc ggaatcgtga
aattcatcga ttggacctga 8280acttgactgg tttgtcgcat aatgttggat aaaatgagct
cgcattcggc gaggatgcgg 8340gcggatgaac aaatcgccca gccttagggg agggcaccaa
agatgacagc ggtcttttga 8400tgctccttgc gttgagcggc cgcctcttcc gcctcgtgaa
ggccggcctg cgcggtagtc 8460atcgttaata ggcttgtcgc ctgtacattt tgaatcattg
cgtcatggat ctgcttgaga 8520agcaaaccat tggtcacggt tgcctgcatg atattgcgag
atcgggaaag ctgagcagac 8580gtatcagcat tcgccgtcaa gcgtttgtcc atcgtttcca
gattgtcagc cgcaatgcca 8640gcgctgtttg cggaaccggt gatctgcgat cgcaacaggt
ccgcttcagc atcactaccc 8700acgactgcac gatctgtatc gctggtgatc gcacgtgccg
tggtcgacat tggcattcgc 8760ggcgaaaaca tttcattgtc taggtccttc gtcgaaggat
actgattttt ctggttgagc 8820gaagtcagta gtccagtaac gccgtaggcc gacgtcaaca
tcgtaaccat cgctatagtc 8880tgagtgagat tctccgcagt cgcgagcgca gtcgcgagcg
tctcagcctc cgttgccggg 8940tcgctaacaa caaactgcgc ccgcgcgggc tgaatatata
gaaagctgca ggtcaaaact 9000gttgcaataa gttgcgtcgt cttcatcgtt tcctacctta
tcaatcttct gcctcgtggt 9060gacgggccat gaattcgctg agccagccag atgagttgcc
ttcttgtgcc tcgcgtagtc 9120gagttgcaaa gcgcaccgtg ttggcacgcc ccgaaagcac
ggcgacatat tcacgcatat 9180cccgcagatc aaattcgcag atgacgcttc cactttctcg
tttaagaaga aacttacggc 9240tgccgaccgt catgtcttca cggatcgcct gaaattcctt
ttcggtacat ttcagtccat 9300cgacataagc cgatcgatct gcggttggtg atggatagaa
aatcttcgtc atacattgcg 9360caaccaagct ggctcctagc ggcgattcca gaacatgctc
tggttgctgc gttgccagta 9420ttagcatccc gttgtttttt cgaacggtca ggaggaattt
gtcgacgaca gtcgaaaatt 9480tagggtttaa caaataggcg cgaaactcat cgcagctcat
cacaaaacgg cggccgtcga 9540tcatggctcc aatccgatgc aggagatatg ctgcagcggg
agcgcatact tcctcgtatt 9600cgagaagatg cgtcatgtcg aagccggtaa tcgacggatc
taactttact tcgtcaactt 9660cgccgtcaaa tgcccagcca agcgcatggc cccggcacca
gcgttggagc cgcgctcctg 9720cgccttcggc gggcccatgc aacaaaaatt cacgtaaccc
cgcgattgaa cgcatttgtg 9780gatcaaacga gagctgacga tggataccac ggaccagacg
gcggttctct tccggagaaa 9840tcccaccccg accatcactc tcgatgagag ccacgatcca
ttcgcgcaga aaatcgtgtg 9900aggctgctgt gttttctagg ccacgcaacg gcgccaaccc
gctgggtgtg cctctgtgaa 9960gtgccaaata tgttcctcct gtggcgcgaa ccagcaattc
gccaccccgg tccttgtcaa 10020agaacacgac cgtacctgca cggtcgacca tgctctgttc
gagcatggct agaacaaaca 10080tcatgagcgt cgtcttaccc ctcccgatag gcccgaatat
tgccgtcatg ccaacatcgt 10140gctcatgcgg gatatagtcg aaaggcgttc cgccattggt
acgaaatcgg gcaatcgcgt 10200tgccccagtg gcctgagctg gcgccctctg gaaagttttc
gaaagagaca aaccctgcga 10260aattgcgtga agtgattgcg ccagggcgtg tgcgccactt
aaaattcccc ggcaattggg 10320accaataggc cgcttccata ccaatacctt cttggacaac
cacggcacct gcatccgcca 10380ttcgtgtccg agcccgcgcg cccctgtccc caagactatt
gagatcgtct gcatagacgc 10440aaaggctcaa atgatgtgag cccataacga attcgttgct
cgcaagtgcg tcctcagcct 10500cggataattt gccgatttga gtcacggctt tatcgccgga
actcagcatc tggctcgatt 10560tgaggctaag tttcgcgtgc gcttgcgggc gagtcaggaa
cgaaaaactc tgcgtgagaa 10620caagtggaaa atcgagggat agcagcgcgt tgagcatgcc
cggccgtgtt tttgcagggt 10680attcgcgaaa cgaatagatg gatccaacgt aactgtcttt
tggcgttctg atctcgagtc 10740ctcgcttgcc gcaaatgact ctgtcggtat aaatcgaagc
gccgagtgag ccgctgacga 10800ccggaaccgg tgtgaaccga ccagtcatga tcaaccgtag
cgcttcgcca atttcggtga 10860agagcacacc ctgcttctcg cggatgccaa gacgatgcag
gccatacgct ttaagagagc 10920cagcgacaac atgccaaaga tcttccatgt tcctgatctg
gcccgtgaga tcgttttccc 10980tttttccgct tagcttggtg aacctcctct ttaccttccc
taaagccgcc tgtgggtaga 11040caatcaacgt aaggaagtgt tcattgcgga ggagttggcc
ggagagcacg cgctgttcaa 11100aagcttcgtt caggctagcg gcgaaaacac tacggaagtg
tcgcggcgcc gatgatggca 11160cgtcggcatg acgtacgagg tgagcatata ttgacacatg
atcatcagcg atattgcgca 11220acagcgtgtt gaacgcacga caacgcgcat tgcgcatttc
agtttcctca agctcgaatg 11280caacgccatc aattctcgca atggtcatga tcgatccgtc
ttcaagaagg acgatatggt 11340cgctgaggtg gccaatataa gggagataga tctcaccgga
tctttcggtc gttccactcg 11400cgccgagcat cacaccattc ctctccctcg tgggggaacc
ctaattggat ttgggctaac 11460agtagcgccc ccccaaactg cactatcaat gcttcttccc
gcggtccgca aaaatagcag 11520gacgacgctc gccgcattgt agtctcgctc cacgatgagc
cgggctgcaa accataacgg 11580cacgagaacg acttcgtaga gcgggttctg aacgataacg
atgacaaagc cggcgaacat 11640catgaataac cctgccaatg tcagtggcac cccaagaaac
aatgcgggcc gtgtggctgc 11700gaggtaaagg gtcgattctt ccaaacgatc agccatcaac
taccgccagt gagcgtttgg 11760ccgaggaagc tcgccccaaa catgataaca atgccgccga
cgacgccggc aaccagccca 11820agcgaagccc gcccgaacat ccaggagatc ccgatagcga
caatgccgag aacagcgagt 11880gactggccga acggaccaag gataaacgtg catatattgt
taaccattgt ggcggggtca 11940gtgccgccac ccgcagattg cgctgcggcg ggtccggatg
aggaaatgct ccatgcaatt 12000gcaccgcaca agcttggggc gcagctcgat atcacgcgca
tcatcgcatt cgagagcgag 12060aggcgattta gatgtaaacg gtatctctca aagcatcgca
tcaatgcgca cctccttagt 12120ataagtcgaa taagacttga ttgtcgtctg cggatttgcc
gttgtcctgg tgtggcggtg 12180gcggagcgat taaaccgcca gcgccatcct cctgcgagcg
gcgctgatat gacccccaaa 12240catcccacgt ctcttcggat tttagcgcct cgtgatcgtc
ttttggaggc tcgattaacg 12300cgggcaccag cgattgagca gctgtttcaa cttttcgcac
gtagccgttt gcaaaaccgc 12360cgatgaaatt accggtgttg taagcggaga tcgcccgacg
aagcgcaaat tgcttctcgt 12420caatcgtttc gccgcctgca taacgacttt tcagcatgtt
tgcagcggca gataatgatg 12480tgcacgcctg gagcgcaccg tcaggtgtca gaccgagcat
agaaaaattt cgagagttta 12540tttgcatgag gccaacatcc agcgaatgcc gtgcatcgag
acggtgcctg acgacttggg 12600ttgcttggct gtgatcttgc cagtgaagcg tttcgccggt
cgtgttgtca tgaatcgcta 12660aaggatcaaa gcgactctcc accttagcta tcgccgcaag
cgtagatgtc gcaactgatg 12720gggcacactt gcgagcaaca tggtcaaact cagcagatga
gagtggcgtg gcaaggctcg 12780acgaacagaa ggagaccatc aaggcaagag aaagcgaccc
cgatctctta agcatacctt 12840atctccttag ctcgcaacta acaccgcctc tcccgttgga
agaagtgcgt tgttttatgt 12900tgaagattat cgggagggtc ggttactcga aaattttcaa
ttgcttcttt atgatttcaa 12960ttgaagcgag aaacctcgcc cggcgtcttg gaacgcaaca
tggaccgaga accgcgcatc 13020catgactaag caaccggatc gacctattca ggccgcagtt
ggtcaggtca ggctcagaac 13080gaaaatgctc ggcgaggtta cgctgtctgt aaacccattc
gatgaacggg aagcttcctt 13140ccgattgctc ttggcaggaa tattggccca tgcctgcttg
cgctttgcaa atgctcttat 13200cgcgttggta tcatatgcct tgtccgccag cagaaacgca
ctctaagcga ttatttgtaa 13260aaatgtttcg gtcatgcggc ggtcatgggc ttgacccgct
gtcagcgcaa gacggatcgg 13320tcaaccgtcg gcatcgacaa cagcgtgaat cttggtggtc
aaaccgccac gggaacgtcc 13380catacagcca tcgtcttgat cccgctgttt cccgtcgccg
catgttggtg gacgcggaca 13440caggaactgt caatcatgac gacattctat cgaaagcctt
ggaaatcaca ctcagaatat 13500gatcccagac gtctgcctca cgccatcgta caaagcgatt
gtagcaggtt gtacaggaac 13560cgtatcgatc aggaacgtct gcccagggcg ggcccgtccg
gaagcgccac aagatgacat 13620tgatcacccg cgtcaacgcg cggcacgcga cgcggcttat
ttgggaacaa aggactgaac 13680aacagtccat tcgaaatcgg tgacatcaaa gcggggacgg
gttatcagtg gcctccaagt 13740caagcctcaa tgaatcaaaa tcagaccgat ttgcaaacct
gatttatgag tgtgcggcct 13800aaatgatgaa atcgtccttc tagatcgcct ccgtggtgta
gcaacacctc gcagtatcgc 13860cgtgctgacc ttggccaggg aattgactgg caagggtgct
ttcacatgac cgctcttttg 13920gccgcgatag atgatttcgt tgctgctttg ggcacgtaga
aggagagaag tcatatcgga 13980gaaattcctc ctggcgcgag agcctgctct atcgcgacgg
catcccactg tcgggaacag 14040accggatcat tcacgaggcg aaagtcgtca acacatgcgt
tataggcatc ttcccttgaa 14100ggatgatctt gttgctgcca atctggaggt gcggcagccg
caggcagatg cgatctcagc 14160gcaacttgcg gcaaaacatc tcactcacct gaaaaccact
agcgagtctc gcgatcagac 14220gaaggccttt tacttaacga cacaatatcc gatgtctgca
tcacaggcgt cgctatccca 14280gtcaatacta aagcggtgca ggaactaaag attactgatg
acttaggcgt gccacgaggc 14340ctgagacgac gcgcgtagac agttttttga aatcattatc
aaagtgatgg cctccgctga 14400agcctatcac ctctgcgccg gtctgtcgga gagatgggca
agcattatta cggtcttcgc 14460gcccgtacat gcattggacg attgcagggt caatggatct
gagatcatcc agaggattgc 14520cgcccttacc ttccgtttcg agttggagcc agcccctaaa
tgagacgaca tagtcgactt 14580gatgtgacaa tgccaagaga gagatttgct taacccgatt
tttttgctca agcgtaagcc 14640tattgaagct tgccggcatg acgtccgcgc cgaaagaata
tcctacaagt aaaacattct 14700gcacaccgaa atgcttggtg tagacatcga ttatgtgacc
aagatcctta gcagtttcgc 14760ttggggaccg ctccgaccag aaataccgaa gtgaactgac
gccaatgaca ggaatccctt 14820ccgtctgcag ataggtacca tcgatagatc tgctgcctcg
cgcgtttcgg tgatgacggt 14880gaaaacctct gacacatgca gctcccggag acggtcacag
cttgtctgta agcggatgcc 14940gggagcagac aagcccgtca gggcgcgtca gcgggtgttg
gcgggtgtcg gggcgcagcc 15000atgacccagt cacgtagcga tagcggagtg tatactggct
taactatgcg gcatcagagc 15060agattgtact gagagtgcac catatgcggt gtgaaatacc
gcacagatgc gtaaggagaa 15120aataccgcat caggcgctct tccgcttcct cgctcactga
ctcgctgcgc tcggtcgttc 15180ggctgcggcg agcggtatca gctcactcaa aggcggtaat
acggttatcc acagaatcag 15240gggataacgc aggaaagaac atgtgagcaa aaggccagca
aaaggccagg aaccgtaaaa 15300aggccgcgtt gctggcgttt ttccataggc tccgcccccc
tgacgagcat cacaaaaatc 15360gacgctcaag tcagaggtgg cgaaacccga caggactata
aagataccag gcgtttcccc 15420ctggaagctc cctcgtgcgc tctcctgttc cgaccctgcc
gcttaccgga tacctgtccg 15480cctttctccc ttcgggaagc gtggcgcttt ctcatagctc
acgctgtagg tatctcagtt 15540cggtgtaggt cgttcgctcc aagctgggct gtgtgcacga
accccccgtt cagcccgacc 15600gctgcgcctt atccggtaac tatcgtcttg agtccaaccc
ggtaagacac gacttatcgc 15660cactggcagc agccactggt aacaggatta gcagagcgag
gtatgtaggc ggtgctacag 15720agttcttgaa gtggtggcct aactacggct acactagaag
gacagtattt ggtatctgcg 15780ctctgctgaa gccagttacc ttcggaaaaa gagttggtag
ctcttgatcc ggcaaacaaa 15840ccaccgctgg tagcggtggt ttttttgttt gcaagcagca
gattacgcgc agaaaaaaag 15900gatctcaaga agatcctttg atcttttcta cggggtctga
cgctcagtgg aacgaaaact 15960cacgttaagg gattttggtc atgagattat caaaaaggat
cttcacctag atccttttaa 16020attaaaaatg aagttttaaa tcaatctaaa gtatatatga
gtaaacttgg tctgacagtt 16080accaatgctt aatcagtgag gcacctatct cagcgatctg
tctatttcgt tcatccatag 16140ttgcctgact ccccgtcgtg tagataacta cgatacggga
gggcttacca tctggcccca 16200gtgctgcaat gataccgcga gacccacgct caccggctcc
agatttatca gcaataaacc 16260agccagccgg aagggccgag cgcagaagtg gtcctgcaac
tttatccgcc tccatccagt 16320ctattaattg ttgccgggaa gctagagtaa gtagttcgcc
agttaatagt ttgcgcaacg 16380ttgttgccat tgctgcaggg gggggggggg ggggggactt
ccattgttca ttccacggac 16440aaaaacagag aaaggaaacg acagaggcca aaaagcctcg
ctttcagcac ctgtcgtttc 16500ctttcttttc agagggtatt ttaaataaaa acattaagtt
atgacgaaga agaacggaaa 16560cgccttaaac cggaaaattt tcataaatag cgaaaacccg
cgaggtcgcc gccccgtaac 16620ctgtcggatc accggaaagg acccgtaaag tgataatgat
tatcatctac atatcacaac 16680gtgcgtggag gccatcaaac cacgtcaaat aatcaattat
gacgcaggta tcgtattaat 16740tgatctgcat caacttaacg taaaaacaac ttcagacaat
acaaatcagc gacactgaat 16800acggggcaac ctcatgtccc cccccccccc ccccctgcag
ggcatcgtgg tgtcacgctc 16860gtcgtttggt atggcttcat tcagctccgg ttcccaacga
tcaaggcgag ttacatgatc 16920ccccatgttg tgcaaaaaag cggttagctc cttcggtcct
ccgatcgttg tcagaagtaa 16980gttggccgca gtgttatcac tcatggttat ggcagcactg
cataattctc ttactgtcat 17040gccatccgta agatgctttt ctgtgactgg tgagtactca
accaagtcat tctgagaata 17100gtgtatgcgg cgaccgagtt gctcttgccc ggcgtcaaca
cgggataata ccgcgccaca 17160tagcagaact ttaaaagtgc tcatcattgg aaaacgttct
tcggggcgaa aactctcaag 17220gatcttaccg ctgttgagat ccagttcgat gtaacccact
cgtgcaccca actgatcttc 17280agcatctttt actttcacca gcgtttctgg gtgagcaaaa
acaggaaggc aaaatgccgc 17340aaaaaaggga ataagggcga cacggaaatg ttgaatactc
atactcttcc tttttcaata 17400ttattgaagc atttatcagg gttattgtct catgagcgga
tacatatttg aatgtattta 17460gaaaaataaa caaatagggg ttccgcgcac atttccccga
aaagtgccac ctgacgtcta 17520agaaaccatt attatcatga cattaaccta taaaaatagg
cgtatcacga ggccctttcg 17580tcttcaagaa ttggtcgacg atcttgctgc gttcggatat
tttcgtggag ttcccgccac 17640agacccggat tgaaggcgag atccagcaac tcgcgccaga
tcatcctgtg acggaacttt 17700ggcgcgtgat gactggccag gacgtcggcc gaaagagcga
caagcagatc acgcttttcg 17760acagcgtcgg atttgcgatc gaggattttt cggcgctgcg
ctacgtccgc gaccgcgttg 17820agggatcaag ccacagcagc ccactcgacc ttctagccga
cccagacgag ccaagggatc 17880tttttggaat gctgctccgt cgtcaggctt tccgacgttt
gggtggttga acagaagtca 17940ttatcgtacg gaatgccaag cactcccgag gggaaccctg
tggttggcat gcacatacaa 18000atggacgaac ggataaacct tttcacgccc ttttaaatat
ccgttattct aataaacgct 18060cttttctctt aggtttaccc gccaatatat cctgtcaaac
actgatagtt taaactgaag 18120gcgggaaacg acaatctgat catgagcgga gaattaaggg
agtcacgtta tgacccccgc 18180cgatgacgcg ggacaagccg ttttacgttt ggaactgaca
gaaccgcaac gttgaaggag 18240ccactcagcc caagcttgat atcgaattcc tgcagcccgg
gggatccggg gcggaagatg 18300gcagggacgc ggattcaggg cggacgcgct tgccgagggc
gcgggggacc acagcgtgcg 18360ttacggggac agggcgggca tcgcgaggac gggtgcggga
gcggagccac atctggtggt 18420ggacgcctac tttgctctct tatagagtag taaagattcg
tggaccaaac aacaccctag 18480cttgtacaaa tattcttagg cagttgctac tgatgagaga
aaaataacat cactccactg 18540catttgcgtg atttattgaa cagatcacaa ttacatctat
tcaaatttat ttacctgtac 18600gtgtccgatt tttaggggag gattttttta cggtattttt
tttttaaaaa aataaattta 18660ggcaacaatt ttatagaatc gagtgcttta tctattatct
tttacaaggc acacgcgtac 18720aataaggttt ggtcgttcgt gacttggata gtggttttgg
ttgcaattcc gtaattcttg 18780gcataggata cagcccaacc cagaaaaaaa taatgttgcg
gtcagttctg gctttgagat 18840tcggagtacc acgtggcgta aaggcaggcc gtgtcttaca
gatgaataaa ggacctgggt 18900ctcacgtgat tggtttccag tttcgtgcat caagatgtgg
aattttcaaa ctgccgtcgt 18960gtttgtttcg tcacataaaa gctttttgga aggctaagga
gaggaagccg gcgagaagga 19020gggggcgttt tacgtgtcac tgtcctgtcg tgttggctgt
tgacacgaat catttcttcc 19080gcgcgtggga agaagaagat gcacattagc ggcctgaagt
agagatgtca atggggaatt 19140ccccagcggg gattaactcc ccagacccgt acccatgaac
atagaccggc ccccatcccc 19200gaacccgaac ccgacctcgg gtacgaaaat cctcccatac
ccattcccga ccgggtacta 19260aatacccatg ggtatccata cccgacccga ttattcaaaa
attaatgggc tttttatttg 19320ttaaccggcg gacgcaatgc ttgggactct aggttttttt
actttgttga ccggctggcg 19380gctgggcttt ttcctacagg cccaaagttg gtcggcagcc
actaggccac acgtcacagg 19440cagcccacaa gtaaatgtcg ttggattgct ggatggtgga
ataaaaatcc tagatgctag 19500attgttctgg ttccgggtat ttttctccat ggctaatcgg
gtttgggttt agccctccca 19560aacccgaacc cgccataccc gatgggtaag ggatttattc
caaatctata cccatgggga 19620tttgttttaa cccatacctt aaccctaata gaggaattcc
ccacgggtaa tcgggtttcg 19680gggcccattg acatctctag actgaaggcg tccaactcaa
atcattaaaa agtgttgacg 19740cacgcgctga tgcgccggcc gcacagcaca ggctgcacag
cccgtttaat cagcgatgga 19800gccccggccg tcagccagcc aggtccggcg tccgggtctg
cgccctgcgg cgtcactgct 19860gtcgccaccg tctccgatgg tcccacatcc atccagcggg
ccgcgcgtgg tacaaaaggc 19920tcttcctcgc cgtcaggtgc agctgcccaa acaccagaca
cagactccac caccccgctt 19980cgatcttctg ttgcagctga aatctgtcag attctgcagt
tcattcctca tggtccgtcc 20040tgtagaaacc ccaacccgtg aaatcaaaaa actcgacggc
ctgtgggcat tcagtctgga 20100tcgcgaaaac tgtggaattg atcagcgttg gtgggaaagc
gcgttacaag aaagccgggc 20160aattgctgtg ccaggcagtt ttaacgatca gttcgccgat
gcagatattc gtaattatgc 20220gggcaacgtc tggtatcagc gcgaagtctt tataccgaaa
ggttgggcag gccagcgtat 20280cgtgctgcgt ttcgatgcgg tcactcatta cggcaaagtg
tgggtcaata atcaggaagt 20340gatggagcat cagggcggct atacgccatt tgaagccgat
gtcacgccgt atgttattgc 20400cgggaaaagt gtacgtatca ccgtttgtgt gaacaacgaa
ctgaactggc agactatccc 20460gccgggaatg gtgattaccg acgaaaacgg caagaaaaag
cagtcttact tccatgattt 20520ctttaactat gccggaatcc atcgcagcgt aatgctctac
accacgccga acacctgggt 20580ggacgatatc accgtggtga cgcatgtcgc gcaagactgt
aaccacgcgt ctgttgactg 20640ccaggtggtg gccaatggtg atgtcagcgt tgaactgcgt
gatgcggatc aacaggtggt 20700tgcaactgga caaggcacta gcgggacttt gcaagtggtg
aatccgcacc tctgccaacc 20760gggtgaaggt tatctctatg aactgtgcgt cacagccaaa
agccagacag agtgtgatat 20820ctacccgctt cgcgtcggca tccggtcagt ggcagtgaag
ggccaacagt tcctgattaa 20880ccacaaaccg ttctacttta ctggctttgg tcgtcatgaa
gatgcggact tacgtggcaa 20940aggattcgat aacgtgctga tggtgcacga ccacgcatta
atggactgga ttggggccaa 21000ctcctaccgt acctcgcatt acccttacgc tgaagagatg
ctcgactggg cagatgaaca 21060tggcatcgtg gtgattgatg aaactgctgc tgtcggcttt
aacctctctt taggcattgg 21120tttcgaagcg ggcaacaagc cgaaagaact gtacagcgaa
gaggcagtca acggggaaac 21180tcagcaagcg cacttacagg cgattaaaga gctgatagcg
cgtgacaaaa accacccaag 21240cgtggtgatg tggagtattg ccaacgaacc ggatacccgt
ccgcaagtgc acgggaatat 21300ttcgccactg gcggaagcaa cgcgtaaact cgacccgacg
cgtccgatca cctgcgtcaa 21360tgtaatgttc tgcgacgctc acaccgatac catcagcgat
ctctttgatg tgctgtgcct 21420gaaccgttat tacggatggt atgtccaaag cggcgatttg
gaaacggcag agaaggtact 21480ggaaaaagaa cttctggcct ggcaggagaa actgcatcag
ccgattatca tcaccgaata 21540cggcgtggat acgttagccg ggctgcactc aatgtacacc
gacatgtgga gtgaagagta 21600tcagtgtgca tggctggata tgtatcaccg cgtctttgat
cgcgtcagcg ccgtcgtcgg 21660tgaacaggta tggaatttcg ccgattttgc gacctcgcaa
ggcatattgc gcgttggcgg 21720taacaagaaa gggatcttca ctcgcgaccg caaaccgaag
tcggcggctt ttctgctgca 21780aaaacgctgg actggcatga acttcggtga aaaaccgcag
cagggaggca aacaatgaat 21840caacaactct cctggcgcac catcgtcggc tacagcctcg
gtgacgtggg gcaacctaga 21900cttgtccatc ttctggattg gccaacttaa ttaatgtatg
aaataaaagg atgcacacat 21960agtgacatgc taatcactat aatgtgggca tcaaagttgt
gtgttatgtg taattactag 22020ttatctgaat aaaagagaaa gagatcatcc atatttctta
tcctaaatga atgtcacgtg 22080tctttataat tctttgatga accagatgca tttcattaac
caaatccata tacatataaa 22140tattaatcat atataattaa tatcaattgg gttagcaaaa
caaatctagt ctaggtgtgt 22200tttgcgaatt gcggccccat ggagtcaaag attcaaatag
aggacctaac agaactcgcc 22260gtaaagactg gcgaacagtt catacagagt ctcttacgac
tcaatgacaa gaagaaaatc 22320ttcgtcaaca tggtggagca cgacacgctt gtctactcca
aaaatatcaa agatacagtc 22380tcagaagacc aaagggcaat tgagactttt caacaaaggg
taatatccgg aaacctcctc 22440ggattccatt gcccagctat ctgtcacttt attgtgaaga
tagtggaaaa ggaaggtggc 22500tcctacaaat gccatcattg cgataaagga aaggccatcg
ttgaagatgc ctctgccgac 22560agtggtccca aagatggacc cccacccacg aggagcatcg
tggaaaaaga agacgttcca 22620accacgtctt caaagcaagt ggattgatgt gatatctcca
ctgacgtaag ggatgacgca 22680caatcccact atccttcgca agacccttcc tctatataag
gaagttcatt tcatttggag 22740aggacagggt acccggggat ccaccatgtc tccggagagg
agaccagttg agattaggcc 22800agctacagca gctgatatgg ccgcggtttg tgatatcgtt
aaccattaca ttgagacgtc 22860tacagtgaac tttaggacag agccacaaac accacaagag
tggattgatg atctagagag 22920gttgcaagat agataccctt ggttggttgc tgaggttgag
ggtgttgtgg ctggtattgc 22980ttacgctggg ccctggaagg ctaggaacgc ttacgattgg
acagttgaga gtactgttta 23040cgtgtcacat aggcatcaaa ggttgggcct aggatccaca
ttgtacacac atttgcttaa 23100gtctatggag gcgcaaggtt ttaagtctgt ggttgctgtt
ataggccttc caaacgatcc 23160atctgttagg ttgcatgagg ctttgggata cacagcccgg
ggtacattgc gcgcagctgg 23220atacaagcat ggtggatggc atgatgttgg tttttggcaa
agggattttg agttgccagc 23280tcctccaagg ccagttaggc cagttaccca gatctgagtc
gacctgcagg catgccgctg 23340aaatcaccag tctctctcta caaatctatc tctctctata
ataatgtgtg agtagttccc 23400agataaggga attagggttc ttatagggtt tcgctcatgt
gttgagcata taagaaaccc 23460ttagtatgta tttgtatttg taaaatactt ctatcaataa
aatttctaat tcctaaaacc 23520aaaatccagt gggtaccgag ctcgaattca gtacattaaa
aacgtccgca atgtgttatt 23580aagttgtcta agcgtcaatt tgtttacacc acaatatatc
ctgccaccag ccagccaaca 23640gctccccgac cggcagctcg gcacaaaatc accactcgat
acaggcagcc catcagtccg 23700ggacggcgtc agcgggagag ccgttgtaag gcggcagact
ttgctcatgt taccgatgct 23760attcggaaga acggcaacta agctgccggg tttgaaacac
ggatgatctc gcggagggta 23820gcatgttgat tgtaacgatg acagagcgtt gctgcctgtg
atcaaatatc atctccctcg 23880cagagatccg aattatcagc cttcttattc atttctcgct
taaccgtgac aggctgtcga 23940tcttgagaac tatgccgaca taataggaaa tcgctggata
aagccgctga ggaagctgag 24000tggcgctatt tctttagaag tgaacgttga cgatcgtcga
ccgtaccccg atgaattaat 24060tcggacgtac gttctgaaca cagctggata cttacttggg
cgattgtcat acatgacatc 24120aacaatgtac ccgtttgtgt aaccgtctct tggaggttcg
tatgacacta gtggttcccc 24180tcagcttgcg actagatgtt gaggcctaac attttattag
agagcaggct agttgcttag 24240atacatgatc ttcaggccgt tatctgtcag ggcaagcgaa
aattggccat ttatgacgac 24300caatgccccg cagaagctcc catctttgcc gccatagacg
ccgcgccccc cttttggggt 24360gtagaacatc cttttgccag atgtggaaaa gaagttcgtt
gtcccattgt tggcaatgac 24420gtagtagccg gcgaaagtgc gagacccatt tgcgctatat
ataagcctac gatttccgtt 24480gcgactattg tcgtaattgg atgaactatt atcgtagttg
ctctcagagt tgtcgtaatt 24540tgatggacta ttgtcgtaat tgcttatgga gttgtcgtag
ttgcttggag aaatgtcgta 24600gttggatggg gagtagtcat agggaagacg agcttcatcc
actaaaacaa ttggcaggtc 24660agcaagtgcc tgccccgatg ccatcgcaag tacgaggctt
agaaccacct tcaacagatc 24720gcgcatagtc ttccccagct ctctaacgct tgagttaagc
cgcgccgcga agcggcgtcg 24780gcttgaacga attgttagac attatttgcc gactaccttg
gtgatctcgc ctttcacgta 24840gtgaacaaat tcttccaact gatctgcgcg cgaggccaag
cgatcttctt gtccaagata 24900agcctgccta gcttcaagta tgacgggctg atactgggcc
ggcaggcgct ccattgccca 24960gtcggcagcg acatccttcg gcgcgatttt gccggttact
gcgctgtacc aaatgcggga 25020caacgtaagc actacatttc gctcatcgcc agcccagtcg
ggcggcgagt tccatagcgt 25080taaggtttca tttagcgcct caaatagatc ctgttcagga
accggatcaa agagttcctc 25140cgccgctgga cctaccaagg caacgctatg ttctcttgct
tttgtcagca agatagccag 25200atcaatgtcg atcgtggctg gctcgaagat acctgcaaga
atgtcattgc gctgccattc 25260tccaaattgc agttcgcgct tagctggata acgccacgga
atgatgtcgt cgtgcacaac 25320aatggtgact tctacagcgc ggagaatctc gctctctcca
ggggaagccg aagtttccaa 25380aaggtcgttg atcaaagctc gccgcgttgt ttcatcaagc
cttacggtca ccgtaaccag 25440caaatcaata tcactgtgtg gcttcaggcc gccatccact
gcggagccgt acaaatgtac 25500ggccagcaac gtcggttcga gatggcgctc gatgacgcca
actacctctg atagttgagt 25560cgatacttcg gcgatcaccg cttccctcat gatgtttaac
tcctgaatta agccgcgccg 25620cgaagcggtg tcggcttgaa tgaattgtta ggcgtcatcc
tgtgctcccg agaaccagta 25680ccagtacatc gctgtttcgt tcgagacttg aggtctagtt
ttatacgtga acaggtcaat 25740gccgccgaga gtaaagccac attttgcgta caaattgcag
gcaggtacat tgttcgtttg 25800tgtctctaat cgtatgccaa ggagctgtct gcttagtgcc
cactttttcg caaattcgat 25860gagactgtgc gcgactcctt tgcctcggtg cgtgtgcgac
acaacaatgt gttcgataga 25920ggctagatcg ttccatgttg agttgagttc aatcttcccg
acaagctctt ggtcgatgaa 25980tgcgccatag caagcagagt cttcatcaga gtcatcatcc
gagatgtaat ccttccggta 26040ggggctcaca cttctggtag atagttcaaa gccttggtcg
gataggtgca catcgaacac 26100ttcacgaaca atgaaatggt tctcagcatc caatgtttcc
gccacctgct cagggatcac 26160cgaaatcttc atatgacgcc taacgcctgg cacagcggat
cgcaaacctg gcgcggcttt 26220tggcacaaaa ggcgtgacag gtttgcgaat ccgttgctgc
cacttgttaa cccttttgcc 26280agatttggta actataattt atgttagagg cgaagtcttg
ggtaaaaact ggcctaaaat 26340tgctggggat ttcaggaaag taaacatcac cttccggctc
gatgtctatt gtagatatat 26400gtagtgtatc tacttgatcg ggggatctgc tgcctcgcgc
gtttcggtga tgacggtgaa 26460aacctctgac acatgcagct cccggagacg gtcacagctt
gtctgtaagc ggatgccggg 26520agcagacaag cccgtcaggg cgcgtcagcg ggtgttggcg
ggtgtcgggg cgcagccatg 26580acccagtcac gtagcgatag cggagtgtat actggcttaa
ctatgcggca tcagagcaga 26640ttgtactgag agtgcaccat atgcggtgtg aaataccgca
cagatgcgta aggagaaaat 26700accgcatcag gcgctcttcc gcttcctcgc tcactgactc
gctgcgctcg gtcgttcggc 26760tgcggcgagc ggtatcagct cactcaaagg cggtaatacg
gttatccaca gaatcagggg 26820ataacgcagg aaagaacatg tgagcaaaag gccagcaaaa
ggccaggaac cgtaaaaagg 26880ccgcgttgct ggcgtttttc cataggctcc gcccccctga
cgagcatcac aaaaatcgac 26940gctcaagtca gaggtggcga aacccgacag gactataaag
ataccaggcg tttccccctg 27000gaagctccct cgtgcgctct cctgttccga ccctgccgct
taccggatac ctgtccgcct 27060ttctcccttc gggaagcgtg gcgctttctc atagctcacg
ctgtaggtat ctcagttcgg 27120tgtaggtcgt tcgctccaag ctgggctgtg tgcacgaacc
ccccgttcag cccgaccgct 27180gcgccttatc cggtaactat cgtcttgagt ccaacccggt
aagacacgac ttatcgccac 27240tggcagcagc cactggtaac aggattagca gagcgaggta
tgtaggcggt gctacagagt 27300tcttgaagtg gtggcctaac tacggctaca ctagaaggac
agtatttggt atctgcgctc 27360tgctgaagcc agttaccttc ggaaaaagag ttggtagctc
ttgatccggc aaacaaacca 27420ccgctggtag cggtggtttt tttgtttgca agcagcagat
tacgcgcaga aaaaaaggat 27480ctcaagaaga tcctttgatc ttttctacgg ggtctgacgc
tcagtggaac gaaaactcac 27540gttaagggat tttggtcatg agattatcaa aaaggatctt
cacctagatc cttttaaatt 27600aaaaatgaag ttttaaatca atctaaagta tatatgagta
aacttggtct gacagttacc 27660aatgcttaat cagtgaggca cctatctcag cgatctgtct
atttcgttca tccatagttg 27720cctgactccc cgtcgtgtag ataactacga tacgggaggg
cttaccatct ggccccagtg 27780ctgcaatgat accgcgagac ccacgctcac cggctccaga
tttatcagca ataaaccagc 27840cagccggaag ggccgagcgc agaagtggtc ctgcaacttt
atccgcctcc atccagtcta 27900ttaattgttg ccgggaagct agagtaagta gttcgccagt
taatagtttg cgcaacgttg 27960ttgccattgc tgcagggggg gggggggggg gggacttcca
ttgttcattc cacggacaaa 28020aacagagaaa ggaaacgaca gaggccaaaa agcctcgctt
tcagcacctg tcgtttcctt 28080tcttttcaga gggtatttta aataaaaaca ttaagttatg
acgaagaaga acggaaacgc 28140cttaaaccgg aaaattttca taaatagcga aaacccgcga
ggtcgccgcc ccgtaacctg 28200tcggatcacc ggaaaggacc cgtaaagtga taatgattat
catctacata tcacaacgtg 28260cgtggaggcc atcaaaccac gtcaaataat caattatgac
gcaggtatcg tattaattga 28320tctgcatcaa cttaacgtaa aaacaacttc agacaataca
aatcagcgac actgaatacg 28380gggcaacctc atgtcccccc cccccccccc cctgcaggca
tcgtggtgtc acgctcgtcg 28440tttggtatgg cttcattcag ctccggttcc caacgatcaa
ggcgagttac atgatccccc 28500atgttgtgca aaaaagcggt tagctccttc ggtcctccga
tcgttgtcag aagtaagttg 28560gccgcagtgt tatcactcat ggttatggca gcactgcata
attctcttac tgtcatgcca 28620tccgtaagat gcttttctgt gactggtgag tactcaacca
agtcattctg agaatagtgt 28680atgcggcgac cgagttgctc ttgcccggcg tcaacacggg
ataataccgc gccacatagc 28740agaactttaa aagtgctcat cattggaaaa cgttcttcgg
ggcgaaaact ctcaaggatc 28800ttaccgctgt tgagatccag ttcgatgtaa cccactcgtg
cacccaactg atcttcagca 28860tcttttactt tcaccagcgt ttctgggtga gcaaaaacag
gaaggcaaaa tgccgcaaaa 28920aagggaataa gggcgacacg gaaatgttga atactcatac
tcttcctttt tcaatattat 28980tgaagcattt atcagggtta ttgtctcatg agcggataca
tatttgaatg tatttagaaa 29040aataaacaaa taggggttcc gcgcacattt ccccgaaaag
tgccacctga cgtctaagaa 29100accattatta tcatgacatt aacctataaa aataggcgta
tcacgaggcc ctttcgtctt 29160caagaattcg gagcttttgc cattctcacc ggattcagtc
gtcactcatg gtgatttctc 29220acttgataac cttatttttg acgaggggaa attaataggt
tgtattgatg ttggacgagt 29280cggaatcgca gaccgatacc aggatcttgc catcctatgg
aactgcctcg gtgagttttc 29340tccttcatta cagaaacggc tttttcaaaa atatggtatt
gataatcctg atatgaataa 29400attgcagttt catttgatgc tcgatgagtt tttctaatca
gaattggtta attggttgta 29460acactggcag agcattacgc tgacttgacg ggacggcggc
tttgttgaat aaatcgaact 29520tttgctgagt tgaaggatca gatcacgcat cttcccgaca
acgcagaccg ttccgtggca 29580aagcaaaagt tcaaaatcac caactggtcc acctacaaca
aagctctcat caaccgtggc 29640tccctcactt tctggctgga tgatggggcg attcaggcct
ggtatgagtc agcaacacct 29700tcttcacgag gcagacctca gcgccagaag gccgccagag
aggccgagcg cggccgtgag 29760gcttggacgc tagggcaggg catgaaaaag cccgtagcgg
gctgctacgg gcgtctgacg 29820cggtggaaag ggggagggga tgttgtctac atggctctgc
tgtagtgagt gggttgcgct 29880ccggcagcgg tcctgatcaa tcgtcaccct ttctcggtcc
ttcaacgttc ctgacaacga 29940gcctcctttt cgccaatcca tcgacaatca ccgcgagtcc
ctgctcgaac gctgcgtccg 30000gaccggcttc gtcgaaggcg tctatcgcgg cccgcaacag
cggcgagagc ggagcctgtt 30060caacggtgcc gccgcgctcg ccggcatcgc tgtcgccggc
ctgctcctca agcacggccc 30120caacagtgaa gtagctgatt gtcatcagcg cattgacggc
gtccccggcc gaaaaacccg 30180cctcgcagag gaagcgaagc tgcgcgtcgg ccgtttccat
ctgcggtgcg cccggtcgcg 30240tgccggcatg gatgcgcgcg ccatcgcggt aggcgagcag
cgcctgcctg aagctgcggg 30300cattcccgat cagaaatgag cgccagtcgt cgtcggctct
cggcaccgaa tgcgtatgat 30360tctccgccag catggcttcg gccagtgcgt cgagcagcgc
ccgcttgttc ctgaagtgcc 30420agtaaagcgc cggctgctga acccccaacc gttccgccag
tttgcgtgtc gtcagaccgt 30480ctacgccgac ctcgttcaac aggtccaggg cggcacggat
cactgtattc ggctgcaact 30540ttgtcatgct tgacacttta tcactgataa acataatatg
tccaccaact tatcagtgat 30600aaagaatccg cgcgttcaat cggaccagcg gaggctggtc
cggaggccag acgtgaaacc 30660caacataccc ctgatcgtaa ttctgagcac tgtcgcgctc
gacgctgtcg gcatcggcct 30720gattatgccg gtgctgccgg gcctcctgcg cgatctggtt
cactcgaacg acgtcaccgc 30780ccactatggc attctgctgg cgctgtatgc gttggtgcaa
tttgcctgcg cacctgtgct 30840gggcgcgctg tcggatcgtt tcgggcggcg gccaatcttg
ctcgtctcgc tggccggcgc 30900cactgtcgac tacgccatca tggcgacagc gcctttcctt
tgggttctct atatcgggcg 30960gatcgtggcc ggcatcaccg gggcgactgg ggcggtagcc
ggcgcttata ttgccgatat 31020cactgatggc gatgagcgcg cgcggcactt cggcttcatg
agcgcctgtt tcgggttcgg 31080gatggtcgcg ggacctgtgc tcggtgggct gatgggcggt
ttctcccccc acgctccgtt 31140cttcgccgcg gcagccttga acggcctcaa tttcctgacg
ggctgtttcc ttttgccgga 31200gtcgcacaaa ggcgaacgcc ggccgttacg ccgggaggct
ctcaacccgc tcgcttcgtt 31260ccggtgggcc cggggcatga ccgtcgtcgc cgccctgatg
gcggtcttct tcatcatgca 31320acttgtcgga caggtgccgg ccgcgctttg ggtcattttc
ggcgaggatc gctttcactg 31380ggacgcgacc acgatcggca tttcgcttgc cgcatttggc
attctgcatt cactcgccca 31440ggcaatgatc accggccctg tagccgcccg gctcggcgaa
aggcgggcac tcatgctcgg 31500aatgattgcc gacggcacag gctacatcct gcttgccttc
gcgacacggg gatggatggc 31560gttcccgatc atggtcctgc ttgcttcggg tggcatcgga
atgccggcgc tgcaagcaat 31620gttgtccagg caggtggatg aggaacgtca ggggcagctg
caaggctcac tggcggcgct 31680caccagcctg acctcgatcg tcggacccct cctcttcacg
gcgatctatg cggcttctat 31740aacaacgtgg aacgggtggg catggattgc aggcgctgcc
ctctacttgc tctgcctgcc 31800ggcgctgcgt cgcgggcttt ggagcggcgc agggcaacga
gccgatcgct gatcgtggaa 31860acgataggcc tatgccatgc gggtcaaggc gacttccggc
aagctatacg cgccctagga 31920gtgcggttgg aacgttggcc cagccagata ctcccgatca
cgagcaggac gccgatgatt 31980tgaagcgcac tcagcgtctg atccaagaac aaccatccta
gcaacacggc ggtccccggg 32040ctgagaaagc ccagtaagga aacaactgta ggttcgagtc
gcgagatccc ccggaaccaa 32100aggaagtagg ttaaacccgc tccgatcagg ccgagccacg
ccaggccgag aacattggtt 32160cctgtaggca tcgggattgg cggatcaaac actaaagcta
ctggaacgag cagaagtcct 32220ccggccgcca gttgccaggc ggtaaaggtg agcagaggca
cgggaggttg ccacttgcgg 32280gtcagcacgg ttccgaacgc catggaaacc gcccccgcca
ggcccgctgc gacgccgaca 32340ggatctagcg ctgcgtttgg tgtcaacacc aacagcgcca
cgcccgcagt tccgcaaata 32400gcccccagga ccgccatcaa tcgtatcggg ctacctagca
gagcggcaga gatgaacacg 32460accatcagcg gctgcacagc gcctaccgtc gccgcgaccc
cgcccggcag gcggtagacc 32520gaaataaaca acaagctcca gaatagcgaa atattaagtg
cgccgaggat gaagatgcgc 32580atccaccaga ttcccgttgg aatctgtcgg acgatcatca
cgagcaataa acccgccggc 32640aacgcccgca gcagcatacc ggcgacccct cggcctcgct
gttcgggctc cacgaaaacg 32700ccggacagat gcgccttgtg agcgtccttg gggccgtcct
cctgtttgaa gaccgacagc 32760ccaatgatct cgccgtcgat gtaggcgccg aatgccacgg
catctcgcaa ccgttcagcg 32820aacgcctcca tgggcttttt ctcctcgtgc tcgtaaacgg
acccgaacat ctctggagct 32880ttcttcaggg ccgacaatcg gatctcgcgg aaatcctgca
cgtcggccgc tccaagccgt 32940cgaatctgag ccttaatcac aattgtcaat tttaatcctc
tgtttatcgg cagttcgtag 33000agcgcgccgt gcgtcccgag cgatactgag cgaagcaagt
gcgtcgagca gtgcccgctt 33060gttcctgaaa tgccagtaaa gcgctggctg ctgaaccccc
agccggaact gaccccacaa 33120ggccctagcg tttgcaatgc accaggtcat cattgaccca
ggcgtgttcc accaggccgc 33180tgcctcgcaa ctcttcgcag gcttcgccga cctgctcgcg
ccacttcttc acgcgggtgg 33240aatccgatcc gcacatgagg cggaaggttt ccagcttgag
cgggtacggc tcccggtgcg 33300agctgaaata gtcgaacatc cgtcgggccg tcggcgacag
cttgcggtac ttctcccata 33360tgaatttcgt gtagtggtcg ccagcaaaca gcacgacgat
ttcctcgtcg atcaggacct 33420ggcaacggga cgttttcttg ccacggtcca ggacgcggaa
gcggtgcagc agcgacaccg 33480attccaggtg cccaacgcgg tcggacgtga agcccatcgc
cgtcgcctgt aggcgcgaca 33540ggcattcctc ggccttcgtg taataccggc cattgatcga
ccagcccagg tcctggcaaa 33600gctcgtagaa cgtgaaggtg atcggctcgc cgataggggt
gcgcttcgcg tactccaaca 33660cctgctgcca caccagttcg tcatcgtcgg cccgcagctc
gacgccggtg taggtgatct 33720tcacgtcctt gttgacgtgg aaaatgacct tgttttgcag
cgcctcgcgc gggattttct 33780tgttgcgcgt ggtgaacagg gcagagcggg ccgtgtcgtt
tggcatcgct cgcatcgtgt 33840ccggccacgg cgcaatatcg aacaaggaaa gctgcatttc
cttgatctgc tgcttcgtgt 33900gtttcagcaa cgcggcctgc ttggcctcgc tgacctgttt
tgccaggtcc tcgccggcgg 33960tttttcgctt cttggtcgtc atagttcctc gcgtgtcgat
ggtcatcgac ttcgccaaac 34020ctgccgcctc ctgttcgaga cgacgcgaac gctccacggc
ggccgatggc gcgggcaggg 34080cagggggagc cagttgcacg ctgtcgcgct cgatcttggc
cgtagcttgc tggaccatcg 34140agccgacgga ctggaaggtt tcgcggggcg cacgcatgac
ggtgcggctt gcgatggttt 34200cggcatcctc ggcggaaaac cccgcgtcga tcagttcttg
cctgtatgcc ttccggtcaa 34260acgtccgatt cattcaccct ccttgcggga ttgccccgac
tcacgccggg gcaatgtgcc 34320cttattcctg atttgacccg cctggtgcct tggtgtccag
ataatccacc ttatcggcaa 34380tgaagtcggt cccgtagacc gtctggccgt ccttctcgta
cttggtattc cgaatcttgc 34440cctgcacgaa taccagcgac cccttgccca aatacttgcc
gtgggcctcg gcctgagagc 34500caaaacactt gatgcggaag aagtcggtgc gctcctgctt
gtcgccggca tcgttgcgcc 34560actcttcatt aaccgctata tcgaaaattg cttgcggctt
gttagaattg ccatgacgta 34620cctcggtgtc acgggtaaga ttaccgataa actggaactg
attatggctc atatcgaaag 34680tctccttgag aaaggagact ctagtttagc taaacattgg
ttccgctgtc aagaacttta 34740gcggctaaaa ttttgcgggc cgcgaccaaa ggtgcgaggg
gcggcttccg ctgtgtacaa 34800ccagatattt ttcaccaaca tccttcgtct gctcgatgag
cggggcatga cgaaacatga 34860gctgtcggag agggcagggg tttcaatttc gtttttatca
gacttaacca acggtaaggc 34920caacccctcg ttgaaggtga tggaggccat tgccgacgcc
ctggaaactc ccctacctct 34980tctcctggag tccaccgacc ttgaccgcga ggcactcgcg
gagattgcgg gtcatccttt 35040caagagcagc gtgccgcccg gatacgaacg catcagtgtg
gttttgccgt cacataaggc 35100gtttatcgta aagaaatggg gcgacgacac ccgaaaaaag
ctgcgtggaa ggctctgacg 35160ccaagggtta gggcttgcac ttccttcttt agccgctaaa
acggcccctt ctctgcgggc 35220cgtcggctcg cgcatcatat cgacatcctc aacggaagcc
gtgccgcgaa tggcatcggg 35280cgggtgcgct ttgacagttg ttttctatca gaacccctac
gtcgtgcggt tcgattagct 35340gtttgtcttg caggctaaac actttcggta tatcgtttgc
ctgtgcgata atgttgctaa 35400tgatttgttg cgtaggggtt actgaaaagt gagcgggaaa
gaagagtttc agaccatcaa 35460ggagcgggcc aagcgcaagc tggaacgcga catgggtgcg
gacctgttgg ccgcgctcaa 35520cgacccgaaa accgttgaag tcatgctcaa cgcggacggc
aaggtgtggc acgaacgcct 35580tggcgagccg atgcggtaca tctgcgacat gcggcccagc
cagtcgcagg cgattataga 35640aacggtggcc ggattccacg gcaaagaggt cacgcggcat
tcgcccatcc tggaaggcga 35700gttccccttg gatggcagcc gctttgccgg ccaattgccg
ccggtcgtgg ccgcgccaac 35760ctttgcgatc cgcaagcgcg cggtcgccat cttcacgctg
gaacagtacg tcgaggcggg 35820catcatgacc cgcgagcaat acgaggtcat taaaagcgcc
gtcgcggcgc atcgaaacat 35880cctcgtcatt ggcggtactg gctcgggcaa gaccacgctc
gtcaacgcga tcatcaatga 35940aatggtcgcc ttcaacccgt ctgagcgcgt cgtcatcatc
gaggacaccg gcgaaatcca 36000gtgcgccgca gagaacgccg tccaatacca caccagcatc
gacgtctcga tgacgctgct 36060gctcaagaca acgctgcgta tgcgccccga ccgcatcctg
gtcggtgagg tacgtggccc 36120cgaagccctt gatctgttga tggcctggaa caccgggcat
gaaggaggtg ccgccaccct 36180gcacgcaaac aaccccaaag cgggcctgag ccggctcgcc
atgcttatca gcatgcaccc 36240ggattcaccg aaacccattg agccgctgat tggcgaggcg
gttcatgtgg tcgtccatat 36300cgccaggacc cctagcggcc gtcgagtgca agaaattctc
gaagttcttg gttacgagaa 36360cggccagtac atcaccaaaa ccctgtaagg agtatttcca
atgacaacgg ctgttccgtt 36420ccgtctgacc atgaatcgcg gcattttgtt ctaccttgcc
gtgttcttcg ttctcgctct 36480cgcgttatcc gcgcatccgg cgatggcctc ggaaggcacc
ggcggcagct tgccatatga 36540gagctggctg acgaacctgc gcaactccgt aaccggcccg
gtggccttcg cgctgtccat 36600catcggcatc gtcgtcgccg gcggcgtgct gatcttcggc
ggcgaactca acgccttctt 36660ccgaaccctg atcttcctgg ttctggtgat ggcgctgctg
gtcggcgcgc agaacgtgat 36720gagcaccttc ttcggtcgtg gtgccgaaat cgcggccctc
ggcaacgggg cgctgcacca 36780ggtgcaagtc gcggcggcgg atgccgtgcg tgcggtagcg
gctggacggc tcgcctaatc 36840atggctctgc gcacgatccc catccgtcgc gcaggcaacc
gagaaaacct gttcatgggt 36900ggtgatcgtg aactggtgat gttctcgggc ctgatggcgt
ttgcgctgat tttcagcgcc 36960caagagctgc gggccaccgt ggtcggtctg atcctgtggt
tcggggcgct ctatgcgttc 37020cgaatcatgg cgaaggccga tccgaagatg cggttcgtgt
acctgcgtca ccgccggtac 37080aagccgtatt acccggcccg ctcgaccccg ttccgcgaga
acaccaatag ccaagggaag 37140caataccgat gatccaagca attgcgattg caatcgcggg
cctcggcgcg cttctgttgt 37200tcatcctctt tgcccgcatc cgcgcggtcg atgccgaact
gaaactgaaa aagcatcgtt 37260ccaaggacgc cggcctggcc gatctgctca actacgccgc
tgtcgtcgat gacggcgtaa 37320tcgtgggcaa gaacggcagc tttatggctg cctggctgta
caagggcgat gacaacgcaa 37380gcagcaccga ccagcagcgc gaagtagtgt ccgcccgcat
caaccaggcc ctcgcgggcc 37440tgggaagtgg gtggatgatc catgtggacg ccgtgcggcg
tcctgctccg aactacgcgg 37500agcggggcct gtcggcgttc cctgaccgtc tgacggcagc
gattgaagaa gagcgctcgg 37560tcttgccttg ctcgtcggtg atgtacttca ccagctccgc
gaagtcgctc ttcttgatgg 37620agcgcatggg gacgtgcttg gcaatcacgc gcaccccccg
gccgttttag cggctaaaaa 37680agtcatggct ctgccctcgg gcggaccacg cccatcatga
ccttgccaag ctcgtcctgc 37740ttctcttcga tcttcgccag cagggcgagg atcgtggcat
caccgaaccg cgccgtgcgc 37800gggtcgtcgg tgagccagag tttcagcagg ccgcccaggc
ggcccaggtc gccattgatg 37860cgggccagct cgcggacgtg ctcatagtcc acgacgcccg
tgattttgta gccctggccg 37920acggccagca ggtaggccga caggctcatg ccggccgccg
ccgccttttc ctcaatcgct 37980cttcgttcgt ctggaaggca gtacaccttg ataggtgggc
tgcccttcct ggttggcttg 38040gtttcatcag ccatccgctt gccctcatct gttacgccgg
cggtagccgg ccagcctcgc 38100agagcaggat tcccgttgag caccgccagg tgcgaataag
ggacagtgaa gaaggaacac 38160ccgctcgcgg gtgggcctac ttcacctatc ctgcccggct
gacgccgttg gatacaccaa 38220ggaaagtcta cacgaaccct ttggcaaaat cctgtatatc
gtgcgaaaaa ggatggatat 38280accgaaaaaa tcgctataat gaccccgaag cagggttatg
cagcggaaaa gcgctgcttc 38340cctgctgttt tgtggaatat ctaccgactg gaaacaggca
aatgcaggaa attactgaac 38400tgaggggaca ggcgagagac gatgccaaag agctacaccg
acgagctggc cgagtgggtt 38460gaatcccgcg cggccaagaa gcgccggcgt gatgaggctg
cggttgcgtt cctggcggtg 38520agggcggatg tcgaggcggc gttagcgtcc ggctatgcgc
tcgtcaccat ttgggagcac 38580atgcgggaaa cggggaaggt caagttctcc tacgagacgt
tccgctcgca cgccaggcgg 38640cacatcaagg ccaagcccgc cgatgtgccc gcaccgcagg
ccaaggctgc ggaacccgcg 38700ccggcaccca agacgccgga gccacggcgg ccgaagcagg
ggggcaaggc tgaaaagccg 38760gcccccgctg cggccccgac cggcttcacc ttcaacccaa
caccggacaa aaaggatcta 38820ctgtaatggc gaaaattcac atggttttgc agggcaaggg
cggggtcggc aagtcggcca 38880tcgccgcgat cattgcgcag tacaagatgg acaaggggca
gacacccttg tgcatcgaca 38940ccgacccggt gaacgcgacg ttcgagggct acaaggccct
gaacgtccgc cggctgaaca 39000tcatggccgg cgacgaaatt aactcgcgca acttcgacac
cctggtcgag ctgattgcgc 39060cgaccaagga tgacgtggtg atcgacaacg gtgccagctc
gttcgtgcct ctgtcgcatt 39120acctcatcag caaccaggtg ccggctctgc tgcaagaaat
ggggcatgag ctggtcatcc 39180ataccgtcgt caccggcggc caggctctcc tggacacggt
gagcggcttc gcccagctcg 39240ccagccagtt cccggccgaa gcgcttttcg tggtctggct
gaacccgtat tgggggccta 39300tcgagcatga gggcaagagc tttgagcaga tgaaggcgta
cacggccaac aaggcccgcg 39360tgtcgtccat catccagatt ccggccctca aggaagaaac
ctacggccgc gatttcagcg 39420acatgctgca agagcggctg acgttcgacc aggcgctggc
cgatgaatcg ctcacgatca 39480tgacgcggca acgcctcaag atcgtgcggc gcggcctgtt
tgaacagctc gacgcggcgg 39540ccgtgctatg agcgaccaga ttgaagagct gatccgggag
attgcggcca agcacggcat 39600cgccgtcggc cgcgacgacc cggtgctgat cctgcatacc
atcaacgccc ggctcatggc 39660cgacagtgcg gccaagcaag aggaaatcct tgccgcgttc
aaggaagagc tggaagggat 39720cgcccatcgt tggggcgagg acgccaaggc caaagcggag
cggatgctga acgcggccct 39780ggcggccagc aaggacgcaa tggcgaaggt aatgaaggac
agcgccgcgc aggcggccga 39840agcgatccgc agggaaatcg acgacggcct tggccgccag
ctcgcggcca aggtcgcgga 39900cgcgcggcgc gtggcgatga tgaacatgat cgccggcggc
atggtgttgt tcgcggccgc 39960cctggtggtg tgggcctcgt tatgaatcgc agaggcgcag
atgaaaaagc ccggcgttgc 40020cgggctttgt ttttgcgtta gctgggcttg tttgacaggc
ccaagctctg actgcgcccg 40080cgctcgcgct cctgggcctg tttcttctcc tgctcctgct
tgcgcatcag ggcctggtgc 40140cgtcgggctg cttcacgcat cgaatcccag tcgccggcca
gctcgggatg ctccgcgcgc 40200atcttgcgcg tcgccagttc ctcgatcttg ggcgcgtgaa
tgcccatgcc ttccttgatt 40260tcgcgcacca tgtccagccg cgtgtgcagg gtctgcaagc
gggcttgctg ttgggcctgc 40320tgctgctgcc aggcggcctt tgtacgcggc agggacagca
agccgggggc attggactgt 40380agctgctgca aacgcgcctg ctgacggtct acgagctgtt
ctaggcggtc ctcgatgcgc 40440tccacctggt catgctttgc ctgcacgtag agcgcaaggg
tctgctggta ggtctgctcg 40500atgggcgcgg attctaagag ggcctgctgt tccgtctcgg
cctcctgggc cgcctgtagc 40560aaatcctcgc cgctgttgcc gctggactgc tttactgccg
gggactgctg ttgccctgct 40620cgcgccgtcg tcgcagttcg gcttgccccc actcgattga
ctgcttcatt tcgagccgca 40680gcgatgcgat ctcggattgc gtcaacggac ggggcagcgc
ggaggtgtcc ggcttctcct 40740tgggtgagtc ggtcgatgcc atagccaaag gtttccttcc
aaaatgcgtc cattgctgga 40800ccgtgtttct cattgatgcc cgcaagcatc ttcggcttga
ccgccaggtc aagcgcgcct 40860tcatgggcgg tcatgacgga cgccgccatg accttgccgc
cgttgttctc gatgtagccg 40920cgtaatgagg caatggtgcc gcccatcgtc agcgtgtcat
cgacaacgat gtacttctgg 40980ccggggatca cctccccctc gaaagtcggg ttgaacgcca
ggcgatgatc tgaaccggct 41040ccggttcggg cgaccttctc ccgctgcaca atgtccgttt
cgacctcaag gccaaggcgg 41100tcggccagaa cgaccgccat catggccgga atcttgttgt
tccccgccgc ctcgacggcg 41160aggactggaa cgatgcgggg cttgtcgtcg ccgatcagcg
tcttgagctg ggcaacagtg 41220tcgtccgaaa tcaggcgctc gaccaaatta agcgccgctt
ccgcgtcgcc ctgcttcgca 41280gcctggtatt caggctcgtt ggtcaaagaa ccaaggtcgc
cgttgcgaac caccttcggg 41340aagtctcccc acggtgcgcg ctcggctctg ctgtagctgc
tcaagacgcc tcccttttta 41400gccgctaaaa ctctaacgag tgcgcccgcg actcaacttg
acgctttcgg cacttacctg 41460tgccttgcca cttgcgtcat aggtgatgct tttcgcactc
ccgatttcag gtactttatc 41520gaaatctgac cgggcgtgca ttacaaagtt cttccccacc
tgttggtaaa tgctgccgct 41580atctgcgtgg acgatgctgc cgtcgtggcg ctgcgactta
tcggcctttt gggccatata 41640gatgttgtaa atgccaggtt tcagggcccc ggctttatct
accttctggt tcgtccatgc 41700gccttggttc tcggtctgga caattctttg cccattcatg
accaggaggc ggtgtttcat 41760tgggtgactc ctgacggttg cctctggtgt taaacgtgtc
ctggtcgctt gccggctaaa 41820aaaaagccga cctcggcagt tcgaggccgg ctttccctag
agccgggcgc gtcaaggttg 41880ttccatctat tttagtgaac tgcgttcgat ttatcagtta
ctttcctccc gctttgtgtt 41940tcctcccact cgtttccgcg tctagccgac ccctcaacat
agcggcctct tcttgggctg 42000cctttgcctc ttgccgcgct tcgtcacgct cggcttgcac
cgtcgtaaag cgctcggcct 42060gcctggccgc ctcttgcgcc gccaacttcc tttgctcctg
gtgggcctcg gcgtcggcct 42120gcgccttcgc tttcaccgct gccaactccg tgcgcaaact
ctccgcttcg cgcctggtgg 42180cgtcgcgctc gccgcgaagc gcctgcattt cctggttggc
cgcgtccagg gtcttgcggc 42240tctcttcttt gaatgcgcgg gcgtcctggt gagcgtagtc
cagctcggcg cgcagctcct 42300gcgctcgacg ctccacctcg tcggcccgct gcgtcgccag
cgcggcccgc tgctcggctc 42360ctgccagggc ggtgcgtgct tcggccaggg cttgccgctg
gcgtgcggcc agctcggccg 42420cctcggcggc ctgctgctct agcaatgtaa cgcgcgcctg
ggcttcttcc agctcgcggg 42480cctgcgcctc gaaggcgtcg gccagctccc cgcgcacggc
ttccaactcg ttgcgctcac 42540gatcccagcc ggcttgcgct gcctgcaacg attcattggc
aagggcctgg gcggcttgcc 42600agagggcggc cacggcctgg ttgccggcct gctgcaccgc
gtccggcacc tggactgcca 42660gcggggcggc ctgcgccgtg cgctggcgtc gccattcgcg
catgccggcg ctggcgtcgt 42720tcatgttgac gcgggcggcc ttacgcactg catccacggt
cgggaagttc tcccggtcgc 42780cttgctcgaa cagctcgtcc gcagccgcaa aaatgcggtc
gcgcgtctct ttgttcagtt 42840ccatgttggc tccggtaatt ggtaagaata ataatactct
tacctacctt atcagcgcaa 42900gagtttagct gaacagttct cgacttaacg gcaggttttt
tagcggctga agggcaggca 42960aaaaaagccc cgcacggtcg gcgggggcaa agggtcagcg
ggaaggggat tagcgggcgt 43020cgggcttctt catgcgtcgg ggccgcgctt cttgggatgg
agcacgacga agcgcgcacg 43080cgcatcgtcc tcggccctat cggcccgcgt cgcggtcagg
aacttgtcgc gcgctaggtc 43140ctccctggtg ggcaccaggg gcatgaactc ggcctgctcg
atgtaggtcc actccatgac 43200cgcatcgcag tcgaggccgc gttccttcac cgtctcttgc
aggtcgcggt acgcccgctc 43260gttgagcggc tggtaacggg ccaattggtc gtaaatggct
gtcggccatg agcggccttt 43320cctgttgagc cagcagccga cgacgaagcc ggcaatgcag
gcccctggca caaccaggcc 43380gacgccgggg gcaggggatg gcagcagctc gccaaccagg
aaccccgccg cgatgatgcc 43440gatgccggtc aaccagccct tgaaactatc cggccccgaa
acacccctgc gcattgcctg 43500gatgctgcgc cggatagctt gcaacatcag gagccgtttc
ttttgttcgt cagtcatggt 43560ccgccctcac cagttgttcg tatcggtgtc ggacgaactg
aaatcgcaag agctgccggt 43620atcggtccag ccgctgtccg tgtcgctgct gccgaagcac
ggcgaggggt ccgcgaacgc 43680cgcagacggc gtatccggcc gcagcgcatc gcccagcatg
gccccggtca gcgagccgcc 43740ggccaggtag cccagcatgg tgctgttggt cgccccggcc
accagggccg acgtgacgaa 43800atcgccgtca ttccctctgg attgttcgct gctcggcggg
gcagtgcgcc gcgccggcgg 43860cgtcgtggat ggctcgggtt ggctggcctg cgacggccgg
cgaaaggtgc gcagcagctc 43920gttatcgacc ggctgcggcg tcggggccgc cgccttgcgc
tgcggtcggt gttccttctt 43980cggctcgcgc agcttgaaca gcatgatcgc ggaaaccagc
agcaacgccg cgcctacgcc 44040tcccgcgatg tagaacagca tcggattcat tcttcggtcc
tccttgtagc ggaaccgttg 44100tctgtgcggc gcgggtggcc cgcgccgctg tctttgggga
tcagccctcg atgagcgcga 44160ccagtttcac gtcggcaagg ttcgcctcga actcctggcc
gtcgtcctcg tacttcaacc 44220aggcatagcc ttccgccggc ggccgacggt tgaggataag
gcgggcaggg cgctcgtcgt 44280gctcgacctg gacgatggcc tttttcagct tgtccgggtc
cggctccttc gcgccctttt 44340ccttggcgtc cttaccgtcc tggtcgccgt cctcgccgtc
ctggccgtcg ccggcctccg 44400cgtcacgctc ggcatcagtc tggccgttga aggcatcgac
ggtgttggga tcgcggccct 44460tctcgtccag gaactcgcgc agcagcttga ccgtgccgcg
cgtgatttcc tgggtgtcgt 44520cgtcaagcca cgcctcgact tcctccgggc gcttcttgaa
ggccgtcacc agctcgttca 44580ccacggtcac gtcgcgcacg cggccggtgt tgaacgcatc
ggcgatcttc tccggcaggt 44640ccagcagcgt gacgtgctgg gtgatgaacg ccggcgactt
gccgatttcc ttggcgatat 44700cgcctttctt cttgcccttc gccagctcgc ggccaatgaa
gtcggcaatt tcgcgcgggg 44760tcagctcgtt gcgttgcagg ttctcgataa cctggtcggc
ttcgttgtag tcgttgtcga 44820tgaacgccgg gatggacttc ttgccggccc acttcgagcc
acggtagcgg cgggcgccgt 44880gattgatgat atagcggccc ggctgctcct ggttctcgcg
caccgaaatg ggtgacttca 44940ccccgcgctc tttgatcgtg gcaccgattt ccgcgatgct
ctccggggaa aagccggggt 45000tgtcggccgt ccgcggctga tgcggatctt cgtcgatcag
gtccaggtcc agctcgatag 45060ggccggaacc gccctgagac gccgcaggag cgtccaggag
gctcgacagg tcgccgatgc 45120tatccaaccc caggccggac ggctgcgccg cgcctgcggc
ttcctgagcg gccgcagcgg 45180tgtttttctt ggtggtcttg gcttgagccg cagtcattgg
gaaatctcca tcttcgtgaa 45240cacgtaatca gccagggcgc gaacctcttt cgatgccttg
cgcgcggccg ttttcttgat 45300cttccagacc ggcacaccgg atgcgagggc atcggcgatg
ctgctgcgca ggccaacggt 45360ggccggaatc atcatcttgg ggtacgcggc cagcagctcg
gcttggtggc gcgcgtggcg 45420cggattccgc gcatcgacct tgctgggcac catgccaagg
aattgcagct tggcgttctt 45480ctggcgcacg ttcgcaatgg tcgtgaccat cttcttgatg
ccctggatgc tgtacgcctc 45540aagctcgatg ggggacagca catagtcggc cgcgaagagg
gcggccgcca ggccgacgcc 45600aagggtcggg gccgtgtcga tcaggcacac gtcgaagcct
tggttcgcca gggccttgat 45660gttcgccccg aacagctcgc gggcgtcgtc cagcgacagc
cgttcggcgt tcgccagtac 45720cgggttggac tcgatgaggg cgaggcgcgc ggcctggccg
tcgccggctg cgggtgcggt 45780ttcggtccag ccgccggcag ggacagcgcc gaacagcttg
cttgcatgca ggccggtagc 45840aaagtccttg agcgtgtagg acgcattgcc ctgggggtcc
aggtcgatca cggcaacccg 45900caagccgcgc tcgaaaaagt cgaaggcaag atgcacaagg
gtcgaagtct tgccgacgcc 45960gcctttctgg ttggccgtga ccaaagtttt catcgtttgg
tttcctgttt tttcttggcg 46020tccgcttccc acttccggac gatgtacgcc tgatgttccg
gcagaaccgc cgttacccgc 46080gcgtacccct cgggcaagtt cttgtcctcg aacgcggccc
acacgcgatg caccgcttgc 46140gacactgcgc ccctggtcag tcccagcgac gttgcgaacg
tcgcctgtgg cttcccatcg 46200actaagacgc cccgcgctat ctcgatggtc tgctgcccca
cttccagccc ctggatcgcc 46260tcctggaact ggctttcggt aagccgtttc ttcatggata
acacccataa tttgctccgc 46320gccttggttg aacatagcgg tgacagccgc cagcacatga
gagaagttta gctaaacatt 46380tctcgcacgt caacaccttt agccgctaaa actcgtcctt
ggcgtaacaa aacaaaagcc 46440cggaaaccgg gctttcgtct cttgccgctt atggctctgc
acccggctcc atcaccaaca 46500ggtcgcgcac gcgcttcact cggttgcgga tcgacactgc
cagcccaaca aagccggttg 46560ccgccgccgc caggatcgcg ccgatgatgc cggccacacc
ggccatcgcc caccaggtcg 46620ccgccttccg gttccattcc tgctggtact gcttcgcaat
gctggacctc ggctcaccat 46680aggctgaccg ctcgatggcg tatgccgctt ctccccttgg
cgtaaaaccc agcgccgcag 46740gcggcattgc catgctgccc gccgctttcc cgaccacgac
gcgcgcacca ggcttgcggt 46800ccagaccttc ggccacggcg agctgcgcaa ggacataatc
agccgccgac ttggctccac 46860gcgcctcgat cagctcttgc actcgcgcga aatccttggc
ctccacggcc gccatgaatc 46920gcgcacgcgg cgaaggctcc gcagggccgg cgtcgtgatc
gccgccgaga atgcccttca 46980ccaagttcga cgacacgaaa atcatgctga cggctatcac
catcatgcag acggatcgca 47040cgaacccgct gaattgaaca cgagcacggc acccgcgacc
actatgccaa gaatgcccaa 47100ggtaaaaatt gccggccccg ccatgaagtc cgtgaatgcc
ccgacggccg aagtgaaggg 47160caggccgcca cccaggccgc cgccctcact gcccggcacc
tggtcgctga atgtcgatgc 47220cagcacctgc ggcacgtcaa tgcttccggg cgtcgcgctc
gggctgatcg cccatcccgt 47280tactgccccg atcccggcaa tggcaaggac tgccagcgct
gccatttttg gggtgaggcc 47340gttcgcggcc gaggggcgca gcccctgggg ggatgggagg
cccgcgttag cgggccggga 47400gggttcgaga agggggggca ccccccttcg gcgtgcgcgg
tcacgcgcac agggcgcagc 47460cctggttaaa aacaaggttt ataaatattg gtttaaaagc
aggttaaaag acaggttagc 47520ggtggccgaa aaacgggcgg aaacccttgc aaatgctgga
ttttctgcct gtggacagcc 47580cctcaaatgt caataggtgc gcccctcatc tgtcagcact
ctgcccctca agtgtcaagg 47640atcgcgcccc tcatctgtca gtagtcgcgc ccctcaagtg
tcaataccgc agggcactta 47700tccccaggct tgtccacatc atctgtggga aactcgcgta
aaatcaggcg ttttcgccga 47760tttgcgaggc tggccagctc cacgtcgccg gccgaaatcg
agcctgcccc tcatctgtca 47820acgccgcgcc gggtgagtcg gcccctcaag tgtcaacgtc
cgcccctcat ctgtcagtga 47880gggccaagtt ttccgcgagg tatccacaac gccggcggcc
gcggtgtctc gcacacggct 47940tcgacggcgt ttctggcgcg tttgcagggc catagacggc
cgccagccca gcggcgaggg 48000caaccagccc ggtgagcgtc ggaaaggcgc tggaagcccc
gtagcgacgc ggagaggggc 48060gagacaagcc aagggcgcag gctcgatgcg cagcacgaca
tagccggttc tcgcaaggac 48120gagaatttcc ctgcggtgcc cctcaagtgt caatgaaagt
ttccaacgcg agccattcgc 48180gagagccttg agtccacgct agatgagagc tttgttgtag
gtggaccagt tggtgatttt 48240gaacttttgc tttgccacgg aacggtctgc gttgtcggga
agatgcgtga tctgatcctt 48300caactcagca aaagttcgat ttattcaaca aagccacgtt
gtgtctcaaa atctctgatg 48360ttacattgca caagataaaa atatatcatc atgaacaata
aaactgtctg cttacataaa 48420cagtaataca aggggtgtta tgagccatat tcaacgggaa
acgtcttgct cgac 484741347505DNAartificial sequencePHP16972
(ZmCAS1HindIII ProMS45/35SPAT) 13tctagagctc gttcctcgag gaacggtacc
tgcggggaag cttacaataa tgtgtgttgt 60taagtcttgt tgcctgtcat cgtctgactg
actttcgtca taaatcccgg cctccgtaac 120ccagctttgg gcaagctcac ggatttgatc
cggcggaacg ggaatatcga gatgccgggc 180tgaacgctgc agttccagct ttccctttcg
ggacaggtac tccagctgat tgattatctg 240ctgaagggtc ttggttccac ctcctggcac
aatgcgaatg attacttgag cgcgatcggg 300catccaattt tctcccgtca ggtgcgtggt
caagtgctac aaggcacctt tcagtaacga 360gcgaccgtcg atccgtcgcc gggatacgga
caaaatggag cgcagtagtc catcgagggc 420ggcgaaagcc tcgccaaaag caatacgttc
atctcgcaca gcctccagat ccgatcgagg 480gtcttcggcg taggcagata gaagcatgga
tacattgctt gagagtattc cgatggactg 540aagtatggct tccatctttt ctcgtgtgtc
tgcatctatt tcgagaaagc ccccgatgcg 600gcgcaccgca acgcgaattg ccatactatc
cgaaagtccc agcaggcgcg cttgatagga 660aaaggtttca tactcggccg atcgcagacg
ggcactcacg accttgaacc cttcaacttt 720cagggatcga tgctggttga tggtagtctc
actcgacgtg gctctggtgt gttttgacat 780agcttcctcc aaagaaagcg gaaggtctgg
atactccagc acgaaatgtg cccgggtaga 840cggatggaag tctagccctg ctcaatatga
aatcaacagt acatttacag tcaatactga 900atatacttgc tacatttgca attgtcttat
aacgaatgtg aaataaaaat agtgtaacaa 960cgcttttact catcgataat cacaaaaaca
tttatacgaa caaaaataca aatgcactcc 1020ggtttcacag gataggcggg atcagaatat
gcaacttttg acgttttgtt ctttcaaagg 1080gggtgctggc aaaaccaccg cactcatggg
cctttgcgct gctttggcaa atgacggtaa 1140acgagtggcc ctctttgatg ccgacgaaaa
ccggcctctg acgcgatgga gagaaaacgc 1200cttacaaagc agtactggga tcctcgctgt
gaagtctatt ccgccgacga aatgcccctt 1260cttgaagcag cctatgaaaa tgccgagctc
gaaggatttg attatgcgtt ggccgatacg 1320cgtggcggct cgagcgagct caacaacaca
atcatcgcta gctcaaacct gcttctgatc 1380cccaccatgc taacgccgct cgacatcgat
gaggcactat ctacctaccg ctacgtcatc 1440gagctgctgt tgagtgaaaa tttggcaatt
cctacagctg ttttgcgcca acgcgtcccg 1500gtcggccgat tgacaacatc gcaacgcagg
atgtcagaga cgctagagag ccttccagtt 1560gtaccgtctc ccatgcatga aagagatgca
tttgccgcga tgaaagaacg cggcatgttg 1620catcttacat tactaaacac gggaactgat
ccgacgatgc gcctcataga gaggaatctt 1680cggattgcga tggaggaagt cgtggtcatt
tcgaaactga tcagcaaaat cttggaggct 1740tgaagatggc aattcgcaag cccgcattgt
cggtcggcga agcacggcgg cttgctggtg 1800ctcgacccga gatccaccat cccaacccga
cacttgttcc ccagaagctg gacctccagc 1860acttgcctga aaaagccgac gagaaagacc
agcaacgtga gcctctcgtc gccgatcaca 1920tttacagtcc cgatcgacaa cttaagctaa
ctgtggatgc ccttagtcca cctccgtccc 1980cgaaaaagct ccaggttttt ctttcagcgc
gaccgcccgc gcctcaagtg tcgaaaacat 2040atgacaacct cgttcggcaa tacagtccct
cgaagtcgct acaaatgatt ttaaggcgcg 2100cgttggacga tttcgaaagc atgctggcag
atggatcatt tcgcgtggcc ccgaaaagtt 2160atccgatccc ttcaactaca gaaaaatccg
ttctcgttca gacctcacgc atgttcccgg 2220ttgcgttgct cgaggtcgct cgaagtcatt
ttgatccgtt ggggttggag accgctcgag 2280ctttcggcca caagctggct accgccgcgc
tcgcgtcatt ctttgctgga gagaagccat 2340cgagcaattg gtgaagaggg acctatcgga
acccctcacc aaatattgag tgtaggtttg 2400aggccgctgg ccgcgtcctc agtcaccttt
tgagccagat aattaagagc caaatgcaat 2460tggctcaggc tgccatcgtc cccccgtgcg
aaacctgcac gtccgcgtca aagaaataac 2520cggcacctct tgctgttttt atcagttgag
ggcttgacgg atccgcctca agtttgcggc 2580gcagccgcaa aatgagaaca tctatactcc
tgtcgtaaac ctcctcgtcg cgtactcgac 2640tggcaatgag aagttgctcg cgcgatagaa
cgtcgcgggg tttctctaaa aacgcgagga 2700gaagattgaa ctcacctgcc gtaagtttca
cctcaccgcc agcttcggac atcaagcgac 2760gttgcctgag attaagtgtc cagtcagtaa
aacaaaaaga ccgtcggtct ttggagcgga 2820caacgttggg gcgcacgcgc aaggcaaccc
gaatgcgtgc aagaaactct ctcgtactaa 2880acggcttagc gataaaatca cttgctccta
gctcgagtgc aacaacttta tccgtctcct 2940caaggcggtc gccactgata attatgattg
gaatatcaga ctttgccgcc agatttcgaa 3000cgatctcaag cccatcttca cgacctaaat
ttagatcaac aaccacgaca tcgaccgtcg 3060cggaagagag tactctagtg aactgggtgc
tgtcggctac cgcggtcact ttgaaggcgt 3120ggatcgtaag gtattcgata ataagatgcc
gcatagcgac atcgtcatcg ataagaagaa 3180cgtgtttcaa cggctcacct ttcaatctaa
aatctgaacc cttgttcaca gcgcttgaga 3240aattttcacg tgaaggatgt acaatcatct
ccagctaaat gggcagttcg tcagaattgc 3300ggctgaccgc ggatgacgaa aatgcgaacc
aagtatttca attttatgac aaaagttctc 3360aatcgttgtt acaagtgaaa cgcttcgagg
ttacagctac tattgattaa ggagatcgcc 3420tatggtctcg ccccggcgtc gtgcgtccgc
cgcgagccag atctcgccta cttcataaac 3480gtcctcatag gcacggaatg gaatgatgac
atcgatcgcc gtagagagca tgtcaatcag 3540tgtgcgatct tccaagctag caccttgggc
gctacttttg acaagggaaa acagtttctt 3600gaatccttgg attggattcg cgccgtgtat
tgttgaaatc gatcccggat gtcccgagac 3660gacttcactc agataagccc atgctgcatc
gtcgcgcatc tcgccaagca atatccggtc 3720cggccgcata cgcagacttg cttggagcaa
gtgctcggcg ctcacagcac ccagcccagc 3780accgttcttg gagtagagta gtctaacatg
attatcgtgt ggaatgacga gttcgagcgt 3840atcttctatg gtgattagcc tttcctgggg
ggggatggcg ctgatcaagg tcttgctcat 3900tgttgtcttg ccgcttccgg tagggccaca
tagcaacatc gtcagtcggc tgacgacgca 3960tgcgtgcaga aacgcttcca aatccccgtt
gtcaaaatgc tgaaggatag cttcatcatc 4020ctgattttgg cgtttccttc gtgtctgcca
ctggttccac ctcgaagcat cataacggga 4080ggagacttct ttaagaccag aaacacgcga
gcttggccgt cgaatggtca agctgacggt 4140gcccgaggga acggtcggcg gcagacagat
ttgtagtcgt tcaccaccag gaagttcagt 4200ggcgcagagg gggttacgtg gtccgacatc
ctgctttctc agcgcgcccg ctaaaatagc 4260gatatcttca agatcatcat aagagacggg
caaaggcatc ttggtaaaaa tgccggcttg 4320gcgcacaaat gcctctccag gtcgattgat
cgcaatttct tcagtcttcg ggtcatcgag 4380ccattccaaa atcggcttca gaagaaagcg
tagttgcgga tccacttcca tttacaatgt 4440atcctatctc taagcggaaa tttgaattca
ttaagagcgg cggttcctcc cccgcgtggc 4500gccgccagtc aggcggagct ggtaaacacc
aaagaaatcg aggtcccgtg ctacgaaaat 4560ggaaacggtg tcaccctgat tcttcttcag
ggttggcggt atgttgatgg ttgccttaag 4620ggctgtctca gttgtctgct caccgttatt
ttgaaagctg ttgaagctca tcccgccacc 4680cgagctgccg gcgtaggtgc tagctgcctg
gaaggcgcct tgaacaacac tcaagagcat 4740agctccgcta aaacgctgcc agaagtggct
gtcgaccgag cccggcaatc ctgagcgacc 4800gagttcgtcc gcgcttggcg atgttaacga
gatcatcgca tggtcaggtg tctcggcgcg 4860atcccacaac acaaaaacgc gcccatctcc
ctgttgcaag ccacgctgta tttcgccaac 4920aacggtggtg ccacgatcaa gaagcacgat
attgttcgtt gttccacgaa tatcctgagg 4980caagacacac tttacatagc ctgccaaatt
tgtgtcgatt gcggtttgca agatgcacgg 5040aattattgtc ccttgcgtta ccataaaatc
ggggtgcggc aagagcgtgg cgctgctggg 5100ctgcagctcg gtgggtttca tacgtatcga
caaatcgttc tcgccggaca cttcgccatt 5160cggcaaggag ttgtcgtcac gcttgccttc
ttgtcttcgg cccgtgtcgc cctgaatggc 5220gcgtttgctg accccttgat cgccgctgct
atatgcaaaa atcggtgttt cttccggccg 5280tggctcatgc cgctccggtt cgcccctcgg
cggtagagga gcagcaggct gaacagcctc 5340ttgaaccgct ggaggatccg gcggcacctc
aatcggagct ggatgaaatg gcttggtgtt 5400tgttgcgatc aaagttgacg gcgatgcgtt
ctcattcacc ttcttttggc gcccacctag 5460ccaaatgagg cttaatgata acgcgagaac
gacacctccg acgatcaatt tctgagaccc 5520cgaaagacgc cggcgatgtt tgtcggagac
cagggatcca gatgcatcaa cctcatgtgc 5580cgcttgctga ctatcgttat tcatcccttc
gcccccttca ggacgcgttt cacatcgggc 5640ctcaccgtgc ccgtttgcgg cctttggcca
acgggatcgt aagcggtgtt ccagatacat 5700agtactgtgt ggccatccct cagacgccaa
cctcgggaaa ccgaagaaat ctcgacatcg 5760ctccctttaa ctgaatagtt ggcaacagct
tccttgccat caggattgat ggtgtagatg 5820gagggtatgc gtacattgcc cggaaagtgg
aataccgtcg taaatccatt gtcgaagact 5880tcgagtggca acagcgaacg atcgccttgg
gcgacgtagt gccaattact gtccgccgca 5940ccaagggctg tgacaggctg atccaataaa
ttctcagctt tccgttgata ttgtgcttcc 6000gcgtgtagtc tgtccacaac agccttctgt
tgtgcctccc ttcgccgagc cgccgcatcg 6060tcggcggggt aggcgaattg gacgctgtaa
tagagatcgg gctgctcttt atcgaggtgg 6120gacagagtct tggaacttat actgaaaaca
taacggcgca tcccggagtc gcttgcggtt 6180agcacgatta ctggctgagg cgtgaggacc
tggcttgcct tgaaaaatag ataatttccc 6240cgcggtaggg ctgctagatc tttgctattt
gaaacggcaa ccgctgtcac cgtttcgttc 6300gtggcgaatg ttacgaccaa agtagctcca
accgccgtcg agaggcgcac cacttgatcg 6360ggattgtaag ccaaataacg catgcgcgga
tctagcttgc ccgccattgg agtgtcttca 6420gcctccgcac cagtcgcagc ggcaaataaa
catgctaaaa tgaaaagtgc ttttctgatc 6480atggttcgct gtggcctacg tttgaaacgg
tatcttccga tgtctgatag gaggtgacaa 6540ccagacctgc cgggttggtt agtctcaatc
tgccgggcaa gctggtcacc ttttcgtagc 6600gaactgtcgc ggtccacgta ctcaccacag
gcattttgcc gtcaacgacg agggtccttt 6660tatagcgaat ttgctgcgtg cttggagtta
catcatttga agcgatgtgc tcgacctcca 6720ccctgccgcg tttgccaaga atgacttgag
gcgaactggg attgggatag ttgaagaatt 6780gctggtaatc ctggcgcact gttggggcac
tgaagttcga taccaggtcg taggcgtact 6840gagcggtgtc ggcatcataa ctctcgcgca
ggcgaacgta ctcccacaat gaggcgttaa 6900cgacggcctc ctcttgagtt gcaggcaatc
gcgagacaga cacctcgctg tcaacggtgc 6960cgtccggccg tatccataga tatacgggca
caagcctgct caacggcacc attgtggcta 7020tagcgaacgc ttgagcaaca tttcccaaaa
tcgcgatagc tgcgacagct gcaatgagtt 7080tggagagacg tcgcgccgat ttcgctcgcg
cggtttgaaa ggcttctact tccttatagt 7140gctcggcaag gctttcgcgc gccactagca
tggcatattc aggccccgtc atagcgtcca 7200cccgaattgc cgagctgaag atctgacgga
gtaggctgcc atcgccccac attcagcggg 7260aagatcgggc ctttgcagct cgctaatgtg
tcgtttgtct ggcagccgct caaagcgaca 7320actaggcaca gcaggcaata cttcatagaa
ttctccattg aggcgaattt ttgcgcgacc 7380tagcctcgct caacctgagc gaagcgacgg
tacaagctgc tggcagattg ggttgcgccg 7440ctccagtaac tgcctccaat gttgccggcg
atcgccggca aagcgacaat gagcgcatcc 7500cctgtcagaa aaaacatatc gagttcgtaa
agaccaatga tcttggccgc ggtcgtaccg 7560gcgaaggtga ttacaccaag cataagggtg
agcgcagtcg cttcggttag gatgacgatc 7620gttgccacga ggtttaagag gagaagcaag
agaccgtagg tgataagttg cccgatccac 7680ttagctgcga tgtcccgcgt gcgatcaaaa
atatatccga cgaggatcag aggcccgatc 7740gcgagaagca ctttcgtgag aattccaacg
gcgtcgtaaa ctccgaaggc agaccagagc 7800gtgccgtaaa ggacccactg tgccccttgg
aaagcaagga tgtcctggtc gttcatcgga 7860ccgatttcgg atgcgatttt ctgaaaaacg
gcctgggtca cggcgaacat tgtatccaac 7920tgtgccggaa cagtctgcag aggcaagccg
gttacactaa actgctgaac aaagtttggg 7980accgtctttt cgaagatgga aaccacatag
tcttggtagt tagcctgccc aacaattaga 8040gcaacaacga tggtgaccgt gatcacccga
gtgataccgc tacgggtatc gacttcgccg 8100cgtatgacta aaataccctg aacaataatc
caaagagtga cacaggcgat caatggcgca 8160ctcaccgcct cctggatagt ctcaagcatc
gagtccaagc ctgtcgtgaa ggctacatcg 8220aagatcgtat gaatggccgt aaacggcgcc
ggaatcgtga aattcatcga ttggacctga 8280acttgactgg tttgtcgcat aatgttggat
aaaatgagct cgcattcggc gaggatgcgg 8340gcggatgaac aaatcgccca gccttagggg
agggcaccaa agatgacagc ggtcttttga 8400tgctccttgc gttgagcggc cgcctcttcc
gcctcgtgaa ggccggcctg cgcggtagtc 8460atcgttaata ggcttgtcgc ctgtacattt
tgaatcattg cgtcatggat ctgcttgaga 8520agcaaaccat tggtcacggt tgcctgcatg
atattgcgag atcgggaaag ctgagcagac 8580gtatcagcat tcgccgtcaa gcgtttgtcc
atcgtttcca gattgtcagc cgcaatgcca 8640gcgctgtttg cggaaccggt gatctgcgat
cgcaacaggt ccgcttcagc atcactaccc 8700acgactgcac gatctgtatc gctggtgatc
gcacgtgccg tggtcgacat tggcattcgc 8760ggcgaaaaca tttcattgtc taggtccttc
gtcgaaggat actgattttt ctggttgagc 8820gaagtcagta gtccagtaac gccgtaggcc
gacgtcaaca tcgtaaccat cgctatagtc 8880tgagtgagat tctccgcagt cgcgagcgca
gtcgcgagcg tctcagcctc cgttgccggg 8940tcgctaacaa caaactgcgc ccgcgcgggc
tgaatatata gaaagctgca ggtcaaaact 9000gttgcaataa gttgcgtcgt cttcatcgtt
tcctacctta tcaatcttct gcctcgtggt 9060gacgggccat gaattcgctg agccagccag
atgagttgcc ttcttgtgcc tcgcgtagtc 9120gagttgcaaa gcgcaccgtg ttggcacgcc
ccgaaagcac ggcgacatat tcacgcatat 9180cccgcagatc aaattcgcag atgacgcttc
cactttctcg tttaagaaga aacttacggc 9240tgccgaccgt catgtcttca cggatcgcct
gaaattcctt ttcggtacat ttcagtccat 9300cgacataagc cgatcgatct gcggttggtg
atggatagaa aatcttcgtc atacattgcg 9360caaccaagct ggctcctagc ggcgattcca
gaacatgctc tggttgctgc gttgccagta 9420ttagcatccc gttgtttttt cgaacggtca
ggaggaattt gtcgacgaca gtcgaaaatt 9480tagggtttaa caaataggcg cgaaactcat
cgcagctcat cacaaaacgg cggccgtcga 9540tcatggctcc aatccgatgc aggagatatg
ctgcagcggg agcgcatact tcctcgtatt 9600cgagaagatg cgtcatgtcg aagccggtaa
tcgacggatc taactttact tcgtcaactt 9660cgccgtcaaa tgcccagcca agcgcatggc
cccggcacca gcgttggagc cgcgctcctg 9720cgccttcggc gggcccatgc aacaaaaatt
cacgtaaccc cgcgattgaa cgcatttgtg 9780gatcaaacga gagctgacga tggataccac
ggaccagacg gcggttctct tccggagaaa 9840tcccaccccg accatcactc tcgatgagag
ccacgatcca ttcgcgcaga aaatcgtgtg 9900aggctgctgt gttttctagg ccacgcaacg
gcgccaaccc gctgggtgtg cctctgtgaa 9960gtgccaaata tgttcctcct gtggcgcgaa
ccagcaattc gccaccccgg tccttgtcaa 10020agaacacgac cgtacctgca cggtcgacca
tgctctgttc gagcatggct agaacaaaca 10080tcatgagcgt cgtcttaccc ctcccgatag
gcccgaatat tgccgtcatg ccaacatcgt 10140gctcatgcgg gatatagtcg aaaggcgttc
cgccattggt acgaaatcgg gcaatcgcgt 10200tgccccagtg gcctgagctg gcgccctctg
gaaagttttc gaaagagaca aaccctgcga 10260aattgcgtga agtgattgcg ccagggcgtg
tgcgccactt aaaattcccc ggcaattggg 10320accaataggc cgcttccata ccaatacctt
cttggacaac cacggcacct gcatccgcca 10380ttcgtgtccg agcccgcgcg cccctgtccc
caagactatt gagatcgtct gcatagacgc 10440aaaggctcaa atgatgtgag cccataacga
attcgttgct cgcaagtgcg tcctcagcct 10500cggataattt gccgatttga gtcacggctt
tatcgccgga actcagcatc tggctcgatt 10560tgaggctaag tttcgcgtgc gcttgcgggc
gagtcaggaa cgaaaaactc tgcgtgagaa 10620caagtggaaa atcgagggat agcagcgcgt
tgagcatgcc cggccgtgtt tttgcagggt 10680attcgcgaaa cgaatagatg gatccaacgt
aactgtcttt tggcgttctg atctcgagtc 10740ctcgcttgcc gcaaatgact ctgtcggtat
aaatcgaagc gccgagtgag ccgctgacga 10800ccggaaccgg tgtgaaccga ccagtcatga
tcaaccgtag cgcttcgcca atttcggtga 10860agagcacacc ctgcttctcg cggatgccaa
gacgatgcag gccatacgct ttaagagagc 10920cagcgacaac atgccaaaga tcttccatgt
tcctgatctg gcccgtgaga tcgttttccc 10980tttttccgct tagcttggtg aacctcctct
ttaccttccc taaagccgcc tgtgggtaga 11040caatcaacgt aaggaagtgt tcattgcgga
ggagttggcc ggagagcacg cgctgttcaa 11100aagcttcgtt caggctagcg gcgaaaacac
tacggaagtg tcgcggcgcc gatgatggca 11160cgtcggcatg acgtacgagg tgagcatata
ttgacacatg atcatcagcg atattgcgca 11220acagcgtgtt gaacgcacga caacgcgcat
tgcgcatttc agtttcctca agctcgaatg 11280caacgccatc aattctcgca atggtcatga
tcgatccgtc ttcaagaagg acgatatggt 11340cgctgaggtg gccaatataa gggagataga
tctcaccgga tctttcggtc gttccactcg 11400cgccgagcat cacaccattc ctctccctcg
tgggggaacc ctaattggat ttgggctaac 11460agtagcgccc ccccaaactg cactatcaat
gcttcttccc gcggtccgca aaaatagcag 11520gacgacgctc gccgcattgt agtctcgctc
cacgatgagc cgggctgcaa accataacgg 11580cacgagaacg acttcgtaga gcgggttctg
aacgataacg atgacaaagc cggcgaacat 11640catgaataac cctgccaatg tcagtggcac
cccaagaaac aatgcgggcc gtgtggctgc 11700gaggtaaagg gtcgattctt ccaaacgatc
agccatcaac taccgccagt gagcgtttgg 11760ccgaggaagc tcgccccaaa catgataaca
atgccgccga cgacgccggc aaccagccca 11820agcgaagccc gcccgaacat ccaggagatc
ccgatagcga caatgccgag aacagcgagt 11880gactggccga acggaccaag gataaacgtg
catatattgt taaccattgt ggcggggtca 11940gtgccgccac ccgcagattg cgctgcggcg
ggtccggatg aggaaatgct ccatgcaatt 12000gcaccgcaca agcttggggc gcagctcgat
atcacgcgca tcatcgcatt cgagagcgag 12060aggcgattta gatgtaaacg gtatctctca
aagcatcgca tcaatgcgca cctccttagt 12120ataagtcgaa taagacttga ttgtcgtctg
cggatttgcc gttgtcctgg tgtggcggtg 12180gcggagcgat taaaccgcca gcgccatcct
cctgcgagcg gcgctgatat gacccccaaa 12240catcccacgt ctcttcggat tttagcgcct
cgtgatcgtc ttttggaggc tcgattaacg 12300cgggcaccag cgattgagca gctgtttcaa
cttttcgcac gtagccgttt gcaaaaccgc 12360cgatgaaatt accggtgttg taagcggaga
tcgcccgacg aagcgcaaat tgcttctcgt 12420caatcgtttc gccgcctgca taacgacttt
tcagcatgtt tgcagcggca gataatgatg 12480tgcacgcctg gagcgcaccg tcaggtgtca
gaccgagcat agaaaaattt cgagagttta 12540tttgcatgag gccaacatcc agcgaatgcc
gtgcatcgag acggtgcctg acgacttggg 12600ttgcttggct gtgatcttgc cagtgaagcg
tttcgccggt cgtgttgtca tgaatcgcta 12660aaggatcaaa gcgactctcc accttagcta
tcgccgcaag cgtagatgtc gcaactgatg 12720gggcacactt gcgagcaaca tggtcaaact
cagcagatga gagtggcgtg gcaaggctcg 12780acgaacagaa ggagaccatc aaggcaagag
aaagcgaccc cgatctctta agcatacctt 12840atctccttag ctcgcaacta acaccgcctc
tcccgttgga agaagtgcgt tgttttatgt 12900tgaagattat cgggagggtc ggttactcga
aaattttcaa ttgcttcttt atgatttcaa 12960ttgaagcgag aaacctcgcc cggcgtcttg
gaacgcaaca tggaccgaga accgcgcatc 13020catgactaag caaccggatc gacctattca
ggccgcagtt ggtcaggtca ggctcagaac 13080gaaaatgctc ggcgaggtta cgctgtctgt
aaacccattc gatgaacggg aagcttcctt 13140ccgattgctc ttggcaggaa tattggccca
tgcctgcttg cgctttgcaa atgctcttat 13200cgcgttggta tcatatgcct tgtccgccag
cagaaacgca ctctaagcga ttatttgtaa 13260aaatgtttcg gtcatgcggc ggtcatgggc
ttgacccgct gtcagcgcaa gacggatcgg 13320tcaaccgtcg gcatcgacaa cagcgtgaat
cttggtggtc aaaccgccac gggaacgtcc 13380catacagcca tcgtcttgat cccgctgttt
cccgtcgccg catgttggtg gacgcggaca 13440caggaactgt caatcatgac gacattctat
cgaaagcctt ggaaatcaca ctcagaatat 13500gatcccagac gtctgcctca cgccatcgta
caaagcgatt gtagcaggtt gtacaggaac 13560cgtatcgatc aggaacgtct gcccagggcg
ggcccgtccg gaagcgccac aagatgacat 13620tgatcacccg cgtcaacgcg cggcacgcga
cgcggcttat ttgggaacaa aggactgaac 13680aacagtccat tcgaaatcgg tgacatcaaa
gcggggacgg gttatcagtg gcctccaagt 13740caagcctcaa tgaatcaaaa tcagaccgat
ttgcaaacct gatttatgag tgtgcggcct 13800aaatgatgaa atcgtccttc tagatcgcct
ccgtggtgta gcaacacctc gcagtatcgc 13860cgtgctgacc ttggccaggg aattgactgg
caagggtgct ttcacatgac cgctcttttg 13920gccgcgatag atgatttcgt tgctgctttg
ggcacgtaga aggagagaag tcatatcgga 13980gaaattcctc ctggcgcgag agcctgctct
atcgcgacgg catcccactg tcgggaacag 14040accggatcat tcacgaggcg aaagtcgtca
acacatgcgt tataggcatc ttcccttgaa 14100ggatgatctt gttgctgcca atctggaggt
gcggcagccg caggcagatg cgatctcagc 14160gcaacttgcg gcaaaacatc tcactcacct
gaaaaccact agcgagtctc gcgatcagac 14220gaaggccttt tacttaacga cacaatatcc
gatgtctgca tcacaggcgt cgctatccca 14280gtcaatacta aagcggtgca ggaactaaag
attactgatg acttaggcgt gccacgaggc 14340ctgagacgac gcgcgtagac agttttttga
aatcattatc aaagtgatgg cctccgctga 14400agcctatcac ctctgcgccg gtctgtcgga
gagatgggca agcattatta cggtcttcgc 14460gcccgtacat gcattggacg attgcagggt
caatggatct gagatcatcc agaggattgc 14520cgcccttacc ttccgtttcg agttggagcc
agcccctaaa tgagacgaca tagtcgactt 14580gatgtgacaa tgccaagaga gagatttgct
taacccgatt tttttgctca agcgtaagcc 14640tattgaagct tgccggcatg acgtccgcgc
cgaaagaata tcctacaagt aaaacattct 14700gcacaccgaa atgcttggtg tagacatcga
ttatgtgacc aagatcctta gcagtttcgc 14760ttggggaccg ctccgaccag aaataccgaa
gtgaactgac gccaatgaca ggaatccctt 14820ccgtctgcag ataggtacca tcgatagatc
tgctgcctcg cgcgtttcgg tgatgacggt 14880gaaaacctct gacacatgca gctcccggag
acggtcacag cttgtctgta agcggatgcc 14940gggagcagac aagcccgtca gggcgcgtca
gcgggtgttg gcgggtgtcg gggcgcagcc 15000atgacccagt cacgtagcga tagcggagtg
tatactggct taactatgcg gcatcagagc 15060agattgtact gagagtgcac catatgcggt
gtgaaatacc gcacagatgc gtaaggagaa 15120aataccgcat caggcgctct tccgcttcct
cgctcactga ctcgctgcgc tcggtcgttc 15180ggctgcggcg agcggtatca gctcactcaa
aggcggtaat acggttatcc acagaatcag 15240gggataacgc aggaaagaac atgtgagcaa
aaggccagca aaaggccagg aaccgtaaaa 15300aggccgcgtt gctggcgttt ttccataggc
tccgcccccc tgacgagcat cacaaaaatc 15360gacgctcaag tcagaggtgg cgaaacccga
caggactata aagataccag gcgtttcccc 15420ctggaagctc cctcgtgcgc tctcctgttc
cgaccctgcc gcttaccgga tacctgtccg 15480cctttctccc ttcgggaagc gtggcgcttt
ctcatagctc acgctgtagg tatctcagtt 15540cggtgtaggt cgttcgctcc aagctgggct
gtgtgcacga accccccgtt cagcccgacc 15600gctgcgcctt atccggtaac tatcgtcttg
agtccaaccc ggtaagacac gacttatcgc 15660cactggcagc agccactggt aacaggatta
gcagagcgag gtatgtaggc ggtgctacag 15720agttcttgaa gtggtggcct aactacggct
acactagaag gacagtattt ggtatctgcg 15780ctctgctgaa gccagttacc ttcggaaaaa
gagttggtag ctcttgatcc ggcaaacaaa 15840ccaccgctgg tagcggtggt ttttttgttt
gcaagcagca gattacgcgc agaaaaaaag 15900gatctcaaga agatcctttg atcttttcta
cggggtctga cgctcagtgg aacgaaaact 15960cacgttaagg gattttggtc atgagattat
caaaaaggat cttcacctag atccttttaa 16020attaaaaatg aagttttaaa tcaatctaaa
gtatatatga gtaaacttgg tctgacagtt 16080accaatgctt aatcagtgag gcacctatct
cagcgatctg tctatttcgt tcatccatag 16140ttgcctgact ccccgtcgtg tagataacta
cgatacggga gggcttacca tctggcccca 16200gtgctgcaat gataccgcga gacccacgct
caccggctcc agatttatca gcaataaacc 16260agccagccgg aagggccgag cgcagaagtg
gtcctgcaac tttatccgcc tccatccagt 16320ctattaattg ttgccgggaa gctagagtaa
gtagttcgcc agttaatagt ttgcgcaacg 16380ttgttgccat tgctgcaggg gggggggggg
ggggggactt ccattgttca ttccacggac 16440aaaaacagag aaaggaaacg acagaggcca
aaaagcctcg ctttcagcac ctgtcgtttc 16500ctttcttttc agagggtatt ttaaataaaa
acattaagtt atgacgaaga agaacggaaa 16560cgccttaaac cggaaaattt tcataaatag
cgaaaacccg cgaggtcgcc gccccgtaac 16620ctgtcggatc accggaaagg acccgtaaag
tgataatgat tatcatctac atatcacaac 16680gtgcgtggag gccatcaaac cacgtcaaat
aatcaattat gacgcaggta tcgtattaat 16740tgatctgcat caacttaacg taaaaacaac
ttcagacaat acaaatcagc gacactgaat 16800acggggcaac ctcatgtccc cccccccccc
ccccctgcag ggcatcgtgg tgtcacgctc 16860gtcgtttggt atggcttcat tcagctccgg
ttcccaacga tcaaggcgag ttacatgatc 16920ccccatgttg tgcaaaaaag cggttagctc
cttcggtcct ccgatcgttg tcagaagtaa 16980gttggccgca gtgttatcac tcatggttat
ggcagcactg cataattctc ttactgtcat 17040gccatccgta agatgctttt ctgtgactgg
tgagtactca accaagtcat tctgagaata 17100gtgtatgcgg cgaccgagtt gctcttgccc
ggcgtcaaca cgggataata ccgcgccaca 17160tagcagaact ttaaaagtgc tcatcattgg
aaaacgttct tcggggcgaa aactctcaag 17220gatcttaccg ctgttgagat ccagttcgat
gtaacccact cgtgcaccca actgatcttc 17280agcatctttt actttcacca gcgtttctgg
gtgagcaaaa acaggaaggc aaaatgccgc 17340aaaaaaggga ataagggcga cacggaaatg
ttgaatactc atactcttcc tttttcaata 17400ttattgaagc atttatcagg gttattgtct
catgagcgga tacatatttg aatgtattta 17460gaaaaataaa caaatagggg ttccgcgcac
atttccccga aaagtgccac ctgacgtcta 17520agaaaccatt attatcatga cattaaccta
taaaaatagg cgtatcacga ggccctttcg 17580tcttcaagaa ttggtcgacg atcttgctgc
gttcggatat tttcgtggag ttcccgccac 17640agacccggat tgaaggcgag atccagcaac
tcgcgccaga tcatcctgtg acggaacttt 17700ggcgcgtgat gactggccag gacgtcggcc
gaaagagcga caagcagatc acgcttttcg 17760acagcgtcgg atttgcgatc gaggattttt
cggcgctgcg ctacgtccgc gaccgcgttg 17820agggatcaag ccacagcagc ccactcgacc
ttctagccga cccagacgag ccaagggatc 17880tttttggaat gctgctccgt cgtcaggctt
tccgacgttt gggtggttga acagaagtca 17940ttatcgtacg gaatgccaag cactcccgag
gggaaccctg tggttggcat gcacatacaa 18000atggacgaac ggataaacct tttcacgccc
ttttaaatat ccgttattct aataaacgct 18060cttttctctt aggtttaccc gccaatatat
cctgtcaaac actgatagtt taaactgaag 18120gcgggaaacg acaatctgat catgagcgga
gaattaaggg agtcacgtta tgacccccgc 18180cgatgacgcg ggacaagccg ttttacgttt
ggaactgaca gaaccgcaac gttgaaggag 18240ccactcagcc caagcttttt ggaaggctaa
ggagaggaag ccggcgagaa ggagggggcg 18300ttttacgtgt cactgtcctg tcgtgttggc
tgttgacacg aatcatttct tccgcgcgtg 18360ggaagaagaa gatgcacatt agcggcctga
agtagagatg tcaatgggga attccccagc 18420ggggattaac tccccagacc cgtacccatg
aacatagacc ggcccccatc cccgaacccg 18480aacccgacct cgggtacgaa aatcctccca
tacccattcc cgaccgggta ctaaataccc 18540atgggtatcc atacccgacc cgattattca
aaaattaatg ggctttttat ttgttaaccg 18600gcggacgcaa tgcttgggac tctaggtttt
tttactttgt tgaccggctg gcggctgggc 18660tttttcctac aggcccaaag ttggtcggca
gccactaggc cacacgtcac aggcagccca 18720caagtaaatg tcgttggatt gctggatggt
ggaataaaaa tcctagatgc tagattgttc 18780tggttccggg tatttttctc catggctaat
cgggtttggg tttagccctc ccaaacccga 18840acccgccata cccgatgggt aagggattta
ttccaaatct atacccatgg ggatttgttt 18900taacccatac cttaacccta atagaggaat
tccccacggg taatcgggtt tcggggccca 18960ttgacatctc tagactgaag gcgtccaact
caaatcatta aaaagtgttg acgcacgcgc 19020tgatgcgccg gccgcacagc acaggctgca
cagcccgttt aatcagcgat ggagccccgg 19080ccgtcagcca gccaggtccg gcgtccgggt
ctgcgccctg cggcgtcact gctgtcgcca 19140ccgtctccga tggtcccaca tccatccagc
gggccgcgcg tggtacaaaa ggctcttcct 19200cgccgtcagg tgcagctgcc caaacaccag
acacagactc caccaccccg cttcgatctt 19260ctgttgcagc tgaaatctgt cagattctgc
agttcattcc tcatggagaa gaggaacctg 19320cagtggcggc gagggcgtga tggcatcgtg
cagtaccctc acctcttctt cgcggccctg 19380gcgctggccc tcctagtcgc ggaccgttcg
gcctcagtcc gctggccgag gtcgactacc 19440ggccggtgaa gcacgagctc gcgccgtacg
gggaggtcat gggcagctgg cccagagaca 19500atgccagccg gctcaggcgc gggaggctgg
agttcgtcgg cgaggtgttc gggccggagt 19560ctatcgagtt cgatctccag ggccgcgggc
cgtacgccgg cctcgccgac ggccgcgtcg 19620tgcggtggat gggcgaggag gccgggtggg
agacgttcgc cggtcatgaa tcctgactgg 19680taagtgctcg atatgcctcc ggcgtccact
cgttacagtg ctataatata gtagtactaa 19740gatattttga tctgattttt tgcattcttg
ggagaaacgt catgcaaaat ttgttgtttc 19800ttggcaaagg tcagaagaag tctgtgccaa
tggagtgaac tcaacgacga ggaagcagca 19860cgagaaggag gagttctgcg gcggccgctc
ggcctgaggt tccacgggga gaccggcgag 19920ctctacgtcg ccgacgcgta ctacggtctc
atggtcgttg gccagagcgg cggcgtggcg 19980tcctccgtcg cgagggaagc cgacggggac
cccatccggt tcgcgaacga cctcgatgtg 20040cacaggaatg gatccgtatt cttcactgac
acgagcatga gatacagcag aaagtgagca 20100aagcagcgta acaatccggc ttctcatttt
caaacgcctc tgtattctct gctgaaagag 20160tagctcacca gacaagagct gaatttgcag
ggaccatctg aacatcctgt tagaaggaga 20220aggcaccggg aggctgctca ggtatgatcc
agaaacaagc ggtgtccatg tcgtgctcaa 20280ggggctggtg ttcccaaacg gcgtgcagat
ctcagaggac catcagtttc ttctcttctc 20340cgagacaaca aactgcaggt aacaaaaata
ctatctgacg atgctcatga ttctaccgta 20400tccatagtca tgaacacaaa ccacacgaat
ctggccttga ccaggataat gaggtactgg 20460ctggaaggcc caagagcggg cgaggtagag
gtgttcgcga acctgccggg cttccccgac 20520aacgtgcgct ccaacggcag gggccagttc
tgggtggcga tcgactgctg ccggaccgag 20580gaggtgttcc caagagcgtg gctccggacc
ctgtacttca agttcccgct gtcgctcaag 20640gtgctcactt ggaaggccgc caggaggatg
cacacggtgc tcgcgctcct cgacggcgaa 20700gggcgcgtcg tggaggtgct cgaggaccgg
ggccacgagg tgatgaagct ggtgagcgag 20760gtgcgggagg tgggccgcaa gctgtggatc
ggaaccgtgg cgcacaacca catcgccacc 20820atcccctacc ctttagagga ctaaccatga
tctatgctgt ttcaatgcct cctaatctgt 20880gtacgtctat aaatgtctaa tgcagctcat
ggttgtaatc ttgtttgtgt ttggcaaatt 20940ggcataataa tggacagatt caatgggcat
tggtgctgta gtcgcatcac actaattgaa 21000tgggatcatg ttgagctctc actttgctac
aatttgctcc agcttgtacg gttgtaccct 21060cttgctcgtc tatagtaagg gccatctaaa
aaaaactcaa attagatctg caatacaagt 21120atgattgggc cgaatttgga ttgtcacggg
tccgcgaccg cgaattgggc tccggtttga 21180tttagccgac atagtagtga ccgacccgag
ccggccggcg agaccaaacc gagcggacgc 21240ccgccatgca tggagtcaaa gattcaaata
gaggacctaa cagaactcgc cgtaaagact 21300ggcgaacagt tcatacagag tctcttacga
ctcaatgaca agaagaaaat cttcgtcaac 21360atggtggagc acgacacgct tgtctactcc
aaaaatatca aagatacagt ctcagaagac 21420caaagggcaa ttgagacttt tcaacaaagg
gtaatatccg gaaacctcct cggattccat 21480tgcccagcta tctgtcactt tattgtgaag
atagtggaaa aggaaggtgg ctcctacaaa 21540tgccatcatt gcgataaagg aaaggccatc
gttgaagatg cctctgccga cagtggtccc 21600aaagatggac ccccacccac gaggagcatc
gtggaaaaag aagacgttcc aaccacgtct 21660tcaaagcaag tggattgatg tgatatctcc
actgacgtaa gggatgacgc acaatcccac 21720tatccttcgc aagacccttc ctctatataa
ggaagttcat ttcatttgga gaggacaggg 21780tacccgggga tccaccatgt ctccggagag
gagaccagtt gagattaggc cagctacagc 21840agctgatatg gccgcggttt gtgatatcgt
taaccattac attgagacgt ctacagtgaa 21900ctttaggaca gagccacaaa caccacaaga
gtggattgat gatctagaga ggttgcaaga 21960tagataccct tggttggttg ctgaggttga
gggtgttgtg gctggtattg cttacgctgg 22020gccctggaag gctaggaacg cttacgattg
gacagttgag agtactgttt acgtgtcaca 22080taggcatcaa aggttgggcc taggatccac
attgtacaca catttgctta agtctatgga 22140ggcgcaaggt tttaagtctg tggttgctgt
tataggcctt ccaaacgatc catctgttag 22200gttgcatgag gctttgggat acacagcccg
gggtacattg cgcgcagctg gatacaagca 22260tggtggatgg catgatgttg gtttttggca
aagggatttt gagttgccag ctcctccaag 22320gccagttagg ccagttaccc agatctgagt
cgacctgcag gcatgccgct gaaatcacca 22380gtctctctct acaaatctat ctctctctat
aataatgtgt gagtagttcc cagataaggg 22440aattagggtt cttatagggt ttcgctcatg
tgttgagcat ataagaaacc cttagtatgt 22500atttgtattt gtaaaatact tctatcaata
aaatttctaa ttcctaaaac caaaatccag 22560tgggtaccga gctcgaattc agtacattaa
aaacgtccgc aatgtgttat taagttgtct 22620aagcgtcaat ttgtttacac cacaatatat
cctgccacca gccagccaac agctccccga 22680ccggcagctc ggcacaaaat caccactcga
tacaggcagc ccatcagtcc gggacggcgt 22740cagcgggaga gccgttgtaa ggcggcagac
tttgctcatg ttaccgatgc tattcggaag 22800aacggcaact aagctgccgg gtttgaaaca
cggatgatct cgcggagggt agcatgttga 22860ttgtaacgat gacagagcgt tgctgcctgt
gatcaaatat catctccctc gcagagatcc 22920gaattatcag ccttcttatt catttctcgc
ttaaccgtga caggctgtcg atcttgagaa 22980ctatgccgac ataataggaa atcgctggat
aaagccgctg aggaagctga gtggcgctat 23040ttctttagaa gtgaacgttg acgatcgtcg
accgtacccc gatgaattaa ttcggacgta 23100cgttctgaac acagctggat acttacttgg
gcgattgtca tacatgacat caacaatgta 23160cccgtttgtg taaccgtctc ttggaggttc
gtatgacact agtggttccc ctcagcttgc 23220gactagatgt tgaggcctaa cattttatta
gagagcaggc tagttgctta gatacatgat 23280cttcaggccg ttatctgtca gggcaagcga
aaattggcca tttatgacga ccaatgcccc 23340gcagaagctc ccatctttgc cgccatagac
gccgcgcccc ccttttgggg tgtagaacat 23400ccttttgcca gatgtggaaa agaagttcgt
tgtcccattg ttggcaatga cgtagtagcc 23460ggcgaaagtg cgagacccat ttgcgctata
tataagccta cgatttccgt tgcgactatt 23520gtcgtaattg gatgaactat tatcgtagtt
gctctcagag ttgtcgtaat ttgatggact 23580attgtcgtaa ttgcttatgg agttgtcgta
gttgcttgga gaaatgtcgt agttggatgg 23640ggagtagtca tagggaagac gagcttcatc
cactaaaaca attggcaggt cagcaagtgc 23700ctgccccgat gccatcgcaa gtacgaggct
tagaaccacc ttcaacagat cgcgcatagt 23760cttccccagc tctctaacgc ttgagttaag
ccgcgccgcg aagcggcgtc ggcttgaacg 23820aattgttaga cattatttgc cgactacctt
ggtgatctcg cctttcacgt agtgaacaaa 23880ttcttccaac tgatctgcgc gcgaggccaa
gcgatcttct tgtccaagat aagcctgcct 23940agcttcaagt atgacgggct gatactgggc
cggcaggcgc tccattgccc agtcggcagc 24000gacatccttc ggcgcgattt tgccggttac
tgcgctgtac caaatgcggg acaacgtaag 24060cactacattt cgctcatcgc cagcccagtc
gggcggcgag ttccatagcg ttaaggtttc 24120atttagcgcc tcaaatagat cctgttcagg
aaccggatca aagagttcct ccgccgctgg 24180acctaccaag gcaacgctat gttctcttgc
ttttgtcagc aagatagcca gatcaatgtc 24240gatcgtggct ggctcgaaga tacctgcaag
aatgtcattg cgctgccatt ctccaaattg 24300cagttcgcgc ttagctggat aacgccacgg
aatgatgtcg tcgtgcacaa caatggtgac 24360ttctacagcg cggagaatct cgctctctcc
aggggaagcc gaagtttcca aaaggtcgtt 24420gatcaaagct cgccgcgttg tttcatcaag
ccttacggtc accgtaacca gcaaatcaat 24480atcactgtgt ggcttcaggc cgccatccac
tgcggagccg tacaaatgta cggccagcaa 24540cgtcggttcg agatggcgct cgatgacgcc
aactacctct gatagttgag tcgatacttc 24600ggcgatcacc gcttccctca tgatgtttaa
ctcctgaatt aagccgcgcc gcgaagcggt 24660gtcggcttga atgaattgtt aggcgtcatc
ctgtgctccc gagaaccagt accagtacat 24720cgctgtttcg ttcgagactt gaggtctagt
tttatacgtg aacaggtcaa tgccgccgag 24780agtaaagcca cattttgcgt acaaattgca
ggcaggtaca ttgttcgttt gtgtctctaa 24840tcgtatgcca aggagctgtc tgcttagtgc
ccactttttc gcaaattcga tgagactgtg 24900cgcgactcct ttgcctcggt gcgtgtgcga
cacaacaatg tgttcgatag aggctagatc 24960gttccatgtt gagttgagtt caatcttccc
gacaagctct tggtcgatga atgcgccata 25020gcaagcagag tcttcatcag agtcatcatc
cgagatgtaa tccttccggt aggggctcac 25080acttctggta gatagttcaa agccttggtc
ggataggtgc acatcgaaca cttcacgaac 25140aatgaaatgg ttctcagcat ccaatgtttc
cgccacctgc tcagggatca ccgaaatctt 25200catatgacgc ctaacgcctg gcacagcgga
tcgcaaacct ggcgcggctt ttggcacaaa 25260aggcgtgaca ggtttgcgaa tccgttgctg
ccacttgtta acccttttgc cagatttggt 25320aactataatt tatgttagag gcgaagtctt
gggtaaaaac tggcctaaaa ttgctgggga 25380tttcaggaaa gtaaacatca ccttccggct
cgatgtctat tgtagatata tgtagtgtat 25440ctacttgatc gggggatctg ctgcctcgcg
cgtttcggtg atgacggtga aaacctctga 25500cacatgcagc tcccggagac ggtcacagct
tgtctgtaag cggatgccgg gagcagacaa 25560gcccgtcagg gcgcgtcagc gggtgttggc
gggtgtcggg gcgcagccat gacccagtca 25620cgtagcgata gcggagtgta tactggctta
actatgcggc atcagagcag attgtactga 25680gagtgcacca tatgcggtgt gaaataccgc
acagatgcgt aaggagaaaa taccgcatca 25740ggcgctcttc cgcttcctcg ctcactgact
cgctgcgctc ggtcgttcgg ctgcggcgag 25800cggtatcagc tcactcaaag gcggtaatac
ggttatccac agaatcaggg gataacgcag 25860gaaagaacat gtgagcaaaa ggccagcaaa
aggccaggaa ccgtaaaaag gccgcgttgc 25920tggcgttttt ccataggctc cgcccccctg
acgagcatca caaaaatcga cgctcaagtc 25980agaggtggcg aaacccgaca ggactataaa
gataccaggc gtttccccct ggaagctccc 26040tcgtgcgctc tcctgttccg accctgccgc
ttaccggata cctgtccgcc tttctccctt 26100cgggaagcgt ggcgctttct catagctcac
gctgtaggta tctcagttcg gtgtaggtcg 26160ttcgctccaa gctgggctgt gtgcacgaac
cccccgttca gcccgaccgc tgcgccttat 26220ccggtaacta tcgtcttgag tccaacccgg
taagacacga cttatcgcca ctggcagcag 26280ccactggtaa caggattagc agagcgaggt
atgtaggcgg tgctacagag ttcttgaagt 26340ggtggcctaa ctacggctac actagaagga
cagtatttgg tatctgcgct ctgctgaagc 26400cagttacctt cggaaaaaga gttggtagct
cttgatccgg caaacaaacc accgctggta 26460gcggtggttt ttttgtttgc aagcagcaga
ttacgcgcag aaaaaaagga tctcaagaag 26520atcctttgat cttttctacg gggtctgacg
ctcagtggaa cgaaaactca cgttaaggga 26580ttttggtcat gagattatca aaaaggatct
tcacctagat ccttttaaat taaaaatgaa 26640gttttaaatc aatctaaagt atatatgagt
aaacttggtc tgacagttac caatgcttaa 26700tcagtgaggc acctatctca gcgatctgtc
tatttcgttc atccatagtt gcctgactcc 26760ccgtcgtgta gataactacg atacgggagg
gcttaccatc tggccccagt gctgcaatga 26820taccgcgaga cccacgctca ccggctccag
atttatcagc aataaaccag ccagccggaa 26880gggccgagcg cagaagtggt cctgcaactt
tatccgcctc catccagtct attaattgtt 26940gccgggaagc tagagtaagt agttcgccag
ttaatagttt gcgcaacgtt gttgccattg 27000ctgcaggggg gggggggggg ggggacttcc
attgttcatt ccacggacaa aaacagagaa 27060aggaaacgac agaggccaaa aagcctcgct
ttcagcacct gtcgtttcct ttcttttcag 27120agggtatttt aaataaaaac attaagttat
gacgaagaag aacggaaacg ccttaaaccg 27180gaaaattttc ataaatagcg aaaacccgcg
aggtcgccgc cccgtaacct gtcggatcac 27240cggaaaggac ccgtaaagtg ataatgatta
tcatctacat atcacaacgt gcgtggaggc 27300catcaaacca cgtcaaataa tcaattatga
cgcaggtatc gtattaattg atctgcatca 27360acttaacgta aaaacaactt cagacaatac
aaatcagcga cactgaatac ggggcaacct 27420catgtccccc cccccccccc ccctgcaggc
atcgtggtgt cacgctcgtc gtttggtatg 27480gcttcattca gctccggttc ccaacgatca
aggcgagtta catgatcccc catgttgtgc 27540aaaaaagcgg ttagctcctt cggtcctccg
atcgttgtca gaagtaagtt ggccgcagtg 27600ttatcactca tggttatggc agcactgcat
aattctctta ctgtcatgcc atccgtaaga 27660tgcttttctg tgactggtga gtactcaacc
aagtcattct gagaatagtg tatgcggcga 27720ccgagttgct cttgcccggc gtcaacacgg
gataataccg cgccacatag cagaacttta 27780aaagtgctca tcattggaaa acgttcttcg
gggcgaaaac tctcaaggat cttaccgctg 27840ttgagatcca gttcgatgta acccactcgt
gcacccaact gatcttcagc atcttttact 27900ttcaccagcg tttctgggtg agcaaaaaca
ggaaggcaaa atgccgcaaa aaagggaata 27960agggcgacac ggaaatgttg aatactcata
ctcttccttt ttcaatatta ttgaagcatt 28020tatcagggtt attgtctcat gagcggatac
atatttgaat gtatttagaa aaataaacaa 28080ataggggttc cgcgcacatt tccccgaaaa
gtgccacctg acgtctaaga aaccattatt 28140atcatgacat taacctataa aaataggcgt
atcacgaggc cctttcgtct tcaagaattc 28200ggagcttttg ccattctcac cggattcagt
cgtcactcat ggtgatttct cacttgataa 28260ccttattttt gacgagggga aattaatagg
ttgtattgat gttggacgag tcggaatcgc 28320agaccgatac caggatcttg ccatcctatg
gaactgcctc ggtgagtttt ctccttcatt 28380acagaaacgg ctttttcaaa aatatggtat
tgataatcct gatatgaata aattgcagtt 28440tcatttgatg ctcgatgagt ttttctaatc
agaattggtt aattggttgt aacactggca 28500gagcattacg ctgacttgac gggacggcgg
ctttgttgaa taaatcgaac ttttgctgag 28560ttgaaggatc agatcacgca tcttcccgac
aacgcagacc gttccgtggc aaagcaaaag 28620ttcaaaatca ccaactggtc cacctacaac
aaagctctca tcaaccgtgg ctccctcact 28680ttctggctgg atgatggggc gattcaggcc
tggtatgagt cagcaacacc ttcttcacga 28740ggcagacctc agcgccagaa ggccgccaga
gaggccgagc gcggccgtga ggcttggacg 28800ctagggcagg gcatgaaaaa gcccgtagcg
ggctgctacg ggcgtctgac gcggtggaaa 28860gggggagggg atgttgtcta catggctctg
ctgtagtgag tgggttgcgc tccggcagcg 28920gtcctgatca atcgtcaccc tttctcggtc
cttcaacgtt cctgacaacg agcctccttt 28980tcgccaatcc atcgacaatc accgcgagtc
cctgctcgaa cgctgcgtcc ggaccggctt 29040cgtcgaaggc gtctatcgcg gcccgcaaca
gcggcgagag cggagcctgt tcaacggtgc 29100cgccgcgctc gccggcatcg ctgtcgccgg
cctgctcctc aagcacggcc ccaacagtga 29160agtagctgat tgtcatcagc gcattgacgg
cgtccccggc cgaaaaaccc gcctcgcaga 29220ggaagcgaag ctgcgcgtcg gccgtttcca
tctgcggtgc gcccggtcgc gtgccggcat 29280ggatgcgcgc gccatcgcgg taggcgagca
gcgcctgcct gaagctgcgg gcattcccga 29340tcagaaatga gcgccagtcg tcgtcggctc
tcggcaccga atgcgtatga ttctccgcca 29400gcatggcttc ggccagtgcg tcgagcagcg
cccgcttgtt cctgaagtgc cagtaaagcg 29460ccggctgctg aacccccaac cgttccgcca
gtttgcgtgt cgtcagaccg tctacgccga 29520cctcgttcaa caggtccagg gcggcacgga
tcactgtatt cggctgcaac tttgtcatgc 29580ttgacacttt atcactgata aacataatat
gtccaccaac ttatcagtga taaagaatcc 29640gcgcgttcaa tcggaccagc ggaggctggt
ccggaggcca gacgtgaaac ccaacatacc 29700cctgatcgta attctgagca ctgtcgcgct
cgacgctgtc ggcatcggcc tgattatgcc 29760ggtgctgccg ggcctcctgc gcgatctggt
tcactcgaac gacgtcaccg cccactatgg 29820cattctgctg gcgctgtatg cgttggtgca
atttgcctgc gcacctgtgc tgggcgcgct 29880gtcggatcgt ttcgggcggc ggccaatctt
gctcgtctcg ctggccggcg ccactgtcga 29940ctacgccatc atggcgacag cgcctttcct
ttgggttctc tatatcgggc ggatcgtggc 30000cggcatcacc ggggcgactg gggcggtagc
cggcgcttat attgccgata tcactgatgg 30060cgatgagcgc gcgcggcact tcggcttcat
gagcgcctgt ttcgggttcg ggatggtcgc 30120gggacctgtg ctcggtgggc tgatgggcgg
tttctccccc cacgctccgt tcttcgccgc 30180ggcagccttg aacggcctca atttcctgac
gggctgtttc cttttgccgg agtcgcacaa 30240aggcgaacgc cggccgttac gccgggaggc
tctcaacccg ctcgcttcgt tccggtgggc 30300ccggggcatg accgtcgtcg ccgccctgat
ggcggtcttc ttcatcatgc aacttgtcgg 30360acaggtgccg gccgcgcttt gggtcatttt
cggcgaggat cgctttcact gggacgcgac 30420cacgatcggc atttcgcttg ccgcatttgg
cattctgcat tcactcgccc aggcaatgat 30480caccggccct gtagccgccc ggctcggcga
aaggcgggca ctcatgctcg gaatgattgc 30540cgacggcaca ggctacatcc tgcttgcctt
cgcgacacgg ggatggatgg cgttcccgat 30600catggtcctg cttgcttcgg gtggcatcgg
aatgccggcg ctgcaagcaa tgttgtccag 30660gcaggtggat gaggaacgtc aggggcagct
gcaaggctca ctggcggcgc tcaccagcct 30720gacctcgatc gtcggacccc tcctcttcac
ggcgatctat gcggcttcta taacaacgtg 30780gaacgggtgg gcatggattg caggcgctgc
cctctacttg ctctgcctgc cggcgctgcg 30840tcgcgggctt tggagcggcg cagggcaacg
agccgatcgc tgatcgtgga aacgataggc 30900ctatgccatg cgggtcaagg cgacttccgg
caagctatac gcgccctagg agtgcggttg 30960gaacgttggc ccagccagat actcccgatc
acgagcagga cgccgatgat ttgaagcgca 31020ctcagcgtct gatccaagaa caaccatcct
agcaacacgg cggtccccgg gctgagaaag 31080cccagtaagg aaacaactgt aggttcgagt
cgcgagatcc cccggaacca aaggaagtag 31140gttaaacccg ctccgatcag gccgagccac
gccaggccga gaacattggt tcctgtaggc 31200atcgggattg gcggatcaaa cactaaagct
actggaacga gcagaagtcc tccggccgcc 31260agttgccagg cggtaaaggt gagcagaggc
acgggaggtt gccacttgcg ggtcagcacg 31320gttccgaacg ccatggaaac cgcccccgcc
aggcccgctg cgacgccgac aggatctagc 31380gctgcgtttg gtgtcaacac caacagcgcc
acgcccgcag ttccgcaaat agcccccagg 31440accgccatca atcgtatcgg gctacctagc
agagcggcag agatgaacac gaccatcagc 31500ggctgcacag cgcctaccgt cgccgcgacc
ccgcccggca ggcggtagac cgaaataaac 31560aacaagctcc agaatagcga aatattaagt
gcgccgagga tgaagatgcg catccaccag 31620attcccgttg gaatctgtcg gacgatcatc
acgagcaata aacccgccgg caacgcccgc 31680agcagcatac cggcgacccc tcggcctcgc
tgttcgggct ccacgaaaac gccggacaga 31740tgcgccttgt gagcgtcctt ggggccgtcc
tcctgtttga agaccgacag cccaatgatc 31800tcgccgtcga tgtaggcgcc gaatgccacg
gcatctcgca accgttcagc gaacgcctcc 31860atgggctttt tctcctcgtg ctcgtaaacg
gacccgaaca tctctggagc tttcttcagg 31920gccgacaatc ggatctcgcg gaaatcctgc
acgtcggccg ctccaagccg tcgaatctga 31980gccttaatca caattgtcaa ttttaatcct
ctgtttatcg gcagttcgta gagcgcgccg 32040tgcgtcccga gcgatactga gcgaagcaag
tgcgtcgagc agtgcccgct tgttcctgaa 32100atgccagtaa agcgctggct gctgaacccc
cagccggaac tgaccccaca aggccctagc 32160gtttgcaatg caccaggtca tcattgaccc
aggcgtgttc caccaggccg ctgcctcgca 32220actcttcgca ggcttcgccg acctgctcgc
gccacttctt cacgcgggtg gaatccgatc 32280cgcacatgag gcggaaggtt tccagcttga
gcgggtacgg ctcccggtgc gagctgaaat 32340agtcgaacat ccgtcgggcc gtcggcgaca
gcttgcggta cttctcccat atgaatttcg 32400tgtagtggtc gccagcaaac agcacgacga
tttcctcgtc gatcaggacc tggcaacggg 32460acgttttctt gccacggtcc aggacgcgga
agcggtgcag cagcgacacc gattccaggt 32520gcccaacgcg gtcggacgtg aagcccatcg
ccgtcgcctg taggcgcgac aggcattcct 32580cggccttcgt gtaataccgg ccattgatcg
accagcccag gtcctggcaa agctcgtaga 32640acgtgaaggt gatcggctcg ccgatagggg
tgcgcttcgc gtactccaac acctgctgcc 32700acaccagttc gtcatcgtcg gcccgcagct
cgacgccggt gtaggtgatc ttcacgtcct 32760tgttgacgtg gaaaatgacc ttgttttgca
gcgcctcgcg cgggattttc ttgttgcgcg 32820tggtgaacag ggcagagcgg gccgtgtcgt
ttggcatcgc tcgcatcgtg tccggccacg 32880gcgcaatatc gaacaaggaa agctgcattt
ccttgatctg ctgcttcgtg tgtttcagca 32940acgcggcctg cttggcctcg ctgacctgtt
ttgccaggtc ctcgccggcg gtttttcgct 33000tcttggtcgt catagttcct cgcgtgtcga
tggtcatcga cttcgccaaa cctgccgcct 33060cctgttcgag acgacgcgaa cgctccacgg
cggccgatgg cgcgggcagg gcagggggag 33120ccagttgcac gctgtcgcgc tcgatcttgg
ccgtagcttg ctggaccatc gagccgacgg 33180actggaaggt ttcgcggggc gcacgcatga
cggtgcggct tgcgatggtt tcggcatcct 33240cggcggaaaa ccccgcgtcg atcagttctt
gcctgtatgc cttccggtca aacgtccgat 33300tcattcaccc tccttgcggg attgccccga
ctcacgccgg ggcaatgtgc ccttattcct 33360gatttgaccc gcctggtgcc ttggtgtcca
gataatccac cttatcggca atgaagtcgg 33420tcccgtagac cgtctggccg tccttctcgt
acttggtatt ccgaatcttg ccctgcacga 33480ataccagcga ccccttgccc aaatacttgc
cgtgggcctc ggcctgagag ccaaaacact 33540tgatgcggaa gaagtcggtg cgctcctgct
tgtcgccggc atcgttgcgc cactcttcat 33600taaccgctat atcgaaaatt gcttgcggct
tgttagaatt gccatgacgt acctcggtgt 33660cacgggtaag attaccgata aactggaact
gattatggct catatcgaaa gtctccttga 33720gaaaggagac tctagtttag ctaaacattg
gttccgctgt caagaacttt agcggctaaa 33780attttgcggg ccgcgaccaa aggtgcgagg
ggcggcttcc gctgtgtaca accagatatt 33840tttcaccaac atccttcgtc tgctcgatga
gcggggcatg acgaaacatg agctgtcgga 33900gagggcaggg gtttcaattt cgtttttatc
agacttaacc aacggtaagg ccaacccctc 33960gttgaaggtg atggaggcca ttgccgacgc
cctggaaact cccctacctc ttctcctgga 34020gtccaccgac cttgaccgcg aggcactcgc
ggagattgcg ggtcatcctt tcaagagcag 34080cgtgccgccc ggatacgaac gcatcagtgt
ggttttgccg tcacataagg cgtttatcgt 34140aaagaaatgg ggcgacgaca cccgaaaaaa
gctgcgtgga aggctctgac gccaagggtt 34200agggcttgca cttccttctt tagccgctaa
aacggcccct tctctgcggg ccgtcggctc 34260gcgcatcata tcgacatcct caacggaagc
cgtgccgcga atggcatcgg gcgggtgcgc 34320tttgacagtt gttttctatc agaaccccta
cgtcgtgcgg ttcgattagc tgtttgtctt 34380gcaggctaaa cactttcggt atatcgtttg
cctgtgcgat aatgttgcta atgatttgtt 34440gcgtaggggt tactgaaaag tgagcgggaa
agaagagttt cagaccatca aggagcgggc 34500caagcgcaag ctggaacgcg acatgggtgc
ggacctgttg gccgcgctca acgacccgaa 34560aaccgttgaa gtcatgctca acgcggacgg
caaggtgtgg cacgaacgcc ttggcgagcc 34620gatgcggtac atctgcgaca tgcggcccag
ccagtcgcag gcgattatag aaacggtggc 34680cggattccac ggcaaagagg tcacgcggca
ttcgcccatc ctggaaggcg agttcccctt 34740ggatggcagc cgctttgccg gccaattgcc
gccggtcgtg gccgcgccaa cctttgcgat 34800ccgcaagcgc gcggtcgcca tcttcacgct
ggaacagtac gtcgaggcgg gcatcatgac 34860ccgcgagcaa tacgaggtca ttaaaagcgc
cgtcgcggcg catcgaaaca tcctcgtcat 34920tggcggtact ggctcgggca agaccacgct
cgtcaacgcg atcatcaatg aaatggtcgc 34980cttcaacccg tctgagcgcg tcgtcatcat
cgaggacacc ggcgaaatcc agtgcgccgc 35040agagaacgcc gtccaatacc acaccagcat
cgacgtctcg atgacgctgc tgctcaagac 35100aacgctgcgt atgcgccccg accgcatcct
ggtcggtgag gtacgtggcc ccgaagccct 35160tgatctgttg atggcctgga acaccgggca
tgaaggaggt gccgccaccc tgcacgcaaa 35220caaccccaaa gcgggcctga gccggctcgc
catgcttatc agcatgcacc cggattcacc 35280gaaacccatt gagccgctga ttggcgaggc
ggttcatgtg gtcgtccata tcgccaggac 35340ccctagcggc cgtcgagtgc aagaaattct
cgaagttctt ggttacgaga acggccagta 35400catcaccaaa accctgtaag gagtatttcc
aatgacaacg gctgttccgt tccgtctgac 35460catgaatcgc ggcattttgt tctaccttgc
cgtgttcttc gttctcgctc tcgcgttatc 35520cgcgcatccg gcgatggcct cggaaggcac
cggcggcagc ttgccatatg agagctggct 35580gacgaacctg cgcaactccg taaccggccc
ggtggccttc gcgctgtcca tcatcggcat 35640cgtcgtcgcc ggcggcgtgc tgatcttcgg
cggcgaactc aacgccttct tccgaaccct 35700gatcttcctg gttctggtga tggcgctgct
ggtcggcgcg cagaacgtga tgagcacctt 35760cttcggtcgt ggtgccgaaa tcgcggccct
cggcaacggg gcgctgcacc aggtgcaagt 35820cgcggcggcg gatgccgtgc gtgcggtagc
ggctggacgg ctcgcctaat catggctctg 35880cgcacgatcc ccatccgtcg cgcaggcaac
cgagaaaacc tgttcatggg tggtgatcgt 35940gaactggtga tgttctcggg cctgatggcg
tttgcgctga ttttcagcgc ccaagagctg 36000cgggccaccg tggtcggtct gatcctgtgg
ttcggggcgc tctatgcgtt ccgaatcatg 36060gcgaaggccg atccgaagat gcggttcgtg
tacctgcgtc accgccggta caagccgtat 36120tacccggccc gctcgacccc gttccgcgag
aacaccaata gccaagggaa gcaataccga 36180tgatccaagc aattgcgatt gcaatcgcgg
gcctcggcgc gcttctgttg ttcatcctct 36240ttgcccgcat ccgcgcggtc gatgccgaac
tgaaactgaa aaagcatcgt tccaaggacg 36300ccggcctggc cgatctgctc aactacgccg
ctgtcgtcga tgacggcgta atcgtgggca 36360agaacggcag ctttatggct gcctggctgt
acaagggcga tgacaacgca agcagcaccg 36420accagcagcg cgaagtagtg tccgcccgca
tcaaccaggc cctcgcgggc ctgggaagtg 36480ggtggatgat ccatgtggac gccgtgcggc
gtcctgctcc gaactacgcg gagcggggcc 36540tgtcggcgtt ccctgaccgt ctgacggcag
cgattgaaga agagcgctcg gtcttgcctt 36600gctcgtcggt gatgtacttc accagctccg
cgaagtcgct cttcttgatg gagcgcatgg 36660ggacgtgctt ggcaatcacg cgcacccccc
ggccgtttta gcggctaaaa aagtcatggc 36720tctgccctcg ggcggaccac gcccatcatg
accttgccaa gctcgtcctg cttctcttcg 36780atcttcgcca gcagggcgag gatcgtggca
tcaccgaacc gcgccgtgcg cgggtcgtcg 36840gtgagccaga gtttcagcag gccgcccagg
cggcccaggt cgccattgat gcgggccagc 36900tcgcggacgt gctcatagtc cacgacgccc
gtgattttgt agccctggcc gacggccagc 36960aggtaggccg acaggctcat gccggccgcc
gccgcctttt cctcaatcgc tcttcgttcg 37020tctggaaggc agtacacctt gataggtggg
ctgcccttcc tggttggctt ggtttcatca 37080gccatccgct tgccctcatc tgttacgccg
gcggtagccg gccagcctcg cagagcagga 37140ttcccgttga gcaccgccag gtgcgaataa
gggacagtga agaaggaaca cccgctcgcg 37200ggtgggccta cttcacctat cctgcccggc
tgacgccgtt ggatacacca aggaaagtct 37260acacgaaccc tttggcaaaa tcctgtatat
cgtgcgaaaa aggatggata taccgaaaaa 37320atcgctataa tgaccccgaa gcagggttat
gcagcggaaa agcgctgctt ccctgctgtt 37380ttgtggaata tctaccgact ggaaacaggc
aaatgcagga aattactgaa ctgaggggac 37440aggcgagaga cgatgccaaa gagctacacc
gacgagctgg ccgagtgggt tgaatcccgc 37500gcggccaaga agcgccggcg tgatgaggct
gcggttgcgt tcctggcggt gagggcggat 37560gtcgaggcgg cgttagcgtc cggctatgcg
ctcgtcacca tttgggagca catgcgggaa 37620acggggaagg tcaagttctc ctacgagacg
ttccgctcgc acgccaggcg gcacatcaag 37680gccaagcccg ccgatgtgcc cgcaccgcag
gccaaggctg cggaacccgc gccggcaccc 37740aagacgccgg agccacggcg gccgaagcag
gggggcaagg ctgaaaagcc ggcccccgct 37800gcggccccga ccggcttcac cttcaaccca
acaccggaca aaaaggatct actgtaatgg 37860cgaaaattca catggttttg cagggcaagg
gcggggtcgg caagtcggcc atcgccgcga 37920tcattgcgca gtacaagatg gacaaggggc
agacaccctt gtgcatcgac accgacccgg 37980tgaacgcgac gttcgagggc tacaaggccc
tgaacgtccg ccggctgaac atcatggccg 38040gcgacgaaat taactcgcgc aacttcgaca
ccctggtcga gctgattgcg ccgaccaagg 38100atgacgtggt gatcgacaac ggtgccagct
cgttcgtgcc tctgtcgcat tacctcatca 38160gcaaccaggt gccggctctg ctgcaagaaa
tggggcatga gctggtcatc cataccgtcg 38220tcaccggcgg ccaggctctc ctggacacgg
tgagcggctt cgcccagctc gccagccagt 38280tcccggccga agcgcttttc gtggtctggc
tgaacccgta ttgggggcct atcgagcatg 38340agggcaagag ctttgagcag atgaaggcgt
acacggccaa caaggcccgc gtgtcgtcca 38400tcatccagat tccggccctc aaggaagaaa
cctacggccg cgatttcagc gacatgctgc 38460aagagcggct gacgttcgac caggcgctgg
ccgatgaatc gctcacgatc atgacgcggc 38520aacgcctcaa gatcgtgcgg cgcggcctgt
ttgaacagct cgacgcggcg gccgtgctat 38580gagcgaccag attgaagagc tgatccggga
gattgcggcc aagcacggca tcgccgtcgg 38640ccgcgacgac ccggtgctga tcctgcatac
catcaacgcc cggctcatgg ccgacagtgc 38700ggccaagcaa gaggaaatcc ttgccgcgtt
caaggaagag ctggaaggga tcgcccatcg 38760ttggggcgag gacgccaagg ccaaagcgga
gcggatgctg aacgcggccc tggcggccag 38820caaggacgca atggcgaagg taatgaagga
cagcgccgcg caggcggccg aagcgatccg 38880cagggaaatc gacgacggcc ttggccgcca
gctcgcggcc aaggtcgcgg acgcgcggcg 38940cgtggcgatg atgaacatga tcgccggcgg
catggtgttg ttcgcggccg ccctggtggt 39000gtgggcctcg ttatgaatcg cagaggcgca
gatgaaaaag cccggcgttg ccgggctttg 39060tttttgcgtt agctgggctt gtttgacagg
cccaagctct gactgcgccc gcgctcgcgc 39120tcctgggcct gtttcttctc ctgctcctgc
ttgcgcatca gggcctggtg ccgtcgggct 39180gcttcacgca tcgaatccca gtcgccggcc
agctcgggat gctccgcgcg catcttgcgc 39240gtcgccagtt cctcgatctt gggcgcgtga
atgcccatgc cttccttgat ttcgcgcacc 39300atgtccagcc gcgtgtgcag ggtctgcaag
cgggcttgct gttgggcctg ctgctgctgc 39360caggcggcct ttgtacgcgg cagggacagc
aagccggggg cattggactg tagctgctgc 39420aaacgcgcct gctgacggtc tacgagctgt
tctaggcggt cctcgatgcg ctccacctgg 39480tcatgctttg cctgcacgta gagcgcaagg
gtctgctggt aggtctgctc gatgggcgcg 39540gattctaaga gggcctgctg ttccgtctcg
gcctcctggg ccgcctgtag caaatcctcg 39600ccgctgttgc cgctggactg ctttactgcc
ggggactgct gttgccctgc tcgcgccgtc 39660gtcgcagttc ggcttgcccc cactcgattg
actgcttcat ttcgagccgc agcgatgcga 39720tctcggattg cgtcaacgga cggggcagcg
cggaggtgtc cggcttctcc ttgggtgagt 39780cggtcgatgc catagccaaa ggtttccttc
caaaatgcgt ccattgctgg accgtgtttc 39840tcattgatgc ccgcaagcat cttcggcttg
accgccaggt caagcgcgcc ttcatgggcg 39900gtcatgacgg acgccgccat gaccttgccg
ccgttgttct cgatgtagcc gcgtaatgag 39960gcaatggtgc cgcccatcgt cagcgtgtca
tcgacaacga tgtacttctg gccggggatc 40020acctccccct cgaaagtcgg gttgaacgcc
aggcgatgat ctgaaccggc tccggttcgg 40080gcgaccttct cccgctgcac aatgtccgtt
tcgacctcaa ggccaaggcg gtcggccaga 40140acgaccgcca tcatggccgg aatcttgttg
ttccccgccg cctcgacggc gaggactgga 40200acgatgcggg gcttgtcgtc gccgatcagc
gtcttgagct gggcaacagt gtcgtccgaa 40260atcaggcgct cgaccaaatt aagcgccgct
tccgcgtcgc cctgcttcgc agcctggtat 40320tcaggctcgt tggtcaaaga accaaggtcg
ccgttgcgaa ccaccttcgg gaagtctccc 40380cacggtgcgc gctcggctct gctgtagctg
ctcaagacgc ctcccttttt agccgctaaa 40440actctaacga gtgcgcccgc gactcaactt
gacgctttcg gcacttacct gtgccttgcc 40500acttgcgtca taggtgatgc ttttcgcact
cccgatttca ggtactttat cgaaatctga 40560ccgggcgtgc attacaaagt tcttccccac
ctgttggtaa atgctgccgc tatctgcgtg 40620gacgatgctg ccgtcgtggc gctgcgactt
atcggccttt tgggccatat agatgttgta 40680aatgccaggt ttcagggccc cggctttatc
taccttctgg ttcgtccatg cgccttggtt 40740ctcggtctgg acaattcttt gcccattcat
gaccaggagg cggtgtttca ttgggtgact 40800cctgacggtt gcctctggtg ttaaacgtgt
cctggtcgct tgccggctaa aaaaaagccg 40860acctcggcag ttcgaggccg gctttcccta
gagccgggcg cgtcaaggtt gttccatcta 40920ttttagtgaa ctgcgttcga tttatcagtt
actttcctcc cgctttgtgt ttcctcccac 40980tcgtttccgc gtctagccga cccctcaaca
tagcggcctc ttcttgggct gcctttgcct 41040cttgccgcgc ttcgtcacgc tcggcttgca
ccgtcgtaaa gcgctcggcc tgcctggccg 41100cctcttgcgc cgccaacttc ctttgctcct
ggtgggcctc ggcgtcggcc tgcgccttcg 41160ctttcaccgc tgccaactcc gtgcgcaaac
tctccgcttc gcgcctggtg gcgtcgcgct 41220cgccgcgaag cgcctgcatt tcctggttgg
ccgcgtccag ggtcttgcgg ctctcttctt 41280tgaatgcgcg ggcgtcctgg tgagcgtagt
ccagctcggc gcgcagctcc tgcgctcgac 41340gctccacctc gtcggcccgc tgcgtcgcca
gcgcggcccg ctgctcggct cctgccaggg 41400cggtgcgtgc ttcggccagg gcttgccgct
ggcgtgcggc cagctcggcc gcctcggcgg 41460cctgctgctc tagcaatgta acgcgcgcct
gggcttcttc cagctcgcgg gcctgcgcct 41520cgaaggcgtc ggccagctcc ccgcgcacgg
cttccaactc gttgcgctca cgatcccagc 41580cggcttgcgc tgcctgcaac gattcattgg
caagggcctg ggcggcttgc cagagggcgg 41640ccacggcctg gttgccggcc tgctgcaccg
cgtccggcac ctggactgcc agcggggcgg 41700cctgcgccgt gcgctggcgt cgccattcgc
gcatgccggc gctggcgtcg ttcatgttga 41760cgcgggcggc cttacgcact gcatccacgg
tcgggaagtt ctcccggtcg ccttgctcga 41820acagctcgtc cgcagccgca aaaatgcggt
cgcgcgtctc tttgttcagt tccatgttgg 41880ctccggtaat tggtaagaat aataatactc
ttacctacct tatcagcgca agagtttagc 41940tgaacagttc tcgacttaac ggcaggtttt
ttagcggctg aagggcaggc aaaaaaagcc 42000ccgcacggtc ggcgggggca aagggtcagc
gggaagggga ttagcgggcg tcgggcttct 42060tcatgcgtcg gggccgcgct tcttgggatg
gagcacgacg aagcgcgcac gcgcatcgtc 42120ctcggcccta tcggcccgcg tcgcggtcag
gaacttgtcg cgcgctaggt cctccctggt 42180gggcaccagg ggcatgaact cggcctgctc
gatgtaggtc cactccatga ccgcatcgca 42240gtcgaggccg cgttccttca ccgtctcttg
caggtcgcgg tacgcccgct cgttgagcgg 42300ctggtaacgg gccaattggt cgtaaatggc
tgtcggccat gagcggcctt tcctgttgag 42360ccagcagccg acgacgaagc cggcaatgca
ggcccctggc acaaccaggc cgacgccggg 42420ggcaggggat ggcagcagct cgccaaccag
gaaccccgcc gcgatgatgc cgatgccggt 42480caaccagccc ttgaaactat ccggccccga
aacacccctg cgcattgcct ggatgctgcg 42540ccggatagct tgcaacatca ggagccgttt
cttttgttcg tcagtcatgg tccgccctca 42600ccagttgttc gtatcggtgt cggacgaact
gaaatcgcaa gagctgccgg tatcggtcca 42660gccgctgtcc gtgtcgctgc tgccgaagca
cggcgagggg tccgcgaacg ccgcagacgg 42720cgtatccggc cgcagcgcat cgcccagcat
ggccccggtc agcgagccgc cggccaggta 42780gcccagcatg gtgctgttgg tcgccccggc
caccagggcc gacgtgacga aatcgccgtc 42840attccctctg gattgttcgc tgctcggcgg
ggcagtgcgc cgcgccggcg gcgtcgtgga 42900tggctcgggt tggctggcct gcgacggccg
gcgaaaggtg cgcagcagct cgttatcgac 42960cggctgcggc gtcggggccg ccgccttgcg
ctgcggtcgg tgttccttct tcggctcgcg 43020cagcttgaac agcatgatcg cggaaaccag
cagcaacgcc gcgcctacgc ctcccgcgat 43080gtagaacagc atcggattca ttcttcggtc
ctccttgtag cggaaccgtt gtctgtgcgg 43140cgcgggtggc ccgcgccgct gtctttgggg
atcagccctc gatgagcgcg accagtttca 43200cgtcggcaag gttcgcctcg aactcctggc
cgtcgtcctc gtacttcaac caggcatagc 43260cttccgccgg cggccgacgg ttgaggataa
ggcgggcagg gcgctcgtcg tgctcgacct 43320ggacgatggc ctttttcagc ttgtccgggt
ccggctcctt cgcgcccttt tccttggcgt 43380ccttaccgtc ctggtcgccg tcctcgccgt
cctggccgtc gccggcctcc gcgtcacgct 43440cggcatcagt ctggccgttg aaggcatcga
cggtgttggg atcgcggccc ttctcgtcca 43500ggaactcgcg cagcagcttg accgtgccgc
gcgtgatttc ctgggtgtcg tcgtcaagcc 43560acgcctcgac ttcctccggg cgcttcttga
aggccgtcac cagctcgttc accacggtca 43620cgtcgcgcac gcggccggtg ttgaacgcat
cggcgatctt ctccggcagg tccagcagcg 43680tgacgtgctg ggtgatgaac gccggcgact
tgccgatttc cttggcgata tcgcctttct 43740tcttgccctt cgccagctcg cggccaatga
agtcggcaat ttcgcgcggg gtcagctcgt 43800tgcgttgcag gttctcgata acctggtcgg
cttcgttgta gtcgttgtcg atgaacgccg 43860ggatggactt cttgccggcc cacttcgagc
cacggtagcg gcgggcgccg tgattgatga 43920tatagcggcc cggctgctcc tggttctcgc
gcaccgaaat gggtgacttc accccgcgct 43980ctttgatcgt ggcaccgatt tccgcgatgc
tctccgggga aaagccgggg ttgtcggccg 44040tccgcggctg atgcggatct tcgtcgatca
ggtccaggtc cagctcgata gggccggaac 44100cgccctgaga cgccgcagga gcgtccagga
ggctcgacag gtcgccgatg ctatccaacc 44160ccaggccgga cggctgcgcc gcgcctgcgg
cttcctgagc ggccgcagcg gtgtttttct 44220tggtggtctt ggcttgagcc gcagtcattg
ggaaatctcc atcttcgtga acacgtaatc 44280agccagggcg cgaacctctt tcgatgcctt
gcgcgcggcc gttttcttga tcttccagac 44340cggcacaccg gatgcgaggg catcggcgat
gctgctgcgc aggccaacgg tggccggaat 44400catcatcttg gggtacgcgg ccagcagctc
ggcttggtgg cgcgcgtggc gcggattccg 44460cgcatcgacc ttgctgggca ccatgccaag
gaattgcagc ttggcgttct tctggcgcac 44520gttcgcaatg gtcgtgacca tcttcttgat
gccctggatg ctgtacgcct caagctcgat 44580gggggacagc acatagtcgg ccgcgaagag
ggcggccgcc aggccgacgc caagggtcgg 44640ggccgtgtcg atcaggcaca cgtcgaagcc
ttggttcgcc agggccttga tgttcgcccc 44700gaacagctcg cgggcgtcgt ccagcgacag
ccgttcggcg ttcgccagta ccgggttgga 44760ctcgatgagg gcgaggcgcg cggcctggcc
gtcgccggct gcgggtgcgg tttcggtcca 44820gccgccggca gggacagcgc cgaacagctt
gcttgcatgc aggccggtag caaagtcctt 44880gagcgtgtag gacgcattgc cctgggggtc
caggtcgatc acggcaaccc gcaagccgcg 44940ctcgaaaaag tcgaaggcaa gatgcacaag
ggtcgaagtc ttgccgacgc cgcctttctg 45000gttggccgtg accaaagttt tcatcgtttg
gtttcctgtt ttttcttggc gtccgcttcc 45060cacttccgga cgatgtacgc ctgatgttcc
ggcagaaccg ccgttacccg cgcgtacccc 45120tcgggcaagt tcttgtcctc gaacgcggcc
cacacgcgat gcaccgcttg cgacactgcg 45180cccctggtca gtcccagcga cgttgcgaac
gtcgcctgtg gcttcccatc gactaagacg 45240ccccgcgcta tctcgatggt ctgctgcccc
acttccagcc cctggatcgc ctcctggaac 45300tggctttcgg taagccgttt cttcatggat
aacacccata atttgctccg cgccttggtt 45360gaacatagcg gtgacagccg ccagcacatg
agagaagttt agctaaacat ttctcgcacg 45420tcaacacctt tagccgctaa aactcgtcct
tggcgtaaca aaacaaaagc ccggaaaccg 45480ggctttcgtc tcttgccgct tatggctctg
cacccggctc catcaccaac aggtcgcgca 45540cgcgcttcac tcggttgcgg atcgacactg
ccagcccaac aaagccggtt gccgccgccg 45600ccaggatcgc gccgatgatg ccggccacac
cggccatcgc ccaccaggtc gccgccttcc 45660ggttccattc ctgctggtac tgcttcgcaa
tgctggacct cggctcacca taggctgacc 45720gctcgatggc gtatgccgct tctccccttg
gcgtaaaacc cagcgccgca ggcggcattg 45780ccatgctgcc cgccgctttc ccgaccacga
cgcgcgcacc aggcttgcgg tccagacctt 45840cggccacggc gagctgcgca aggacataat
cagccgccga cttggctcca cgcgcctcga 45900tcagctcttg cactcgcgcg aaatccttgg
cctccacggc cgccatgaat cgcgcacgcg 45960gcgaaggctc cgcagggccg gcgtcgtgat
cgccgccgag aatgcccttc accaagttcg 46020acgacacgaa aatcatgctg acggctatca
ccatcatgca gacggatcgc acgaacccgc 46080tgaattgaac acgagcacgg cacccgcgac
cactatgcca agaatgccca aggtaaaaat 46140tgccggcccc gccatgaagt ccgtgaatgc
cccgacggcc gaagtgaagg gcaggccgcc 46200acccaggccg ccgccctcac tgcccggcac
ctggtcgctg aatgtcgatg ccagcacctg 46260cggcacgtca atgcttccgg gcgtcgcgct
cgggctgatc gcccatcccg ttactgcccc 46320gatcccggca atggcaagga ctgccagcgc
tgccattttt ggggtgaggc cgttcgcggc 46380cgaggggcgc agcccctggg gggatgggag
gcccgcgtta gcgggccggg agggttcgag 46440aagggggggc accccccttc ggcgtgcgcg
gtcacgcgca cagggcgcag ccctggttaa 46500aaacaaggtt tataaatatt ggtttaaaag
caggttaaaa gacaggttag cggtggccga 46560aaaacgggcg gaaacccttg caaatgctgg
attttctgcc tgtggacagc ccctcaaatg 46620tcaataggtg cgcccctcat ctgtcagcac
tctgcccctc aagtgtcaag gatcgcgccc 46680ctcatctgtc agtagtcgcg cccctcaagt
gtcaataccg cagggcactt atccccaggc 46740ttgtccacat catctgtggg aaactcgcgt
aaaatcaggc gttttcgccg atttgcgagg 46800ctggccagct ccacgtcgcc ggccgaaatc
gagcctgccc ctcatctgtc aacgccgcgc 46860cgggtgagtc ggcccctcaa gtgtcaacgt
ccgcccctca tctgtcagtg agggccaagt 46920tttccgcgag gtatccacaa cgccggcggc
cgcggtgtct cgcacacggc ttcgacggcg 46980tttctggcgc gtttgcaggg ccatagacgg
ccgccagccc agcggcgagg gcaaccagcc 47040cggtgagcgt cggaaaggcg ctggaagccc
cgtagcgacg cggagagggg cgagacaagc 47100caagggcgca ggctcgatgc gcagcacgac
atagccggtt ctcgcaagga cgagaatttc 47160cctgcggtgc ccctcaagtg tcaatgaaag
tttccaacgc gagccattcg cgagagcctt 47220gagtccacgc tagatgagag ctttgttgta
ggtggaccag ttggtgattt tgaacttttg 47280ctttgccacg gaacggtctg cgttgtcggg
aagatgcgtg atctgatcct tcaactcagc 47340aaaagttcga tttattcaac aaagccacgt
tgtgtctcaa aatctctgat gttacattgc 47400acaagataaa aatatatcat catgaacaat
aaaactgtct gcttacataa acagtaatac 47460aaggggtgtt atgagccata ttcaacggga
aacgtcttgc tcgac 475051448232DNAartificial
sequencePHP16973 (ZmCAS1BamProMS45/35SPAT) 14tctagagctc gttcctcgag
gaacggtacc tgcggggaag cttacaataa tgtgtgttgt 60taagtcttgt tgcctgtcat
cgtctgactg actttcgtca taaatcccgg cctccgtaac 120ccagctttgg gcaagctcac
ggatttgatc cggcggaacg ggaatatcga gatgccgggc 180tgaacgctgc agttccagct
ttccctttcg ggacaggtac tccagctgat tgattatctg 240ctgaagggtc ttggttccac
ctcctggcac aatgcgaatg attacttgag cgcgatcggg 300catccaattt tctcccgtca
ggtgcgtggt caagtgctac aaggcacctt tcagtaacga 360gcgaccgtcg atccgtcgcc
gggatacgga caaaatggag cgcagtagtc catcgagggc 420ggcgaaagcc tcgccaaaag
caatacgttc atctcgcaca gcctccagat ccgatcgagg 480gtcttcggcg taggcagata
gaagcatgga tacattgctt gagagtattc cgatggactg 540aagtatggct tccatctttt
ctcgtgtgtc tgcatctatt tcgagaaagc ccccgatgcg 600gcgcaccgca acgcgaattg
ccatactatc cgaaagtccc agcaggcgcg cttgatagga 660aaaggtttca tactcggccg
atcgcagacg ggcactcacg accttgaacc cttcaacttt 720cagggatcga tgctggttga
tggtagtctc actcgacgtg gctctggtgt gttttgacat 780agcttcctcc aaagaaagcg
gaaggtctgg atactccagc acgaaatgtg cccgggtaga 840cggatggaag tctagccctg
ctcaatatga aatcaacagt acatttacag tcaatactga 900atatacttgc tacatttgca
attgtcttat aacgaatgtg aaataaaaat agtgtaacaa 960cgcttttact catcgataat
cacaaaaaca tttatacgaa caaaaataca aatgcactcc 1020ggtttcacag gataggcggg
atcagaatat gcaacttttg acgttttgtt ctttcaaagg 1080gggtgctggc aaaaccaccg
cactcatggg cctttgcgct gctttggcaa atgacggtaa 1140acgagtggcc ctctttgatg
ccgacgaaaa ccggcctctg acgcgatgga gagaaaacgc 1200cttacaaagc agtactggga
tcctcgctgt gaagtctatt ccgccgacga aatgcccctt 1260cttgaagcag cctatgaaaa
tgccgagctc gaaggatttg attatgcgtt ggccgatacg 1320cgtggcggct cgagcgagct
caacaacaca atcatcgcta gctcaaacct gcttctgatc 1380cccaccatgc taacgccgct
cgacatcgat gaggcactat ctacctaccg ctacgtcatc 1440gagctgctgt tgagtgaaaa
tttggcaatt cctacagctg ttttgcgcca acgcgtcccg 1500gtcggccgat tgacaacatc
gcaacgcagg atgtcagaga cgctagagag ccttccagtt 1560gtaccgtctc ccatgcatga
aagagatgca tttgccgcga tgaaagaacg cggcatgttg 1620catcttacat tactaaacac
gggaactgat ccgacgatgc gcctcataga gaggaatctt 1680cggattgcga tggaggaagt
cgtggtcatt tcgaaactga tcagcaaaat cttggaggct 1740tgaagatggc aattcgcaag
cccgcattgt cggtcggcga agcacggcgg cttgctggtg 1800ctcgacccga gatccaccat
cccaacccga cacttgttcc ccagaagctg gacctccagc 1860acttgcctga aaaagccgac
gagaaagacc agcaacgtga gcctctcgtc gccgatcaca 1920tttacagtcc cgatcgacaa
cttaagctaa ctgtggatgc ccttagtcca cctccgtccc 1980cgaaaaagct ccaggttttt
ctttcagcgc gaccgcccgc gcctcaagtg tcgaaaacat 2040atgacaacct cgttcggcaa
tacagtccct cgaagtcgct acaaatgatt ttaaggcgcg 2100cgttggacga tttcgaaagc
atgctggcag atggatcatt tcgcgtggcc ccgaaaagtt 2160atccgatccc ttcaactaca
gaaaaatccg ttctcgttca gacctcacgc atgttcccgg 2220ttgcgttgct cgaggtcgct
cgaagtcatt ttgatccgtt ggggttggag accgctcgag 2280ctttcggcca caagctggct
accgccgcgc tcgcgtcatt ctttgctgga gagaagccat 2340cgagcaattg gtgaagaggg
acctatcgga acccctcacc aaatattgag tgtaggtttg 2400aggccgctgg ccgcgtcctc
agtcaccttt tgagccagat aattaagagc caaatgcaat 2460tggctcaggc tgccatcgtc
cccccgtgcg aaacctgcac gtccgcgtca aagaaataac 2520cggcacctct tgctgttttt
atcagttgag ggcttgacgg atccgcctca agtttgcggc 2580gcagccgcaa aatgagaaca
tctatactcc tgtcgtaaac ctcctcgtcg cgtactcgac 2640tggcaatgag aagttgctcg
cgcgatagaa cgtcgcgggg tttctctaaa aacgcgagga 2700gaagattgaa ctcacctgcc
gtaagtttca cctcaccgcc agcttcggac atcaagcgac 2760gttgcctgag attaagtgtc
cagtcagtaa aacaaaaaga ccgtcggtct ttggagcgga 2820caacgttggg gcgcacgcgc
aaggcaaccc gaatgcgtgc aagaaactct ctcgtactaa 2880acggcttagc gataaaatca
cttgctccta gctcgagtgc aacaacttta tccgtctcct 2940caaggcggtc gccactgata
attatgattg gaatatcaga ctttgccgcc agatttcgaa 3000cgatctcaag cccatcttca
cgacctaaat ttagatcaac aaccacgaca tcgaccgtcg 3060cggaagagag tactctagtg
aactgggtgc tgtcggctac cgcggtcact ttgaaggcgt 3120ggatcgtaag gtattcgata
ataagatgcc gcatagcgac atcgtcatcg ataagaagaa 3180cgtgtttcaa cggctcacct
ttcaatctaa aatctgaacc cttgttcaca gcgcttgaga 3240aattttcacg tgaaggatgt
acaatcatct ccagctaaat gggcagttcg tcagaattgc 3300ggctgaccgc ggatgacgaa
aatgcgaacc aagtatttca attttatgac aaaagttctc 3360aatcgttgtt acaagtgaaa
cgcttcgagg ttacagctac tattgattaa ggagatcgcc 3420tatggtctcg ccccggcgtc
gtgcgtccgc cgcgagccag atctcgccta cttcataaac 3480gtcctcatag gcacggaatg
gaatgatgac atcgatcgcc gtagagagca tgtcaatcag 3540tgtgcgatct tccaagctag
caccttgggc gctacttttg acaagggaaa acagtttctt 3600gaatccttgg attggattcg
cgccgtgtat tgttgaaatc gatcccggat gtcccgagac 3660gacttcactc agataagccc
atgctgcatc gtcgcgcatc tcgccaagca atatccggtc 3720cggccgcata cgcagacttg
cttggagcaa gtgctcggcg ctcacagcac ccagcccagc 3780accgttcttg gagtagagta
gtctaacatg attatcgtgt ggaatgacga gttcgagcgt 3840atcttctatg gtgattagcc
tttcctgggg ggggatggcg ctgatcaagg tcttgctcat 3900tgttgtcttg ccgcttccgg
tagggccaca tagcaacatc gtcagtcggc tgacgacgca 3960tgcgtgcaga aacgcttcca
aatccccgtt gtcaaaatgc tgaaggatag cttcatcatc 4020ctgattttgg cgtttccttc
gtgtctgcca ctggttccac ctcgaagcat cataacggga 4080ggagacttct ttaagaccag
aaacacgcga gcttggccgt cgaatggtca agctgacggt 4140gcccgaggga acggtcggcg
gcagacagat ttgtagtcgt tcaccaccag gaagttcagt 4200ggcgcagagg gggttacgtg
gtccgacatc ctgctttctc agcgcgcccg ctaaaatagc 4260gatatcttca agatcatcat
aagagacggg caaaggcatc ttggtaaaaa tgccggcttg 4320gcgcacaaat gcctctccag
gtcgattgat cgcaatttct tcagtcttcg ggtcatcgag 4380ccattccaaa atcggcttca
gaagaaagcg tagttgcgga tccacttcca tttacaatgt 4440atcctatctc taagcggaaa
tttgaattca ttaagagcgg cggttcctcc cccgcgtggc 4500gccgccagtc aggcggagct
ggtaaacacc aaagaaatcg aggtcccgtg ctacgaaaat 4560ggaaacggtg tcaccctgat
tcttcttcag ggttggcggt atgttgatgg ttgccttaag 4620ggctgtctca gttgtctgct
caccgttatt ttgaaagctg ttgaagctca tcccgccacc 4680cgagctgccg gcgtaggtgc
tagctgcctg gaaggcgcct tgaacaacac tcaagagcat 4740agctccgcta aaacgctgcc
agaagtggct gtcgaccgag cccggcaatc ctgagcgacc 4800gagttcgtcc gcgcttggcg
atgttaacga gatcatcgca tggtcaggtg tctcggcgcg 4860atcccacaac acaaaaacgc
gcccatctcc ctgttgcaag ccacgctgta tttcgccaac 4920aacggtggtg ccacgatcaa
gaagcacgat attgttcgtt gttccacgaa tatcctgagg 4980caagacacac tttacatagc
ctgccaaatt tgtgtcgatt gcggtttgca agatgcacgg 5040aattattgtc ccttgcgtta
ccataaaatc ggggtgcggc aagagcgtgg cgctgctggg 5100ctgcagctcg gtgggtttca
tacgtatcga caaatcgttc tcgccggaca cttcgccatt 5160cggcaaggag ttgtcgtcac
gcttgccttc ttgtcttcgg cccgtgtcgc cctgaatggc 5220gcgtttgctg accccttgat
cgccgctgct atatgcaaaa atcggtgttt cttccggccg 5280tggctcatgc cgctccggtt
cgcccctcgg cggtagagga gcagcaggct gaacagcctc 5340ttgaaccgct ggaggatccg
gcggcacctc aatcggagct ggatgaaatg gcttggtgtt 5400tgttgcgatc aaagttgacg
gcgatgcgtt ctcattcacc ttcttttggc gcccacctag 5460ccaaatgagg cttaatgata
acgcgagaac gacacctccg acgatcaatt tctgagaccc 5520cgaaagacgc cggcgatgtt
tgtcggagac cagggatcca gatgcatcaa cctcatgtgc 5580cgcttgctga ctatcgttat
tcatcccttc gcccccttca ggacgcgttt cacatcgggc 5640ctcaccgtgc ccgtttgcgg
cctttggcca acgggatcgt aagcggtgtt ccagatacat 5700agtactgtgt ggccatccct
cagacgccaa cctcgggaaa ccgaagaaat ctcgacatcg 5760ctccctttaa ctgaatagtt
ggcaacagct tccttgccat caggattgat ggtgtagatg 5820gagggtatgc gtacattgcc
cggaaagtgg aataccgtcg taaatccatt gtcgaagact 5880tcgagtggca acagcgaacg
atcgccttgg gcgacgtagt gccaattact gtccgccgca 5940ccaagggctg tgacaggctg
atccaataaa ttctcagctt tccgttgata ttgtgcttcc 6000gcgtgtagtc tgtccacaac
agccttctgt tgtgcctccc ttcgccgagc cgccgcatcg 6060tcggcggggt aggcgaattg
gacgctgtaa tagagatcgg gctgctcttt atcgaggtgg 6120gacagagtct tggaacttat
actgaaaaca taacggcgca tcccggagtc gcttgcggtt 6180agcacgatta ctggctgagg
cgtgaggacc tggcttgcct tgaaaaatag ataatttccc 6240cgcggtaggg ctgctagatc
tttgctattt gaaacggcaa ccgctgtcac cgtttcgttc 6300gtggcgaatg ttacgaccaa
agtagctcca accgccgtcg agaggcgcac cacttgatcg 6360ggattgtaag ccaaataacg
catgcgcgga tctagcttgc ccgccattgg agtgtcttca 6420gcctccgcac cagtcgcagc
ggcaaataaa catgctaaaa tgaaaagtgc ttttctgatc 6480atggttcgct gtggcctacg
tttgaaacgg tatcttccga tgtctgatag gaggtgacaa 6540ccagacctgc cgggttggtt
agtctcaatc tgccgggcaa gctggtcacc ttttcgtagc 6600gaactgtcgc ggtccacgta
ctcaccacag gcattttgcc gtcaacgacg agggtccttt 6660tatagcgaat ttgctgcgtg
cttggagtta catcatttga agcgatgtgc tcgacctcca 6720ccctgccgcg tttgccaaga
atgacttgag gcgaactggg attgggatag ttgaagaatt 6780gctggtaatc ctggcgcact
gttggggcac tgaagttcga taccaggtcg taggcgtact 6840gagcggtgtc ggcatcataa
ctctcgcgca ggcgaacgta ctcccacaat gaggcgttaa 6900cgacggcctc ctcttgagtt
gcaggcaatc gcgagacaga cacctcgctg tcaacggtgc 6960cgtccggccg tatccataga
tatacgggca caagcctgct caacggcacc attgtggcta 7020tagcgaacgc ttgagcaaca
tttcccaaaa tcgcgatagc tgcgacagct gcaatgagtt 7080tggagagacg tcgcgccgat
ttcgctcgcg cggtttgaaa ggcttctact tccttatagt 7140gctcggcaag gctttcgcgc
gccactagca tggcatattc aggccccgtc atagcgtcca 7200cccgaattgc cgagctgaag
atctgacgga gtaggctgcc atcgccccac attcagcggg 7260aagatcgggc ctttgcagct
cgctaatgtg tcgtttgtct ggcagccgct caaagcgaca 7320actaggcaca gcaggcaata
cttcatagaa ttctccattg aggcgaattt ttgcgcgacc 7380tagcctcgct caacctgagc
gaagcgacgg tacaagctgc tggcagattg ggttgcgccg 7440ctccagtaac tgcctccaat
gttgccggcg atcgccggca aagcgacaat gagcgcatcc 7500cctgtcagaa aaaacatatc
gagttcgtaa agaccaatga tcttggccgc ggtcgtaccg 7560gcgaaggtga ttacaccaag
cataagggtg agcgcagtcg cttcggttag gatgacgatc 7620gttgccacga ggtttaagag
gagaagcaag agaccgtagg tgataagttg cccgatccac 7680ttagctgcga tgtcccgcgt
gcgatcaaaa atatatccga cgaggatcag aggcccgatc 7740gcgagaagca ctttcgtgag
aattccaacg gcgtcgtaaa ctccgaaggc agaccagagc 7800gtgccgtaaa ggacccactg
tgccccttgg aaagcaagga tgtcctggtc gttcatcgga 7860ccgatttcgg atgcgatttt
ctgaaaaacg gcctgggtca cggcgaacat tgtatccaac 7920tgtgccggaa cagtctgcag
aggcaagccg gttacactaa actgctgaac aaagtttggg 7980accgtctttt cgaagatgga
aaccacatag tcttggtagt tagcctgccc aacaattaga 8040gcaacaacga tggtgaccgt
gatcacccga gtgataccgc tacgggtatc gacttcgccg 8100cgtatgacta aaataccctg
aacaataatc caaagagtga cacaggcgat caatggcgca 8160ctcaccgcct cctggatagt
ctcaagcatc gagtccaagc ctgtcgtgaa ggctacatcg 8220aagatcgtat gaatggccgt
aaacggcgcc ggaatcgtga aattcatcga ttggacctga 8280acttgactgg tttgtcgcat
aatgttggat aaaatgagct cgcattcggc gaggatgcgg 8340gcggatgaac aaatcgccca
gccttagggg agggcaccaa agatgacagc ggtcttttga 8400tgctccttgc gttgagcggc
cgcctcttcc gcctcgtgaa ggccggcctg cgcggtagtc 8460atcgttaata ggcttgtcgc
ctgtacattt tgaatcattg cgtcatggat ctgcttgaga 8520agcaaaccat tggtcacggt
tgcctgcatg atattgcgag atcgggaaag ctgagcagac 8580gtatcagcat tcgccgtcaa
gcgtttgtcc atcgtttcca gattgtcagc cgcaatgcca 8640gcgctgtttg cggaaccggt
gatctgcgat cgcaacaggt ccgcttcagc atcactaccc 8700acgactgcac gatctgtatc
gctggtgatc gcacgtgccg tggtcgacat tggcattcgc 8760ggcgaaaaca tttcattgtc
taggtccttc gtcgaaggat actgattttt ctggttgagc 8820gaagtcagta gtccagtaac
gccgtaggcc gacgtcaaca tcgtaaccat cgctatagtc 8880tgagtgagat tctccgcagt
cgcgagcgca gtcgcgagcg tctcagcctc cgttgccggg 8940tcgctaacaa caaactgcgc
ccgcgcgggc tgaatatata gaaagctgca ggtcaaaact 9000gttgcaataa gttgcgtcgt
cttcatcgtt tcctacctta tcaatcttct gcctcgtggt 9060gacgggccat gaattcgctg
agccagccag atgagttgcc ttcttgtgcc tcgcgtagtc 9120gagttgcaaa gcgcaccgtg
ttggcacgcc ccgaaagcac ggcgacatat tcacgcatat 9180cccgcagatc aaattcgcag
atgacgcttc cactttctcg tttaagaaga aacttacggc 9240tgccgaccgt catgtcttca
cggatcgcct gaaattcctt ttcggtacat ttcagtccat 9300cgacataagc cgatcgatct
gcggttggtg atggatagaa aatcttcgtc atacattgcg 9360caaccaagct ggctcctagc
ggcgattcca gaacatgctc tggttgctgc gttgccagta 9420ttagcatccc gttgtttttt
cgaacggtca ggaggaattt gtcgacgaca gtcgaaaatt 9480tagggtttaa caaataggcg
cgaaactcat cgcagctcat cacaaaacgg cggccgtcga 9540tcatggctcc aatccgatgc
aggagatatg ctgcagcggg agcgcatact tcctcgtatt 9600cgagaagatg cgtcatgtcg
aagccggtaa tcgacggatc taactttact tcgtcaactt 9660cgccgtcaaa tgcccagcca
agcgcatggc cccggcacca gcgttggagc cgcgctcctg 9720cgccttcggc gggcccatgc
aacaaaaatt cacgtaaccc cgcgattgaa cgcatttgtg 9780gatcaaacga gagctgacga
tggataccac ggaccagacg gcggttctct tccggagaaa 9840tcccaccccg accatcactc
tcgatgagag ccacgatcca ttcgcgcaga aaatcgtgtg 9900aggctgctgt gttttctagg
ccacgcaacg gcgccaaccc gctgggtgtg cctctgtgaa 9960gtgccaaata tgttcctcct
gtggcgcgaa ccagcaattc gccaccccgg tccttgtcaa 10020agaacacgac cgtacctgca
cggtcgacca tgctctgttc gagcatggct agaacaaaca 10080tcatgagcgt cgtcttaccc
ctcccgatag gcccgaatat tgccgtcatg ccaacatcgt 10140gctcatgcgg gatatagtcg
aaaggcgttc cgccattggt acgaaatcgg gcaatcgcgt 10200tgccccagtg gcctgagctg
gcgccctctg gaaagttttc gaaagagaca aaccctgcga 10260aattgcgtga agtgattgcg
ccagggcgtg tgcgccactt aaaattcccc ggcaattggg 10320accaataggc cgcttccata
ccaatacctt cttggacaac cacggcacct gcatccgcca 10380ttcgtgtccg agcccgcgcg
cccctgtccc caagactatt gagatcgtct gcatagacgc 10440aaaggctcaa atgatgtgag
cccataacga attcgttgct cgcaagtgcg tcctcagcct 10500cggataattt gccgatttga
gtcacggctt tatcgccgga actcagcatc tggctcgatt 10560tgaggctaag tttcgcgtgc
gcttgcgggc gagtcaggaa cgaaaaactc tgcgtgagaa 10620caagtggaaa atcgagggat
agcagcgcgt tgagcatgcc cggccgtgtt tttgcagggt 10680attcgcgaaa cgaatagatg
gatccaacgt aactgtcttt tggcgttctg atctcgagtc 10740ctcgcttgcc gcaaatgact
ctgtcggtat aaatcgaagc gccgagtgag ccgctgacga 10800ccggaaccgg tgtgaaccga
ccagtcatga tcaaccgtag cgcttcgcca atttcggtga 10860agagcacacc ctgcttctcg
cggatgccaa gacgatgcag gccatacgct ttaagagagc 10920cagcgacaac atgccaaaga
tcttccatgt tcctgatctg gcccgtgaga tcgttttccc 10980tttttccgct tagcttggtg
aacctcctct ttaccttccc taaagccgcc tgtgggtaga 11040caatcaacgt aaggaagtgt
tcattgcgga ggagttggcc ggagagcacg cgctgttcaa 11100aagcttcgtt caggctagcg
gcgaaaacac tacggaagtg tcgcggcgcc gatgatggca 11160cgtcggcatg acgtacgagg
tgagcatata ttgacacatg atcatcagcg atattgcgca 11220acagcgtgtt gaacgcacga
caacgcgcat tgcgcatttc agtttcctca agctcgaatg 11280caacgccatc aattctcgca
atggtcatga tcgatccgtc ttcaagaagg acgatatggt 11340cgctgaggtg gccaatataa
gggagataga tctcaccgga tctttcggtc gttccactcg 11400cgccgagcat cacaccattc
ctctccctcg tgggggaacc ctaattggat ttgggctaac 11460agtagcgccc ccccaaactg
cactatcaat gcttcttccc gcggtccgca aaaatagcag 11520gacgacgctc gccgcattgt
agtctcgctc cacgatgagc cgggctgcaa accataacgg 11580cacgagaacg acttcgtaga
gcgggttctg aacgataacg atgacaaagc cggcgaacat 11640catgaataac cctgccaatg
tcagtggcac cccaagaaac aatgcgggcc gtgtggctgc 11700gaggtaaagg gtcgattctt
ccaaacgatc agccatcaac taccgccagt gagcgtttgg 11760ccgaggaagc tcgccccaaa
catgataaca atgccgccga cgacgccggc aaccagccca 11820agcgaagccc gcccgaacat
ccaggagatc ccgatagcga caatgccgag aacagcgagt 11880gactggccga acggaccaag
gataaacgtg catatattgt taaccattgt ggcggggtca 11940gtgccgccac ccgcagattg
cgctgcggcg ggtccggatg aggaaatgct ccatgcaatt 12000gcaccgcaca agcttggggc
gcagctcgat atcacgcgca tcatcgcatt cgagagcgag 12060aggcgattta gatgtaaacg
gtatctctca aagcatcgca tcaatgcgca cctccttagt 12120ataagtcgaa taagacttga
ttgtcgtctg cggatttgcc gttgtcctgg tgtggcggtg 12180gcggagcgat taaaccgcca
gcgccatcct cctgcgagcg gcgctgatat gacccccaaa 12240catcccacgt ctcttcggat
tttagcgcct cgtgatcgtc ttttggaggc tcgattaacg 12300cgggcaccag cgattgagca
gctgtttcaa cttttcgcac gtagccgttt gcaaaaccgc 12360cgatgaaatt accggtgttg
taagcggaga tcgcccgacg aagcgcaaat tgcttctcgt 12420caatcgtttc gccgcctgca
taacgacttt tcagcatgtt tgcagcggca gataatgatg 12480tgcacgcctg gagcgcaccg
tcaggtgtca gaccgagcat agaaaaattt cgagagttta 12540tttgcatgag gccaacatcc
agcgaatgcc gtgcatcgag acggtgcctg acgacttggg 12600ttgcttggct gtgatcttgc
cagtgaagcg tttcgccggt cgtgttgtca tgaatcgcta 12660aaggatcaaa gcgactctcc
accttagcta tcgccgcaag cgtagatgtc gcaactgatg 12720gggcacactt gcgagcaaca
tggtcaaact cagcagatga gagtggcgtg gcaaggctcg 12780acgaacagaa ggagaccatc
aaggcaagag aaagcgaccc cgatctctta agcatacctt 12840atctccttag ctcgcaacta
acaccgcctc tcccgttgga agaagtgcgt tgttttatgt 12900tgaagattat cgggagggtc
ggttactcga aaattttcaa ttgcttcttt atgatttcaa 12960ttgaagcgag aaacctcgcc
cggcgtcttg gaacgcaaca tggaccgaga accgcgcatc 13020catgactaag caaccggatc
gacctattca ggccgcagtt ggtcaggtca ggctcagaac 13080gaaaatgctc ggcgaggtta
cgctgtctgt aaacccattc gatgaacggg aagcttcctt 13140ccgattgctc ttggcaggaa
tattggccca tgcctgcttg cgctttgcaa atgctcttat 13200cgcgttggta tcatatgcct
tgtccgccag cagaaacgca ctctaagcga ttatttgtaa 13260aaatgtttcg gtcatgcggc
ggtcatgggc ttgacccgct gtcagcgcaa gacggatcgg 13320tcaaccgtcg gcatcgacaa
cagcgtgaat cttggtggtc aaaccgccac gggaacgtcc 13380catacagcca tcgtcttgat
cccgctgttt cccgtcgccg catgttggtg gacgcggaca 13440caggaactgt caatcatgac
gacattctat cgaaagcctt ggaaatcaca ctcagaatat 13500gatcccagac gtctgcctca
cgccatcgta caaagcgatt gtagcaggtt gtacaggaac 13560cgtatcgatc aggaacgtct
gcccagggcg ggcccgtccg gaagcgccac aagatgacat 13620tgatcacccg cgtcaacgcg
cggcacgcga cgcggcttat ttgggaacaa aggactgaac 13680aacagtccat tcgaaatcgg
tgacatcaaa gcggggacgg gttatcagtg gcctccaagt 13740caagcctcaa tgaatcaaaa
tcagaccgat ttgcaaacct gatttatgag tgtgcggcct 13800aaatgatgaa atcgtccttc
tagatcgcct ccgtggtgta gcaacacctc gcagtatcgc 13860cgtgctgacc ttggccaggg
aattgactgg caagggtgct ttcacatgac cgctcttttg 13920gccgcgatag atgatttcgt
tgctgctttg ggcacgtaga aggagagaag tcatatcgga 13980gaaattcctc ctggcgcgag
agcctgctct atcgcgacgg catcccactg tcgggaacag 14040accggatcat tcacgaggcg
aaagtcgtca acacatgcgt tataggcatc ttcccttgaa 14100ggatgatctt gttgctgcca
atctggaggt gcggcagccg caggcagatg cgatctcagc 14160gcaacttgcg gcaaaacatc
tcactcacct gaaaaccact agcgagtctc gcgatcagac 14220gaaggccttt tacttaacga
cacaatatcc gatgtctgca tcacaggcgt cgctatccca 14280gtcaatacta aagcggtgca
ggaactaaag attactgatg acttaggcgt gccacgaggc 14340ctgagacgac gcgcgtagac
agttttttga aatcattatc aaagtgatgg cctccgctga 14400agcctatcac ctctgcgccg
gtctgtcgga gagatgggca agcattatta cggtcttcgc 14460gcccgtacat gcattggacg
attgcagggt caatggatct gagatcatcc agaggattgc 14520cgcccttacc ttccgtttcg
agttggagcc agcccctaaa tgagacgaca tagtcgactt 14580gatgtgacaa tgccaagaga
gagatttgct taacccgatt tttttgctca agcgtaagcc 14640tattgaagct tgccggcatg
acgtccgcgc cgaaagaata tcctacaagt aaaacattct 14700gcacaccgaa atgcttggtg
tagacatcga ttatgtgacc aagatcctta gcagtttcgc 14760ttggggaccg ctccgaccag
aaataccgaa gtgaactgac gccaatgaca ggaatccctt 14820ccgtctgcag ataggtacca
tcgatagatc tgctgcctcg cgcgtttcgg tgatgacggt 14880gaaaacctct gacacatgca
gctcccggag acggtcacag cttgtctgta agcggatgcc 14940gggagcagac aagcccgtca
gggcgcgtca gcgggtgttg gcgggtgtcg gggcgcagcc 15000atgacccagt cacgtagcga
tagcggagtg tatactggct taactatgcg gcatcagagc 15060agattgtact gagagtgcac
catatgcggt gtgaaatacc gcacagatgc gtaaggagaa 15120aataccgcat caggcgctct
tccgcttcct cgctcactga ctcgctgcgc tcggtcgttc 15180ggctgcggcg agcggtatca
gctcactcaa aggcggtaat acggttatcc acagaatcag 15240gggataacgc aggaaagaac
atgtgagcaa aaggccagca aaaggccagg aaccgtaaaa 15300aggccgcgtt gctggcgttt
ttccataggc tccgcccccc tgacgagcat cacaaaaatc 15360gacgctcaag tcagaggtgg
cgaaacccga caggactata aagataccag gcgtttcccc 15420ctggaagctc cctcgtgcgc
tctcctgttc cgaccctgcc gcttaccgga tacctgtccg 15480cctttctccc ttcgggaagc
gtggcgcttt ctcatagctc acgctgtagg tatctcagtt 15540cggtgtaggt cgttcgctcc
aagctgggct gtgtgcacga accccccgtt cagcccgacc 15600gctgcgcctt atccggtaac
tatcgtcttg agtccaaccc ggtaagacac gacttatcgc 15660cactggcagc agccactggt
aacaggatta gcagagcgag gtatgtaggc ggtgctacag 15720agttcttgaa gtggtggcct
aactacggct acactagaag gacagtattt ggtatctgcg 15780ctctgctgaa gccagttacc
ttcggaaaaa gagttggtag ctcttgatcc ggcaaacaaa 15840ccaccgctgg tagcggtggt
ttttttgttt gcaagcagca gattacgcgc agaaaaaaag 15900gatctcaaga agatcctttg
atcttttcta cggggtctga cgctcagtgg aacgaaaact 15960cacgttaagg gattttggtc
atgagattat caaaaaggat cttcacctag atccttttaa 16020attaaaaatg aagttttaaa
tcaatctaaa gtatatatga gtaaacttgg tctgacagtt 16080accaatgctt aatcagtgag
gcacctatct cagcgatctg tctatttcgt tcatccatag 16140ttgcctgact ccccgtcgtg
tagataacta cgatacggga gggcttacca tctggcccca 16200gtgctgcaat gataccgcga
gacccacgct caccggctcc agatttatca gcaataaacc 16260agccagccgg aagggccgag
cgcagaagtg gtcctgcaac tttatccgcc tccatccagt 16320ctattaattg ttgccgggaa
gctagagtaa gtagttcgcc agttaatagt ttgcgcaacg 16380ttgttgccat tgctgcaggg
gggggggggg ggggggactt ccattgttca ttccacggac 16440aaaaacagag aaaggaaacg
acagaggcca aaaagcctcg ctttcagcac ctgtcgtttc 16500ctttcttttc agagggtatt
ttaaataaaa acattaagtt atgacgaaga agaacggaaa 16560cgccttaaac cggaaaattt
tcataaatag cgaaaacccg cgaggtcgcc gccccgtaac 16620ctgtcggatc accggaaagg
acccgtaaag tgataatgat tatcatctac atatcacaac 16680gtgcgtggag gccatcaaac
cacgtcaaat aatcaattat gacgcaggta tcgtattaat 16740tgatctgcat caacttaacg
taaaaacaac ttcagacaat acaaatcagc gacactgaat 16800acggggcaac ctcatgtccc
cccccccccc ccccctgcag ggcatcgtgg tgtcacgctc 16860gtcgtttggt atggcttcat
tcagctccgg ttcccaacga tcaaggcgag ttacatgatc 16920ccccatgttg tgcaaaaaag
cggttagctc cttcggtcct ccgatcgttg tcagaagtaa 16980gttggccgca gtgttatcac
tcatggttat ggcagcactg cataattctc ttactgtcat 17040gccatccgta agatgctttt
ctgtgactgg tgagtactca accaagtcat tctgagaata 17100gtgtatgcgg cgaccgagtt
gctcttgccc ggcgtcaaca cgggataata ccgcgccaca 17160tagcagaact ttaaaagtgc
tcatcattgg aaaacgttct tcggggcgaa aactctcaag 17220gatcttaccg ctgttgagat
ccagttcgat gtaacccact cgtgcaccca actgatcttc 17280agcatctttt actttcacca
gcgtttctgg gtgagcaaaa acaggaaggc aaaatgccgc 17340aaaaaaggga ataagggcga
cacggaaatg ttgaatactc atactcttcc tttttcaata 17400ttattgaagc atttatcagg
gttattgtct catgagcgga tacatatttg aatgtattta 17460gaaaaataaa caaatagggg
ttccgcgcac atttccccga aaagtgccac ctgacgtcta 17520agaaaccatt attatcatga
cattaaccta taaaaatagg cgtatcacga ggccctttcg 17580tcttcaagaa ttggtcgacg
atcttgctgc gttcggatat tttcgtggag ttcccgccac 17640agacccggat tgaaggcgag
atccagcaac tcgcgccaga tcatcctgtg acggaacttt 17700ggcgcgtgat gactggccag
gacgtcggcc gaaagagcga caagcagatc acgcttttcg 17760acagcgtcgg atttgcgatc
gaggattttt cggcgctgcg ctacgtccgc gaccgcgttg 17820agggatcaag ccacagcagc
ccactcgacc ttctagccga cccagacgag ccaagggatc 17880tttttggaat gctgctccgt
cgtcaggctt tccgacgttt gggtggttga acagaagtca 17940ttatcgtacg gaatgccaag
cactcccgag gggaaccctg tggttggcat gcacatacaa 18000atggacgaac ggataaacct
tttcacgccc ttttaaatat ccgttattct aataaacgct 18060cttttctctt aggtttaccc
gccaatatat cctgtcaaac actgatagtt taaactgaag 18120gcgggaaacg acaatctgat
catgagcgga gaattaaggg agtcacgtta tgacccccgc 18180cgatgacgcg ggacaagccg
ttttacgttt ggaactgaca gaaccgcaac gttgaaggag 18240ccactcagcc caagcttgat
atcgaattcc tgcagcccgg gggatccggg gcggaagatg 18300gcagggacgc ggattcaggg
cggacgcgct tgccgagggc gcgggggacc acagcgtgcg 18360ttacggggac agggcgggca
tcgcgaggac gggtgcggga gcggagccac atctggtggt 18420ggacgcctac tttgctctct
tatagagtag taaagattcg tggaccaaac aacaccctag 18480cttgtacaaa tattcttagg
cagttgctac tgatgagaga aaaataacat cactccactg 18540catttgcgtg atttattgaa
cagatcacaa ttacatctat tcaaatttat ttacctgtac 18600gtgtccgatt tttaggggag
gattttttta cggtattttt tttttaaaaa aataaattta 18660ggcaacaatt ttatagaatc
gagtgcttta tctattatct tttacaaggc acacgcgtac 18720aataaggttt ggtcgttcgt
gacttggata gtggttttgg ttgcaattcc gtaattcttg 18780gcataggata cagcccaacc
cagaaaaaaa taatgttgcg gtcagttctg gctttgagat 18840tcggagtacc acgtggcgta
aaggcaggcc gtgtcttaca gatgaataaa ggacctgggt 18900ctcacgtgat tggtttccag
tttcgtgcat caagatgtgg aattttcaaa ctgccgtcgt 18960gtttgtttcg tcacataaaa
gctttttgga aggctaagga gaggaagccg gcgagaagga 19020gggggcgttt tacgtgtcac
tgtcctgtcg tgttggctgt tgacacgaat catttcttcc 19080gcgcgtggga agaagaagat
gcacattagc ggcctgaagt agagatgtca atggggaatt 19140ccccagcggg gattaactcc
ccagacccgt acccatgaac atagaccggc ccccatcccc 19200gaacccgaac ccgacctcgg
gtacgaaaat cctcccatac ccattcccga ccgggtacta 19260aatacccatg ggtatccata
cccgacccga ttattcaaaa attaatgggc tttttatttg 19320ttaaccggcg gacgcaatgc
ttgggactct aggttttttt actttgttga ccggctggcg 19380gctgggcttt ttcctacagg
cccaaagttg gtcggcagcc actaggccac acgtcacagg 19440cagcccacaa gtaaatgtcg
ttggattgct ggatggtgga ataaaaatcc tagatgctag 19500attgttctgg ttccgggtat
ttttctccat ggctaatcgg gtttgggttt agccctccca 19560aacccgaacc cgccataccc
gatgggtaag ggatttattc caaatctata cccatgggga 19620tttgttttaa cccatacctt
aaccctaata gaggaattcc ccacgggtaa tcgggtttcg 19680gggcccattg acatctctag
actgaaggcg tccaactcaa atcattaaaa agtgttgacg 19740cacgcgctga tgcgccggcc
gcacagcaca ggctgcacag cccgtttaat cagcgatgga 19800gccccggccg tcagccagcc
aggtccggcg tccgggtctg cgccctgcgg cgtcactgct 19860gtcgccaccg tctccgatgg
tcccacatcc atccagcggg ccgcgcgtgg tacaaaaggc 19920tcttcctcgc cgtcaggtgc
agctgcccaa acaccagaca cagactccac caccccgctt 19980cgatcttctg ttgcagctga
aatctgtcag attctgcagt tcattcctca tggagaagag 20040gaacctgcag tggcggcgag
ggcgtgatgg catcgtgcag taccctcacc tcttcttcgc 20100ggccctggcg ctggccctcc
tagtcgcgga ccgttcggcc tcagtccgct ggccgaggtc 20160gactaccggc cggtgaagca
cgagctcgcg ccgtacgggg aggtcatggg cagctggccc 20220agagacaatg ccagccggct
caggcgcggg aggctggagt tcgtcggcga ggtgttcggg 20280ccggagtcta tcgagttcga
tctccagggc cgcgggccgt acgccggcct cgccgacggc 20340cgcgtcgtgc ggtggatggg
cgaggaggcc gggtgggaga cgttcgccgg tcatgaatcc 20400tgactggtaa gtgctcgata
tgcctccggc gtccactcgt tacagtgcta taatatagta 20460gtactaagat attttgatct
gattttttgc attcttggga gaaacgtcat gcaaaatttg 20520ttgtttcttg gcaaaggtca
gaagaagtct gtgccaatgg agtgaactca acgacgagga 20580agcagcacga gaaggaggag
ttctgcggcg gccgctcggc ctgaggttcc acggggagac 20640cggcgagctc tacgtcgccg
acgcgtacta cggtctcatg gtcgttggcc agagcggcgg 20700cgtggcgtcc tccgtcgcga
gggaagccga cggggacccc atccggttcg cgaacgacct 20760cgatgtgcac aggaatggat
ccgtattctt cactgacacg agcatgagat acagcagaaa 20820gtgagcaaag cagcgtaaca
atccggcttc tcattttcaa acgcctctgt attctctgct 20880gaaagagtag ctcaccagac
aagagctgaa tttgcaggga ccatctgaac atcctgttag 20940aaggagaagg caccgggagg
ctgctcaggt atgatccaga aacaagcggt gtccatgtcg 21000tgctcaaggg gctggtgttc
ccaaacggcg tgcagatctc agaggaccat cagtttcttc 21060tcttctccga gacaacaaac
tgcaggtaac aaaaatacta tctgacgatg ctcatgattc 21120taccgtatcc atagtcatga
acacaaacca cacgaatctg gccttgacca ggataatgag 21180gtactggctg gaaggcccaa
gagcgggcga ggtagaggtg ttcgcgaacc tgccgggctt 21240ccccgacaac gtgcgctcca
acggcagggg ccagttctgg gtggcgatcg actgctgccg 21300gaccgaggag gtgttcccaa
gagcgtggct ccggaccctg tacttcaagt tcccgctgtc 21360gctcaaggtg ctcacttgga
aggccgccag gaggatgcac acggtgctcg cgctcctcga 21420cggcgaaggg cgcgtcgtgg
aggtgctcga ggaccggggc cacgaggtga tgaagctggt 21480gagcgaggtg cgggaggtgg
gccgcaagct gtggatcgga accgtggcgc acaaccacat 21540cgccaccatc ccctaccctt
tagaggacta accatgatct atgctgtttc aatgcctcct 21600aatctgtgta cgtctataaa
tgtctaatgc agctcatggt tgtaatcttg tttgtgtttg 21660gcaaattggc ataataatgg
acagattcaa tgggcattgg tgctgtagtc gcatcacact 21720aattgaatgg gatcatgttg
agctctcact ttgctacaat ttgctccagc ttgtacggtt 21780gtaccctctt gctcgtctat
agtaagggcc atctaaaaaa aactcaaatt agatctgcaa 21840tacaagtatg attgggccga
atttggattg tcacgggtcc gcgaccgcga attgggctcc 21900ggtttgattt agccgacata
gtagtgaccg acccgagccg gccggcgaga ccaaaccgag 21960cggacgcccg ccatgcatgg
agtcaaagat tcaaatagag gacctaacag aactcgccgt 22020aaagactggc gaacagttca
tacagagtct cttacgactc aatgacaaga agaaaatctt 22080cgtcaacatg gtggagcacg
acacgcttgt ctactccaaa aatatcaaag atacagtctc 22140agaagaccaa agggcaattg
agacttttca acaaagggta atatccggaa acctcctcgg 22200attccattgc ccagctatct
gtcactttat tgtgaagata gtggaaaagg aaggtggctc 22260ctacaaatgc catcattgcg
ataaaggaaa ggccatcgtt gaagatgcct ctgccgacag 22320tggtcccaaa gatggacccc
cacccacgag gagcatcgtg gaaaaagaag acgttccaac 22380cacgtcttca aagcaagtgg
attgatgtga tatctccact gacgtaaggg atgacgcaca 22440atcccactat ccttcgcaag
acccttcctc tatataagga agttcatttc atttggagag 22500gacagggtac ccggggatcc
accatgtctc cggagaggag accagttgag attaggccag 22560ctacagcagc tgatatggcc
gcggtttgtg atatcgttaa ccattacatt gagacgtcta 22620cagtgaactt taggacagag
ccacaaacac cacaagagtg gattgatgat ctagagaggt 22680tgcaagatag atacccttgg
ttggttgctg aggttgaggg tgttgtggct ggtattgctt 22740acgctgggcc ctggaaggct
aggaacgctt acgattggac agttgagagt actgtttacg 22800tgtcacatag gcatcaaagg
ttgggcctag gatccacatt gtacacacat ttgcttaagt 22860ctatggaggc gcaaggtttt
aagtctgtgg ttgctgttat aggccttcca aacgatccat 22920ctgttaggtt gcatgaggct
ttgggataca cagcccgggg tacattgcgc gcagctggat 22980acaagcatgg tggatggcat
gatgttggtt tttggcaaag ggattttgag ttgccagctc 23040ctccaaggcc agttaggcca
gttacccaga tctgagtcga cctgcaggca tgccgctgaa 23100atcaccagtc tctctctaca
aatctatctc tctctataat aatgtgtgag tagttcccag 23160ataagggaat tagggttctt
atagggtttc gctcatgtgt tgagcatata agaaaccctt 23220agtatgtatt tgtatttgta
aaatacttct atcaataaaa tttctaattc ctaaaaccaa 23280aatccagtgg gtaccgagct
cgaattcagt acattaaaaa cgtccgcaat gtgttattaa 23340gttgtctaag cgtcaatttg
tttacaccac aatatatcct gccaccagcc agccaacagc 23400tccccgaccg gcagctcggc
acaaaatcac cactcgatac aggcagccca tcagtccggg 23460acggcgtcag cgggagagcc
gttgtaaggc ggcagacttt gctcatgtta ccgatgctat 23520tcggaagaac ggcaactaag
ctgccgggtt tgaaacacgg atgatctcgc ggagggtagc 23580atgttgattg taacgatgac
agagcgttgc tgcctgtgat caaatatcat ctccctcgca 23640gagatccgaa ttatcagcct
tcttattcat ttctcgctta accgtgacag gctgtcgatc 23700ttgagaacta tgccgacata
ataggaaatc gctggataaa gccgctgagg aagctgagtg 23760gcgctatttc tttagaagtg
aacgttgacg atcgtcgacc gtaccccgat gaattaattc 23820ggacgtacgt tctgaacaca
gctggatact tacttgggcg attgtcatac atgacatcaa 23880caatgtaccc gtttgtgtaa
ccgtctcttg gaggttcgta tgacactagt ggttcccctc 23940agcttgcgac tagatgttga
ggcctaacat tttattagag agcaggctag ttgcttagat 24000acatgatctt caggccgtta
tctgtcaggg caagcgaaaa ttggccattt atgacgacca 24060atgccccgca gaagctccca
tctttgccgc catagacgcc gcgcccccct tttggggtgt 24120agaacatcct tttgccagat
gtggaaaaga agttcgttgt cccattgttg gcaatgacgt 24180agtagccggc gaaagtgcga
gacccatttg cgctatatat aagcctacga tttccgttgc 24240gactattgtc gtaattggat
gaactattat cgtagttgct ctcagagttg tcgtaatttg 24300atggactatt gtcgtaattg
cttatggagt tgtcgtagtt gcttggagaa atgtcgtagt 24360tggatgggga gtagtcatag
ggaagacgag cttcatccac taaaacaatt ggcaggtcag 24420caagtgcctg ccccgatgcc
atcgcaagta cgaggcttag aaccaccttc aacagatcgc 24480gcatagtctt ccccagctct
ctaacgcttg agttaagccg cgccgcgaag cggcgtcggc 24540ttgaacgaat tgttagacat
tatttgccga ctaccttggt gatctcgcct ttcacgtagt 24600gaacaaattc ttccaactga
tctgcgcgcg aggccaagcg atcttcttgt ccaagataag 24660cctgcctagc ttcaagtatg
acgggctgat actgggccgg caggcgctcc attgcccagt 24720cggcagcgac atccttcggc
gcgattttgc cggttactgc gctgtaccaa atgcgggaca 24780acgtaagcac tacatttcgc
tcatcgccag cccagtcggg cggcgagttc catagcgtta 24840aggtttcatt tagcgcctca
aatagatcct gttcaggaac cggatcaaag agttcctccg 24900ccgctggacc taccaaggca
acgctatgtt ctcttgcttt tgtcagcaag atagccagat 24960caatgtcgat cgtggctggc
tcgaagatac ctgcaagaat gtcattgcgc tgccattctc 25020caaattgcag ttcgcgctta
gctggataac gccacggaat gatgtcgtcg tgcacaacaa 25080tggtgacttc tacagcgcgg
agaatctcgc tctctccagg ggaagccgaa gtttccaaaa 25140ggtcgttgat caaagctcgc
cgcgttgttt catcaagcct tacggtcacc gtaaccagca 25200aatcaatatc actgtgtggc
ttcaggccgc catccactgc ggagccgtac aaatgtacgg 25260ccagcaacgt cggttcgaga
tggcgctcga tgacgccaac tacctctgat agttgagtcg 25320atacttcggc gatcaccgct
tccctcatga tgtttaactc ctgaattaag ccgcgccgcg 25380aagcggtgtc ggcttgaatg
aattgttagg cgtcatcctg tgctcccgag aaccagtacc 25440agtacatcgc tgtttcgttc
gagacttgag gtctagtttt atacgtgaac aggtcaatgc 25500cgccgagagt aaagccacat
tttgcgtaca aattgcaggc aggtacattg ttcgtttgtg 25560tctctaatcg tatgccaagg
agctgtctgc ttagtgccca ctttttcgca aattcgatga 25620gactgtgcgc gactcctttg
cctcggtgcg tgtgcgacac aacaatgtgt tcgatagagg 25680ctagatcgtt ccatgttgag
ttgagttcaa tcttcccgac aagctcttgg tcgatgaatg 25740cgccatagca agcagagtct
tcatcagagt catcatccga gatgtaatcc ttccggtagg 25800ggctcacact tctggtagat
agttcaaagc cttggtcgga taggtgcaca tcgaacactt 25860cacgaacaat gaaatggttc
tcagcatcca atgtttccgc cacctgctca gggatcaccg 25920aaatcttcat atgacgccta
acgcctggca cagcggatcg caaacctggc gcggcttttg 25980gcacaaaagg cgtgacaggt
ttgcgaatcc gttgctgcca cttgttaacc cttttgccag 26040atttggtaac tataatttat
gttagaggcg aagtcttggg taaaaactgg cctaaaattg 26100ctggggattt caggaaagta
aacatcacct tccggctcga tgtctattgt agatatatgt 26160agtgtatcta cttgatcggg
ggatctgctg cctcgcgcgt ttcggtgatg acggtgaaaa 26220cctctgacac atgcagctcc
cggagacggt cacagcttgt ctgtaagcgg atgccgggag 26280cagacaagcc cgtcagggcg
cgtcagcggg tgttggcggg tgtcggggcg cagccatgac 26340ccagtcacgt agcgatagcg
gagtgtatac tggcttaact atgcggcatc agagcagatt 26400gtactgagag tgcaccatat
gcggtgtgaa ataccgcaca gatgcgtaag gagaaaatac 26460cgcatcaggc gctcttccgc
ttcctcgctc actgactcgc tgcgctcggt cgttcggctg 26520cggcgagcgg tatcagctca
ctcaaaggcg gtaatacggt tatccacaga atcaggggat 26580aacgcaggaa agaacatgtg
agcaaaaggc cagcaaaagg ccaggaaccg taaaaaggcc 26640gcgttgctgg cgtttttcca
taggctccgc ccccctgacg agcatcacaa aaatcgacgc 26700tcaagtcaga ggtggcgaaa
cccgacagga ctataaagat accaggcgtt tccccctgga 26760agctccctcg tgcgctctcc
tgttccgacc ctgccgctta ccggatacct gtccgccttt 26820ctcccttcgg gaagcgtggc
gctttctcat agctcacgct gtaggtatct cagttcggtg 26880taggtcgttc gctccaagct
gggctgtgtg cacgaacccc ccgttcagcc cgaccgctgc 26940gccttatccg gtaactatcg
tcttgagtcc aacccggtaa gacacgactt atcgccactg 27000gcagcagcca ctggtaacag
gattagcaga gcgaggtatg taggcggtgc tacagagttc 27060ttgaagtggt ggcctaacta
cggctacact agaaggacag tatttggtat ctgcgctctg 27120ctgaagccag ttaccttcgg
aaaaagagtt ggtagctctt gatccggcaa acaaaccacc 27180gctggtagcg gtggtttttt
tgtttgcaag cagcagatta cgcgcagaaa aaaaggatct 27240caagaagatc ctttgatctt
ttctacgggg tctgacgctc agtggaacga aaactcacgt 27300taagggattt tggtcatgag
attatcaaaa aggatcttca cctagatcct tttaaattaa 27360aaatgaagtt ttaaatcaat
ctaaagtata tatgagtaaa cttggtctga cagttaccaa 27420tgcttaatca gtgaggcacc
tatctcagcg atctgtctat ttcgttcatc catagttgcc 27480tgactccccg tcgtgtagat
aactacgata cgggagggct taccatctgg ccccagtgct 27540gcaatgatac cgcgagaccc
acgctcaccg gctccagatt tatcagcaat aaaccagcca 27600gccggaaggg ccgagcgcag
aagtggtcct gcaactttat ccgcctccat ccagtctatt 27660aattgttgcc gggaagctag
agtaagtagt tcgccagtta atagtttgcg caacgttgtt 27720gccattgctg cagggggggg
gggggggggg gacttccatt gttcattcca cggacaaaaa 27780cagagaaagg aaacgacaga
ggccaaaaag cctcgctttc agcacctgtc gtttcctttc 27840ttttcagagg gtattttaaa
taaaaacatt aagttatgac gaagaagaac ggaaacgcct 27900taaaccggaa aattttcata
aatagcgaaa acccgcgagg tcgccgcccc gtaacctgtc 27960ggatcaccgg aaaggacccg
taaagtgata atgattatca tctacatatc acaacgtgcg 28020tggaggccat caaaccacgt
caaataatca attatgacgc aggtatcgta ttaattgatc 28080tgcatcaact taacgtaaaa
acaacttcag acaatacaaa tcagcgacac tgaatacggg 28140gcaacctcat gtcccccccc
cccccccccc tgcaggcatc gtggtgtcac gctcgtcgtt 28200tggtatggct tcattcagct
ccggttccca acgatcaagg cgagttacat gatcccccat 28260gttgtgcaaa aaagcggtta
gctccttcgg tcctccgatc gttgtcagaa gtaagttggc 28320cgcagtgtta tcactcatgg
ttatggcagc actgcataat tctcttactg tcatgccatc 28380cgtaagatgc ttttctgtga
ctggtgagta ctcaaccaag tcattctgag aatagtgtat 28440gcggcgaccg agttgctctt
gcccggcgtc aacacgggat aataccgcgc cacatagcag 28500aactttaaaa gtgctcatca
ttggaaaacg ttcttcgggg cgaaaactct caaggatctt 28560accgctgttg agatccagtt
cgatgtaacc cactcgtgca cccaactgat cttcagcatc 28620ttttactttc accagcgttt
ctgggtgagc aaaaacagga aggcaaaatg ccgcaaaaaa 28680gggaataagg gcgacacgga
aatgttgaat actcatactc ttcctttttc aatattattg 28740aagcatttat cagggttatt
gtctcatgag cggatacata tttgaatgta tttagaaaaa 28800taaacaaata ggggttccgc
gcacatttcc ccgaaaagtg ccacctgacg tctaagaaac 28860cattattatc atgacattaa
cctataaaaa taggcgtatc acgaggccct ttcgtcttca 28920agaattcgga gcttttgcca
ttctcaccgg attcagtcgt cactcatggt gatttctcac 28980ttgataacct tatttttgac
gaggggaaat taataggttg tattgatgtt ggacgagtcg 29040gaatcgcaga ccgataccag
gatcttgcca tcctatggaa ctgcctcggt gagttttctc 29100cttcattaca gaaacggctt
tttcaaaaat atggtattga taatcctgat atgaataaat 29160tgcagtttca tttgatgctc
gatgagtttt tctaatcaga attggttaat tggttgtaac 29220actggcagag cattacgctg
acttgacggg acggcggctt tgttgaataa atcgaacttt 29280tgctgagttg aaggatcaga
tcacgcatct tcccgacaac gcagaccgtt ccgtggcaaa 29340gcaaaagttc aaaatcacca
actggtccac ctacaacaaa gctctcatca accgtggctc 29400cctcactttc tggctggatg
atggggcgat tcaggcctgg tatgagtcag caacaccttc 29460ttcacgaggc agacctcagc
gccagaaggc cgccagagag gccgagcgcg gccgtgaggc 29520ttggacgcta gggcagggca
tgaaaaagcc cgtagcgggc tgctacgggc gtctgacgcg 29580gtggaaaggg ggaggggatg
ttgtctacat ggctctgctg tagtgagtgg gttgcgctcc 29640ggcagcggtc ctgatcaatc
gtcacccttt ctcggtcctt caacgttcct gacaacgagc 29700ctccttttcg ccaatccatc
gacaatcacc gcgagtccct gctcgaacgc tgcgtccgga 29760ccggcttcgt cgaaggcgtc
tatcgcggcc cgcaacagcg gcgagagcgg agcctgttca 29820acggtgccgc cgcgctcgcc
ggcatcgctg tcgccggcct gctcctcaag cacggcccca 29880acagtgaagt agctgattgt
catcagcgca ttgacggcgt ccccggccga aaaacccgcc 29940tcgcagagga agcgaagctg
cgcgtcggcc gtttccatct gcggtgcgcc cggtcgcgtg 30000ccggcatgga tgcgcgcgcc
atcgcggtag gcgagcagcg cctgcctgaa gctgcgggca 30060ttcccgatca gaaatgagcg
ccagtcgtcg tcggctctcg gcaccgaatg cgtatgattc 30120tccgccagca tggcttcggc
cagtgcgtcg agcagcgccc gcttgttcct gaagtgccag 30180taaagcgccg gctgctgaac
ccccaaccgt tccgccagtt tgcgtgtcgt cagaccgtct 30240acgccgacct cgttcaacag
gtccagggcg gcacggatca ctgtattcgg ctgcaacttt 30300gtcatgcttg acactttatc
actgataaac ataatatgtc caccaactta tcagtgataa 30360agaatccgcg cgttcaatcg
gaccagcgga ggctggtccg gaggccagac gtgaaaccca 30420acatacccct gatcgtaatt
ctgagcactg tcgcgctcga cgctgtcggc atcggcctga 30480ttatgccggt gctgccgggc
ctcctgcgcg atctggttca ctcgaacgac gtcaccgccc 30540actatggcat tctgctggcg
ctgtatgcgt tggtgcaatt tgcctgcgca cctgtgctgg 30600gcgcgctgtc ggatcgtttc
gggcggcggc caatcttgct cgtctcgctg gccggcgcca 30660ctgtcgacta cgccatcatg
gcgacagcgc ctttcctttg ggttctctat atcgggcgga 30720tcgtggccgg catcaccggg
gcgactgggg cggtagccgg cgcttatatt gccgatatca 30780ctgatggcga tgagcgcgcg
cggcacttcg gcttcatgag cgcctgtttc gggttcggga 30840tggtcgcggg acctgtgctc
ggtgggctga tgggcggttt ctccccccac gctccgttct 30900tcgccgcggc agccttgaac
ggcctcaatt tcctgacggg ctgtttcctt ttgccggagt 30960cgcacaaagg cgaacgccgg
ccgttacgcc gggaggctct caacccgctc gcttcgttcc 31020ggtgggcccg gggcatgacc
gtcgtcgccg ccctgatggc ggtcttcttc atcatgcaac 31080ttgtcggaca ggtgccggcc
gcgctttggg tcattttcgg cgaggatcgc tttcactggg 31140acgcgaccac gatcggcatt
tcgcttgccg catttggcat tctgcattca ctcgcccagg 31200caatgatcac cggccctgta
gccgcccggc tcggcgaaag gcgggcactc atgctcggaa 31260tgattgccga cggcacaggc
tacatcctgc ttgccttcgc gacacgggga tggatggcgt 31320tcccgatcat ggtcctgctt
gcttcgggtg gcatcggaat gccggcgctg caagcaatgt 31380tgtccaggca ggtggatgag
gaacgtcagg ggcagctgca aggctcactg gcggcgctca 31440ccagcctgac ctcgatcgtc
ggacccctcc tcttcacggc gatctatgcg gcttctataa 31500caacgtggaa cgggtgggca
tggattgcag gcgctgccct ctacttgctc tgcctgccgg 31560cgctgcgtcg cgggctttgg
agcggcgcag ggcaacgagc cgatcgctga tcgtggaaac 31620gataggccta tgccatgcgg
gtcaaggcga cttccggcaa gctatacgcg ccctaggagt 31680gcggttggaa cgttggccca
gccagatact cccgatcacg agcaggacgc cgatgatttg 31740aagcgcactc agcgtctgat
ccaagaacaa ccatcctagc aacacggcgg tccccgggct 31800gagaaagccc agtaaggaaa
caactgtagg ttcgagtcgc gagatccccc ggaaccaaag 31860gaagtaggtt aaacccgctc
cgatcaggcc gagccacgcc aggccgagaa cattggttcc 31920tgtaggcatc gggattggcg
gatcaaacac taaagctact ggaacgagca gaagtcctcc 31980ggccgccagt tgccaggcgg
taaaggtgag cagaggcacg ggaggttgcc acttgcgggt 32040cagcacggtt ccgaacgcca
tggaaaccgc ccccgccagg cccgctgcga cgccgacagg 32100atctagcgct gcgtttggtg
tcaacaccaa cagcgccacg cccgcagttc cgcaaatagc 32160ccccaggacc gccatcaatc
gtatcgggct acctagcaga gcggcagaga tgaacacgac 32220catcagcggc tgcacagcgc
ctaccgtcgc cgcgaccccg cccggcaggc ggtagaccga 32280aataaacaac aagctccaga
atagcgaaat attaagtgcg ccgaggatga agatgcgcat 32340ccaccagatt cccgttggaa
tctgtcggac gatcatcacg agcaataaac ccgccggcaa 32400cgcccgcagc agcataccgg
cgacccctcg gcctcgctgt tcgggctcca cgaaaacgcc 32460ggacagatgc gccttgtgag
cgtccttggg gccgtcctcc tgtttgaaga ccgacagccc 32520aatgatctcg ccgtcgatgt
aggcgccgaa tgccacggca tctcgcaacc gttcagcgaa 32580cgcctccatg ggctttttct
cctcgtgctc gtaaacggac ccgaacatct ctggagcttt 32640cttcagggcc gacaatcgga
tctcgcggaa atcctgcacg tcggccgctc caagccgtcg 32700aatctgagcc ttaatcacaa
ttgtcaattt taatcctctg tttatcggca gttcgtagag 32760cgcgccgtgc gtcccgagcg
atactgagcg aagcaagtgc gtcgagcagt gcccgcttgt 32820tcctgaaatg ccagtaaagc
gctggctgct gaacccccag ccggaactga ccccacaagg 32880ccctagcgtt tgcaatgcac
caggtcatca ttgacccagg cgtgttccac caggccgctg 32940cctcgcaact cttcgcaggc
ttcgccgacc tgctcgcgcc acttcttcac gcgggtggaa 33000tccgatccgc acatgaggcg
gaaggtttcc agcttgagcg ggtacggctc ccggtgcgag 33060ctgaaatagt cgaacatccg
tcgggccgtc ggcgacagct tgcggtactt ctcccatatg 33120aatttcgtgt agtggtcgcc
agcaaacagc acgacgattt cctcgtcgat caggacctgg 33180caacgggacg ttttcttgcc
acggtccagg acgcggaagc ggtgcagcag cgacaccgat 33240tccaggtgcc caacgcggtc
ggacgtgaag cccatcgccg tcgcctgtag gcgcgacagg 33300cattcctcgg ccttcgtgta
ataccggcca ttgatcgacc agcccaggtc ctggcaaagc 33360tcgtagaacg tgaaggtgat
cggctcgccg ataggggtgc gcttcgcgta ctccaacacc 33420tgctgccaca ccagttcgtc
atcgtcggcc cgcagctcga cgccggtgta ggtgatcttc 33480acgtccttgt tgacgtggaa
aatgaccttg ttttgcagcg cctcgcgcgg gattttcttg 33540ttgcgcgtgg tgaacagggc
agagcgggcc gtgtcgtttg gcatcgctcg catcgtgtcc 33600ggccacggcg caatatcgaa
caaggaaagc tgcatttcct tgatctgctg cttcgtgtgt 33660ttcagcaacg cggcctgctt
ggcctcgctg acctgttttg ccaggtcctc gccggcggtt 33720tttcgcttct tggtcgtcat
agttcctcgc gtgtcgatgg tcatcgactt cgccaaacct 33780gccgcctcct gttcgagacg
acgcgaacgc tccacggcgg ccgatggcgc gggcagggca 33840gggggagcca gttgcacgct
gtcgcgctcg atcttggccg tagcttgctg gaccatcgag 33900ccgacggact ggaaggtttc
gcggggcgca cgcatgacgg tgcggcttgc gatggtttcg 33960gcatcctcgg cggaaaaccc
cgcgtcgatc agttcttgcc tgtatgcctt ccggtcaaac 34020gtccgattca ttcaccctcc
ttgcgggatt gccccgactc acgccggggc aatgtgccct 34080tattcctgat ttgacccgcc
tggtgccttg gtgtccagat aatccacctt atcggcaatg 34140aagtcggtcc cgtagaccgt
ctggccgtcc ttctcgtact tggtattccg aatcttgccc 34200tgcacgaata ccagcgaccc
cttgcccaaa tacttgccgt gggcctcggc ctgagagcca 34260aaacacttga tgcggaagaa
gtcggtgcgc tcctgcttgt cgccggcatc gttgcgccac 34320tcttcattaa ccgctatatc
gaaaattgct tgcggcttgt tagaattgcc atgacgtacc 34380tcggtgtcac gggtaagatt
accgataaac tggaactgat tatggctcat atcgaaagtc 34440tccttgagaa aggagactct
agtttagcta aacattggtt ccgctgtcaa gaactttagc 34500ggctaaaatt ttgcgggccg
cgaccaaagg tgcgaggggc ggcttccgct gtgtacaacc 34560agatattttt caccaacatc
cttcgtctgc tcgatgagcg gggcatgacg aaacatgagc 34620tgtcggagag ggcaggggtt
tcaatttcgt ttttatcaga cttaaccaac ggtaaggcca 34680acccctcgtt gaaggtgatg
gaggccattg ccgacgccct ggaaactccc ctacctcttc 34740tcctggagtc caccgacctt
gaccgcgagg cactcgcgga gattgcgggt catcctttca 34800agagcagcgt gccgcccgga
tacgaacgca tcagtgtggt tttgccgtca cataaggcgt 34860ttatcgtaaa gaaatggggc
gacgacaccc gaaaaaagct gcgtggaagg ctctgacgcc 34920aagggttagg gcttgcactt
ccttctttag ccgctaaaac ggccccttct ctgcgggccg 34980tcggctcgcg catcatatcg
acatcctcaa cggaagccgt gccgcgaatg gcatcgggcg 35040ggtgcgcttt gacagttgtt
ttctatcaga acccctacgt cgtgcggttc gattagctgt 35100ttgtcttgca ggctaaacac
tttcggtata tcgtttgcct gtgcgataat gttgctaatg 35160atttgttgcg taggggttac
tgaaaagtga gcgggaaaga agagtttcag accatcaagg 35220agcgggccaa gcgcaagctg
gaacgcgaca tgggtgcgga cctgttggcc gcgctcaacg 35280acccgaaaac cgttgaagtc
atgctcaacg cggacggcaa ggtgtggcac gaacgccttg 35340gcgagccgat gcggtacatc
tgcgacatgc ggcccagcca gtcgcaggcg attatagaaa 35400cggtggccgg attccacggc
aaagaggtca cgcggcattc gcccatcctg gaaggcgagt 35460tccccttgga tggcagccgc
tttgccggcc aattgccgcc ggtcgtggcc gcgccaacct 35520ttgcgatccg caagcgcgcg
gtcgccatct tcacgctgga acagtacgtc gaggcgggca 35580tcatgacccg cgagcaatac
gaggtcatta aaagcgccgt cgcggcgcat cgaaacatcc 35640tcgtcattgg cggtactggc
tcgggcaaga ccacgctcgt caacgcgatc atcaatgaaa 35700tggtcgcctt caacccgtct
gagcgcgtcg tcatcatcga ggacaccggc gaaatccagt 35760gcgccgcaga gaacgccgtc
caataccaca ccagcatcga cgtctcgatg acgctgctgc 35820tcaagacaac gctgcgtatg
cgccccgacc gcatcctggt cggtgaggta cgtggccccg 35880aagcccttga tctgttgatg
gcctggaaca ccgggcatga aggaggtgcc gccaccctgc 35940acgcaaacaa ccccaaagcg
ggcctgagcc ggctcgccat gcttatcagc atgcacccgg 36000attcaccgaa acccattgag
ccgctgattg gcgaggcggt tcatgtggtc gtccatatcg 36060ccaggacccc tagcggccgt
cgagtgcaag aaattctcga agttcttggt tacgagaacg 36120gccagtacat caccaaaacc
ctgtaaggag tatttccaat gacaacggct gttccgttcc 36180gtctgaccat gaatcgcggc
attttgttct accttgccgt gttcttcgtt ctcgctctcg 36240cgttatccgc gcatccggcg
atggcctcgg aaggcaccgg cggcagcttg ccatatgaga 36300gctggctgac gaacctgcgc
aactccgtaa ccggcccggt ggccttcgcg ctgtccatca 36360tcggcatcgt cgtcgccggc
ggcgtgctga tcttcggcgg cgaactcaac gccttcttcc 36420gaaccctgat cttcctggtt
ctggtgatgg cgctgctggt cggcgcgcag aacgtgatga 36480gcaccttctt cggtcgtggt
gccgaaatcg cggccctcgg caacggggcg ctgcaccagg 36540tgcaagtcgc ggcggcggat
gccgtgcgtg cggtagcggc tggacggctc gcctaatcat 36600ggctctgcgc acgatcccca
tccgtcgcgc aggcaaccga gaaaacctgt tcatgggtgg 36660tgatcgtgaa ctggtgatgt
tctcgggcct gatggcgttt gcgctgattt tcagcgccca 36720agagctgcgg gccaccgtgg
tcggtctgat cctgtggttc ggggcgctct atgcgttccg 36780aatcatggcg aaggccgatc
cgaagatgcg gttcgtgtac ctgcgtcacc gccggtacaa 36840gccgtattac ccggcccgct
cgaccccgtt ccgcgagaac accaatagcc aagggaagca 36900ataccgatga tccaagcaat
tgcgattgca atcgcgggcc tcggcgcgct tctgttgttc 36960atcctctttg cccgcatccg
cgcggtcgat gccgaactga aactgaaaaa gcatcgttcc 37020aaggacgccg gcctggccga
tctgctcaac tacgccgctg tcgtcgatga cggcgtaatc 37080gtgggcaaga acggcagctt
tatggctgcc tggctgtaca agggcgatga caacgcaagc 37140agcaccgacc agcagcgcga
agtagtgtcc gcccgcatca accaggccct cgcgggcctg 37200ggaagtgggt ggatgatcca
tgtggacgcc gtgcggcgtc ctgctccgaa ctacgcggag 37260cggggcctgt cggcgttccc
tgaccgtctg acggcagcga ttgaagaaga gcgctcggtc 37320ttgccttgct cgtcggtgat
gtacttcacc agctccgcga agtcgctctt cttgatggag 37380cgcatgggga cgtgcttggc
aatcacgcgc accccccggc cgttttagcg gctaaaaaag 37440tcatggctct gccctcgggc
ggaccacgcc catcatgacc ttgccaagct cgtcctgctt 37500ctcttcgatc ttcgccagca
gggcgaggat cgtggcatca ccgaaccgcg ccgtgcgcgg 37560gtcgtcggtg agccagagtt
tcagcaggcc gcccaggcgg cccaggtcgc cattgatgcg 37620ggccagctcg cggacgtgct
catagtccac gacgcccgtg attttgtagc cctggccgac 37680ggccagcagg taggccgaca
ggctcatgcc ggccgccgcc gccttttcct caatcgctct 37740tcgttcgtct ggaaggcagt
acaccttgat aggtgggctg cccttcctgg ttggcttggt 37800ttcatcagcc atccgcttgc
cctcatctgt tacgccggcg gtagccggcc agcctcgcag 37860agcaggattc ccgttgagca
ccgccaggtg cgaataaggg acagtgaaga aggaacaccc 37920gctcgcgggt gggcctactt
cacctatcct gcccggctga cgccgttgga tacaccaagg 37980aaagtctaca cgaacccttt
ggcaaaatcc tgtatatcgt gcgaaaaagg atggatatac 38040cgaaaaaatc gctataatga
ccccgaagca gggttatgca gcggaaaagc gctgcttccc 38100tgctgttttg tggaatatct
accgactgga aacaggcaaa tgcaggaaat tactgaactg 38160aggggacagg cgagagacga
tgccaaagag ctacaccgac gagctggccg agtgggttga 38220atcccgcgcg gccaagaagc
gccggcgtga tgaggctgcg gttgcgttcc tggcggtgag 38280ggcggatgtc gaggcggcgt
tagcgtccgg ctatgcgctc gtcaccattt gggagcacat 38340gcgggaaacg gggaaggtca
agttctccta cgagacgttc cgctcgcacg ccaggcggca 38400catcaaggcc aagcccgccg
atgtgcccgc accgcaggcc aaggctgcgg aacccgcgcc 38460ggcacccaag acgccggagc
cacggcggcc gaagcagggg ggcaaggctg aaaagccggc 38520ccccgctgcg gccccgaccg
gcttcacctt caacccaaca ccggacaaaa aggatctact 38580gtaatggcga aaattcacat
ggttttgcag ggcaagggcg gggtcggcaa gtcggccatc 38640gccgcgatca ttgcgcagta
caagatggac aaggggcaga cacccttgtg catcgacacc 38700gacccggtga acgcgacgtt
cgagggctac aaggccctga acgtccgccg gctgaacatc 38760atggccggcg acgaaattaa
ctcgcgcaac ttcgacaccc tggtcgagct gattgcgccg 38820accaaggatg acgtggtgat
cgacaacggt gccagctcgt tcgtgcctct gtcgcattac 38880ctcatcagca accaggtgcc
ggctctgctg caagaaatgg ggcatgagct ggtcatccat 38940accgtcgtca ccggcggcca
ggctctcctg gacacggtga gcggcttcgc ccagctcgcc 39000agccagttcc cggccgaagc
gcttttcgtg gtctggctga acccgtattg ggggcctatc 39060gagcatgagg gcaagagctt
tgagcagatg aaggcgtaca cggccaacaa ggcccgcgtg 39120tcgtccatca tccagattcc
ggccctcaag gaagaaacct acggccgcga tttcagcgac 39180atgctgcaag agcggctgac
gttcgaccag gcgctggccg atgaatcgct cacgatcatg 39240acgcggcaac gcctcaagat
cgtgcggcgc ggcctgtttg aacagctcga cgcggcggcc 39300gtgctatgag cgaccagatt
gaagagctga tccgggagat tgcggccaag cacggcatcg 39360ccgtcggccg cgacgacccg
gtgctgatcc tgcataccat caacgcccgg ctcatggccg 39420acagtgcggc caagcaagag
gaaatccttg ccgcgttcaa ggaagagctg gaagggatcg 39480cccatcgttg gggcgaggac
gccaaggcca aagcggagcg gatgctgaac gcggccctgg 39540cggccagcaa ggacgcaatg
gcgaaggtaa tgaaggacag cgccgcgcag gcggccgaag 39600cgatccgcag ggaaatcgac
gacggccttg gccgccagct cgcggccaag gtcgcggacg 39660cgcggcgcgt ggcgatgatg
aacatgatcg ccggcggcat ggtgttgttc gcggccgccc 39720tggtggtgtg ggcctcgtta
tgaatcgcag aggcgcagat gaaaaagccc ggcgttgccg 39780ggctttgttt ttgcgttagc
tgggcttgtt tgacaggccc aagctctgac tgcgcccgcg 39840ctcgcgctcc tgggcctgtt
tcttctcctg ctcctgcttg cgcatcaggg cctggtgccg 39900tcgggctgct tcacgcatcg
aatcccagtc gccggccagc tcgggatgct ccgcgcgcat 39960cttgcgcgtc gccagttcct
cgatcttggg cgcgtgaatg cccatgcctt ccttgatttc 40020gcgcaccatg tccagccgcg
tgtgcagggt ctgcaagcgg gcttgctgtt gggcctgctg 40080ctgctgccag gcggcctttg
tacgcggcag ggacagcaag ccgggggcat tggactgtag 40140ctgctgcaaa cgcgcctgct
gacggtctac gagctgttct aggcggtcct cgatgcgctc 40200cacctggtca tgctttgcct
gcacgtagag cgcaagggtc tgctggtagg tctgctcgat 40260gggcgcggat tctaagaggg
cctgctgttc cgtctcggcc tcctgggccg cctgtagcaa 40320atcctcgccg ctgttgccgc
tggactgctt tactgccggg gactgctgtt gccctgctcg 40380cgccgtcgtc gcagttcggc
ttgcccccac tcgattgact gcttcatttc gagccgcagc 40440gatgcgatct cggattgcgt
caacggacgg ggcagcgcgg aggtgtccgg cttctccttg 40500ggtgagtcgg tcgatgccat
agccaaaggt ttccttccaa aatgcgtcca ttgctggacc 40560gtgtttctca ttgatgcccg
caagcatctt cggcttgacc gccaggtcaa gcgcgccttc 40620atgggcggtc atgacggacg
ccgccatgac cttgccgccg ttgttctcga tgtagccgcg 40680taatgaggca atggtgccgc
ccatcgtcag cgtgtcatcg acaacgatgt acttctggcc 40740ggggatcacc tccccctcga
aagtcgggtt gaacgccagg cgatgatctg aaccggctcc 40800ggttcgggcg accttctccc
gctgcacaat gtccgtttcg acctcaaggc caaggcggtc 40860ggccagaacg accgccatca
tggccggaat cttgttgttc cccgccgcct cgacggcgag 40920gactggaacg atgcggggct
tgtcgtcgcc gatcagcgtc ttgagctggg caacagtgtc 40980gtccgaaatc aggcgctcga
ccaaattaag cgccgcttcc gcgtcgccct gcttcgcagc 41040ctggtattca ggctcgttgg
tcaaagaacc aaggtcgccg ttgcgaacca ccttcgggaa 41100gtctccccac ggtgcgcgct
cggctctgct gtagctgctc aagacgcctc cctttttagc 41160cgctaaaact ctaacgagtg
cgcccgcgac tcaacttgac gctttcggca cttacctgtg 41220ccttgccact tgcgtcatag
gtgatgcttt tcgcactccc gatttcaggt actttatcga 41280aatctgaccg ggcgtgcatt
acaaagttct tccccacctg ttggtaaatg ctgccgctat 41340ctgcgtggac gatgctgccg
tcgtggcgct gcgacttatc ggccttttgg gccatataga 41400tgttgtaaat gccaggtttc
agggccccgg ctttatctac cttctggttc gtccatgcgc 41460cttggttctc ggtctggaca
attctttgcc cattcatgac caggaggcgg tgtttcattg 41520ggtgactcct gacggttgcc
tctggtgtta aacgtgtcct ggtcgcttgc cggctaaaaa 41580aaagccgacc tcggcagttc
gaggccggct ttccctagag ccgggcgcgt caaggttgtt 41640ccatctattt tagtgaactg
cgttcgattt atcagttact ttcctcccgc tttgtgtttc 41700ctcccactcg tttccgcgtc
tagccgaccc ctcaacatag cggcctcttc ttgggctgcc 41760tttgcctctt gccgcgcttc
gtcacgctcg gcttgcaccg tcgtaaagcg ctcggcctgc 41820ctggccgcct cttgcgccgc
caacttcctt tgctcctggt gggcctcggc gtcggcctgc 41880gccttcgctt tcaccgctgc
caactccgtg cgcaaactct ccgcttcgcg cctggtggcg 41940tcgcgctcgc cgcgaagcgc
ctgcatttcc tggttggccg cgtccagggt cttgcggctc 42000tcttctttga atgcgcgggc
gtcctggtga gcgtagtcca gctcggcgcg cagctcctgc 42060gctcgacgct ccacctcgtc
ggcccgctgc gtcgccagcg cggcccgctg ctcggctcct 42120gccagggcgg tgcgtgcttc
ggccagggct tgccgctggc gtgcggccag ctcggccgcc 42180tcggcggcct gctgctctag
caatgtaacg cgcgcctggg cttcttccag ctcgcgggcc 42240tgcgcctcga aggcgtcggc
cagctccccg cgcacggctt ccaactcgtt gcgctcacga 42300tcccagccgg cttgcgctgc
ctgcaacgat tcattggcaa gggcctgggc ggcttgccag 42360agggcggcca cggcctggtt
gccggcctgc tgcaccgcgt ccggcacctg gactgccagc 42420ggggcggcct gcgccgtgcg
ctggcgtcgc cattcgcgca tgccggcgct ggcgtcgttc 42480atgttgacgc gggcggcctt
acgcactgca tccacggtcg ggaagttctc ccggtcgcct 42540tgctcgaaca gctcgtccgc
agccgcaaaa atgcggtcgc gcgtctcttt gttcagttcc 42600atgttggctc cggtaattgg
taagaataat aatactctta cctaccttat cagcgcaaga 42660gtttagctga acagttctcg
acttaacggc aggtttttta gcggctgaag ggcaggcaaa 42720aaaagccccg cacggtcggc
gggggcaaag ggtcagcggg aaggggatta gcgggcgtcg 42780ggcttcttca tgcgtcgggg
ccgcgcttct tgggatggag cacgacgaag cgcgcacgcg 42840catcgtcctc ggccctatcg
gcccgcgtcg cggtcaggaa cttgtcgcgc gctaggtcct 42900ccctggtggg caccaggggc
atgaactcgg cctgctcgat gtaggtccac tccatgaccg 42960catcgcagtc gaggccgcgt
tccttcaccg tctcttgcag gtcgcggtac gcccgctcgt 43020tgagcggctg gtaacgggcc
aattggtcgt aaatggctgt cggccatgag cggcctttcc 43080tgttgagcca gcagccgacg
acgaagccgg caatgcaggc ccctggcaca accaggccga 43140cgccgggggc aggggatggc
agcagctcgc caaccaggaa ccccgccgcg atgatgccga 43200tgccggtcaa ccagcccttg
aaactatccg gccccgaaac acccctgcgc attgcctgga 43260tgctgcgccg gatagcttgc
aacatcagga gccgtttctt ttgttcgtca gtcatggtcc 43320gccctcacca gttgttcgta
tcggtgtcgg acgaactgaa atcgcaagag ctgccggtat 43380cggtccagcc gctgtccgtg
tcgctgctgc cgaagcacgg cgaggggtcc gcgaacgccg 43440cagacggcgt atccggccgc
agcgcatcgc ccagcatggc cccggtcagc gagccgccgg 43500ccaggtagcc cagcatggtg
ctgttggtcg ccccggccac cagggccgac gtgacgaaat 43560cgccgtcatt ccctctggat
tgttcgctgc tcggcggggc agtgcgccgc gccggcggcg 43620tcgtggatgg ctcgggttgg
ctggcctgcg acggccggcg aaaggtgcgc agcagctcgt 43680tatcgaccgg ctgcggcgtc
ggggccgccg ccttgcgctg cggtcggtgt tccttcttcg 43740gctcgcgcag cttgaacagc
atgatcgcgg aaaccagcag caacgccgcg cctacgcctc 43800ccgcgatgta gaacagcatc
ggattcattc ttcggtcctc cttgtagcgg aaccgttgtc 43860tgtgcggcgc gggtggcccg
cgccgctgtc tttggggatc agccctcgat gagcgcgacc 43920agtttcacgt cggcaaggtt
cgcctcgaac tcctggccgt cgtcctcgta cttcaaccag 43980gcatagcctt ccgccggcgg
ccgacggttg aggataaggc gggcagggcg ctcgtcgtgc 44040tcgacctgga cgatggcctt
tttcagcttg tccgggtccg gctccttcgc gcccttttcc 44100ttggcgtcct taccgtcctg
gtcgccgtcc tcgccgtcct ggccgtcgcc ggcctccgcg 44160tcacgctcgg catcagtctg
gccgttgaag gcatcgacgg tgttgggatc gcggcccttc 44220tcgtccagga actcgcgcag
cagcttgacc gtgccgcgcg tgatttcctg ggtgtcgtcg 44280tcaagccacg cctcgacttc
ctccgggcgc ttcttgaagg ccgtcaccag ctcgttcacc 44340acggtcacgt cgcgcacgcg
gccggtgttg aacgcatcgg cgatcttctc cggcaggtcc 44400agcagcgtga cgtgctgggt
gatgaacgcc ggcgacttgc cgatttcctt ggcgatatcg 44460cctttcttct tgcccttcgc
cagctcgcgg ccaatgaagt cggcaatttc gcgcggggtc 44520agctcgttgc gttgcaggtt
ctcgataacc tggtcggctt cgttgtagtc gttgtcgatg 44580aacgccggga tggacttctt
gccggcccac ttcgagccac ggtagcggcg ggcgccgtga 44640ttgatgatat agcggcccgg
ctgctcctgg ttctcgcgca ccgaaatggg tgacttcacc 44700ccgcgctctt tgatcgtggc
accgatttcc gcgatgctct ccggggaaaa gccggggttg 44760tcggccgtcc gcggctgatg
cggatcttcg tcgatcaggt ccaggtccag ctcgataggg 44820ccggaaccgc cctgagacgc
cgcaggagcg tccaggaggc tcgacaggtc gccgatgcta 44880tccaacccca ggccggacgg
ctgcgccgcg cctgcggctt cctgagcggc cgcagcggtg 44940tttttcttgg tggtcttggc
ttgagccgca gtcattggga aatctccatc ttcgtgaaca 45000cgtaatcagc cagggcgcga
acctctttcg atgccttgcg cgcggccgtt ttcttgatct 45060tccagaccgg cacaccggat
gcgagggcat cggcgatgct gctgcgcagg ccaacggtgg 45120ccggaatcat catcttgggg
tacgcggcca gcagctcggc ttggtggcgc gcgtggcgcg 45180gattccgcgc atcgaccttg
ctgggcacca tgccaaggaa ttgcagcttg gcgttcttct 45240ggcgcacgtt cgcaatggtc
gtgaccatct tcttgatgcc ctggatgctg tacgcctcaa 45300gctcgatggg ggacagcaca
tagtcggccg cgaagagggc ggccgccagg ccgacgccaa 45360gggtcggggc cgtgtcgatc
aggcacacgt cgaagccttg gttcgccagg gccttgatgt 45420tcgccccgaa cagctcgcgg
gcgtcgtcca gcgacagccg ttcggcgttc gccagtaccg 45480ggttggactc gatgagggcg
aggcgcgcgg cctggccgtc gccggctgcg ggtgcggttt 45540cggtccagcc gccggcaggg
acagcgccga acagcttgct tgcatgcagg ccggtagcaa 45600agtccttgag cgtgtaggac
gcattgccct gggggtccag gtcgatcacg gcaacccgca 45660agccgcgctc gaaaaagtcg
aaggcaagat gcacaagggt cgaagtcttg ccgacgccgc 45720ctttctggtt ggccgtgacc
aaagttttca tcgtttggtt tcctgttttt tcttggcgtc 45780cgcttcccac ttccggacga
tgtacgcctg atgttccggc agaaccgccg ttacccgcgc 45840gtacccctcg ggcaagttct
tgtcctcgaa cgcggcccac acgcgatgca ccgcttgcga 45900cactgcgccc ctggtcagtc
ccagcgacgt tgcgaacgtc gcctgtggct tcccatcgac 45960taagacgccc cgcgctatct
cgatggtctg ctgccccact tccagcccct ggatcgcctc 46020ctggaactgg ctttcggtaa
gccgtttctt catggataac acccataatt tgctccgcgc 46080cttggttgaa catagcggtg
acagccgcca gcacatgaga gaagtttagc taaacatttc 46140tcgcacgtca acacctttag
ccgctaaaac tcgtccttgg cgtaacaaaa caaaagcccg 46200gaaaccgggc tttcgtctct
tgccgcttat ggctctgcac ccggctccat caccaacagg 46260tcgcgcacgc gcttcactcg
gttgcggatc gacactgcca gcccaacaaa gccggttgcc 46320gccgccgcca ggatcgcgcc
gatgatgccg gccacaccgg ccatcgccca ccaggtcgcc 46380gccttccggt tccattcctg
ctggtactgc ttcgcaatgc tggacctcgg ctcaccatag 46440gctgaccgct cgatggcgta
tgccgcttct ccccttggcg taaaacccag cgccgcaggc 46500ggcattgcca tgctgcccgc
cgctttcccg accacgacgc gcgcaccagg cttgcggtcc 46560agaccttcgg ccacggcgag
ctgcgcaagg acataatcag ccgccgactt ggctccacgc 46620gcctcgatca gctcttgcac
tcgcgcgaaa tccttggcct ccacggccgc catgaatcgc 46680gcacgcggcg aaggctccgc
agggccggcg tcgtgatcgc cgccgagaat gcccttcacc 46740aagttcgacg acacgaaaat
catgctgacg gctatcacca tcatgcagac ggatcgcacg 46800aacccgctga attgaacacg
agcacggcac ccgcgaccac tatgccaaga atgcccaagg 46860taaaaattgc cggccccgcc
atgaagtccg tgaatgcccc gacggccgaa gtgaagggca 46920ggccgccacc caggccgccg
ccctcactgc ccggcacctg gtcgctgaat gtcgatgcca 46980gcacctgcgg cacgtcaatg
cttccgggcg tcgcgctcgg gctgatcgcc catcccgtta 47040ctgccccgat cccggcaatg
gcaaggactg ccagcgctgc catttttggg gtgaggccgt 47100tcgcggccga ggggcgcagc
ccctgggggg atgggaggcc cgcgttagcg ggccgggagg 47160gttcgagaag ggggggcacc
ccccttcggc gtgcgcggtc acgcgcacag ggcgcagccc 47220tggttaaaaa caaggtttat
aaatattggt ttaaaagcag gttaaaagac aggttagcgg 47280tggccgaaaa acgggcggaa
acccttgcaa atgctggatt ttctgcctgt ggacagcccc 47340tcaaatgtca ataggtgcgc
ccctcatctg tcagcactct gcccctcaag tgtcaaggat 47400cgcgcccctc atctgtcagt
agtcgcgccc ctcaagtgtc aataccgcag ggcacttatc 47460cccaggcttg tccacatcat
ctgtgggaaa ctcgcgtaaa atcaggcgtt ttcgccgatt 47520tgcgaggctg gccagctcca
cgtcgccggc cgaaatcgag cctgcccctc atctgtcaac 47580gccgcgccgg gtgagtcggc
ccctcaagtg tcaacgtccg cccctcatct gtcagtgagg 47640gccaagtttt ccgcgaggta
tccacaacgc cggcggccgc ggtgtctcgc acacggcttc 47700gacggcgttt ctggcgcgtt
tgcagggcca tagacggccg ccagcccagc ggcgagggca 47760accagcccgg tgagcgtcgg
aaaggcgctg gaagccccgt agcgacgcgg agaggggcga 47820gacaagccaa gggcgcaggc
tcgatgcgca gcacgacata gccggttctc gcaaggacga 47880gaatttccct gcggtgcccc
tcaagtgtca atgaaagttt ccaacgcgag ccattcgcga 47940gagccttgag tccacgctag
atgagagctt tgttgtaggt ggaccagttg gtgattttga 48000acttttgctt tgccacggaa
cggtctgcgt tgtcgggaag atgcgtgatc tgatccttca 48060actcagcaaa agttcgattt
attcaacaaa gccacgttgt gtctcaaaat ctctgatgtt 48120acattgcaca agataaaaat
atatcatcat gaacaataaa actgtctgct tacataaaca 48180gtaatacaag gggtgttatg
agccatattc aacgggaaac gtcttgctcg ac 48232151055DNAartificial
sequenceZMCAS1HINDIIIPRO fragment comprising 1.0 KB ZmCAS1 promoter
15aagctttttg gaaggctaag gagaggaagc cggcgagaag gagggggcgt tttacgtgtc
60actgtcctgt cgtgttggct gttgacacga atcatttctt ccgcgcgtgg gaagaagaag
120atgcacatta gcggcctgaa gtagagatgt caatggggaa ttccccagcg gggattaact
180ccccagaccc gtacccatga acatagaccg gcccccatcc ccgaacccga acccgacctc
240gggtacgaaa atcctcccat acccattccc gaccgggtac taaataccca tgggtatcca
300tacccgaccc gattattcaa aaattaatgg gctttttatt tgttaaccgg cggacgcaat
360gcttgggact ctaggttttt ttactttgtt gaccggctgg cggctgggct ttttcctaca
420ggcccaaagt tggtcggcag ccactaggcc acacgtcaca ggcagcccac aagtaaatgt
480cgttggattg ctggatggtg gaataaaaat cctagatgct agattgttct ggttccgggt
540atttttctcc atggctaatc gggtttgggt ttagccctcc caaacccgaa cccgccatac
600ccgatgggta agggatttat tccaaatcta tacccatggg gatttgtttt aacccatacc
660ttaaccctaa tagaggaatt ccccacgggt aatcgggttt cggggcccat tgacatctct
720agactgaagg cgtccaactc aaatcattaa aaagtgttga cgcacgcgct gatgcgccgg
780ccgcacagca caggctgcac agcccgttta atcagcgatg gagccccggc cgtcagccag
840ccaggtccgg cgtccgggtc tgcgccctgc ggcgtcactg ctgtcgccac cgtctccgat
900ggtcccacat ccatccagcg ggccgcgcgt ggtacaaaag gctcttcctc gccgtcaggt
960gcagctgccc aaacaccaga cacagactcc accaccccgc ttcgatcttc tgttgcagct
1020gaaatctgtc agattctgca gttcattcct catga
1055161752DNAartificial sequenceZMCAS1BAMPRO fragment comprising 1.7 KB
ZmCAS1 promoter 16ggatccgggg cggaagatgg cagggacgcg gattcagggc
ggacgcgctt gccgagggcg 60cgggggacca cagcgtgcgt tacggggaca gggcgggcat
cgcgaggacg ggtgcgggag 120cggagccaca tctggtggtg gacgcctact ttgctctctt
atagagtagt aaagattcgt 180ggaccaaaca acaccctagc ttgtacaaat attcttaggc
agttgctact gatgagagaa 240aaataacatc actccactgc atttgcgtga tttattgaac
agatcacaat tacatctatt 300caaatttatt tacctgtacg tgtccgattt ttaggggagg
atttttttac ggtatttttt 360ttttaaaaaa ataaatttag gcaacaattt tatagaatcg
agtgctttat ctattatctt 420ttacaaggca cacgcgtaca ataaggtttg gtcgttcgtg
acttggatag tggttttggt 480tgcaattccg taattcttgg cataggatac agcccaaccc
agaaaaaaat aatgttgcgg 540tcagttctgg ctttgagatt cggagtacca cgtggcgtaa
aggcaggccg tgtcttacag 600atgaataaag gacctgggtc tcacgtgatt ggtttccagt
ttcgtgcatc aagatgtgga 660attttcaaac tgccgtcgtg tttgtttcgt cacataaaag
ctttttggaa ggctaaggag 720aggaagccgg cgagaaggag ggggcgtttt acgtgtcact
gtcctgtcgt gttggctgtt 780gacacgaatc atttcttccg cgcgtgggaa gaagaagatg
cacattagcg gcctgaagta 840gagatgtcaa tggggaattc cccagcgggg attaactccc
cagacccgta cccatgaaca 900tagaccggcc cccatccccg aacccgaacc cgacctcggg
tacgaaaatc ctcccatacc 960cattcccgac cgggtactaa atacccatgg gtatccatac
ccgacccgat tattcaaaaa 1020ttaatgggct ttttatttgt taaccggcgg acgcaatgct
tgggactcta ggttttttta 1080ctttgttgac cggctggcgg ctgggctttt tcctacaggc
ccaaagttgg tcggcagcca 1140ctaggccaca cgtcacaggc agcccacaag taaatgtcgt
tggattgctg gatggtggaa 1200taaaaatcct agatgctaga ttgttctggt tccgggtatt
tttctccatg gctaatcggg 1260tttgggttta gccctcccaa acccgaaccc gccatacccg
atgggtaagg gatttattcc 1320aaatctatac ccatggggat ttgttttaac ccatacctta
accctaatag aggaattccc 1380cacgggtaat cgggtttcgg ggcccattga catctctaga
ctgaaggcgt ccaactcaaa 1440tcattaaaaa gtgttgacgc acgcgctgat gcgccggccg
cacagcacag gctgcacagc 1500ccgtttaatc agcgatggag ccccggccgt cagccagcca
ggtccggcgt ccgggtctgc 1560gccctgcggc gtcactgctg tcgccaccgt ctccgatggt
cccacatcca tccagcgggc 1620cgcgcgtggt acaaaaggct cttcctcgcc gtcaggtgca
gctgcccaaa caccagacac 1680agactccacc accccgcttc gatcttctgt tgcagctgaa
atctgtcaga ttctgcagtt 1740cattcctcat ga
175217354PRTOryza sativa 17Met Ala Ala Glu Cys Gly
Ser Gly Asn Cys Asp Ala Trp Ala Ala Arg 1 5
10 15 Asp Pro Ser Gly Ile Leu Ser Pro Tyr Lys Phe
Asn Arg Arg Ala Val 20 25
30 Gln Ser Asp Asp Val Ser Leu Arg Ile Thr His Cys Gly Val Cys
Tyr 35 40 45 Ala
Asp Val Ala Trp Thr Arg Asn Ile Leu Asn Asn Ser Met Tyr Pro 50
55 60 Leu Val Pro Gly His Glu
Ile Ala Gly Val Val Thr Glu Val Gly Ala 65 70
75 80 Asp Val Lys Ser Phe Lys Val Gly Asp His Val
Gly Val Gly Thr Tyr 85 90
95 Val Asn Ser Cys Arg Asp Cys Glu Asn Cys Asn Ser Ser Leu Glu Asn
100 105 110 Tyr Cys
Ser Gln His Val Phe Thr Phe Asn Gly Val Asp Thr Asp Gly 115
120 125 Thr Val Thr Lys Gly Gly Tyr
Ser Thr His Ile Val Val His Glu Arg 130 135
140 Tyr Cys Phe Lys Ile Pro Asp Gly Tyr Pro Leu Glu
Lys Ala Ala Pro 145 150 155
160 Leu Leu Cys Ala Gly Ile Thr Val Tyr Ser Pro Met Met Arg His Asn
165 170 175 Met Asn Gln
Pro Gly Lys Ser Leu Gly Val Ile Gly Leu Gly Gly Leu 180
185 190 Gly His Met Ala Val Lys Phe Gly
Lys Ala Phe Gly Leu Lys Val Thr 195 200
205 Val Ile Ser Thr Ser Glu Ser Lys Arg Lys Glu Ala Ile
Asp Leu Leu 210 215 220
Gly Ala Asp Asn Phe Val Val Ser Ser Asp Glu Asn Gln Met Glu Thr 225
230 235 240 Leu Lys Ser Ser
Leu Asn Phe Ile Ile Asp Thr Ala Ser Gly Asp His 245
250 255 Pro Phe Asp Pro Tyr Leu Thr Leu Leu
Lys Val Gly Gly Val Met Ala 260 265
270 Leu Leu Ser Phe Pro Ser Glu Ile Lys Val His Pro Ala Asn
Leu Asn 275 280 285
Leu Gly Gly Arg Ser Leu Ser Gly Ser Val Thr Gly Gly Thr Lys Asp 290
295 300 Ile Gln Glu Met Ile
Asn Phe Cys Ala Ala Asn Lys Ile Tyr Pro Asp 305 310
315 320 Ile Glu Met Ile Lys Ile Asp Tyr Ile Asn
Glu Ala Leu Gln Arg Leu 325 330
335 Val Asp Arg Asp Val Arg Phe Arg Phe Val Ile Asp Ile Glu Asn
Ser 340 345 350 Phe
Lys 186254DNAOryza sativa 18cagtcacaaa ttttttggta attccgagaa aatttccgga
aactccgaga acatttccag 60aaagtttccg accatttccg agttccgacg gaaactgccc
ttatcatttc cgattccgtt 120tccgagaaaa tattttcgaa ttcgtttccg tttccgaaaa
atccgaaaaa attctgaccg 180acagattccg tttccgaaaa taggtccgga atccggaaag
tttccgtacc gttttcaccc 240ctaactggga caaaatgatt attttagtct aacatacatt
gttaagtata ttaaagtcta 300cggtctatct cacagcttac aaatggtgag atatgatatc
ttttaacatt atttcataaa 360acagtaaata agatgcccgg catcaagggg tgaaaacgat
attgaaattt cccgaccata 420ccgataccgt tttcggaaat ctaatataat aataatattt
ttaccaatgg ctaccatttt 480caaaatttag tttttttcta atttctatca gcgtggcatt
gaagttaaca ctaaatatat 540gaattcgtaa acattaaaat tttccgaccg tttcggttcg
ttttcgaaaa aataataaaa 600taatgatatc gtttccgacc atttctgata gttttcatcc
ctaacggcat ccacgtgggc 660tttcttccta gttgagtaaa aagtggagta ttgagagttt
acacgagaga attattagtg 720agattaagat ggaaatttga tttttagtcc taatatcttg
tttagtaaat atggcggaaa 780ttgacaggga gtttcgttgg agattatgtc attaataaaa
acggatggct gagatttgtt 840tatgcaaatt aaaggaggaa ttccccctat ccacgtattg
aaagaggtgt tccttactca 900attctccatt cccatctcat taaaattatc gtgttttcca
actaaacaaa acaaacaaaa 960cagataactt atccatctaa agttctaaac acctaatcac
tcctcaaaaa ctacctccaa 1020ccaaaacagg gccgtaaagg aaaatcttcc gtgtcgtcgt
ctcctccgct ccttgcgcca 1080agacgagtcg cggctcaaca gcggagaggc ggcgatctcc
catctggcga gcagagcagg 1140ggaggggaag ggatcttggt gagcatccac atcctcttcc
tgactcatct ctctcccacc 1200gggagtactt ctgtctggaa tttgcttgcg ttaaccctag
cttctgttct aggtttggaa 1260gaagctcttc tcttaatttc agagccttaa tcttaataca
agtgacagtt tgtttgttcc 1320ccaaaaagct gaatccgccc ctgtccagtg gtacgaattc
agtttctgta gctgccagaa 1380gaagtaaatt aaatttcatg ttataccatg ttctgaaact
ctcaagaatg ttgatggaag 1440ttgatggggt tctagtgtat aaaactggat cagtatttct
ggttattgat tgggtttcac 1500aaactgtgga agtataaatt ggatattata gagtacaagg
agcagagatg gggcctaaca 1560catgacacgt gcgttacaaa ggagaactgg agaagaggtg
gattccttct caaattcccc 1620gcccacagat tcccatctcc aacctccatc tccgtctcct
cagagatcca catagcgact 1680cccgatccag actaagcttg caaaactccc ttgccccagc
agcgtcgaca gtgtggtgct 1740gaccggcggc ttctgatcaa aatagtcagt tggattggtc
ctgtcctgtc ctgtcctggt 1800cctgaacctc taatctcttc atttcaaaac gaatagatga
ctggattcat tttcctgcat 1860tttaaactac tggtttttct ttttgttgtt tttgacattg
ttcattgctg atataagtat 1920caggggtata agattatgat ggatttatgg cattatgggt
tcatctgagt tgattttttt 1980ctatctgcag ttagagtggc atggctgctg aatgtggaag
tggcaactgt gatgcttggg 2040cagcgagaga tccttcaggg atcctctccc cgtacaagtt
caaccgcagg ttaaatcaca 2100ctacactccc ttgtgtattg attttccttt ttattttctt
gatatgaatg tctttccagc 2160actgttaaga aatcttcctt gtttccaagt gttttctcta
actgcagaat gcactattct 2220ggcaactaga atcatcacga ttgcagtttc agttagctgt
gtgcttatct ggaacaagaa 2280tgtgaacttg tcaagttacc tgttttgaat tttctaactt
ttgcgcaaca tgataaactg 2340gattacaaaa tctctatagt actttgctga atgttgtgat
taactactca cagacaagat 2400tttaaaatga tcttttccat ttttagtacc tattcttatg
aattgttaat aacctggttt 2460attattacca gggctgtaca aagtgacgat gtttccttga
ggatcacaca ctgtggtgtt 2520tgttatgctg atgttgcatg gacaaggaat atactcaaca
attcgatgta ccctttagtc 2580cctgggtaaa attattaact ttgttacttg aattttaaca
aaagtatgga aaagaataaa 2640cttctgttac agactataaa ttcatccatt tagctactgt
atttgttgta aagataaaag 2700ttcatgaaat tgtgtctaaa tttcaatatg agtaggcatg
agatagcagg agttgtaact 2760gaggttggtg cagacgtcaa gagcttcaaa gtgggtgacc
atgtaggtgt tggcacatac 2820gtgaattcat gccgggactg tgagaactgc aatagctctc
tagagaacta ctgctcacaa 2880catgtcttca ctttcaatgg tgttgacact gatgggactg
tcacaaaggg aggatattct 2940actcacatag tagtacatga gaggtatgga ttttgaccat
gttttcttct gaaagttttc 3000tgacaagaca acgaaaaatt tcatatactt atttctaatg
gttgctcaac aatgttgtca 3060caaaatcatc ctacattctg ctatgtaaat tttatatagg
aaacaaatcc catgatgttc 3120tgctaatgtc tctcttatga atatagcagc tattagttat
tcctgtttac ttaattcaaa 3180aaagaaccta tgagaaaatt gcatcttcct ttggaatggc
aggtattgct ttaaaatacc 3240tgatggctac cctttggaaa aggcagcacc tttactttgt
gctggcatca ctgtatatag 3300tccgatgatg cggcataata tgaaccagcc agggaagtca
ctcggcgtca ttggacttgg 3360tggcttgggt cacatggcag taaaatttgg gaaagccttt
ggactgaaag tcacagttat 3420tagtactagt gaatcaaaga gaaaagaagc tattgacctt
cttggtgcag ataatttcgt 3480ggtgtcatcg gatgaaaatc agatggaggt aatacacatt
ctacattatg ttttacccca 3540ttgttcacag ttatctacta tgaccatacc atgagtctta
tcagtcaaat gccatttggt 3600aaagatagct aacctgtact tctcatttat cttacgtact
gaaagtggta atgacagatt 3660cttcattgtt gcagaccttg aaaagctctc tgaacttcat
tattgacaca gcctccggcg 3720atcacccatt cgatccttat ctcacgcttc tgaaagttgg
tggtgtaatg gcactactta 3780gcttcccaag tgaaatcaaa gtgcatcctg caaaccttaa
tctcggtaat gcacgtttct 3840cataccaaaa tatatatggc tttttccaga gaggaaatat
atctgtctgc taggatgggg 3900agaaaattaa ttaattacaa ctaatctgga ctcatgtagg
tgggcggagt ttatctggta 3960gtgtaactgg aggtacgaag gacatccagg agatgataaa
cttctgtgcg gcaaacaaaa 4020tctacccaga tatcgagatg atcaagatag attacatcaa
cgaggctctt cagaggcttg 4080ttgaccggga tgtcagattt cgctttgtaa tcgacattga
gaactcgttc aagtagtatc 4140ttgatatctt cagtacacca ttaatgatag aagtgtatgc
aataataata agtttaaatg 4200tcctacaaga tgatgagacc atgagacagt tccagaacaa
aatgttccat tttaagaagc 4260aatttggttc tgttttcagt tattttcagg agtaaattgt
attggtggta cacaaactta 4320tttggtgggt gtaatttaat acataaactt gtttgttggg
tgtaatttag tatataaatt 4380tgtttgttgg gtgtagttta gtacataaac taaacttgtg
aagtacttat tttggtacat 4440gaacttatct aattcgtatg aatcacaatc aaaatggctt
ggcaaagttg atccttcaaa 4500gttgattttg attaggatat agcatgcaca agtggtgagt
acgcataaga tatgctatat 4560ttataaactt tgtttctact tttcatcatc attaaaatga
tgaatttcct atgtttctat 4620ctcttgttaa atggttttca tatatttttc tataatcatt
atgtctcgat ctatcttttt 4680attgaacata ctagagaaaa tgcctgtgcg ttgcaacggg
tgaaaatgtt tgtggatcgc 4740tatatattat acttctttta ggttttgggt cgaaagctca
tcccatcggc cccaacccag 4800gctttgagcc gaaagctcat ctcattgagc ccaacccaat
ccaatatttc cctttgggcc 4860tcatcgggcc gaaagctcat ctcatcggtc cctacccagg
ctttgggccg aaagctcatc 4920tcatcggtcc ctacctaggc tttgggccga aagctcatct
catcgggccc aacccagtcc 4980gatatttccc tttgctcgac ttctctcttc ggctgatatt
tccctttgtt cgacttcggt 5040tcaaacctgg agctactggc ccgactattt ccatttgttc
gacttcggtt caaacctgga 5100gctgcaggcc cgactacgtc catcaatacc catctcttcc
ttctttttca acccaacccg 5160cgatattttc gctgactatt gtccatcaat acccatctct
tccttctttt ttaagcacaa 5220cccagtccgt ctcattcctt cctttttcaa cccaatctag
tccgatattt ccttttgctc 5280gacctttttc agccgaaccc agttcgatat tttcctctac
tcaacctttc tcttcctccc 5340ctccaggccc gactacgtcc atcaataccc atctcttcct
tctttttcaa cccaacccgc 5400gatattttcc ccgactattg tccatcaata cccatctctt
ccttcttttt taagcacaac 5460ccagtccgtc tcatcccttc ctttttcaac ccaatctagt
ccgatattaa tacccatctc 5520ttccttcttt ttcaacccaa cccgcgatat tttccccgac
tattgtccat caatacccat 5580ctcttccttc ttttttaagc acaacccagt ccgtctcatc
ccttcctttt tcaacccaat 5640atagtccgat atttcctttt gctcgacctt tttcagccga
acccagttcc atattttcct 5700ctactcaacc tttctcttcc tcccatcgtt aacccagtcc
gatactttcc tctgctcgac 5760ctttctcttc ctcccatcat tcgatcgagc ccatctctcc
ctttattttc agtccaatcc 5820agtccgatat ttctgcttgc tcaacctttt tcagcccaat
ccagtccaat atttccgtct 5880gcttgacctg tttcttgctc ccatcatttg atcaagccca
tctattactt tattttcagc 5940ccaacccagt ccgatatttc cctctactcg actattttca
acccagttcg atatgtccct 6000ttgctcagac ctttttcagc ccaacccagt tcactatctc
tcttcggctc aaaccacaag 6060tggagctaca ggtctagcga ctgtcatctt caacctacga
aagacattcc cgtgcacata 6120cgatgtcgaa gtcgttcctc acctcaataa ttccatgtct
taccgtgcac atgcgatgtt 6180ggagccgatc ctcccatcaa ttacgataaa atatttcttg
atctcccaaa attggtctct 6240cttcggctca aacc
6254192000DNAOryza sativa 19cagtcacaaa ttttttggta
attccgagaa aatttccgga aactccgaga acatttccag 60aaagtttccg accatttccg
agttccgacg gaaactgccc ttatcatttc cgattccgtt 120tccgagaaaa tattttcgaa
ttcgtttccg tttccgaaaa atccgaaaaa attctgaccg 180acagattccg tttccgaaaa
taggtccgga atccggaaag tttccgtacc gttttcaccc 240ctaactggga caaaatgatt
attttagtct aacatacatt gttaagtata ttaaagtcta 300cggtctatct cacagcttac
aaatggtgag atatgatatc ttttaacatt atttcataaa 360acagtaaata agatgcccgg
catcaagggg tgaaaacgat attgaaattt cccgaccata 420ccgataccgt tttcggaaat
ctaatataat aataatattt ttaccaatgg ctaccatttt 480caaaatttag tttttttcta
atttctatca gcgtggcatt gaagttaaca ctaaatatat 540gaattcgtaa acattaaaat
tttccgaccg tttcggttcg ttttcgaaaa aataataaaa 600taatgatatc gtttccgacc
atttctgata gttttcatcc ctaacggcat ccacgtgggc 660tttcttccta gttgagtaaa
aagtggagta ttgagagttt acacgagaga attattagtg 720agattaagat ggaaatttga
tttttagtcc taatatcttg tttagtaaat atggcggaaa 780ttgacaggga gtttcgttgg
agattatgtc attaataaaa acggatggct gagatttgtt 840tatgcaaatt aaaggaggaa
ttccccctat ccacgtattg aaagaggtgt tccttactca 900attctccatt cccatctcat
taaaattatc gtgttttcca actaaacaaa acaaacaaaa 960cagataactt atccatctaa
agttctaaac acctaatcac tcctcaaaaa ctacctccaa 1020ccaaaacagg gccgtaaagg
aaaatcttcc gtgtcgtcgt ctcctccgct ccttgcgcca 1080agacgagtcg cggctcaaca
gcggagaggc ggcgatctcc catctggcga gcagagcagg 1140ggaggggaag ggatcttggt
gagcatccac atcctcttcc tgactcatct ctctcccacc 1200gggagtactt ctgtctggaa
tttgcttgcg ttaaccctag cttctgttct aggtttggaa 1260gaagctcttc tcttaatttc
agagccttaa tcttaataca agtgacagtt tgtttgttcc 1320ccaaaaagct gaatccgccc
ctgtccagtg gtacgaattc agtttctgta gctgccagaa 1380gaagtaaatt aaatttcatg
ttataccatg ttctgaaact ctcaagaatg ttgatggaag 1440ttgatggggt tctagtgtat
aaaactggat cagtatttct ggttattgat tgggtttcac 1500aaactgtgga agtataaatt
ggatattata gagtacaagg agcagagatg gggcctaaca 1560catgacacgt gcgttacaaa
ggagaactgg agaagaggtg gattccttct caaattcccc 1620gcccacagat tcccatctcc
aacctccatc tccgtctcct cagagatcca catagcgact 1680cccgatccag actaagcttg
caaaactccc ttgccccagc agcgtcgaca gtgtggtgct 1740gaccggcggc ttctgatcaa
aatagtcagt tggattggtc ctgtcctgtc ctgtcctggt 1800cctgaacctc taatctcttc
atttcaaaac gaatagatga ctggattcat tttcctgcat 1860tttaaactac tggtttttct
ttttgttgtt tttgacattg ttcattgctg atataagtat 1920caggggtata agattatgat
ggatttatgg cattatgggt tcatctgagt tgattttttt 1980ctatctgcag ttagagtggc
200020354PRTSorghum bicolor
20Met Ala Ala Glu Ser Glu His Gly Asn Cys Asn Ala Trp Ala Ala Arg 1
5 10 15 Asp Pro Ser Gly
Val Leu Ser Pro Tyr Ser Phe Asn Arg Arg Pro Val 20
25 30 Gln Ser Ser Asp Val Ala Leu Lys Ile
Leu Tyr Cys Gly Val Cys Tyr 35 40
45 Ala Asp Val Val Trp Thr Arg Asn Met His His Asp Ser Lys
Tyr Pro 50 55 60
Val Val Pro Gly His Glu Ile Ala Gly Val Val Thr Gln Val Gly Ala 65
70 75 80 Asp Val Lys Gly Phe
Lys Val Gly Asp His Val Gly Val Gly Thr Tyr 85
90 95 Val Asn Ser Cys Arg Asp Cys Glu Asn Cys
Asn Ser Ser Leu Glu Asn 100 105
110 His Cys Pro Lys Gly Val Tyr Thr Phe Asn Gly Ile Asp Thr Asp
Gly 115 120 125 Thr
Val Thr Lys Gly Gly Tyr Ser Thr His Ile Val Val His Glu Arg 130
135 140 Tyr Cys Phe Gln Ile Pro
Asp Gly Tyr Pro Leu Ala Lys Ala Ala Pro 145 150
155 160 Leu Leu Cys Ala Gly Ile Thr Val Tyr Thr Pro
Met Met Arg His Asn 165 170
175 Met Asn Gln Pro Gly Lys Ser Leu Gly Val Ile Gly Leu Gly Gly Leu
180 185 190 Gly His
Met Ala Val Lys Phe Gly Lys Ala Phe Gly Leu Lys Val Thr 195
200 205 Val Leu Ser Thr Ser Glu Ser
Lys Arg His Glu Ala Ile Ser Leu Leu 210 215
220 Gly Ala Asp Asn Phe Val Ile Ser Ser Asp Thr Gln
Gln Met Glu Ser 225 230 235
240 Leu Arg Asn Ser Leu His Phe Ile Val Asp Thr Ala Ser Gly Asp His
245 250 255 Pro Phe Asp
Pro Tyr Leu Ser Leu Leu Met Val Gly Gly Val Met Ala 260
265 270 Ile Val Gly Phe Pro Ser Glu Ile
Lys Met His Pro Ala Ser Leu Asn 275 280
285 Leu Gly Ala Arg Thr Leu Ser Gly Ser Val Thr Gly Gly
Thr Lys Asp 290 295 300
Ile Gln Glu Met Val Asn Phe Cys Ala Ala Asn Lys Ile Ser Pro Glu 305
310 315 320 Ile Glu Ile Ile
Lys Ile Asp Tyr Ile Asn Glu Ala Leu Thr Arg Leu 325
330 335 Val Asn Arg Asp Val Lys Tyr Arg Phe
Val Ile Asp Ile Glu Asn Ser 340 345
350 Phe Lys 215242DNASorghum bicolor 21aaaagtcaaa
cgtcttataa tttgggatgg agtattaaac atgcataaac ctaggtaaat 60taatatttca
atcatacatg gtccaatgag tcttaatttt ttagcatagc taaagcatac 120aattagtagc
ctaccataaa agttttacca caatcaaagc acaaagagct acaattatga 180attaaacaca
aaaacacata tatctacagc tacaacaaaa atgttaaact aaagcatgtc 240atattatata
tctattatat agagcttgtc atgtggagtc taacaaaact gaattttcgt 300ttttacaaat
tttctatgat tttatagaat ttttttaaag ttttagtcag tttatgaaat 360aaaaaagagc
atgtgggcca catcagcaac cacattggca ggagagtcct tcgtgattgg 420atggtggcta
ggttctagac aaagtttatg gatctagatg agcgaaaaaa gtttatggac 480atatgtgaga
agtggagaaa agtttaagaa ccctcaacac atttgactca agaaagaata 540acataaaacc
aaccacacac taaatttaag gttgaaatcc aaatgatatg atatatgatg 600aaatacaaac
attcatgaga tttttataaa agaataaaga catagtaaat aaagcaaaga 660atatataaaa
aataagtata tagttatctg ttgacaaatt ttatatagta tattttataa 720atacgaacta
gtagatagtt ctagccaatc ctaaaacata atcaaacaca tagaaatcta 780acatatttaa
ggggtgtttg gttggaggtg ttaaagttta atatgtatta taacattttc 840gttttatttg
acaattagtg tctaatcatt gactaactag gcttaaaaga tttgtctggc 900aaattacttt
ctagttatgt ttttagtttc ataaataatc tatatttagt actttatgca 960tatgtccaaa
cattcgatgt gacgagagtt aaagtttaac tatggaaacc aaggccctta 1020ctaaagaaat
ctgttaacag atcaataaca gaagtcatat aaaaatcaaa tatcagaaaa 1080taatgatcta
tctatattat gaaaaaaaca agcgttcatg agtttgagtt ttttttaccg 1140tgtttgttat
ttactaaaag aaaactacaa gtacttttat ctgttctatt tacaaggcac 1200gggtctaatt
ataagtttct tttacctttg aaattcttaa tattccgacg gtagtattag 1260tcgtatgata
caatactccc attcggccat tccattagct gaggccttgt ttagttccta 1320aaaagttttg
ccaaattttc agatttttcg tcacatcaaa ttttacggca tatgcatgaa 1380acattaaata
tagataaaaa gaataactaa ttacatagtt taactgtaat ttgtgagacg 1440aatttttaaa
atctaattaa tctataatta aataatattt gtcaaataca aacgtggtat 1500aatgcctatt
ttactaattt ttttgaaact acttcctccg tcccataaat aattgcattt 1560ctttaacttc
tatagtttat gtttgaccgt tcgtcttatt aaaaaatttt taataaatat 1620tatttatttt
ttcatgactt attttattgt tagatatatt ttttatgata atttatttat 1680tttattattt
gtacaaaaat ttaaataaaa taaatggtca aacattacta ttagcagtcg 1740aagaaatgca
tttatttatg ggacgaagag agtaaacaag gcctgaagct gaagcctgaa 1800ggaagaaagg
gaaaggggat tgacgcatgc actgatggtt cggccaggca caatcagcga 1860tgaactcatg
gctgacgaga agccccgtaa ctccggaagc caggtccgcg acgtcactgc 1920tgtcggcacc
ctctccaatc caatccatcc acccgtcaac ggagcatgat ccggtctgcg 1980cccttccaac
ttcttcccca ttataatttg ctcactcgcc ggcggccgct acctgctgca 2040acccggcgac
acccgaccga cagcctgctt gccacgcttg catcgaggag agttcgggga 2100ccgcccggtg
cagaaggaat cgacgcaggg gtgagtcttt tcccctcccc tcccatgtac 2160ttcccgcctc
gattcgactc caagggtcag cgatttcact gcgcttttcc ttagcatagg 2220ctatctgaaa
tttccttctg ttagagccga gtctttcgat ctgctgactt gtcacgaatt 2280cattctgcag
ttcagttcga atggctgctg aatcagagca cggcaactgc aatgcttggg 2340cagcgagaga
tccttctgga gttctctcac catacagctt taaccgcagg tggcttccca 2400ccccccccca
cccccccccc cccccccccc cgcgtctcct gatctggaga aaagttgcgt 2460catgtttagc
actctgaata ctgtattctt gctatgctag actgtttctt tgggatagct 2520atggaccgta
ttgtttccat gtctttctgt gtacacgcta tatgtttggt tttcagtgtg 2580tttcatacct
gttatcaaat gagcaatatc ttggtgtgga gcaagtagtg cagctatttc 2640tgctcgtggt
ctgtcaaaca aaatcatgaa attgatgctt actatgggca ttgggtgaac 2700attttaggtc
tgatgggatt gcagaatttt gttcaacgtt ggcaggatct gccttttatt 2760attaaaaaaa
taagggaagc gacctaagtc aacaactata cagattgatt gcagaaattt 2820gagtacaaat
agcaacgtac ttgtcttttt gttcagctag cttgtctgtt tcatttttta 2880taacgtcttc
ttatgagagc tactatgtgt aataattttc agaccagtgc aaagcagcga 2940tgttgcgttg
aagatcttat actgtggagt ctgttatgct gacgttgtct ggacacggaa 3000tatgcaccat
gattcaaaat atcctgtggt tcctgggtaa gcgtctgaca acaatttaga 3060cttgtatcag
ggaagggcat gtttcttatc aatcgcgaat tgcatttcaa gatccaactc 3120tgaattgacc
aggactagta gttttactag gtgttctaat ctaaggggca gaatatgact 3180aatagttaca
gactaacagc ccagatcaat acattgaacc cctctagagc aaacaaaaca 3240tcatagtcag
gagtatcgat taagtgtttg cccattgtcc aaaaatgatg tctgtgtact 3300atatatttgt
caatgatatt attcattggt gcaggcatga gatagctgga gtcgtaactc 3360aggttggtgc
agatgtcaaa ggctttaaag tgggtgacca tgtaggtgtt ggaacttatg 3420tgaactcatg
ccgagattgt gagaactgca atagctctct agagaaccac tgtccaaaag 3480gagtttatac
tttcaatggc attgatacag atggaactgt cacaaagggg ggttactcca 3540ctcacattgt
agtccatgaa aggtatggca gtactgtttt acttggctgt tcaaacaaga 3600ctctttttat
attgacaaaa gctattttag ggatgaaaac aggttccact gtgatttcat 3660ttgcctattt
aatcacactg tgtcgtgttc aagatgttat atacttctgt atgaaagtaa 3720actctgggtc
attgctgtcc ataaattttc atcacaatca atacatgatc aaagaattat 3780atatgggtct
atataagtca ttattgacac cttgatgctg attcgataca acctgtcaga 3840taagggatgg
catcttctgt tgattaattt gaactttagg tattgcttcc aaatacctga 3900tggctatcct
ttggcgaagg cagcacctct tctgtgtgct ggaataactg tgtatactcc 3960aatgatgcga
cacaacatga accaacctgg aaagtcactt ggggtcattg gactcggtgg 4020tctaggtcac
atggcagtga aatttggtaa agcatttggt ctgaaggtca cagttttgag 4080tacaagtgaa
tcaaagagac atgaagccat cagcctcctt ggagcggata attttgttat 4140atcatcagat
acacagcaga tggaggtact gctctggaca ttgcgtaatc tgaacgtacc 4200taggcacctc
gtacttcatc tgttccgaat atgagaaatt ttatttttgt cctaagtcaa 4260aaaactctat
aattgaccta gtttatagaa aaaagcacta gtatctacaa gaccaaattt 4320attccattga
atctcctgta gaatgtgttt ggtattgtag atgtcaatat atttttctat 4380aaacttagtc
aaagctagac aaaattgact taggataaaa ctaaaacatc ttacatttga 4440aagggagaga
atataccacg tttcccttct tgattatatg acaccatatt ttggtggggt 4500tctttttgtt
tctttatttt attgccaaag atgcaagcac atcacattgt tttcacatgg 4560aacgaacaga
tagcagatca ctttcgttct cattaatttg ctacttgact ttaaagttcc 4620aggcagccag
ttcagtatat gacattggat agttttctta tgctgagaat aaagataaca 4680gctaagtact
ggctgttctg ctttacctga aaactgaccg tgaatatatg atctcttgct 4740tttgcagtcc
ctgagaaact ccctgcactt catagttgac accgcttctg gcgaccatcc 4800atttgatcca
tatctctctc tccttatggt tggtggtgtg atggcaattg tgggctttcc 4860aagtgagatc
aaaatgcatc ctgcaagcct taatcttggt aattctctgt tctgacactc 4920tgtactaaaa
agaaacgtgg tattagaagc caaattattc agatttaacc tgaactgttt 4980tttaggtgca
cggaccttgt ctggtagtgt tactggaggt acaaaagaca tccaagaaat 5040ggttaacttc
tgtgcggcga acaaaatctc tccagagatt gagatcatta agatagatta 5100tatcaatgag
gctctcacga ggcttgttaa ccgagatgtg aaataccgct ttgttatcga 5160catcgagaac
tctttcaagt aacatgctgg tctacatgct ctcagtcttt ctattaatat 5220actagagcaa
acacaaatgt ga
5242222300DNASorghum bicolor 22aaaagtcaaa cgtcttataa tttgggatgg
agtattaaac atgcataaac ctaggtaaat 60taatatttca atcatacatg gtccaatgag
tcttaatttt ttagcatagc taaagcatac 120aattagtagc ctaccataaa agttttacca
caatcaaagc acaaagagct acaattatga 180attaaacaca aaaacacata tatctacagc
tacaacaaaa atgttaaact aaagcatgtc 240atattatata tctattatat agagcttgtc
atgtggagtc taacaaaact gaattttcgt 300ttttacaaat tttctatgat tttatagaat
ttttttaaag ttttagtcag tttatgaaat 360aaaaaagagc atgtgggcca catcagcaac
cacattggca ggagagtcct tcgtgattgg 420atggtggcta ggttctagac aaagtttatg
gatctagatg agcgaaaaaa gtttatggac 480atatgtgaga agtggagaaa agtttaagaa
ccctcaacac atttgactca agaaagaata 540acataaaacc aaccacacac taaatttaag
gttgaaatcc aaatgatatg atatatgatg 600aaatacaaac attcatgaga tttttataaa
agaataaaga catagtaaat aaagcaaaga 660atatataaaa aataagtata tagttatctg
ttgacaaatt ttatatagta tattttataa 720atacgaacta gtagatagtt ctagccaatc
ctaaaacata atcaaacaca tagaaatcta 780acatatttaa ggggtgtttg gttggaggtg
ttaaagttta atatgtatta taacattttc 840gttttatttg acaattagtg tctaatcatt
gactaactag gcttaaaaga tttgtctggc 900aaattacttt ctagttatgt ttttagtttc
ataaataatc tatatttagt actttatgca 960tatgtccaaa cattcgatgt gacgagagtt
aaagtttaac tatggaaacc aaggccctta 1020ctaaagaaat ctgttaacag atcaataaca
gaagtcatat aaaaatcaaa tatcagaaaa 1080taatgatcta tctatattat gaaaaaaaca
agcgttcatg agtttgagtt ttttttaccg 1140tgtttgttat ttactaaaag aaaactacaa
gtacttttat ctgttctatt tacaaggcac 1200gggtctaatt ataagtttct tttacctttg
aaattcttaa tattccgacg gtagtattag 1260tcgtatgata caatactccc attcggccat
tccattagct gaggccttgt ttagttccta 1320aaaagttttg ccaaattttc agatttttcg
tcacatcaaa ttttacggca tatgcatgaa 1380acattaaata tagataaaaa gaataactaa
ttacatagtt taactgtaat ttgtgagacg 1440aatttttaaa atctaattaa tctataatta
aataatattt gtcaaataca aacgtggtat 1500aatgcctatt ttactaattt ttttgaaact
acttcctccg tcccataaat aattgcattt 1560ctttaacttc tatagtttat gtttgaccgt
tcgtcttatt aaaaaatttt taataaatat 1620tatttatttt ttcatgactt attttattgt
tagatatatt ttttatgata atttatttat 1680tttattattt gtacaaaaat ttaaataaaa
taaatggtca aacattacta ttagcagtcg 1740aagaaatgca tttatttatg ggacgaagag
agtaaacaag gcctgaagct gaagcctgaa 1800ggaagaaagg gaaaggggat tgacgcatgc
actgatggtt cggccaggca caatcagcga 1860tgaactcatg gctgacgaga agccccgtaa
ctccggaagc caggtccgcg acgtcactgc 1920tgtcggcacc ctctccaatc caatccatcc
acccgtcaac ggagcatgat ccggtctgcg 1980cccttccaac ttcttcccca ttataatttg
ctcactcgcc ggcggccgct acctgctgca 2040acccggcgac acccgaccga cagcctgctt
gccacgcttg catcgaggag agttcgggga 2100ccgcccggtg cagaaggaat cgacgcaggg
gtgagtcttt tcccctcccc tcccatgtac 2160ttcccgcctc gattcgactc caagggtcag
cgatttcact gcgcttttcc ttagcatagg 2220ctatctgaaa tttccttctg ttagagccga
gtctttcgat ctgctgactt gtcacgaatt 2280cattctgcag ttcagttcga
2300
User Contributions:
Comment about this patent or add new information about this topic: