Patent application title: PLANT REGULATORY SEQUENCE
Inventors:
Syngenta Participations Ag (Basel, CH)
Mikyong Lee (Research Triangle Park, NC, US)
Michael L. Nuccio (Research Triangle Park, NC, US)
Michael L. Nuccio (Research Triangle Park, NC, US)
Joseph Dallas Clarke, V (Research Triangle Park, NC, US)
Assignees:
Syngenta Participations AG
IPC8 Class: AC12N1582FI
USPC Class:
8003201
Class name: Higher plant, seedling, plant seed, or plant part (i.e., angiosperms or gymnosperms) gramineae (e.g., barley, oats, rye, sorghum, millet, etc.) maize
Publication date: 2013-04-25
Patent application number: 20130104265
Abstract:
The present invention relates to regulatory sequences. In particular, the
invention relates to a regulatory nucleotide sequence, at least part of
which has a transcription initiation function directing expression of an
operably associated protein encoding polynucleotide of interest to
non-pollen tissue, but not or substantially not to pollen.
The invention further relates to chimeric genes and expression cassettes
comprising said regulatory sequence and to transgenic plants comprising
said chimeric genes and expression cassettes.Claims:
1. A regulatory nucleotide sequence comprising a fragment of at least 50
consecutive nucleotides of SEQ ID NO:13, wherein the regulatory
nucleotide sequence is operably-linked to a protein encoding
polynucleotide of interest and wherein the protein encoding
polynucleotide is transcribed in leaf tissue and not in pollen.
2. The regulatory nucleotide sequence of claim 1, wherein the protein encoding polynucleotide of interest encodes a protein that confers insect resistance or drought tolerance to a transgenic plant.
4. An expression cassette comprising the regulatory nucleotide sequence of claim 1.
5. A vector molecule comprising the expression cassette according to claim 4.
6. A transgenic plant cell stably transformed with the expression cassette of claim 4.
7. A transgenic plant stably transformed with the expression cassette of claim 4.
8. A seed of the transgenic plant of claim 7 wherein the seed comprises the expression cassette.
9. A progeny of the transgenic plant of claim 7 wherein the progeny comprises the expression cassette.
10. The transgenic plant of claim 7 wherein the plant is Zea mays.
11. A seed of the transgenic plant of claim 10, wherein the seed comprises the expression cassette.
Description:
[0001] This application is a divisional of co-pending U.S. application
Ser. No. 12/172,535, filed Jul. 14, 2008, which is herein incorporated by
reference in its entirety.
[0002] The present invention is in the field of plant biotechnology and relates to regulatory sequences. In particular, the invention relates to a regulatory nucleotide sequence, at least part of which has a transcription initiation function directing expression of an operably associated protein encoding polynucleotide of interest to basically all plant tissues, but essentially excluding expression in the tissues of the reproductive structures of the plant, particularly in the tissues of the pollen and/or the tassel so that no expression product is present in said tissues to any significant extent. The invention further relates to chimeric genes and expression cassettes comprising said regulatory sequence in association with an expressible protein encoding polynucleotide of interest and to transgenic plants comprising said chimeric genes and expression cassettes, respectively, expressing the protein encoding polynucleotide of interest in basically all plant tissues, but essentially excluding expression in the tissues of the reproductive structures of the plant, particularly in the tissues of the pollen and/or the tassel so that no expression product is present in said tissues to any significant extent.
BACKGROUND OF THE INVENTION
[0003] In many agricultural crops such as corn, devastating pests tend to feed on vegetative tissues such as the leaf, stalk and root and also reproductive tissues such as the ear. One technique used to protect plants from pests is the application of chemical compounds. An alternative technique involves genetic recombination, wherein a gene or genes are introduced into the plant to express protein products that are directly or indirectly involved in the control of the pest organisms. Current protein products produced by genetic recombination are expressed constitutively, i.e., throughout the plant at all times and in most tissues and organs. Such protein products are also expressed specifically, either in response to particular stimuli or confined to specific cells or tissues. In contrast, the present invention includes expression of the protein or polynucleotide of interest in basically all plant tissues, but essentially excludes expression in the tissues of the reproductive structures of the plant, particularly in the tissues of the pollen and/or the tassel so that no expression product is present in said tissues to any significant extent.
[0004] Several insect control trait genes target the larval stage of development. Under certain circumstances, these proteins also affect unintended insects, which are not corn pests, but do occasionally feed on corn pollen. These insects may be harmed by insecticidal proteins expressed in pollen tissue. This was seen as a problem in early BT-corn events which had high insecticidal protein expression in pollen. This issue was addressed in later BT-corn events through the development of alternative transgene expression systems. These newer events remained effective against target pests and accumulated less insecticidal protein in pollen, but are still viewed as potentially harmful to non-target pests due to the presence of insecticidal protein in pollen.
[0005] In some instances, useful insect control trait genes may also compromise the development of reproductive structures of the plant such as, for example, the tassel.
[0006] It is, therefore, desirable to provide plants, particularly corn plants that exclude expression of the transgene in the tissues of the reproductive structures of the plant such as the tissues of the pollen and/or the tassel. This could be achieved within the scope of the present invention by providing a regulatory nucleotide sequence, at least part of which has a transcription initiation function directing expression of an operably associated protein encoding a polynucleotide of interest to basically all plant tissues, but essentially excluding expression in the tissues of the male reproductive structures of the plant, particularly in the tissues of the pollen and/or the tassel so that no expression product is present in said tissues to any significant extent. This regulatory nucleotide sequence can then be used to develop expression systems that enable effective accumulation of the polypeptide or protein of interest such as, for example, an insecticidal protein, in tissues that target pests normally feed on, and eliminate or reduce accumulation of the insecticidal protein in non-target tissues or organs and/or in those tissues that may be compromised by the polypeptide or protein of interest.
SUMMARY OF THE INVENTION
[0007] In one embodiment, the invention relates to a transgenic plant comprising stably integrated in its genome a chimeric polynucleotide construct, particularly a chimeric construct, comprising a polynucleotide of interest, particularly a polypeptide or protein encoding polynucleotide of interest, associated with and/or under control of a regulatory nucleotide sequence, at least part of which has a transcription initiation function directing expression of said protein encoding polynucleotide of interest to basically all tissues of said plant, particularly the tissues target insects normally feed on, but essentially excluding the tissues of the reproductive plant structures, particularly the tissues of the pollen and/or the tassel so that no expression product is present in said tissues to any significant extent.
[0008] In one embodiment, the polynucleotide of interest, particularly a polypeptide or protein encoding polynucleotide of interest, is not transcribed to any significant extent in the tissues of the reproductive plant structures, particularly in pollen and/or tassel tissue of the transgenic plant according to the invention. Therefore, essentially no expression of the polynucleotide of interest, particularly a polypeptide or protein encoding polynucleotide of interest, occurs in the tissues of the male reproductive plant structures, particularly in the tissues of the pollen and/or the tassel, and only residual amounts of the expression product, if any, can be detected in said tissues, which is not sufficient for the expression product to fulfil its envisaged biological function in said tissues, particularly in the tissues of the pollen and/or the tassel, and therefore also does not exhibit any toxic effects on insects feeding on said tissues or on the plant reproductive structures.
[0009] In one embodiment of the invention, a transgenic plant is provided as described herein, wherein a chimeric polynucleotide construct, particularly a chimeric DNA construct, comprises a polynucleotide of interest, particularly a polypeptide or protein encoding polynucleotide of interest, operably associated with and/or under operable control of a regulatory nucleotide sequence, at least part of which has a transcription initiation function and is obtainable from a gene encoding an actin depolymerizing factor 3 (ABP3), which polypeptide or protein is highly expressed in most tissues of the plant but essentially excluding the tissues of the pollen so that no expression product is present in said tissues to any significant extent.
[0010] In one embodiment, said actin depolymerizing factor 3 (ABP3) gene is obtainable from maize.
[0011] In one embodiment of the invention, a transgenic plant is provided as described herein, wherein a chimeric polynucleotide construct, particularly a chimeric DNA construct, comprises a polynucleotide of interest, particularly a polypeptide or protein encoding polynucleotide of interest, operably associated with and/or under operable control of a regulatory nucleotide sequence, at least part of which has a transcription initiation function and is obtainable from a gene represented by a DNA probe, particularly a DNA probe exhibiting a DNA sequence as depicted in SEQ ID NOs: 47 to 56, which DNA probe shows a signal pattern in tissue samples, which is indicative of expression of said gene in all tissues and of no or substantially no expression in pollen.
[0012] In one embodiment, a transgenic plant according to the invention and as described herein is provided comprising a regulatory nucleotide sequence or a polynucleotide construct, particularly a chimeric polynucleotide construct, comprising said regulatory sequence as described herein, at least part of which has a transcription initiation function and mediates expression of an operably associated protein encoding polynucleotide of interest in most tissues of the plant but essentially excluding the tissues of the pollen so that no expression product is present in said tissues to any significant extent, which regulatory sequence can be obtained in a PCR reaction from a genomic Zea mays DNA template using
[0013] i) a first primer which has at least 90%, particularly at least 91%, particularly at least 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% sequence identity with a nucleotide sequence as depicted in SEQ ID NO: 1, particularly a first primer of SEQ ID NO: 1; or
[0014] ii) second primer which has at least 90%, particularly at least 91%, particularly at least 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% sequence identity with a nucleotide sequence as depicted in SEQ ID NO: 2, particularly a second primer of; SEQ ID NO: 2; or
[0015] iii) a first primer as a forward primer which has at least 90%, particularly at least 91%, particularly at least 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% sequence identity with a nucleotide sequence as depicted in SEQ ID NO: 1 and a second primer as a reverse primer which has at least 90%, particularly at least 91%, particularly at least 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% sequence identity with a nucleotide sequence as depicted in SEQ ID NO: 2, particularly the forward primer of SEQ ID NO: 1 and the reverse primer of SEQ ID NO: 2.
[0016] In one embodiment, the invention relates to a transgenic plant as described herein, wherein the nucleotide sequence providing the transcription initiation function has at least between 80% and 85% sequence identity with a nucleotide sequence depicted in SEQ ID NO: 13, with all integers falling within this range also being comprised herewith, particularly at least between 85% and 90% sequence identity with a nucleotide sequence depicted in SEQ ID NO: 13, with all integers falling within this range also being comprised herewith, particularly at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% sequence identity with a nucleotide sequence depicted in SEQ ID NO: 13, or a fragment thereof, and wherein said regulatory nucleotide sequence or fragment thereof mediates transcription of an operably associated polynucleotide molecule, particularly of an operably associated protein encoding polynucleotide of interest such that said polynucleotide of interest is transcribed in most of the plant tissues but essentially excluding the tissues of the pollen so that no expression product is present in said tissues to any significant extent.
[0017] In one embodiment, the invention relates to a transgenic plant as described herein, wherein the complementary strand of the nucleotide sequence providing the transcription initiation function is capable of hybridizing with a nucleotide sequence depicted in SEQ ID NO: 13, particularly under moderate hybridization conditions, more particularly under stringent hybridization conditions, and wherein said regulatory nucleotide sequence mediates transcription of an operably associated polynucleotide molecule, particularly of an operably associated protein encoding polynucleotide of interest such that said polynucleotide of interest is transcribed in most of the plant tissues but essentially excluding the tissues of the pollen so that no expression product is present in said tissues to any significant extent.
[0018] In one embodiment, the invention relates to a transgenic plant as described herein, wherein the nucleotide sequence providing the transcription initiation function is the sequence depicted in SEQ ID NO: 13 or a fragment thereof, which still exhibits the full functionality as a transcription initiation sequence.
[0019] In one embodiment the invention relates to a transgenic plant according to the invention and as described herein comprising a regulatory nucleotide sequence or an expression cassette comprising said regulatory nucleotide sequence or a polynucleotide construct, particularly a chimeric polynucleotide construct, comprising said regulatory sequence at least part of which has a transcription termination function obtainable from a gene encoding an actin depolymerizing factor 3 (ABP3), which regulatory sequence mediates transcription of an operably associated polynucleotide molecule, particularly of an operably associated protein encoding polynucleotide molecule of interest such that said polynucleotide of interest is transcribed in most of the plant tissues excluding the tissues of the pollen but essentially excluding the tissues of the pollen so that no expression product is present in said tissues to any significant extent, particularly from a maize actin depolymerizing factor 3 (ABP3) gene, wherein
[0020] i) said regulatory nucleotide sequence comprises a transcription termination sequence which has at least between 80% and 85% sequence identity, with all integers falling within this range also being comprised herewith, particularly at least between 85% and 90% sequence identity, with all integers falling within this range also being comprised herewith, particularly at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% sequence identity with a nucleotide sequence depicted in SEQ ID NO: 14; or a fragment thereof, which still exhibits the functionality of a termination sequence; or
[0021] ii) the complementary strand of said regulatory nucleotide sequence hybridizes to a nucleotide sequence depicted in SEQ ID NO: 14, particularly under moderate hybridization conditions, more particularly under moderate-stringent hybridization conditions, particularly under stringent hybridization conditions and mediates termination of transcription of an operably associated protein encoding polynucleotide of interest; or.
[0022] iii) said regulatory nucleotide sequence has a sequence as depicted in SEQ ID NO: 14, including complements thereof.
[0023] In one embodiment the invention relates to a transgenic plant according to the invention and as described herein comprising a regulatory nucleotide sequence or an expression cassette comprising said regulatory nucleotide sequence or a polynucleotide construct, particularly a chimeric polynucleotide construct, comprising said regulatory sequence at least part of which has a transcription initiation function and a termination function, respectively, which regulatory nucleotide sequence is obtainable from a gene encoding an actin depolymerizing factor 3 (ABP3), which is expressed in most plant tissues but essentially excluding the tissues of the pollen so that no expression product is present in said tissues to any significant extent, particularly from a maize actin depolymerizing factor 3 (ABP3) gene, and which regulatory nucleotide sequence comprises a transcription initiation sequence and a transcription termination sequence, respectively, which have at least between 80% and 85% sequence identity, with all integers falling within this range also being comprised herewith, particularly at least between 85% and 90% sequence identity, with all integers falling within this range also being comprised herewith, particularly at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% sequence identity with a nucleotide sequence depicted in SEQ ID NO: 13 and SEQ ID NO:14, respectively, or a fragment thereof which still exhibits the full functionality as a transcription initiation and a termination sequence, respectively.
[0024] In one embodiment the invention relates to a transgenic plant according to the invention and as described herein comprising a regulatory nucleotide sequence or an expression cassette comprising said regulatory nucleotide sequence or a polynucleotide construct, particularly a chimeric polynucleotide construct, comprising said regulatory sequence at least part of which has a transcription initiation function and a termination function, respectively, which regulatory nucleotide sequence is obtainable from a gene encoding an actin depolymerizing factor 3 (ABP3), which is expressed in most tissues of the plant but essentially excluding the tissues of the pollen so that no expression product is present in said tissues to any significant extent, particularly from a maize actin depolymerizing factor 3 (ABP3) gene, and which regulatory nucleotide sequence comprises a transcription initiation sequence as depicted in SEQ ID NO: 13 and a transcription termination sequence as depicted in SEQ ID NO:14.
[0025] In one embodiment the invention relates to a transgenic plant according to the invention and as described herein comprising a regulatory nucleotide sequence or an expression cassette comprising said regulatory nucleotide sequence or a polynucleotide construct, particularly a chimeric polynucleotide construct, comprising said regulatory sequence at least part of which has a transcription initiation function and a termination function, respectively, which regulatory nucleotide sequence is obtainable from a gene represented by a DNA probe, particularly a DNA probe exhibiting a DNA sequence as depicted in SEQ ID NOs: 47 to 56, which DNA probe shows a signal pattern in tissue samples, which is indicative of expression of said gene in all tissues and of no or substantially no expression in pollen.
[0026] In one embodiment of the invention, a transgenic plant is provided as described herein, wherein the chimeric polynucleotide construct, particularly the chimeric DNA construct, comprises a polynucleotide of interest, particularly a polypeptide or protein encoding polynucleotide of interest, operably associated with and/or under operable control of a regulatory nucleotide sequence, at least part of which has a transcription initiation function and is obtainable from plant genomic DNA, particularly from maize genomic DNA, which polypeptide or protein is expressed in most tissues of the plant but essentially excluding tissues of the tassel so that no expression product is present in said tissues to any significant extent.
[0027] In one embodiment of the invention, a transgenic plant is provided as described herein, wherein the chimeric polynucleotide construct, particularly the chimeric DNA construct, comprises a polynucleotide of interest, particularly a polypeptide or protein encoding polynucleotide of interest, operably associated with and/or under operable control of a regulatory nucleotide sequence, at least part of which has a transcription initiation function and is obtainable from a gene represented by a DNA probe, particularly a DNA probe exhibiting a DNA sequence as depicted in SEQ ID NOs: 57 to 79, which DNA probe shows a signal pattern in tissue samples, which is indicative of expression of said gene in all tissues and of no or substantially no expression in the tissues of the tassel.
[0028] In one embodiment, a transgenic plant according to the invention and as described herein is provided comprising a regulatory nucleotide sequence or an expression cassette comprising said regulatory nucleotide sequence or a polynucleotide construct, particularly a chimeric polynucleotide construct, comprising said regulatory sequence as described herein, which regulatory sequence can be obtained in a PCR reaction from a genomic Zea mays DNA template using
[0029] i) a first primer which has at least 90%, particularly at least 91%, particularly at least 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% sequence identity with a nucleotide sequence as depicted in SEQ ID NO: 19, particularly the primer of SEQ ID NO: 19; or
[0030] ii) a second primer which has at least 90%, particularly at least 91%, particularly at least 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% sequence identity with a nucleotide sequence as depicted in SEQ ID NO: 20, particularly the reverse primer of SEQ ID NO: 20; or
[0031] iii) a first primer as a forward primer which has at least 90%, particularly at least 91%, particularly at least 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% sequence identity with a nucleotide sequence as depicted in SEQ ID NO: 19 and a second primer as a reverse primer which has at least 90%, particularly at least 91%, particularly at least 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% sequence identity with a nucleotide sequence as depicted in SEQ ID NO: 20, particularly the forward primer of SEQ ID NO: 19 and the reverse primer of SEQ ID NO: 20.
[0032] In one embodiment, the invention relates to a transgenic plant as described herein, wherein the nucleotide sequence providing the transcription initiation function has at least between 80% and 85% sequence identity with a nucleotide sequence depicted in SEQ ID NO: 35, with all integers falling within this range also being comprised herewith, particularly at least between 85% and 90% sequence identity with a nucleotide sequence depicted in SEQ ID NO: 35, with all integers falling within this range also being comprised herewith, particularly at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% sequence identity with a nucleotide sequence depicted in SEQ ID NO: 35, or a fragment thereof, and wherein said regulatory nucleotide sequence or fragment thereof mediates transcription of an operably associated polynucleotide molecule, particularly of an operably associated protein encoding polynucleotide of interest such that said polynucleotide of interest is transcribed in most tissues of the plant but essentially excluding the tissues of the tassel so that no expression product is present in said tissues to any significant extent.
[0033] In one embodiment, the invention relates to a transgenic plant as described herein, wherein the complementary strand of the nucleotide sequence providing the transcription initiation function is capable of hybridizing with a nucleotide sequence depicted in SEQ ID NO: 35, particularly under moderate hybridization conditions, more particularly under stringent hybridization conditions and wherein said regulatory nucleotide sequence mediates transcription of an operably associated polynucleotide molecule, particularly of an operably associated protein encoding polynucleotide of interest such that said polynucleotide of interest is transcribed in most tissues of the plant but essentially excluding the tissues of the tassel so that no expression product is present in said tissues to any significant extent.
[0034] In one embodiment, the invention relates to a transgenic plant as described herein, wherein the nucleotide sequence providing the transcription initiation function is the sequence depicted in SEQ ID NO: 35 or a fragment thereof, which still exhibits the full functionality as a transcription initiation sequence.
[0035] In one embodiment the invention relates to a transgenic plant according to the invention and as described herein comprising a regulatory nucleotide sequence or an expression cassette comprising said regulatory nucleotide sequence or a polynucleotide construct, particularly a chimeric polynucleotide construct, comprising said regulatory sequence at least part of which has a transcription termination function obtainable from a plant genomic DNA, particularly a maize genomic DNA and mediates transcription of an operably associated polynucleotide molecule, particularly of an operably associated protein encoding polynucleotide of interest such that said polynucleotide of interest is transcribed in most tissues of the plant but essentially excluding the tissues of the tassel so that no expression product is present in said tissues to any significant extent, wherein
[0036] i) said regulatory nucleotide sequence comprises a transcription termination sequence which has at least between 80% and 85% sequence identity, with all integers falling within this range also being comprised herewith, particularly at least between 85% and 90% sequence identity, with all integers falling within this range also being comprised herewith, particularly at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% sequence identity with a nucleotide sequence depicted in SEQ ID NO: 36; or a fragment thereof which still exhibits the full functionality as a transcription initiation sequence; or
[0037] ii) the complementary strand of said regulatory nucleotide sequence hybridizes to a nucleotide sequence depicted in SEQ ID NO: 36, particularly under moderate hybridization conditions, particularly under moderate-stringent hybridization conditions, particularly under stringent hybridization conditions and mediates termination of transcription of an operably associated protein encoding polynucleotide of interest; or
[0038] iii) said regulatory sequence has a sequence as depicted in SEQ ID NO: 36, or a fragment thereof, which still exhibits the full functionality as a transcription initiation sequence including complements thereof.
[0039] In one embodiment the invention relates to a transgenic plant according to the invention and as described herein comprising a regulatory nucleotide sequence or an expression cassette comprising said regulatory nucleotide sequence or a polynucleotide construct, particularly a chimeric polynucleotide construct, comprising said regulatory sequence at least part of which has a transcription initiation function and a termination function, respectively, which regulatory nucleotide sequence is obtainable from a genomic plant DNA, particularly from a genomic maize DNA and is expressed in most tissues of the plant but essentially excluding tissues of the tassel so that no expression product is present in said tissues to any significant extent, which regulatory nucleotide sequence comprises a transcription initiation sequence and a transcription termination sequence, respectively, which sequences have at least between 80% and 85% sequence identity, with all integers falling within this range also being comprised herewith, particularly at least between 85% and 90% sequence identity, with all integers falling within this range also being comprised herewith, particularly at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% sequence identity with a nucleotide sequence depicted in SEQ ID NO: 35 and SEQ ID NO:36, respectively, including a fragment thereof, which still exhibits the full functionality as a transcription initiation sequence and a termination sequence, respectively.
[0040] In one embodiment the invention relates to a transgenic plant according to the invention and as described herein comprising a regulatory nucleotide sequence or an expression cassette comprising said regulatory nucleotide sequence or a polynucleotide construct, particularly a chimeric polynucleotide construct, comprising said regulatory sequence at least part of which has a transcription initiation function and a termination function, respectively, which regulatory nucleotide sequence is obtainable from a genomic plant DNA, particularly a genomic maize DNA and is expressed in most tissues of the plant but essentially excluding tissues of the tassel so that no expression product is present in said tissues to any significant extent, which regulatory nucleotide sequence comprises a transcription initiation sequence as depicted in SEQ ID NO: 35 and a transcription termination sequence as depicted in and SEQ ID NO:36 respectively, including a fragment thereof, which still exhibits the full functionality as a transcription initiation sequence and a termination sequence, respectively.
[0041] In one embodiment, a transgenic plant according to the invention and as described herein is provided, wherein the polypeptide or protein encoding polynucleotide of interest encodes a polypeptide product exhibiting an insecticidal activity, particularly an endotoxin of Bacillus thuringiensis.
[0042] In one embodiment, the concentration of the polypeptide product expressed from the protein encoding polynucleotide of interest in the tissues of the plant reproductive structures, particularly in the tissues of the pollen and/or the tassel is such that no insecticidal activity can be detected in a standard insect feeding assay. In particular, the concentration of the expression product in the tassel is below a basic level of not more than 10 ng/mg soluble protein, particularly of not more than 5 ng/mg soluble protein, more particularly of not more than 3 ng/mg soluble protein, but especially of not more than 2 ng/mg soluble protein or less.
[0043] In one embodiment, a transgenic plant according to the invention and as described herein is provided, wherein the polypeptide or protein encoding polynucleotide of interest encodes an endotoxin of Bacillus thuringiensis which has at least between 80% and 85% sequence identity, with all integers falling within this range also being comprised herewith, particularly at least between 85% and 90% sequence identity, with all integers falling within this range also being comprised herewith, particularly at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% sequence identity with a nucleotide sequence as depicted in SEQ ID NO:15.
[0044] In one embodiment, a transgenic plant according to the invention and as described herein is provided, wherein the polypeptide or protein encoding polynucleotide of interest encodes an endotoxin of Bacillus thuringiensis which has the nucleotide sequence as depicted in SEQ ID NO: 15.
[0045] In one embodiment, a transgenic plant according to the invention and as described herein is provided, wherein the polypeptide or protein encoding polynucleotide of interest encodes a polypeptide product contributing to the enhancement of drought tolerance, particularly a deregulated form of a H+-pyrophosphatase, wherein said polypeptide or protein is under control of a regulatory sequence according to the invention at least part of which has a transcription initiation function which mediates expression of an operably associated protein encoding polynucleotide of interest in most plant tissues but essentially excluding expression in the tissues of the pollen and/or the tissues of the tassel so that no expression product is present in said tissues to any significant extent.
[0046] In one embodiment, the transgenic plant according to the invention and as described herein is a Zea mays plant.
[0047] In one embodiment, the invention relates to a regulatory nucleotide sequence or an expression cassette comprising said regulatory nucleotide sequence or a polynucleotide construct, particularly a chimeric polynucleotide construct, comprising said regulatory sequence, at least part of which has a transcription initiation function which mediates expression of an operably associated protein encoding a polynucleotide of interest in most plant tissues but essentially excluding expression in the tissues of the male reproductive structures, particularly the tissues of the pollen and/or the tassel so that no expression product is present in said tissues to any significant extent.
[0048] In one embodiment of the invention, the regulatory nucleotide sequence is obtainable from a gene represented by a DNA probe, particularly a DNA probe exhibiting a DNA sequence as depicted in SEQ ID NOs: 47 to 56, which DNA probe shows a signal pattern in tissue samples which is indicative of expression of said gene in all tissues and of no or substantially no expression in pollen.
[0049] In one embodiment of the invention, the regulatory nucleotide sequence is obtainable from a gene encoding an actin depolymerizing factor 3, which is expressed in most tissues of the plant but essentially excluding the tissues of the pollen so that no expression product is present in said tissues to any significant extent, particularly an actin depolymerizing factor 3 gene from maize.
[0050] In one embodiment of the invention, the regulatory nucleotide sequence is obtainable from a gene represented by a DNA probe, particularly a DNA probe exhibiting a DNA sequence as depicted in SEQ ID NOs: 57 to 79, which DNA probe shows a signal pattern in tissue samples, which is indicative of expression of said gene in all tissues and of no or substantially no expression in the tissues of the tassel . . . .
[0051] In one embodiment, the invention provides a regulatory nucleotide sequence or an expression cassette comprising said regulatory nucleotide sequence or a polynucleotide construct, particularly a chimeric polynucleotide construct, comprising said regulatory sequence as described herein, which sequence is obtainable from a genomic Zea mays DNA template using
[0052] i) a first primer which has at least 90%, particularly at least 91%, particularly at least 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% sequence identity with a nucleotide sequence as depicted in SEQ ID NO: 1, particularly a first primer of SEQ ID NO: 1; or
[0053] ii) second primer which has at least 90%, particularly at least 91%, particularly at least 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% sequence identity with a nucleotide sequence as depicted in SEQ ID NO: 2, particularly a second primer of; SEQ ID NO: 2; or
[0054] iii) a first primer as a forward primer which has at least 90%, particularly at least 91%, particularly at least 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% sequence identity with a nucleotide sequence as depicted in SEQ ID NO: 1 and a second primer as a reverse primer which has at least 90%, particularly at least 91%, particularly at least 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% sequence identity with a nucleotide sequence as depicted in SEQ ID NO: 2, particularly the forward primer of SEQ ID NO: 1 and the reverse primer of SEQ ID NO: 2.
[0055] In one embodiment, the regulatory nucleotide sequence according to the invention and as described herein is modified using one or more oligonucleotides selected from the group of oligonucleotides depicted in SEQ ID NO: 3, SEQ ID NO: 4, SEQ ID NO: 5, SEQ ID NO: 6, SEQ ID NO: 7 and SEQ ID NO: 8.
[0056] In one embodiment the invention relates to a regulatory nucleotide sequence or an expression cassette comprising said regulatory nucleotide sequence or a polynucleotide construct, particularly a chimeric polynucleotide construct, comprising said regulatory sequence as described herein, which regulatory nucleotide sequence provides a transcription initiation function, wherein the nucleotide sequence providing said function has at least between 80% and 85% sequence identity, with all integers falling within this range also being comprised herewith, particularly at least between 85% and 90% sequence identity, with all integers falling within this range also being comprised herewith, particularly at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% sequence identity with a nucleotide sequence depicted in SEQ ID NO: 13 and wherein said regulatory nucleotide sequence mediates transcription of an operably associated protein encoding polynucleotide of interest in most tissues of the plant but essentially excluding the tissues of the pollen so that no expression product is present in said tissues to any significant extent.
[0057] In one embodiment, the invention relates to a regulatory nucleotide sequence or an expression cassette comprising said regulatory nucleotide sequence or a polynucleotide construct, particularly a chimeric polynucleotide construct, comprising said regulatory sequence as described herein, which regulatory nucleotide sequence provides a transcription initiation function, wherein the complementary strand of the nucleotide sequence providing said function hybridizes to a nucleotide sequence depicted in SEQ ID NO: 13, particularly under moderate hybridization conditions, more particularly under moderately stringent hybridization conditions and wherein said regulatory nucleotide sequence mediates transcription of an operably associated protein encoding polynucleotide of interest in most tissues of the plant but essentially excluding the tissues of the pollen so that no expression product is present in said tissues to any significant extent. In particular, said hybridization occurs under stringent hybridization conditions.
[0058] In one embodiment of the invention, the nucleotide sequence providing the transcription initiation function is the sequence depicted in SEQ ID NO: 13, or a fragment thereof which still exhibits full functionality as a transcription initiation sequence, and complements thereof.
[0059] In one embodiment, a regulatory nucleotide sequence or an expression cassette comprising said regulatory nucleotide sequence or a polynucleotide construct, particularly a chimeric polynucleotide construct, comprising said regulatory sequence according to the invention and as described herein is provided comprising approximately 1 kb of the nucleotide sequence upstream of the ZmABP3 transcription start site of a ZmABP3 gene, particularly upstream of the ZmABP3 transcription start site of the ZmABP3 gene as depicted in SEQ ID NO: 17.
[0060] In one embodiment of the invention, said regulatory nucleotide sequence comprises in addition the ZmABP3 5'-untranslated sequence, the ZmABP3 first exon, the ZmABP3 first intron and a portion of the ZmABP3 second exon, particularly a portion of the ZmABP3 second exon terminating at the translation initiation codon, particularly a portion of the ZmABP3 second exon comprising between about 10 to about 20 nucleotides, particularly between about 12 and about 16 nucleotides, particularly about 14 nucleotides, of the second exon.
[0061] In one embodiment, a regulatory nucleotide sequence or an expression cassette comprising said regulatory nucleotide sequence or a polynucleotide construct, particularly a chimeric polynucleotide construct, comprising said regulatory sequence is provided at least part of which has a transcription termination function, which sequence is obtainable in a PCR amplification reaction from a gDNA template, particularly a maize gDNA template, using a forward primer (P3 (5'-tatatagagctcgcatcatgatcatgcatcatggact-3') which has at least 90%, particularly at least 91%, particularly at least 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% sequence identity with a nucleotide sequence as depicted in SEQ ID NO: 9 and a reverse primer (P4 (5'-atatatactagtggcgcgccacactttctgtcgcatgtgatttgca-3') which has at least 90%, particularly at least 91%, particularly at least 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% sequence identity with a nucleotide sequence as depicted in SEQ ID NO: 10. In particular, said regulatory nucleotide sequence comprises a transcriptional terminator and poly-adenylation signal. In particular, a forward primer (P3 (5'-tatatagagctcgcatcatgatcatgcatcatggact-3')) which has a nucleotide sequence as depicted in SEQ ID NO: 9 and a reverse primer (P4 (5'-atatatactagtggcgcgccacactlictgtcgcatgtgatttgca-3') which has a nucleotide sequence as depicted in SEQ ID NO: 10 are used.
[0062] In one embodiment of the invention, a regulatory nucleotide sequence or an expression cassette comprising said regulatory nucleotide sequence or a polynucleotide construct, particularly a chimeric polynucleotide construct, comprising said regulatory sequence is provided which comprises a transcription termination sequence obtainable from a gene encoding an actin depolymerizing factor 3 (ABP3), which regulatory sequence mediates transcription of an operably associated polynucleotide molecule, particularly of an operably associated protein encoding polynucleotide molecule of interest such that said polynucleotide of interest is transcribed in most of the plant tissues but not or substantially not in the tissues of the pollen so that no expression product is present in said tissues to any significant extent, particularly from a maize actin depolymerizing factor 3 (ABP3) gene, wherein
[0063] i) said regulatory nucleotide sequence comprises a transcription termination sequence which regulatory sequence has at least between 80% and 85% sequence identity, with all integers falling within this range also being comprised herewith, particularly at least between 85% and 90% sequence identity, with all integers falling within this range also being comprised herewith, particularly at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% sequence identity with a nucleotide sequence depicted in SEQ ID NO: 14; or
[0064] ii) the complementary strand of said regulatory nucleotide sequence hybridizes to a nucleotide sequence depicted in SEQ ID NO: 14, particularly under moderate hybridization conditions, more particularly under moderate-stringent hybridization conditions, particularly under stringent hybridization conditions and mediates termination of transcription of an operably associated protein encoding polynucleotide of interest; or
[0065] iii) said regulatory nucleotide sequence has a sequence as depicted in SEQ ID NO: 14, or a fragment thereof which still exhibits full functionality as a termination sequence, including complements thereof.
[0066] In one embodiment of the invention, a regulatory nucleotide sequence is provided or an expression cassette comprising said regulatory nucleotide sequence or a polynucleotide construct, particularly a chimeric polynucleotide construct, comprising said regulatory sequence at least part of which has a transcription initiation function and a termination function, respectively, which regulatory nucleotide sequence is obtainable from a gene encoding an actin depolymerizing factor 3 (ABP3), which is expressed in most tissues of the plant but not or substantially not in the tissues of the pollen so that no expression product is present in said tissues to any significant extent, particularly from a maize actin depolymerizing factor 3 (ABP3) gene, and which regulatory nucleotide sequence comprises a transcription initiation sequence as depicted in SEQ ID NO: 13 and a transcription termination sequence as depicted in SEQ ID NO:14.
[0067] In one embodiment of the invention, the regulatory nucleotide sequence is obtainable from maize genomic DNA, particularly from a putative gene on the maize genome, which is highly expressed in most tissues of the plant but not or substantially not in the tissues of the tassel so that no expression product is present in said tissues to any significant extent.
[0068] In one embodiment, a regulatory nucleotide sequence or an expression cassette comprising said regulatory nucleotide sequence or a polynucleotide construct, particularly a chimeric polynucleotide construct, comprising said regulatory sequence according to the invention and as described herein is provided comprising approximately 2.6 kb of the 5'-sequence including approximately 2 kb of 5'-non-transcribed sequence, a 5'-UTR, and exon 1 and part of exon 2 and intron 1, particularly approximately 0.6 kb representing exon 1, intron 1 and about 16 bp of exon 2.
[0069] In one embodiment, the invention provides a regulatory nucleotide sequence or an expression cassette comprising said regulatory nucleotide sequence or a polynucleotide construct, particularly a chimeric polynucleotide construct, comprising said regulatory sequence at least part of which has a transcription initiation function as described herein, which regulatory sequence is obtainable from a genomic Zea mays DNA template using
[0070] i) a first primer which has at least 90%, particularly at least 91%, particularly at least 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% sequence identity with a nucleotide sequence as depicted in SEQ ID NO: 19, particularly the primer of SEQ ID NO: 19; or
[0071] ii) a second primer which has at least 90%, particularly at least 91%, particularly at least 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% sequence identity with a nucleotide sequence as depicted in SEQ ID NO: 20, particularly the reverse primer of SEQ ID NO: 20; or
[0072] iii) a first primer as a forward primer which has at least 90%, particularly at least 91%, particularly at least 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% sequence identity with a nucleotide sequence as depicted in SEQ ID NO: 19 and a second primer as a reverse primer which has at least 90%, particularly at least 91%, particularly at least 92%, 93%, 94%, 95%, 96%, 0.97%, 98% or 99% sequence identity with a nucleotide sequence as depicted in SEQ ID NO: 20, particularly the forward primer of SEQ ID NO: 19 and the reverse primer of SEQ ID NO: 20.
[0073] In one embodiment, the regulatory nucleotide sequence according to the invention and as described herein is modified using one or more oligonucleotides selected from the group of oligonucleotides depicted in SEQ ID NO: 21, SEQ ID NO: 22, SEQ ID NO: 23, SEQ ID NO: 24, SEQ ID NO: 25 and SEQ ID NO: 26.
[0074] In one embodiment, the invention relates to a regulatory nucleotide sequence or an expression cassette comprising said regulatory nucleotide sequence or a polynucleotide construct, particularly a chimeric polynucleotide construct, comprising said regulatory sequence as described herein, which regulatory nucleotide sequence provides a transcription initiation function, wherein the nucleotide sequence providing said function has at least between 80% and 85% sequence identity, with all integers falling within this range also being comprised herewith, particularly at least between 85% and 90% sequence identity, with all integers falling within this range also being comprised herewith, particularly at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% sequence identity with a nucleotide sequence depicted in SEQ ID NO: 35 and wherein said regulatory nucleotide sequence mediates transcription of an operably associated protein encoding polynucleotide of interest in most tissues of the plant but essentially excluding the tissues of the tassel so that no expression product is present in said tissues to any significant extent.
[0075] In one embodiment, the invention relates to a regulatory nucleotide sequence or an expression cassette comprising said regulatory nucleotide sequence or a polynucleotide construct, particularly a chimeric polynucleotide construct, comprising said regulatory sequence as described herein, which regulatory nucleotide sequence provides a transcription initiation function, wherein the complementary strand of the nucleotide sequence providing said function hybridizes to a nucleotide sequence depicted in SEQ ID NO: 35, particularly under moderate hybridization conditions, more particularly under moderately stringent hybridization conditions and wherein said regulatory nucleotide sequence mediates transcription of an operably associated protein encoding polynucleotide of interest in most tissues of the plant but essentially excluding the tissues of the pollen so that no expression product is present in said tissues to any significant extent. In particular, said hybridization occurs under stringent hybridization conditions.
[0076] In one embodiment of the invention, the nucleotide sequence providing the transcription initiation function is the sequence depicted in SEQ ID NO: 35, or a fragment thereof, which still exhibits the full functionality as a transcription initiation sequence and complements thereof.
[0077] In one embodiment, a regulatory nucleotide sequence or an expression cassette comprising said regulatory nucleotide sequence or a polynucleotide construct, particularly a chimeric polynucleotide construct, comprising said regulatory sequence is provided at least part of which has a transcription termination function which sequence is obtainable in a PCR amplification reaction from a gDNA template, particularly a maize gDNA template, using a forward primer which has at least 90%, particularly at least 91%, particularly at least 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% sequence identity with a nucleotide sequence as depicted in SEQ ID NO: 29 and a reverse primer, which has at least 90%, particularly at least 91%, particularly at least 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% sequence identity with a nucleotide sequence as depicted in SEQ ID NO: 30. In particular, said regulatory nucleotide sequence comprises a transcriptional terminator and poly-adenylation signal. In particular, a forward primer, which has a nucleotide sequence as depicted in SEQ ID NO: 29 and a reverse primer, which has a nucleotide sequence as depicted in SEQ ID NO: 30 are used.
[0078] In one embodiment of the invention, a regulatory nucleotide sequence or an expression cassette comprising said regulatory nucleotide sequence or a polynucleotide construct, particularly a chimeric polynucleotide construct, comprising said regulatory sequence is provided wherein
[0079] i) said regulatory nucleotide sequence comprises a transcription termination sequence which has at least between 80% and 85% sequence identity, with all integers falling within this range also being comprised herewith, particularly at least between 85% and 90% sequence identity, with all integers falling within this range also being comprised herewith, particularly at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% sequence identity with a nucleotide sequence depicted in SEQ ID NO: 36; or
[0080] ii) the complementary strand of said regulatory nucleotide sequence hybridizes to a nucleotide sequence depicted in SEQ ID NO: 36, particularly under moderate hybridization conditions, particularly under moderate-stringent hybridization conditions, particularly under stringent hybridization conditions and mediates termination of transcription of an operably associated protein encoding polynucleotide of interest; or
[0081] iii) said regulatory sequence has a sequence as depicted in SEQ ID NO: 36, or a fragment thereof, which still exhibits the full functionality as a termination sequence, including complements thereof.
[0082] In one embodiment of the invention, a regulatory nucleotide sequence or an expression cassette comprising said regulatory nucleotide sequence or a polynucleotide construct, particularly a chimeric polynucleotide construct, comprising said regulatory sequence, is provided at least part of which has a transcription initiation function and a termination function, respectively, which regulatory nucleotide sequence is obtainable from a maize genomic DNA, which is expressed in most tissues of the plant but not or substantially not in the tissues of the tassel so that no expression product is present in said tissues to any significant extent, and which regulatory nucleotide sequence comprises a transcription initiation sequence and a transcription termination sequence, respectively, which have at least between 80% and 85% sequence identity, with all integers falling within this range also being comprised herewith, particularly at least between 85% and 90% sequence identity, with all integers falling within this range also being comprised herewith, particularly at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% sequence identity with a nucleotide sequence depicted in SEQ ID NO: 35 and SEQ ID NO:36, respectively.
[0083] In one embodiment of the invention, a regulatory nucleotide sequence or an expression cassette comprising said regulatory nucleotide sequence or a polynucleotide construct, particularly a chimeric polynucleotide construct, comprising said regulatory sequence, is provided at least part of which has a transcription initiation function and a termination function, respectively, which regulatory nucleotide sequence is obtainable from a maize genomic DNA, which is expressed in most tissues of the plant but not or substantially not in the tissues of the tassel so that no expression product is present in said tissues to any significant extent, and which regulatory nucleotide sequence comprises a transcription initiation sequence as depicted in SEQ ID NO: 35 and a transcription termination sequence as depicted in SEQ ID NO: 36.
[0084] It is apparent to the skilled artisan that, based on the nucleotide sequences shown in SEQ ID NO: 13, SEQ ID NO 14, SEQ ID NO: 35 and SEQ ID NO: 36, fragments of various length can be obtained from said sequences, for example by using any primer combinations of interest to generate fragments that still exhibit the specific regulatory function according to the invention that is driving expression of an operably associated polynucleotide of interest in most plant tissues but tissues of the pollen and the tassel, respectively. The invention thus includes fragments derived from a full-length transcript promoter and a full-length terminator of the invention and as described herein, respectively that function according to the invention, i.e. are capable of conferring expression and termination of an operably associated nucleotide sequence in most plant tissues but essentially excluding the tissues of the pollen so that no expression product is present in said tissues to any significant extent and/or the tassel.
[0085] The function of the promoter and terminator fragments, once obtained, can be easily tested by fusing them to a selectable or screenable marker gene and assaying the fusion constructs for retention of the specific promoter activity. Such assays are within the ordinary skill of the person skilled in the art.
[0086] In one embodiment, the invention relates to nucleotide fragments, particularly to nucleotide fragments obtainable from the regulatory sequences of an action depolymerizing factor 3 (ABP3) gene, which nucleotide fragments are of at least about 50 bases, preferably of between about 400 bases and about 650 bases, more preferably of between about 200 bases and about 400 bases and most preferably of about 350 bases in length and still exhibit the specific regulatory function according to the invention that is driving expression of an operably associated polynucleotide of interest in most plant tissues but essentially excluding the tissues of the pollen and/or the tassel so that no expression product is present in said tissues to any significant extent.
[0087] In one embodiment, the invention relates to nucleotide fragment comprising a nucleotide sequence comprising a consecutive stretch of at least 50 nt, particularly of between about 400 nt and about 650 nt, particularly of between about 200 nt and about 400 nt, particularly of about 350 nt in length of the nucleotide sequence depicted in SEQ ID NO:13 and SEQ ID NO: 35, respectively, wherein said nucleotide sequences still exhibit the specific regulatory function according to the invention that is driving expression of an operably associated polynucleotide of interest in most plant tissues but essentially excluding the tissues of the pollen and/or the tassel so that no expression product is present in said tissues to any significant extent.
[0088] It is also clear to the skilled artisan that variant sequences may be obtained without affecting the specific properties of the regulatory sequences according to the invention by introducing mutations, i.e. insertions, deletions and/or substitutions of one or more nucleotides, into the DNA sequences of SEQ ID NO: 13, SEQ ID NO 14, SEQ ID NO: 35 and SEQ ID NO: 36, respectively, using methods known in the art. In addition, an unmodified or modified nucleotide sequence of the present invention may be further varied by shuffling the sequence of the invention. To test for a function of variant DNA sequences according to the invention, the sequence of interest is operably linked to a selectable or screenable marker gene and expression of the marker gene is tested in transient expression assays with protoplasts or in whole plant tissues or in stably transformed plants. It is known to the skilled artisan that DNA sequences capable of driving expression of an operably associated nucleotide sequence are build in a modular way. Accordingly, expression levels from shorter DNA fragments may be different than the one from the longest fragment and may be different from each other. For example, deletion of a down-regulating upstream element will lead to an increase in the expression levels of the associated nucleotide sequence while deletion of an up-regulating element will decrease the expression levels of the associated nucleotide sequence.
[0089] In one embodiment, the invention relates to an expression cassette comprising a regulatory nucleotide sequence or an expression cassette comprising said regulatory nucleotide sequence or a polynucleotide construct, particularly a chimeric polynucleotide construct, comprising said regulatory sequence according to the invention and as described herein.
[0090] In one embodiment, the expression cassette according to the invention comprises about 2.3 kb of the 5'-sequence of ZmABP3 which consists of about 1.1 kb of 5'-non-transcribed sequence, about 0.25 kb of 5'-UTR and about 0.98 kb representing ZmABP3-intron 1, about 1.013 kb of the 3'-sequence starting just past the ABP3 translation stop codon including about 0.3 kb of 3'-UTR and about 0.7 kb of non-transcribed sequence, which functions as the transcriptional terminator and poly-adenylation signal.
[0091] In one embodiment, an expression cassette according to the invention is provided wherein the natural translation start codon is silenced and moved to the second exon, particularly moved within 15 nucleotides of the 5'-end of ZmABP3 exon 2.
[0092] In one embodiment, an expression cassette according to the invention is provided wherein the start codon is preceded by the Kozak sequence 5'- . . . CCACC . . . -3'.
[0093] In one embodiment, the expression cassette according to the invention comprises a regulatory nucleotide sequence comprising approximately 2.6 kb of the 5'-sequence, which consists of approximately 2 kb of 5'-non-transcribed sequence, and about 12 bp of 5'-UTR, approximately 0.6 kb representing exon 1, intron 1 and about 16 bp of exon 2; and approximately 1 kb of the 3'-sequence that begins just past the translation stop codon and includes approximately 0.6 kb of 3'-UTR and about 0.4 kb of non-transcribed sequence, and functions as the transcriptional terminator and poly-adenylation signal.
[0094] In one embodiment, an expression cassette according to the invention is provided wherein the natural translation start codon is silenced and moved to the second exon.
[0095] In one embodiment, a polypeptide or protein encoding nucleotide sequence is provided encoding an endotoxin of Bacillus thuringiensis which has at least 80% sequence identity, particularly at least 85% sequence identity, particularly at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% sequence identity with a nucleotide sequence as depicted in SEQ ID NO:15.
[0096] In one embodiment, a polypeptide or protein encoding nucleotide sequence is provided encoding an endotoxin of Bacillus thuringiensis which has the nucleotide sequence as depicted in SEQ ID NO: 15.
[0097] In one embodiment, the invention relates to a transgenic plant comprising an expression cassette according to the invention and as described herein.
[0098] In one embodiment, the invention provides a transgenic plant, particularly a transgenic maize plant comprising a regulatory sequence according to the invention and as described herein.
[0099] In one embodiment, the invention provides a transgenic plant, particularly a transgenic maize plant comprising a regulatory sequence according to the invention and as described herein in association with a polynucleotide of interest, particularly a polypeptide or protein encoding polynucleotide of interest.
[0100] In one embodiment, the invention provides a transgenic plant, particularly a transgenic maize plant comprising an expression cassette according to the invention and as described herein.
[0101] In one embodiment, a transgenic plant according to the invention and as described herein is provided, wherein the polypeptide or protein encoding nucleotide sequence encodes an endotoxin of Bacillus thuringiensis which has at least 80% sequence identity, particularly at least 85% sequence identity, particularly at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% sequence identity with a nucleotide sequence as depicted in SEQ ID NO:15 and is under the control of a regulatory sequences operable in said plant.
[0102] In one embodiment, a transgenic plant according to the invention and as described herein is provided, wherein the polypeptide or protein encoding nucleotide sequence encodes an endotoxin of Bacillus thuringiensis which has the nucleotide sequence as depicted in SEQ ID NO: 15 and is under the control of a regulatory sequences operable in said plant.
[0103] The invention also provides methods for preparing expression cassettes comprising the regulatory sequence according to the invention comprising linking an expressible polynucleotide encoding a polypeptide or a protein of interest with the regulatory sequence according to the invention and as described herein to obtain an expression construct, wherein the polynucleotide of interest is operably linked or associated with the regulatory sequence such that expression of the polypeptide or a protein of interest is mediated by the regulatory sequence according to the invention and results in the expression of said polypeptide or a protein of interest in essentially all plant tissues, but essentially excludes expression in the tissues of the reproductive structures of the plant, particularly in the tissues of the pollen and/or the tassel so that no expression product is present in said tissues to any significant extent.
[0104] In one embodiment, the invention relates to a method of producing a transgenic plant expressing a DNA sequence of interest in non-pollen tissue but not or substantially not in the tissues of the pollen and/or the tassel, comprising
[0105] a. transforming an expression cassette according to the invention and as described herein into a plant cell which comprises a regulatory nucleotide sequence, at least part of which has a transcription initiation function which mediates expression of an operably associated protein encoding polynucleotide of interest in most plant tissues but essentially excluding the tissues of the pollen and/or the tassel so that no expression product is present in said tissues to any significant extent; and
[0106] b. regenerating the plant cell transformed in step a) into a plant.
[0107] In one embodiment, the invention relates to a method of controlling insect target-pests feeding on vegetative plant tissues such as the leaf, stalk and root and/or on reproductive tissues such as the ear, but protecting non-target pests feeding on pollen comprising
[0108] a. growing a plant according to the invention and as described herein in an area that is infested with the target pest;
[0109] b. expressing a polypeptide or protein that is capable of controlling said target pest under the control of a regulatory sequence according to the invention and as described herein.
[0110] In one embodiment, the invention relates to a method of protecting the reproductive tissues of a plant, particularly the tissues of the pollen and/or the tassel against damage caused by expression in said tissues of a polypeptide or protein of interest comprising
[0111] a. growing a plant according to the invention and as described herein;
[0112] b. expressing in said plant a polypeptide or protein of interest under the control of a regulatory sequence according to the invention and as described herein.
[0113] In one embodiment the present invention relates to the use of a regulatory sequence according to the present invention and as disclosed herein for protecting the reproductive tissues of a plant, particularly the tissues of the pollen and/or the tassel against damage caused by expression in said tissues of a polypeptide or protein of interest comprising expressing in said plant said polypeptide or protein of interest under the control of a regulatory sequence according to the invention and as described herein.
BRIEF DESCRIPTION OF THE SEQUENCES
[0114] SEQ ID NO: 1 depicts the nucleotide sequence of forward primer P1
[0115] SEQ ID NO: 2 depicts the nucleotide sequence of reverse primer P2
[0116] SEQ ID NO: 3 depicts the nucleotide sequence of oligonucleotide Patg
[0117] SEQ ID NO: 4 depicts the nucleotide sequence of oligonucleotide Pnco
[0118] SEQ ID NO: 5 depicts the nucleotide sequence of oligonucleotide ADPc-1
[0119] SEQ ID NO: 6 depicts the nucleotide sequence of oligonucleotide ADPc-2
[0120] SEQ ID NO: 7 depicts the nucleotide sequence of oligonucleotide ADPc-4
[0121] SEQ ID NO: 8 depicts the nucleotide sequence of oligonucleotide adp3-a
[0122] SEQ ID NO: 9 depicts the nucleotide sequence of forward primer P3
[0123] SEQ ID NO: 10 depicts the nucleotide sequence of reverse primer P4
[0124] SEQ ID NO: 11 depicts the nucleotide sequence of forward primer Tnco
[0125] SEQ ID NO: 12 depicts the nucleotide sequence of forward primer T2
[0126] SEQ ID NO: 13 depicts the nucleotide sequence of modified ZmABP3 regulatory sequence including the transcription initiation sequence
[0127] SEQ ID NO: 14 depicts the nucleotide sequence of ZmABP3 terminal sequence
[0128] SEQ ID NO: 15 depicts the nucleotide sequence of Cry1AbG6
[0129] SEQ ID NO: 16 depicts the nucleotide sequence of maize-optimized AtAVP1D coding sequence
[0130] SEQ ID NO: 17 depicts the nucleotide sequence of the ZmABP3 gene
[0131] SEQ ID NO: 18 depicts the nucleotide sequence of the pNOV1321 plasmid
[0132] SEQ ID NO: 19 depicts the nucleotide sequence of forward primer ABT P1 forw
[0133] SEQ ID NO: 20 depicts the nucleotide sequence of reverse primer ABT P2 rev
[0134] SEQ ID NO: 21 depicts the nucleotide sequence of oligonucleotide pABT mut1
[0135] SEQ ID NO: 22 depicts the nucleotide sequence of oligonucleotide pABT mut2
[0136] SEQ ID NO: 23 depicts the nucleotide sequence of oligonucleotide pABT mut3
[0137] SEQ ID NO: 24 depicts the nucleotide sequence of oligonucleotide pABT mut4
[0138] SEQ ID NO: 25 depicts the nucleotide sequence of oligonucleotide pABT mut5
[0139] SEQ ID NO: 26 depicts the nucleotide sequence of oligonucleotide pABT mut6
[0140] SEQ ID NO: 27 depicts the nucleotide sequence of forward primer pABT amp1
[0141] SEQ ID NO: 28 depicts the nucleotide sequence of reverse primer pABT amp2
[0142] SEQ ID NO: 29 depicts the nucleotide sequence of forward primer ABT P4
[0143] SEQ ID NO: 30 depicts the nucleotide sequence of reverse primer ABT P5
[0144] SEQ ID NO: 31 depicts the nucleotide sequence of oligonucleotide ABTt m1
[0145] SEQ ID NO: 32 depicts the nucleotide sequence of oligonucleotide ABTt m2
[0146] SEQ ID NO: 33 depicts the nucleotide sequence of ZmABT1 cDNA
[0147] SEQ ID NO: 34 depicts the nucleotide sequence of ZmABT2 cDNA
[0148] SEQ ID NO: 35 depicts the nucleotide sequence of the ZmABT promoter
[0149] SEQ ID NO: 36 depicts the nucleotide sequence of the ZmABT terminal sequence.
[0150] SEQ ID NO: 37 depicts the nucleotide sequence of the ZmABP3-Cry1AbG6 Assembly construct.
[0151] SEQ ID NO: 38 depicts the nucleotide sequence of the ZmABP3-Cry1AbG6 binary construct.
[0152] SEQ ID NO: 39 depicts the nucleotide sequence of the enhanced ZmABP3-Cry1AbG6 binary construct.
[0153] SEQ ID NO: 40 depicts the nucleotide sequence of the ZmABP3-AmCyan assembly construct.
[0154] SEQ ID NO: 41 depicts the nucleotide sequence of the ZmABP3-AmCyan binary construct.
[0155] SEQ ID NO: 42 depicts the nucleotide sequence of the ZmABP3-AtAVP1 D assembly construct.
[0156] SEQ ID NO: 43 depicts the nucleotide sequence of the ZmABP3-AtAVP1D binary construct.
[0157] SEQ ID NO: 44 depicts the nucleotide sequence of plasmid 15772 (ZmABT Assembly)
[0158] SEQ ID NO: 45 depicts the nucleotide sequence of plasmid 15773
[0159] SEQ ID NO: 46 depicts the nucleotide sequence of ZmABT gDNA
[0160] SEQ ID NO: 47 depicts the nucleotide sequence of Ctrl_ZMU45855-3_at
[0161] SEQ ID NO: 48 depicts the nucleotide sequence of AF032370_at
[0162] SEQ ID NO: 49 depicts the nucleotide sequence of Zm001747_s_at
[0163] SEQ ID NO: 50 depicts the nucleotide sequence of Zm005803_s_at
[0164] SEQ ID NO: 51 depicts the nucleotide sequence of Zm007728_s_at
[0165] SEQ ID NO: 52 depicts the nucleotide sequence of Zm009722_s_at
[0166] SEQ ID NO: 53 depicts the nucleotide sequence of Zm015335_s_at
[0167] SEQ ID NO: 54 depicts the nucleotide sequence of Zm021004_s_at
[0168] SEQ ID NO: 55 depicts the nucleotide sequence of Zm058948_s_at
[0169] SEQ ID NO: 56 depicts the nucleotide sequence of Zm061393_s_at
[0170] SEQ ID NO: 57 depicts the nucleotide sequence of Zm016864_s_at
[0171] SEQ ID NO: 58 depicts the nucleotide sequence of Zm018791_at
[0172] SEQ ID NO: 59 depicts the nucleotide sequence of ZMMETALL_x_at
[0173] SEQ ID NO: 60 depicts the nucleotide sequence of Zm000019_at
[0174] SEQ ID NO: 61 depicts the nucleotide sequence of Zm002987_at
[0175] SEQ ID NO: 62 depicts the nucleotide sequence of Zm002990_s_at
[0176] SEQ ID NO: 63 depicts the nucleotide sequence of Zm002990_x_at
[0177] SEQ ID NO: 64 depicts the nucleotide sequence of Zm004433_at
[0178] SEQ ID NO: 65 depicts the nucleotide sequence of Zm005761_at
[0179] SEQ ID NO: 66 depicts the nucleotide sequence of Zm006285_at
[0180] SEQ ID NO: 67 depicts the nucleotide sequence of Zm006481_s_at
[0181] SEQ ID NO: 68 depicts the nucleotide sequence of Zm010323_s_at
[0182] SEQ ID NO: 69 depicts the nucleotide sequence of Zm011554_at
[0183] SEQ ID NO: 70 depicts the nucleotide sequence of Zm011554_x_at
[0184] SEQ ID NO: 71 depicts the nucleotide sequence of Zm021403_at
[0185] SEQ ID NO: 72 depicts the nucleotide sequence of Zm028405_s_at
[0186] SEQ ID NO: 73 depicts the nucleotide sequence of Zm032921_s_at
[0187] SEQ ID NO: 74 depicts the nucleotide sequence of Zm033444_s_at
[0188] SEQ ID NO: 75 depicts the nucleotide sequence of Zm035082_s_at
[0189] SEQ ID NO: 76 depicts the nucleotide sequence of Zm040564_x_at
[0190] SEQ ID NO: 77 depicts the nucleotide sequence of Zm054116_s_at
[0191] SEQ ID NO: 78 depicts the nucleotide sequence of Zm066342_at
[0192] SEQ ID NO: 79 depicts the nucleotide sequence of Zm051284_at
[0193] SEQ ID NO: 80 depicts the nucleotide sequence of Vector 15289
[0194] SEQ ID NO: 81 depicts the nucleotide sequence of ZmABP-948-binary
[0195] SEQ ID NO: 82 depicts the nucleotide sequence of ZmABT-990-binary
[0196] SEQ ID NO: 83 depicts the nucleotide sequence of 5' Bfr1 primer
[0197] SEQ ID NO: 84 depicts the nucleotide sequence of 3' Xba1 primer
[0198] SEQ ID NO: 85 depicts the nucleotide sequence of 5'Gfix primer
[0199] SEQ ID NO: 86 depicts the nucleotide sequence of 3'Gfix primer
[0200] SEQ ID NO: 87 depicts the nucleotide sequence of 5'1Ab5XbaI primer
[0201] SEQ ID NO: 88 depicts the nucleotide sequence of 3'1Ab3d6 primer
[0202] SEQ ID NO: 89 depicts the nucleotide sequence of cy2'
[0203] SEQ ID NO: 90 depicts the nucleotide sequence of cy1
[0204] SEQ ID NO: 91 depicts the nucleotide sequence of cy2
DETAILED DESCRIPTION OF THE INVENTION
Definitions
[0205] The technical terms and expressions used within the scope of this application are generally to be given the meaning commonly applied to them in the pertinent art of plant molecular biology if not otherwise indicated herein below.
[0206] As used in this specification and the appended claims, the singular forms "a", "an", and "the" include plural referents unless the context clearly dictates otherwise. Thus, for example, reference to "a plant" includes one or more plants, and reference to "a cell" includes mixtures of cells, tissues, and the like.
[0207] As used in this specification and the appended claims, the plural form "tissues", includes also the singular form unless the context clearly dictates otherwise. Thus, for example, reference to "tissues of the tassel" includes one or more tissues present in the tassel.
[0208] As used in this specification and the appended claims, the phrase "most tissues of the plant" or "essentially all tissues of the plant" is used interchangeably and refers to the majority to the tissues present in the plant with the exception of the tissues of the reproductive structures, particularly the tissues of the pollen and the tassel. In particular, "most tissues" refer to those tissues of the plant where target insects mainly feed on, with the exception of the tissues of the male reproductive structures, such as the tissues of the stalk, the roots, the leaves, the ear, the ear sheath, the silks and the developing kernels.
[0209] The term "polynucleotide" is understood herein to refer to polymeric molecule of high molecular weight which can be single-stranded or double-stranded, composed of monomers (nucleotides) containing a sugar, phosphate and a base which is either a purine or pyrimidine. A "polynucleotide fragment" is a fraction of a given polynucleotide molecule. In higher plants, deoxyribonucleic acid (DNA) is the genetic material while ribonucleic acid (RNA) is involved in the transfer of information contained within DNA into proteins. A "genome" is the entire body of genetic material contained in each cell of an organism, including the genomes of the mitochondria and the plastids. The term "polynucleotide" thus refers to a polymer of DNA or RNA which can be single- or double-stranded, optionally containing synthetic, non-natural or altered nucleotide bases capable of incorporation into DNA or RNA polymers. Unless otherwise indicated, a particular nucleic acid sequence of this invention also implicitly encompasses conservatively modified variants thereof (e.g. degenerate codon substitutions) and complementary sequences and as well as the sequence explicitly indicated. Specifically, degenerate codon substitutions may be achieved by generating sequences in which the third position of one or more selected (or all) codons is substituted with mixed-base and/or deoxyinosine residues (Batzer, et al., Nucleic Acid Res. 19:5081 (1991); Ohtsuka, et al., J. Biol. Chem. 260:2605-2608 (1985); and Rossolini, et al., Mol. Cell. Probes 8:91-98 (1994)). The term polynucleotide is used interchangeably with nucleic acid, nucleotide sequence and may include genes, cDNAs, and mRNAs encoded by a gene, etc.
[0210] A "regulatory nucleotide sequence at least part of which has a transcription initiation function" is understood herein to refer to a nucleotide sequence, which controls the expression of an operably associated coding sequence by providing the recognition for RNA polymerase and other factors required for proper transcription and is located usually upstream (5') to its coding sequence. "Regulatory nucleotide sequences" include 5' regulatory sequences located proximal and more distal elements upstream of the associated coding region, which influence the transcription, RNA processing or stability, or translation of the associated coding sequence. "Regulatory nucleotide sequences" may further include 3' sequences, including 3' non-translated and/or 3' non-transcribed sequences, located downstream of the associated coding region, and can include a transcription termination site. "Regulatory nucleotide sequences" may include enhancers, promoters, untranslated leader sequences, introns, and polyadenylation signal sequences. They include natural and synthetic sequences as well as sequences that may be a combination of synthetic and natural sequences. An "enhancer" is a DNA sequence that can stimulate promoter activity and may be an innate element of the promoter or a heterologous element inserted to enhance the level or tissue specificity of a promoter. It is capable of operating in both orientations (normal or flipped), and is capable of functioning even when moved either upstream or downstream from the promoter. The meaning of the term "regulatory nucleotide sequences" includes "transcription initiation" or "promoter" sequences and "promoter regulatory sequences." These terms are used interchangeably herein after.
[0211] For purposes of this invention, the definition of the term "3'-nontranscribed sequence" includes modifications to the nucleotide sequence of a 3'-nontranscribed sequence derived from a target gene, provided the modified 3'-nontranscribed sequence does not significantly reduce the activity of its associated 3' regulatory sequence. The 3'-nontranscribed sequence extends approximately 0.5 to 1.5 kb downstream of the transcription termination site.
[0212] The polynucleotide of the invention is understood to be provided in isolated form. The term "isolated" means that the polynucleotide disclosed and claimed herein is not a polynucleotide as it occurs in its natural context, if it indeed has a naturally occurring counterpart. Accordingly, the other compounds of the invention described further below are understood to be isolated. If claimed in the context of a plant genome, the polynucleotide of the invention is distinguished over naturally occurring counterparts by i.e. modifications introduced into the naturally occurring counterpart sequence and/or the insertion side in the genome and the flanking sequences at the insertion side.
[0213] "Operably associated" and "operably-linked" are used interchangeably and refer to the association of nucleic acid sequences on a single nucleic acid fragment so that the function of one is affected by the other. For example, a promoter is associated or operably-linked with a coding sequence or functional RNA when it is capable of affecting the expression of that coding sequence or functional RNA (i.e., that the coding sequence or functional RNA is under the transcriptional control of the promoter). Coding sequences in sense or antisense orientation can be operably-linked to regulatory sequences.
[0214] The term "present to any significant extent" as used within the context of the present invention refers to the fact that only negligible expression occurs in pollen resulting in only minor amounts of the expression product in pollen tissue at concentrations that may be detectable by high-resolution detection methods such as HPLC, ELISA-based assays, Western analysis, insect feeding assays, enzyme activity assays etc., but stay below a certain threshold level that would be needed to effect the envisaged biological function of the expression product. For example, in case of the Cry1AbG6 endotoxin of Bacillus thuringiensis the threshold level is in the range of between 5 ng/mg soluble protein and 60 ng/mg soluble protein, particularly in the range of between 20 ng/mg soluble protein and 50 ng/mg soluble protein.
[0215] The term "chimeric gene" refers to any gene that contains 1) DNA sequences, including regulatory and coding sequences that are not found together in nature in this specific combination or 2) sequences encoding parts of proteins not naturally adjoined, or 3) parts of promoters that are not naturally adjoined. Accordingly, a chimeric gene may comprise regulatory sequences and coding sequences that are derived from different sources, or comprise regulatory sequences and coding sequences derived from the same source, but arranged in a manner different from that found in nature.
[0216] The terms "heterologous DNA sequence," "exogenous DNA segment" or "heterologous nucleic acid," as used herein, each refer to a sequence that originates from a source foreign to the particular host cell or, if from the same source, is modified from its original form. Thus, a heterologous gene in a host cell includes a gene that is endogenous to the particular host cell but has been modified through, for example, the use of DNA shuffling or mutation. These terms also include non-naturally occurring multiple copies of a naturally occurring DNA sequence. Thus, the terms refer to a DNA segment that is foreign or heterologous to the cell, or homologous to the cell but in a position within the host cell genome in which the element is not ordinarily found. Exogenous DNA segments are expressed to yield exogenous polypeptides. A "homologous" DNA sequence is a DNA sequence that is naturally associated with a host cell into which it is introduced.
[0217] A "transgene" refers to a gene that has been introduced into the genome by transformation and is stably maintained. Transgenes may include, for example, genes that are either heterologous or homologous to the genes of a particular plant to be transformed. Additionally, transgenes may comprise native genes inserted into a non-native organism, or chimeric genes. The term "endogenous gene" refers to a native gene in its natural location in the genome of an organism. A "foreign" gene refers to a gene not normally found in the host organism but that is introduced by gene transfer.
[0218] "Expression cassette" as used herein means a DNA sequence capable of directing expression of a particular nucleotide sequence in an appropriate host cell, comprising a promoter operably linked to the protein encoding polynucleotide of interest which is operably linked to a terminator. It also typically comprises sequences required for proper translation of the nucleotide sequence. The coding region usually codes for a protein of interest but may also code for a functional RNA of interest, for example antisense RNA or a nontranslated RNA, in the sense or antisense direction. The expression cassette comprising the protein encoding polynucleotide of interest may be chimeric.
[0219] "Intron" refers to an intervening section of DNA which occurs almost exclusively within a eukaryotic gene, but which is not translated to amino acid sequences in the gene product. The introns are removed from the pre-mature mRNA through a process called splicing, which leaves the exons untouched, to form an mRNA. For purposes of the present invention, the definition of the term "intron" includes modifications to the nucleotide sequence of an intron derived from a target gene, provided the modified intron does not significantly reduce the activity of its associated 5' regulatory sequence.
[0220] "Exon" refers to a section of DNA which carries the coding sequence for a protein or part of it. Exons are separated by intervening, non-coding sequences (introns). For purposes of the present invention, the definition of the term "exon" includes modifications to the nucleotide sequence of an exon derived from a target gene, provided the modified exon does not significantly reduce the activity of its associated 5' regulatory sequence.
[0221] The terms "protein," "peptide" and "polypeptide" are used interchangeably herein.
[0222] A "probe" as used herein refers to a defined nucleic acid (DNA or RNA) fragment of variable length which may be used to detect in a DNA or RNA containing sample nucleotide sequences that are complementary to the sequence represented by the probe molecule.
[0223] The probe molecules may be used in a microarray set up, where they are covalently attached to a chemical matrix on an inert surface, such as coated glass slides or silicon based gene chips. Hybridization of the probe molecules to a target nucleic acid in the sample usually occurs under high stringency conditions. Probe-target hybridization is usually detected and quantified by fluorescence-based detection of fluorophore-labeled targets to determine relative transcript abundance of nucleic acid sequences in the target. DNA microarrays may be used in expression profiling experiments to quantify transcript abundance for a target molecule in tissue samples such as the tissues of the pollen and/or the tassel, calculated based on the strength of the signal detected in the respective samples.
[0224] The term "hybridize" as used herein refers to conventional hybridization conditions, preferably to hybridization conditions at which 5×SSPE, 1% SDS, 1×Denhardts solution is used as a solution and/or hybridization temperatures are between 35° C. and 70° C., preferably 65° C. After hybridization, washing is preferably carried out first with 2×SSC, 1% SDS and subsequently with 0.2×SSC at temperatures between 35° C. and 75° C., particularly between 45° C. and 65° C., but especially at 59° C. (regarding the definition of SSPE, SSC and Denhardts solution see Sambrook et al. loc. cit.). High stringency hybridization conditions as for instance described in Sambrook et al, supra, are particularly preferred. Particularly preferred stringent hybridization conditions are for instance present if hybridization and washing occur at 65° C. as indicated above. Non-stringent hybridization conditions for instance with hybridization and washing carried out at 45° C. are less preferred and at 35° C. even less.
[0225] "Sequence Homology or Sequence Identity" is used herein interchangeably. The terms "identical" or percent "identity" in the context of two or more nucleic acid or protein sequences, refer to two or more sequences or subsequences that are the same or have a specified percentage of amino acid residues or nucleotides that are the same, when compared and aligned for maximum correspondence, as measured using one of the following sequence comparison algorithms or by visual inspection. If two sequences which are to be compared with each other differ in length, sequence identity preferably relates to the percentage of the nucleotide residues of the shorter sequence which are identical with the nucleotide residues of the longer sequence. Sequence identity can be determined conventionally with the use of computer programs such as the Bestfit program (Wisconsin Sequence Analysis Package, Version 8 for Unix, Genetics Computer Group, University Research Park, 575 Science Drive Madison, Wis. 53711). Bestfit utilizes the local homology algorithm of Smith and Waterman, Advances in Applied Mathematics 2 (1981), 482-489, in order to find the segment having the highest sequence identity between two sequences. When using Bestfit or another sequence alignment program to determine whether a particular sequence has for instance 95% identity with a reference sequence of the present invention, the parameters are preferably so adjusted that the percentage of identity is calculated over the entire length of the reference sequence and that homology gaps of up to 5% of the total number of the nucleotides in the reference sequence are permitted. When using Bestfit, the so-called optional parameters are preferably left at their preset ("default") values. The deviations appearing in the comparison between a given sequence and the above-described sequences of the invention may be caused for instance by addition, deletion, substitution, insertion or recombination. Such a sequence comparison can preferably also be carried out with the program "fasta20u66" (version 2.0u66, September 1998 by William R. Pearson and the University of Virginia; see also W. R. Pearson (1990), Methods in Enzymology 183, 63-98, appended examples and http://workbench.sdsc.edu/). For this purpose, the "default" parameter settings may be used.
[0226] Another indication that two nucleic acid sequences are substantially identical is that the two molecules hybridize to each other under stringent conditions. The phrase: "hybridizing specifically to" refers to the binding, duplexing, or hybridizing of a molecule only to a particular nucleotide sequence under stringent conditions when that sequence is present in a complex mixture (e.g., total cellular) DNA or RNA. "Bind(s) substantially" refers to complementary hybridization between a probe nucleic acid and a target nucleic acid and embraces minor mismatches that can be accommodated by reducing the stringency of the hybridization media to achieve the desired detection of the target nucleic acid sequence.
[0227] "Stringent hybridization conditions" and "stringent hybridization wash conditions" in the context of nucleic acid hybridization experiments such as Southern and Northern hybridizations are sequence dependent, and are different under different environmental parameters. Longer sequences hybridize specifically at higher temperatures. An extensive guide to the hybridization of nucleic acids is found in Tijssen (1993) Laboratory Techniques in Biochemistry and Molecular Biology-Hybridization with Nucleic Acid Probes part I chapter 2 "Overview of principles of hybridization and the strategy of nucleic acid probe assays" Elsevier, New York. Generally, highly stringent hybridization and wash conditions are selected to be about 5° C. lower than the thermal melting point (Tm) for the specific sequence at a defined ionic strength and pH. Typically, under "stringent conditions" a probe will hybridize to its target subsequence, but to no other sequences.
[0228] The Tm is the temperature (under defined ionic strength and pH) at which 50% of the target sequence hybridizes to a perfectly matched probe. Very stringent conditions are selected to be equal to the Tm for a particular probe. An example of stringent hybridization conditions for hybridization of complementary nucleic acids which have more than 100 complementary residues on a filter in a Southern or northern blot is 50% formamide with 1 mg of heparin at 42° C., with the hybridization being carried out overnight. An example of highly stringent wash conditions is 0.15 M NaCl at 72° C. for about 15 minutes. An example of stringent wash conditions is a 0.2×SSC wash at 65° C. for 15 minutes (see, Sambrook, infra, for a description of SSC buffer). Often, a high stringency wash is preceded by a low stringency wash to remove background probe signal. An example medium stringency wash for a duplex of, e.g., more than 100 nucleotides, is 0.1×SSC at 45° C. for 15 minutes. An example low stringency wash for a duplex of, e.g., more than 100 nucleotides, is 4-6×SSC at 40° C. for 15 minutes. For short probes (e.g., about 10 to 50 nucleotides), stringent conditions typically involve salt concentrations of less than about 1.0 M Na+ ion, typically about 0.01 to 1.0 M Na+ ion concentration (or other salts) at pH 7.0 to 8.3, and the temperature is typically at least about 30° C. Stringent conditions can also be achieved with the addition of destabilizing agents such as formamide. In general, a signal to noise ratio of 2× (or higher) than that observed for an unrelated probe in the particular hybridization assay indicates detection of a specific hybridization. Nucleic acids that do not hybridize to each other under stringent conditions are still substantially identical if the proteins that they encode are substantially identical. This occurs, e.g., when a copy of a nucleic acid is created using the maximum codon degeneracy permitted by the genetic code.
[0229] A "plant" is any plant at any stage of development, particularly a seed plant.
[0230] A "plant cell" is a structural and physiological unit of a plant, comprising a protoplast and a cell wall. The plant cell may be in form of an isolated single cell or a cultured cell, or as a part of higher organized unit such as, for example, plant tissue, a plant organ, or a whole plant.
[0231] "Plant cell culture" means cultures of plant units such as, for example, protoplasts, cell culture cells, cells in plant tissues, pollen, pollen tubes, ovules, embryo sacs, zygotes and embryos at various stages of development.
[0232] "Plant material" refers to leaves, stems, roots, flowers or flower parts, fruits, pollen, egg cells, zygotes, seeds, cuttings, cell or tissue cultures, or any other part or product of a plant.
[0233] A "plant organ" is a distinct and visibly structured and differentiated part of a plant such as a root, stem, leaf, flower bud, or embryo.
[0234] "Plant tissue" as used herein means a group of plant cells organized into a structural and functional unit. Any tissue of a plant in planta or in culture is included. This term includes, but is not limited to, whole plants, plant organs, plant seeds, tissue culture and any groups of plant cells organized into structural and/or functional units. "Plant tissue" includes differentiated and undifferentiated tissues or plants, including but not limited to roots, stems, shoots, leaves, pollen, seeds, tumor tissue and various forms of cells and culture such as single cells, protoplasts, embryos, and callus tissue. The plant tissue may be in plants or in organ, tissue or cell culture.
[0235] The use of this term in conjunction with, or in the absence of, any specific type of plant tissue as listed above or otherwise embraced by this definition is not intended to be exclusive of any other type of plant tissue.
[0236] The terms "maize", "corn" and "Zea mays" are used herein interchangeably and refer to plants belonging to the genus Zea including, for example, different strains, races or varieties, commercial and non-commercial, of the species Zea mays.
[0237] The present invention relates to a transgenic plant comprising stably integrated in its genome a chimeric polynucleotide construct, particularly a chimeric DNA construct, comprising a protein encoding polynucleotide of interest, particularly a polypeptide or protein encoding polynucleotide of interest, under control of a regulatory nucleotide sequence, at least part of which has a transcription initiation function directing expression of said protein encoding polynucleotide of interest to essentially all tissues of the plant with the exception of the tissues of the male reproductive structures, particularly the tissues of the pollen and/or the tassel so that no expression product is present in said tissues to any significant extent.
[0238] A regulatory nucleotide sequence according to the present invention at least part of which has a transcription initiation function which mediates expression of an operably associated protein encoding polynucleotide of interest in most plant tissues but the tissues of the male reproductive structures, particularly the tissues of the pollen and/or the tassel, may be obtained in an expression profiling experiment to screen for probes that give strong signals in all samples, but only a weak or no signal in the pollen and/or the tassel sample, which is indicative of expression of the respective polynucleotides represented by said probes in most plant tissues and of no or substantially no expression in the tissues of the pollen and/or the tassel. In particular, maize plant tissues and tissues of the reproductive structures, particularly tissues of the pollen and/or the tassel may be screened to identify and obtain a regulatory sequence according to the present invention.
[0239] In particular, samples of all plant tissues, particularly samples of the green tissues and the root of a maize plant, may be directly compared to tissue samples from the male reproductive structures, particularly tissue samples of the pollen and/or the tassel. Probes representing polynucleotides that do not meet the target expression profile are eliminated. Only those probes with the strongest signal across all non-pollen/non-tassel tissues and weak of no signal in pollen and/or the tassel are selected for further analysis that is probes representing polynucleotides that are highly expressed in all tissue samples, but show substantially no expression in pollen and/or the tassel. Said probes may then be aligned with plant cDNA assembly datasets to detect bona fide plant genes, particularly maize genes or putative maize genes.
[0240] The DNA sequence representing probes on the maize chip identified as representing genes that are highly expressed in all tissue samples but essentially not expressed in pollen, particularly probes represented by the DNA sequence as given in SEQ ID NOs: 47 to 56 and those representing genes that are highly expressed in all tissue samples and have essentially no or reduced expression in tassel samples, particularly probes represented by the DNA sequence a given in SEQ ID NOs: 57-79, can easily be extended to designed expression cassettes following the steps outlined in the Examples.
[0241] Probe candidate sequences from the expression profiling analysis for each expression category may be selected and progressed to a finished binary vector with the designed expression cassette linked to a gene of interest such as, for example, a reported gene, i.e., the GUS reporter gene.
[0242] In a first step, each expression cassette is flanked with one or more suitable restriction sites such as, for example, SanDI/RsrII sites and cloned into the vector molecule. The regulatory region including the transcription initiation function typically resides within a fragment of about 1000-1500 bp upstream of the transcription start site and extends into the second exon, or to the natural translation start codon if it is not on the first exon. It typically terminates with the maize optimized Kozak sequence `gtaaaccatgg`. The engineered translation start codon is then embedded in a suitable restriction site such as the NcoI restriction endonuclease site `ccatgg`. All translation start codons in the theoretical transcript that are upstream of the engineered restriction site are eliminated. At least one stop codon should be present in each reading frame upstream of the engineered restriction site. The regulatory region including the transcription initiation function is designed to be flanked by suitable restriction sites such as, for example, XhoI/SanDI sites at the 5'-end and a NcoI site at the 3'-end.
[0243] The Gene Of Interest (GOI) such as the GUS reporter gene is provided as a suitable restriction fragment, in the example given here as a NcoI/SacI fragment. The terminus extends from just after the translation stop codon for about 1 kb downstream. The terminus is designed to be flanked by suitable restriction sites such as, for example, SacI at the 5'-end and RsrII/XmaI at the 3'-end.
[0244] The complete expression cassette is designed to be mobilized as a suitable restriction fragment, such as a SanDI/RsrII fragment, which can be ligated into the corresponding site located on an Agrobacterium binary vector such as the vector given in SEQ ID NO: 80.
[0245] All internal restriction sites used in the cloning steps identified above are mutated by single base substitutions to silence them.
[0246] Through application of these basic steps a plant expression cassette can be designed that corresponds to the respective probe molecules, particularly probe molecules on the maize chip identified as representing genes that are highly expressed in all tissue samples but essentially not expressed in pollen, particularly probes represented by the DNA sequence as given in SEQ ID NOs: 47 to 56 and those identified as representing genes that are highly expressed in all tissue samples and have essentially no or reduced expression in tassel samples, particularly probes represented by the DNA sequence a given in SEQ ID NOs: 57-79. The former is an expression cassette that should be transcribed in all maize tissues and not in pollen. The latter is an expression cassette that should be transcribed in all maize tissues but not or only moderately transcribed in tassels. This design strategy can be applied to all probes identified in an expression profiling experiment.
[0247] In a specific embodiment of the invention, applying the above criteria results in the identification of genes which exhibit the desired expression profile. In particular, a gene is identified which encodes an actin binding protein 3 (ABP3), particularly a actin binding protein 3 of maize (ZmABP3), which is a member of a small gene family that had been previously characterized (Lopez et al., 1996). The gene product has also been called actin depolymerizing factor 3.
[0248] It was shown by southern analysis that there are two ABP3 genes in the maize genome (Lopez et al., 1996), designated herein as ZmABP3-A and ZmABP3-B, respectively. The ZmABP3-A and ZmABP3-B cDNAs encode a protein of 139 amino acids that are identical at all residues, except one. The expression profiling data indicate that ZmABP3-B is highly expressed in most tissues of the plant, but essentially excluding the tissues of the pollen so that no expression product is present in said tissues to any significant extent, whereas. ZmABP3-A is not as highly expressed.
[0249] A structural analysis of the ZmABP3-B gene reveals that the ZmABP3-B protein coding region is encoded on 3 exons, which are interrupted by two intervening sequences (introns) flanked by the expected GT . . . AG border nucleotides.
[0250] The regulatory sequence is located in the 5'-region of the ABP3 gene immediately upstream of the coding sequence. The size of the regulatory region is in a range of between about 2 kb to 3 kb, particularly between about 2.3 kb and 2.5 kb, and comprises a 5'-non-transcribed sequence, particularly a 5'-non-transcribed sequence of between about 0.9 kb and 1.3 kb, but especial of about 1.1 kb, and a 5'-UTR, particularly between about of 0.1 kb and 0.3 kb, but especially 0.25 kb of the 5'-UTR and all or part of a nucleotide sequence representing ZmABP3-intron 1, particularly a nucleotide sequence of between about 0.7 kb and 1.2 kb, but especially of about 0.98 kb.
[0251] The regulatory sequence according to the invention further comprises part of 3'-sequence that begins just past the ABP3 translation stop codon including transcribed but not translated sequence (UTR) and non-transcribed sequence that functions as the transcriptional terminator and a poly-adenylation signal. In particular, the 3'-sequence is in a range of between about 0.8 kb and about 1.2 kb, particularly between about 0.9 kb and about 1.1 kb, but especially about 1.013 kb. The size of the 3'-UTR is in a range of between about 0.2 kb and about 0.4 kb, but especially about 0.3 kb, and that of the non-transcribed sequence in a range of between about 0.5 kb and about 0.8 kb, but specifically about 0.7 kb. In a specific embodiment of the invention, the regulatory sequence is modified such that the natural translation start codon is silenced in order to move it to the second exon.
[0252] In another embodiment of the invention, candidate probes can be identified on a DNA chip or gene array, particularly a maize DNA chip or gene array such as, for example, the maize Affymetrix® Chip applying the above criteria, which can be used in the identification of genes or putative genes on the maize genome which exhibit the desired expression profile. Two candidate probes were identified which demonstrate virtually no signal in tassel but a high signal in other tissues. This indicates that the gene represented by said candidate probes is not expressed in tassel, but is highly expressed throughout the rest of the plant. The greatest expression differential, 60-fold higher in non-tassel tissue, was observed in candidate probe Zm033444_S_AT. The other candidate probe (Zm040564_X_AT) showed signal variation depending on the development status of the probed plant material, i.e. a low signal in young tassel that gradually increases to a high or strong signal when the plant becomes older. The signal strength between tassel and non-tassel samples differed by less than 10-fold, but the signal strength in non-tassel samples was nearly 10-fold higher as compared to the other candidate probe. The sequence data indicate that neither probe corresponds to a characterized gene. Both probes identify good candidate genes for development of promoters that deliver high expression in non-tassel tissue and little or no expression in tassels. Given the high signal differential between tassel and non-tassel samples, an expression cassette based on probe Zm033444_S_AT was developed.
[0253] Public and proprietary databases can be queried by BLASTN with the candidate probe Zm033444_S_AT sequence to obtain DNA sequence evidence for both transcripts and gDNA corresponding to Zm033444_S_AT. cDNA hits with precise matches to the query sequence fell into two similar contigs. ZmABT1 corresponds to Maize.1482.c47 and Maize.1908.c31, and ZmABT2 corresponds to Maize.1482.c32, Maize.1482.c28, Maize.1482.c53, Maize.1908.c17, Maize.1908.c20, Maize.1908.c37 and A1947567.
[0254] The Zm033444_S_AT, ZmABT1 and ZmABT2 sequences can then be used to query maize genomic DNA sequence databases to identify the regulatory sequence(s) that give high expression in non-tassel tissue and little or no expression in tassels. These queries identified three entries, AZM4--12, ZmGSStuc11-12-04.4740.1 and MAGI--88845, that assemble into a single contig. The ZmABT gDNA sequence is shown in SEQ ID NO: 46. It encodes both the ZmABT1 and ZmABT2 transcript, which suggests that they are alternatively spliced variants of the same transcript.
[0255] ZmABT1 is encoded on 5 exons, and ZmABT2 is encoded on 6 exons. The additional exon lies between exon 1 and exon 2 of ZmABT1. The largest open reading frame on ZmABT1 and ZmABT2 can be used to define their translation start and stop codons and further to define the location of each translation start and stop codon. By this analysis both cDNAs use the same translation start and stop codon.
[0256] In one important aspect of the present invention the regulatory sequence according to the invention can be used in the development of robust expression cassettes that express recombinant genes in most tissues of the plant but not or substantially not in the tissues of the male reproductive structures, particularly the tissues of the pollen and/or the tassel so that no expression product is present in said tissues to any significant extent.
[0257] In a specific embodiment of the invention a regulatory sequence obtainable from a ABP3 gene, more particularly of regulatory sequence obtainable form a Zea mays ABP3 gene, can be used in the development of robust expression cassettes that express recombinant genes in most tissues of the plant but essentially excluding the tissues of the pollen so that no expression product is present in said tissues to any significant extent.
[0258] The transcription initiation region of the regulatory sequence according to the invention, particularly of regulatory sequence obtainable from a ABP3 gene, more particularly of regulatory sequence obtainable from a Zea mays ABP3 gene can be obtained in a PCR reaction containing a primer pair involving forward primer P1 (5'-atatatgcatgcggcgcgccgaaagtagcaaacaacaggttcatgtgcac-3') as depicted in SEQ ID NO: 1 and reverse primer P2 (5'-tatataccatggtgggtttgcctgcgaccacaagttca-3') as depicted in SEQ ID NO: 2 through amplification from a gDNA template, particularly a maize gDNA template. In a specific embodiment of the invention a thermocycling program is applied involving amplification at about 95° C. for about 15 minutes followed by about 45 cycles at about 94° C. for about 1 minute, at about 64° C. for about 1 minute and at about 72° C. for about 5 minutes. The final extension step is carried out at about 72° C. for about 15 minutes. The reaction product, particularly an about 2.3 kb reaction product, is purified and the DNA extracted using a DNA extraction method known in the art. The DNA is precipitated, recovered and finally cloned into a suitable vector.
[0259] The transcription initiation region according to the invention, particularly a transcription initiation region obtainable from an ABP3 gene, more particularly obtainable from a ZmABP3, may be modified in a series of reactions using at least one of the oligonucleotides selected from the group of oligonucleotides depicted in
TABLE-US-00001 SEQ ID NO: 3 (Patg (5'-cagctcgcccgagttggtaaggccccct-3')), SEQ ID NO: 4 . . . (Pnco (5'-acagattagtccatcgcccacggt-3')), SEQ ID NO: 5 . . . (ADPc-1 (5'-agccctgtccatgacggcccaagcaac-3')), SEQ ID NO: 6 . . . (ADPc-2 (5'-agtagcaattcggtaggcacaggcac-3')), SEQ ID NO: 7 . . . (ADPc-4 (5'-tctatggtctgcgaggtgcggtggc-3')), and SEQ ID NO: 8 . . . (adp3-a (5'-gtccccttcttcgccgcgccagctcgc-3')).
[0260] The terminus of the regulatory sequence according to the invention, particularly a terminal sequence obtainable from an ABP3 gene, more particularly a terminal sequence obtainable from a ZmABP3, can be amplified from a gDNA template, particularly a maize gDNA template, in a DNA polymerase reaction using a forward primer (P3 (5'-tatatagagctcgcatcatgatcatgcatcatggact-3')) as depicted in SEQ ID NO: 9 and a reverse primer (P4 (5'-atatatactagtggcgcgccacactlictgtcgcatgtgatttgca-3')) as depicted in SEQ ID NO: 10. A thermocycling program may be applied comprising a first cycle of about 95° C. for about 5 minutes followed by about 45 cycles of about 94° C. for about 30 seconds, about 50° C. for about 1 minute and about 72° C. for about 4 minutes. The final extension step may be carried out at about 72° C. for about 15 minutes. The about 1 kb reaction product is then purified and the DNA extracted using standard extraction methods. The DNA is precipitated, recovered and cloned into a suitable vector.
[0261] The terminus of the regulatory sequence according to the invention, particularly a terminal sequence obtainable from a ABP3 gene, more particularly a terminal sequence obtainable from a ZmABP3, may be modified to remove an internal restriction site, particularly a NcoI restriction site using a suitable primer pair, particularly primer pair Tnco (5'-Pgtaaaaaaaggtcccttggctcccagaaga-3')/T2 (5'-Pcaatgtgttagactgacgtg-3') as depicted in SEQ ID NO: 11 and SEQ ID NO: 12, respectively, in a DNA polymerase reaction. The thermocycling program employed may comprise a first cycle at about 95° C. for about 5 minutes followed by about 30 cycles of about 95° C. for about 1 minute, about 50° C. for about 1 minute and about 65° C. for about 15 minutes. The product may then be processed and sequenced.
[0262] The present invention is also directed to expression cassettes that incorporate the regulatory mechanisms of a target gene of interest that shows the desired expression profile, that is high expression in most plant tissues but no expression in pollen tissue, particularly an ABP target gene, more particularly of a ZmABP3 target gene, to control in plants the expression of products of nucleic acid molecules of interest in a manner that mimics the expression profile of the original target gene.
[0263] The present invention further includes expression cassettes that incorporate regulatory sequences obtainable from the 5'-region of the target gene, particularly an ABP target gene, more particularly of a ZmABP3 target gene, to express the products of nucleic acid hb molecules of interest in plant tissues but not or substantially not in pollen tissue, The present invention is also directed to expression cassettes incorporating both regulatory sequences obtainable from the 5'-region and the 3'-region of the target gene, particularly an ABP3 target gene, more particularly of a ZmABP3 target gene.
[0264] In another specific embodiment of the invention a regulatory sequence obtainable from maize genomic DNA can be used in the development of robust expression cassettes that transcribe polynucleotides in most tissues of the plant but essentially excluding the tissues of the tassel so that no expression product is present in said tissues to any significant extent.
[0265] An inclusive gene structure-based design strategy may be used to construct such an expression cassette. To incorporate the known alternative splicing of the putative maize gene identified in a method as described above into the expression cassette, the design strategy can be based on the structure of ZmABT1 transcript as shown in SEQ ID NO: 33.
[0266] The transcription initiation region of the regulatory sequence according to the invention, particularly of the ZmABT promoter region can be amplified from a maize gDNA template in a DNA polymerase reaction containing gDNA and a primer pair involving forward primer ABT P1 forw (5'-CGACCAGCGCGACATGCATGGCA-3') as depicted in SEQ ID NO: 19 and ABT P2 rev (5'-ACCCCAGGGCGTACGACAAG-GCC-3') as depicted in SEQ ID NO: 20. In a specific embodiment of the invention a thermocycling program is applied involving amplification at about 95° C. for about 5 minutes followed by about 40 cycles of 94° C. for about 30 seconds, about 67° C. for about 30 seconds and about 72° C. for about 2.5 minutes. The final extension step was done at about 72° C. for about 10 minutes.
[0267] This amplification reaction leads to an amplification product of about 2.6 kb, which can be purified and the DNA extracted using a standard DNA extraction method. The DNA can than be cloned into a suitable vector such as, for example, the pCR-BluntII-TOPO vector.
[0268] The ZmABT promoter can be modified in a series of mutagenesis reactions to silence the endogenous translation start codon, silence a SanDI restriction site and correct point mutations created during amplification. This can be done in a series of reactions using at least one of the oligonucleotides selected from the group of oligonucleotides depicted in
TABLE-US-00002 SEQ ID NO: 21 pABT mut1 (5'-GATGGCCGGATTGGGCTCCCGGGGTGGAG-3') SEQ ID NO: 22 pABT mut2 (5'-CTGGGAGGCGCGCAAGGGGCAGTTCCTCG-3') SEQ ID NO: 23 pABT mut3 (5'-CCCACCGCCGGAGCACCGAAAGGCCCCGCG-3') SEQ ID NO: 24 pABT mut4 (5'-GTCACCCGGGAGCACTTCCCGGCGCCG-3') SEQ ID NO: 25 pABT mut5 (5'-CATTGGGCCGAGCACGGCTTCTTCCGC-3') SEQ ID NO: 26 pABT mut6 (5'-GGGGTACGGTGTTCTTGAGTCGTGAAGCGAC-3')
[0269] The modified ZmABT promoter can the be amplified in another PCR reaction using primers pABT amp1 (5'-GCGTCTAGAGGGACCCCGACCAGCGCGACATGCAT-GGCA-3') as depicted in SEQ ID NO: 27 and pABT amp2 (5'-ACCCCAGGG-CGTACGACAAGGCCCCACCATGGGCGC-3') as depicted in SEQ ID NO: 28. The PCR product can then be purified and the DNA extracted using standard a DNA extraction method. The DNA can be cloned into a suitable vector such as, for example, the pCR-BluntII-TOPO vector, transformed and sequenced. The ZmABT promoter can then be excised, particularly as an XbaI/NcoI fragment and ligated to a suitable expression vector such as, for example, pNOV6901.
[0270] In one embodiment of the invention, an expression cassette is provided comprising a termination sequence which can be obtained form the ZmABT gene identified and described herein above. The ZmABT terminus can be amplified from maize gDNA template in a DNA polymerase reaction containing gDNA and a primer pair involving forward primer ABT P4 (5'-TATATAGAGCTCGAATCGAAGAAGCCACA-CTGTAAATCTGCCGGG-3') as depicted in SEQ ID NO: 29 and reverse primer ABT P5 (5'-AGCAAGGCATATGCAGCAGCTGCTGGTCGGACCGGGCCCTATATA-3') as depicted in SEQ ID NO: 30 resulting in an amplification product of about 1 kb. This reaction product can be purified and the DNA extracted using a standard DNA extraction method. The purified DNA can then be cloned into a suitable vector such as, for example, the pCR4-TOPO-Blunt vector.
[0271] In one embodiment of the invention, the ZmABP3 terminus is modified to remove internal NcoI and XhoI restriction sites. This can be done in a series of reactions using at least one of the oligonucleotides selected from the group of oligonucleotides depicted in.
TABLE-US-00003 SEQ ID NO: 31 ABTt m1 (5'-GTCATGCATGGGCATGTGAAGGAGGAGCC-3') SEQ ID NO: 32 ABTt m2 (5'-GTTGCATGCATGCTGCATGGCGTCGAGAT-3')
[0272] The amplification product can then be processed and sequenced to result in a terminator sequence as shown in SEQ ID NO: 36.
[0273] In one embodiment of the invention, an expression cassette is provided that express recombinant genes in most tissues of the plant but essentially excluding the tissues of the tassel so that no expression product is present in said tissues to any significant extent, comprising both a regulatory sequence at least part of which has a transcription initiation function and a regulatory sequence at least part of which has a termination function, which regulatory sequences can be obtained form the ZmABT gene identified and described herein above.
[0274] In one embodiment of the invention such an expression cassette can be obtained by excising the ZmABT terminus excised and ligating it into a suitable vector already comprising a regulatory sequence at least part of which has a transcription initiation function, particularly the sequence of the ZmABT promoter such as, for example, the pNOV6901-prABT vector as described above.
[0275] In one embodiment, the expression cassette according to the invention comprises a regulatory nucleotide sequence comprising approximately 2.6 kb of the 5'-sequence, which consists of approximately 2 kb of 5'-non-transcribed sequence, and about 12 bp of 5'-UTR, approximately 0.6 kb representing exon 1, intron 1 and about 16 bp of exon 2; and approximately 1 kb of the 3'-sequence that begins just past the translation stop codon and includes approximately 0.6 kb of 3'-UTR and about 0.4 kb of non-transcribed sequence, and functions as the transcriptional terminator and poly-adenylation signal.
[0276] In one embodiment, an expression cassette according to the invention is provided wherein the natural translation start codon is silenced and moved to the second exon
[0277] The complete expression cassette can then be mobilized into a suitable vector for plant transformation and expression such as, for example, an Agrobacterium binary vector, particularly Agrobacterium binary vector 15289.
[0278] The nucleic acid segment of interest can, for example, code for a ribosomal RNA, an antisense RNA or any other type of RNA that is not translated into protein. In another preferred embodiment of the invention, the nucleic acid segment of interest is translated into a protein product. The nucleotide sequence which directs transcription and/or the nucleic acid segment may be of homologous or heterologous origin with respect to the plant to be transformed. A recombinant DNA molecule useful for introduction into plant cells includes that which has been derived or isolated from any source that may be subsequently characterized as to structure size and/or function, chemically altered, and later introduced into plants. Therefore a useful nucleotide sequence, segment or fragment of interest includes completely synthetic DNA, semi-synthetic DNA, DNA isolated from biological sources, etc. Generally, the introduced DNA is not originally resident in the plant genotype which is the recipient of the DNA, but it is within the scope of the invention to isolate a gene from a given plant genotype, and to subsequently introduce multiple copies of the gene into the same genotype, e.g., to enhance production of a given gene product such as a storage protein or a protein that is involved in carbohydrate metabolism or any other gene of interest as provided in the SEQ ID NOs of the sequence listing.
[0279] The introduced recombinant DNA molecule includes but is not limited to, DNA from plant genes, and non-plant genes such as those from bacteria, yeasts, animals or viruses. The introduced DNA can include modified genes, portions of genes, or chimeric genes, including genes from the same or different genotype. The term "chimeric gene" or "chimeric DNA" is defined as a gene or DNA sequence or segment comprising at least two DNA sequences or segments from species which do not combine DNA under natural conditions, or which DNA sequences or segments are positioned or linked in a manner which does not normally occur in the native genome of untransformed plant.
[0280] The introduced recombinant DNA molecule used for transformation herein may be circular or linear, double-stranded or single-stranded. Generally, the DNA is in the form of chimeric DNA, such as plasmid DNA,
[0281] In one embodiment, the regulatory sequences may be operably associated with an expressible polynucleotide of interest. The expressible polynucleotide may encode a polypeptide or protein of interest.
[0282] Such a polypeptide or protein of interest may be one exhibiting a certain biological activity such as, for example, an insecticidal, herbicidal or fungicidal activity or may contribute of an improved performance of a crop plant of agronomic interest in form of improved yield, quality, lodging, biotic and abiotic stress resistance, flowering control, etc.
[0283] In one embodiment, the concentration of the polypeptide product expressed from the protein encoding polynucleotide of interest in the tissues of the reproductive structures, particularly in the tissues of the pollen and/or the tassel, is such that no insecticidal activity can be detected in a standard insect feeding assay. In particular, the concentration of the expression product in the tissues of the male reproductive structures, particularly in the tissues of the pollen and/or the tassel, is below a basic level of about 10 ng/mg soluble protein, particularly of about 5 ng/mg soluble protein, more particularly of about 3 ng/mg soluble protein, but especially of about 2 ng/mg soluble protein or below.
[0284] In one specific embodiment of the invention, the polypeptide or protein of interest is an insecticidally active protein or polypeptide, particularly an insecticidally active protein or polypeptide obtainable from Bacillus thuringiensis, more particularly a Bacillus thuringiensis endotoxin such as, for example, cryIA(b) endotoxin. Other endotoxins known to occur in Bacillus thuringiensis may likewise be used in association with the regulatory sequence according to the invention to obtain toxin expression in most plant tissues except pollen and/or the tassel so that no expression product is present in said tissues to any significant extent.
[0285] In one embodiment, a transgenic plant according to the invention and as described herein is provided, wherein the polypeptide or protein encoding polynucleotide of interest encodes an endotoxin of Bacillus thuringiensis which has at least 80% sequence identity, particularly at least 85% sequence identity, particularly at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% sequence identity with a nucleotide sequence as depicted in SEQ ID NO:15.
[0286] In one embodiment, a transgenic plant according to the invention and as described herein is provided, wherein the polypeptide or protein encoding polynucleotide of interest encodes an endotoxin of Bacillus thuringiensis which has the nucleotide sequence as depicted in SEQ ID NO: 15.
[0287] Once completed, the expression cassette may be mobilized into a suitable vector for plant transformation, such as, for example, a binary vector, which may then be mobilized to maize via Agrobacterium-mediated transformation.
[0288] Transgenic plants (or plant cells, or plant explants, or plant tissues) incorporating the polynucleotides of the invention and/or expressing a polypeptide of interest such as, for example, a toxin protein of B. thuringiensis, can be produced by a variety of well established techniques. Following construction of an expression cassette and a vector incorporating the regulatory polynucleotide sequence according to the invention and as described herein, standard techniques can be used to introduce the polynucleotide into a plant, a plant cell, a plant explant or a plant tissue of interest. Optionally, the plant cell, explant or tissue can be regenerated to produce a transgenic plant. The plant can be any higher plant, including gymnosperms, monocotyledonous and dicotyledonous plants. Suitable protocols are available for Leguminosae (alfalfa, soybean, clover, etc.), Umbelliferae (carrot, celery, parsnip), Cruciferae (cabbage, radish, rapeseed, broccoli, etc.), Curcurbitaceae (melons and cucumber), Gramineae (wheat, corn, rice, barley, millet, etc.), Solanaceae (potato, tomato, tobacco, peppers, etc.), and various other crops. See protocols described in Ammirato et al., eds., (1984) Handbook of Plant Cell Culture--Crop Species, Macmillan Publ. Co., New York, N.Y.; Shimamoto et al. (1989) Nature 338: 274 276; Fromm et al. (1990) Bio/Technol. 8: 833 839; and Vasil et al. (1990) Bio/Technol. 8: 429 434. Transformation and regeneration of both monocotyledonous and dicotyledonous plant cells is now routine, and the selection of the most appropriate transformation technique will be determined by the practitioner. The choice of method will vary with the type of plant to be transformed; those skilled in the art will recognize the suitability of particular methods for given plant types. Suitable methods can include, but are not limited to: electroporation of plant protoplasts; liposome-mediated transformation; polyethylene glycol (PEG) mediated transformation; transformation using viruses; micro-injection of plant cells; micro-projectile bombardment of plant cells; vacuum infiltration; and Agrobacterium tumefaciens mediated transformation.
[0289] Transformation of plants can be undertaken with a single DNA molecule or multiple DNA molecules (i.e., co-transformation), and both these techniques are suitable for use with the expression cassettes of the present invention. Numerous transformation vectors are available for plant transformation, and the expression cassettes of this invention can be used in conjunction with any such vectors. The selection of vector will depend upon the preferred transformation technique and the target species for transformation.
[0290] A variety of techniques are available and known to those skilled in the art for introduction of constructs into a plant cell host. These techniques generally include transformation with DNA employing A. tumefaciens or A. rhizogenes as the transforming agent, liposomes, PEG precipitation, electroporation, DNA injection, direct DNA uptake, microprojectile bombardment, particle acceleration, and the like (See, for example, EP 295959 and EP 138341) (see below). However, cells other than plant cells may be transformed with the expression cassettes of the invention. The general descriptions of plant expression vectors and reporter genes, and Agrobacterium and Agrobacterium-mediated gene transfer, can be found in Gruber et al. (1993).
[0291] Expression vectors containing the regulatory polynucleotide sequence according to the invention can be introduced into protoplasts or into intact tissues or isolated cells. Preferably expression vectors are introduced into intact tissue. General methods of culturing plant tissues are provided for example by Maki et al., (1993); and by Phillips et al. (1988). Preferably, expression vectors are introduced into maize or other plant tissues using a direct gene transfer method such as microprojectile-mediated delivery, DNA injection, electroporation and the like. More preferably expression vectors are introduced into plant tissues using the microprojectile media delivery with the biolistic device. See, for example, Tomes et al. (1995). The vectors of the invention can not only be used for expression of structural genes but may also be used in exon-trap cloning, or promoter trap procedures to detect differential gene expression in varieties of tissues, (Lindsey et al., 1993; Auch & Reth et al.).
[0292] It is particularly preferred to use the binary type vectors of Ti and Ri plasmids of Agrobacterium spp. Ti-derived vectors transform a wide variety of higher plants, including monocotyledonous and dicotyledonous plants, such as soybean, cotton, rape, tobacco, and rice (Pacciotti et al., 1985: Byrne et al., 1987; Sukhapinda et al., 1987; Lorz et al., 1985; Potrykus, 1985; Park et al., 1985: Hiei et al., 1994). The use of T-DNA to transform plant cells has received extensive study and is amply described (EP 120516; Hoekema, 1985; Knauf, et al., 1983; and An et al., 1985). For introduction into plants, the chimeric genes of the invention can be inserted into binary vectors as described in the examples.
[0293] Those skilled in the art will appreciate that the choice of method might depend on the type of plant, i.e., monocotyledonous or dicotyledonous, targeted for transformation. Suitable methods of transforming plant cells include, but are not limited to, microinjection (Crossway et al., 1986), electroporation (Riggs et al., 1986), Agrobacterium-mediated transformation (Hinchee et al., 1988), direct gene transfer (Paszkowski et al., 1984), and ballistic particle acceleration using devices available from Agracetus, Inc., Madison, Wis. and BioRad, Hercules, Calif. (see, for example, Sanford et al., U.S. Pat. No. 4,945,050; and McCabe et al., 1988). Also see, Weissinger et al., 1988; Sanford et al., 1987 (onion); Christou et al., 1988 (soybean); McCabe et al., 1988 (soybean); Datta et al., 1990 (rice); Klein et al., 1988 (maize); Klein et al., 1988 (maize); Klein et al., 1988 (maize); Fromm et al., 1990 (maize); and Gordon-Kamm et al., 1990 (maize); Svab et al., 1990 (tobacco chloroplast); Koziel et al., 1993 (maize); Shimamoto et al., 1989 (rice); Christou et al., 1991 (rice); European Patent Application EP 0 332 581 (orchardgrass and other Pooideae); Vasil et al., 1993 (wheat); Weeks et al., 1993 (wheat). In one embodiment, the protoplast transformation method for maize is employed (European Patent Application EP 0 292 435, U.S. Pat. No. 5,350,689).
[0294] In another embodiment, a nucleotide sequence of the present invention is directly transformed into the plastid genome. Plastid transformation technology is extensively described in U.S. Pat. Nos. 5,451,513, 5,545,817, and 5,545,818, in PCT application no. WO 95/16783, and in McBride et al., 1994. Following transformation, plants are preferably selected using a dominant selectable marker incorporated into the transformation vector. Typically, such a marker will confer antibiotic or herbicide resistance on the transformed plants and selection of transformants can be accomplished by exposing the plants to appropriate concentrations of the antibiotic or herbicide.
[0295] After transformed plant cells or plants are selected and grown to maturity, those plants showing the trait of interest are identified. The trait can be any of those traits described above. Additionally, to confirm that the trait of interest is due to the expression of the introduced polynucleotide of interest under control of the regulatory nucleotide according to the invention, expression levels or activity of the polypeptide or polynucleotide of interest can be determined by analyzing mRNA expression using Northern blots, RT-PCR or microarrays, or protein expression using immunoblots or Western blots or enzyme activity assays.
[0296] The invention thus relates to plant cells and tissues, to plants derived from such cells and tissues, respectively, to plant material, to the progeny and to seeds derived from such plants, and to agricultural products including processed plant products with improved properties obtainable by, for example, any one of the transformation methods described below.
[0297] Once an expression cassette according the present invention and as described herein comprising a regulatory sequence according to the invention in association with a polynucleotide of interest has been transformed into a particular plant species, it may be propagated in that species or moved into other varieties of the same species, particularly including commercial varieties, using traditional breeding techniques. Preferred plants of the invention include gymnosperms, monocots, and dicots, especially agronomically important crop plants, such as rice, wheat, barley, rye, rape, corn, potato, carrot, sweet potato, sugar beet, bean, pea, chicory, lettuce, cabbage, cauliflower, broccoli, turnip, radish, spinach, asparagus, onion, garlic, eggplant, pepper, celery, carrot, squash, pumpkin, zucchini, cucumber, apple, pear, quince, melon, plum, cherry, peach, nectarine, apricot, strawberry, grape, raspberry, blackberry, pineapple, avocado, papaya, mango, banana, soybean, tobacco, tomato, sorghum and sugarcane.
[0298] The genetic properties engineered into the transgenic plants described above are passed on by sexual reproduction or vegetative growth and can thus be maintained and propagated in progeny plants. Generally said maintenance and propagation make use of known agricultural methods developed to fit specific purposes such as tilling, sowing or harvesting. Specialized processes such as hydroponics or greenhouse technologies can also be applied. Use of the advantageous genetic properties of the transgenic plants according to the invention can further be made in plant breeding that aims at the development of plants with improved properties such as tolerance to pests, herbicides, or stress, improved nutritional value, increased yield, or improved structure causing less loss from lodging or shattering. The various breeding steps are characterized by well-defined human intervention such as selecting the lines to be crossed, directing pollination of the parental lines, or selecting appropriate progeny plants. Depending on the desired properties different breeding measures are taken. The relevant techniques are well known in the art and include but are not limited to hybridization, inbreeding, backcross breeding, multiline breeding, variety blend, interspecific hybridization, aneuploid techniques, etc. Hybridization techniques also include the sterilization of plants to yield male or female sterile plants by mechanical, chemical or biochemical means. Cross pollination of a male sterile plant with pollen of a different line assures that the genome of the male sterile but female fertile plant will uniformly obtain properties of both parental lines. Thus, the transgenic plants according to the invention can be used for the breeding of improved plant lines that for example increase the effectiveness of conventional methods such as herbicide or pesticide treatment or allow to dispense with said methods due to their modified genetic properties. Alternatively new crops with improved stress tolerance can be obtained that, due to their optimized genetic "equipment", yield harvested product of better quality than products that were not able to tolerate comparable adverse developmental conditions.
[0299] In one embodiment of the invention, the plant has been transformed with and expresses a polypeptide or protein encoding nucleotide sequence encoding a polypeptide product exhibiting an insecticidal activity, particularly an endotoxin of Bacillus thuringiensis in most tissues of the plant but essentially excluding the tissues of the pollen and/or the tassel so that no expression product is present in said tissues to any significant extent, where the nucleotide sequence is not transcribed to any significant extent. Therefore, essentially no expression occurs in the pollen and/or the tassel tissue and only residual amounts of the expression product, if any, can be detected in said tissues, which is not sufficient for the expression product to fulfil its envisaged biological function in said tissues or to exhibit any toxic effects either towards insects feeding on these tissues or the plant itself.
[0300] In particular, the concentration of the polypeptide product expressed from the protein encoding polynucleotide of interest in the tissues of the pollen and/or the tassel is such that no insecticidal activity can be detected in a standard insect feeding assay. In one embodiment of the invention, the concentration of the expression product in pollen is below a basic level of about 10 ng/mg soluble protein, particularly of about 5 ng/mg soluble protein, more particularly of about 3 ng/mg soluble protein, but especially of about 2 ng/mg soluble protein or below.
[0301] The invention also provides methods for preparing expression cassettes comprising the regulatory sequence according to the invention comprising linking an expressible polynucleotide encoding a polypeptide or a protein of interest with the regulatory sequence according to the invention and as described herein to obtain an expression construct, wherein the polynucleotide of interest is operably linked or associated with the regulatory sequence such that expression of the polypeptide or a protein of interest is mediated by the regulatory sequence according to the invention and results in the expression of said polypeptide or a protein of interest in essentially all plant tissues, but essentially excludes expression in the tissues of the reproductive structures of the plant, particularly in the tissues of the pollen and/or the tassel so that no expression product is present in said tissues to any significant extent.
[0302] In one embodiment, the invention relates to a method of producing a transgenic plant expressing a DNA sequence of interest in non-pollen tissue but not or substantially not in the tissues of the pollen and/or the tassel, comprising
[0303] a) transforming an expression cassette according to the invention and as described herein into a plant cell which comprises a regulatory nucleotide sequence, at least part of which has a transcription initiation function which mediates expression of an operably associated protein encoding polynucleotide of interest in most plant tissues but essentially excluding the tissues of the pollen and/or the tassel so that no expression product is present in said tissues to any significant extent; and
[0304] b) regenerating the plant cell transformed in step a) into a plant.
[0305] In one embodiment, the invention relates to a method of controlling insect target-pests feeding on vegetative plant tissues such as the leaf, stalk and root and/or on reproductive tissues such as the ear, but protecting non-target pests feeding on pollen comprising
[0306] a) growing a plant according to the invention and as described herein in an area that is infested with the target pest;
[0307] b) expressing a polypeptide or protein that is capable of controlling said target pest under the control of a regulatory sequence according to the invention and as described herein.
[0308] In one embodiment, the invention relates to a method of protecting the reproductive tissues of a plant, particularly the tissues of the pollen and/or the tassel against damage caused by expression in said tissues of a polypeptide or protein of interest comprising
[0309] a) growing a plant according to the invention and as described herein;
[0310] b) expressing a polypeptide or protein of interest under the control of a regulatory sequence according to the invention and as described herein.
[0311] In one embodiment the present invention relates to the use of a regulatory sequence according to the present invention and as disclosed herein for controlling insect target-pests feeding on vegetative plant tissues such as the leaf, stalk and root and/or on reproductive tissues such as the ear, but protecting non-target pests feeding on pollen comprising
[0312] a) growing a plant according to the invention and as described herein in an area that is infested with the target pest;
[0313] b) expressing a polypeptide or protein that is capable of controlling said target pest under the control of a regulatory sequence according to the invention and as described herein.
[0314] In one embodiment the present invention relates to the use of a regulatory sequence according to the present invention and as disclosed herein for protecting the reproductive tissues of a plant, particularly the tissues of the pollen and/or the tassel against damage caused by expression in said tissues of a polypeptide or protein of interest comprising expressing said polypeptide or protein of interest under the control of a regulatory sequence according to the invention and as described herein.
EXAMPLE
[0315] The following Examples provide illustrative embodiments. In light of the present disclosure and the general level of skill in the art, those of skill will appreciate that the following Examples are intended to be exemplary only and that numerous changes, modifications, and alterations can be employed without departing from the scope of the presently claimed subject matter.
[0316] All manipulations and techniques necessary to construct and propagate strains described in this invention are known to those skilled in the art. Technical details are described e.g. in Ausubel et al 1995; Sambrook, J, 2001 and Miller, J. H. 1992 and in relevant publications cited within this invention.
Example 1
Non-Pollen Expression
Example 1.1
Identification of ZmABP3
[0317] In an expression profiling experiment a maize developmental series was queried on a Zea mays (Zm80K) Affymetrix chip for probes that gave strong signals in all samples, but not or substantially not in the pollen sample. All the green tissue and root samples were directly compared to pollen, and probes representing polynucleotides that did not meet the target expression profile were eliminated. The analysis produced two sets of results. The first set contains 36 probes representing polynucleotides that were highly expressed in all the tissue samples, but very low in pollen. The second set contains 10 probes represented polynucleotides that are highly expressed in all tissue samples, but gave no signal in pollen. Alignment of probe sequence with maize cDNA assembly datasets showed that all 46 probes represent bona fide maize genes. The top 10 probes are those with the strongest signal across all non-pollen tissues and no signal in pollen (see Table A).
[0318] Applying further criteria including determination of the availability of genomic DNA (gDNA) and cDNA sequence for each lead produced Zm07728_s_at as the top candidate that met all promoter development requirements. Literature analysis revealed that this probe represents the gene encoding actin binding protein 3 (ZmABP3) which is a member of a small gene family that had been previously characterized (Lopez et al., 1996). The gene product has also been called actin depolymerizing factor 3. Lopez et al (1996) confirms in FIG. 3 that ZmABP3 is highly expressed in most tissues of the plant examined, except pollen samples.
[0319] Lopez et al (1996) also show by southern analysis that there are two ABP3 genes in the maize genome. The ZmABP3 cDNA they report is GenBank Accession X97726, and it corresponds to the TIGR Accession TC248585. This gene was designated ZmABP3-A. Both ZmABP3 genes are represented on the maize (Zm80K) Affymetrix Chip: ZmABP3-A corresponds to probe Zm007595_at and ZmABP3-B corresponds to Zm07728_s_at. The `Zm07728_s_at` sequence was used to identify the TC248588 in the TIGR database, and MAIZE.974.CB1 in a maize cDNA assembly database. It also identified the MAGI--93606, MAGI--93607, AZM4--39177, ZmGSStuc11-12-04.2725.1, ZmGSStuc11-12-04.2725.2 and CC463190 gDNA sequences. The ZmABP3-A and ZmABP3-B cDNAs encode proteins that are identical at all residues, except one. The expression profiling data indicate that ZmABP3-B is highly expressed in most tissues of the plant, but essentially excluding the tissues of the pollen so that no expression product is present in said tissues to any significant extent. ZmABP3-A is not as highly expressed.
[0320] SEQ ID NO: 16 show that the ZmABP3-B mRNA is encoded on 3 exons. The two intervening sequences (introns) are bracketed by the expected GT . . . AG border nucleotides.
[0321] More specifically, SEQ ID NO: 16 discloses the design of the ZmABP3 expression cassette. The ZmABP3 regulatory components to be included in the construct are 2.3 kb of 5'-sequence (prZmABP3-01) which contains 1.1 kb of 5'-non-transcribed sequence, 0.25 kb of 5'-UTR and 0.98 kb representing ZmABP3-B-intron 1; and 1.013 kb of 3'-sequence (tZmZBP3-01) that begins just past the ABP3-B translation stop codon. This includes about 0.3 kb of 3'-UTR and 0.7 kb of non-transcribed sequence.
[0322] Table A shows a summary of the to 10 candidate probes representing polynucleotides with a high expression level in all maize tissues and no expression signal in pollen
TABLE-US-00004 Average Pollen Expression Zea mays Probe Name Description of Reference Gene Expression (all tissues) TIGR Hit AF032370_at "Zea mays profilin (PRO4) mRNA, complete cds." absent 4208 TC269677 Ctrl_ZmU45855-3_at From 808 to 1307 of glyceraldehyde-3-phosphate absent 4275 TC269361 dehydrogenase GAPC2 (gpc2) mRNA, complete cds. Zm001747_s_at Similar to CAA63903.1 Pennisetum glaucum; heat shock absent 4945 TC268849 protein 17.9; P. glaucum mRNA for heat shock protein, HSP 17.9 Zm005803_s_at "Similar to AAB99745.1 Triticum aestivum; HSP70; absent 4091 TC247918 Triticum aestivum 70 kDa heat shock protein (TaHSP70d) mRNA, complete cds; 70 kDa heat shock protein, molecular chaperone" Zm007728_s_at Similar to SW:ADF3_MAIZE Q41764 zea mays (maize). absent 4805 TC248588 actin-depolymerizing factor 3 (adf 3) (zmabp3) (zmadf3). Zm009722_s_at "Similar to BAC22420.1 Oryza sativa (japonica cultivar- absent 3306 TC248975 group); ; Oryza sativa (japonica cultivar-group) genomic DNA, chromosome 7, PAC clone: P0453E03; contains ESTs C96778(C10671), D22278(C10671) unknown Zm015335_s_at Similar to SW:RS5A_ARATH Q9zut9 arabidopsis thaliana absent 3598 TC269022 (mouse-ear cress). 40s ribosomal protein s5-1. February 2003 Zm021004_s_at "Similar to AAD39835.1 Arabidopsis thaliana; Ran-binding absent 3092 TC259986 protein siRanBP; Arabidopsis thaliana Ran-binding protein (siRanBP) mRNA, complete cds; atranbp1a homolog" Zm058948_s_at No Description absent 4337 TC270333 Zm061393_s_a No Description = sucrose synthase absent 6509 TC258905
Example 1.2
Cry1AbG6 Construction
[0323] Cry1AbG6 (2814 bp) is a modified version of the full-length Cry1Ab (pNOV1321, 3546 bp) gene. The Geiser sequence (81 bp from 4398-4478 in pNOV1321) and the 3'-end (651 bp from 4908-5558 in pNOV1321) were deleted.
[0324] The Cry1AbG6 sequence was constructed from pNOV1321 (source vector for the Cry1Ab full-length gene) as follows: pNOV1321 plasmid DNA was cut with BamHI/SacI. The Cry1Ab full-length gene (3546 bp, named Michigan) was gel purified and ligated to pTrcHisB expression vector (In vitrogen life technologies, Cat# V36020), which was cut with BamHI/SacI. This construct was named as Michigan-pTrcHisB. The Geiser sequence (81 bp) was deleted from Michigan-pTrcHisB by overlapping PCR with the following primers:
TABLE-US-00005 5' Bfr1 (SEQ ID NO: 83) (5'-cctggtggagtgcttaagcgacgagttctgcctgg-3'), 3' Xba1 (SEQ ID NO: 84) (5'-gggcttctcctccaggaactctagattgcccaggcg-3'), 5'Gfix (SEQ ID NO: 85) (5'-catcggcaagtgccaccacagccaccacttcagcctg-3') and 3'Gfix (SEQ ID NO: 86) (5'-gctgtggtggcacttgccgatggggctggg-3').
[0325] PCR product A was made using high-fidelity PCR with Michigan-pTrcHisB as a template, and the 5' Bfr1 and 3' Gfix primers. PCR product B was made using high-fidelity PCR with Michigan-pTrcHisB as a template, and the 5'Gfix and 3'Xba1 primers. The final PCR used products A and B as templates, and the 5'Bfr1 and 3'Xba1 primers. The final PCR band was digested with AfIII/XbaI and gel-purified. This fragment was ligated to Michigan-pTrcHisB that had also been digested with XbaI/AfIII. The correct recombinant DNA product was identified by AfIII/XbaI digestion analysis. This construct was named as Cry1Ab-G.
[0326] A second PCR product was made by high-fidelity PCR using pNOV1321 as a template, the 5'1Ab5XbaI (5'-gcccgcctgggcaatctagagttcctggaggag-3') primer depicted in SEQ ID NO: 87, and the 3'1Ab3d6 (5'-gcgagctcctagatgcggccctcga-gttcctcgaaga-3') primer depicted in SEQ ID NO: 88. The PCR product was digested with XbaI/SacI then ligated to Cry1Ab-G that was also digested with XbaI/SacI. The correct recombinant DNA product was identified using BamHI/SacI restriction analysis. This construct was named as Cry1AbG6.
[0327] The Cry1AbG6 sequence was subjected to QuikChange mutagenesis to remove an internal NcoI site. The 25 μL reaction contained
1 μL Cry1AbG6 template, 2.5 μL 10× QuikChange buffer, 1 μL QuikChange dNTP mix, 1 μL of 20 μM cy2' (5'-Pccctgtacggcacgatgggcaacgctgca-3'; SEQ ID NO: 89), 0.75 μL Quik solution and 1 μL QuikChange DNA polymerase.
[0328] The thermocycling program was 95° C. for 5 minutes followed by 30 cycles of 95° C. for 1 minute, 55° C. for 1 minute and 65° C. for 20 minutes. The product was processed as described by the manufacturer (Stratagene) and completely sequenced.
[0329] The Cry1AbG6 coding sequence was amplified from the mutagenized plasmid template, above, in a 50 μL Pfu turbo (Stratagene) DNA polymerase reaction containing
5 μL template, 5 μL 10×Pfu buffer, 1 μL 10 mM dNTP mix, 1 μL of 20 μM cy1 (5'-atatatccaccatggacaacaaccccaaca-3'; SEQ ID NO: 90), 1 μL of 20 μM cy2 (5'-tatatagagctcctagatgcggccctcgagt-3'; SEQ ID NO: 91) and 1 μL Pfu turbo DNA polymerase.
[0330] The thermocycling program was 95° C. for 2 minutes followed by 40 cycles of 95° C. for 1 minute, 50° C. for 1 minute and 72° C. for 7 minutes. The final extension step was 72° C. for 15 minutes. The 2.8 kb reaction product was gel-purified on 1% TAE agarose, and the DNA was extracted using Qiaprep DNA extraction method. The recovered DNA was digested with NcoI/SacI, then ligated to pNOV6901 vector that was also digested with NcoI/SacI. This operation replaced the GUS coding sequence in pNOV6901 with Cry1AbG6. The Cry1AbG6 sequence is given in SEQ ID NO: 15.
Example 1.3
Construction of the ZmABP3 Expression Cassette
[0331] An inclusive design strategy was used to develop the ZmABP3 expression cassette. The cassette contains 2.3 kb of 5'-sequence which consists of 1.1 kb of 5'-non-transcribed sequence, 0.25 kb of 5'-UTR and 0.98 kb representing ZmABP3-intron 1. The natural translation start codon was silenced in order to move it to the second exon. The expression cassette also contains 1.013 kb of 3'-sequence that begins just past the ABP3 translation stop codon. This includes about 0.3 kb of 3'-UTR and 0.7 kb of non-transcribed sequence, and functions as the transcriptional terminator and poly-adenylation signal.
[0332] The ZmABP3 terminus was amplified from maize gDNA template in a 50 μL Proofstart (Qiagen) DNA polymerase reaction containing
10 μg gDNA, 5 μL 10× Proofstart buffer, 1.5 μL 10 mM dNTP mix, 2.5 μL of 20 μM P3 (5'-tatatagagctcgcatcatgatcatgcatcatggact-3'; SEQ ID NO: 9), 2.5 μL of 20 μM P4 (5'-atatatactagtggcgcgccacactlictgtcgcatgtgatttgca-3'; SEQ ID NO: 10), 10 μL Q solution and 2 μL Proofstart DNA polymerase.
[0333] The thermocycling program was 95° C. for 5 minutes followed by 45 cycles of 94° C. for 30 seconds, 50° C. for 1 minute and 72° C. for 4 minutes. The final extension step was 72° C. for 15 minutes. The 1 kb reaction product was gel-purified on 1% TAE agarose, and the DNA was extracted using Qiaprep DNA extraction method. The DNA was ethanol precipitated and recovered in 4 μL ddH2O, then cloned into the pCR4-TOPO-Blunt vector.
[0334] The ZmABP3 terminus was modified to remove an internal NcoI restriction site using the Stratagene QuikChange Multi-site mutagenesis kit. The 25 μL reaction contained
1 μL pCR4-TOPO-ZmABP3-terminus, 2.5 μL 10× QuikChange buffer, 1 μL QuikChange dNTP mix,
1 μL of 20 μM Tnco (5'-Pgtaaaaaaaggtcccttggctcccagaaga-3'; SEQ ID NO: 11),
1 μL of 20 μM T2 (5'-Pcaatgtgttagactgacgtg-3'; SEQ ID NO: 12),
[0335] 0.75 μL Quik solution and 1 μL QuikChange DNA polymerase.
[0336] The thermocycling program was 95° C. for 5 minutes followed by 30 cycles of 95° C. for 1 minute, 50° C. for 1 minute and 65° C. for 15 minutes. The product was processed as described by the manufacturer (Stratagene) and completely sequenced. The ZmABP3-terminus sequence is shown in SEQ ID NO: 14.
[0337] The ZmABP3 promoter was amplified from maize gDNA template in a 50 μL Hotstart (Qiagen) DNA polymerase reaction containing
10 μg gDNA,
25 μL 2× Hotstart Master Mix,
[0338] 1.25 μL of 20 μM P1 (5'-atatatgcatgcggcgcgccgaaagtagcaaacaacaggttcatgtgcac-3'; SEQ ID NO: 1), 1.25 μL of 20 μM P2 (5'-tatataccatggtgggtttgcctgcgaccacaagttca-3'; SEQ ID NO: 2), 10.5 μL Q solution and
2 μL 25 mM MgCl2.
[0339] The thermocycling program was 95° C. for 15 minutes followed by 45 cycles of 94° C. for 1 minute, 64° C. for 1 minute and 72° C. for 5 minutes. The final extension step was 72° C. for 15 minutes. The 2.3 kb reaction product was gel-purified on 1% TAE agarose, and the DNA was extracted using Qiaprep DNA extraction method. The DNA was ethanol precipitated and recovered in 4 μL ddH2O, then cloned into the pCR4-TOPO vector.
[0340] The ZmABP3 promoter was modified in a series of QuikChange reactions as outlined above using the following oligonucleotides:
TABLE-US-00006 Patg (5'-cagctcgcccgagttggtaaggccccct-3';, SEQ ID NO: 3) Pnco (5'-acagattagtccatcgcccacggt-3';, SEQ ID NO: 4) ADPc-1 (5'-agccctgtccatgacggcccaagcaac-3';, SEQ ID NO: 5) ADPc-2 (5'-agtagcaattcggtaggcacaggcac-3';, SEQ ID NO: 6) ADPc-4 (5'-tctatggtctgcgaggtgcggtggc-3';, SEQ ID NO: 7) and adp3-a (5'-gtccccttcttcgccgcgccagctcgc-3';. SEQ ID NO: 8)
[0341] The ZmABP3 promoter sequence is shown in SEQ ID NO: 13.
[0342] The ZmABP3 terminus was ligated to the pNOV6901-Cry1AbG6 vector (from Example 2) as a SacI/SpeI fragment. The ZmABP3 Promoter was subsequently ligated to the vector as a SphI/NcoI fragment. This produced ZmABP3-Cry1AbG6-assembly, shown in SEQ ID NO: 37. The complete ZmABP3-Cry1AbG6 expression cassette was mobilized into a binary vector, pNOV6900, as an AscI fragment. These constructs, ZmABP3-Cry1AbG6-6900 and enhanced ZmABP3-Cry1AbG6-binary, are shown in SEQ ID NOS: 38 and 39, respectively. The only difference between these vectors is the presence of the CaMV-FMV dual enhancer in enhanced ZmABP3-Cry1AbG6-binary. Both were mobilized to maize via Agrobacterium-mediated transformation.
Example 1.4
Construction of ZmABP3-AmCyan
[0343] The Cry1AbG6 coding sequence was excised from ZmABP3-Cry1AbG6-assembly as an NcoI/SacI fragment. It was replaced with the AmCyan reporter gene coding sequence that was excised from plasmid 13718 as an NcoI/SacI fragment. This produced the ZmABP3-AmCyan-assembly construct shown in SEQ ID NO: 40. The ZmABP3-AmCyan expression cassette was mobilized into a binary vector, pNOV6900, as an AscI fragment. This construct, ZmABP3-AmCyan-binary, is shown in SEQ ID NO: 41. It was mobilized to maize via Agrobacterium-mediated transformation.
Example 1.5
Expression from ZmABP3-AmCyan in Transgenic Maize
[0344] Several transgenic maize events containing the ZmABP3-AmCyan expression cassette were produced. Those containing a single-copy of the transgene and no un-intended vector sequence were analyzed. All transgenic events accumulated AmCyan transcript in leaf tissue (data not shown). Several tissues from a representative event were examined for AmCyan transcript accumulation. Total RNA was prepared using the Plant RNAeasy total RNA isolation system (Qiagen). Pollen total RNA was prepared using the method described by Shirzadegan et al (1991). Preparation quality was assessed by UV spectrophotometry, and 10 μg of total RNA per sample was resolved on a 1% formaldehyde gel then transferred to Nytran SuPerCharge membrane following the recommended protocol (Schleicher & Schuell). The blot was hybridized to a random-primed 32P-labeled AmCyan DNA probe using high stringency conditions. The results clearly show that ZmABP3 promotes transcription in tassel, leaf, silk, ear and root tissue, but does not promote transcription in pollen.
Example 1.6
Expression from ZmABP3-Cry1AbG6 in Transgenic Maize
[0345] Several transgenic maize events containing the ZmABP3-Cry1AbG6 expression cassette were produced. Those containing a single-copy of the transgene and no un-intended vector sequence were analyzed. The T0 events were tested for insecticidal activity against corn earworm twice during the course of development. The first samples were taken at V2-V4, and the second samples were taken at V7-V9. Leaf discs from lower leaf tips were excised and placed on water-moistened Whatman paper in 47×10 mm petri dishes. Ten-to-twenty L1 corn earworm or European corn borer larvae were added to each dish, and they were incubated for 48 hours at 28° C. Leaf discs were then scored for insect damage. Samples with no visible leaf damage and absolute mortality were scored as positive, and those with visible damage were negative. The data obtained show that several transgenic events with activity against both insects were identified.
[0346] Cry1AbG6 protein accumulation was also measured in T0 plants using the enzyme-linked immunosorbent assay (ELISA) with a fully-truncated Cry1Ab standard. The first assay was done on seedling leaf tissue, sampled 1-2 weeks after transfer to soil. The second assay was done on leaf tissue from maturing plants, sampled just prior to the transition to reproductive development. The data in TABLE B show the range of Cry1AbG6 protein accumulated in plants with insecticidal activity. The data indicate that plants require nearly 50 ng (or more) Cry1AbG6 protein/mg extractable protein to have insecticidal activity.
[0347] TABLE B shows the insect control characteristics of greenhouse grown plants.
TABLE-US-00007 Cry1AbG6 (ng/mg Corn Earworm ECB Event Cassette extractable protein) Activity Activity Number Description seedling adult V2-V4 V7-V9 V7-V9 1 ABP3-Cry1Abg6 63 79 + + + 2 ABP3-Cry1Abg6 54 56 + + + 3 ABP3-Cry1Abg6 85 108 + + + 4 ABP3-Cry1Abg6 67 94 + + + 5 ABP3-Cry1Abg6 45 83 + +/- +/- 6 ABP3-Cry1Abg6 68 120 + + + 7 ABP3-Cry1Abg6 133 159 + + + 8 ABP3-Cry1Abg6 96 46 + + + 9 ABP3-Cry1Abg6 138 101 + + + 10 ABP3-Cry1Abg6 131 100 + + + 11 ABP3-Cry1Abg6 94 65 + + + 12 ABP3-Cry1Abg6 111 59 + + + 13 ABP3-Cry1Abg6 139 60 + + + 14 ABP3-Cry1Abg6 121 81 15 ABP3-Cry1Abg6 66 55 + + + 16 ABP3-Cry1Abg6 130 95 + + +
[0348] Leaf tissue from T0 plants was assayed for Cry1AbG6 protein by ELISA using truncated Cry1Ab protein as standard, Corn Earworm activity and European Corn Borer (ECB) activity. The plant developmental stage when sampled is indicated at the top of each column. The older (lower) leaf tissue was sampled. For insect assays a (+) indicates no visible leaf damage and complete and absolute insect mortality. Visible leaf damage produced a (-) score.
Example 1.7
European Cornborer Efficacy of ZmABP3-Cry1AbG6 Events in the Field
[0349] The ECB (European corn borer) field efficacy studies were conducted in Stanton, MN (SMN) and Bloomington, Ill. (BIL) during the 2006 growing season. Near-isogenic hybrids, comprising the ABP3-Cry1AbG6 events listed in TABLE C, Bt11, and a nontransgenic control hybrid were tested. The experimental design was randomized complete block with three replications in each location. A plot consisted of one 5.31 m long row containing 25 plants, with 0.76 m spacing between rows.
[0350] TABLE C shows the performance of ZmABP3-Cry1AbG6 maize in field studies.
TABLE-US-00008 Trial MG371 MG331 Location BIL SMN Trial Type ECB ECB ECBLR ECBKN ECBSN ECBLR ECBKN ECBSN Event Cassette Leaf Feeding CEBSN Ear Stalk Leaf Feeding CEBSN Ear Stalk Number Description Rating Shank (cm) Feeding (cm) Feeding (cm) Rating Shank (cm) Feeding (cm) Feeding (cm) 1 ABP3-Cry1Abg6 1.0 0.00 1.42 0.00 1.1 0.00 0.00 0.30 2 ABP3-Cry1Abg6 1.0 0.00 1.42 0.08 1.0 0.00 0.15 0.10 3 ABP3-Cry1Abg6 1.0 0.00 1.25 0.08 1.0 0.00 0.00 0.80 4 ABP3-Cry1Abg6 1.0 0.00 1.57 0.00 1.0 0.10 0.51 1.10 5 ABP3-Cry1Abg6 1.0 0.00 1.25 0.04 1.0 0.00 0.07 0.20 6 ABP3-Cry1Abg6 1.0 0.00 1.08 0.00 7 ABP3-Cry1Abg6 1.0 0.00 1.31 0.00 1.1 0.10 0.45 0.80 8 ABP3-Cry1Abg6 1.0 0.04 2.00 0.08 1.1 0.00 0.00 0.30 9 ABP3-Cry1Abg6 1.0 0.00 0.92 0.00 1.3 0.00 0.00 0.10 10 ABP3-Cry1Abg6 1.0 0.00 1.42 0.04 1.2 0.00 0.00 0.40 11 ABP3-Cry1Abg6 1.0 0.13 1.17 0.00 1.0 0.00 0.00 0.10 12 ABP3-Cry1Abg6 1.0 0.00 1.62 0.08 1.1 0.00 0.17 0.30 13 ABP3-Cry1Abg6 1.0 0.00 1.29 0.00 1.2 0.00 0.00 0.20 14 ABP3-Cry1Abg6 1.0 0.00 1.10 0.13 1.0 0.00 0.07 0.10 15 ABP3-Cry1Abg6 1.0 0.08 1.33 0.04 1.1 0.00 0.24 0.20 16 ABP3-Cry1Abg6 1.0 0.00 1.33 0.21 1.0 0.00 0.00 0.10 Bt11 1.0 0.00 2.75 0.00 1.3 0.00 0.00 0.00 Negative Check 7.0 0.21 3.00 4.67 4.3 0.40 5.80 13.50 Rep with data 3 3 3 3 3 3 3 3 Loc with data 1 1 1 1 1 1 1 1 Design Used RCB RCB RCB RCB RCB RCB RCB RCB LSD (5%) General EE 0.149 0.923 0.257 0.399 0.200 1.988 0.650 LSD (5%) Excluded Negatives 0.158 0.936 0.255 0.397 0.181 0.505 1.391 CV % 242.21 38.47 72.14 20.10 292.75 138.76 120.87 Probablitiy % 0.90 0.09 0.00 0.00 4.10 0.00 0.00
[0351] Two studies were undertaken in Bloomington, Ill. (BIL) and Stanton, Minn. (SMN) in 2006. Several ZmABP3-Cry1AG6 events were compared to positive and negative benchmarks represented by Bt11 and Negative Check, respectively.
[0352] First-instar ECB larvae were produced from a laboratory colony following procedures outlined in Guthrie (1989) at the Syngenta Seeds, Inc. entomology laboratory in Slater, Iowa. Eggs were incubated at about 28° C. and approximately 80% relative humidity, and neonates were collected from hatching containers approximately 6 hours after hatch. Larvae were healthy and vigorous when placed on the plants as indicated by movement.
[0353] Two ECB application types were performed: ECB1, applied at approximately leaf stage V6-V8 and ECB2, applied at pollen shed. The applications were made with the BioServe Davis Inoculator using 1 ml corn cob grits per application. For ECB1 (first-generation ECB infestation) a total of about 150 larvae were placed into the whorl of each plant, in corn cob grits. Two to four applications were made, with one to six days between each application. The first plant in the row was not treated, and then up to 10 consecutive plants were infested.
[0354] For ECB2 (second-generation ECB infestation) a total of about 200 larvae were applied per plant, placed into the ear leaf axil and leaf axils directly above or below the ear, in corn cob grits. Four applications were made, with one to six days between each application. Up to ten consecutive plants on the opposite end of the row from the ECB1 treatment were infested. The last plant in the row was not treated.
[0355] The following observations were recorded. For ECB1, up to eight consecutive infested plants in the row were evaluated for foliar ECB damage (ECBLR in TABLE C) at least 14 days after the first infestation. The Guthrie scale of 1-9 (Guthrie et al. (1960) was used and one rating, the average for the evaluated plants, was recorded for each plot. For ECB2, approximately 45 days after the plants were infested, up to eight consecutively infested plants on the opposite end of the row from the ECB1 evaluations were dissected to assess ear shank, ear kernel, and stalk feeding, by measuring feeding tunnel lengths (cm).
[0356] ECB2 data were subjected to analyses of variance appropriate for a randomized complete block design. Replications were considered random while all other effects were considered fixed. Mean separation was done using the least significant difference (LSD) procedure, but only if the F-test for entries was significant at the customary 5% significance level. Because there was no variability among the events in the ECB1 data, an analysis of variance was not done for this trait. The data and analysis are summarized in TABLE D. In general, the data show that ZmABP3-Cry1ABG6 affords protection against ECB similar to that observed in Bt11 material.
[0357] TABLE D shows the amount of Cry1AbG6 protein in transgenic maize tissue. The youngest developing leaf was tested for Cry1AbG6 by ELISA at 5 developmental stages (V5-V6, V8, V10, R1, R3-R4) for each plant. Cry1AbG6 was also measured in pollen. Events 5, 12, 15 and 16 express the ABP3-Cry1AbG6 construct, and Events A-D express the enhanced ABP3-Cry1Ab construct. Data shown are the mean±SD (n=8-10).
TABLE-US-00009 Developmental Stage V5-V6 V8 V10 R1 R3-R4 Pollen Event 5 .sup. 39(3.8) .sup. 38(2.7) .sup. 61(8.2) .sup. 75(5.3) .sup. 60(3.5) 1.5(0.14) Event 12 .sup. 61(5.2) .sup. 32(1.9) .sup. 50(6.1) .sup. 44(5.1) .sup. 49(4.4) 1.4(0.39) Event 15 .sup. 45(4.5) .sup. 45(4.8) .sup. 46(4.8) .sup. 38(7.4) .sup. 55(5.4) 1.0(0.14) Event 16 .sup. 58(5.4) .sup. 30(2.9) .sup. 47(5.3) .sup. 53(7.2) .sup. 44(4.6) 1.2(0.17) Event A 260(24) 190(22) 250(18) 200(21) 150(14) 1.3(0.19) Event B 260(22) 227(29) 240(30) 200(23) 150(76) 1.6(0.30) Event C 310(31) 210(26) 270(26) 150(15) 160(16) 1.9(0.31) Event D 310(30) 180(23) 240(15) 170(26) 150(18) 1.4(0.19)
Example 1.8
Use of ZmABP3 Expression Cassette to Improve Drought Tolerance in Maize
[0358] A deregulated form of an Arabidopsis H+-pyrophosphatase (AtAVP1 D) has been shown to improve drought tolerance when over-expressed in several plants (Gaxiola et al., 2001; Park et al., 2005). The improved performance is enabled by high expression throughout the plant. To demonstrate the utility of AtAVP1D to improve drought tolerance in maize, a maize-optimized coding sequence was synthesized. The sequence of the AtAVP1D synthetic gene is shown in SEQ ID NO: 16. It was ligated to the ZmABP3 expression cassette as an NcoI/SacI fragment. The vector map shown in SEQ ID: 42 illustrates the ZmABP3-AtAVP1D expression cassette. The complete ZmABP3-AVP1 D expression cassette was excised from the Assembly vector as a SanDI/RsrII fragment and ligated to the RsrII site of the Agrobacterium binary vector, 15289. A map of the construct is shown in SEQ ID NO: 43.
Example 1.9
Measurement of Cry1AbG6 in Maize Tissue
[0359] Hybrid T1 seed (in the ID5829/AX5707 background) for several ZmABP3-Cry1ABG6 events were produced at a Syngenta field station in Bloomington, Ill. Several seed were germinated in 2 inch pots. Seedlings were tested for transgene zygosity, and only hemizygotes were retained. A minimum of 8 plants per event were transplanted to 3 gallon pots and grown in a temperature controlled greenhouse. Leaf tissue from each plant was sampled and assayed for Cry1AbG6 protein at 5 stages of development, V5-V6, V8, V10, R1, and R3-R4 (Ritchie et al., 1997). Pollen was also collected and assayed for Cry1AbG6 protein.
[0360] At each stage, leaf tissue (minus the collar, midrib and sheath) was sampled from the youngest expanding leaf. Duplicate samples were pulverized in 96-well blocks. The powder was suspended in 500 A-1 mL extraction buffer (0.1 M Sodium Borate, 0.5% Tween 20, 0.2% Polyvinylpyrrolidone, 0.05% Sodium Azide, and 1× protease inhibitor cocktail tablets (Roche)). The mixture was clarified by centrifugation and soluble protein quantified using the BCA assay. Fresh pollen was collected in 1.5 mL Eppendorf tubes. Three 3 mm glass beads were added to each tube and the samples were frozen at -80° C. Samples were then pulverized in a horizontal oscillator at 600 rpm. Protein was extracted by adding 500 4-1 mL extraction buffer and incubating at 4° C. for 30 minutes. The samples were clarified by centrifugation at 4° C., and the soluble protein in each sample was quantified by BCA Assay.
[0361] Samples were normalized for protein content and Cry1AbG6 was quantified by ELISA using fully-truncated Cry1Ab as a standard. Each data point is the mean of duplicate measurements, taken at a different dilution of total protein. Data for each event are reported as the mean±SD for all siblings.
[0362] Results in TABLE D show that the ZmABP3-Cry1AbG6 cassette produces steady Cry1AbG6 protein in leaf tissue throughout development. Some reduction in CryAbG6 protein is evident as the vegetative tissue begins to senesce (R3-R4). Also evident is the 3-5 fold increase in Cry1AbG6 accumulation in events that also have the CaMV-FMV dual-enhancer complex. Finally, the data show virtually no detectable Cry1AbG6 protein in pollen. In all events CryAbG6, on average, accumulates to less than 1.5 ng/mg total soluble protein. Furthermore, the dual-enhancer complex does not influence Cry1AbG6 accumulation in pollen; it is identical between all events. This is consistent with our data showing that ZmABP3 is not transcribed in pollen (Example 1.5). We conclude that detectable Cry1AbG6 in pollen was likely produced in the microspore mother cells or their progenitors, and carried to pollen through cell division.
Example 2
Non-Tassel Expression
Example 2.1
Identification of ZmABT
[0363] 2.1.1 Expression Profiling Experiment:
[0364] A maize developmental series on the Zm80K Affymetrix chip, was queried for probes that gave strong signals in all samples, and a low or no signal in the tassel samples. Twenty-three (23) probes were identified representing polynucleotides that met the expression criteria. To better represent the differential expression signal between the tassel samples and other tissue samples, the ratio of mean signal for other samples and tassel was calculated for each probe. This indicates the expression differential between tassel and other samples. Any signal below 50 is in the experimental noise, which means the gene may not be transcribed or is transcribed at a very low level. To understand the expression level of each gene represented by candidate probes, a second expression profiling study was queried. In this experiment tissues from two maize genotypes were hybridized to the Zm80K Affymetrix chip. In general signals over 1000 indicate high expression and signals over 10,000 indicate very high expression.
[0365] 2.1.2 Identification of Candidate Probes:
[0366] Two top candidate probes were identified. Probe Zm033444_S_AT demonstrates virtually no signal in tassel and a high signal in other tissues. This indicates that the gene represented by Zm033444_S_AT is not expressed in tassel and is highly expressed throughout the rest of the plant. It also demonstrates the greatest expression differential, 60-fold higher in non-tassel tissue. Probe Zm040564_X_AT has a low signal in young tassel that gradually increases to a high or strong signal. The signal strength between tassel and non-tassel samples differs by less than 10-fold. However the signal strength in non-tassel samples is nearly 10-fold higher than Zm033444_S_AT. The sequence data indicate that neither probe corresponds to a characterized gene. Both probes identify good candidate genes for development of promoters that deliver high expression in non-tassel tissue and little or no expression in tassels. Given the high signal differential between tassel and non-tassel samples, an expression cassette based on probe Zm033444_S_AT was developed.
[0367] Table E: shows a summary of the top candidate probes representing polynucleotides with a high expression level in all maize tissues and low expression signal in tassel
TABLE-US-00010 Mean induction in P- BH Q- non-tassel V9 V12 V15 Probe Value Value samples tassel tassel tassel Zm033444_s_at 0.00 0.00 60 16.2 10.2 132 Zm002990_s_at 0.00 0.00 45 32.8 68.7 47.8 Zm006285_at 0.00 0.00 20 37.9 44.1 35.8 Zm000019_at 0.00 0.00 16 117 200 242 Zm006481_s_at 0.00 0.00 14 26.9 32.1 31.5 Zm002987_at 0.00 0.00 14 83.7 80.8 119 Zm004433_at 0.00 0.00 12 53.8 35.3 127 Zm010323_s_at 0.00 0.00 11 45.4 63 71.5 Zm016864_s_at 0.01 0.01 11 89.5 55.6 1280 Zm018791_at 0.01 0.01 11 41.4 34.7 252 Zm028405_s_at 0.00 0.00 10 69 65.1 89 Zm021403_at 0.00 0.00 10 42.2 41.4 71 Zm054116_s_at 0.00 0.00 10 93.3 62.4 219 Zm002990_x_at 0.00 0.00 10 13.6 29.5 29.2 Zm005761_at 0.00 0.00 9.6 33.2 40 46.7 Zm035082_s_at 0.00 0.00 8.5 83 84 143 Zm066342_at 0.00 0.00 8.2 52.9 59.2 199 Zm032921_s_at 0.00 0.00 8.1 57.5 29.8 90.5 Zm040564_x_at 0.01 0.01 7.5 277 143 3710 Zm051284_at 0.01 0.01 6.5 53.2 40 194 Zm011554_at 0.03 0.04 5.4 72.5 64.2 895 Zmmetall_x_at 0.01 0.01 5.3 325 199 2330 Zm011554_x_at 0.04 0.04 4.9 63.5 62.6 664
Example 2.2
Development of an Expression Cassette
[0368] DNA sequence evidence to identify cDNAs corresponding to Zm033444_S_AT was collected. Public and proprietary databases were queried by BLASTN with Zm033444_S_AT sequence. cDNA hits with precise matches to the query sequence fell into two similar contigs. ZmABT1 corresponds to Maize.1482.c47 and Maize.1908.c31, and ZmABT2 corresponds to Maize.1482.c32, Maize.1482.c28, Maize.1482.c53, Maize.1908.c17, Maize.1908.c20, Maize.1908.c37 and A1947567. The Zm033444_S_AT, ZmABT1 and ZmABT2 sequences were used to query maize genomic DNA sequence databases to identify the regulatory sequence(s) that give high expression in non-tassel tissue and little or no expression in tassels. The queries identified three entries, AZM4--12, ZmGSStuc11-12-04.4740.1 and MAGI--88845, that assemble into a single contig. The ZmABT gDNA sequence is shown in SEQ ID NO: 46. It encodes both ZmABT1 and ZmABT2 (SEQ ID NO: 33 and 34, respectively). They are alternatively spliced variants of the same transcript.
[0369] ZmABT1 is encoded on 5 exons, and ZmABT2 is encoded on 6 exons. The additional exon lies between exon 1 and exon 2 of ZmABT1. The largest open reading frame on ZmABT1 and ZmABT2 was used to define their translation start and stop codons. Both cDNAs used the same translation start and stop codon. This information enabled the design of a ZmABT-based expression cassette.
Example 3
Construction of a ZmABT-GUS Expression Cassette
[0370] An inclusive, gene structure-based design strategy was used to construct the ZmABT expression cassette. To incorporate the known alternative splicing of this gene into the expression cassette, the design strategy was based on the structure of ZmABT1. The cassette contains 2.615 kb of 5'-sequence, which consists of 2.020 kb of 5'-non-transcribed sequence, 12 bp of 5'-UTR and 0.58 kb representing exon 1, intron 1 and 16 bp of exon 2. The natural translation start codon was silenced in order to move it to the second exon. The expression cassette also contains 1.039 kb of 3'-sequence that begins just past the translation stop codon. This includes 0.603 kb of 3'-UTR and 0.436 kb of non-transcribed sequence, and functions as the transcriptional terminator and poly-adenylation signal.
[0371] The ZmABT promoter was amplified from maize gDNA template in a 50 μL Proofstart (Qiagen) DNA polymerase reaction containing 10 μg gDNA, 5 μL 10× Proofstart Buffer, 1.0 μL 10 mM dNTP mix, 1.0 μL of 20 μM ABT P1 forw (5'-CGACCAGCGCGACATGCATGGCA-3'; SEQ ID NO: 19), 1.0 μL of 20 μM ABT P2 rev (5'-ACCCCAGGGCGTACGACAAGGCC-3'; SEQ ID NO: 20), and 10.0 μL 5× Q solution. The thermocycling program was 95° C. for 5 minutes followed by 40 cycles of 94° C. for 30 seconds, 67° C. for 30 seconds and 72° C. for 2.5 minutes. The final extension step was 72° C. for 10 minutes. The 2.6 kb reaction product was gel-purified on 1% TBE agarose and the DNA was extracted using Qiaprep DNA extraction method. The DNA was cloned into the pCR-BluntII-TOPO vector.
[0372] The ZmABT promoter was modified in a series of mutagenesis reactions to silence the endogenous START codon, silence a SanDI restriction site and correct point mutations created during amplification. This was done using the Stratagene QuikChange Multi-site mutagenesis kit. The 25 μL reaction contained 1 μL pCR4-TOPO-ZmABT-promoter, 2.5 μL 10× QuikChange buffer, 1 μL QuikChange dNTP mix, 0.75 μL Quik solution, 1 μL QuikChange DNA polymerase and 1 μL of 20 μM of at least one of the following oligonucleotides:
TABLE-US-00011 (SEQ ID NO: 21) pABT mut1 (5'-GATGGCCGGATTGGGCTCCCGGGGTGGAG-3') (SEQ ID NO: 22) pABT mut2 (5'-CTGGGAGGCGCGCAAGGGGCAGTTCCTCG-3') (SEQ ID NO: 23) pABT mut3 (5'-CCCACCGCCGGAGCACCGAAAGGCCCCGCG-3') (SEQ ID NO: 24) pABT mut4 (5'-GTCACCCGGGAGCACTTCCCGGCGCCG-3') (SEQ ID NO: 25) pABT mut5 (5'-CATTGGGCCGAGCACGGCTTCTTCCGC-3') (SEQ ID NO: 26) pABT mut6 (5'-GGGGTACGGTGTTCTTGAGTCGTGAAGCGAC-3')
[0373] The thermocycling program was 95° C. for 1 minute followed by 35 cycles of 95° C. for 1 minute, 50° C. for 1 minute and 65° C. for 12 minutes. The product was processed as described by the manufacturer (Stratagene) and completely sequenced. The ZmABT promoter sequence is shown in SEQ ID NO: 35.
[0374] The corrected ZmABT promoter was PCR amplified from the TOPO vector in a 50 μL Proofstart (Qiagen) DNA polymerase reaction as above using primers pABT amp1 (5'-GCGTCTAGAGGGACCCCGACCAGCGCGACATGCATGGCA-3'), depicted in SEQ ID NO: 27 and pABT amp2 (5'-ACCCCAGGGCGTACGACAA-GGCCCCACCATGGGCGC-3'), depicted in SEQ ID NO: 28. The PCR product was gel-purified on 1% TBE agarose and the DNA was extracted using Qiaprep DNA extraction method. The DNA was cloned into the pCR-BluntII-TOPO vector, transformed and sequenced. The ZmABT promoter was excised as an XbaI/NcoI fragment and ligated to pNOV6901.
[0375] The ZmABT terminus was amplified from maize gDNA template in a 50 μL Extensor (ABgene) DNA polymerase reaction containing 10 μg gDNA, 5 μL 10× Extensor buffer #1, 2.0 μL 10 mM dNTP mix, 2.0 μL of 20 μM ABT P4 (5'-TATATAGAGCTCGAATCGAAGAAGCCACACTGTAAATCTGCCGGG-3'; SEQ ID NO: 29), 2.0 μL of 20 μM ABT P5 (5'-AGCAAGGCATATGCAGCAGCTGCT-GGTCGGACCGGGCCCTATATA-3'; SEQ ID NO: 30), 10 μL 5× Q solution, 0.5 μL Extensor DNA polymerase and 0.5 μL Amplitaq DNA polymerase. The reactions were overlaid with mineral oil and the thermocycling program was 95° C. for 2 minutes followed by 40 cycles of 98° C. for 2 seconds, 63° C. for 1 minute and 68° C. for 4 minutes. The final extension step was 68° C. for 7 minutes. The 1 kb reaction product was gel-purified on 1% TAE agarose, and the DNA was extracted using Qiaprep DNA extraction method. The DNA was ethanol precipitated and recovered in 4 μL ddH2O, then cloned into the pCR4-TOPO-Blunt vector.
[0376] The ZmABT terminus was modified to remove internal NcoI and XhoI restriction sites using the Stratagene QuikChange Multi-site mutagenesis kit, as above. The 25 μL reaction contained 1 μL pCR4-TOPO-ZmABT-promoter, 2.5 μL 10× QuikChange buffer, 1 μL QuikChange dNTP mix, 0.75 μL Quik solution, 1 μL QuikChange DNA polymerase and 1 μL of 20 μM of at least one of the following oligonucleotides:
TABLE-US-00012 (SEQ ID NO: 31) ABTt m1 (5'-GTCATGCATGGGCATGTGAAGGAGGAGCC-3') (SEQ ID NO: 32) ABTt m2 (5'-GTTGCATGCATGCTGCATGGCGTCGAGAT-3')
[0377] The thermocycling program was 95° C. for 1 minute followed by 35 cycles of 95° C. for 1 minute, 50° C. for 1 minute and 65° C. for 13 minutes. The product was processed as described by the manufacturer (Stratagene) and completely sequenced. The ZmABT terminator sequence is shown in SEQ ID NO: 36.
[0378] The ZmABT terminus was excised as a SacI/ApaI fragment and ligated to pNOV6901-prABT vector (above). This produced plasmid 15772 (ZmABT Assembly), and a plasmid map is shown in SEQ ID NO: 44. The complete ZmABT expression cassette was mobilized as a SanDI/RsrII fragment into the RsrII site of the Agrobacterium binary vector 15289. A plasmid map of this construct, 15773, is shown in SEQ ID NO: 45.
Example 4
Extension of DNA Probe Sequences to Designed Expression Cassettes
[0379] DNA sequence representing probes on the maize chip can easily be extended to designed expression cassettes following the steps outlined above. The DNA sequence for probes identified as representing genes that are highly expressed in all tissue samples and not expressed in pollen (Table A) and those that are highly expressed in all tissue samples and have reduced expression in tassel samples (Table E) is reported as SEQ ID NOs: 47-79.
[0380] An additional probe candidate from the expression profiling analysis for each expression category was selected to demonstrate progression from this DNA sequence to a finished binary vector with the designed expression cassette linked to the GUS reporter gene. The method used is identical to that for ZmABP3 and ZmABT. In summary the process steps to be applied are as follows:
1. Flank each expression cassette with SanDI/RsrII sites and report as cloned into the RsrII site of 15289 (SEQ ID NO: 80). 2. Promoter consists of 1000-1500 bp of sequence upstream of the transcription start site and extends 10 bases into the second exon, or to the natural translation start codon if it is not on the first exon. It terminates with the maize optimized Kozak sequence `gtaaaccatgg`. The engineered translation start codon is now embedded in the NcoI restriction endonuclease site `ccatgg`. Mutate all translation start codons in the theoretical transcript that are upstream of the engineered NcoI site. Ensure at least one stop codon is in each reading frame upstream of the engineered NcoI site. The promoter is designed to be flanked by XhoI/SanDI at the 5'-end and NcoI at the 3'-end. 3. The Gene Of Interest (GOI) is represented by the GUS reporter gene as an NcoI/SacI fragment. 4. The terminus extends from just after the translation stop codon for 1 kb downstream. The terminus is designed to be flanked by SacI at the 5'-end and RsrII/XmaI at the 3'-end. 5. The complete expression cassette is designed to be mobilized as a SanDI/RsrII fragment, which can be ligated into an RsrII site located on an Agrobacterium binary vector such as 15289 (SEQ ID NO: 80). 6. Mutate all internal SanDI, RsrII, NcoI, SacI, XhoI and XmaI sites by single base substitution to silence them.
[0381] Through application of these basic steps a plant expression cassette (SEQ ID NO: 81) can be designed that corresponds to probe Zm058948_s_at (SEQ ID NO: 55) and a plant expression cassette (SEQ ID NO: 82) that corresponds to probe Zm002990_s_at (SEQ ID NO: 62). The former is an expression cassette that should be transcribed in all maize tissues and not in pollen. The latter is an expression cassette that should be transcribed in all maize tissues and have reduced transcription in tassels. This design strategy applies to all probes identified in Tables A and E.
[0382] Further details of how to make such expression cassettes are described in US2005235311, which is incorporated herein by reference in its entirety.
REFERENCES
[0383] Ammirato et al., eds., (1984) Handbook of Plant Cell Culture--Crop Species, Macmillan Publ. Co., New York, N.Y.
[0384] An et al., (1985) EMBO J. 4, 277 287
[0385] Auch & Reth et al.
[0386] Batzer, et al., Nucleic Acid Res. 19:5081 (1991)
[0387] Byrne, M. C., McDonnell, R. E., Wright, M. S. and Carnes, M. G., 1987. "Strain and Cultivar Specificity in the Agrobacterium-soybean Interaction." Plant Cell Tissue and Organ Culture 8:3-15
[0388] Christou et al., Plant Physiol. 87:671-674 (1988)
[0389] Christou et al., Biotechnology 9: 957-962 (1991)
[0390] Crossway et al., Bio Techniques 4:320-334 (1986)
[0391] Datta et al., Bio/Technology 8:736-740 (1990)
[0392] Fromm et al., Bio/Technology 8:833-839 (1990)
[0393] Gaxiola, R. A., Li, J., Undurraga, S., Dang, L. M., Allen, G. J. Alper, S. L., Fink, G. R. (2001). Drought- and salt-tolerant plants result from over-expression of the AVP1 H+-pump. Proc. Natl. Acad. Sci. USA 98: 11444-11449.
[0394] Gordon-Kamm et al., Plant Cell 2:603-618 (1990)
[0395] Gruber, et al., "Vectors for Plant Transformation, in Methods in Plant Molecular Biology & Biotechnology" in Glich, et al., (Eds. pp. 89-119, CRC Press, 1993).
[0396] Guthrie, W. D., F. F. Dicke, and C. R. Neiswander (1960) Leaf and sheath feeding resistance to the Eur. corn borer in eight inbred lines of dent corn. Ohio Agric. Exp. Stn. Res. Bull. 860.
[0397] Guthrie, W. D. (1989) Advances in Rearing the European Corn Borer on a Meridic Diet, In: Toward Insect Resistant Maize for the Third World; Proceedings of the International Symposium on Methodologies for Developing Host Plant Resistance to Maize Insects. Mexico, D. F.:CIMMYT
[0398] Hiei et al., (1994) Plant J. 6, 271-282
[0399] Hinchee et al., Biotechnology 6:915-921 (1988)
[0400] Hoekema (1985) The Binary Plant Vector System, Offsetdrukkerij Kanters B. V., Alblasserdam, Chap. V
[0401] Klein et al., Proc. Natl. Acad. Sci. USA, 85:4305-4309 (1988)
[0402] Klein et al., Bio/Technology 6:559-563 (1988)(maize); Klein et al., Plant Physiol. 91:440-444 (1988)
[0403] Knauf, et al., 1983
[0404] Koziel et al., Biotechnology 11: 194-200 (1993)
[0405] Lindsey K, Wei W, Clarke M C, McArdle H F, Rooke L M, Topping J F. Tagging genomic sequences that direct transgene expression by activation of a promoter trap in plants. Transgenic Res. 1993 January; 2(1):33-47.
[0406] Lopez, I, Anthony, R. G., Maciver, S. K., Jiang, C.-J., Khan, S., Weeds, A. G., Hussey, P. J. (1996) Pollen specific expression of maize genes encoding actin depolymerizing factor-like proteins. Proc. Natl. Acad. Sci. USA. 93: 7415-7420.
[0407] Lorz et al. (Mol. Gen. Genet. 199, 178, (1985))
[0408] McBride, et al. (1994) Proc. Natl. Acad. Sci. USA 91:7301-7305
[0409] McCabe et al., Biotechnology 6:923-926 (1988)
[0410] Ohtsuka, et al., J. Biol. Chem. 260:2605-2608 (1985)
[0411] Pacciotti et al. (1985) Bio/Technology 3:241
[0412] Park et al., 1985
[0413] Park, S., Li, J., Pittman, J. K., Berkowitz, G. A., Yang, H., Undurrago, S., Morris, J., Hirschi, K. D., Gaxiola, R. A. (2005). Up-regulation of a H+-pyrophosphatase (H+-PPase) as a strategy to engineer drought-resistant crop plants. Proc. Natl. Acad. Sci. USA 102: 18830-18835.
[0414] Paszkowski et al., EMBO J. 3:2717-2722 (1984)
[0415] Pearson, W. R. (1990), Methods in Enzymology 183, 63-98
[0416] Potrykus, I., Paszkowski, J. P., Saul, M. W., Petruska, P. and Shillito, R. D. 1985. Molecular and general genetics of a hybrid foreign gene introduced into tobacco by direct gene transfer. Mol. Gen. Genet. 199:169-177.
[0417] Ritchie, S. W., Hanway, J. J., Benson, G. O. (1997). How a corn plant develops: Special Report No. 48. Iowa State University of Science and Technology Cooperative Extension service: Ames, Iowa.
[0418] Riggs et al., Proc. Natl. Acad. Sci. USA 83:5602-5606 (1986)
[0419] Rossolini, et al., Mol. Cell. Probes 8:91-98 (1994)
[0420] Sambrook et al. supra; Molecular Cloning, a Laboratory Manual, Maniatis et al. (eds) Cold Spring Harbor Laboratory, Cold Spring Harbor, N.Y. (1982); Advanced Bacterial Genetics, Davis et al. (eds.) Cold Spring Harbor Laboratory, Cold Spring Harbor, N.Y. (1980)
[0421] Sanford et al., Particulate Science and Technology 5:27-37 (1987)
[0422] Shimamoto et al., Nature 338: 274-277 (1989)
[0423] Shirzadegan, M., Christie, P., Seemann, J. (1991) An efficient method for isolation of RNA from tissue-cultured plant cells. Nucleic Acids Res. 19(21): 6055.
[0424] Smith and Waterman, Advances in Applied Mathematics 2 (1981), 482-489
[0425] Sukhapinda et al., Plant Mol. Biol., vol. 8:209-216, 1987
[0426] Svab et al., Proc. Natl. Acad. Sci. USA 87: 8526-8530 (1990)
[0427] Tijssen (1993) Laboratory Techniques in Biochemistry and Molecular Biology-Hybridization with Nucleic Acid Probes part I chapter 2 "Overview of principles of hybridization and the strategy of nucleic acid probe assays" Elsevier, New York.,
[0428] Vasil et al., Biotechnology 11: 1553-1558 (1993)
[0429] Weeks et al., Plant Physiol. 102: 1077-1084 (1993)
[0430] Weissinger et al., Annual Rev. Genet. 22:421-477 (1988)
Patent Literature
[0430]
[0431] EP 0 332 581
[0432] EP 0 292 435
[0433] EP 0 295959
[0434] EP 0 138341
[0435] EP 0 120516
[0436] U.S. Pat. No. 5,451,513
[0437] U.S. Pat. No. 5,545,817
[0438] U.S. Pat. No. 5,545,818
[0439] U.S. Pat. No. 5,350,689
[0440] U.S. Pat. No. 5,451,513,
[0441] U.S. Pat. No. 4,945,050
[0442] WO 95/16783
Sequence CWU
1
1
91150DNAArtificial Sequenceforward primer P1 1atatatgcat gcggcgcgcc
gaaagtagca aacaacaggt tcatgtgcac 50238DNAArtificial
Sequencereverse primer P2 2tatataccat ggtgggtttg cctgcgacca caagttca
38328DNAArtificial Sequenceoligonucleotide Patg
3cagctcgccc gagttggtaa ggccccct
28424DNAArtificial Sequenceoligonucleotide Pnco 4acagattagt ccatcgccca
cggt 24527DNAArtificial
Sequenceoligonucleotide ADPc-1 5agccctgtcc atgacggccc aagcaac
27626DNAArtificial Sequenceoligonucleotide
ADPc-2 6agtagcaatt cggtaggcac aggcac
26725DNAArtificial Sequenceoligonucleotide ADPc-4 7tctatggtct
gcgaggtgcg gtggc
25827DNAArtificial Sequenceoligonucleotide adp3-a 8gtccccttct tcgccgcgcc
agctcgc 27937DNAArtificial
Sequenceforward primer P3 9tatatagagc tcgcatcatg atcatgcatc atggact
371046DNAArtificial Sequencereverse primer P4
10atatatacta gtggcgcgcc acactttctg tcgcatgtga tttgca
461130DNAArtificial Sequenceforward primer Tnco 11gtaaaaaaag gtcccttggc
tcccagaaga 301220DNAArtificial
Sequenceforward primer T2 12caatgtgtta gactgacgtg
20132333DNAZea mays 13gcatgcggcg cgccgaaagt
agcaaacaac aggttcatgt gcactataaa aagacaaaat 60tctcgagttt catcttttat
tccacataag ccttatattt tccattttca tatgattttt 120agtttaagtt tgtgtcttaa
ctttttcgtt aatacgtaat tctatgcatt atggatgcgt 180gaagtatttt tgtttaaaaa
aatgaaatgt caaaatacgt tttgtgatct atttccatgt 240tttcacctaa caggtggttt
ttactatata ttctgccata actctagcct tagatgtaaa 300tcgaaaaaaa atgagagatg
agctggagat agccttagat gaagcgtctg aaatataaaa 360gaaagagtaa tgttgaacgc
agtaggtgta gcagctgtag ttccatctct aggaaaggga 420actgcaatcc gggctccggg
cctcgcgcaa tctggcctgt cgtgtagatg cagccctgtc 480catgacggcc caagcaacgc
ccgcggctct cgatccacca cggaacccac tccgacacac 540actgacacac acatgctgga
tgtggatgtg ctgtccaatt attagtagca attcggtagg 600cacaggcacg tactggccgg
tgttttagct gtaagtaccg aaccaatcac ggttaagaac 660cgattaatcc gtgcccagcc
gccgagtgcg ttcgtacgtg catcggatgc actgcatgaa 720ttgagagcat catcatatca
tacgcaggag tagtacgacg ccgctgctgt cttgtccggc 780taatgctttg ctcacagatt
agtccatcgc ccacggtcgg tgtggtgtgg atcgctgatg 840ccactgcttt ttgtttggtt
tttattcccc tgataatcct ccgcgtccct gaatgtatct 900atttattttc attccgaaat
ccctttcacg aaaaagaaaa cgaataaaaa gagagttacg 960aatacgcttc cggcggccca
catcaccttc cagcgaacat cgcgccgcgc tgacgtgtcg 1020cccatcgcgg ccgtccatat
cgccatccga cgaccgtgga agctggcagc ggccgctccg 1080ttccgtcgaa ggggcaggtc
agtcaggtca cccacacggc cacacccgcg cgggggatac 1140gcggtggaaa acccggcgac
cacatcaaaa cacgaggcgt ctcccgcagg actggtcact 1200cggcacgcag gcagaggcag
cacagcagca gccagctcca tccatcctct ttcccctcct 1260cgcttcgctt cctcggcgga
ttcctcctcc ctcggccgtc cccgtcccct tcttcgccgc 1320gccagctcgc ccgagttggt
aaggccccct ccacccctcc gcttcccctc ccccgggcgc 1380gctctggctt cctccccgga
tcggcgcggg gcgtgctggc tccgcgcctg atttcgggcc 1440ttttgtttcc ttctcgcgga
gcgctcgtgt aacgcttcgg atctagctgg attcaggcgg 1500gatcgcggcc gctcggcttc
ctcgtggcct gattcgtggt tttcctcggg gagggaatcc 1560tgatcggatc atcgggattc
ctcgtgcggc cgggacacgc ttgcgagcca gaaacatagt 1620ctgcgtggcc gggattccac
gatctgtgat ctagacgtcg ggcgcttcgt ctatgtgctc 1680gctgcaggct gtggcgtact
ggcgtggtgc gcggccgcta tggatccgtg cttgtttgtt 1740cgccctgtag cgtgtgaaat
cgagctgtgt agatctatgg tctgcgaggt gcggtggcgg 1800tggaatctcg gttgatcttt
acctcagcgg cgccagtgta gctcgtgtgg ctgcagttca 1860tctgcgaatt tggctctcgg
cggcttaggt cgcggagctt ggattatgga gcaccagctg 1920cagcgtgacc ctgttggttc
tcatgtggat ctgttggctg aggttgcaga cttcaagtgc 1980cactgccatt gaccggagct
gctgcacgat tatactggaa tatctagcgg tagtatactc 2040tgctagtact caatacgggt
ctcctgacaa atgtctttcg tgtttaggga cctagcactc 2100tagtgtcaag actatttgct
ggaatatcta atattagcag tttctgtagt ggctcagttg 2160cagcctggtt tagaatgatg
gggacagttg gctgtgccat gcaaaataaa gtgtgtgaaa 2220gcaactgcct cttaaactat
gggtggtgca agcaggttat ttgaagggac tctccacact 2280gtatctccag ttaactatga
ctgaacttgt ggtcgcaggc aaacccacca tgg 2333141031DNAZea mays
14gagctcgcat catgatcatg catcatggac tcggcctact actgtggatt tgtatgccat
60tatagacttg gtgctgtgaa agactgcttg atgatttgcg ggtttgttgc tgtgtaaaaa
120aaggtccctt ggctcccaga agaccatgaa ggttcggatc tatcatgtaa ttccttgtta
180tctgccaatt atgtatggac tatggacatg tgttgcgctg ttcaacttac tactacaaat
240aagtaatcga tatgttccct tcccatgtct cggtgacaat tgtctggaga agcttagggg
300tcgtttgttt gggattatgt ctggagaaac ttattttaaa ctaagtgtga gttcaagtta
360agttagatta tataatctag gcagattata attccaagcg aacaggtcct tagtgttttt
420ggaaaatcct aggtgttctt ttggctacat tgttgtgtgt gcagatccct tgttggtctg
480taagcgtggg gaagtaagaa tcgtccgttt ctactgaaga cctgctcgag ttaggcaccg
540aggatgccgg taaccaaaca gagcaatagt gtctctgtgg gcacagtgga gtgtgaatct
600gtgtgatgca aatccgtcat ttgtttagca aaatttccag cgttgcatga tgcagtttct
660ttaacacgga cttaagggaa gggaaaaaaa tgttgagcca ggagatcctt caatgtgtta
720gactgacgtg atagccaact aaaccacgac gcaatgttgt cgttaatgac aaaaaaacta
780tttgttccta aatccttggc gacattgcat ggctgtctca tgagataatg gtctcatctc
840ttatttatct cttatttata gccggaagtg gtagtgaccc ctgcttgatt gctcgtatgc
900catctcaagt tctcaaccgt gtcgagcagc cattttccca tctcaagcgc atcatcgttt
960cgtttgacct catctgctat cctgctccta gtgcaaatca catgcgacag aaagtgtggc
1020gcgccactag t
1031151031DNAZea mays 15gagctcgcat catgatcatg catcatggac tcggcctact
actgtggatt tgtatgccat 60tatagacttg gtgctgtgaa agactgcttg atgatttgcg
ggtttgttgc tgtgtaaaaa 120aaggtccctt ggctcccaga agaccatgaa ggttcggatc
tatcatgtaa ttccttgtta 180tctgccaatt atgtatggac tatggacatg tgttgcgctg
ttcaacttac tactacaaat 240aagtaatcga tatgttccct tcccatgtct cggtgacaat
tgtctggaga agcttagggg 300tcgtttgttt gggattatgt ctggagaaac ttattttaaa
ctaagtgtga gttcaagtta 360agttagatta tataatctag gcagattata attccaagcg
aacaggtcct tagtgttttt 420ggaaaatcct aggtgttctt ttggctacat tgttgtgtgt
gcagatccct tgttggtctg 480taagcgtggg gaagtaagaa tcgtccgttt ctactgaaga
cctgctcgag ttaggcaccg 540aggatgccgg taaccaaaca gagcaatagt gtctctgtgg
gcacagtgga gtgtgaatct 600gtgtgatgca aatccgtcat ttgtttagca aaatttccag
cgttgcatga tgcagtttct 660ttaacacgga cttaagggaa gggaaaaaaa tgttgagcca
ggagatcctt caatgtgtta 720gactgacgtg atagccaact aaaccacgac gcaatgttgt
cgttaatgac aaaaaaacta 780tttgttccta aatccttggc gacattgcat ggctgtctca
tgagataatg gtctcatctc 840ttatttatct cttatttata gccggaagtg gtagtgaccc
ctgcttgatt gctcgtatgc 900catctcaagt tctcaaccgt gtcgagcagc cattttccca
tctcaagcgc atcatcgttt 960cgtttgacct catctgctat cctgctccta gtgcaaatca
catgcgacag aaagtgtggc 1020gcgccactag t
1031161031DNAZea mays 16gagctcgcat catgatcatg
catcatggac tcggcctact actgtggatt tgtatgccat 60tatagacttg gtgctgtgaa
agactgcttg atgatttgcg ggtttgttgc tgtgtaaaaa 120aaggtccctt ggctcccaga
agaccatgaa ggttcggatc tatcatgtaa ttccttgtta 180tctgccaatt atgtatggac
tatggacatg tgttgcgctg ttcaacttac tactacaaat 240aagtaatcga tatgttccct
tcccatgtct cggtgacaat tgtctggaga agcttagggg 300tcgtttgttt gggattatgt
ctggagaaac ttattttaaa ctaagtgtga gttcaagtta 360agttagatta tataatctag
gcagattata attccaagcg aacaggtcct tagtgttttt 420ggaaaatcct aggtgttctt
ttggctacat tgttgtgtgt gcagatccct tgttggtctg 480taagcgtggg gaagtaagaa
tcgtccgttt ctactgaaga cctgctcgag ttaggcaccg 540aggatgccgg taaccaaaca
gagcaatagt gtctctgtgg gcacagtgga gtgtgaatct 600gtgtgatgca aatccgtcat
ttgtttagca aaatttccag cgttgcatga tgcagtttct 660ttaacacgga cttaagggaa
gggaaaaaaa tgttgagcca ggagatcctt caatgtgtta 720gactgacgtg atagccaact
aaaccacgac gcaatgttgt cgttaatgac aaaaaaacta 780tttgttccta aatccttggc
gacattgcat ggctgtctca tgagataatg gtctcatctc 840ttatttatct cttatttata
gccggaagtg gtagtgaccc ctgcttgatt gctcgtatgc 900catctcaagt tctcaaccgt
gtcgagcagc cattttccca tctcaagcgc atcatcgttt 960cgtttgacct catctgctat
cctgctccta gtgcaaatca catgcgacag aaagtgtggc 1020gcgccactag t
1031171031DNAZea mays
17gagctcgcat catgatcatg catcatggac tcggcctact actgtggatt tgtatgccat
60tatagacttg gtgctgtgaa agactgcttg atgatttgcg ggtttgttgc tgtgtaaaaa
120aaggtccctt ggctcccaga agaccatgaa ggttcggatc tatcatgtaa ttccttgtta
180tctgccaatt atgtatggac tatggacatg tgttgcgctg ttcaacttac tactacaaat
240aagtaatcga tatgttccct tcccatgtct cggtgacaat tgtctggaga agcttagggg
300tcgtttgttt gggattatgt ctggagaaac ttattttaaa ctaagtgtga gttcaagtta
360agttagatta tataatctag gcagattata attccaagcg aacaggtcct tagtgttttt
420ggaaaatcct aggtgttctt ttggctacat tgttgtgtgt gcagatccct tgttggtctg
480taagcgtggg gaagtaagaa tcgtccgttt ctactgaaga cctgctcgag ttaggcaccg
540aggatgccgg taaccaaaca gagcaatagt gtctctgtgg gcacagtgga gtgtgaatct
600gtgtgatgca aatccgtcat ttgtttagca aaatttccag cgttgcatga tgcagtttct
660ttaacacgga cttaagggaa gggaaaaaaa tgttgagcca ggagatcctt caatgtgtta
720gactgacgtg atagccaact aaaccacgac gcaatgttgt cgttaatgac aaaaaaacta
780tttgttccta aatccttggc gacattgcat ggctgtctca tgagataatg gtctcatctc
840ttatttatct cttatttata gccggaagtg gtagtgaccc ctgcttgatt gctcgtatgc
900catctcaagt tctcaaccgt gtcgagcagc cattttccca tctcaagcgc atcatcgttt
960cgtttgacct catctgctat cctgctccta gtgcaaatca catgcgacag aaagtgtggc
1020gcgccactag t
1031188546DNAArtificial sequenceplasmid pNOV1321 18cttgcatgcc tgcagtgcag
cgtgacccgg tcgtgcccct ctctagagat aatgagcatt 60gcatgtctaa gttataaaaa
attaccacat attttttttg tcacacttgt ttgaagtgca 120gtttatctat ctttatacat
atatttaaac tttactctac gaataatata atctatagta 180ctacaataat atcagtgttt
tagagaatca tataaatgaa cagttagaca tggtctaaag 240gacaattgag tattttgaca
acaggactct acagttttat ctttttagtg tgcatgtgtt 300ctcctttttt tttgcaaata
gcttcaccta tataatactt catccatttt attagtacat 360ccatttaggg tttagggtta
atggttttta tagactaatt tttttagtac atctatttta 420ttctatttta gcctctaaat
taagaaaact aaaactctat tttagttttt ttatttaata 480atttagatat aaaatagaat
aaaataaagt gactaaaaat taaacaaata ccctttaaga 540aattaaaaaa actaaggaaa
catttttctt gtttcgagta gataatgcca gcctgttaaa 600cgccgtcgac gagtctaacg
gacaccaacc agcgaaccag cagcgtcgcg tcgggccaag 660cgaagcagac ggcacggcat
ctctgtcgct gcctctggac ccctctcgag agttccgctc 720caccgttgga cttgctccgc
tgtcggcatc cagaaattgc gtggcggagc ggcagacgtg 780agccggcacg gcaggcggcc
tcctcctcct ctcacggcac cggcagctac gggggattcc 840tttcccaccg ctccttcgct
ttcccttcct cgcccgccgt aataaataga caccccctcc 900acaccctctt tccccaacct
cgtgttgttc ggagcgcaca cacacacaac cagatctccc 960ccaaatccac ccgtcggcac
ctccgcttca aggtacgccg ctcgtcctcc cccccccccc 1020ctctctacct tctctagatc
ggcgttccgg tccatggtta gggcccggta gttctacttc 1080tgttcatgtt tgtgttagat
ccgtgtttgt gttagatccg tgctgctagc gttcgtacac 1140ggatgcgacc tgtacgtcag
acacgttctg attgctaact tgccagtgtt tctctttggg 1200gaatcctggg atggctctag
ccgttccgca gacgggatcg atttcatgat tttttttgtt 1260tcgttgcata gggtttggtt
tgcccttttc ctttatttca atatatgccg tgcacttgtt 1320tgtcgggtca tcttttcatg
cttttttttg tcttggttgt gatgatgtgg tctggttggg 1380cggtcgttct agatcggagt
agaattctgt ttcaaactac ctggtggatt tattaatttt 1440ggatctgtat gtgtgtgcca
tacatattca tagttacgaa ttgaagatga tggatggaaa 1500tatcgatcta ggataggtat
acatgttgat gcgggtttta ctgatgcata tacagagatg 1560ctttttgttc gcttggttgt
gatgatgtgg tgtggttggg cggtcgttca ttcgttctag 1620atcggagtag aatactgttt
caaactacct ggtgtattta ttaattttgg aactgtatgt 1680gtgtgtcata catcttcata
gttacgagtt taagatggat ggaaatatcg atctaggata 1740ggtatacatg ttgatgtggg
ttttactgat gcatatacat gatggcatat gcagcatcta 1800ttcatatgct ctaaccttga
gtacctatct attataataa acaagtatgt tttataatta 1860ttttgatctt gatatacttg
gatgatggca tatgcagcag ctatatgtgg atttttttag 1920ccctgccttc atacgctatt
tatttgcttg gtactgtttc ttttgtcgat gctcaccctg 1980ttgtttggtg ttacttctgc
agggatccaa caatggacaa caaccccaac atcaacgagt 2040gcatccccta caactgcctg
agcaaccccg aggtggaggt gctgggcggc gagcgcatcg 2100agaccggcta cacccccatc
gacatcagcc tgagcctgac ccagttcctg ctgagcgagt 2160tcgtgcccgg cgccggcttc
gtgctgggcc tggtggacat catctggggc atcttcggcc 2220ccagccagtg ggacgccttc
ctggtgcaga tcgagcagtt gataaaccaa cgcatagagg 2280aattcgcccg caaccaggcc
atcagccgcc tggagggcct gagcaacctg taccaaatct 2340acgccgagag cttccgcgag
tgggaggccg accccaccaa ccccgccctg cgcgaggaga 2400tgcgcatcca gttcaacgac
atgaacagcg ccctgaccac cgccatcccc ctgttcgccg 2460tgcagaacta ccaggtgccc
ctgctgagcg tgtacgtgca ggccgccaac ctgcacctga 2520gcgtgctgcg cgacgtcagc
gtgttcggcc agcgctgggg cttcgacgcc gccaccatca 2580acagccgcta caacgacctg
acccgcctga tcggcaacta caccgaccac gccgtgcgct 2640ggtacaacac cggcctggag
cgcgtgtggg gtcccgacag ccgcgactgg atcaggtaca 2700accagttccg ccgcgagctg
accctgaccg tgctggacat cgtgagcctg ttccccaact 2760acgacagccg cacctacccc
atccgcaccg tgagccagct gacccgcgag atttacacca 2820accccgtgct ggagaacttc
gacggcagct tccgcggcag cgcccagggc atcgagggca 2880gcatccgcag cccccacctg
atggacatcc tgaacagcat caccatctac accgacgccc 2940accgcggcga gtactactgg
agcggccacc agatcatggc cagccccgtc ggcttcagcg 3000gccccgagtt caccttcccc
ctgtacggca ccatgggcaa cgctgcacct cagcagcgca 3060tcgtggcaca gctgggccag
ggagtgtacc gcaccctgag cagcaccctg taccgtcgac 3120ctttcaacat cggcatcaac
aaccagcagc tgagcgtgct ggacggcacc gagttcgcct 3180acggcaccag cagcaacctg
cccagcgccg tgtaccgcaa gagcggcacc gtggacagcc 3240tggacgagat cccccctcag
aacaacaacg tgccacctcg acagggcttc agccaccgtc 3300tgagccacgt gagcatgttc
cgcagtggct tcagcaacag cagcgtgagc atcatccgtg 3360cacctatgtt cagctggatt
caccgcagtg ccgagttcaa caacatcatc cccagcagcc 3420agatcaccca gatccccctg
accaagagca ccaacctggg cagcggcacc agcgtggtga 3480agggccccgg cttcaccggc
ggcgacatcc tgcgccgcac cagccccggc cagatcagca 3540ccctgcgcgt gaacatcacc
gcccccctga gccagcgcta ccgcgtccgc atccgctacg 3600ccagcaccac caacctgcag
ttccacacca gcatcgacgg ccgccccatc aaccagggca 3660acttcagcgc caccatgagc
agcggcagca acctgcagag cggcagcttc cgcaccgtgg 3720gcttcaccac ccccttcaac
ttcagcaacg gcagcagcgt gttcaccctg agcgcccacg 3780tgttcaacag cggcaacgag
gtgtacatcg accgcatcga gttcgtgccc gccgaggtga 3840ccttcgaggc cgagtacgac
ctggagaggg ctcagaaggc cgtgaacgag ctgttcacca 3900gcagcaacca gatcggcctg
aagaccgacg tgaccgacta ccacatcgac caggtgagca 3960acctggtgga gtgcttaagc
gacgagttct gcctggacga gaagaaggag ctgagcgaga 4020aggtgaagca cgccaagcgc
ctgagcgacg agcgcaacct gctgcaggac cccaacttcc 4080gcggcatcaa ccgccagctg
gaccgcggct ggcgaggcag caccgatatc accatccagg 4140gcggcgacga cgtgttcaag
gagaactacg tgaccctgct gggcaccttc gacgagtgct 4200accccaccta cctgtaccag
aagatcgacg agagcaagct gaaggcctac acccgctacc 4260agctgcgcgg ctacatcgag
gacagccagg acctggaaat ctacctgatc cgctacaacg 4320cgaagcacga gaccgtgaac
gtgcccggca ccggcagcct gtggcccctg agcgccccca 4380gccccatcgg caagtgcggg
gagccgaatc gatgcgctcc gcacctggag tggaacccgg 4440acctagactg cagctgcagg
gacggggaga agtgcgccca ccacagccac cacttcagcc 4500tggacatcga cgtgggctgc
accgacctga acgaggacct gggcgtgtgg gtgatcttca 4560agatcaagac ccaggacggc
cacgcccgcc tgggcaatct agagttcctg gaggagaagc 4620ccctggtggg cgaggccctg
gcccgcgtga agcgtgctga gaagaagtgg cgcgacaagc 4680gcgagaagct ggagtgggag
accaacatcg tgtacaagga ggccaaggag agcgtggacg 4740ccctgttcgt gaacagccag
tacgaccgcc tgcaggccga caccaacatc gccatgatcc 4800acgccgccga caagcgcgtg
cacagcattc gcgaggccta cctgcccgag ctgagcgtga 4860tccccggtgt gaacgccgcc
atcttcgagg aactcgaggg ccgcatcttc accgccttca 4920gcctgtacga cgcccgcaac
gtgatcaaga acggcgactt caacaacggc ctgagctgct 4980ggaacgtgaa gggccacgtg
gacgtggagg agcagaacaa ccaccgcagc gtgctggtgg 5040tgcccgagtg ggaggccgag
gtgagccagg aggtgcgcgt gtgccccggc cgcggctaca 5100tcctgcgcgt gaccgcctac
aaggagggct acggcgaggg ctgcgtgacc atccacgaga 5160tcgagaacaa caccgacgaa
ctcaagttca gcaactgcgt ggaggaggag gtttacccca 5220acaacaccgt gacctgcaac
gactacaccg cgacccagga ggagtacgaa ggcacctaca 5280cctctcgcaa caggggttac
gacggcgcct acgagtccaa cagctccgtg ccagctgact 5340acgccagcgc ctacgaggag
aaagcctaca ccgacggtag acgcgacaac ccatgtgaga 5400gcaacagagg ctacggcgac
tacacccccc tgcccgctgg atacgtgacc aaggagctgg 5460agtacttccc cgagaccgac
aaggtgtgga tcgagattgg cgagaccgag ggcaccttca 5520tcgtggacag cgtggagctg
ctgctgatgg aggagtagta gatccatctg cagatgagct 5580ctagatcccc gaatttcccc
gatcgttcaa acatttggca ataaagtttc ttaagattga 5640atcctgttgc cggtcttgcg
atgattatca tataatttct gttgaattac gttaagcatg 5700taataattaa catgtaatgc
atgacgttat ttatgagatg ggtttttatg attagagtcc 5760cgcaattata catttaatac
gcgatagaaa acaaaatata gcgcgcaaac taggataaat 5820tatcgcgcgc ggtgtcatct
atgttactag atcgggaatt gggtaccgaa ttcactggcc 5880gtcgttttac aacgtcgtga
ctgggaaaac cctggcgtta cccaacttaa tcgccttgca 5940gcacatcccc ctttcgccag
ctggcgtaat agcgaagagg cccgcaccga tcgcccttcc 6000caacagttgc gcagcctgaa
tggcgaatgg cgcctgatgc ggtattttct ccttacgcat 6060ctgtgcggta tttcacaccg
catatggtgc actctcagta caatctgctc tgatgccgca 6120tagttaagcc agccccgaca
cccgccaaca cccgctgacg cgccctgacg ggcttgtctg 6180ctcccggcat ccgcttacag
acaagctgtg accgtctccg ggagctgcat gtgtcagagg 6240ttttcaccgt catcaccgaa
acgcgcgaga cgaaagggcc tcgtgatacg cctattttta 6300taggttaatg tcatgataat
aatggtttct tagacgtcag gtggcacttt tcggggaaat 6360gtgcgcggaa cccctatttg
tttatttttc taaatacatt caaatatgta tccgctcatg 6420agacaataac cctgataaat
gcttcaatgg cgcgccgcgg ccgcttaaga atattgaaaa 6480aggaagagta tgagtattca
acatttccgt gtcgccctta ttcccttttt tgcggcattt 6540tgccttcctg tttttgctca
cccagaaacg ctggtgaaag taaaagatgc tgaagatcag 6600ttgggtgcac gagtgggtta
catcgaactg gatctcaaca gcggtaagat ccttgagagt 6660tttcgccccg aagaacgttt
tccaatgatg agcactttta aagttctgct atgtggcgcg 6720gtattatccc gtattgacgc
cgggcaagag caactcggtc gccgcataca ctattctcag 6780aatgacttgg ttgagtactc
accagtcaca gaaaagcatc ttacggatgg catgacagta 6840agagaattat gcagtgctgc
cataaccatg agtgataaca ctgcggccaa cttacttctg 6900acaacgatcg gaggaccgaa
ggagctaacc gcttttttgc acaacatggg ggatcatgta 6960actcgccttg atcgttggga
accggagctg aatgaagcca taccaaacga cgagcgtgac 7020accacgatgc ctgtagcaat
ggcaacaacg ttgcgcaaac tattaactgg cgaactactt 7080actctagctt cccggcaaca
attaatagac tggatggagg cggataaagt tgcaggacca 7140cttctgcgct cggcccttcc
ggctggctgg tttattgctg ataaatctgg agccggtgag 7200cgtgggtctc gcggtatcat
tgcagcactg gggccagatg gtaagccctc ccgtatcgta 7260gttatctaca cgacggggag
tcaggcaact atggatgaac gaaatagaca gatcgctgag 7320ataggtgcct cactgattaa
gcattggtaa ctgtcagacc aagtttactc atatatactt 7380tagattgatt taaaacttca
tttttaattt aaaaggatct aggtgaagat cctttttgat 7440aatctcatga ccaaaatccc
ttaacgtgag ttttcgttcc actgagcgtc agaccccgta 7500gaaaagatca aaggatcttc
ttgagatcct ttttttctgc gcgtaatctg ctgcttgcaa 7560acaaaaaaac caccgctacc
agcggtggtt tgtttgccgg atcaagagct accaactctt 7620tttccgaagg taactggctt
cagcagagcg cagataccaa atactgtcct tctagtgtag 7680ccgtagttag gccaccactt
caagaactct gtagcaccgc ctacatacct cgctctgcta 7740atcctgttac cagtggctgc
tgccagtggc gataagtcgt gtcttaccgg gttggactca 7800agacgatagt taccggataa
ggcgcagcgg tcgggctgaa cggggggttc gtgcacacag 7860cccagcttgg agcgaacgac
ctacaccgaa ctgagatacc tacagcgtga gctatgagaa 7920agcgccacgc ttcccgaagg
gagaaaggcg gacaggtatc cggtaagcgg cagggtcgga 7980acaggagagc gcacgaggga
gcttccaggg ggaaacgcct ggtatcttta tagtcctgtc 8040gggtttcgcc acctctgact
tgagcgtcga tttttgtgat gctcgtcagg ggggcggagc 8100ctatggaaaa acgccagcaa
cgcggccttt ttacggttcc tggccttttg ctggcctttt 8160gctcacatgt tctttcctgc
gttatcccct gattctgtgg ataaccgtat taccgccttt 8220gagtgagctg ataccgctcg
ccgcagccga acgaccgagc gcagcgagtc agtgagcgag 8280gaagcggaag agcttaagcg
gccgcggcgc gccgcccaat acgcaaaccg cctctccccg 8340cgcgttggcc gattcattaa
tgcagctggc acgacaggtt tcccgactgg aaagcgggca 8400gtgagcgcaa cgcaattaat
gtgagttagc tcactcatta ggcaccccag gctttacact 8460ttatgcttcc ggctcgtatg
ttgtgtggaa ttgtgagcgg ataacaattt cacacaggaa 8520acagctatga ccatgattac
gccaag 85461923DNAArtificial
sequenceforward primer ABT P1 forw 19cgaccagcgc gacatgcatg gca
232023DNAArtificial sequencereverse
primer ABT P2 rev 20accccagggc gtacgacaag gcc
232129DNAArtificial sequenceoligonucleotide pABT mut1
21gatggccgga ttgggctccc ggggtggag
292229DNAArtificial sequenceoligonucleotide pABT mut2 22ctgggaggcg
cgcaaggggc agttcctcg
292330DNAArtificial sequenceoligonucleotide pABT mut3 23cccaccgccg
gagcaccgaa aggccccgcg
302427DNAArtificial sequenceoligonucleotide pABT mut4 24gtcacccggg
agcacttccc ggcgccg
272527DNAArtificial sequenceoligonucleotide pABT mut5 25cattgggccg
agcacggctt cttccgc
272631DNAArtificial sequenceoligonucleotide pABT mut6 26ggggtacggt
gttcttgagt cgtgaagcga c
312739DNAArtificial sequenceforward primer pABT amp1 27gcgtctagag
ggaccccgac cagcgcgaca tgcatggca
392836DNAArtificial sequencereverse primer pABT amp2 28accccagggc
gtacgacaag gccccaccat gggcgc
362945DNAArtificial sequenceforward primer ABT P4 29tatatagagc tcgaatcgaa
gaagccacac tgtaaatctg ccggg 453045DNAArtificial
sequencereverse primer ABT P5 30agcaaggcat atgcagcagc tgctggtcgg
accgggccct atata 453129DNAArtificial
sequenceoligonucleotide ABTt m1 31gtcatgcatg ggcatgtgaa ggaggagcc
293229DNAArtificial sequenceoligonucleotide
ABTt m2 32gttgcatgca tgctgcatgg cgtcgagat
2933873DNAZea mays 33tgggaggcgc gcatggggca gttcctcggc aagaaggcgt
acgacaaggc cgcgatcaaa 60tgcaacggta gagaggccgt gacgaacttc gagcccagca
cgtacgacgg ggagctgctg 120ctgactgctg aagctagcgc agaagttgct gacgacgttg
atctgaactt gagcatctcg 180caaccggcat cgtcccagag ccccaaaaga gacaagaact
gccttggtcc gcagctccac 240caccaccatg ggcggccgtt tgacggctcc gccgttctga
agaaaaccaa gatcgatgct 300ccgtctgagc tgtcgtcggc gggccgccct caccggtcgt
tcctccctca tctcgtggct 360gccgagcatc taccgcctcg gtctcacccc ttcttcatca
cacaccatga gagtgatgca 420tcaagaagag atcccagctg ggcagcagca gcagcatgga
aggtgaccgc agctgcacct 480cctcctccta ccaccaccct gttgccgttg ccgctgccgt
cgacgtcgtc cgctgcagca 540tcatcaggat tctccaatac cgccacgaca gctgccgccg
ccccatcggc cgcctcctcc 600cgccggttcg acccgccgcc accgtcgtcg tcctcctcct
cgagccatca ccaccaccac 660caccgccgct gagaatcgaa gaagccacac tgtaaatctg
ccgggaagcg gctggtggca 720tccggcccgc tcctccctcc gggcgccgca acttttttcg
atcggttttg cgccgcccgg 780gacgggttgt agttgatcga ttggattctt cataactgta
tttgcgtact gcttacacta 840cccaagtgaa atcgaaaatg gcgccttctc tcg
87334921DNAZea mays 34gaggcgcgca tggggcagtt
cctcggcaag aagtacatat atcttgggct attcgacagc 60gaagtagagg ctgcaagggc
gtacgacaag gccgcgatca aatgcaacgg tagagaggcc 120gtgacgaact tcgagcccag
cacgtacgac ggggagctgc tgctgactgc tgaagctagc 180gcagaagttg ctgacgacgt
tgatctgaac ttgagcatct cgcaaccggc atcgtcccag 240agccccaaaa gagacaagaa
ctgccttggt ccgcagctcc accaccacca tgggcggccg 300tttgacggct ccgccgttct
gaagaaaacc aagatcgatg ctccgtctga gctgtcgtcg 360gcgggccgcc ctcaccggtc
gttcctccct catctcgtgg ctgccgagca tctaccgcct 420cggtctcacc ccttcttcat
cacacaccat gagagtgatg catcaagaag agatcccagc 480tgggcagcag cagcagcatg
gaaggtgacc gcagctgcac ctcctcctcc taccaccacc 540ctgttgccgt tgccgctgcc
gtcgacgtcg tccgctgcag catcatcagg attctccaat 600accgccacga cagctgccgc
cgccccatcg gccgcctcct cccgccggtt cgacccgccg 660ccaccgtcgt cgtcctcctc
ctcgagccat caccaccacc accaccgccg ctgagaatcg 720aagaagccac actgtaaatc
tgccgggaag cggctggtgg catccggccc gctcctccct 780ccgggcgccg caactttttt
cgatcggttt tgcgccgccc gggacgggtt gtagttgatc 840gattggattc ttcataactg
tatttgcgta ctgcttacac tacccaagtg aaatcgaaaa 900tggcgccttc tctcgttgaa t
921352644DNAZea mays
35gcgtctagag ggaccccgac cagcgcgaca tgcatggcat ggcaaactat atatcgtcat
60catcattatt atcatctgac cctctttttt tttcactctc actcccatgt ttttattccc
120gggcggggcc gtgtgggtgt gggttgggat ggccggattg ggctcccggg gtggagaaat
180gacaaatcca ggcccgcagg cggccaccca ccaaatcgga cgacgcaggg tgcccaaatc
240aggaaggatt ttaaggttaa ccggccaccg gcggtgaccg acgccccacc ccactctcct
300tctcctattc tatctatata tcacccgcct cttttttctc cctcactccg ccacaccttc
360cctcttcttc ctcagctccg tcgcccaccg ccggagcacc gaaaggcccc gcgcccgccg
420cctttcctgt aaaaaaccca acctttagct agctaaccgc tcctcttctc cccctactcc
480ccttgcccaa atcagagaag atatttaacg gaggagggga aggagaggat atttagctga
540ttgttgattg gtggtccggg gtacggtgtt cttgagtcgt gaagcgaccg tacagtggct
600agggccgtct ccgggttgcg tgcaggatgg tcgtcagaga tcgggagtga ggaggcagct
660cgtggtcgtg gaggctaaat gtaccgcaag aacgactcgg cactctcctg tttctacctc
720ttcctcctct ggttcttctt cttgaaatag accagcgcca gccaccaggt agctacctac
780tagctagcag cccagttgcg actggggacg ggctgctgct tgcaagttgg aatcttggag
840caggagcaga ggagcgggag atggagctgg atctgaacgt ggccgaggtg gcgccggaga
900agccatcggc ggcgctggag gcgagcgact cggggtcctc gggctcgtcg gtgctgaacg
960cggaggcggc atcggcgggc ggcggggggc ccgcgccggg ggaggagggg tcaagctcga
1020cgccggccgt gctcgagttc agcatcctca ggagcgacag cgacgcggcc ggcgcggacg
1080ccgacgacgg cgacgccacg ccgtcgccac ctcgccacca ccagcagcag ctcgtcaccc
1140gggagcactt cccggcgccg cagcattggg ccgagcacgg cttcttccgc gccggcccgc
1200agcagcagcc ggacatcagg gtcctgccgc acccgcaccc gtacccgccc ccgccgccgc
1260ccgcgcagcc gcagcaggcc aagaagagcc gccgcggccc gcgctcccgc agctcgcagt
1320accgcggcgt caccttctac cgccgcaccg gccgctggga gtcccacatc tggtcagtag
1380cactgcaagc tcaccatgcg ccctttcacc taccgaccaa taatcgcttg tgattctgac
1440acccaaatgt ttcgtcttcc tgtgctgtcc tgttcctcgg aaatggcagg gattgcggga
1500agcaggtgta cttaggtgag cagcaataag cagatcgatc tgcagcataa atttcccgtt
1560attaactagt tcgtgatctc gatcgaatgg cctaattaac cgattcggtg atctggccga
1620tggccaatct acgcaggtgg attcgacact gctcatgccg ctgcaaggta acgatcaatc
1680catccatcca cccttgtcta gctaccccac cgaccggccg gattaatgga ccgctagctc
1740tcgggacggg cttgctgcag ggcgtacgac cgagcggcga tcaagttccg cggcgtcgac
1800gccgacataa acttcaacct cagcgactac gacgacgata tgaagcaggt acatacacga
1860gtgttcttgc agctagcacc gactgaaaca tctgctgaac gtacacgcat ggccctgtgc
1920accagatgaa gagcctgtcc aaggaggagt tcgttcacgc cctgcggcgg cagagcaccg
1980gcttctcccg cggcagctcc aagtacaggg gcgtcaccct gcacaagtgc ggccgctggg
2040aggcgcgcaa ggggcagttc ctcggcaaga agtaagaaac aacacttcgt ttgcaggcgc
2100tgtactttgc tgcagattat ttcatttcat ccttgcatgt gcctttcctt tccatccact
2160cacttgatgg ctgtagtctc gatagagttc gttcgttcgt acttcgcacc agatgaactc
2220ccacgcacat gatttagtac tagttttacc atgcattgtt cagtaaaagt atatgcttgc
2280ttgatcagtg gttgtttcaa tcagaagatt aaaaaaacgg aatattaata taaaaaaaag
2340gggaagtggc tagggaattc ctcagtccta gctagctagc tcaccggtgg gaacgccatg
2400cttggcttgg gtgcaggtac atatatcttg ggctattcga cagcgaagta gaggctgcaa
2460ggttgttcac ctcggacgat tctgccattt gttcatatac accatgcctt ttgatttctc
2520tcttgcaatt tctcttcttt tatcatggct tttgattccc aaagggttga gtaccgactc
2580gatattcgat tctccctgcc gtttcgtgac cccagggcgt acgacaaggc cccaccatgg
2640gcgc
2644361069DNAZea mays 36tatatagagc tcgaatcgaa gaagccacac tgtaaatctg
ccgggaagcg gctggtggca 60tccggcccgc tcctccctcc gggcgccgca acttttttcg
atcggttttg cgccgcccgg 120gacgggttgt agttgatcga ttggattctt cataactgta
tttgcgtact gcttacacta 180cccaagtgaa atcgaaaatg gcgccttctc tcgttgaata
aattgcacgt acgctactcg 240atccgctgcg gctcttgctg gagtggccgc cgccgctata
gatagaagga tcaagccaag 300gaatctgtca tgcatgggca tgtgaaggag gagcctcctg
caatgtttag tcttttttgg 360tcgacgccca ccagagatat acgcactaga tttcatatag
ctgagctaga tcgattccgt 420tgcatgcatg ctgcatggcg tcgagattcg agctagcacc
gcctgttcat catcgaccga 480tccattctga tcgattcccc tctcgagctt tcacgaactg
aacctaccta gtgagggtga 540cgcctaacgc ctagtgcgcg cgcgtgggtc tccgatgtca
gtggccgcac gcgcgcgcgc 600gttctcgaga tcgcatgtgg tcatagcgca gcaggtttgc
cctcagaacc tacagcaact 660cgaccaccgg tttggatttc ttcttttttc aaggatatga
tcggagagag agagctacct 720aggcgtcgtc cttgttttct tgtatcgcat gtggtgtggg
tctctctcct cctttcgtac 780gcacgcatga ttccattctt accccccctc gagatcgaga
ggaaatatat tgctatttta 840tacacacacg gcgcccccag ctatacgtca ctgcttacgt
taattccccc accggatagt 900agttgtttaa tggcccaaac aaaccttgtt gttgcatgca
tcatggacca aacaaaatac 960atagttagtt aaatattact gttatatata caactaataa
taattatatt attagttaaa 1020acaaagcaag gcatatgcag cagctgctgg tcggaccggg
ccctatata 1069378599DNAArtificial sequenceZmABP3-Cry1AbG6
Assembly construct 37cggcgcgccg aaagtagcaa acaacaggtt catgtgcact
ataaaaagac aaaattctcg 60agtttcatct tttattccac ataagcctta tattttccat
tttcatatga tttttagttt 120aagtttgtgt cttaactttt tcgttaatac gtaattctat
gcattatgga tgcgtgaagt 180atttttgttt aaaaaaatga aatgtcaaaa tacgttttgt
gatctatttc catgttttca 240cctaacaggt ggtttttact atatattctg ccataactct
agccttagat gtaaatcgaa 300aaaaaatgag agatgagctg gagatagcct tagatgaagc
gtctgaaata taaaagaaag 360agtaatgttg aacgcagtag gtgtagcagc tgtagttcca
tctctaggaa agggaactgc 420aatccgggct ccgggcctcg cgcaatctgg cctgtcgtgt
agatgcagcc ctgtccatga 480cggcccaagc aacgcccgcg gctctcgatc caccacggaa
cccactccga cacacactga 540cacacacatg ctggatgtgg atgtgctgtc caattattag
tagcaattcg gtaggcacag 600gcacgtactg gccggtgttt tagctgtaag taccgaacca
atcacggtta agaaccgatt 660aatccgtgcc cagccgccga gtgcgttcgt acgtgcatcg
gatgcactgc atgaattgag 720agcatcatca tatcatacgc aggagtagta cgacgccgct
gctgtcttgt ccggctaatg 780ctttgctcac agattagtcc atcgcccacg gtcggtgtgg
tgtggatcgc tgatgccact 840gctttttgtt tggtttttat tcccctgata atcctccgcg
tccctgaatg tatctattta 900ttttcattcc gaaatccctt tcacgaaaaa gaaaacgaat
aaaaagagag ttacgaatac 960gcttccggcg gcccacatca ccttccagcg aacatcgcgc
cgcgctgacg tgtcgcccat 1020cgcggccgtc catatcgcca tccgacgacc gtggaagctg
gcagcggccg ctccgttccg 1080tcgaaggggc aggtcagtca ggtcacccac acggccacac
ccgcgcgggg gatacgcggt 1140ggaaaacccg gcgaccacat caaaacacga ggcgtctccc
gcaggactgg tcactcggca 1200cgcaggcaga ggcagcacag cagcagccag ctccatccat
cctctttccc ctcctcgctt 1260cgcttcctcg gcggattcct cctccctcgg ccgtccccgt
ccccttcttc gccgcgccag 1320ctcgcccgag ttggtaaggc cccctccacc cctccgcttc
ccctcccccg ggcgcgctct 1380ggcttcctcc ccggatcggc gcggggcgtg ctggctccgc
gcctgatttc gggccttttg 1440tttccttctc gcggagcgct cgtgtaacgc ttcggatcta
gctggattca ggcgggatcg 1500cggccgctcg gcttcctcgt ggcctgattc gtggttttcc
tcggggaggg aatcctgatc 1560ggatcatcgg gattcctcgt gcggccggga cacgcttgcg
agccagaaac atagtctgcg 1620tggccgggat tccacgatct gtgatctaga cgtcgggcgc
ttcgtctatg tgctcgctgc 1680aggctgtggc gtactggcgt ggtgcgcggc cgctatggat
ccgtgcttgt ttgttcgccc 1740tgtagcgtgt gaaatcgagc tgtgtagatc tatggtctgc
gaggtgcggt ggcggtggaa 1800tctcggttga tctttacctc agcggcgcca gtgtagctcg
tgtggctgca gttcatctgc 1860gaatttggct ctcggcggct taggtcgcgg agcttggatt
atggagcacc agctgcagcg 1920tgaccctgtt ggttctcatg tggatctgtt ggctgaggtt
gcagacttca agtgccactg 1980ccattgaccg gagctgctgc acgattatac tggaatatct
agcggtagta tactctgcta 2040gtactcaata cgggtctcct gacaaatgtc tttcgtgttt
agggacctag cactctagtg 2100tcaagactat ttgctggaat atctaatatt agcagtttct
gtagtggctc agttgcagcc 2160tggtttagaa tgatggggac agttggctgt gccatgcaaa
ataaagtgtg tgaaagcaac 2220tgcctcttaa actatgggtg gtgcaagcag gttatttgaa
gggactctcc acactgtatc 2280tccagttaac tatgactgaa cttgtggtcg caggcaaacc
caccatggac aacaacccca 2340acatcaacga gtgcatcccc tacaactgcc tgagcaaccc
cgaggtggag gtgctgggcg 2400gcgagcgcat cgagaccggc tacaccccca tcgacatcag
cctgagcctg acccagttcc 2460tgctgagcga gttcgtgccc ggcgccggct tcgtgctggg
cctggtggac atcatctggg 2520gcatcttcgg ccccagccag tgggacgcct tcctggtgca
gatcgagcag ttgataaacc 2580aacgcataga ggaattcgcc cgcaaccagg ccatcagccg
cctggagggc ctgagcaacc 2640tgtaccaaat ctacgccgag agcttccgcg agtgggaggc
cgaccccacc aaccccgccc 2700tgcgcgagga gatgcgcatc cagttcaacg acatgaacag
cgccctgacc accgccatcc 2760ccctgttcgc cgtgcagaac taccaggtgc ccctgctgag
cgtgtacgtg caggccgcca 2820acctgcacct gagcgtgctg cgcgacgtca gcgtgttcgg
ccagcgctgg ggcttcgacg 2880ccgccaccat caacagccgc tacaacgacc tgacccgcct
gatcggcaac tacaccgacc 2940acgccgtgcg ctggtacaac accggcctgg agcgcgtgtg
gggtcccgac agccgcgact 3000ggatcaggta caaccagttc cgccgcgagc tgaccctgac
cgtgctggac atcgtgagcc 3060tgttccccaa ctacgacagc cgcacctacc ccatccgcac
cgtgagccag ctgacccgcg 3120agatttacac caaccccgtg ctggagaact tcgacggcag
cttccgcggc agcgcccagg 3180gcatcgaggg cagcatccgc agcccccacc tgatggacat
cctgaacagc atcaccatct 3240acaccgacgc ccaccgcggc gagtactact ggagcggcca
ccagatcatg gccagccccg 3300tcggcttcag cggccccgag ttcaccttcc ccctgtacgg
cacgatgggc aacgctgcac 3360ctcagcagcg catcgtggca cagctgggcc agggagtgta
ccgcaccctg agcagcaccc 3420tgtaccgtcg acctttcaac atcggcatca acaaccagca
gctgagcgtg ctggacggca 3480ccgagttcgc ctacggcacc agcagcaacc tgcccagcgc
cgtgtaccgc aagagcggca 3540ccgtggacag cctggacgag atcccccctc agaacaacaa
cgtgccacct cgacagggct 3600tcagccaccg tctgagccac gtgagcatgt tccgcagtgg
cttcagcaac agcagcgtga 3660gcatcatccg tgcacctatg ttcagctgga ttcaccgcag
tgccgagttc aacaacatca 3720tccccagcag ccagatcacc cagatccccc tgaccaagag
caccaacctg ggcagcggca 3780ccagcgtggt gaagggcccc ggcttcaccg gcggcgacat
cctgcgccgc accagccccg 3840gccagatcag caccctgcgc gtgaacatca ccgcccccct
gagccagcgc taccgcgtcc 3900gcatccgcta cgccagcacc accaacctgc agttccacac
cagcatcgac ggccgcccca 3960tcaaccaggg caacttcagc gccaccatga gcagcggcag
caacctgcag agcggcagct 4020tccgcaccgt gggcttcacc acccccttca acttcagcaa
cggcagcagc gtgttcaccc 4080tgagcgccca cgtgttcaac agcggcaacg aggtgtacat
cgaccgcatc gagttcgtgc 4140ccgccgaggt gaccttcgag gccgagtacg acctggagag
ggctcagaag gccgtgaacg 4200agctgttcac cagcagcaac cagatcggcc tgaagaccga
cgtgaccgac taccacatcg 4260accaggtgag caacctggtg gagtgcttaa gcgacgagtt
ctgcctggac gagaagaagg 4320agctgagcga gaaggtgaag cacgccaagc gcctgagcga
cgagcgcaac ctgctgcagg 4380accccaactt ccgcggcatc aaccgccagc tggaccgcgg
ctggcgaggc agcaccgata 4440tcaccatcca gggcggcgac gacgtgttca aggagaacta
cgtgaccctg ctgggcacct 4500tcgacgagtg ctaccccacc tacctgtacc agaagatcga
cgagagcaag ctgaaggcct 4560acacccgcta ccagctgcgc ggctacatcg aggacagcca
ggacctggaa atctacctga 4620tccgctacaa cgcgaagcac gagaccgtga acgtgcccgg
caccggcagc ctgtggcccc 4680tgagcgcccc cagccccatc ggcaagtgcc accacagcca
ccacttcagc ctggacatcg 4740acgtgggctg caccgacctg aacgaggacc tgggcgtgtg
ggtgatcttc aagatcaaga 4800cccaggacgg ccacgcccgc ctgggcaatc tagagttcct
ggaggagaag cccctggtgg 4860gcgaggccct ggcccgcgtg aagcgtgctg agaagaagtg
gcgcgacaag cgcgagaagc 4920tggagtggga gaccaacatc gtgtacaagg aggccaagga
gagcgtggac gccctgttcg 4980tgaacagcca gtacgaccgc ctgcaggccg acaccaacat
cgccatgatc cacgccgccg 5040acaagcgcgt gcacagcatt cgcgaggcct acctgcccga
gctgagcgtg atccccggtg 5100tgaacgccgc catcttcgag gaactcgagg gccgcatcta
ggagctcgca tcatgatcat 5160gcatcatgga ctcggcctac tactgtggat ttgtatgcca
ttatagactt ggtgctgtga 5220aagactgctt gatgatttgc gggtttgttg ctgtgtaaaa
aaaggtccct tggctcccag 5280aagaccatga aggttcggat ctatcatgta attccttgtt
atctgccaat tatgtatgga 5340ctatggacat gtgttgcgct gttcaactta ctactacaaa
taagtaatcg atatgttccc 5400ttcccatgtc tcggtgacaa ttgtctggag aagcttaggg
gtcgtttgtt tgggattatg 5460tctggagaaa cttattttaa actaagtgtg agttcaagtt
aagttagatt atataatcta 5520ggcagattat aattccaagc gaacaggtcc ttagtgtttt
tggaaaatcc taggtgttct 5580tttggctaca ttgttgtgtg tgcagatccc ttgttggtct
gtaagcgtgg ggaagtaaga 5640atcgtccgtt tctactgaag acctgctcga gttaggcacc
gaggatgccg gtaaccaaac 5700agagcaatag tgtctctgtg ggcacagtgg agtgtgaatc
tgtgtgatgc aaatccgtca 5760tttgtttagc aaaatttcca gcgttgcatg atgcagtttc
tttaacacgg acttaaggga 5820agggaaaaaa atgttgagcc aggagatcct tcaatgtgtt
agactgacgt gatagccaac 5880taaaccacga cgcaatgttg tcgttaatga caaaaaaact
atttgttcct aaatccttgg 5940cgacattgca tggctgtctc atgagataat ggtctcatct
cttatttatc tcttatttat 6000agccggaagt ggtagtgacc cctgcttgat tgctcgtatg
ccatctcaag ttctcaaccg 6060tgtcgagcag ccattttccc atctcaagcg catcatcgtt
tcgtttgacc tcatctgcta 6120tcctgctcct agtgcaaatc acatgcgaca gaaagtgtgg
cgcgccacta gtcccgggcc 6180catcgatgat atcagatctg gttctatagt gtcacctaaa
tcgtatgtgt atgatacata 6240aggttatgta ttaattgtag ccgcgttcta acgacaatat
gtccatatgg tgcactctca 6300gtacaatctg ctctgatgcc gcatagttaa gccagccccg
acacccgcca acacccgctg 6360acgcgccctg acgggcttgt ctgctcccgg catccgctta
cagacaagct gtgaccgtct 6420ccgggagctg catgtgtcag aggttttcac cgtcatcacc
gaaacgcgcg agacgaaagg 6480gcctcgtgat acgcctattt ttataggtta atgtcatgat
aataatggtt tcttagacgt 6540caggtggcac ttttcgggga aatgtgcgcg gaacccctat
ttgtttattt ttctaaatac 6600attcaaatat gtatccgctc atgagacaat aaccctgata
aatgcttcaa taatattgaa 6660aaaggaagag tatgagtatt caacatttcc gtgtcgccct
tattcccttt tttgcggcat 6720tttgccttcc tgtttttgct cacccagaaa cgctggtgaa
agtaaaagat gctgaagatc 6780agttgggtgc acgagtgggt tacatcgaac tggatctcaa
cagcggtaag atccttgaga 6840gttttcgccc cgaagaacgt tttccaatga tgagcacttt
taaagttctg ctatgtggcg 6900cggtattatc ccgtattgac gccgggcaag agcaactcgg
tcgccgcata cactattctc 6960agaatgactt ggttgagtac tcaccagtca cagaaaagca
tcttacggat ggcatgacag 7020taagagaatt atgcagtgct gccataacca tgagtgataa
cactgcggcc aacttacttc 7080tgacaacgat cggaggaccg aaggagctaa ccgctttttt
gcacaacatg ggggatcatg 7140taactcgcct tgatcgttgg gaaccggagc tgaatgaagc
cataccaaac gacgagcgtg 7200acaccacgat gcctgtagca atggcaacaa cgttgcgcaa
actattaact ggcgaactac 7260ttactctagc ttcccggcaa caattaatag actggatgga
ggcggataaa gttgcaggac 7320cacttctgcg ctcggccctt ccggctggct ggtttattgc
tgataaatct ggagccggtg 7380agcgtgggtc tcgcggtatc attgcagcac tggggccaga
tggtaagccc tcccgtatcg 7440tagttatcta cacgacgggg agtcaggcaa ctatggatga
acgaaataga cagatcgctg 7500agataggtgc ctcactgatt aagcattggt aactgtcaga
ccaagtttac tcatatatac 7560tttagattga tttaaaactt catttttaat ttaaaaggat
ctaggtgaag atcctttttg 7620ataatctcat gaccaaaatc ccttaacgtg agttttcgtt
ccactgagcg tcagaccccg 7680tagaaaagat caaaggatct tcttgagatc ctttttttct
gcgcgtaatc tgctgcttgc 7740aaacaaaaaa accaccgcta ccagcggtgg tttgtttgcc
ggatcaagag ctaccaactc 7800tttttccgaa ggtaactggc ttcagcagag cgcagatacc
aaatactgtt cttctagtgt 7860agccgtagtt aggccaccac ttcaagaact ctgtagcacc
gcctacatac ctcgctctgc 7920taatcctgtt accagtggct gctgccagtg gcgataagtc
gtgtcttacc gggttggact 7980caagacgata gttaccggat aaggcgcagc ggtcgggctg
aacggggggt tcgtgcacac 8040agcccagctt ggagcgaacg acctacaccg aactgagata
cctacagcgt gagctatgag 8100aaagcgccac gcttcccgaa gggagaaagg cggacaggta
tccggtaagc ggcagggtcg 8160gaacaggaga gcgcacgagg gagcttccag ggggaaacgc
ctggtatctt tatagtcctg 8220tcgggtttcg ccacctctga cttgagcgtc gatttttgtg
atgctcgtca ggggggcgga 8280gcctatggaa aaacgccagc aacgcggcct ttttacggtt
cctggccttt tgctggcctt 8340ttgctcacat gttctttcct gcgttatccc ctgattctgt
ggataaccgt attaccgcct 8400ttgagtgagc tgataccgct cgccgcagcc gaacgaccga
gcgcagcgag tcagtgagcg 8460aggaagcgga agagcgccca atacgcaaac cgcctctccc
cgcgcgttgg ccgattcatt 8520aatgcaggtt aacctggctt atcgaaatta atacgactca
ctatagggag accggcctcg 8580agcagctgaa gcttgcatg
85993815162DNAArtificial sequenceZmABP3-Cry1AbG6
binary construct 38taattcctgt ggttggcatg cacatacaaa tggacgaacg gataaacctt
ttcacgccct 60tttaaatatc cgattattct aataaacgct cttttctctt aggtttaccc
gccaatatat 120cctgtcaaac actgatagtt taaactgaag gcgggaaacg acaatctgat
catgagcgga 180gaattaaggg agtcacgtta tgacccccgc cgatgacgcg ggacaagccg
ttttacgttt 240ggaactgaca gaaccgcaac gctgcaggaa ttggccgcag cggccattta
aatcaattgg 300gcgcgccaca ctttctgtcg catgtgattt gcactaggag caggatagca
gatgaggtca 360aacgaaacga tgatgcgctt gagatgggaa aatggctgct cgacacggtt
gagaacttga 420gatggcatac gagcaatcaa gcaggggtca ctaccacttc cggctataaa
taagagataa 480ataagagatg agaccattat ctcatgagac agccatgcaa tgtcgccaag
gatttaggaa 540caaatagttt ttttgtcatt aacgacaaca ttgcgtcgtg gtttagttgg
ctatcacgtc 600agtctaacac attgaaggat ctcctggctc aacatttttt tcccttccct
taagtccgtg 660ttaaagaaac tgcatcatgc aacgctggaa attttgctaa acaaatgacg
gatttgcatc 720acacagattc acactccact gtgcccacag agacactatt gctctgtttg
gttaccggca 780tcctcggtgc ctaactcgag caggtcttca gtagaaacgg acgattctta
cttccccacg 840cttacagacc aacaagggat ctgcacacac aacaatgtag ccaaaagaac
acctaggatt 900ttccaaaaac actaaggacc tgttcgcttg gaattataat ctgcctagat
tatataatct 960aacttaactt gaactcacac ttagtttaaa ataagtttct ccagacataa
tcccaaacaa 1020acgaccccta agcttctcca gacaattgtc accgagacat gggaagggaa
catatcgatt 1080acttatttgt agtagtaagt tgaacagcgc aacacatgtc catagtccat
acataattgg 1140cagataacaa ggaattacat gatagatccg aaccttcatg gtcttctggg
agccaaggga 1200ccttttttta cacagcaaca aacccgcaaa tcatcaagca gtctttcaca
gcaccaagtc 1260tataatggca tacaaatcca cagtagtagg ccgagtccat gatgcatgat
catgatgcga 1320gctcctagat gcggccctcg agttcctcga agatggcggc gttcacaccg
gggatcacgc 1380tcagctcggg caggtaggcc tcgcgaatgc tgtgcacgcg cttgtcggcg
gcgtggatca 1440tggcgatgtt ggtgtcggcc tgcaggcggt cgtactggct gttcacgaac
agggcgtcca 1500cgctctcctt ggcctccttg tacacgatgt tggtctccca ctccagcttc
tcgcgcttgt 1560cgcgccactt cttctcagca cgcttcacgc gggccagggc ctcgcccacc
aggggcttct 1620cctccaggaa ctctagattg cccaggcggg cgtggccgtc ctgggtcttg
atcttgaaga 1680tcacccacac gcccaggtcc tcgttcaggt cggtgcagcc cacgtcgatg
tccaggctga 1740agtggtggct gtggtggcac ttgccgatgg ggctgggggc gctcaggggc
cacaggctgc 1800cggtgccggg cacgttcacg gtctcgtgct tcgcgttgta gcggatcagg
tagatttcca 1860ggtcctggct gtcctcgatg tagccgcgca gctggtagcg ggtgtaggcc
ttcagcttgc 1920tctcgtcgat cttctggtac aggtaggtgg ggtagcactc gtcgaaggtg
cccagcaggg 1980tcacgtagtt ctccttgaac acgtcgtcgc cgccctggat ggtgatatcg
gtgctgcctc 2040gccagccgcg gtccagctgg cggttgatgc cgcggaagtt ggggtcctgc
agcaggttgc 2100gctcgtcgct caggcgcttg gcgtgcttca ccttctcgct cagctccttc
ttctcgtcca 2160ggcagaactc gtcgcttaag cactccacca ggttgctcac ctggtcgatg
tggtagtcgg 2220tcacgtcggt cttcaggccg atctggttgc tgctggtgaa cagctcgttc
acggccttct 2280gagccctctc caggtcgtac tcggcctcga aggtcacctc ggcgggcacg
aactcgatgc 2340ggtcgatgta cacctcgttg ccgctgttga acacgtgggc gctcagggtg
aacacgctgc 2400tgccgttgct gaagttgaag ggggtggtga agcccacggt gcggaagctg
ccgctctgca 2460ggttgctgcc gctgctcatg gtggcgctga agttgccctg gttgatgggg
cggccgtcga 2520tgctggtgtg gaactgcagg ttggtggtgc tggcgtagcg gatgcggacg
cggtagcgct 2580ggctcagggg ggcggtgatg ttcacgcgca gggtgctgat ctggccgggg
ctggtgcggc 2640gcaggatgtc gccgccggtg aagccggggc ccttcaccac gctggtgccg
ctgcccaggt 2700tggtgctctt ggtcaggggg atctgggtga tctggctgct ggggatgatg
ttgttgaact 2760cggcactgcg gtgaatccag ctgaacatag gtgcacggat gatgctcacg
ctgctgttgc 2820tgaagccact gcggaacatg ctcacgtggc tcagacggtg gctgaagccc
tgtcgaggtg 2880gcacgttgtt gttctgaggg gggatctcgt ccaggctgtc cacggtgccg
ctcttgcggt 2940acacggcgct gggcaggttg ctgctggtgc cgtaggcgaa ctcggtgccg
tccagcacgc 3000tcagctgctg gttgttgatg ccgatgttga aaggtcgacg gtacagggtg
ctgctcaggg 3060tgcggtacac tccctggccc agctgtgcca cgatgcgctg ctgaggtgca
gcgttgccca 3120tcgtgccgta cagggggaag gtgaactcgg ggccgctgaa gccgacgggg
ctggccatga 3180tctggtggcc gctccagtag tactcgccgc ggtgggcgtc ggtgtagatg
gtgatgctgt 3240tcaggatgtc catcaggtgg gggctgcgga tgctgccctc gatgccctgg
gcgctgccgc 3300ggaagctgcc gtcgaagttc tccagcacgg ggttggtgta aatctcgcgg
gtcagctggc 3360tcacggtgcg gatggggtag gtgcggctgt cgtagttggg gaacaggctc
acgatgtcca 3420gcacggtcag ggtcagctcg cggcggaact ggttgtacct gatccagtcg
cggctgtcgg 3480gaccccacac gcgctccagg ccggtgttgt accagcgcac ggcgtggtcg
gtgtagttgc 3540cgatcaggcg ggtcaggtcg ttgtagcggc tgttgatggt ggcggcgtcg
aagccccagc 3600gctggccgaa cacgctgacg tcgcgcagca cgctcaggtg caggttggcg
gcctgcacgt 3660acacgctcag caggggcacc tggtagttct gcacggcgaa cagggggatg
gcggtggtca 3720gggcgctgtt catgtcgttg aactggatgc gcatctcctc gcgcagggcg
gggttggtgg 3780ggtcggcctc ccactcgcgg aagctctcgg cgtagatttg gtacaggttg
ctcaggccct 3840ccaggcggct gatggcctgg ttgcgggcga attcctctat gcgttggttt
atcaactgct 3900cgatctgcac caggaaggcg tcccactggc tggggccgaa gatgccccag
atgatgtcca 3960ccaggcccag cacgaagccg gcgccgggca cgaactcgct cagcaggaac
tgggtcaggc 4020tcaggctgat gtcgatgggg gtgtagccgg tctcgatgcg ctcgccgccc
agcacctcca 4080cctcggggtt gctcaggcag ttgtagggga tgcactcgtt gatgttgggg
ttgttgtcca 4140tggtgggttt gcctgcgacc acaagttcag tcatagttaa ctggagatac
agtgtggaga 4200gtcccttcaa ataacctgct tgcaccaccc atagtttaag aggcagttgc
tttcacacac 4260tttattttgc atggcacagc caactgtccc catcattcta aaccaggctg
caactgagcc 4320actacagaaa ctgctaatat tagatattcc agcaaatagt cttgacacta
gagtgctagg 4380tccctaaaca cgaaagacat ttgtcaggag acccgtattg agtactagca
gagtatacta 4440ccgctagata ttccagtata atcgtgcagc agctccggtc aatggcagtg
gcacttgaag 4500tctgcaacct cagccaacag atccacatga gaaccaacag ggtcacgctg
cagctggtgc 4560tccataatcc aagctccgcg acctaagccg ccgagagcca aattcgcaga
tgaactgcag 4620ccacacgagc tacactggcg ccgctgaggt aaagatcaac cgagattcca
ccgccaccgc 4680acctcgcaga ccatagatct acacagctcg atttcacacg ctacagggcg
aacaaacaag 4740cacggatcca tagcggccgc gcaccacgcc agtacgccac agcctgcagc
gagcacatag 4800acgaagcgcc cgacgtctag atcacagatc gtggaatccc ggccacgcag
actatgtttc 4860tggctcgcaa gcgtgtcccg gccgcacgag gaatcccgat gatccgatca
ggattccctc 4920cccgaggaaa accacgaatc aggccacgag gaagccgagc ggccgcgatc
ccgcctgaat 4980ccagctagat ccgaagcgtt acacgagcgc tccgcgagaa ggaaacaaaa
ggcccgaaat 5040caggcgcgga gccagcacgc cccgcgccga tccggggagg aagccagagc
gcgcccgggg 5100gaggggaagc ggaggggtgg agggggcctt accaactcgg gcgagctggc
gcggcgaaga 5160aggggacggg gacggccgag ggaggaggaa tccgccgagg aagcgaagcg
aggaggggaa 5220agaggatgga tggagctggc tgctgctgtg ctgcctctgc ctgcgtgccg
agtgaccagt 5280cctgcgggag acgcctcgtg ttttgatgtg gtcgccgggt tttccaccgc
gtatcccccg 5340cgcgggtgtg gccgtgtggg tgacctgact gacctgcccc ttcgacggaa
cggagcggcc 5400gctgccagct tccacggtcg tcggatggcg atatggacgg ccgcgatggg
cgacacgtca 5460gcgcggcgcg atgttcgctg gaaggtgatg tgggccgccg gaagcgtatt
cgtaactctc 5520tttttattcg ttttcttttt cgtgaaaggg atttcggaat gaaaataaat
agatacattc 5580agggacgcgg aggattatca ggggaataaa aaccaaacaa aaagcagtgg
catcagcgat 5640ccacaccaca ccgaccgtgg gcgatggact aatctgtgag caaagcatta
gccggacaag 5700acagcagcgg cgtcgtacta ctcctgcgta tgatatgatg atgctctcaa
ttcatgcagt 5760gcatccgatg cacgtacgaa cgcactcggc ggctgggcac ggattaatcg
gttcttaacc 5820gtgattggtt cggtacttac agctaaaaca ccggccagta cgtgcctgtg
cctaccgaat 5880tgctactaat aattggacag cacatccaca tccagcatgt gtgtgtcagt
gtgtgtcgga 5940gtgggttccg tggtggatcg agagccgcgg gcgttgcttg ggccgtcatg
gacagggctg 6000catctacacg acaggccaga ttgcgcgagg cccggagccc ggattgcagt
tccctttcct 6060agagatggaa ctacagctgc tacacctact gcgttcaaca ttactctttc
ttttatattt 6120cagacgcttc atctaaggct atctccagct catctctcat tttttttcga
tttacatcta 6180aggctagagt tatggcagaa tatatagtaa aaaccacctg ttaggtgaaa
acatggaaat 6240agatcacaaa acgtattttg acatttcatt tttttaaaca aaaatacttc
acgcatccat 6300aatgcataga attacgtatt aacgaaaaag ttaagacaca aacttaaact
aaaaatcata 6360tgaaaatgga aaatataagg cttatgtgga ataaaagatg aaactcgaga
attttgtctt 6420tttatagtgc acatgaacct gttgtttgct actttcggcg cgccagctgc
ttgtggggac 6480cagacaaaaa aggaatggtg cagaattgtt aggcgcacct accaaaagca
tctttgcctt 6540tattgcaaag ataaagcaga ttcctctagt acaagtgggg aacaaaataa
cgtggaaaag 6600agctgtcctg acagcccact cactaatgcg tatgacgaac gcagtgacga
ccacaaaact 6660cgagactttt caacaaaggg taatatccgg aaacctcctc ggattccatt
gcccagctat 6720ctgtcacttt attgtgaaga tagtggaaaa ggaaggtggc tcctacaaat
gccatcattg 6780cgataaagga aaggctatcg ttgaagatgc ctctgccgac agtggtccca
aagatggacc 6840cccacccacg aggagcatcg tggaaaaaga agacgttcca accacgtctt
caaagcaagt 6900ggattgatgt gatatctcca ctgacgtaag ggatgacgaa caatcccact
atccttcggt 6960accggaccgc gatcgcttaa ttaagcttgc atgcctgcag tgcagcgtga
cccggtcgtg 7020cccctctcta gagataatga gcattgcatg tctaagttat aaaaaattac
cacatatttt 7080ttttgtcaca cttgtttgaa gtgcagttta tctatcttta tacatatatt
taaactttac 7140tctacgaata atataatcta tagtactaca ataatatcag tgttttagag
aatcatataa 7200atgaacagtt agacatggtc taaaggacaa ttgagtattt tgacaacagg
actctacagt 7260tttatctttt tagtgtgcat gtgttctcct ttttttttgc aaatagcttc
acctatataa 7320tacttcatcc attttattag tacatccatt tagggtttag ggttaatggt
ttttatagac 7380taattttttt agtacatcta ttttattcta ttttagcctc taaattaaga
aaactaaaac 7440tctattttag tttttttatt taataattta gatataaaat agaataaaat
aaagtgacta 7500aaaattaaac aaataccctt taagaaatta aaaaaactaa ggaaacattt
ttcttgtttc 7560gagtagataa tgccagcctg ttaaacgccg ccgacgagtc taacggacac
caaccagcga 7620accagcagcg tcgcgtcggg ccaagcgaag cagacggcac ggcatctctg
tcgctgcctc 7680tggacccctc tcgagagttc cgctccaccg ttggacttgc tccgctgtcg
gcatccagaa 7740attgcgtggc ggagcggcag acgtgagccg gcacggcagg cggcctcctc
ctcctctcac 7800ggcaccggca gctacggggg attcctttcc caccgctcct tcgctttccc
ttcctcgccc 7860gccgtaataa atagacaccc cctccacacc ctctttcccc aacctcgtgt
tgttcggagc 7920gcacacacac acaaccagat ctcccccaaa tccacccgtc ggcacctccg
cttcaaggta 7980cgccgctcgt cctccccccc cccccctctc taccttctct agatcggcgt
tccggtccat 8040agttagggcc cggtagttct acttctgttc atgtttgtgt tagatccgtg
tttgtgttag 8100atccgtgctg ttagcgttcg tacacggatg cgacctgtac gtcagacacg
ttctgattgc 8160taacttgcca gtgtttctct ttggggaatc ctgggatggc tctagccgtt
ccgcagacgg 8220gatcgatttc atgatttttt ttgtttcgtt gcatagggtt tggtttgccc
ttttccttta 8280tttcaatata tgccgtgcac ttgtttgtcg ggtcatcttt tcatgctttt
ttttgtcttg 8340gttgtgatga tgtggtctgg ttgggcggtc gttctagatc ggagtagaat
tctgtttcaa 8400actacctggt ggatttatta attttggatc tgtatgtgtg tgccatacat
attcatagtt 8460acgaattgaa gatgatggat ggaaatatcg atctaggata ggtatacatg
ttgatgcggg 8520ttttactgat gcatatacag agatgctttt tgttcgcttg gttgtgatga
tgtggtgtgg 8580ttgggcggtc gttcattcgt tctagatcgg agtagaatac tgtttcaaac
tacctggtgt 8640atttattaat tttggaactg tatgtgtgtg tcatacatct tcatagttac
gagtttaaga 8700tggatggaaa tatcgatcta ggataggtat acatgttgat gtgggtttta
ctgatgcata 8760tacatgatgg catatgcagc atctattcat atgctctaac cttgagtacc
tatctattat 8820aataaacaag tatgttttat aattattttg atcttgatat acttggatga
tggcatatgc 8880agcagctata tgtggatttt tttagccctg ccttcatacg ctatttattt
gcttggtact 8940gtttcttttg tcgatgctca ccctgttgtt tggtgttact tctgcaggga
tccactagtc 9000caccatgtct ccggagagga gaccagttga gattaggcca gctacagcag
ctgatatggc 9060cgcggtttgt gatatcgtta accattacat tgagacgtct acagtgaact
ttaggacaga 9120gccacaaaca ccacaagagt ggattgatga tctagagagg ttgcaagata
gatacccttg 9180gttggttgct gaggttgagg gtgttgtggc tggtattgct tacgctgggc
cctggaaggc 9240taggaacgct tacgattgga cagttgagag tactgtttac gtgtcacata
ggcatcaaag 9300gttgggccta ggatccacat tgtacacaca tttgcttaag tctatggagg
cgcaaggttt 9360taagtctgtg gttgctgtta taggccttcc aaacgatcca tctgttaggt
tgcatgaggc 9420tttgggatac acagcccggg gtacattgcg cgcagctgga tacaagcatg
gtggatggca 9480tgatgttggt ttttggcaaa gggattttga gttgccagct cctccaaggc
cagttaggcc 9540agttacccag atctgaacta gtgatatcgg cgccatgggt cgacctgcag
atcgttcaaa 9600catttggcaa taaagtttct taagattgaa tcctgttgcc ggtcttgcga
tgattatcat 9660ataatttctg ttgaattacg ttaagcatgt aataattaac atgtaatgca
tgacgttatt 9720tatgagatgg gtttttatga ttagagtccc gcaattatac atttaatacg
cgatagaaaa 9780caaaatatag cgcgcaaact aggataaatt atcgcgcgcg gtgtcatcta
tgttactaga 9840tctgctagcc ctgcaggaaa tttaccggtg cccgggcggc cagcatggcc
gtatccgcaa 9900tgtgttatta agttgtctaa gcgtcaattt gtttacacca caatatatcc
tgccaccagc 9960cagccaacag ctccccgacc ggcagctcgg cacaaaatca ccactcgata
caggcagccc 10020atcagaatta attctcatgt ttgacagctt atcatcgact gcacggtgca
ccaatgcttc 10080tggcgtcagg cagccatcgg aagctgtggt atggctgtgc aggtcgtaaa
tcactgcata 10140attcgtgtcg ctcaaggcgc actcccgttc tggataatgt tttttgcgcc
gacatcataa 10200cggttctggc aaatattctg aaatgagctg ttgacaatta atcatccggc
tcgtataatg 10260tgtggaattg tgagcggata acaatttcac acaggaaaca gaccatgagg
gaagcgttga 10320tcgccgaagt atcgactcaa ctatcagagg tagttggcgt catcgagcgc
catctcgaac 10380cgacgttgct ggccgtacat ttgtacggct ccgcagtgga tggcggcctg
aagccacaca 10440gtgatattga tttgctggtt acggtgaccg taaggcttga tgaaacaacg
cggcgagctt 10500tgatcaacga ccttttggaa acttcggctt cccctggaga gagcgagatt
ctccgcgctg 10560tagaagtcac cattgttgtg cacgacgaca tcattccgtg gcgttatcca
gctaagcgcg 10620aactgcaatt tggagaatgg cagcgcaatg acattcttgc aggtatcttc
gagccagcca 10680cgatcgacat tgatctggct atcttgctga caaaagcaag agaacatagc
gttgccttgg 10740taggtccagc ggcggaggaa ctctttgatc cggttcctga acaggatcta
tttgaggcgc 10800taaatgaaac cttaacgcta tggaactcgc cgcccgactg ggctggcgat
gagcgaaatg 10860tagtgcttac gttgtcccgc atttggtaca gcgcagtaac cggcaaaatc
gcgccgaagg 10920atgtcgctgc cgactgggca atggagcgcc tgccggccca gtatcagccc
gtcatacttg 10980aagctaggca ggcttatctt ggacaagaag atcgcttggc ctcgcgcgca
gatcagttgg 11040aagaatttgt tcactacgtg aaaggcgaga tcaccaaagt agtcggcaaa
taaagctcta 11100gtggatctcc gtacccaggg atctggctcg cggcggacgc acgacgccgg
ggcgagacca 11160taggcgatct cctaaatcaa tagtagctgt aacctcgaag cgtttcactt
gtaacaacga 11220ttgagaattt ttgtcataaa attgaaatac ttggttcgca tttttgtcat
ccgcggtcag 11280ccgcaattct gacgaactgc ccatttagct ggagatgatt gtacatcctt
cacgtgaaaa 11340tttctcaagc gctgtgaaca agggttcaga ttttagattg aaaggtgagc
cgttgaaaca 11400cgttcttctt gtcgatgacg acgtcgctat gcggcatctt attattgaat
accttacgat 11460ccacgccttc aaagtgaccg cggtagccga cagcacccag ttcacaagag
tactctcttc 11520cgcgacggtc gatgtcgtgg ttgttgatct agatttaggt cgtgaagatg
ggctcgagat 11580cgttcgtaat ctggcggcaa agtctgatat tccaatcata attatcagtg
gcgaccgcct 11640tgaggagacg gataaagttg ttgcactcga gctaggagca agtgatttta
tcgctaagcc 11700gttcagtatc agagagtttc tagcacgcat tcgggttgcc ttgcgcgtgc
gccccaacgt 11760tgtccgctcc aaagaccgac ggtctttttg ttttactgac tggacactta
atctcaggca 11820acgtcgcttg atgtccgaag ctggcggtga ggtgaaactt acggcaggtg
agttcaatct 11880tctcctcgcg tttttagaga aaccccgcga cgttctatcg cgcgagcaac
ttctcattgc 11940cagtcgagta cgcgacgagg aggtttatga caggagtata gatgttctca
ttttgaggct 12000gcgccgcaaa cttgaggcag atccgtcaag ccctcaactg ataaaaacag
caagaggtgc 12060cggttatttc tttgacgcgg acgtgcaggt ttcgcacggg gggacgatgg
cagcctgagc 12120caattcccag atccccgagg aatcggcgtg agcggtcgca aaccatccgg
cccggtacaa 12180atcggcgcgg cgctgggtga tgacctggtg gagaagttga aggccgcgca
ggccgcccag 12240cggcaacgca tcgaggcaga agcacgcccc ggtgaatcgt ggcaagcggc
cgctgatcga 12300atccgcaaag aatcccggca accgccggca gccggtgcgc cgtcgattag
gaagccgccc 12360aagggcgacg agcaaccaga ttttttcgtt ccgatgctct atgacgtggg
cacccgcgat 12420agtcgcagca tcatggacgt ggccgttttc cgtctgtcga agcgtgaccg
acgagctggc 12480gaggtgatcc gctacgagct tccagacggg cacgtagagg tttccgcagg
gccggccggc 12540atggccagtg tgtgggatta cgacctggta ctgatggcgg tttcccatct
aaccgaatcc 12600atgaaccgat accgggaagg gaagggagac aagcccggcc gcgtgttccg
tccacacgtt 12660gcggacgtac tcaagttctg ccggcgagcc gatggcggaa agcagaaaga
cgacctggta 12720gaaacctgca ttcggttaaa caccacgcac gttgccatgc agcgtacgaa
gaaggccaag 12780aacggccgcc tggtgacggt atccgagggt gaagccttga ttagccgcta
caagatcgta 12840aagagcgaaa ccgggcggcc ggagtacatc gagatcgagc tggctgattg
gatgtaccgc 12900gagatcacag aaggcaagaa cccggacgtg ctgacggttc accccgatta
ctttttgatc 12960gatcccggca tcggccgttt tctctaccgc ctggcacgcc gcgccgcagg
caaggcagaa 13020gccagatggt tgttcaagac gatctacgaa cgcagtggca gcgccggaga
gttcaagaag 13080ttctgtttca ccgtgcgcaa gctgatcggg tcaaatgacc tgccggagta
cgatttgaag 13140gaggaggcgg ggcaggctgg cccgatccta gtcatgcgct accgcaacct
gatcgagggc 13200gaagcatccg ccggttccta atgtacggag cagatgctag ggcaaattgc
cctagcaggg 13260gaaaaaggtc gaaaaggtct ctttcctgtg gatagcacgt acattgggaa
cccaaagccg 13320tacattggga accggaaccc gtacattggg aacccaaagc cgtacattgg
gaaccggtca 13380cacatgtaag tgactgatat aaaagagaaa aaaggcgatt tttccgccta
aaactcttta 13440aaacttatta aaactcttaa aacccgcctg gcctgtgcat aactgtctgg
ccagcgcaca 13500gccgaagagc tgcaaaaagc gcctaccctt cggtcgctgc gctccctacg
ccccgccgct 13560tcgcgtcggc ctatcgcggc cgctggccgc tcaaaaatgg ctggcctacg
gccaggcaat 13620ctaccagggc gcggacaagc cgcgccgtcg ccactcgacc gccggcgctg
aggtctgcct 13680cgtgaagaag gtgttgctga ctcataccag gcctgaatcg ccccatcatc
cagccagaaa 13740gtgagggagc cacggttgat gagagctttg ttgtaggtgg accagttggt
gattttgaac 13800ttttgctttg ccacggaacg gtctgcgttg tcgggaagat gcgtgatctg
atccttcaac 13860tcagcaaaag ttcgatttat tcaacaaagc cgccgtcccg tcaagtcagc
gtaatgctct 13920gccagtgtta caaccaatta accaattctg attagaaaaa ctcatcgagc
atcaaatgaa 13980actgcaattt attcatatca ggattatcaa taccatattt ttgaaaaagc
cgtttctgta 14040atgaaggaga aaactcaccg aggcagttcc ataggatggc aagatcctgg
tatcggtctg 14100cgattccgac tcgtccaaca tcaatacaac ctattaattt cccctcgtca
aaaataaggt 14160tatcaagtga gaaatcacca tgagtgacga ctgaatccgg tgagaatggc
aaaagctctg 14220cattaatgaa tcggccaacg cgcggggaga ggcggtttgc gtattgggcg
ctcttccgct 14280tcctcgctca ctgactcgct gcgctcggtc gttcggctgc ggcgagcggt
atcagctcac 14340tcaaaggcgg taatacggtt atccacagaa tcaggggata acgcaggaaa
gaacatgtga 14400gcaaaaggcc agcaaaaggc caggaaccgt aaaaaggccg cgttgctggc
gtttttccat 14460aggctccgcc cccctgacga gcatcacaaa aatcgacgct caagtcagag
gtggcgaaac 14520ccgacaggac tataaagata ccaggcgttt ccccctggaa gctccctcgt
gcgctctcct 14580gttccgaccc tgccgcttac cggatacctg tccgcctttc tcccttcggg
aagcgtggcg 14640ctttctcata gctcacgctg taggtatctc agttcggtgt aggtcgttcg
ctccaagctg 14700ggctgtgtgc acgaaccccc cgttcagccc gaccgctgcg ccttatccgg
taactatcgt 14760cttgagtcca acccggtaag acacgactta tcgccactgg cagcagccac
tggtaacagg 14820attagcagag cgaggtatgt aggcggtgct acagagttct tgaagtggtg
gcctaactac 14880ggctacacta gaagaacagt atttggtatc tgcgctctgc tgaagccagt
taccttcgga 14940aaaagagttg gtagctcttg atccggcaaa caaaccaccg ctggtagcgg
tggttttttt 15000gtttgcaagc agcagattac gcgcagaaaa aaaggatctc aagaagatcc
tttgatcttt 15060tctacggggt ctgacgctca gtggaacgaa aactcacgtt aagggatttt
ggtcatgaga 15120ttatcaaaaa ggatcttcac ctagatcctt ttgatccgga at
151623915162DNAArtificial sequenceenhanced ZmABP3-Cry1AbG6
binary construct 39taattcctgt ggttggcatg cacatacaaa tggacgaacg gataaacctt
ttcacgccct 60tttaaatatc cgattattct aataaacgct cttttctctt aggtttaccc
gccaatatat 120cctgtcaaac actgatagtt taaactgaag gcgggaaacg acaatctgat
catgagcgga 180gaattaaggg agtcacgtta tgacccccgc cgatgacgcg ggacaagccg
ttttacgttt 240ggaactgaca gaaccgcaac gctgcaggaa ttggccgcag cggccattta
aatcaattgg 300gcgcgccaca ctttctgtcg catgtgattt gcactaggag caggatagca
gatgaggtca 360aacgaaacga tgatgcgctt gagatgggaa aatggctgct cgacacggtt
gagaacttga 420gatggcatac gagcaatcaa gcaggggtca ctaccacttc cggctataaa
taagagataa 480ataagagatg agaccattat ctcatgagac agccatgcaa tgtcgccaag
gatttaggaa 540caaatagttt ttttgtcatt aacgacaaca ttgcgtcgtg gtttagttgg
ctatcacgtc 600agtctaacac attgaaggat ctcctggctc aacatttttt tcccttccct
taagtccgtg 660ttaaagaaac tgcatcatgc aacgctggaa attttgctaa acaaatgacg
gatttgcatc 720acacagattc acactccact gtgcccacag agacactatt gctctgtttg
gttaccggca 780tcctcggtgc ctaactcgag caggtcttca gtagaaacgg acgattctta
cttccccacg 840cttacagacc aacaagggat ctgcacacac aacaatgtag ccaaaagaac
acctaggatt 900ttccaaaaac actaaggacc tgttcgcttg gaattataat ctgcctagat
tatataatct 960aacttaactt gaactcacac ttagtttaaa ataagtttct ccagacataa
tcccaaacaa 1020acgaccccta agcttctcca gacaattgtc accgagacat gggaagggaa
catatcgatt 1080acttatttgt agtagtaagt tgaacagcgc aacacatgtc catagtccat
acataattgg 1140cagataacaa ggaattacat gatagatccg aaccttcatg gtcttctggg
agccaaggga 1200ccttttttta cacagcaaca aacccgcaaa tcatcaagca gtctttcaca
gcaccaagtc 1260tataatggca tacaaatcca cagtagtagg ccgagtccat gatgcatgat
catgatgcga 1320gctcctagat gcggccctcg agttcctcga agatggcggc gttcacaccg
gggatcacgc 1380tcagctcggg caggtaggcc tcgcgaatgc tgtgcacgcg cttgtcggcg
gcgtggatca 1440tggcgatgtt ggtgtcggcc tgcaggcggt cgtactggct gttcacgaac
agggcgtcca 1500cgctctcctt ggcctccttg tacacgatgt tggtctccca ctccagcttc
tcgcgcttgt 1560cgcgccactt cttctcagca cgcttcacgc gggccagggc ctcgcccacc
aggggcttct 1620cctccaggaa ctctagattg cccaggcggg cgtggccgtc ctgggtcttg
atcttgaaga 1680tcacccacac gcccaggtcc tcgttcaggt cggtgcagcc cacgtcgatg
tccaggctga 1740agtggtggct gtggtggcac ttgccgatgg ggctgggggc gctcaggggc
cacaggctgc 1800cggtgccggg cacgttcacg gtctcgtgct tcgcgttgta gcggatcagg
tagatttcca 1860ggtcctggct gtcctcgatg tagccgcgca gctggtagcg ggtgtaggcc
ttcagcttgc 1920tctcgtcgat cttctggtac aggtaggtgg ggtagcactc gtcgaaggtg
cccagcaggg 1980tcacgtagtt ctccttgaac acgtcgtcgc cgccctggat ggtgatatcg
gtgctgcctc 2040gccagccgcg gtccagctgg cggttgatgc cgcggaagtt ggggtcctgc
agcaggttgc 2100gctcgtcgct caggcgcttg gcgtgcttca ccttctcgct cagctccttc
ttctcgtcca 2160ggcagaactc gtcgcttaag cactccacca ggttgctcac ctggtcgatg
tggtagtcgg 2220tcacgtcggt cttcaggccg atctggttgc tgctggtgaa cagctcgttc
acggccttct 2280gagccctctc caggtcgtac tcggcctcga aggtcacctc ggcgggcacg
aactcgatgc 2340ggtcgatgta cacctcgttg ccgctgttga acacgtgggc gctcagggtg
aacacgctgc 2400tgccgttgct gaagttgaag ggggtggtga agcccacggt gcggaagctg
ccgctctgca 2460ggttgctgcc gctgctcatg gtggcgctga agttgccctg gttgatgggg
cggccgtcga 2520tgctggtgtg gaactgcagg ttggtggtgc tggcgtagcg gatgcggacg
cggtagcgct 2580ggctcagggg ggcggtgatg ttcacgcgca gggtgctgat ctggccgggg
ctggtgcggc 2640gcaggatgtc gccgccggtg aagccggggc ccttcaccac gctggtgccg
ctgcccaggt 2700tggtgctctt ggtcaggggg atctgggtga tctggctgct ggggatgatg
ttgttgaact 2760cggcactgcg gtgaatccag ctgaacatag gtgcacggat gatgctcacg
ctgctgttgc 2820tgaagccact gcggaacatg ctcacgtggc tcagacggtg gctgaagccc
tgtcgaggtg 2880gcacgttgtt gttctgaggg gggatctcgt ccaggctgtc cacggtgccg
ctcttgcggt 2940acacggcgct gggcaggttg ctgctggtgc cgtaggcgaa ctcggtgccg
tccagcacgc 3000tcagctgctg gttgttgatg ccgatgttga aaggtcgacg gtacagggtg
ctgctcaggg 3060tgcggtacac tccctggccc agctgtgcca cgatgcgctg ctgaggtgca
gcgttgccca 3120tcgtgccgta cagggggaag gtgaactcgg ggccgctgaa gccgacgggg
ctggccatga 3180tctggtggcc gctccagtag tactcgccgc ggtgggcgtc ggtgtagatg
gtgatgctgt 3240tcaggatgtc catcaggtgg gggctgcgga tgctgccctc gatgccctgg
gcgctgccgc 3300ggaagctgcc gtcgaagttc tccagcacgg ggttggtgta aatctcgcgg
gtcagctggc 3360tcacggtgcg gatggggtag gtgcggctgt cgtagttggg gaacaggctc
acgatgtcca 3420gcacggtcag ggtcagctcg cggcggaact ggttgtacct gatccagtcg
cggctgtcgg 3480gaccccacac gcgctccagg ccggtgttgt accagcgcac ggcgtggtcg
gtgtagttgc 3540cgatcaggcg ggtcaggtcg ttgtagcggc tgttgatggt ggcggcgtcg
aagccccagc 3600gctggccgaa cacgctgacg tcgcgcagca cgctcaggtg caggttggcg
gcctgcacgt 3660acacgctcag caggggcacc tggtagttct gcacggcgaa cagggggatg
gcggtggtca 3720gggcgctgtt catgtcgttg aactggatgc gcatctcctc gcgcagggcg
gggttggtgg 3780ggtcggcctc ccactcgcgg aagctctcgg cgtagatttg gtacaggttg
ctcaggccct 3840ccaggcggct gatggcctgg ttgcgggcga attcctctat gcgttggttt
atcaactgct 3900cgatctgcac caggaaggcg tcccactggc tggggccgaa gatgccccag
atgatgtcca 3960ccaggcccag cacgaagccg gcgccgggca cgaactcgct cagcaggaac
tgggtcaggc 4020tcaggctgat gtcgatgggg gtgtagccgg tctcgatgcg ctcgccgccc
agcacctcca 4080cctcggggtt gctcaggcag ttgtagggga tgcactcgtt gatgttgggg
ttgttgtcca 4140tggtgggttt gcctgcgacc acaagttcag tcatagttaa ctggagatac
agtgtggaga 4200gtcccttcaa ataacctgct tgcaccaccc atagtttaag aggcagttgc
tttcacacac 4260tttattttgc atggcacagc caactgtccc catcattcta aaccaggctg
caactgagcc 4320actacagaaa ctgctaatat tagatattcc agcaaatagt cttgacacta
gagtgctagg 4380tccctaaaca cgaaagacat ttgtcaggag acccgtattg agtactagca
gagtatacta 4440ccgctagata ttccagtata atcgtgcagc agctccggtc aatggcagtg
gcacttgaag 4500tctgcaacct cagccaacag atccacatga gaaccaacag ggtcacgctg
cagctggtgc 4560tccataatcc aagctccgcg acctaagccg ccgagagcca aattcgcaga
tgaactgcag 4620ccacacgagc tacactggcg ccgctgaggt aaagatcaac cgagattcca
ccgccaccgc 4680acctcgcaga ccatagatct acacagctcg atttcacacg ctacagggcg
aacaaacaag 4740cacggatcca tagcggccgc gcaccacgcc agtacgccac agcctgcagc
gagcacatag 4800acgaagcgcc cgacgtctag atcacagatc gtggaatccc ggccacgcag
actatgtttc 4860tggctcgcaa gcgtgtcccg gccgcacgag gaatcccgat gatccgatca
ggattccctc 4920cccgaggaaa accacgaatc aggccacgag gaagccgagc ggccgcgatc
ccgcctgaat 4980ccagctagat ccgaagcgtt acacgagcgc tccgcgagaa ggaaacaaaa
ggcccgaaat 5040caggcgcgga gccagcacgc cccgcgccga tccggggagg aagccagagc
gcgcccgggg 5100gaggggaagc ggaggggtgg agggggcctt accaactcgg gcgagctggc
gcggcgaaga 5160aggggacggg gacggccgag ggaggaggaa tccgccgagg aagcgaagcg
aggaggggaa 5220agaggatgga tggagctggc tgctgctgtg ctgcctctgc ctgcgtgccg
agtgaccagt 5280cctgcgggag acgcctcgtg ttttgatgtg gtcgccgggt tttccaccgc
gtatcccccg 5340cgcgggtgtg gccgtgtggg tgacctgact gacctgcccc ttcgacggaa
cggagcggcc 5400gctgccagct tccacggtcg tcggatggcg atatggacgg ccgcgatggg
cgacacgtca 5460gcgcggcgcg atgttcgctg gaaggtgatg tgggccgccg gaagcgtatt
cgtaactctc 5520tttttattcg ttttcttttt cgtgaaaggg atttcggaat gaaaataaat
agatacattc 5580agggacgcgg aggattatca ggggaataaa aaccaaacaa aaagcagtgg
catcagcgat 5640ccacaccaca ccgaccgtgg gcgatggact aatctgtgag caaagcatta
gccggacaag 5700acagcagcgg cgtcgtacta ctcctgcgta tgatatgatg atgctctcaa
ttcatgcagt 5760gcatccgatg cacgtacgaa cgcactcggc ggctgggcac ggattaatcg
gttcttaacc 5820gtgattggtt cggtacttac agctaaaaca ccggccagta cgtgcctgtg
cctaccgaat 5880tgctactaat aattggacag cacatccaca tccagcatgt gtgtgtcagt
gtgtgtcgga 5940gtgggttccg tggtggatcg agagccgcgg gcgttgcttg ggccgtcatg
gacagggctg 6000catctacacg acaggccaga ttgcgcgagg cccggagccc ggattgcagt
tccctttcct 6060agagatggaa ctacagctgc tacacctact gcgttcaaca ttactctttc
ttttatattt 6120cagacgcttc atctaaggct atctccagct catctctcat tttttttcga
tttacatcta 6180aggctagagt tatggcagaa tatatagtaa aaaccacctg ttaggtgaaa
acatggaaat 6240agatcacaaa acgtattttg acatttcatt tttttaaaca aaaatacttc
acgcatccat 6300aatgcataga attacgtatt aacgaaaaag ttaagacaca aacttaaact
aaaaatcata 6360tgaaaatgga aaatataagg cttatgtgga ataaaagatg aaactcgaga
attttgtctt 6420tttatagtgc acatgaacct gttgtttgct actttcggcg cgccagctgc
ttgtggggac 6480cagacaaaaa aggaatggtg cagaattgtt aggcgcacct accaaaagca
tctttgcctt 6540tattgcaaag ataaagcaga ttcctctagt acaagtgggg aacaaaataa
cgtggaaaag 6600agctgtcctg acagcccact cactaatgcg tatgacgaac gcagtgacga
ccacaaaact 6660cgagactttt caacaaaggg taatatccgg aaacctcctc ggattccatt
gcccagctat 6720ctgtcacttt attgtgaaga tagtggaaaa ggaaggtggc tcctacaaat
gccatcattg 6780cgataaagga aaggctatcg ttgaagatgc ctctgccgac agtggtccca
aagatggacc 6840cccacccacg aggagcatcg tggaaaaaga agacgttcca accacgtctt
caaagcaagt 6900ggattgatgt gatatctcca ctgacgtaag ggatgacgaa caatcccact
atccttcggt 6960accggaccgc gatcgcttaa ttaagcttgc atgcctgcag tgcagcgtga
cccggtcgtg 7020cccctctcta gagataatga gcattgcatg tctaagttat aaaaaattac
cacatatttt 7080ttttgtcaca cttgtttgaa gtgcagttta tctatcttta tacatatatt
taaactttac 7140tctacgaata atataatcta tagtactaca ataatatcag tgttttagag
aatcatataa 7200atgaacagtt agacatggtc taaaggacaa ttgagtattt tgacaacagg
actctacagt 7260tttatctttt tagtgtgcat gtgttctcct ttttttttgc aaatagcttc
acctatataa 7320tacttcatcc attttattag tacatccatt tagggtttag ggttaatggt
ttttatagac 7380taattttttt agtacatcta ttttattcta ttttagcctc taaattaaga
aaactaaaac 7440tctattttag tttttttatt taataattta gatataaaat agaataaaat
aaagtgacta 7500aaaattaaac aaataccctt taagaaatta aaaaaactaa ggaaacattt
ttcttgtttc 7560gagtagataa tgccagcctg ttaaacgccg ccgacgagtc taacggacac
caaccagcga 7620accagcagcg tcgcgtcggg ccaagcgaag cagacggcac ggcatctctg
tcgctgcctc 7680tggacccctc tcgagagttc cgctccaccg ttggacttgc tccgctgtcg
gcatccagaa 7740attgcgtggc ggagcggcag acgtgagccg gcacggcagg cggcctcctc
ctcctctcac 7800ggcaccggca gctacggggg attcctttcc caccgctcct tcgctttccc
ttcctcgccc 7860gccgtaataa atagacaccc cctccacacc ctctttcccc aacctcgtgt
tgttcggagc 7920gcacacacac acaaccagat ctcccccaaa tccacccgtc ggcacctccg
cttcaaggta 7980cgccgctcgt cctccccccc cccccctctc taccttctct agatcggcgt
tccggtccat 8040agttagggcc cggtagttct acttctgttc atgtttgtgt tagatccgtg
tttgtgttag 8100atccgtgctg ttagcgttcg tacacggatg cgacctgtac gtcagacacg
ttctgattgc 8160taacttgcca gtgtttctct ttggggaatc ctgggatggc tctagccgtt
ccgcagacgg 8220gatcgatttc atgatttttt ttgtttcgtt gcatagggtt tggtttgccc
ttttccttta 8280tttcaatata tgccgtgcac ttgtttgtcg ggtcatcttt tcatgctttt
ttttgtcttg 8340gttgtgatga tgtggtctgg ttgggcggtc gttctagatc ggagtagaat
tctgtttcaa 8400actacctggt ggatttatta attttggatc tgtatgtgtg tgccatacat
attcatagtt 8460acgaattgaa gatgatggat ggaaatatcg atctaggata ggtatacatg
ttgatgcggg 8520ttttactgat gcatatacag agatgctttt tgttcgcttg gttgtgatga
tgtggtgtgg 8580ttgggcggtc gttcattcgt tctagatcgg agtagaatac tgtttcaaac
tacctggtgt 8640atttattaat tttggaactg tatgtgtgtg tcatacatct tcatagttac
gagtttaaga 8700tggatggaaa tatcgatcta ggataggtat acatgttgat gtgggtttta
ctgatgcata 8760tacatgatgg catatgcagc atctattcat atgctctaac cttgagtacc
tatctattat 8820aataaacaag tatgttttat aattattttg atcttgatat acttggatga
tggcatatgc 8880agcagctata tgtggatttt tttagccctg ccttcatacg ctatttattt
gcttggtact 8940gtttcttttg tcgatgctca ccctgttgtt tggtgttact tctgcaggga
tccactagtc 9000caccatgtct ccggagagga gaccagttga gattaggcca gctacagcag
ctgatatggc 9060cgcggtttgt gatatcgtta accattacat tgagacgtct acagtgaact
ttaggacaga 9120gccacaaaca ccacaagagt ggattgatga tctagagagg ttgcaagata
gatacccttg 9180gttggttgct gaggttgagg gtgttgtggc tggtattgct tacgctgggc
cctggaaggc 9240taggaacgct tacgattgga cagttgagag tactgtttac gtgtcacata
ggcatcaaag 9300gttgggccta ggatccacat tgtacacaca tttgcttaag tctatggagg
cgcaaggttt 9360taagtctgtg gttgctgtta taggccttcc aaacgatcca tctgttaggt
tgcatgaggc 9420tttgggatac acagcccggg gtacattgcg cgcagctgga tacaagcatg
gtggatggca 9480tgatgttggt ttttggcaaa gggattttga gttgccagct cctccaaggc
cagttaggcc 9540agttacccag atctgaacta gtgatatcgg cgccatgggt cgacctgcag
atcgttcaaa 9600catttggcaa taaagtttct taagattgaa tcctgttgcc ggtcttgcga
tgattatcat 9660ataatttctg ttgaattacg ttaagcatgt aataattaac atgtaatgca
tgacgttatt 9720tatgagatgg gtttttatga ttagagtccc gcaattatac atttaatacg
cgatagaaaa 9780caaaatatag cgcgcaaact aggataaatt atcgcgcgcg gtgtcatcta
tgttactaga 9840tctgctagcc ctgcaggaaa tttaccggtg cccgggcggc cagcatggcc
gtatccgcaa 9900tgtgttatta agttgtctaa gcgtcaattt gtttacacca caatatatcc
tgccaccagc 9960cagccaacag ctccccgacc ggcagctcgg cacaaaatca ccactcgata
caggcagccc 10020atcagaatta attctcatgt ttgacagctt atcatcgact gcacggtgca
ccaatgcttc 10080tggcgtcagg cagccatcgg aagctgtggt atggctgtgc aggtcgtaaa
tcactgcata 10140attcgtgtcg ctcaaggcgc actcccgttc tggataatgt tttttgcgcc
gacatcataa 10200cggttctggc aaatattctg aaatgagctg ttgacaatta atcatccggc
tcgtataatg 10260tgtggaattg tgagcggata acaatttcac acaggaaaca gaccatgagg
gaagcgttga 10320tcgccgaagt atcgactcaa ctatcagagg tagttggcgt catcgagcgc
catctcgaac 10380cgacgttgct ggccgtacat ttgtacggct ccgcagtgga tggcggcctg
aagccacaca 10440gtgatattga tttgctggtt acggtgaccg taaggcttga tgaaacaacg
cggcgagctt 10500tgatcaacga ccttttggaa acttcggctt cccctggaga gagcgagatt
ctccgcgctg 10560tagaagtcac cattgttgtg cacgacgaca tcattccgtg gcgttatcca
gctaagcgcg 10620aactgcaatt tggagaatgg cagcgcaatg acattcttgc aggtatcttc
gagccagcca 10680cgatcgacat tgatctggct atcttgctga caaaagcaag agaacatagc
gttgccttgg 10740taggtccagc ggcggaggaa ctctttgatc cggttcctga acaggatcta
tttgaggcgc 10800taaatgaaac cttaacgcta tggaactcgc cgcccgactg ggctggcgat
gagcgaaatg 10860tagtgcttac gttgtcccgc atttggtaca gcgcagtaac cggcaaaatc
gcgccgaagg 10920atgtcgctgc cgactgggca atggagcgcc tgccggccca gtatcagccc
gtcatacttg 10980aagctaggca ggcttatctt ggacaagaag atcgcttggc ctcgcgcgca
gatcagttgg 11040aagaatttgt tcactacgtg aaaggcgaga tcaccaaagt agtcggcaaa
taaagctcta 11100gtggatctcc gtacccaggg atctggctcg cggcggacgc acgacgccgg
ggcgagacca 11160taggcgatct cctaaatcaa tagtagctgt aacctcgaag cgtttcactt
gtaacaacga 11220ttgagaattt ttgtcataaa attgaaatac ttggttcgca tttttgtcat
ccgcggtcag 11280ccgcaattct gacgaactgc ccatttagct ggagatgatt gtacatcctt
cacgtgaaaa 11340tttctcaagc gctgtgaaca agggttcaga ttttagattg aaaggtgagc
cgttgaaaca 11400cgttcttctt gtcgatgacg acgtcgctat gcggcatctt attattgaat
accttacgat 11460ccacgccttc aaagtgaccg cggtagccga cagcacccag ttcacaagag
tactctcttc 11520cgcgacggtc gatgtcgtgg ttgttgatct agatttaggt cgtgaagatg
ggctcgagat 11580cgttcgtaat ctggcggcaa agtctgatat tccaatcata attatcagtg
gcgaccgcct 11640tgaggagacg gataaagttg ttgcactcga gctaggagca agtgatttta
tcgctaagcc 11700gttcagtatc agagagtttc tagcacgcat tcgggttgcc ttgcgcgtgc
gccccaacgt 11760tgtccgctcc aaagaccgac ggtctttttg ttttactgac tggacactta
atctcaggca 11820acgtcgcttg atgtccgaag ctggcggtga ggtgaaactt acggcaggtg
agttcaatct 11880tctcctcgcg tttttagaga aaccccgcga cgttctatcg cgcgagcaac
ttctcattgc 11940cagtcgagta cgcgacgagg aggtttatga caggagtata gatgttctca
ttttgaggct 12000gcgccgcaaa cttgaggcag atccgtcaag ccctcaactg ataaaaacag
caagaggtgc 12060cggttatttc tttgacgcgg acgtgcaggt ttcgcacggg gggacgatgg
cagcctgagc 12120caattcccag atccccgagg aatcggcgtg agcggtcgca aaccatccgg
cccggtacaa 12180atcggcgcgg cgctgggtga tgacctggtg gagaagttga aggccgcgca
ggccgcccag 12240cggcaacgca tcgaggcaga agcacgcccc ggtgaatcgt ggcaagcggc
cgctgatcga 12300atccgcaaag aatcccggca accgccggca gccggtgcgc cgtcgattag
gaagccgccc 12360aagggcgacg agcaaccaga ttttttcgtt ccgatgctct atgacgtggg
cacccgcgat 12420agtcgcagca tcatggacgt ggccgttttc cgtctgtcga agcgtgaccg
acgagctggc 12480gaggtgatcc gctacgagct tccagacggg cacgtagagg tttccgcagg
gccggccggc 12540atggccagtg tgtgggatta cgacctggta ctgatggcgg tttcccatct
aaccgaatcc 12600atgaaccgat accgggaagg gaagggagac aagcccggcc gcgtgttccg
tccacacgtt 12660gcggacgtac tcaagttctg ccggcgagcc gatggcggaa agcagaaaga
cgacctggta 12720gaaacctgca ttcggttaaa caccacgcac gttgccatgc agcgtacgaa
gaaggccaag 12780aacggccgcc tggtgacggt atccgagggt gaagccttga ttagccgcta
caagatcgta 12840aagagcgaaa ccgggcggcc ggagtacatc gagatcgagc tggctgattg
gatgtaccgc 12900gagatcacag aaggcaagaa cccggacgtg ctgacggttc accccgatta
ctttttgatc 12960gatcccggca tcggccgttt tctctaccgc ctggcacgcc gcgccgcagg
caaggcagaa 13020gccagatggt tgttcaagac gatctacgaa cgcagtggca gcgccggaga
gttcaagaag 13080ttctgtttca ccgtgcgcaa gctgatcggg tcaaatgacc tgccggagta
cgatttgaag 13140gaggaggcgg ggcaggctgg cccgatccta gtcatgcgct accgcaacct
gatcgagggc 13200gaagcatccg ccggttccta atgtacggag cagatgctag ggcaaattgc
cctagcaggg 13260gaaaaaggtc gaaaaggtct ctttcctgtg gatagcacgt acattgggaa
cccaaagccg 13320tacattggga accggaaccc gtacattggg aacccaaagc cgtacattgg
gaaccggtca 13380cacatgtaag tgactgatat aaaagagaaa aaaggcgatt tttccgccta
aaactcttta 13440aaacttatta aaactcttaa aacccgcctg gcctgtgcat aactgtctgg
ccagcgcaca 13500gccgaagagc tgcaaaaagc gcctaccctt cggtcgctgc gctccctacg
ccccgccgct 13560tcgcgtcggc ctatcgcggc cgctggccgc tcaaaaatgg ctggcctacg
gccaggcaat 13620ctaccagggc gcggacaagc cgcgccgtcg ccactcgacc gccggcgctg
aggtctgcct 13680cgtgaagaag gtgttgctga ctcataccag gcctgaatcg ccccatcatc
cagccagaaa 13740gtgagggagc cacggttgat gagagctttg ttgtaggtgg accagttggt
gattttgaac 13800ttttgctttg ccacggaacg gtctgcgttg tcgggaagat gcgtgatctg
atccttcaac 13860tcagcaaaag ttcgatttat tcaacaaagc cgccgtcccg tcaagtcagc
gtaatgctct 13920gccagtgtta caaccaatta accaattctg attagaaaaa ctcatcgagc
atcaaatgaa 13980actgcaattt attcatatca ggattatcaa taccatattt ttgaaaaagc
cgtttctgta 14040atgaaggaga aaactcaccg aggcagttcc ataggatggc aagatcctgg
tatcggtctg 14100cgattccgac tcgtccaaca tcaatacaac ctattaattt cccctcgtca
aaaataaggt 14160tatcaagtga gaaatcacca tgagtgacga ctgaatccgg tgagaatggc
aaaagctctg 14220cattaatgaa tcggccaacg cgcggggaga ggcggtttgc gtattgggcg
ctcttccgct 14280tcctcgctca ctgactcgct gcgctcggtc gttcggctgc ggcgagcggt
atcagctcac 14340tcaaaggcgg taatacggtt atccacagaa tcaggggata acgcaggaaa
gaacatgtga 14400gcaaaaggcc agcaaaaggc caggaaccgt aaaaaggccg cgttgctggc
gtttttccat 14460aggctccgcc cccctgacga gcatcacaaa aatcgacgct caagtcagag
gtggcgaaac 14520ccgacaggac tataaagata ccaggcgttt ccccctggaa gctccctcgt
gcgctctcct 14580gttccgaccc tgccgcttac cggatacctg tccgcctttc tcccttcggg
aagcgtggcg 14640ctttctcata gctcacgctg taggtatctc agttcggtgt aggtcgttcg
ctccaagctg 14700ggctgtgtgc acgaaccccc cgttcagccc gaccgctgcg ccttatccgg
taactatcgt 14760cttgagtcca acccggtaag acacgactta tcgccactgg cagcagccac
tggtaacagg 14820attagcagag cgaggtatgt aggcggtgct acagagttct tgaagtggtg
gcctaactac 14880ggctacacta gaagaacagt atttggtatc tgcgctctgc tgaagccagt
taccttcgga 14940aaaagagttg gtagctcttg atccggcaaa caaaccaccg ctggtagcgg
tggttttttt 15000gtttgcaagc agcagattac gcgcagaaaa aaaggatctc aagaagatcc
tttgatcttt 15060tctacggggt ctgacgctca gtggaacgaa aactcacgtt aagggatttt
ggtcatgaga 15120ttatcaaaaa ggatcttcac ctagatcctt ttgatccgga at
15162406472DNAArtificial sequenceZmABP3-AmCyan assembly
construct 40taatacgact cactataggg agaccggcct cgagcagctg aagcttgcat
gcggcgcgcc 60gaaagtagca aacaacaggt tcatgtgcac tataaaaaga caaaattctc
gagtttcatc 120ttttattcca cataagcctt atattttcca ttttcatatg atttttagtt
taagtttgtg 180tcttaacttt ttcgttaata cgtaattcta tgcattatgg atgcgtgaag
tatttttgtt 240taaaaaaatg aaatgtcaaa atacgttttg tgatctattt ccatgttttc
acctaacagg 300tggtttttac tatatattct gccataactc tagccttaga tgtaaatcga
aaaaaaatga 360gagatgagct ggagatagcc ttagatgaag cgtctgaaat ataaaagaaa
gagtaatgtt 420gaacgcagta ggtgtagcag ctgtagttcc atctctagga aagggaactg
caatccgggc 480tccgggcctc gcgcaatctg gcctgtcgtg tagatgcagc cctgtccatg
acggcccaag 540caacgcccgc ggctctcgat ccaccacgga acccactccg acacacactg
acacacacat 600gctggatgtg gatgtgctgt ccaattatta gtagcaattc ggtaggcaca
ggcacgtact 660ggccggtgtt ttagctgtaa gtaccgaacc aatcacggtt aagaaccgat
taatccgtgc 720ccagccgccg agtgcgttcg tacgtgcatc ggatgcactg catgaattga
gagcatcatc 780atatcatacg caggagtagt acgacgccgc tgctgtcttg tccggctaat
gctttgctca 840cagattagtc catcgcccac ggtcggtgtg gtgtggatcg ctgatgccac
tgctttttgt 900ttggttttta ttcccctgat aatcctccgc gtccctgaat gtatctattt
attttcattc 960cgaaatccct ttcacgaaaa agaaaacgaa taaaaagaga gttacgaata
cgcttccggc 1020ggcccacatc accttccagc gaacatcgcg ccgcgctgac gtgtcgccca
tcgcggccgt 1080ccatatcgcc atccgacgac cgtggaagct ggcagcggcc gctccgttcc
gtcgaagggg 1140caggtcagtc aggtcaccca cacggccaca cccgcgcggg ggatacgcgg
tggaaaaccc 1200ggcgaccaca tcaaaacacg aggcgtctcc cgcaggactg gtcactcggc
acgcaggcag 1260aggcagcaca gcagcagcca gctccatcca tcctctttcc cctcctcgct
tcgcttcctc 1320ggcggattcc tcctccctcg gccgtccccg tccccttctt cgccgcgcca
gctcgcccga 1380gttggtaagg ccccctccac ccctccgctt cccctccccc gggcgcgctc
tggcttcctc 1440cccggatcgg cgcggggcgt gctggctccg cgcctgattt cgggcctttt
gtttccttct 1500cgcggagcgc tcgtgtaacg cttcggatct agctggattc aggcgggatc
gcggccgctc 1560ggcttcctcg tggcctgatt cgtggttttc ctcggggagg gaatcctgat
cggatcatcg 1620ggattcctcg tgcggccggg acacgcttgc gagccagaaa catagtctgc
gtggccggga 1680ttccacgatc tgtgatctag acgtcgggcg cttcgtctat gtgctcgctg
caggctgtgg 1740cgtactggcg tggtgcgcgg ccgctatgga tccgtgcttg tttgttcgcc
ctgtagcgtg 1800tgaaatcgag ctgtgtagat ctatggtctg cgaggtgcgg tggcggtgga
atctcggttg 1860atctttacct cagcggcgcc agtgtagctc gtgtggctgc agttcatctg
cgaatttggc 1920tctcggcggc ttaggtcgcg gagcttggat tatggagcac cagctgcagc
gtgaccctgt 1980tggttctcat gtggatctgt tggctgaggt tgcagacttc aagtgccact
gccattgacc 2040ggagctgctg cacgattata ctggaatatc tagcggtagt atactctgct
agtactcaat 2100acgggtctcc tgacaaatgt ctttcgtgtt tagggaccta gcactctagt
gtcaagacta 2160tttgctggaa tatctaatat tagcagtttc tgtagtggct cagttgcagc
ctggtttaga 2220atgatgggga cagttggctg tgccatgcaa aataaagtgt gtgaaagcaa
ctgcctctta 2280aactatgggt ggtgcaagca ggttatttga agggactctc cacactgtat
ctccagttaa 2340ctatgactga acttgtggtc gcaggcaaac ccaccatggc cctgtccaac
aagttcatcg 2400gcgacgacat gaagatgacc taccacatgg acggctgcgt gaacggccac
tacttcaccg 2460tgaagggcga gggcagcggc aagccctacg agggcaccca gacctccacc
ttcaaggtga 2520cgatggccaa cggcggcccc ctggccttct ccttcgacat cctgtccacc
gtgttcatgt 2580acggcaaccg ctgcttcacc gcctacccca ccagcatgcc cgactacttc
aagcaggcct 2640tccccgacgg catgtcctac gagagaacct tcacctacga ggacggcggc
gtggccaccg 2700ccagctggga gatcagcctg aagggcaact gcttcgagca caagtccacc
ttccacggcg 2760tgaacttccc cgccgacggc cccgtgatgg ccaagaagac caccggctgg
gacccctcct 2820tcgagaagat gaccgtgtgc gacggcatct tgaagggcga cgtgaccgcc
ttcctgatgc 2880tgcagggcgg cggcaactac agatgccagt tccacacctc ctacaagacc
aagaagcccg 2940tgaccatgcc ccccaaccac gtggtggagc accgcatcgc cagaaccgac
ctggacaagg 3000gcggcaacag cgtgcagctg accgagcacg ccgtggccca catcacctcc
gtggtgccct 3060tctgagagct cgcatcatga tcatgcatca tggactcggc ctactactgt
ggatttgtat 3120gccattatag acttggtgct gtgaaagact gcttgatgat ttgcgggttt
gttgctgtgt 3180aaaaaaaggt cccttggctc ccagaagacc atgaaggttc ggatctatca
tgtaattcct 3240tgttatctgc caattatgta tggactatgg acatgtgttg cgctgttcaa
cttactacta 3300caaataagta atcgatatgt tcccttccca tgtctcggtg acaattgtct
ggagaagctt 3360aggggtcgtt tgtttgggat tatgtctgga gaaacttatt ttaaactaag
tgtgagttca 3420agttaagtta gattatataa tctaggcaga ttataattcc aagcgaacag
gtccttagtg 3480tttttggaaa atcctaggtg ttcttttggc tacattgttg tgtgtgcaga
tcccttgttg 3540gtctgtaagc gtggggaagt aagaatcgtc cgtttctact gaagacctgc
tcgagttagg 3600caccgaggat gccggtaacc aaacagagca atagtgtctc tgtgggcaca
gtggagtgtg 3660aatctgtgtg atgcaaatcc gtcatttgtt tagcaaaatt tccagcgttg
catgatgcag 3720tttctttaac acggacttaa gggaagggaa aaaaatgttg agccaggaga
tccttcaatg 3780tgttagactg acgtgatagc caactaaacc acgacgcaat gttgtcgtta
atgacaaaaa 3840aactatttgt tcctaaatcc ttggcgacat tgcatggctg tctcatgaga
taatggtctc 3900atctcttatt tatctcttat ttatagccgg aagtggtagt gacccctgct
tgattgctcg 3960tatgccatct caagttctca accgtgtcga gcagccattt tcccatctca
agcgcatcat 4020cgtttcgttt gacctcatct gctatcctgc tcctagtgca aatcacatgc
gacagaaagt 4080gtggcgcgcc actagtcccg ggcccatcga tgatatcaga tctggttcta
tagtgtcacc 4140taaatcgtat gtgtatgata cataaggtta tgtattaatt gtagccgcgt
tctaacgaca 4200atatgtccat atggtgcact ctcagtacaa tctgctctga tgccgcatag
ttaagccagc 4260cccgacaccc gccaacaccc gctgacgcgc cctgacgggc ttgtctgctc
ccggcatccg 4320cttacagaca agctgtgacc gtctccggga gctgcatgtg tcagaggttt
tcaccgtcat 4380caccgaaacg cgcgagacga aagggcctcg tgatacgcct atttttatag
gttaatgtca 4440tgataataat ggtttcttag acgtcaggtg gcacttttcg gggaaatgtg
cgcggaaccc 4500ctatttgttt atttttctaa atacattcaa atatgtatcc gctcatgaga
caataaccct 4560gataaatgct tcaataatat tgaaaaagga agagtatgag tattcaacat
ttccgtgtcg 4620cccttattcc cttttttgcg gcattttgcc ttcctgtttt tgctcaccca
gaaacgctgg 4680tgaaagtaaa agatgctgaa gatcagttgg gtgcacgagt gggttacatc
gaactggatc 4740tcaacagcgg taagatcctt gagagttttc gccccgaaga acgttttcca
atgatgagca 4800cttttaaagt tctgctatgt ggcgcggtat tatcccgtat tgacgccggg
caagagcaac 4860tcggtcgccg catacactat tctcagaatg acttggttga gtactcacca
gtcacagaaa 4920agcatcttac ggatggcatg acagtaagag aattatgcag tgctgccata
accatgagtg 4980ataacactgc ggccaactta cttctgacaa cgatcggagg accgaaggag
ctaaccgctt 5040ttttgcacaa catgggggat catgtaactc gccttgatcg ttgggaaccg
gagctgaatg 5100aagccatacc aaacgacgag cgtgacacca cgatgcctgt agcaatggca
acaacgttgc 5160gcaaactatt aactggcgaa ctacttactc tagcttcccg gcaacaatta
atagactgga 5220tggaggcgga taaagttgca ggaccacttc tgcgctcggc ccttccggct
ggctggttta 5280ttgctgataa atctggagcc ggtgagcgtg ggtctcgcgg tatcattgca
gcactggggc 5340cagatggtaa gccctcccgt atcgtagtta tctacacgac ggggagtcag
gcaactatgg 5400atgaacgaaa tagacagatc gctgagatag gtgcctcact gattaagcat
tggtaactgt 5460cagaccaagt ttactcatat atactttaga ttgatttaaa acttcatttt
taatttaaaa 5520ggatctaggt gaagatcctt tttgataatc tcatgaccaa aatcccttaa
cgtgagtttt 5580cgttccactg agcgtcagac cccgtagaaa agatcaaagg atcttcttga
gatccttttt 5640ttctgcgcgt aatctgctgc ttgcaaacaa aaaaaccacc gctaccagcg
gtggtttgtt 5700tgccggatca agagctacca actctttttc cgaaggtaac tggcttcagc
agagcgcaga 5760taccaaatac tgttcttcta gtgtagccgt agttaggcca ccacttcaag
aactctgtag 5820caccgcctac atacctcgct ctgctaatcc tgttaccagt ggctgctgcc
agtggcgata 5880agtcgtgtct taccgggttg gactcaagac gatagttacc ggataaggcg
cagcggtcgg 5940gctgaacggg gggttcgtgc acacagccca gcttggagcg aacgacctac
accgaactga 6000gatacctaca gcgtgagcta tgagaaagcg ccacgcttcc cgaagggaga
aaggcggaca 6060ggtatccggt aagcggcagg gtcggaacag gagagcgcac gagggagctt
ccagggggaa 6120acgcctggta tctttatagt cctgtcgggt ttcgccacct ctgacttgag
cgtcgatttt 6180tgtgatgctc gtcagggggg cggagcctat ggaaaaacgc cagcaacgcg
gcctttttac 6240ggttcctggc cttttgctgg ccttttgctc acatgttctt tcctgcgtta
tcccctgatt 6300ctgtggataa ccgtattacc gcctttgagt gagctgatac cgctcgccgc
agccgaacga 6360ccgagcgcag cgagtcagtg agcgaggaag cggaagagcg cccaatacgc
aaaccgcctc 6420tccccgcgcg ttggccgatt cattaatgca ggttaacctg gcttatcgaa
at 64724113200DNAArtificial sequenceZmABP3-AmCyan binary
construct 41aattaattcc tgtggttggc atgcacatac aaatggacga acggataaac
cttttcacgc 60ccttttaaat atccgattat tctaataaac gctcttttct cttaggttta
cccgccaata 120tatcctgtca aacactgata gtttaaactg aaggcgggaa acgacaatct
gatcatgagc 180ggagaattaa gggagtcacg ttatgacccc cgccgatgac gcgggacaag
ccgttttacg 240tttggaactg acagaaccgc aacgctgcag gaattggccg cagcggccat
ttaaatcaat 300tgggcgcgcc gaaagtagca aacaacaggt tcatgtgcac tataaaaaga
caaaattctc 360gagtttcatc ttttattcca cataagcctt atattttcca ttttcatatg
atttttagtt 420taagtttgtg tcttaacttt ttcgttaata cgtaattcta tgcattatgg
atgcgtgaag 480tatttttgtt taaaaaaatg aaatgtcaaa atacgttttg tgatctattt
ccatgttttc 540acctaacagg tggtttttac tatatattct gccataactc tagccttaga
tgtaaatcga 600aaaaaaatga gagatgagct ggagatagcc ttagatgaag cgtctgaaat
ataaaagaaa 660gagtaatgtt gaacgcagta ggtgtagcag ctgtagttcc atctctagga
aagggaactg 720caatccgggc tccgggcctc gcgcaatctg gcctgtcgtg tagatgcagc
cctgtccatg 780acggcccaag caacgcccgc ggctctcgat ccaccacgga acccactccg
acacacactg 840acacacacat gctggatgtg gatgtgctgt ccaattatta gtagcaattc
ggtaggcaca 900ggcacgtact ggccggtgtt ttagctgtaa gtaccgaacc aatcacggtt
aagaaccgat 960taatccgtgc ccagccgccg agtgcgttcg tacgtgcatc ggatgcactg
catgaattga 1020gagcatcatc atatcatacg caggagtagt acgacgccgc tgctgtcttg
tccggctaat 1080gctttgctca cagattagtc catcgcccac ggtcggtgtg gtgtggatcg
ctgatgccac 1140tgctttttgt ttggttttta ttcccctgat aatcctccgc gtccctgaat
gtatctattt 1200attttcattc cgaaatccct ttcacgaaaa agaaaacgaa taaaaagaga
gttacgaata 1260cgcttccggc ggcccacatc accttccagc gaacatcgcg ccgcgctgac
gtgtcgccca 1320tcgcggccgt ccatatcgcc atccgacgac cgtggaagct ggcagcggcc
gctccgttcc 1380gtcgaagggg caggtcagtc aggtcaccca cacggccaca cccgcgcggg
ggatacgcgg 1440tggaaaaccc ggcgaccaca tcaaaacacg aggcgtctcc cgcaggactg
gtcactcggc 1500acgcaggcag aggcagcaca gcagcagcca gctccatcca tcctctttcc
cctcctcgct 1560tcgcttcctc ggcggattcc tcctccctcg gccgtccccg tccccttctt
cgccgcgcca 1620gctcgcccga gttggtaagg ccccctccac ccctccgctt cccctccccc
gggcgcgctc 1680tggcttcctc cccggatcgg cgcggggcgt gctggctccg cgcctgattt
cgggcctttt 1740gtttccttct cgcggagcgc tcgtgtaacg cttcggatct agctggattc
aggcgggatc 1800gcggccgctc ggcttcctcg tggcctgatt cgtggttttc ctcggggagg
gaatcctgat 1860cggatcatcg ggattcctcg tgcggccggg acacgcttgc gagccagaaa
catagtctgc 1920gtggccggga ttccacgatc tgtgatctag acgtcgggcg cttcgtctat
gtgctcgctg 1980caggctgtgg cgtactggcg tggtgcgcgg ccgctatgga tccgtgcttg
tttgttcgcc 2040ctgtagcgtg tgaaatcgag ctgtgtagat ctatggtctg cgaggtgcgg
tggcggtgga 2100atctcggttg atctttacct cagcggcgcc agtgtagctc gtgtggctgc
agttcatctg 2160cgaatttggc tctcggcggc ttaggtcgcg gagcttggat tatggagcac
cagctgcagc 2220gtgaccctgt tggttctcat gtggatctgt tggctgaggt tgcagacttc
aagtgccact 2280gccattgacc ggagctgctg cacgattata ctggaatatc tagcggtagt
atactctgct 2340agtactcaat acgggtctcc tgacaaatgt ctttcgtgtt tagggaccta
gcactctagt 2400gtcaagacta tttgctggaa tatctaatat tagcagtttc tgtagtggct
cagttgcagc 2460ctggtttaga atgatgggga cagttggctg tgccatgcaa aataaagtgt
gtgaaagcaa 2520ctgcctctta aactatgggt ggtgcaagca ggttatttga agggactctc
cacactgtat 2580ctccagttaa ctatgactga acttgtggtc gcaggcaaac ccaccatggc
cctgtccaac 2640aagttcatcg gcgacgacat gaagatgacc taccacatgg acggctgcgt
gaacggccac 2700tacttcaccg tgaagggcga gggcagcggc aagccctacg agggcaccca
gacctccacc 2760ttcaaggtga cgatggccaa cggcggcccc ctggccttct ccttcgacat
cctgtccacc 2820gtgttcatgt acggcaaccg ctgcttcacc gcctacccca ccagcatgcc
cgactacttc 2880aagcaggcct tccccgacgg catgtcctac gagagaacct tcacctacga
ggacggcggc 2940gtggccaccg ccagctggga gatcagcctg aagggcaact gcttcgagca
caagtccacc 3000ttccacggcg tgaacttccc cgccgacggc cccgtgatgg ccaagaagac
caccggctgg 3060gacccctcct tcgagaagat gaccgtgtgc gacggcatct tgaagggcga
cgtgaccgcc 3120ttcctgatgc tgcagggcgg cggcaactac agatgccagt tccacacctc
ctacaagacc 3180aagaagcccg tgaccatgcc ccccaaccac gtggtggagc accgcatcgc
cagaaccgac 3240ctggacaagg gcggcaacag cgtgcagctg accgagcacg ccgtggccca
catcacctcc 3300gtggtgccct tctgagagct cgcatcatga tcatgcatca tggactcggc
ctactactgt 3360ggatttgtat gccattatag acttggtgct gtgaaagact gcttgatgat
ttgcgggttt 3420gttgctgtgt aaaaaaaggt cccttggctc ccagaagacc atgaaggttc
ggatctatca 3480tgtaattcct tgttatctgc caattatgta tggactatgg acatgtgttg
cgctgttcaa 3540cttactacta caaataagta atcgatatgt tcccttccca tgtctcggtg
acaattgtct 3600ggagaagctt aggggtcgtt tgtttgggat tatgtctgga gaaacttatt
ttaaactaag 3660tgtgagttca agttaagtta gattatataa tctaggcaga ttataattcc
aagcgaacag 3720gtccttagtg tttttggaaa atcctaggtg ttcttttggc tacattgttg
tgtgtgcaga 3780tcccttgttg gtctgtaagc gtggggaagt aagaatcgtc cgtttctact
gaagacctgc 3840tcgagttagg caccgaggat gccggtaacc aaacagagca atagtgtctc
tgtgggcaca 3900gtggagtgtg aatctgtgtg atgcaaatcc gtcatttgtt tagcaaaatt
tccagcgttg 3960catgatgcag tttctttaac acggacttaa gggaagggaa aaaaatgttg
agccaggaga 4020tccttcaatg tgttagactg acgtgatagc caactaaacc acgacgcaat
gttgtcgtta 4080atgacaaaaa aactatttgt tcctaaatcc ttggcgacat tgcatggctg
tctcatgaga 4140taatggtctc atctcttatt tatctcttat ttatagccgg aagtggtagt
gacccctgct 4200tgattgctcg tatgccatct caagttctca accgtgtcga gcagccattt
tcccatctca 4260agcgcatcat cgtttcgttt gacctcatct gctatcctgc tcctagtgca
aatcacatgc 4320gacagaaagt gtggcgcgcc gaattcgagc tcggtaccgg accgcgatcg
cttaattaag 4380cttgcatgcc tgcagtgcag cgtgacccgg tcgtgcccct ctctagagat
aatgagcatt 4440gcatgtctaa gttataaaaa attaccacat attttttttg tcacacttgt
ttgaagtgca 4500gtttatctat ctttatacat atatttaaac tttactctac gaataatata
atctatagta 4560ctacaataat atcagtgttt tagagaatca tataaatgaa cagttagaca
tggtctaaag 4620gacaattgag tattttgaca acaggactct acagttttat ctttttagtg
tgcatgtgtt 4680ctcctttttt tttgcaaata gcttcaccta tataatactt catccatttt
attagtacat 4740ccatttaggg tttagggtta atggttttta tagactaatt tttttagtac
atctatttta 4800ttctatttta gcctctaaat taagaaaact aaaactctat tttagttttt
ttatttaata 4860atttagatat aaaatagaat aaaataaagt gactaaaaat taaacaaata
ccctttaaga 4920aattaaaaaa actaaggaaa catttttctt gtttcgagta gataatgcca
gcctgttaaa 4980cgccgtcgac gagtctaacg gacaccaacc agcgaaccag cagcgtcgcg
tcgggccaag 5040cgaagcagac ggcacggcat ctctgtcgct gcctctggac ccctctcgag
agttccgctc 5100caccgttgga cttgctccgc tgtcggcatc cagaaattgc gtggcggagc
ggcagacgtg 5160agccggcacg gcaggcggcc tcctcctcct ctcacggcac cggcagctac
gggggattcc 5220tttcccaccg ctccttcgct ttcccttcct cgcccgccgt aataaataga
caccccctcc 5280acaccctctt tccccaacct cgtgttgttc ggagcgcaca cacacacaac
cagatctccc 5340ccaaatccac ccgtcggcac ctccgcttca aggtacgccg ctcgtcctcc
cccccccccc 5400ctctctacct tctctagatc ggcgttccgg tccatggtta gggcccggta
gttctacttc 5460tgttcatgtt tgtgttagat ccgtgtttgt gttagatccg tgctgctagc
gttcgtacac 5520ggatgcgacc tgtacgtcag acacgttctg attgctaact tgccagtgtt
tctctttggg 5580gaatcctggg atggctctag ccgttccgca gacgggatcg atttcatgat
tttttttgtt 5640tcgttgcata gggtttggtt tgcccttttc ctttatttca atatatgccg
tgcacttgtt 5700tgtcgggtca tcttttcatg cttttttttg tcttggttgt gatgatgtgg
tctggttggg 5760cggtcgttct agatcggagt agaattctgt ttcaaactac ctggtggatt
tattaatttt 5820ggatctgtat gtgtgtgcca tacatattca tagttacgaa ttgaagatga
tggatggaaa 5880tatcgatcta ggataggtat acatgttgat gcgggtttta ctgatgcata
tacagagatg 5940ctttttgttc gcttggttgt gatgatgtgg tgtggttggg cggtcgttca
ttcgttctag 6000atcggagtag aatactgttt caaactacct ggtgtattta ttaattttgg
aactgtatgt 6060gtgtgtcata catcttcata gttacgagtt taagatggat ggaaatatcg
atctaggata 6120ggtatacatg ttgatgtggg ttttactgat gcatatacat gatggcatat
gcagcatcta 6180ttcatatgct ctaaccttga gtacctatct attataataa acaagtatgt
tttataatta 6240ttttgatctt gatatacttg gatgatggca tatgcagcag ctatatgtgg
atttttttag 6300ccctgccttc atacgctatt tatttgcttg gtactgtttc ttttgtcgat
gctcaccctg 6360ttgtttggtg ttacttctgc agggatcccc gatcatgcaa aaactcatta
actcagtgca 6420aaactatgcc tggggcagca aaacggcgtt gactgaactt tatggtatgg
aaaatccgtc 6480cagccagccg atggccgagc tgtggatggg cgcacatccg aaaagcagtt
cacgagtgca 6540gaatgccgcc ggagatatcg tttcactgcg tgatgtgatt gagagtgata
aatcgactct 6600gctcggagag gccgttgcca aacgctttgg cgaactgcct ttcctgttca
aagtattatg 6660cgcagcacag ccactctcca ttcaggttca tccaaacaaa cacaattctg
aaatcggttt 6720tgccaaagaa aatgccgcag gtatcccgat ggatgccgcc gagcgtaact
ataaagatcc 6780taaccacaag ccggagctgg tttttgcgct gacgcctttc cttgcgatga
acgcgtttcg 6840tgaattttcc gagattgtct ccctactcca gccggtcgca ggtgcacatc
cggcgattgc 6900tcacttttta caacagcctg atgccgaacg tttaagcgaa ctgttcgcca
gcctgttgaa 6960tatgcagggt gaagaaaaat cccgcgcgct ggcgatttta aaatcggccc
tcgatagcca 7020gcagggtgaa ccgtggcaaa cgattcgttt aatttctgaa ttttacccgg
aagacagcgg 7080tctgttctcc ccgctattgc tgaatgtggt gaaattgaac cctggcgaag
cgatgttcct 7140gttcgctgaa acaccgcacg cttacctgca aggcgtggcg ctggaagtga
tggcaaactc 7200cgataacgtg ctgcgtgcgg gtctgacgcc taaatacatt gatattccgg
aactggttgc 7260caatgtgaaa ttcgaagcca aaccggctaa ccagttgttg acccagccgg
tgaaacaagg 7320tgcagaactg gacttcccga ttccagtgga tgattttgcc ttctcgctgc
atgaccttag 7380tgataaagaa accaccatta gccagcagag tgccgccatt ttgttctgcg
tcgaaggcga 7440tgcaacgttg tggaaaggtt ctcagcagtt acagcttaaa ccgggtgaat
cagcgtttat 7500tgccgccaac gaatcaccgg tgactgtcaa aggccacggc cgtttagcgc
gtgtttacaa 7560caagctgtaa gagcttactg aaaaaattaa catctcttgc taagctggga
gctcgatccg 7620tcgacctgca gatcgttcaa acatttggca ataaagtttc ttaagattga
atcctgttgc 7680cggtcttgcg atgattatca tataatttct gttgaattac gttaagcatg
taataattaa 7740catgtaatgc atgacgttat ttatgagatg ggtttttatg attagagtcc
cgcaattata 7800catttaatac gcgatagaaa acaaaatata gcgcgcaaac taggataaat
tatcgcgcgc 7860ggtgtcatct atgttactag atctgctagc cctgcaggaa atttaccggt
gcccgggcgg 7920ccagcatggc cgtatccgca atgtgttatt aagttgtcta agcgtcaatt
tgtttacacc 7980acaatatatc ctgccaccag ccagccaaca gctccccgac cggcagctcg
gcacaaaatc 8040accactcgat acaggcagcc catcagaatt aattctcatg tttgacagct
tatcatcgac 8100tgcacggtgc accaatgctt ctggcgtcag gcagccatcg gaagctgtgg
tatggctgtg 8160caggtcgtaa atcactgcat aattcgtgtc gctcaaggcg cactcccgtt
ctggataatg 8220ttttttgcgc cgacatcata acggttctgg caaatattct gaaatgagct
gttgacaatt 8280aatcatccgg ctcgtataat gtgtggaatt gtgagcggat aacaatttca
cacaggaaac 8340agaccatgag ggaagcgttg atcgccgaag tatcgactca actatcagag
gtagttggcg 8400tcatcgagcg ccatctcgaa ccgacgttgc tggccgtaca tttgtacggc
tccgcagtgg 8460atggcggcct gaagccacac agtgatattg atttgctggt tacggtgacc
gtaaggcttg 8520atgaaacaac gcggcgagct ttgatcaacg accttttgga aacttcggct
tcccctggag 8580agagcgagat tctccgcgct gtagaagtca ccattgttgt gcacgacgac
atcattccgt 8640ggcgttatcc agctaagcgc gaactgcaat ttggagaatg gcagcgcaat
gacattcttg 8700caggtatctt cgagccagcc acgatcgaca ttgatctggc tatcttgctg
acaaaagcaa 8760gagaacatag cgttgccttg gtaggtccag cggcggagga actctttgat
ccggttcctg 8820aacaggatct atttgaggcg ctaaatgaaa ccttaacgct atggaactcg
ccgcccgact 8880gggctggcga tgagcgaaat gtagtgctta cgttgtcccg catttggtac
agcgcagtaa 8940ccggcaaaat cgcgccgaag gatgtcgctg ccgactgggc aatggagcgc
ctgccggccc 9000agtatcagcc cgtcatactt gaagctaggc aggcttatct tggacaagaa
gatcgcttgg 9060cctcgcgcgc agatcagttg gaagaatttg ttcactacgt gaaaggcgag
atcaccaaag 9120tagtcggcaa ataaagctct agtggatctc cgtacccggg gatctggctc
gcggcggacg 9180cacgacgccg gggcgagacc ataggcgatc tcctaaatca atagtagctg
taacctcgaa 9240gcgtttcact tgtaacaacg attgagaatt tttgtcataa aattgaaata
cttggttcgc 9300atttttgtca tccgcggtca gccgcaattc tgacgaactg cccatttagc
tggagatgat 9360tgtacatcct tcacgtgaaa atttctcaag cgctgtgaac aagggttcag
attttagatt 9420gaaaggtgag ccgttgaaac acgttcttct tgtcgatgac gacgtcgcta
tgcggcatct 9480tattattgaa taccttacga tccacgcctt caaagtgacc gcggtagccg
acagcaccca 9540gttcacaaga gtactctctt ccgcgacggt cgatgtcgtg gttgttgatc
tagatttagg 9600tcgtgaagat gggctcgaga tcgttcgtaa tctggcggca aagtctgata
ttccaatcat 9660aattatcagt ggcgaccgcc ttgaggagac ggataaagtt gttgcactcg
agctaggagc 9720aagtgatttt atcgctaagc cgttcagtat cagagagttt ctagcacgca
ttcgggttgc 9780cttgcgcgtg cgccccaacg ttgtccgctc caaagaccga cggtcttttt
gttttactga 9840ctggacactt aatctcaggc aacgtcgctt gatgtccgaa gctggcggtg
aggtgaaact 9900tacggcaggt gagttcaatc ttctcctcgc gtttttagag aaaccccgcg
acgttctatc 9960gcgcgagcaa cttctcattg ccagtcgagt acgcgacgag gaggtttatg
acaggagtat 10020agatgttctc attttgaggc tgcgccgcaa acttgaggca gatccgtcaa
gccctcaact 10080gataaaaaca gcaagaggtg ccggttattt ctttgacgcg gacgtgcagg
tttcgcacgg 10140ggggacgatg gcagcctgag ccaattccca gatccccgag gaatcggcgt
gagcggtcgc 10200aaaccatccg gcccggtaca aatcggcgcg gcgctgggtg atgacctggt
ggagaagttg 10260aaggccgcgc aggccgccca gcggcaacgc atcgaggcag aagcacgccc
cggtgaatcg 10320tggcaagcgg ccgctgatcg aatccgcaaa gaatcccggc aaccgccggc
agccggtgcg 10380ccgtcgatta ggaagccgcc caagggcgac gagcaaccag attttttcgt
tccgatgctc 10440tatgacgtgg gcacccgcga tagtcgcagc atcatggacg tggccgtttt
ccgtctgtcg 10500aagcgtgacc gacgagctgg cgaggtgatc cgctacgagc ttccagacgg
gcacgtagag 10560gtttccgcag ggccggccgg catggccagt gtgtgggatt acgacctggt
actgatggcg 10620gtttcccatc taaccgaatc catgaaccga taccgggaag ggaagggaga
caagcccggc 10680cgcgtgttcc gtccacacgt tgcggacgta ctcaagttct gccggcgagc
cgatggcgga 10740aagcagaaag acgacctggt agaaacctgc attcggttaa acaccacgca
cgttgccatg 10800cagcgtacga agaaggccaa gaacggccgc ctggtgacgg tatccgaggg
tgaagccttg 10860attagccgct acaagatcgt aaagagcgaa accgggcggc cggagtacat
cgagatcgag 10920ctagctgatt ggatgtaccg cgagatcaca gaaggcaaga acccggacgt
gctgacggtt 10980caccccgatt actttttgat cgatcccggc atcggccgtt ttctctaccg
cctggcacgc 11040cgcgccgcag gcaaggcaga agccagatgg ttgttcaaga cgatctacga
acgcagtggc 11100agcgccggag agttcaagaa gttctgtttc accgtgcgca agctgatcgg
gtcaaatgac 11160ctgccggagt acgatttgaa ggaggaggcg gggcaggctg gcccgatcct
agtcatgcgc 11220taccgcaacc tgatcgaggg cgaagcatcc gccggttcct aatgtacgga
gcagatgcta 11280gggcaaattg ccctagcagg ggaaaaaggt cgaaaaggtc tctttcctgt
ggatagcacg 11340tacattggga acccaaagcc gtacattggg aaccggaacc cgtacattgg
gaacccaaag 11400ccgtacattg ggaaccggtc acacatgtaa gtgactgata taaaagagaa
aaaaggcgat 11460ttttccgcct aaaactcttt aaaacttatt aaaactctta aaacccgcct
ggcctgtgca 11520taactgtctg gccagcgcac agccgaagag ctgcaaaaag cgcctaccct
tcggtcgctg 11580cgctccctac gccccgccgc ttcgcgtcgg cctatcgcgg ccgctggccg
ctcaaaaatg 11640gctggcctac ggccaggcaa tctaccaggg cgcggacaag ccgcgccgtc
gccactcgac 11700cgccggcgct gaggtctgcc tcgtgaagaa ggtgttgctg actcatacca
ggcctgaatc 11760gccccatcat ccagccagaa agtgagggag ccacggttga tgagagcttt
gttgtaggtg 11820gaccagttgg tgattttgaa cttttgcttt gccacggaac ggtctgcgtt
gtcgggaaga 11880tgcgtgatct gatccttcaa ctcagcaaaa gttcgattta ttcaacaaag
ccgccgtccc 11940gtcaagtcag cgtaatgctc tgccagtgtt acaaccaatt aaccaattct
gattagaaaa 12000actcatcgag catcaaatga aactgcaatt tattcatatc aggattatca
ataccatatt 12060tttgaaaaag ccgtttctgt aatgaaggag aaaactcacc gaggcagttc
cataggatgg 12120caagatcctg gtatcggtct gcgattccga ctcgtccaac atcaatacaa
cctattaatt 12180tcccctcgtc aaaaataagg ttatcaagtg agaaatcacc atgagtgacg
actgaatccg 12240gtgagaatgg caaaagctct gcattaatga atcggccaac gcgcggggag
aggcggtttg 12300cgtattgggc gctcttccgc ttcctcgctc actgactcgc tgcgctcggt
cgttcggctg 12360cggcgagcgg tatcagctca ctcaaaggcg gtaatacggt tatccacaga
atcaggggat 12420aacgcaggaa agaacatgtg agcaaaaggc cagcaaaagg ccaggaaccg
taaaaaggcc 12480gcgttgctgg cgtttttcca taggctccgc ccccctgacg agcatcacaa
aaatcgacgc 12540tcaagtcaga ggtggcgaaa cccgacagga ctataaagat accaggcgtt
tccccctgga 12600agctccctcg tgcgctctcc tgttccgacc ctgccgctta ccggatacct
gtccgccttt 12660ctcccttcgg gaagcgtggc gctttctcat agctcacgct gtaggtatct
cagttcggtg 12720taggtcgttc gctccaagct gggctgtgtg cacgaacccc ccgttcagcc
cgaccgctgc 12780gccttatccg gtaactatcg tcttgagtcc aacccggtaa gacacgactt
atcgccactg 12840gcagcagcca ctggtaacag gattagcaga gcgaggtatg taggcggtgc
tacagagttc 12900ttgaagtggt ggcctaacta cggctacact agaagaacag tatttggtat
ctgcgctctg 12960ctgaagccag ttaccttcgg aaaaagagtt ggtagctctt gatccggcaa
acaaaccacc 13020gctggtagcg gtggtttttt tgtttgcaag cagcagatta cgcgcagaaa
aaaaggatct 13080caagaagatc ctttgatctt ttctacgggg tctgacgctc agtggaacga
aaactcacgt 13140taagggattt tggtcatgag attatcaaaa aggatcttca cctagatcct
tttgatccgg 13200428961DNAArtificial sequenceZmABP3-AtAVP1D assembly
construct 42gggacccaaa gtagcaaaca acaggttcat gtgcactata aaaagacaaa
attctcgagt 60ttcatctttt attccacata agccttatat tttccatttt catatgattt
ttagtttaag 120tttgtgtctt aactttttcg ttaatacgta attctatgca ttatggatgc
gtgaagtatt 180tttgtttaaa aaaatgaaat gtcaaaatac gttttgtgat ctatttccat
gttttcacct 240aacaggtggt ttttactata tattctgcca taactctagc cttagatgta
aatcgaaaaa 300aaatgagaga tgagctggag atagccttag atgaagcgtc tgaaatataa
aagaaagagt 360aatgttgaac gcagtaggtg tagcagctgt agttccatct ctaggaaagg
gaactgcaat 420ccgggctccg ggcctcgcgc aatctggcct gtcgtgtaga tgcagccctg
tccatgacgg 480cccaagcaac gcccgcggct ctcgatccac cacggaaccc actccgacac
acactgacac 540acacatgctg gatgtggatg tgctgtccaa ttattagtag caattcggta
ggcacaggca 600cgtactggcc ggtgttttag ctgtaagtac cgaaccaatc acggttaaga
accgattaat 660ccgtgcccag ccgccgagtg cgttcgtacg tgcatcggat gcactgcatg
aattgagagc 720atcatcatat catacgcagg agtagtacga cgccgctgct gtcttgtccg
gctaatgctt 780tgctcacaga ttagtccatc gcccacggtc ggtgtggtgt ggatcgctga
tgccactgct 840ttttgtttgg tttttattcc cctgataatc ctccgcgtcc ctgaatgtat
ctatttattt 900tcattccgaa atccctttca cgaaaaagaa aacgaataaa aagagagtta
cgaatacgct 960tccggcggcc cacatcacct tccagcgaac atcgcgccgc gctgacgtgt
cgcccatcgc 1020ggccgtccat atcgccatcc gacgaccgtg gaagctggca gcggccgctc
cgttccgtcg 1080aaggggcagg tcagtcaggt cacccacacg gccacacccg cgcgggggat
acgcggtgga 1140aaacccggcg accacatcaa aacacgaggc gtctcccgca ggactggtca
ctcggcacgc 1200aggcagaggc agcacagcag cagccagctc catccatcct ctttcccctc
ctcgcttcgc 1260ttcctcggcg gattcctcct ccctcggccg tccccgtccc cttcttcgcc
gcgccagctc 1320gcccgagttg gtaaggcccc ctccacccct ccgcttcccc tcccccgggc
gcgctctggc 1380ttcctccccg gatcggcgcg gggcgtgctg gctccgcgcc tgatttcggg
ccttttgttt 1440ccttctcgcg gagcgctcgt gtaacgcttc ggatctagct ggattcaggc
gggatcgcgg 1500ccgctcggct tcctcgtggc ctgattcgtg gttttcctcg gggagggaat
cctgatcgga 1560tcatcgggat tcctcgtgcg gccgggacac gcttgcgagc cagaaacata
gtctgcgtgg 1620ccgggattcc acgatctgtg atctagacgt cgggcgcttc gtctatgtgc
tcgctgcagg 1680ctgtggcgta ctggcgtggt gcgcggccgc tatggatccg tgcttgtttg
ttcgccctgt 1740agcgtgtgaa atcgagctgt gtagatctat ggtctgcgag gtgcggtggc
ggtggaatct 1800cggttgatct ttacctcagc ggcgccagtg tagctcgtgt ggctgcagtt
catctgcgaa 1860tttggctctc ggcggcttag gtcgcggagc ttggattatg gagcaccagc
tgcagcgtga 1920ccctgttggt tctcatgtgg atctgttggc tgaggttgca gacttcaagt
gccactgcca 1980ttgaccggag ctgctgcacg attatactgg aatatctagc ggtagtatac
tctgctagta 2040ctcaatacgg gtctcctgac aaatgtcttt cgtgtttagg gacctagcac
tctagtgtca 2100agactatttg ctggaatatc taatattagc agtttctgta gtggctcagt
tgcagcctgg 2160tttagaatga tggggacagt tggctgtgcc atgcaaaata aagtgtgtga
aagcaactgc 2220ctcttaaact atgggtggtg caagcaggtt atttgaaggg actctccaca
ctgtatctcc 2280agttaacttt gactgaactt gtggtcgcag gcaaacccac catggttgca
ccagcattgc 2340ttccggaact gtggacggag atactggtcc caatctgcgc tgtgatcggc
atagccttca 2400gcctgttcca gtggtacgtc gtgtcaaggg tgaagctcac gagcgacttg
ggagccagta 2460gtagcggagg ggcgaacaac gggaagaacg gctatggcga ctatctgatc
gaggaggaag 2520agggtgtgaa cgaccaatca gtggtggcga agtgtgcgga gattcagacc
gccattagcg 2580agggagctac gagcttcctg tttacggagt acaagtacgt gggcgtcttc
atgatcttct 2640tcgctgccgt catcttcgtg ttcctgggtt ctgtcgaagg cttctccacc
gacaacaagc 2700cgtgcactta cgacaccacc agaacctgca aacctgcact ggccactgct
gcgttctcca 2760ccatagcgtt cgtgcttggt gctgtgacaa gcgtcctgag tggcttcttg
gggatgaaga 2820tcgctaccta cgccaatgcc agaaccacac tggaggcaag gaaaggtgtc
gggaaagcct 2880tcatcgtggc ctttcggagt ggtgctgtca tgggcttcct gcttgctgcc
agtggattgc 2940tcgtgctcta catcaccatc aacgtgttca agatctacta cggcgacgat
tgggaagggc 3000tcttcgacgc aatcactggc tatgggttgg gtggctcttc aatggcgctc
ttcggaagag 3060tgggaggtgg catctacacg aaagcggctg atgtgggagc tgacctggtc
gggaagatcg 3120agcgcaacat cccggaagat gacccaagga acccagcagt gatcgccgac
aatgtcggcg 3180acaatgtcgg tgacatagcg ggtatgggaa gcgacctctt tggctcatac
gccgaagcca 3240gctgcgcagc gcttgttgtc gcctccatct ccagcttcgg gatcaaccac
gacttcacag 3300ccatgtgcta tcccctcctg atcagcagca tgggcatact ggtgtgcctc
atcaccacgc 3360tgtttgcgac cgacttcttc gagatcaagc tggtgaagga gatcgaacct
gcgctgaaga 3420accagctgat catctcgacc gtgatcatga ccgttgggat cgccatcgtc
tcatgggtgg 3480gtcttcctac ctcgttcacc atcttcaact ttggcactca gaaggtggtg
aagaactggc 3540agctcttcct ctgcgtttgc gtcggacttt gggctgggct gatcatcggc
tttgtcacgg 3600agtactacac ctccaacgcc tacagtcctg tgcaggatgt ggccgattct
tgccgtactg 3660gtgctgcaac gaacgtcatc ttcggtcttg cactgggcta caagtcggtc
atcatcccca 3720tcttcgccat tgccatctcc atcttcgtga gcttctcgtt cgcagccatg
tacggtgttg 3780ccgttgctgc attgggcatg ctctccacca tcgctactgg cctcgctatt
gacgcgtatg 3840gtccgatttc ggacaatgct ggagggattg ccgagatggc tgggatgtcg
cacaggatca 3900gagagcgtac ggatgcactg gatgctgcag ggaacactac cgctgccatt
ggcaagggct 3960ttgccatagg gtctgctgca ctcgttagcc tggccttgtt tggcgctttc
gtgtcgagag 4020ctggcatcca cacagtggac gttctgactc ccaaggtgat catcggactt
ctggtgggag 4080ctatgctccc gtactggttc tctgcgatga cgatgaagtc ggtcggatca
gcagcgctga 4140agatggtcga ggaggttagg aggcagttca acacgatccc cggattgatg
gagggcacag 4200ctaagccgga ctatgctacc tgcgtgaaga tctccacaga cgcctccatc
aaggagatga 4260tccctccagg gtgcctggtg atgcttactc cgctgattgt gggcttcttc
ttcggcgtgg 4320agacactttc cggcgtgttg gcaggaagcc tcgtgagtgg agtgcagatc
gcgatcagtg 4380ccagcaatac tggaggggca tgggacaacg cgaagaagta catcgaagcc
ggcgtctcag 4440aacacgcgaa gtctctgggt ccgaaagggt cagaacccca taaggccgct
gtgatcggcg 4500atacgattgg cgatcccttg aaggacactt ctggcccatc cctcaacatc
ctgatcaagc 4560tcatggcagt ggagagcctc gttttcgcgc ctttcttcgc gactcatggt
ggcatcctgt 4620tcaagtactt ctagagctcg catcatgatc atgcatcatg gactcggcct
actactgtgg 4680atttgtatgc cattatagac ttggtgctgt gaaagactgc ttgatgattt
gcgggtttgt 4740tgctgtgtaa aaaaaggtcc cttggctccc agaagaccat gaaggttcgg
atctatcatg 4800taattccttg ttatctgcca attatgtatg gactatggac atgtgttgcg
ctgttcaact 4860tactactaca aataagtaat cgatatgttc ccttcccatg tctcggtgac
aattgtctgg 4920agaagcttag gggtcgtttg tttgggatta tgtctggaga aacttatttt
aaactaagtg 4980tgagttcaag ttaagttaga ttatataatc taggcagatt ataattccaa
gcgaacaggt 5040ccttagtgtt tttggaaaat cctaggtgtt cttttggcta cattgttgtg
tgtgcagatc 5100ccttgttggt ctgtaagcgt ggggaagtaa gaatcgtccg tttctactga
agacctgctc 5160gagttaggca ccgaggatgc cggtaaccaa acagagcaat agtgtctctg
tgggcacagt 5220ggagtgtgaa tctgtgtgat gcaaatccgt catttgttta gcaaaatttc
cagcgttgca 5280tgatgcagtt tctttaacac ggacttaagg gaagggaaaa aaatgttgag
ccaggagatc 5340cttcaatgtg ttagactgac gtgatagcca actaaaccac gacgcaatgt
tgtcgttaat 5400gacaaaaaaa ctatttgttc ctaaatcctt ggcgacattg catggctgtc
tcatgagata 5460atggtctcat ctcttattta tctcttattt atagccggaa gtggtagtga
cccctgcttg 5520attgctcgta tgccatctca agttctcaac cgtgtcgagc agccattttc
ccatctcaag 5580cgcatcatcg tttcgtttga cctcatctgc tatcctgctc ctagtgcaaa
tcacatgcga 5640cagaaagtgt cggaccgcag ctccagcttt tgttcccttt agtgagggtt
aattgcgcgc 5700ttggcgtaat catggtcata gctgtttcct gtgtgaaatt gttatccgct
cacaattcca 5760cacaacatac gagccggaag cataaagtgt aaagcctggg gtgcctaatg
agtgagctaa 5820ctcacattaa ttgcgttgcg ctcactgccc gctttccagt cgggaaacct
gtcgtgccag 5880ctgcattaat gaatcggcca acgcgcgggg agaggcggtt tgcgtattgg
gcgctcttcc 5940gcttcctcgc tcactgactc gctgcgctcg gtcgttcggc tgcggcgagc
ggtatcagct 6000cactcaaagg cggtaatacg gttatccaca gaatcagggg ataacgcagg
aaagaacatg 6060tgagcaaaag gccagcaaaa ggccaggaac cgtaaaaagg ccgcgttgct
ggcgtttttc 6120cataggctcc gcccccctga cgagcatcac aaaaatcgac gctcaagtca
gaggtggcga 6180aacccgacag gactataaag ataccaggcg tttccccctg gaagctccct
cgtgcgctct 6240cctgttccga ccctgccgct taccggatac ctgtccgcct ttctcccttc
gggaagcgtg 6300gcgctttctc atagctcacg ctgtaggtat ctcagttcgg tgtaggtcgt
tcgctccaag 6360ctgggctgtg tgcacgaacc ccccgttcag cccgaccgct gcgccttatc
cggtaactat 6420cgtcttgagt ccaacccggt aagacacgac ttatcgccac tggcagcagc
cactggtaac 6480aggattagca gagcgaggta tgtaggcggt gctacagagt tcttgaagtg
gtggcctaac 6540tacggctaca ctagaaggac agtatttggt atctgcgctc tgctgaagcc
agttaccttc 6600ggaaaaagag ttggtagctc ttgatccggc aaacaaacca ccgctggtag
cggtggtttt 6660tttgtttgca agcagcagat tacgcgcaga aaaaaaggat ctcaagaaga
tcctttgatc 6720ttttctacgg ggtctgacgc tcagtggaac gaaaactcac gttaagggat
tttggtcatg 6780agattatcaa aaaggatctt cacctagatc cttttcgacc gaataaatac
ctgtgacgga 6840agatcacttc gcagaataaa taaatcctgg tgtccctgtt gataccggga
agccctgggc 6900caacttttgg cgaaaatgag acgttgatcg gcacgtaaga ggttccaact
ttcaccataa 6960tgaaataaga tcactaccgg gcgtattttt tgagttgtcg agattttcag
gagctaagga 7020agctaaaatg gagaaaaaaa tcactggata taccaccgtt gatatatccc
aatggcatcg 7080taaagaacat tttgaggcat ttcagtcagt tgctcaatgt acctataacc
agaccgttca 7140gctggatatt acggcctttt taaagaccgt aaagaaaaat aagcacaagt
tttatccggc 7200ctttattcac attcttgccc gcctgatgaa tgctcatccg gaattacgta
tggcaatgaa 7260agacggtgag ctggtgatat gggatagtgt tcacccttgt tacaccgttt
tccatgagca 7320aactgaaacg ttttcatcgc tctggagtga ataccacgac gatttccggc
agtttctaca 7380catatattcg caagatgtgg cgtgttacgg tgaaaacctg gcctatttcc
ctaaagggtt 7440tattgagaat atgtttttcg tctcagccaa tccctgggtg agtttcacca
gttttgattt 7500aaacgtggcc aatatggaca acttcttcgc ccccgttttc actatgggca
aatattatac 7560gcaaggcgac aaggtgctga tgccgctggc gattcaggtt catcatgccg
tttgtgatgg 7620cttccatgtc ggcagaatgc ttaatgaatt acaacagtac tgcgatgagt
ggcagggcgg 7680ggcgtaattt ttttaaggca gttattggtg cccttaaacg cctggttgct
acgcctgaat 7740aagtgataat aagcggatga atggcagaaa ttcgaaagca aattcgaccc
ggtcgtcggt 7800tcagggcagg gtcgttaaat agccgcttat gtctattgct ggtttaccgg
tttattgact 7860accggaagca gtgtgaccgt gtgcttctca aatgcctgag gccagtttgc
tcaggctctc 7920cccgtggagg taataattga cgatatgatc ctttttttct gatcaaaagt
gctcatcatt 7980ggaaaacgtt cttcggggcg aaaactctca aggatcttac cgctgttgag
atccagttcg 8040atgtaaccca ctcgtgcacc caactgatct tcagcatctt ttactttcac
cagcgtttct 8100gggtgagcaa aaacaggaag gcaaaatgcc gcaaaaaagg gaataagggc
gacacggaaa 8160tgttgaatac tcatactctt cctttttcaa tattattgaa gcatttatca
agggttattg 8220tctcatgagc ggatacatat ttgaatgtat ttagaaaaat aaacaaatag
gggttccgcg 8280cacatttccc cgaaaagtgc cacctaaatt gtaagcgtta atattttgtt
aaaattcgcg 8340ttaaattttt gttaaatcag ctcatttttt aaccaatagg ccgaaatcgg
caaaatccct 8400tataaatcaa aagaatagac cgagataggg ttgagtgttg ttccagtttg
gaacaagagt 8460ccactattaa agaacgtgga ctccaacgtc aaagggcgaa aaaccgtcta
tcagggcgat 8520ggcccactac gtgaaccatc accctaatca agttttttgg ggtcgaggtg
ccgtaaagca 8580ctaaatcgga accctaaagg gagcccccga tttagagctt gacggggaaa
gccggcgaac 8640gtggcgagaa aggaagggaa gaaagcgaaa ggagcgggcg ctagggcgct
ggcaagtgta 8700gcggtcacgc tgcgcgtaac caccacaccc gccgcgctta atgcgccgct
acagggcgcg 8760tcccattcgc cattcaggct gcgcaactgt tgggaagggc gatcggtgcg
ggcctcttcg 8820ctattacgcc agctggcgaa agggggatgt gctgcaaggc gattaagttg
ggtaacgcca 8880gggttttccc agtcacgacg ttgtaaaacg acggccagtg agcgcgcgta
atacgactca 8940ctatagggcg aattgggtac g
89614315301DNAArtificial sequenceZmABP3-AtAVP1D binary
construct 43aattcctgtg gttggcatgc acatacaaat ggacgaacgg ataaaccttt
tcacgccctt 60ttaaatatcc gattattcta ataaacgctc ttttctctta ggtttacccg
ccaatatatc 120ctgtcaaaca ctgatagttt aaactgaagg cgggaaacga caatctgatc
atgagcggag 180aattaaggga gtcacgttat gacccccgcc gatgacgcgg gacaagccgt
tttacgtttg 240gaactgacag aaccgcaacg ctgcaggaat tggccgcagc ggccatttaa
atcaattggg 300cgcgccagct gcttgtgggg accagacaaa aaaggaatgg tgcagaattg
ttaggcgcac 360ctaccaaaag catctttgcc tttattgcaa agataaagca gattcctcta
gtacaagtgg 420ggaacaaaat aacgtggaaa agagctgtcc tgacagccca ctcactaatg
cgtatgacga 480acgcagtgac gaccacaaaa ctcgagactt ttcaacaaag ggtaatatcc
ggaaacctcc 540tcggattcca ttgcccagct atctgtcact ttattgtgaa gatagtggaa
aaggaaggtg 600gctcctacaa atgccatcat tgcgataaag gaaaggctat cgttgaagat
gcctctgccg 660acagtggtcc caaagatgga cccccaccca cgaggagcat cgtggaaaaa
gaagacgttc 720caaccacgtc ttcaaagcaa gtggattgat gtgatatctc cactgacgta
agggatgacg 780aacaatccca ctatccttcg gtaccggacc caaagtagca aacaacaggt
tcatgtgcac 840tataaaaaga caaaattctc gagtttcatc ttttattcca cataagcctt
atattttcca 900ttttcatatg atttttagtt taagtttgtg tcttaacttt ttcgttaata
cgtaattcta 960tgcattatgg atgcgtgaag tatttttgtt taaaaaaatg aaatgtcaaa
atacgttttg 1020tgatctattt ccatgttttc acctaacagg tggtttttac tatatattct
gccataactc 1080tagccttaga tgtaaatcga aaaaaaatga gagatgagct ggagatagcc
ttagatgaag 1140cgtctgaaat ataaaagaaa gagtaatgtt gaacgcagta ggtgtagcag
ctgtagttcc 1200atctctagga aagggaactg caatccgggc tccgggcctc gcgcaatctg
gcctgtcgtg 1260tagatgcagc cctgtccatg acggcccaag caacgcccgc ggctctcgat
ccaccacgga 1320acccactccg acacacactg acacacacat gctggatgtg gatgtgctgt
ccaattatta 1380gtagcaattc ggtaggcaca ggcacgtact ggccggtgtt ttagctgtaa
gtaccgaacc 1440aatcacggtt aagaaccgat taatccgtgc ccagccgccg agtgcgttcg
tacgtgcatc 1500ggatgcactg catgaattga gagcatcatc atatcatacg caggagtagt
acgacgccgc 1560tgctgtcttg tccggctaat gctttgctca cagattagtc catcgcccac
ggtcggtgtg 1620gtgtggatcg ctgatgccac tgctttttgt ttggttttta ttcccctgat
aatcctccgc 1680gtccctgaat gtatctattt attttcattc cgaaatccct ttcacgaaaa
agaaaacgaa 1740taaaaagaga gttacgaata cgcttccggc ggcccacatc accttccagc
gaacatcgcg 1800ccgcgctgac gtgtcgccca tcgcggccgt ccatatcgcc atccgacgac
cgtggaagct 1860ggcagcggcc gctccgttcc gtcgaagggg caggtcagtc aggtcaccca
cacggccaca 1920cccgcgcggg ggatacgcgg tggaaaaccc ggcgaccaca tcaaaacacg
aggcgtctcc 1980cgcaggactg gtcactcggc acgcaggcag aggcagcaca gcagcagcca
gctccatcca 2040tcctctttcc cctcctcgct tcgcttcctc ggcggattcc tcctccctcg
gccgtccccg 2100tccccttctt cgccgcgcca gctcgcccga gttggtaagg ccccctccac
ccctccgctt 2160cccctccccc gggcgcgctc tggcttcctc cccggatcgg cgcggggcgt
gctggctccg 2220cgcctgattt cgggcctttt gtttccttct cgcggagcgc tcgtgtaacg
cttcggatct 2280agctggattc aggcgggatc gcggccgctc ggcttcctcg tggcctgatt
cgtggttttc 2340ctcggggagg gaatcctgat cggatcatcg ggattcctcg tgcggccggg
acacgcttgc 2400gagccagaaa catagtctgc gtggccggga ttccacgatc tgtgatctag
acgtcgggcg 2460cttcgtctat gtgctcgctg caggctgtgg cgtactggcg tggtgcgcgg
ccgctatgga 2520tccgtgcttg tttgttcgcc ctgtagcgtg tgaaatcgag ctgtgtagat
ctatggtctg 2580cgaggtgcgg tggcggtgga atctcggttg atctttacct cagcggcgcc
agtgtagctc 2640gtgtggctgc agttcatctg cgaatttggc tctcggcggc ttaggtcgcg
gagcttggat 2700tatggagcac cagctgcagc gtgaccctgt tggttctcat gtggatctgt
tggctgaggt 2760tgcagacttc aagtgccact gccattgacc ggagctgctg cacgattata
ctggaatatc 2820tagcggtagt atactctgct agtactcaat acgggtctcc tgacaaatgt
ctttcgtgtt 2880tagggaccta gcactctagt gtcaagacta tttgctggaa tatctaatat
tagcagtttc 2940tgtagtggct cagttgcagc ctggtttaga atgatgggga cagttggctg
tgccatgcaa 3000aataaagtgt gtgaaagcaa ctgcctctta aactatgggt ggtgcaagca
ggttatttga 3060agggactctc cacactgtat ctccagttaa ctttgactga acttgtggtc
gcaggcaaac 3120ccaccatggt tgcaccagca ttgcttccgg aactgtggac ggagatactg
gtcccaatct 3180gcgctgtgat cggcatagcc ttcagcctgt tccagtggta cgtcgtgtca
agggtgaagc 3240tcacgagcga cttgggagcc agtagtagcg gaggggcgaa caacgggaag
aacggctatg 3300gcgactatct gatcgaggag gaagagggtg tgaacgacca atcagtggtg
gcgaagtgtg 3360cggagattca gaccgccatt agcgagggag ctacgagctt cctgtttacg
gagtacaagt 3420acgtgggcgt cttcatgatc ttcttcgctg ccgtcatctt cgtgttcctg
ggttctgtcg 3480aaggcttctc caccgacaac aagccgtgca cttacgacac caccagaacc
tgcaaacctg 3540cactggccac tgctgcgttc tccaccatag cgttcgtgct tggtgctgtg
acaagcgtcc 3600tgagtggctt cttggggatg aagatcgcta cctacgccaa tgccagaacc
acactggagg 3660caaggaaagg tgtcgggaaa gccttcatcg tggcctttcg gagtggtgct
gtcatgggct 3720tcctgcttgc tgccagtgga ttgctcgtgc tctacatcac catcaacgtg
ttcaagatct 3780actacggcga cgattgggaa gggctcttcg acgcaatcac tggctatggg
ttgggtggct 3840cttcaatggc gctcttcgga agagtgggag gtggcatcta cacgaaagcg
gctgatgtgg 3900gagctgacct ggtcgggaag atcgagcgca acatcccgga agatgaccca
aggaacccag 3960cagtgatcgc cgacaatgtc ggcgacaatg tcggtgacat agcgggtatg
ggaagcgacc 4020tctttggctc atacgccgaa gccagctgcg cagcgcttgt tgtcgcctcc
atctccagct 4080tcgggatcaa ccacgacttc acagccatgt gctatcccct cctgatcagc
agcatgggca 4140tactggtgtg cctcatcacc acgctgtttg cgaccgactt cttcgagatc
aagctggtga 4200aggagatcga acctgcgctg aagaaccagc tgatcatctc gaccgtgatc
atgaccgttg 4260ggatcgccat cgtctcatgg gtgggtcttc ctacctcgtt caccatcttc
aactttggca 4320ctcagaaggt ggtgaagaac tggcagctct tcctctgcgt ttgcgtcgga
ctttgggctg 4380ggctgatcat cggctttgtc acggagtact acacctccaa cgcctacagt
cctgtgcagg 4440atgtggccga ttcttgccgt actggtgctg caacgaacgt catcttcggt
cttgcactgg 4500gctacaagtc ggtcatcatc cccatcttcg ccattgccat ctccatcttc
gtgagcttct 4560cgttcgcagc catgtacggt gttgccgttg ctgcattggg catgctctcc
accatcgcta 4620ctggcctcgc tattgacgcg tatggtccga tttcggacaa tgctggaggg
attgccgaga 4680tggctgggat gtcgcacagg atcagagagc gtacggatgc actggatgct
gcagggaaca 4740ctaccgctgc cattggcaag ggctttgcca tagggtctgc tgcactcgtt
agcctggcct 4800tgtttggcgc tttcgtgtcg agagctggca tccacacagt ggacgttctg
actcccaagg 4860tgatcatcgg acttctggtg ggagctatgc tcccgtactg gttctctgcg
atgacgatga 4920agtcggtcgg atcagcagcg ctgaagatgg tcgaggaggt taggaggcag
ttcaacacga 4980tccccggatt gatggagggc acagctaagc cggactatgc tacctgcgtg
aagatctcca 5040cagacgcctc catcaaggag atgatccctc cagggtgcct ggtgatgctt
actccgctga 5100ttgtgggctt cttcttcggc gtggagacac tttccggcgt gttggcagga
agcctcgtga 5160gtggagtgca gatcgcgatc agtgccagca atactggagg ggcatgggac
aacgcgaaga 5220agtacatcga agccggcgtc tcagaacacg cgaagtctct gggtccgaaa
gggtcagaac 5280cccataaggc cgctgtgatc ggcgatacga ttggcgatcc cttgaaggac
acttctggcc 5340catccctcaa catcctgatc aagctcatgg cagtggagag cctcgttttc
gcgcctttct 5400tcgcgactca tggtggcatc ctgttcaagt acttctagag ctcgcatcat
gatcatgcat 5460catggactcg gcctactact gtggatttgt atgccattat agacttggtg
ctgtgaaaga 5520ctgcttgatg atttgcgggt ttgttgctgt gtaaaaaaag gtcccttggc
tcccagaaga 5580ccatgaaggt tcggatctat catgtaattc cttgttatct gccaattatg
tatggactat 5640ggacatgtgt tgcgctgttc aacttactac tacaaataag taatcgatat
gttcccttcc 5700catgtctcgg tgacaattgt ctggagaagc ttaggggtcg tttgtttggg
attatgtctg 5760gagaaactta ttttaaacta agtgtgagtt caagttaagt tagattatat
aatctaggca 5820gattataatt ccaagcgaac aggtccttag tgtttttgga aaatcctagg
tgttcttttg 5880gctacattgt tgtgtgtgca gatcccttgt tggtctgtaa gcgtggggaa
gtaagaatcg 5940tccgtttcta ctgaagacct gctcgagtta ggcaccgagg atgccggtaa
ccaaacagag 6000caatagtgtc tatgtgggca cagtggagtg tgaatctgtg tgatgcaaat
ccgtcatttg 6060tttagcaaaa tttccagcgt tgcatgatgc agtttcttta acacggactt
aagggaaggg 6120aaaaaaatgt tgagccagga gatccttcaa tgtgttagac tgacgtgata
gccaactaaa 6180ccacgacgca atgttgtcgt taatgacaaa aaaactattt gttcctaaat
ccttggcgac 6240attgcatggc tgtctcatga gataatggtc tcatctctta tttatctctt
atttatagcc 6300ggaagtggta gtgacccctg cttgattgct cgtatgccat ctcaagttct
caaccgtgtc 6360gagcagccat tttcccatct caagcgcatc atcgtttcgt ttgacctcat
ctgctatcct 6420gctcctagtg caaatcacat gcgacagaaa gtgtcggacc gcgatcgctt
aattaagctt 6480gcatgcctgc agtgcagcgt gacccggtcg tgcccctctc tagagataat
gagcattgca 6540tgtctaagtt ataaaaaatt accacatatt ttttttgtca cacttgtttg
aagtgcagtt 6600tatctatctt tatacatata tttaaacttt actctacgaa taatataatc
tatagtacta 6660caataatatc agtgttttag agaatcatat aaatgaacag ttagacatgg
tctaaaggac 6720aattgagtat tttgacaaca ggactctaca gttttatctt tttagtgtgc
atgtgttctc 6780cttttttttt gcaaatagct tcacctatat aatacttcat ccattttatt
agtacatcca 6840tttagggttt agggttaatg gtttttatag actaattttt ttagtacatc
tattttattc 6900tattttagcc tctaaattaa gaaaactaaa actctatttt agttttttta
tttaataatt 6960tagatataaa atagaataaa ataaagtgac taaaaattaa acaaataccc
tttaagaaat 7020taaaaaaact aaggaaacat ttttcttgtt tcgagtagat aatgccagcc
tgttaaacgc 7080cgtcgacgag tctaacggac accaaccagc gaaccagcag cgtcgcgtcg
ggccaagcga 7140agcagacggc acggcatctc tgtcgctgcc tctggacccc tctcgagagt
tccgctccac 7200cgttggactt gctccgctgt cggcatccag aaattgcgtg gcggagcggc
agacgtgagc 7260cggcacggca ggcggcctcc tcctcctctc acggcaccgg cagctacggg
ggattccttt 7320cccaccgctc cttcgctttc ccttcctcgc ccgccgtaat aaatagacac
cccctccaca 7380ccctctttcc ccaacctcgt gttgttcgga gcgcacacac acacaaccag
atctccccca 7440aatccacccg tcggcacctc cgcttcaagg tacgccgctc gtcctccccc
cccccccctc 7500tctaccttct ctagatcggc gttccggtcc atggttaggg cccggtagtt
ctacttctgt 7560tcatgtttgt gttagatccg tgtttgtgtt agatccgtgc tgctagcgtt
cgtacacgga 7620tgcgacctgt acgtcagaca cgttctgatt gctaacttgc cagtgtttct
ctttggggaa 7680tcctgggatg gctctagccg ttccgcagac gggatcgatt tcatgatttt
ttttgtttcg 7740ttgcataggg tttggtttgc ccttttcctt tatttcaata tatgccgtgc
acttgtttgt 7800cgggtcatct tttcatgctt ttttttgtct tggttgtgat gatgtggtct
ggttgggcgg 7860tcgttctaga tcggagtaga attctgtttc aaactacctg gtggatttat
taattttgga 7920tctgtatgtg tgtgccatac atattcatag ttacgaattg aagatgatgg
atggaaatat 7980cgatctagga taggtataca tgttgatgcg ggttttactg atgcatatac
agagatgctt 8040tttgttcgct tggttgtgat gatgtggtgt ggttgggcgg tcgttcattc
gttctagatc 8100ggagtagaat actgtttcaa actacctggt gtatttatta attttggaac
tgtatgtgtg 8160tgtcatacat cttcatagtt acgagtttaa gatggatgga aatatcgatc
taggataggt 8220atacatgttg atgtgggttt tactgatgca tatacatgat ggcatatgca
gcatctattc 8280atatgctcta accttgagta cctatctatt ataataaaca agtatgtttt
ataattattt 8340tgatcttgat atacttggat gatggcatat gcagcagcta tatgtggatt
tttttagccc 8400tgccttcata cgctatttat ttgcttggta ctgtttcttt tgtcgatgct
caccctgttg 8460tttggtgtta cttctgcagg gatccccgat catgcaaaaa ctcattaact
cagtgcaaaa 8520ctatgcctgg ggcagcaaaa cggcgttgac tgaactttat ggtatggaaa
atccgtccag 8580ccagccgatg gccgagctgt ggatgggcgc acatccgaaa agcagttcac
gagtgcagaa 8640tgccgccgga gatatcgttt cactgcgtga tgtgattgag agtgataaat
cgactctgct 8700cggagaggcc gttgccaaac gctttggcga actgcctttc ctgttcaaag
tattatgcgc 8760agcacagcca ctctccattc aggttcatcc aaacaaacac aattctgaaa
tcggttttgc 8820caaagaaaat gccgcaggta tcccgatgga tgccgccgag cgtaactata
aagatcctaa 8880ccacaagccg gagctggttt ttgcgctgac gcctttcctt gcgatgaacg
cgtttcgtga 8940attttccgag attgtctccc tactccagcc ggtcgcaggt gcacatccgg
cgattgctca 9000ctttttacaa cagcctgatg ccgaacgttt aagcgaactg ttcgccagcc
tgttgaatat 9060gcagggtgaa gaaaaatccc gcgcgctggc gattttaaaa tcggccctcg
atagccagca 9120gggtgaaccg tggcaaacga ttcgtttaat ttctgaattt tacccggaag
acagcggtct 9180gttctccccg ctattgctga atgtggtgaa attgaaccct ggcgaagcga
tgttcctgtt 9240cgctgaaaca ccgcacgctt acctgcaagg cgtggcgctg gaagtgatgg
caaactccga 9300taacgtgctg cgtgcgggtc tgacgcctaa atacattgat attccggaac
tggttgccaa 9360tgtgaaattc gaagccaaac cggctaacca gttgttgacc cagccggtga
aacaaggtgc 9420agaactggac ttcccgattc cagtggatga ttttgccttc tcgctgcatg
accttagtga 9480taaagaaacc accattagcc agcagagtgc cgccattttg ttctgcgtcg
aaggcgatgc 9540aacgttgtgg aaaggttctc agcagttaca gcttaaaccg ggtgaatcag
cgtttattgc 9600cgccaacgaa tcaccggtga ctgtcaaagg ccacggccgt ttagcgcgtg
tttacaacaa 9660gctgtaagag cttactgaaa aaattaacat ctcttgctaa gctgggagct
cgatccgtcg 9720acctgcagat cgttcaaaca tttggcaata aagtttctta agattgaatc
ctgttgccgg 9780tcttgcgatg attatcatat aatttctgtt gaattacgtt aagcatgtaa
taattaacat 9840gtaatgcatg acgttattta tgagatgggt ttttatgatt agagtcccgc
aattatacat 9900ttaatacgcg atagaaaaca aaatatagcg cgcaaactag gataaattat
cgcgcgcggt 9960gtcatctatg ttactagatc tgctagccct gcaggaaatt taccggtgcc
cgggcggcca 10020gcatggccgt atccgcaatg tgttattaag ttgtctaagc gtcaatttgt
ttacaccaca 10080atatatcctg ccaccagcca gccaacagct ccccgaccgg cagctcggca
caaaatcacc 10140actcgataca ggcagcccat cagaattaat tctcatgttt gacagcttat
catcgactgc 10200acggtgcacc aatgcttctg gcgtcaggca gccatcggaa gctgtggtat
ggctgtgcag 10260gtcgtaaatc actgcataat tcgtgtcgct caaggcgcac tcccgttctg
gataatgttt 10320tttgcgccga catcataacg gttctggcaa atattctgaa atgagctgtt
gacaattaat 10380catccggctc gtataatgtg tggaattgtg agcggataac aatttcacac
aggaaacaga 10440ccatgaggga agcgttgatc gccgaagtat cgactcaact atcagaggta
gttggcgtca 10500tcgagcgcca tctcgaaccg acgttgctgg ccgtacattt gtacggctcc
gcagtggatg 10560gcggcctgaa gccacacagt gatattgatt tgctggttac ggtgaccgta
aggcttgatg 10620aaacaacgcg gcgagctttg atcaacgacc ttttggaaac ttcggcttcc
cctggagaga 10680gcgagattct ccgcgctgta gaagtcacca ttgttgtgca cgacgacatc
attccgtggc 10740gttatccagc taagcgcgaa ctgcaatttg gagaatggca gcgcaatgac
attcttgcag 10800gtatcttcga gccagccacg atcgacattg atctggctat cttgctgaca
aaagcaagag 10860aacatagcgt tgccttggta ggtccagcgg cggaggaact ctttgatccg
gttcctgaac 10920aggatctatt tgaggcgcta aatgaaacct taacgctatg gaactcgccg
cccgactggg 10980ctggcgatga gcgaaatgta gtgcttacgt tgtcccgcat ttggtacagc
gcagtaaccg 11040gcaaaatcgc gccgaaggat gtcgctgccg actgggcaat ggagcgcctg
ccggcccagt 11100atcagcccgt catacttgaa gctaggcagg cttatcttgg acaagaagat
cgcttggcct 11160cgcgcgcaga tcagttggaa gaatttgttc actacgtgaa aggcgagatc
accaaagtag 11220tcggcaaata aagctctagt ggatctccgt acccggggat ctggctcgcg
gcggacgcac 11280gacgccgggg cgagaccata ggcgatctcc taaatcaata gtagctgtaa
cctcgaagcg 11340tttcacttgt aacaacgatt gagaattttt gtcataaaat tgaaatactt
ggttcgcatt 11400tttgtcatcc gcggtcagcc gcaattctga cgaactgccc atttagctgg
agatgattgt 11460acatccttca cgtgaaaatt tctcaagcgc tgtgaacaag ggttcagatt
ttagattgaa 11520aggtgagccg ttgaaacacg ttcttcttgt cgatgacgac gtcgctatgc
ggcatcttat 11580tattgaatac cttacgatcc acgccttcaa agtgaccgcg gtagccgaca
gcacccagtt 11640cacaagagta ctctcttccg cgacggtcga tgtcgtggtt gttgatctag
atttaggtcg 11700tgaagatggg ctcgagatcg ttcgtaatct ggcggcaaag tctgatattc
caatcataat 11760tatcagtggc gaccgccttg aggagacgga taaagttgtt gcactcgagc
taggagcaag 11820tgattttatc gctaagccgt tcagtatcag agagtttcta gcacgcattc
gggttgcctt 11880gcgcgtgcgc cccaacgttg tccgctccaa agaccgacgg tctttttgtt
ttactgactg 11940gacacttaat ctcaggcaac gtcgcttgat gtccgaagct ggcggtgagg
tgaaacttac 12000ggcaggtgag ttcaatcttc tcctcgcgtt tttagagaaa ccccgcgacg
ttctatcgcg 12060cgagcaactt ctcattgcca gtcgagtacg cgacgaggag gtttatgaca
ggagtataga 12120tgttctcatt ttgaggctgc gccgcaaact tgaggcagat ccgtcaagcc
ctcaactgat 12180aaaaacagca agaggtgccg gttatttctt tgacgcggac gtgcaggttt
cgcacggggg 12240gacgatggca gcctgagcca attcccagat ccccgaggaa tcggcgtgag
cggtcgcaaa 12300ccatccggcc cggtacaaat cggcgcggcg ctgggtgatg acctggtgga
gaagttgaag 12360gccgcgcagg ccgcccagcg gcaacgcatc gaggcagaag cacgccccgg
tgaatcgtgg 12420caagcggccg ctgatcgaat ccgcaaagaa tcccggcaac cgccggcagc
cggtgcgccg 12480tcgattagga agccgcccaa gggcgacgag caaccagatt ttttcgttcc
gatgctctat 12540gacgtgggca cccgcgatag tcgcagcatc atggacgtgg ccgttttccg
tctgtcgaag 12600cgtgaccgac gagctggcga ggtgatccgc tacgagcttc cagacgggca
cgtagaggtt 12660tccgcagggc cggccggcat ggccagtgtg tgggattacg acctggtact
gatggcggtt 12720tcccatctaa ccgaatccat gaaccgatac cgggaaggga agggagacaa
gcccggccgc 12780gtgttccgtc cacacgttgc ggacgtactc aagttctgcc ggcgagccga
tggcggaaag 12840cagaaagacg acctggtaga aacctgcatt cggttaaaca ccacgcacgt
tgccatgcag 12900cgtacgaaga aggccaagaa cggccgcctg gtgacggtat ccgagggtga
agccttgatt 12960agccgctaca agatcgtaaa gagcgaaacc gggcggccgg agtacatcga
gatcgagcta 13020gctgattgga tgtaccgcga gatcacagaa ggcaagaacc cggacgtgct
gacggttcac 13080cccgattact ttttgatcga tcccggcatc ggccgttttc tctaccgcct
ggcacgccgc 13140gccgcaggca aggcagaagc cagatggttg ttcaagacga tctacgaacg
cagtggcagc 13200gccggagagt tcaagaagtt ctgtttcacc gtgcgcaagc tgatcgggtc
aaatgacctg 13260ccggagtacg atttgaagga ggaggcgggg caggctggcc cgatcctagt
catgcgctac 13320cgcaacctga tcgagggcga agcatccgcc ggttcctaat gtacggagca
gatgctaggg 13380caaattgccc tagcagggga aaaaggtcga aaaggtctct ttcctgtgga
tagcacgtac 13440attgggaacc caaagccgta cattgggaac cggaacccgt acattgggaa
cccaaagccg 13500tacattggga accggtcaca catgtaagtg actgatataa aagagaaaaa
aggcgatttt 13560tccgcctaaa actctttaaa acttattaaa actcttaaaa cccgcctggc
ctgtgcataa 13620ctgtctggcc agcgcacagc cgaagagctg caaaaagcgc ctacccttcg
gtcgctgcgc 13680tccctacgcc ccgccgcttc gcgtcggcct atcgcggccg ctggccgctc
aaaaatggct 13740ggcctacggc caggcaatct accagggcgc ggacaagccg cgccgtcgcc
actcgaccgc 13800cggcgctgag gtctgcctcg tgaagaaggt gttgctgact cataccaggc
ctgaatcgcc 13860ccatcatcca gccagaaagt gagggagcca cggttgatga gagctttgtt
gtaggtggac 13920cagttggtga ttttgaactt ttgctttgcc acggaacggt ctgcgttgtc
gggaagatgc 13980gtgatctgat ccttcaactc agcaaaagtt cgatttattc aacaaagccg
ccgtcccgtc 14040aagtcagcgt aatgctctgc cagtgttaca accaattaac caattctgat
tagaaaaact 14100catcgagcat caaatgaaac tgcaatttat tcatatcagg attatcaata
ccatattttt 14160gaaaaagccg tttctgtaat gaaggagaaa actcaccgag gcagttccat
aggatggcaa 14220gatcctggta tcggtctgcg attccgactc gtccaacatc aatacaacct
attaatttcc 14280cctcgtcaaa aataaggtta tcaagtgaga aatcaccatg agtgacgact
gaatccggtg 14340agaatggcaa aagctctgca ttaatgaatc ggccaacgcg cggggagagg
cggtttgcgt 14400attgggcgct cttccgcttc ctcgctcact gactcgctgc gctcggtcgt
tcggctgcgg 14460cgagcggtat cagctcactc aaaggcggta atacggttat ccacagaatc
aggggataac 14520gcaggaaaga acatgtgagc aaaaggccag caaaaggcca ggaaccgtaa
aaaggccgcg 14580ttgctggcgt ttttccatag gctccgcccc cctgacgagc atcacaaaaa
tcgacgctca 14640agtcagaggt ggcgaaaccc gacaggacta taaagatacc aggcgtttcc
ccctggaagc 14700tccctcgtgc gctctcctgt tccgaccctg ccgcttaccg gatacctgtc
cgcctttctc 14760ccttcgggaa gcgtggcgct ttctcatagc tcacgctgta ggtatctcag
ttcggtgtag 14820gtcgttcgct ccaagctggg ctgtgtgcac gaaccccccg ttcagcccga
ccgctgcgcc 14880ttatccggta actatcgtct tgagtccaac ccggtaagac acgacttatc
gccactggca 14940gcagccactg gtaacaggat tagcagagcg aggtatgtag gcggtgctac
agagttcttg 15000aagtggtggc ctaactacgg ctacactaga agaacagtat ttggtatctg
cgctctgctg 15060aagccagtta ccttcggaaa aagagttggt agctcttgat ccggcaaaca
aaccaccgct 15120ggtagcggtg gtttttttgt ttgcaagcag cagattacgc gcagaaaaaa
aggatctcaa 15180gaagatcctt tgatcttttc tacggggtct gacgctcagt ggaacgaaaa
ctcacgttaa 15240gggattttgg tcatgagatt atcaaaaagg atcttcacct agatcctttt
gatccggaat 15300t
15301448342DNAArtificial sequenceplasmid 15772 ZmABT Assembly
44ccccgaccag cgcgacatgc atggcatggc aaactatata tcgtcatcat cattattatc
60atctgaccct cttttttttt cactctcact cccatgtttt tattcccggg cggggccgtg
120tgggtgtggg ttgggatggc cggattgggc tcccggggtg gagaaatgac aaatccaggc
180ccgcaggcgg ccacccacca aatcggacga cgcagggtgc ccaaatcagg aaggatttta
240aggttaaccg gccaccggcg gtgaccgacg ccccacccca ctctccttct cctattctat
300ctatatatca cccgcctctt ttttctccct cactccgcca caccttccct cttcttcctc
360agctccgtcg cccaccgccg gagcaccgaa aggccccgcg cccgccgcct ttcctgtaaa
420aaacccaacc tttagctagc taaccgctcc tcttctcccc ctactcccct tgcccaaatc
480agagaagata tttaacggag gaggggaagg agaggatatt tagctgattg ttgattggtg
540gtccggggta cggtgttctt gagtcgtgaa gcgaccgtac agtggctagg gccgtctccg
600ggttgcgtgc aggatggtcg tcagagatcg ggagtgagga ggcagctcgt ggtcgtggag
660gctaaatgta ccgcaagaac gactcggcac tctcctgttt ctacctcttc ctcctctggt
720tcttcttctt gaaatagacc agcgccagcc accaggtagc tacctactag ctagcagccc
780agttgcgact ggggacgggc tgctgcttgc aagttggaat cttggagcag gagcagagga
840gcgggagatg gagctggatc tgaacgtggc cgaggtggcg ccggagaagc catcggcggc
900gctggaggcg agcgactcgg ggtcctcggg ctcgtcggtg ctgaacgcgg aggcggcatc
960ggcgggcggc ggggggcccg cgccggggga ggaggggtca agctcgacgc cggccgtgct
1020cgagttcagc atcctcagga gcgacagcga cgcggccggc gcggacgccg acgacggcga
1080cgccacgccg tcgccacctc gccaccacca gcagcagctc gtcacccggg agcacttccc
1140ggcgccgcag cattgggccg agcacggctt cttccgcgcc ggcccgcagc agcagccgga
1200catcagggtc ctgccgcacc cgcacccgta cccgcccccg ccgccgcccg cgcagccgca
1260gcaggccaag aagagccgcc gcggcccgcg ctcccgcagc tcgcagtacc gcggcgtcac
1320cttctaccgc cgcaccggcc gctgggagtc ccacatctgg tcagtagcac tgcaagctca
1380ccatgcgccc tttcacctac cgaccaataa tcgcttgtga ttctgacacc caaatgtttc
1440gtcttcctgt gctgtcctgt tcctcggaaa tggcagggat tgcgggaagc aggtgtactt
1500aggtgagcag caataagcag atcgatctgc agcataaatt tcccgttatt aactagttcg
1560tgatctcgat cgaatggcct aattaaccga ttcggtgatc tggccgatgg ccaatctacg
1620caggtggatt cgacactgct catgccgctg caaggtaacg atcaatccat ccatccaccc
1680ttgtctagct accccaccga ccggccggat taatggaccg ctagctctcg ggacgggctt
1740gctgcagggc gtacgaccga gcggcgatca agttccgcgg cgtcgacgcc gacataaact
1800tcaacctcag cgactacgac gacgatatga agcaggtaca tacacgagtg ttcttgcagc
1860tagcaccgac tgaaacatct gctgaacgta cacgcatggc cctgtgcacc agatgaagag
1920cctgtccaag gaggagttcg ttcacgccct gcggcggcag agcaccggct tctcccgcgg
1980cagctccaag tacaggggcg tcaccctgca caagtgcggc cgctgggagg cgcgcaaggg
2040gcagttcctc ggcaagaagt aagaaacaac acttcgtttg caggcgctgt actttgctgc
2100agattatttc atttcatcct tgcatgtgcc tttcctttcc atccactcac ttgatggctg
2160tagtctcgat agagttcgtt cgttcgtact tcgcaccaga tgaactccca cgcacatgat
2220ttagtactag ttttaccatg cattgttcag taaaagtata tgcttgcttg atcagtggtt
2280gtttcaatca gaagattaaa aaaacggaat attaatataa aaaaaagggg aagtggctag
2340ggaattcctc agtcctagct agctagctca ccggtgggaa cgccatgctt ggcttgggtg
2400caggtacata tatcttgggc tattcgacag cgaagtagag gctgcaaggt tgttcacctc
2460ggacgattct gccatttgtt catatacacc atgccttttg atttctctct tgcaatttct
2520cttcttttat catggctttt gattcccaaa gggttgagta ccgactcgat attcgattct
2580ccctgccgtt tcgtgacccc agggcgtacg acaaggcccc accatggtac gtcctgtaga
2640aaccccaacc cgtgaaatca aaaaactcga cggcctgtgg gcattcagtc tggatcgcga
2700aaactgtgga attgatcagc gttggtggga aagcgcgtta caagaaagcc gggcaattgc
2760tgtgccaggc agttttaacg atcagttcgc cgatgcagat attcgtaatt atgcgggcaa
2820cgtctggtat cagcgcgaag tctttatacc gaaaggttgg gcaggccagc gtatcgtgct
2880gcgtttcgat gcggtcactc attacggcaa agtgtgggtc aataatcagg aagtgatgga
2940gcatcagggc ggctatacgc catttgaagc cgatgtcacg ccgtatgtta ttgccgggaa
3000aagtgtacgt atcaccgttt gtgtgaacaa cgaactgaac tggcagacta tcccgccggg
3060aatggtgatt accgacgaaa acggcaagaa aaagcagtct tacttccatg atttctttaa
3120ctatgccgga atccatcgca gcgtaatgct ctacaccacg ccgaacacct gggtggacga
3180tatcaccgtg gtgacgcatg tcgcgcaaga ctgtaaccac gcgtctgttg actggcaggt
3240accaagctgc gaatcttcgt ttttttaagg aattctcgat ctttatggtg tataggctct
3300gggttttctg ttttttgtat ctcttaggat tttgtaaatt ccagatcttt ctatggccac
3360ttagtagtat atttcaaaaa ttctccaatc gagttcttca ttcgcatttt cagtcatttt
3420ctcttcgacg ttgtttttaa gcctgggtat tactcctatt tagttgaact ctgcagcaat
3480cttagaaaat tagggttttg aggtttcgat ttctctaggt aaccgatcta ttgcattcat
3540ctgaatttct gcatatatgt cttagatttc tgataagctt acgatacgtt aggtgtaatt
3600gaagtttatt tttcaagagt gttatttttt gtttctgaat ttttcaggtg gtggccaatg
3660gtgatgtcag cgttgaactg cgtgatgcgg atcaacaggt ggttgcaact ggacaaggca
3720ctagcgggac tttgcaagtg gtgaatccgc acctctggca accgggtgaa ggttatctct
3780atgaactgtg cgtcacagcc aaaagccaga cagagtgtga tatctacccg cttcgcgtcg
3840gcatccggtc agtggcagtg aagggcgaac agttcctgat taaccacaaa ccgttctact
3900ttactggctt tggtcgtcat gaagatgcgg acttgcgtgg caaaggattc gataacgtgc
3960tgatggtgca cgaccacgca ttaatggact ggattggggc caactcctac cgtacctcgc
4020attaccctta cgctgaagag atgctcgact gggcagatga acatggcatc gtggtgattg
4080atgaaactgc tgctgtcggc tttaacctct ctttaggcat tggtttcgaa gcgggcaaca
4140agccgaaaga actgtacagc gaagaggcag tcaacgggga aactcagcaa gcgcacttac
4200aggcgattaa agagctgata gcgcgtgaca aaaaccaccc aagcgtggtg atgtggagta
4260ttgccaacga accggatacc cgtccgcaag gtgcacggga atatttcgcg ccactggcgg
4320aagcaacgcg taaactcgac ccgacgcgtc cgatcacctg cgtcaatgta atgttctgcg
4380acgctcacac cgataccatc agcgatctct ttgatgtgct gtgcctgaac cgttattacg
4440gatggtatgt ccaaagcggc gatttggaaa cggcagagaa ggtactggaa aaagaacttc
4500tggcctggca ggagaaactg catcagccga ttatcatcac cgaatacggc gtggatacgt
4560tagccgggct gcactcaatg tacaccgaca tgtggagtga agagtatcag tgtgcatggc
4620tggatatgta tcaccgcgtc tttgatcgcg tcagcgccgt cgtcggtgaa caggtatgga
4680atttcgccga ttttgcgacc tcgcaaggca tattgcgcgt tggcggtaac aagaaaggga
4740tcttcactcg cgaccgcaaa ccgaagtcgg cggcttttct gctgcaaaaa cgctggactg
4800gcatgaactt cggtgaaaaa ccgcagcagg gaggcaaaca atgagagctc gaatcgaaga
4860agccacactg taaatctgcc gggaagcggc tggtggcatc cggcccgctc ctccctccgg
4920gcgccgcaac ttttttcgat cggttttgcg ccgcccggga cgggttgtag ttgatcgatt
4980ggattcttca taactgtatt tgcgtactgc ttacactacc caagtgaaat cgaaaatggc
5040gccttctctc gttgaataaa ttgcacgtac gctactcgat ccgctgcggc tcttgctgga
5100gtggccgccg ccgctataga tagaaggatc aagccaagga atctgtcatg catgggcatg
5160tgaaggagga gcctcctgca atgtttagtc ttttttggtc gacgcccacc agagatatac
5220gcactagatt tcatatagct gagctagatc gattccgttg catgcatgct gcatggcgtc
5280gagattcgag ctagcaccgc ctgttcatca tcgaccgatc cattctgatc gattcccctc
5340tcgagctttc acgaactgaa cctacctagt gagggtgacg cctaacgcct agtgcgcgcg
5400cgtgggtctc cgatgtcagt ggccgcacgc gcgcgcgcgt tctcgagatc gcatgtggtc
5460atagcgcagc aggtttgccc tcagaaccta cagcaactcg accaccggtt tggatttctt
5520cttttttcaa ggatatgatc ggagagagag agctacctag gcgtcgtcct tgttttcttg
5580tatcgcatgt ggtgtgggtc tctctcctcc tttcgtacgc acgcatgatt ccattcttac
5640cccccctcga gatcgagagg aaatatattg ctattttata cacacacggc gcccccagct
5700atacgtcact gcttacgtta attcccccac cggatagtag ttgtttaatg gcccaaacaa
5760accttgttgt tgcatgcatc atggaccaaa caaaatacat agttagttaa atattactgt
5820tatatataca actaataata attatattat tagttaaaac aaagcaaggc atatgcagca
5880gctgctggtc ggaccgggcc catcgatgat atcagatctg gttctatagt gtcacctaaa
5940tcgtatgtgt atgatacata aggttatgta ttaattgtag ccgcgttcta acgacaatat
6000gtccatatgg tgcactctca gtacaatctg ctctgatgcc gcatagttaa gccagccccg
6060acacccgcca acacccgctg acgcgccctg acgggcttgt ctgctcccgg catccgctta
6120cagacaagct gtgaccgtct ccgggagctg catgtgtcag aggttttcac cgtcatcacc
6180gaaacgcgcg agacgaaagg gcctcgtgat acgcctattt ttataggtta atgtcatgat
6240aataatggtt tcttagacgt caggtggcac ttttcgggga aatgtgcgcg gaacccctat
6300ttgtttattt ttctaaatac attcaaatat gtatccgctc atgagacaat aaccctgata
6360aatgcttcaa taatattgaa aaaggaagag tatgagtatt caacatttcc gtgtcgccct
6420tattcccttt tttgcggcat tttgccttcc tgtttttgct cacccagaaa cgctggtgaa
6480agtaaaagat gctgaagatc agttgggtgc acgagtgggt tacatcgaac tggatctcaa
6540cagcggtaag atccttgaga gttttcgccc cgaagaacgt tttccaatga tgagcacttt
6600taaagttctg ctatgtggcg cggtattatc ccgtattgac gccgggcaag agcaactcgg
6660tcgccgcata cactattctc agaatgactt ggttgagtac tcaccagtca cagaaaagca
6720tcttacggat ggcatgacag taagagaatt atgcagtgct gccataacca tgagtgataa
6780cactgcggcc aacttacttc tgacaacgat cggaggaccg aaggagctaa ccgctttttt
6840gcacaacatg ggggatcatg taactcgcct tgatcgttgg gaaccggagc tgaatgaagc
6900cataccaaac gacgagcgtg acaccacgat gcctgtagca atggcaacaa cgttgcgcaa
6960actattaact ggcgaactac ttactctagc ttcccggcaa caattaatag actggatgga
7020ggcggataaa gttgcaggac cacttctgcg ctcggccctt ccggctggct ggtttattgc
7080tgataaatct ggagccggtg agcgtgggtc tcgcggtatc attgcagcac tggggccaga
7140tggtaagccc tcccgtatcg tagttatcta cacgacgggg agtcaggcaa ctatggatga
7200acgaaataga cagatcgctg agataggtgc ctcactgatt aagcattggt aactgtcaga
7260ccaagtttac tcatatatac tttagattga tttaaaactt catttttaat ttaaaaggat
7320ctaggtgaag atcctttttg ataatctcat gaccaaaatc ccttaacgtg agttttcgtt
7380ccactgagcg tcagaccccg tagaaaagat caaaggatct tcttgagatc ctttttttct
7440gcgcgtaatc tgctgcttgc aaacaaaaaa accaccgcta ccagcggtgg tttgtttgcc
7500ggatcaagag ctaccaactc tttttccgaa ggtaactggc ttcagcagag cgcagatacc
7560aaatactgtt cttctagtgt agccgtagtt aggccaccac ttcaagaact ctgtagcacc
7620gcctacatac ctcgctctgc taatcctgtt accagtggct gctgccagtg gcgataagtc
7680gtgtcttacc gggttggact caagacgata gttaccggat aaggcgcagc ggtcgggctg
7740aacggggggt tcgtgcacac agcccagctt ggagcgaacg acctacaccg aactgagata
7800cctacagcgt gagctatgag aaagcgccac gcttcccgaa gggagaaagg cggacaggta
7860tccggtaagc ggcagggtcg gaacaggaga gcgcacgagg gagcttccag ggggaaacgc
7920ctggtatctt tatagtcctg tcgggtttcg ccacctctga cttgagcgtc gatttttgtg
7980atgctcgtca ggggggcgga gcctatggaa aaacgccagc aacgcggcct ttttacggtt
8040cctggccttt tgctggcctt ttgctcacat gttctttcct gcgttatccc ctgattctgt
8100ggataaccgt attaccgcct ttgagtgagc tgataccgct cgccgcagcc gaacgaccga
8160gcgcagcgag tcagtgagcg aggaagcgga agagcgccca atacgcaaac cgcctctccc
8220cgcgcgttgg ccgattcatt aatgcaggtt aacctggctt atcgaaatta atacgactca
8280ctatagggag accggcctcg agcagctgaa gcttgcatgc ctgcaggtcg actctagagg
8340ga
83424515544DNAArtificial sequenceplasmid 15773 45attcctgtgg ttggcatgca
catacaaatg gacgaacgga taaacctttt cacgcccttt 60taaatatccg attattctaa
taaacgctct tttctcttag gtttacccgc caatatatcc 120tgtcaaacac tgatagttta
aactgaaggc gggaaacgac aatctgatca tgagcggaga 180attaagggag tcacgttatg
acccccgccg atgacgcggg acaagccgtt ttacgtttgg 240aactgacaga accgcaacgc
tgcaggaatt ggccgcagcg gccatttaaa tcaattgggc 300gcgccagctg cttgtgggga
ccagacaaaa aaggaatggt gcagaattgt taggcgcacc 360taccaaaagc atctttgcct
ttattgcaaa gataaagcag attcctctag tacaagtggg 420gaacaaaata acgtggaaaa
gagctgtcct gacagcccac tcactaatgc gtatgacgaa 480cgcagtgacg accacaaaac
tcgagacttt tcaacaaagg gtaatatccg gaaacctcct 540cggattccat tgcccagcta
tctgtcactt tattgtgaag atagtggaaa aggaaggtgg 600ctcctacaaa tgccatcatt
gcgataaagg aaaggctatc gttgaagatg cctctgccga 660cagtggtccc aaagatggac
ccccacccac gaggagcatc gtggaaaaag aagacgttcc 720aaccacgtct tcaaagcaag
tggattgatg tgatatctcc actgacgtaa gggatgacga 780acaatcccac tatccttcgg
taccggaccc cgaccagcgc gacatgcatg gcatggcaaa 840ctatatatcg tcatcatcat
tattatcatc tgaccctctt tttttttcac tctcactccc 900atgtttttat tcccgggcgg
ggccgtgtgg gtgtgggttg ggatggccgg attgggctcc 960cggggtggag aaatgacaaa
tccaggcccg caggcggcca cccaccaaat cggacgacgc 1020agggtgccca aatcaggaag
gattttaagg ttaaccggcc accggcggtg accgacgccc 1080caccccactc tccttctcct
attctatcta tatatcaccc gcctcttttt tctccctcac 1140tccgccacac cttccctctt
cttcctcagc tccgtcgccc accgccggag caccgaaagg 1200ccccgcgccc gccgcctttc
ctgtaaaaaa cccaaccttt agctagctaa ccgctcctct 1260tctcccccta ctccccttgc
ccaaatcaga gaagatattt aacggaggag gggaaggaga 1320ggatatttag ctgattgttg
attggtggtc cggggtacgg tgttcttgag tcgtgaagcg 1380accgtacagt ggctagggcc
gtctccgggt tgcgtgcagg atggtcgtca gagatcggga 1440gtgaggaggc agctcgtggt
cgtggaggct aaatgtaccg caagaacgac tcggcactct 1500cctgtttcta cctcttcctc
ctctggttct tcttcttgaa atagaccagc gccagccacc 1560aggtagctac ctactagcta
gcagcccagt tgcgactggg gacgggctgc tgcttgcaag 1620ttggaatctt ggagcaggag
cagaggagcg ggagatggag ctggatctga acgtggccga 1680ggtggcgccg gagaagccat
cggcggcgct ggaggcgagc gactcggggt cctcgggctc 1740gtcggtgctg aacgcggagg
cggcatcggc gggcggcggg gggcccgcgc cgggggagga 1800ggggtcaagc tcgacgccgg
ccgtgctcga gttcagcatc ctcaggagcg acagcgacgc 1860ggccggcgcg gacgccgacg
acggcgacgc cacgccgtcg ccacctcgcc accaccagca 1920gcagctcgtc acccgggagc
acttcccggc gccgcagcat tgggccgagc acggcttctt 1980ccgcgccggc ccgcagcagc
agccggacat cagggtcctg ccgcacccgc acccgtaccc 2040gcccccgccg ccgcccgcgc
agccgcagca ggccaagaag agccgccgcg gcccgcgctc 2100ccgcagctcg cagtaccgcg
gcgtcacctt ctaccgccgc accggccgct gggagtccca 2160catctggtca gtagcactgc
aagctcacca tgcgcccttt cacctaccga ccaataatcg 2220cttgtgattc tgacacccaa
atgtttcgtc ttcctgtgct gtcctgttcc tcggaaatgg 2280cagggattgc gggaagcagg
tgtacttagg tgagcagcaa taagcagatc gatctgcagc 2340ataaatttcc cgttattaac
tagttcgtga tctcgatcga atggcctaat taaccgattc 2400ggtgatctgg ccgatggcca
atctacgcag gtggattcga cactgctcat gccgctgcaa 2460ggtaacgatc aatccatcca
tccacccttg tctagctacc ccaccgaccg gccggattaa 2520tggaccgcta gctctcggga
cgggcttgct gcagggcgta cgaccgagcg gcgatcaagt 2580tccgcggcgt cgacgccgac
ataaacttca acctcagcga ctacgacgac gatatgaagc 2640aggtacatac acgagtgttc
ttgcagctag caccgactga aacatctgct gaacgtacac 2700gcatggccct gtgcaccaga
tgaagagcct gtccaaggag gagttcgttc acgccctgcg 2760gcggcagagc accggcttct
cccgcggcag ctccaagtac aggggcgtca ccctgcacaa 2820gtgcggccgc tgggaggcgc
gcaaggggca gttcctcggc aagaagtaag aaacaacact 2880tcgtttgcag gcgctgtact
ttgctgcaga ttatttcatt tcatccttgc atgtgccttt 2940cctttccatc cactcacttg
atggctgtag tctcgataga gttcgttcgt tcgtacttcg 3000caccagatga actcccacgc
acatgattta gtactagttt taccatgcat tgttcagtaa 3060aagtatatgc ttgcttgatc
agtggttgtt tcaatcagaa gattaaaaaa acggaatatt 3120aatataaaaa aaaggggaag
tggctaggga attcctcagt cctagctagc tagctcaccg 3180gtgggaacgc catgcttggc
ttgggtgcag gtacatatat cttgggctat tcgacagcga 3240agtagaggct gcaaggttgt
tcacctcgga cgattctgcc atttgttcat atacaccatg 3300ccttttgatt tctctcttgc
aatttctctt cttttatcat ggcttttgat tcccaaaggg 3360ttgagtaccg actcgatatt
cgattctccc tgccgtttcg tgaccccagg gcgtacgaca 3420aggccccacc atggtacgtc
ctgtagaaac cccaacccgt gaaatcaaaa aactcgacgg 3480cctgtgggca ttcagtctgg
atcgcgaaaa ctgtggaatt gatcagcgtt ggtgggaaag 3540cgcgttacaa gaaagccggg
caattgctgt gccaggcagt tttaacgatc agttcgccga 3600tgcagatatt cgtaattatg
cgggcaacgt ctggtatcag cgcgaagtct ttataccgaa 3660aggttgggca ggccagcgta
tcgtgctgcg tttcgatgcg gtcactcatt acggcaaagt 3720gtgggtcaat aatcaggaag
tgatggagca tcagggcggc tatacgccat ttgaagccga 3780tgtcacgccg tatgttattg
ccgggaaaag tgtacgtatc accgtttgtg tgaacaacga 3840actgaactgg cagactatcc
cgccgggaat ggtgattacc gacgaaaacg gcaagaaaaa 3900gcagtcttac ttccatgatt
tctttaacta tgccggaatc catcgcagcg taatgctcta 3960caccacgccg aacacctggg
tggacgatat caccgtggtg acgcatgtcg cgcaagactg 4020taaccacgcg tctgttgact
ggcaggtacc aagctgcgaa tcttcgtttt tttaaggaat 4080tctcgatctt tatggtgtat
aggctctggg ttttctgttt tttgtatctc ttaggatttt 4140gtaaattcca gatctttcta
tggccactta gtagtatatt tcaaaaattc tccaatcgag 4200ttcttcattc gcattttcag
tcattttctc ttcgacgttg tttttaagcc tgggtattac 4260tcctatttag ttgaactctg
cagcaatctt agaaaattag ggttttgagg tttcgatttc 4320tctaggtaac cgatctattg
cattcatctg aatttctgca tatatgtctt agatttctga 4380taagcttacg atacgttagg
tgtaattgaa gtttattttt caagagtgtt attttttgtt 4440tctgaatttt tcaggtggtg
gccaatggtg atgtcagcgt tgaactgcgt gatgcggatc 4500aacaggtggt tgcaactgga
caaggcacta gcgggacttt gcaagtggtg aatccgcacc 4560tctggcaacc gggtgaaggt
tatctctatg aactgtgcgt cacagccaaa agccagacag 4620agtgtgatat ctacccgctt
cgcgtcggca tccggtcagt ggcagtgaag ggcgaacagt 4680tcctgattaa ccacaaaccg
ttctacttta ctggctttgg tcgtcatgaa gatgcggact 4740tgcgtggcaa aggattcgat
aacgtgctga tggtgcacga ccacgcatta atggactgga 4800ttggggccaa ctcctaccgt
acctcgcatt acccttacgc tgaagagatg ctcgactggg 4860cagatgaaca tggcatcgtg
gtgattgatg aaactgctgc tgtcggcttt aacctctctt 4920taggcattgg tttcgaagcg
ggcaacaagc cgaaagaact gtacagcgaa gaggcagtca 4980acggggaaac tcagcaagcg
cacttacagg cgattaaaga gctgatagcg cgtgacaaaa 5040accacccaag cgtggtgatg
tggagtattg ccaacgaacc ggatacccgt ccgcaaggtg 5100cacgggaata tttcgcgcca
ctggcggaag caacgcgtaa actcgacccg acgcgtccga 5160tcacctgcgt caatgtaatg
ttctgcgacg ctcacaccga taccatcagc gatctctttg 5220atgtgctgtg cctgaaccgt
tattacggat ggtatgtcca aagcggcgat ttggaaacgg 5280cagagaaggt actggaaaaa
gaacttctgg cctggcagga gaaactgcat cagccgatta 5340tcatcaccga atacggcgtg
gatacgttag ccgggctgca ctcaatgtac accgacatgt 5400ggagtgaaga gtatcagtgt
gcatggctgg atatgtatca ccgcgtcttt gatcgcgtca 5460gcgccgtcgt cggtgaacag
gtatggaatt tcgccgattt tgcgacctcg caaggcatat 5520tgcgcgttgg cggtaacaag
aaagggatct tcactcgcga ccgcaaaccg aagtcggcgg 5580cttttctgct gcaaaaacgc
tggactggca tgaacttcgg tgaaaaaccg cagcagggag 5640gcaaacaatg agagctcgaa
tcgaagaagc cacactgtaa atctgccggg aagcggctgg 5700tggcatccgg cccgctcctc
cctccgggcg ccgcaacttt tttcgatcgg ttttgcgccg 5760cccgggacgg gttgtagttg
atcgattgga ttcttcataa ctgtatttgc gtactgctta 5820cactacccaa gtgaaatcga
aaatggcgcc ttctctcgtt gaataaattg cacgtacgct 5880actcgatccg ctgcggctct
tgctggagtg gccgccgccg ctatagatag aaggatcaag 5940ccaaggaatc tgtcatgcat
gggcatgtga aggaggagcc tcctgcaatg tttagtcttt 6000tttggtcgac gcccaccaga
gatatacgca ctagatttca tatagctgag ctagatcgat 6060tccgttgcat gcatgctgca
tggcgtcgag attcgagcta gcaccgcctg ttcatcatcg 6120accgatccat tctgatcgat
tcccctctcg agctttcacg aactgaacct acctagtgag 6180ggtgacgcct aacgcctagt
gcgcgcgcgt gggtctccga tgtcagtggc cgcacgcgcg 6240cgcgcgttct cgagatcgca
tgtggtcata gcgcagcagg tttgccctca gaacctacag 6300caactcgacc accggtttgg
atttcttctt ttttcaagga tatgatcgga gagagagagc 6360tacctaggcg tcgtccttgt
tttcttgtat cgcatgtggt gtgggtctct ctcctccttt 6420cgtacgcacg catgattcca
ttcttacccc ccctcgagat cgagaggaaa tatattgcta 6480ttttatacac acacggcgcc
cccagctata cgtcactgct tacgttaatt cccccaccgg 6540atagtagttg tttaatggcc
caaacaaacc ttgttgttgc atgcatcatg gaccaaacaa 6600aatacatagt tagttaaata
ttactgttat atatacaact aataataatt atattattag 6660ttaaaacaaa gcaaggcata
tgcagcagct gctggtcgga ccgcgatcgc ttaattaagc 6720ttgcatgcct gcagtgcagc
gtgacccggt cgtgcccctc tctagagata atgagcattg 6780catgtctaag ttataaaaaa
ttaccacata ttttttttgt cacacttgtt tgaagtgcag 6840tttatctatc tttatacata
tatttaaact ttactctacg aataatataa tctatagtac 6900tacaataata tcagtgtttt
agagaatcat ataaatgaac agttagacat ggtctaaagg 6960acaattgagt attttgacaa
caggactcta cagttttatc tttttagtgt gcatgtgttc 7020tccttttttt ttgcaaatag
cttcacctat ataatacttc atccatttta ttagtacatc 7080catttagggt ttagggttaa
tggtttttat agactaattt ttttagtaca tctattttat 7140tctattttag cctctaaatt
aagaaaacta aaactctatt ttagtttttt tatttaataa 7200tttagatata aaatagaata
aaataaagtg actaaaaatt aaacaaatac cctttaagaa 7260attaaaaaaa ctaaggaaac
atttttcttg tttcgagtag ataatgccag cctgttaaac 7320gccgtcgacg agtctaacgg
acaccaacca gcgaaccagc agcgtcgcgt cgggccaagc 7380gaagcagacg gcacggcatc
tctgtcgctg cctctggacc cctctcgaga gttccgctcc 7440accgttggac ttgctccgct
gtcggcatcc agaaattgcg tggcggagcg gcagacgtga 7500gccggcacgg caggcggcct
cctcctcctc tcacggcacc ggcagctacg ggggattcct 7560ttcccaccgc tccttcgctt
tcccttcctc gcccgccgta ataaatagac accccctcca 7620caccctcttt ccccaacctc
gtgttgttcg gagcgcacac acacacaacc agatctcccc 7680caaatccacc cgtcggcacc
tccgcttcaa ggtacgccgc tcgtcctccc cccccccccc 7740tctctacctt ctctagatcg
gcgttccggt ccatggttag ggcccggtag ttctacttct 7800gttcatgttt gtgttagatc
cgtgtttgtg ttagatccgt gctgctagcg ttcgtacacg 7860gatgcgacct gtacgtcaga
cacgttctga ttgctaactt gccagtgttt ctctttgggg 7920aatcctggga tggctctagc
cgttccgcag acgggatcga tttcatgatt ttttttgttt 7980cgttgcatag ggtttggttt
gcccttttcc tttatttcaa tatatgccgt gcacttgttt 8040gtcgggtcat cttttcatgc
ttttttttgt cttggttgtg atgatgtggt ctggttgggc 8100ggtcgttcta gatcggagta
gaattctgtt tcaaactacc tggtggattt attaattttg 8160gatctgtatg tgtgtgccat
acatattcat agttacgaat tgaagatgat ggatggaaat 8220atcgatctag gataggtata
catgttgatg cgggttttac tgatgcatat acagagatgc 8280tttttgttcg cttggttgtg
atgatgtggt gtggttgggc ggtcgttcat tcgttctaga 8340tcggagtaga atactgtttc
aaactacctg gtgtatttat taattttgga actgtatgtg 8400tgtgtcatac atcttcatag
ttacgagttt aagatggatg gaaatatcga tctaggatag 8460gtatacatgt tgatgtgggt
tttactgatg catatacatg atggcatatg cagcatctat 8520tcatatgctc taaccttgag
tacctatcta ttataataaa caagtatgtt ttataattat 8580tttgatcttg atatacttgg
atgatggcat atgcagcagc tatatgtgga tttttttagc 8640cctgccttca tacgctattt
atttgcttgg tactgtttct tttgtcgatg ctcaccctgt 8700tgtttggtgt tacttctgca
gggatccccg atcatgcaaa aactcattaa ctcagtgcaa 8760aactatgcct ggggcagcaa
aacggcgttg actgaacttt atggtatgga aaatccgtcc 8820agccagccga tggccgagct
gtggatgggc gcacatccga aaagcagttc acgagtgcag 8880aatgccgccg gagatatcgt
ttcactgcgt gatgtgattg agagtgataa atcgactctg 8940ctcggagagg ccgttgccaa
acgctttggc gaactgcctt tcctgttcaa agtattatgc 9000gcagcacagc cactctccat
tcaggttcat ccaaacaaac acaattctga aatcggtttt 9060gccaaagaaa atgccgcagg
tatcccgatg gatgccgccg agcgtaacta taaagatcct 9120aaccacaagc cggagctggt
ttttgcgctg acgcctttcc ttgcgatgaa cgcgtttcgt 9180gaattttccg agattgtctc
cctactccag ccggtcgcag gtgcacatcc ggcgattgct 9240cactttttac aacagcctga
tgccgaacgt ttaagcgaac tgttcgccag cctgttgaat 9300atgcagggtg aagaaaaatc
ccgcgcgctg gcgattttaa aatcggccct cgatagccag 9360cagggtgaac cgtggcaaac
gattcgttta atttctgaat tttacccgga agacagcggt 9420ctgttctccc cgctattgct
gaatgtggtg aaattgaacc ctggcgaagc gatgttcctg 9480ttcgctgaaa caccgcacgc
ttacctgcaa ggcgtggcgc tggaagtgat ggcaaactcc 9540gataacgtgc tgcgtgcggg
tctgacgcct aaatacattg atattccgga actggttgcc 9600aatgtgaaat tcgaagccaa
accggctaac cagttgttga cccagccggt gaaacaaggt 9660gcagaactgg acttcccgat
tccagtggat gattttgcct tctcgctgca tgaccttagt 9720gataaagaaa ccaccattag
ccagcagagt gccgccattt tgttctgcgt cgaaggcgat 9780gcaacgttgt ggaaaggttc
tcagcagtta cagcttaaac cgggtgaatc agcgtttatt 9840gccgccaacg aatcaccggt
gactgtcaaa ggccacggcc gtttagcgcg tgtttacaac 9900aagctgtaag agcttactga
aaaaattaac atctcttgct aagctgggag ctcgatccgt 9960cgacctgcag atcgttcaaa
catttggcaa taaagtttct taagattgaa tcctgttgcc 10020ggtcttgcga tgattatcat
ataatttctg ttgaattacg ttaagcatgt aataattaac 10080atgtaatgca tgacgttatt
tatgagatgg gtttttatga ttagagtccc gcaattatac 10140atttaatacg cgatagaaaa
caaaatatag cgcgcaaact aggataaatt atcgcgcgcg 10200gtgtcatcta tgttactaga
tctgctagcc ctgcaggaaa tttaccggtg cccgggcggc 10260cagcatggcc gtatccgcaa
tgtgttatta agttgtctaa gcgtcaattt gtttacacca 10320caatatatcc tgccaccagc
cagccaacag ctccccgacc ggcagctcgg cacaaaatca 10380ccactcgata caggcagccc
atcagaatta attctcatgt ttgacagctt atcatcgact 10440gcacggtgca ccaatgcttc
tggcgtcagg cagccatcgg aagctgtggt atggctgtgc 10500aggtcgtaaa tcactgcata
attcgtgtcg ctcaaggcgc actcccgttc tggataatgt 10560tttttgcgcc gacatcataa
cggttctggc aaatattctg aaatgagctg ttgacaatta 10620atcatccggc tcgtataatg
tgtggaattg tgagcggata acaatttcac acaggaaaca 10680gaccatgagg gaagcgttga
tcgccgaagt atcgactcaa ctatcagagg tagttggcgt 10740catcgagcgc catctcgaac
cgacgttgct ggccgtacat ttgtacggct ccgcagtgga 10800tggcggcctg aagccacaca
gtgatattga tttgctggtt acggtgaccg taaggcttga 10860tgaaacaacg cggcgagctt
tgatcaacga ccttttggaa acttcggctt cccctggaga 10920gagcgagatt ctccgcgctg
tagaagtcac cattgttgtg cacgacgaca tcattccgtg 10980gcgttatcca gctaagcgcg
aactgcaatt tggagaatgg cagcgcaatg acattcttgc 11040aggtatcttc gagccagcca
cgatcgacat tgatctggct atcttgctga caaaagcaag 11100agaacatagc gttgccttgg
taggtccagc ggcggaggaa ctctttgatc cggttcctga 11160acaggatcta tttgaggcgc
taaatgaaac cttaacgcta tggaactcgc cgcccgactg 11220ggctggcgat gagcgaaatg
tagtgcttac gttgtcccgc atttggtaca gcgcagtaac 11280cggcaaaatc gcgccgaagg
atgtcgctgc cgactgggca atggagcgcc tgccggccca 11340gtatcagccc gtcatacttg
aagctaggca ggcttatctt ggacaagaag atcgcttggc 11400ctcgcgcgca gatcagttgg
aagaatttgt tcactacgtg aaaggcgaga tcaccaaagt 11460agtcggcaaa taaagctcta
gtggatctcc gtacccgggg atctggctcg cggcggacgc 11520acgacgccgg ggcgagacca
taggcgatct cctaaatcaa tagtagctgt aacctcgaag 11580cgtttcactt gtaacaacga
ttgagaattt ttgtcataaa attgaaatac ttggttcgca 11640tttttgtcat ccgcggtcag
ccgcaattct gacgaactgc ccatttagct ggagatgatt 11700gtacatcctt cacgtgaaaa
tttctcaagc gctgtgaaca agggttcaga ttttagattg 11760aaaggtgagc cgttgaaaca
cgttcttctt gtcgatgacg acgtcgctat gcggcatctt 11820attattgaat accttacgat
ccacgccttc aaagtgaccg cggtagccga cagcacccag 11880ttcacaagag tactctcttc
cgcgacggtc gatgtcgtgg ttgttgatct agatttaggt 11940cgtgaagatg ggctcgagat
cgttcgtaat ctggcggcaa agtctgatat tccaatcata 12000attatcagtg gcgaccgcct
tgaggagacg gataaagttg ttgcactcga gctaggagca 12060agtgatttta tcgctaagcc
gttcagtatc agagagtttc tagcacgcat tcgggttgcc 12120ttgcgcgtgc gccccaacgt
tgtccgctcc aaagaccgac ggtctttttg ttttactgac 12180tggacactta atctcaggca
acgtcgcttg atgtccgaag ctggcggtga ggtgaaactt 12240acggcaggtg agttcaatct
tctcctcgcg tttttagaga aaccccgcga cgttctatcg 12300cgcgagcaac ttctcattgc
cagtcgagta cgcgacgagg aggtttatga caggagtata 12360gatgttctca ttttgaggct
gcgccgcaaa cttgaggcag atccgtcaag ccctcaactg 12420ataaaaacag caagaggtgc
cggttatttc tttgacgcgg acgtgcaggt ttcgcacggg 12480gggacgatgg cagcctgagc
caattcccag atccccgagg aatcggcgtg agcggtcgca 12540aaccatccgg cccggtacaa
atcggcgcgg cgctgggtga tgacctggtg gagaagttga 12600aggccgcgca ggccgcccag
cggcaacgca tcgaggcaga agcacgcccc ggtgaatcgt 12660ggcaagcggc cgctgatcga
atccgcaaag aatcccggca accgccggca gccggtgcgc 12720cgtcgattag gaagccgccc
aagggcgacg agcaaccaga ttttttcgtt ccgatgctct 12780atgacgtggg cacccgcgat
agtcgcagca tcatggacgt ggccgttttc cgtctgtcga 12840agcgtgaccg acgagctggc
gaggtgatcc gctacgagct tccagacggg cacgtagagg 12900tttccgcagg gccggccggc
atggccagtg tgtgggatta cgacctggta ctgatggcgg 12960tttcccatct aaccgaatcc
atgaaccgat accgggaagg gaagggagac aagcccggcc 13020gcgtgttccg tccacacgtt
gcggacgtac tcaagttctg ccggcgagcc gatggcggaa 13080agcagaaaga cgacctggta
gaaacctgca ttcggttaaa caccacgcac gttgccatgc 13140agcgtacgaa gaaggccaag
aacggccgcc tggtgacggt atccgagggt gaagccttga 13200ttagccgcta caagatcgta
aagagcgaaa ccgggcggcc ggagtacatc gagatcgagc 13260tagctgattg gatgtaccgc
gagatcacag aaggcaagaa cccggacgtg ctgacggttc 13320accccgatta ctttttgatc
gatcccggca tcggccgttt tctctaccgc ctggcacgcc 13380gcgccgcagg caaggcagaa
gccagatggt tgttcaagac gatctacgaa cgcagtggca 13440gcgccggaga gttcaagaag
ttctgtttca ccgtgcgcaa gctgatcggg tcaaatgacc 13500tgccggagta cgatttgaag
gaggaggcgg ggcaggctgg cccgatccta gtcatgcgct 13560accgcaacct gatcgagggc
gaagcatccg ccggttccta atgtacggag cagatgctag 13620ggcaaattgc cctagcaggg
gaaaaaggtc gaaaaggtct ctttcctgtg gatagcacgt 13680acattgggaa cccaaagccg
tacattggga accggaaccc gtacattggg aacccaaagc 13740cgtacattgg gaaccggtca
cacatgtaag tgactgatat aaaagagaaa aaaggcgatt 13800tttccgccta aaactcttta
aaacttatta aaactcttaa aacccgcctg gcctgtgcat 13860aactgtctgg ccagcgcaca
gccgaagagc tgcaaaaagc gcctaccctt cggtcgctgc 13920gctccctacg ccccgccgct
tcgcgtcggc ctatcgcggc cgctggccgc tcaaaaatgg 13980ctggcctacg gccaggcaat
ctaccagggc gcggacaagc cgcgccgtcg ccactcgacc 14040gccggcgctg aggtctgcct
cgtgaagaag gtgttgctga ctcataccag gcctgaatcg 14100ccccatcatc cagccagaaa
gtgagggagc cacggttgat gagagctttg ttgtaggtgg 14160accagttggt gattttgaac
ttttgctttg ccacggaacg gtctgcgttg tcgggaagat 14220gcgtgatctg atccttcaac
tcagcaaaag ttcgatttat tcaacaaagc cgccgtcccg 14280tcaagtcagc gtaatgctct
gccagtgtta caaccaatta accaattctg attagaaaaa 14340ctcatcgagc atcaaatgaa
actgcaattt attcatatca ggattatcaa taccatattt 14400ttgaaaaagc cgtttctgta
atgaaggaga aaactcaccg aggcagttcc ataggatggc 14460aagatcctgg tatcggtctg
cgattccgac tcgtccaaca tcaatacaac ctattaattt 14520cccctcgtca aaaataaggt
tatcaagtga gaaatcacca tgagtgacga ctgaatccgg 14580tgagaatggc aaaagctctg
cattaatgaa tcggccaacg cgcggggaga ggcggtttgc 14640gtattgggcg ctcttccgct
tcctcgctca ctgactcgct gcgctcggtc gttcggctgc 14700ggcgagcggt atcagctcac
tcaaaggcgg taatacggtt atccacagaa tcaggggata 14760acgcaggaaa gaacatgtga
gcaaaaggcc agcaaaaggc caggaaccgt aaaaaggccg 14820cgttgctggc gtttttccat
aggctccgcc cccctgacga gcatcacaaa aatcgacgct 14880caagtcagag gtggcgaaac
ccgacaggac tataaagata ccaggcgttt ccccctggaa 14940gctccctcgt gcgctctcct
gttccgaccc tgccgcttac cggatacctg tccgcctttc 15000tcccttcggg aagcgtggcg
ctttctcata gctcacgctg taggtatctc agttcggtgt 15060aggtcgttcg ctccaagctg
ggctgtgtgc acgaaccccc cgttcagccc gaccgctgcg 15120ccttatccgg taactatcgt
cttgagtcca acccggtaag acacgactta tcgccactgg 15180cagcagccac tggtaacagg
attagcagag cgaggtatgt aggcggtgct acagagttct 15240tgaagtggtg gcctaactac
ggctacacta gaagaacagt atttggtatc tgcgctctgc 15300tgaagccagt taccttcgga
aaaagagttg gtagctcttg atccggcaaa caaaccaccg 15360ctggtagcgg tggttttttt
gtttgcaagc agcagattac gcgcagaaaa aaaggatctc 15420aagaagatcc tttgatcttt
tctacggggt ctgacgctca gtggaacgaa aactcacgtt 15480aagggatttt ggtcatgaga
ttatcaaaaa ggatcttcac ctagatcctt ttgatccgga 15540atta
15544467127DNAZea mays
46agagaggaga tattttcgac cagcgcgaca tgcatggcat ggcaaactat atatcgtcat
60catcattatt atcatctgac cctctttttt tttcactctc actcccatgt ttttattccc
120gggcggggcc gtgtgggtgt gggttgggat ggccggattg gggtcccggg gtggagaaat
180gacaaatcca ggcccgcagg cggccaccca ccaaatcgga cgacgcaggg tgcccaaatc
240aggaaggatt ttaaggttaa ccggccaccg gcggtgaccg acgccccacc ccactctcct
300tctcctattc tatctatata tcacccgcct cttttttctc cctcactccg ccacaccttc
360cctcttcttc ctcagctccg tcgcccaccg ccggagctcc gaaaggcccc gcgcccgccg
420cctttcctgt aaaaaaccca acctttagct agctaaccgc tcctcttctc cccctactcc
480ccttgcccaa atcagagaag atatttaacg gaggagggga aggagaggat atttagctga
540ttgttgattg gtggtccggg gtacggtgtt cttgagtcgt gaagcgaccg tacagtggct
600agggccgtct ccgggttgcg tgcaggatgg tcgtcagaga tcgggagtga ggaggcagct
660cgtggtcgtg gaggctaaat gtaccgcaag aacgactcgg cactctcctg tttctacctc
720ttcctcctct ggttcttctt cttgaaatag accagcgcca gccaccaggt agctacctac
780tagctagcag cccagttgcg actggggacg ggctgctgct tgcaagttgg aatcttggag
840caggagcaga ggagcgggag atggagctgg atctgaacgt ggccgaggtg gcgccggaga
900agccatcggc ggcgctggag gcgagcgact cggggtcctc gggctcgtcg gtgctgaacg
960cggaggcggc atcggcgggc ggcggggggc ccgcgccggg ggaggagggg tcaagctcga
1020cgccggccgt gctcgagttc agcatcctca ggagcgacag cgacgcggcc ggcgcggacg
1080ccgacgacgg cgacgccacg ccgtcgccac ctcgccacca ccagcagcag ctcgtcaccc
1140gggagctctt cccggcgccg cagcattggg ccgagctcgg cttcttccgc gccggcccgc
1200agcagcagcc ggacatcagg gtcctgccgc acccgcaccc gtacccgccc ccgccgccgc
1260ccgcgcagcc gcagcaggcc aagaagagcc gccgcggccc gcgctcccgc agctcgcagt
1320accgcggcgt caccttctac cgccgcaccg gccgctggga gtcccacatc tggtcagtag
1380cactgcaagc tcaccatgcg ccctttcacc taccgaccaa taatcgcttg tgattctgac
1440acccaaatgt ttcgtcttcc tgtgctgtcc tgttcctcgg aaatggcagg gattgcggga
1500agcaggtgta cttaggtgag cagcaataag cagatcgatc tgcagcataa atttcccgtt
1560attaactagt tcgtgatctc gatcgaatgg cctaattaac cgattcggtg atctggccga
1620tggccaatct acgcaggtgg attcgacact gctcatgccg ctgcaaggta acgatcaatc
1680catccatcca cccttgtcta gctaccccac cgaccggccg gattaatgga ccgctagctc
1740tcgggacggg cttgctgcag ggcgtacgac cgagcggcga tcaagttccg cggcgtcgac
1800gccgacataa acttcaacct cagcgactac gacgacgata tgaagcaggt acatacacga
1860gtgttcttgc agctagcacc gactgaaaca tctgctgaac gtacacgcat ggccctgtgc
1920accagatgaa gagcctgtcc aaggaggagt tcgttcacgc cctgcggcgg cagagcaccg
1980gcttctcccg cggcagctcc aagtacaggg gcgtcaccct gcacaagtgc ggccgctggg
2040aggcgcgcat ggggcagttc ctcggcaaga agtaagaaac aacacttcgt ttgcaggcgc
2100tgtactttgc tgcagattat ttcatttcat ccttgcatgt gcctttcctt tccatccact
2160cacttgatgg ctgtagtctc gatagagttc gttcgttcgt acttcgcacc agatgaactc
2220ccacgcacat gatttagtac tagttttacc atgcattgtt cagtaaaagt atatgcttgc
2280ttgatcagtg gttgtttcaa tcagaagatt aaaaaaacgg aatattaata taaaaaaaag
2340gggaagtggc tagggaattc ctcagtccta gctagctagc tcaccggtgg gaacgccatg
2400cttggcttgg gtgcaggtac atatatcttg ggctattcga cagcgaagta gaggctgcaa
2460ggttgttcac ctcggacgat tctgccattt gttcatatac accatgcctt ttgatttctc
2520tcttgcaatt tctcttcttt tatcatggct tttgattccc aaagggttga gtaccgactc
2580gatattcgat tctccctgcc gtttcgtgac cccagggcgt acgacaaggc cgcgatcaaa
2640tgcaacggta gagaggccgt gacgaacttc gagcccagca cgtacgacgg ggagctgctg
2700ctgactgctg aagctagcgc agaaggtaat taagtagctg ctcgctgcca tgtaatcttc
2760agatgacgcc gctgttaatt attagctcat cagctttcgg acgatgccct tgtttttcgg
2820ttgaaccggg gtgaactttc tgaatttgag atttgatttt ttttgtttct gcttctgcag
2880ttgctgacga cgttgatctg aacttgagca tctcgcaacc ggcatcgtcc cagagcccca
2940aaagagacaa gaactgcctt ggtccgcagc tccaccacca ccatgggcgg ccgtttgacg
3000gctccgccgt tctgaagaaa accaaggcaa gcgctaagta ataacgctac gtaccttgac
3060aagtatcaaa atcagtaaaa ctttcctctt cgtcaaaccc tatctctacc gacggctgtt
3120agttgcccgg ttttgatcat ttgacaatta aacacatacc ctctcgcaag tcgggatcat
3180ttttagctag gcggactagt ttatcgccaa gcagcgagtt tctctttcgg ggtgggtgat
3240cgcgacagct gagcagaata cttcttcttc gtctactttt tctccttcct cctaccaaaa
3300ttgaattgtt taaggaaaat ttatacagag agcggcgtgg acagctttgg atggagctgc
3360cgataattca actgaaaatc tctcgcttct tcttcttctc atgcagatcg atgctccgtc
3420tgagctgtcg tcggcgggcc gccctcaccg gtcgttcctc cctcatctcg tggctgccga
3480gcatctaccg cctcggtctc accccttctt catcacacac catgaggtta gacgacacta
3540tacagtactg aatcatttgc aaaggtttgt caagctagct agattggcat cataatacac
3600ggatcaggtg tcagattgtt catgcagtgc agtatgcagc ctgaaggtgt atgcagtttc
3660agatagcaga tttttagcag ctggttaatt tctctcttgc gtgcggctgt cagtcagtgt
3720agctctcgtc gtcgcccgct ttatttcctt ggattctagc tagagtccgc ctgtcacccg
3780tcgatttcag tgaagttaat gggatgcgcg aatttttttt ctcccccgta taggccggct
3840gttgaatata tgtgtctatc ttgaattggc ctaatatggg aataatagta ctagcagctt
3900tatggctaga tcagaatatg tacatgtgtt tgattttttt tctctctctc ccttagcttc
3960cttgaaaagg aaaggtccta gacctagcta ccggccagca gcgacacttc aactctaagg
4020gcatgtacag tggagagacg ccaaaacggt tctccaagca taggagacaa ctaagagact
4080ctattgtaca atggagtgtc tctaaacgta gtctattaat aaatacagaa ttaaatgtat
4140ttgtatagca tcagatcgat agaacagacg acaaattcgt acagtgggaa gtgaggcgtc
4200tgttgttact tggtttacga gccagaggcg tctcttcacg gagagacggc tctaagattt
4260ttttgcaaat aaccccctaa aacaccttaa gagcccccac attaaacacc actgtacatg
4320ccctaagccc tgcctggcct gcctaatcaa accctctcgg tcaactatgc tatgcctgcc
4380tgcctgcttt caacacgtac tgttcctttt tcaaaccttc cctggaaacg aaaacagaag
4440atgcatggta tttatgcttg gggatttgcc ttcttttcag tgtactaata agcttggggt
4500ttgtttagtc gttcagcaat caacttggac gagtgttgat aaataaaact cgatctccaa
4560cctttcgttc ataaatgggt cagctaactt tgaggtcggt ctcactctca caccagtgtc
4620gctttctgat tgtattgtat tggacgggaa gagctgaggt cgacgctttt ctgcccccag
4680ctgaactgat gggaaacgct aagctaatta tattggtgga acgagtctcc tgccgtttgc
4740tctctttttt gttttgtttc tcttaaaaaa aacatgcttc catgcatcag aaagcgttat
4800tacttaggat gattaatttg aactgttcat cagttcgttg aattggtcct agggtgaatg
4860aactttcagt ttatttgttg accatgcatg cagagtgatg catcaagaag agatcccagc
4920tgggcagcag cagcagcatg gaaggtgacc gcagctgcac ctcctcctcc taccaccacc
4980ctgttgccgt tgccgctgcc gtcgacgtcg tccgctgcag catcatcagg attctccaat
5040accgccacga cagctgccgc cgccccatcg gccgcctcct cccgccggtt cgacccgccg
5100ccaccgtcgt cgtcctcctc ctcgagccat caccaccacc accaccgccg ctgagaatcg
5160aagaagccac actgtaaatc tgccgggaag cggctggtgg catccggccc gctcctccct
5220ccgggcgccg caactttttt cgatcggttt tgcgccgccc gggacgggtt gtagttgatc
5280gattggattc ttcataactg tatttgcgta ctgcttacac tacccaagtg aaatcgaaaa
5340tggcgccttc tctcgttgaa taaattgcac gtacgctact cgatccgctg cggctcttgc
5400tggagtggcc gccgccgcta tagatagaag gatcaagcca aggaatctgt catgcatggg
5460catgtgaagg aggagcctcc tgcaatgttt agtctttttt ggtcgacgcc caccagagat
5520atacgcacta gatttcatat agctgagcta gatcgattcc gttgcatgca tgctccatgg
5580cgtcgagatt cgagctagca ccgcctgttc atcatcgacc gatccattct gatcgattcc
5640cctctcgagc tttcacgaac tgaacctacc tagtgagggt gacgcctaac gcctagtgcg
5700cgcgcgtggg tctccgatgt cagtggccgc acgcgcgcgc gcgttctcga gatcgcatgt
5760ggtcatagcg cagcaggttt gccctcagaa cctacagcaa ctcgaccacc ggtttggatt
5820tcttcttttt tcaaggatat gatcggagag agagagctac ctaggcgtcg tccttgtttt
5880cttgtatcgc atgtggtgtg ggtctctctc ctcctttcgt acgcacgcat gattccattc
5940ttaccccccc tcgagatcga gaggaaatat attgctattt tatacacaca cggcgccccc
6000agctatacgt cactgcttac gttaattccc ccaccggata gtagttgttt aatggcccaa
6060acaaaccttg ttgttgcatg catcatggac caaacaaaat acatagttag ttaaatatta
6120ctgttatata tacaactaat aataattata ttattagtta aaacaaagca aggcatatgc
6180agcagctgct ggtactaccc agtacatggc acatgcgttt gtttaatccc ctgttgctgt
6240gtgtgtgatt gattccttgt attagctaat aattagttag gtcggtcgtc gtctcccctc
6300taatccctct tcgatttaga attagtagtc ttgtacgttg tttaatatgc ttggacgacg
6360acgctctttg ttgggtgtgc acttcatctt tccatctaca ctagctagct agacacacat
6420gtactatagc tagctacttg ttttagtatg ctgctcttct aattaactaa ccaacatgat
6480tgcactgcta agcaaggcta cctttggtac ggtcttaaac tttgtgtggc ccatatgctg
6540ctatactata tcatgcatgt agattcttcc tgccaaggtg catggttttt ttatgttaat
6600aggtacggtt agttgtcgta gtacatacta aggcatcgat cgtccactta tatatatcaa
6660accctgcagc tcaaacaagc tgcaaataaa aaaaaaactg aagctggtat atgagtgtat
6720attgtatatg aaataataat gcatatgcgg ctgcatgcat cagggagctg agtcagatga
6780caggtgtagg tttgaagcag cttgctgtac gtgtgcaatt tttttctctc cataatgatg
6840tctcagattg gtgatctgat gacgctgtga ttattctatt ctattcatct ttggttgtag
6900acactccttt tcatttgtta atagttttct ggtccagttg atagatagag gttaaataaa
6960agccagttgt agtctacctt aactagtacg atagtacaac aggattggcc ggcggcgtta
7020gtaaatttat aatttcgtat acaagctgtt attgttatta catacactag ccggttactc
7080gtgcttttct atagttgtta tatattatat actcgaggcg tctagag
712747439DNAZea mays 47tattaaggct gcttctgagg gcccactcaa gggtattatg
ggctacgtgg aggaggatct 60ggtttccacc gacttcaccg gtgacagcag gtcgagcatc
ttcgacgcca aggccgggat 120tgccctgaac gaccacttca tcaagctcgt ctcttggtac
gacaacgagt ggggctacag 180caaccgcgtc gtcgacctga tccgccacat gttcaagacc
cagtagagag agatatttct 240gcctccctat cgagggtcgt ccccgatggc ctttggtcgc
agaccatctt tgctgcttgt 300ctatgctgag aataaatgtg aacggtgccc ctggacgctg
gatccatgct ggttttggac 360acggttgtct ttttgtgttt aacttatctg ctgccgtccg
tcctgtaacg aattcgctaa 420gttttagttc ttttgtgct
43948391DNAZea mays 48catgtccttg attattggtg
tctacgacga gccaatgact ccagggcaat gcaacatggt 60ggtggagagg ctcggcgatt
acctgatcga gcagggcttc taaaagttcg tcatgttctg 120ttttggtcat ttgggcacca
aagtttgcgc ctcatttggt tctgtaatcc gtgagctcgt 180gcatgtactt ggcgtattgc
atgcagtgaa taatttagct tgggtttgtt tgttgggggc 240agtgttgggg acggatttgg
attggggttt atgcttggca tcgcgtcgta tcgaaactca 300gctgctgttt cgctgagtaa
tgtacatttc cctggtaatg gtacttgtgg actctgatgc 360ttttatggga acgagtgcat
tttactgcaa a 39149136DNAZea mays
49attgggttac aagaattatg gcgtttgtca atatggtcgt aatgtcgtag gatggtggaa
60tgtggtcaca aactttgcgt atgttgggtc tactggtggt gtctgaatct atgtatggat
120gtcatgagtt tgtcta
1365063DNAZea mays 50ggtgtatccg cgttagaacc ttttgttggt gaacaatatt
atcgtggcac gcgttttaag 60taa
6351632DNAZea mays 51cgctgtgaat gacgagtgca
tgctcaagtt cggcgagctg cagtcgaaga ggctgcaccg 60cttcctaact ttcaagatgg
acgacaagtt caaggagatc gttgtggacc aggtcgggga 120tcgcgctacc agctacgagg
acttcacaaa cagcctcccc gagaatgact gccgatacgc 180gatctatgat ttcgactttg
tcactgcaga agatgtccag aagagcagga tcttctatat 240cctatggtcc ccatcctccg
ccaaggtgaa gagcaagatg ctttatgcaa gctcaaacca 300aaaattcaag agtgggctca
atggcattca ggtggaactg caggctactg atgcaagtga 360aatcagcctt gatgagatca
aggatcgggc tcgctaggca tcatgatcat gcatcatgga 420ctcggcctac tactgtggat
ttgtatgcca ttatagactt ggtgctgtga aagactgctt 480gatgatttgc gggtttgttg
ctgtgtaaaa aaaggtccca tggctcccag aagaccatga 540aggttcggat ctatcatgta
attccttgtt atctgccaat tatgtatgga ctatggacat 600gtgttgcgct gttcaactta
ctactacaaa ta 63252598DNAZea mays
52gggttgaact atgagcgccg tggcggtttc gtcgtcgctg aacccggacg cgccgctctt
60catcccggcg gcgctgctgc aggtggagga cttctcgccg cagtggtggg acctcatcac
120caccactgcc tggttccgcg accactggtc ccgcgagcgc gcccacctgg acgagatggc
180cgagcagatc gacgcggccg gcctcctccc cgacgacgag gacctcttct acgacgacca
240gctcgagcag ggccccgtcg ccgccgccct taagacagat tcggtgctca aggcgctgaa
300catgacctcc ccgaagggcg gcggcgacgc cccgcggggg ttccgggaga aacccaggaa
360cgccgagaag ccgaccaagt acgccggcag ccccaagagc agcgcccccc gcgtgatcca
420ccagcctcgc taggttcgct gggggaactc atcaggaagg ctgctgcccc tcttgcagcc
480ttgctcctgg ctgccgcccg ctgtcgtggt ctgctctttc aagtcgaagt aacggtggtt
540cgagctagtg gatagtgtgg ctcaactgta gaagttcctt ttgtatagca agcaagta
59853516DNAZea mays 53atggctgtcc gcatcatcaa gcataccctg gagatcatcc
acctgctcac cgatgccaac 60cccatccagg tcgtcgtcga cgcgatcatc aacagtggcc
cccgtgagga tgccacccgt 120attggttccg ctggtgttgt gaggaggcag gccgtggata
tctcacccct gaggagggtg 180aaccaggcca tctacctcct caccactggt gccagggaga
gtgctttccg gaacatcaaa 240accattgccg agtgccttgc agatgagctg atcaacgctg
ccaagggctc atccaacagt 300tacgccatca agaagaagga cgagattgag cgtgttgcca
aggccaaccg ttgaactgag 360cttgtatcct ggtgcactct gcgctggaaa cttttatgtc
gctggcagtc gtatcggttc 420ttgttttacc aatgtttaga gttttttgag acctatatgc
ggttttggtt ttcagtgcac 480aattaaaatt actgagtaat gtagttgatt gggaac
5165464DNAZea mays 54gtgttcggtg aaatcagagt
cgtcagtcat ctacatagct tttcttggtt gatagactgt 60tatt
6455195DNAZea mays
55ataaaatagc atgccgtctc tgtcactggc aatggacggt ggtgcctagc gcaactcagc
60gcacaactgt gtgtcttgat ttttcttctg tttatcacgg cattagtgcc atgccgtttt
120atgttacagt gttgtgtgct cgcaagcatc cgaaaatatg cgtctgagtt tagggttggg
180tcaaacttgt cgaat
1955652DNAZea mays 56gagaaccatc gcctgcattt cgatctgttt caccgcaatt
cgcattgtta gt 525790DNAZea mays 57ctatgttgta taaggctagt
gcagctgtgc aggttactct atattcttac tctatatcac 60tatttgtagt ctactcatca
attaataaat 9058549DNAZea mays
58tggtcaacgt gcacgcggtc cacagggacc ccgcggtgtg ggacgacccg gacaggttcg
60tgccggagcg gttcgagggc gccggcggca aggccgaggg gcgcctgctg aagccgttcg
120ggatggggcg gcgcaagtgc cccggggaga cgctcgcgct gcggaccgtc gggctggtgc
180tcgccacgct gctccagtgc ttcgactggg acacggttga tggagctcag gttgacatga
240aggctagcgg cgggctgacc atgccccggg ccgtcccgtt ggaggccatg tgcaggccgc
300gtacagctat gcgtggtgtt cttaagaggc tctgaaaacc tcatggatcg aattgctggc
360atcgtctgaa gggtgtatga cgtagcttcc gagttccgag catatatatt cacttgcctt
420gtactagttg attttcgccg agtgtatgga atggattttc tttttttttc ttgcaatgga
480tgtgaatttt gtttttctcg acgttacaag aagtgaatca acctagcttc tctttgagcg
540acagcaacg
54959538DNAZea mays 59cgacttgttt cattgattct tcaagagatc gagcttcttt
tgcaccacaa ggtcgaggat 60gtcttgcagc tgcggatcaa gctgcggctg cggctcaagc
tgcaagtgcg gcaagaagta 120ccctgacctg gaggagacga gcaccgccgc gcagcccacc
gtcgtcctcg gggtggcccc 180ggagaagaag gccgcgcccg agttcgtcga ggccgcggcg
gagtccggcg aggcggccca 240cggctgcagc tgcggtagcg gctgcaagtg cgacccctgc
aactgctgat cacatcgatc 300gacgaccatg gatgattatt atctatctag cttgtggtgg
tggttgaaca ataataagcg 360aggccgagct ggctgccata cataggtatt gtgtggtgtg
tgtgtgagag agagagaaac 420agagttcttc agtttgctat ctctctctgc atgtttggcg
tcagtctttg tgctcatgta 480cgtgtgtcta catgcatgtt ggttgatccg attgcgtctg
ctgtaaccat atattaat 53860228DNAZea mays 60tctacccgcc cgagaaggtc
tacgacttcg tctgcgggat gaagaagagg ctgggcatcg 60agtagagcat ccatcggtcg
gccggtggct ggccgggagt aataatgacg aaccaataat 120ctagttttgg ttttagtgtg
ctcagcagag cagttcgtgt tcatgagttc gtcgtcgttg 180tattttctat tgtcagcggt
ggcagcgccg tacgtgttgc ctcgtaca 22861737DNAZea mays
61ccgccgagct cgaccgcgtg attggggcac ggccgctggg tcacagagcg cgacctcccg
60gacctcccct acatcgacgc cgtcgtgaag gagacgatgc ggctgcaccc ggtcggcccg
120ctcctcgtcc cgcaccacgc ccgcgagcac acggtggtgg ccggctacga cgtccccgcc
180ggtgcgcgcg tgctggtgaa cgtgtgggcc atcgctcgcg accccgcgtc atggcctgac
240gcgcctgacg cgttccggcc ggagcggttc ttgaacggca gctccggcgc cagcgtcgac
300gtgcgcggcg cgcactttga gctgctgccg ttcggggccg ggcggcggat gtgccccgcg
360cacggcctcg cgatgaagct ggtgaccgct ggcgtggcga acctggtgca cgggttcgcg
420tggcggctgc cggacggtat ggcgccggag gatgtgagca tggaggagct atttgggctt
480tccacgcgcc ggaaggttcc gctcgtcgcc gtcgcggagc ccaggctgcc ggcgcacctc
540tacactaatg tcacgccgcc acagcaggtc gcgggctcca cgattgcgaa cttgtccacc
600aggccggagt acaagctcgt gttctgaatc attcaccgcc actaaaaata aagcaggaaa
660aactacactt cctgcgtgct agacgtccgg gcggaacaca acagtgcttg ctcacgttct
720tctattggtt gtactaa
73762141DNAZea mays 62gcgcaatcgt atcgtacgtg catgatacgc atacatctgg
aaactactat accaatgcaa 60acagagatct atacgtacga gtatgtataa cgacgagtga
tgtttgtatg gatctacgta 120tgtaacaagg acctctcgta g
14163672DNAZea mays 63ctccaagcac ttgttagccg
gcgtacagca agaagaacct cggacgcgac cgacatggtc 60gctctctcag gcgctcacac
aatcgggcag gcccagtgct cgagcttcaa cggccacatc 120tacaacgaca cgaacatcaa
cgcggccttc gcgacgtcgc tcaaggccaa ctgccccatg 180tccggcggca gcagcctggc
gccgctggac accatgaccc cgaccgtgtt cgacaacgac 240tactacaaga acctgctgtc
gcagaagggg ctgctgcact cggaccagga gctgttcaac 300aacggcagca ccgacagcac
ggtcagcaac tttgcgtcca gctcggccgc cttcaccagc 360gccttcacgg cggccatggt
gaagatgggg aacctcggcc cgctcaccgg gaccagtggg 420cagatcaggc tcacctgctg
gaagctcaac tcgtcctaat aattaaggac ggacgtccga 480tagacgatcc tgcgcaatcg
tatcgtacgt gcatgatacg catacatctg gaaactacta 540taccaatgca aacagagatc
tatacgtacg agtatgtata acgacgagtg atgtttgtat 600ggatctacgt atgtaacaag
gacctctcgt agcgcaaagg cgcgcgttgg gagattaatt 660aggtacacaa gc
6726483DNAZea mays
64tacgtatact aaagacctta ctaggtacct cgcgtgattg ttgttcaagt gtactagcta
60ccaagctagt gacaagaatg ttg
8365640DNAZea mays 65tgaggttgcg acagcgtggc taaacaacaa tagcgtcaga
tccgctatcc atgccgaacc 60agtcagttca atcggaccct gggaattatg cacggataaa
ctggattttg atcatgatgc 120cggcagcatg atcatctatc acaagaacct cacgagtcag
ggctaccgtg ctttcatcta 180cagcggcgac catgacatgt gtgtacctta caccgggact
gaagcatgga ctgcgtcttt 240aggctacgcc gtcgttgatc cgtggcgaca gtggattgtc
gacgaacaag ttgccgggta 300cacccaagga tatgaaaagg gccttacttt tgccactatt
aagggtgctg ggcacacagt 360tcctgagtac aaaccacagg aagcactagc tttctacagc
cgttggcttg ccggtgctaa 420actgtgagga ggcctatttt gtgtgcaaag gtcatgcagt
actgaatcaa acagaagttg 480gataaagcat gcagcaataa ggcagtcgaa ggatcaaagt
atccaacgcg ccaactacaa 540tgttgcattc attttcacat gttataccaa tgcagttgct
aattacctgc attgttcatg 600agttcacagt ccatctaatt ggttgaccac accgtcctat
64066125DNAZea mays 66tatcactctc attgtggcta
catatctata tctctgaggc caaatgcttg ggtgtccagt 60actaattaat aataattcag
tgcgtatgca agatttgtgg gcaaatattg gtttacgatt 120tcgga
12567136DNAZea mays
67gcaccacctt ggtttgagca aacgcgcggc gccgtgtttt ggcatctgtc accgtaggtg
60ggcggggata cagtgaagtg ataatgcgct tgtgttaggc gcatgtatat atataataat
120tagatggata cccgtg
13668142DNAZea mays 68gcaccacctt ggtttgagca aacgcgcggc gccgtgtttt
ggcatctgtc accgtaggtg 60ggcggggata cagtgaagtg ataatgcgct tgtgttaggc
gcatgtatat atataataat 120tagatggata cccgtgcgtt ac
14269154DNAZea mays 69gagcggccgc ggatctgttc
aagaaccacg acctcgcttt cgcctcccgc ccacgcagcg 60tgggagggga taagctgatg
tatgagtgca gcaacgtgtc gttcgcgcct tacggcgaga 120actggcgccg gggcaagaag
atcgctgtgg tcca 15470289DNAZea mays
70gagcggccgc ggatctgttc aagaaccacg acctcgcttt cgcctcccgc ccacgcagcg
60tgggagggga taagctgatg tatgagtgca gcaacgtgtc gttcgcgcct tacggcgaga
120actggcgccg gggcaagaag atcgctgtgg tccacctcct ctctccacgg cgcgtggaat
180cgttcgcgcc cgtaagggcc gccgaggtag ccgcgctcgt cgcacggaca cgccgcaccg
240cggaggctgg ggaggccgtg gagttgaggg agctcctgaa cggctacgc
28971686DNAZea mays 71gtagccaggc tctttttgca agatcagact cgaggcatca
caaaccacat cgttgggaca 60ttcggctaca tgtctcccga gtatgtgatg cgtggacaat
actccataaa atctagatgt 120atttagtttc ggcatccttg ttatagagat tgtaacagga
caaaagaaca atgggcatta 180cttcgacgag caaaacgagg atgttgtgag cattgtatgg
aagcactgga gcgagggaac 240acttgcagag attatagatg attctttagg gagaaactac
tcagagactg aggtgctaaa 300atgtgttaac attggcttgt ggtgccttca acagaatcca
atggaccgac ctacaatgtc 360agatgtcatg gtgatgctca atgatgatga tactagttct
ctacctgctg ctgcaaaacc 420aacttttttc ttggatgcaa gctcaggcta ctcttacacc
tcgggcacca tttcacatcc 480ttctgcaagg tagtgtaggc taaggcctaa tgcacacctt
tatatgaata tcgacatatt 540gttgcttgtt tgtttcttat tgtgtattgg ttgaaagaaa
catggaattc accctgaatt 600gtaatagctt gtgctcatta ttagtttctt ccaaatcctc
aaatataaat tttctcttac 660tagatgtcct acaagctttc agaaag
68672337DNAZea mays 72tcaccaccat cctgcgcaag
aagatgggcg acgcgcagct cgtcgaggtc gccgaggaca 60agaagaagga ggagaagaag
cccgaccccg tcgccgaagc tgcggcggcg tactacaacc 120agtactacta ccactaccca
ccgccggccg ccgtcgttta cgacccctac ccacggccgg 180gcaacacctg ctccataatg
tagactcagc ctgtggacat atgcaagtta agttttgtgt 240gtagcggtgc gtgtgtgggg
gaggcgcgca agtgtagttt ctatacggaa ttcttctctt 300atctcccttt tgaggttaag
ggcatgtgca gtcccag 33773186DNAZea mays
73ggttccgcgg ccagtagctg ctgcttgggg ctggtgcacg acctgacgcg ctgcttggcc
60acgctgggca ccgccctcca ctaccgtggt tactacaatg gttgacgttg taacgcggga
120agcttggaaa ttatgcgtgc atagccatag catcggcact ctggagatgg atctcccagc
180tctgaa
18674187DNAZea mays 74accaccgccg ctgagaatcg aagaagccac actgtaaatc
tgccgggaag cggctggtgg 60catccggccc gctcctccct ccgggcgccg caactttttt
cgatcggttt tgcgccgccc 120gggacgggtt gtagttgatc gattggattc ttcataactg
tatttgcgta ctgcttacac 180tacccaa
18775238DNAZea maysvariation184nucleotide a at
this position can be substituted with any nucleotide c, g, or t
75tggtcgttgg gtccgggtgc cacggcgggg accagaccgt gtacgtgctc cgcgaggagg
60gcgggagacc tgcgtcctgg tcgcgcgcgc cgccgccgcc gccggagttc gccgggcacg
120tgcaggcctc ctacttcctt gaactctgaa ctctgaagtg gagggtgtgt acctacacgt
180accagtggtg gctgtgcata catgacggaa ctacgctacc gtacttgttg tgccactg
23876169DNAZea mays 76cttgtttcat tgattcttga agagatcgag cttcttttgc
accacaaggt cgagatgtct 60tgcaactgcg gtggcaactg caagtgcgac ccctgcaact
gctgatcaca tcgatcgacg 120accatggata tgattattat ctatctagct tgtggtggtg
gttgaacaa 16977114DNAZea mays 77cgagaacgat ttcgcaggtg
tatcagtgta gtatgtatag ccgtatagca agtgcgcatc 60tcatctcgtg tacgtgaaat
tagttggtta ggacgaacag cagcgtgtga tgtt 11478916DNAZea mays
78gccattcggc gccacgattg cagagccaga gcgagacgcg actgcttttc tgcttcatcc
60acattggtag ctagctagct tacacgttca cgcatcgctt tccgggccgt ctccggtggt
120ttagctcagc agagcgggga aggaagaaga tgacctccgt gagcgcgagg cccgttggcg
180tggggtactg cttcggcggg gcgaggtgcc agccacggtc gcgggtgcgg gtttcggccg
240cggcctcggc agtggccgcg cccgcgcccg cgatggcggc gacgatgtac gagctgctcg
300ccgtcgagga gacggcgggg cccgacgaga tcaaggcggc gtaccggcgc gccgcgcggc
360ggtggcaccc ggacgcgtgc cccggcggcg ccgaccgctt catggcggcg cgggaggcct
420acgaggtgct gtccgacccc gagcgcaggc gcggctacga catccagctc cgctgcggcg
480cccacttcgg cgacgccggg taccgcgcgg cacgccgcgc cgggttcgcc gactgggagg
540cgcagctgac cgggctgcag tggcgcgcgg cggggcggcg cgggcgcgcc ggcggggaga
600cttggggcag caggatgcgc caggcggccg cgcagccgtc cttgtagcgg cgtcgccggt
660ggctggcctt tgatagttca tacttcgtag tactagtgta ctaccctacc ttcccctttc
720ctcttcgaca atcgaatggc ccgagaagct gtaattgcgc tgttctgcag cgttttctct
780tgccaacacg tcatcctcgt cgcactgttc ggagtgcaga cgagcttgaa gtctagaagc
840agtagacatt ttccccccct ttgaagtgta gtactgtcaa cttttagttc ccactcggtt
900acatacggtt cgaatc
91679446DNAZea mays 79tgctccatga agaagtcggt ccacccaatc tcgctgcggc
gggcgtctgt agagcctgcg 60ttacgtgtac ggcgcgtgta cgtatacggc cgtagcgtac
atgctcgcct ttgcactcag 120atgcacaata taacacacag tcacacacac acacacacac
acacgacaca cgctgtatac 180actggatcct aggtgttttt ttagcttagc taggaatgca
aatttcttga ttcgttggag 240ggtttttttt ctagcacgcg gcgcggccgg tgcccatctg
tctcgcaccg tcgcacgcct 300cttcatacac tctctcctgt actcggctac tagtgctact
gcatgtagac atgtagtgaa 360tgtgaagtac aaagaataca atacacggag tatagtagtg
tagtcttgta tgcatatgta 420aactactata ctctgtttta cgaaat
446809651DNAArtificial sequenceVector 15289
80aattaattcc tgtggttggc atgcacatac aaatggacga acggataaac cttttcacgc
60ccttttaaat atccgattat tctaataaac gctcttttct cttaggttta cccgccaata
120tatcctgtca aacactgata gtttaaactg aaggcgggaa acgacaatct gatcatgagc
180ggagaattaa gggagtcacg ttatgacccc cgccgatgac gcgggacaag ccgttttacg
240tttggaactg acagaaccgc aacgctgcag gaattggccg cagcggccat ttaaatcaat
300tgggcgcgcc agctgcttgt ggggaccaga caaaaaagga atggtgcaga attgttaggc
360gcacctacca aaagcatctt tgcctttatt gcaaagataa agcagattcc tctagtacaa
420gtggggaaca aaataacgtg gaaaagagct gtcctgacag cccactcact aatgcgtatg
480acgaacgcag tgacgaccac aaaactcgag acttttcaac aaagggtaat atccggaaac
540ctcctcggat tccattgccc agctatctgt cactttattg tgaagatagt ggaaaaggaa
600ggtggctcct acaaatgcca tcattgcgat aaaggaaagg ctatcgttga agatgcctct
660gccgacagtg gtcccaaaga tggaccccca cccacgagga gcatcgtgga aaaagaagac
720gttccaacca cgtcttcaaa gcaagtggat tgatgtgata tctccactga cgtaagggat
780gacgaacaat cccactatcc ttcggtaccg gaccgcgatc gcttaattaa gcttgcatgc
840ctgcagtgca gcgtgacccg gtcgtgcccc tctctagaga taatgagcat tgcatgtcta
900agttataaaa aattaccaca tatttttttt gtcacacttg tttgaagtgc agtttatcta
960tctttataca tatatttaaa ctttactcta cgaataatat aatctatagt actacaataa
1020tatcagtgtt ttagagaatc atataaatga acagttagac atggtctaaa ggacaattga
1080gtattttgac aacaggactc tacagtttta tctttttagt gtgcatgtgt tctccttttt
1140ttttgcaaat agcttcacct atataatact tcatccattt tattagtaca tccatttagg
1200gtttagggtt aatggttttt atagactaat ttttttagta catctatttt attctatttt
1260agcctctaaa ttaagaaaac taaaactcta ttttagtttt tttatttaat aatttagata
1320taaaatagaa taaaataaag tgactaaaaa ttaaacaaat accctttaag aaattaaaaa
1380aactaaggaa acatttttct tgtttcgagt agataatgcc agcctgttaa acgccgtcga
1440cgagtctaac ggacaccaac cagcgaacca gcagcgtcgc gtcgggccaa gcgaagcaga
1500cggcacggca tctctgtcgc tgcctctgga cccctctcga gagttccgct ccaccgttgg
1560acttgctccg ctgtcggcat ccagaaattg cgtggcggag cggcagacgt gagccggcac
1620ggcaggcggc ctcctcctcc tctcacggca ccggcagcta cgggggattc ctttcccacc
1680gctccttcgc tttcccttcc tcgcccgccg taataaatag acaccccctc cacaccctct
1740ttccccaacc tcgtgttgtt cggagcgcac acacacacaa ccagatctcc cccaaatcca
1800cccgtcggca cctccgcttc aaggtacgcc gctcgtcctc cccccccccc cctctctacc
1860ttctctagat cggcgttccg gtccatggtt agggcccggt agttctactt ctgttcatgt
1920ttgtgttaga tccgtgtttg tgttagatcc gtgctgctag cgttcgtaca cggatgcgac
1980ctgtacgtca gacacgttct gattgctaac ttgccagtgt ttctctttgg ggaatcctgg
2040gatggctcta gccgttccgc agacgggatc gatttcatga ttttttttgt ttcgttgcat
2100agggtttggt ttgccctttt cctttatttc aatatatgcc gtgcacttgt ttgtcgggtc
2160atcttttcat gctttttttt gtcttggttg tgatgatgtg gtctggttgg gcggtcgttc
2220tagatcggag tagaattctg tttcaaacta cctggtggat ttattaattt tggatctgta
2280tgtgtgtgcc atacatattc atagttacga attgaagatg atggatggaa atatcgatct
2340aggataggta tacatgttga tgcgggtttt actgatgcat atacagagat gctttttgtt
2400cgcttggttg tgatgatgtg gtgtggttgg gcggtcgttc attcgttcta gatcggagta
2460gaatactgtt tcaaactacc tggtgtattt attaattttg gaactgtatg tgtgtgtcat
2520acatcttcat agttacgagt ttaagatgga tggaaatatc gatctaggat aggtatacat
2580gttgatgtgg gttttactga tgcatataca tgatggcata tgcagcatct attcatatgc
2640tctaaccttg agtacctatc tattataata aacaagtatg ttttataatt attttgatct
2700tgatatactt ggatgatggc atatgcagca gctatatgtg gattttttta gccctgcctt
2760catacgctat ttatttgctt ggtactgttt cttttgtcga tgctcaccct gttgtttggt
2820gttacttctg cagggatccc cgatcatgca aaaactcatt aactcagtgc aaaactatgc
2880ctggggcagc aaaacggcgt tgactgaact ttatggtatg gaaaatccgt ccagccagcc
2940gatggccgag ctgtggatgg gcgcacatcc gaaaagcagt tcacgagtgc agaatgccgc
3000cggagatatc gtttcactgc gtgatgtgat tgagagtgat aaatcgactc tgctcggaga
3060ggccgttgcc aaacgctttg gcgaactgcc tttcctgttc aaagtattat gcgcagcaca
3120gccactctcc attcaggttc atccaaacaa acacaattct gaaatcggtt ttgccaaaga
3180aaatgccgca ggtatcccga tggatgccgc cgagcgtaac tataaagatc ctaaccacaa
3240gccggagctg gtttttgcgc tgacgccttt ccttgcgatg aacgcgtttc gtgaattttc
3300cgagattgtc tccctactcc agccggtcgc aggtgcacat ccggcgattg ctcacttttt
3360acaacagcct gatgccgaac gtttaagcga actgttcgcc agcctgttga atatgcaggg
3420tgaagaaaaa tcccgcgcgc tggcgatttt aaaatcggcc ctcgatagcc agcagggtga
3480accgtggcaa acgattcgtt taatttctga attttacccg gaagacagcg gtctgttctc
3540cccgctattg ctgaatgtgg tgaaattgaa ccctggcgaa gcgatgttcc tgttcgctga
3600aacaccgcac gcttacctgc aaggcgtggc gctggaagtg atggcaaact ccgataacgt
3660gctgcgtgcg ggtctgacgc ctaaatacat tgatattccg gaactggttg ccaatgtgaa
3720attcgaagcc aaaccggcta accagttgtt gacccagccg gtgaaacaag gtgcagaact
3780ggacttcccg attccagtgg atgattttgc cttctcgctg catgacctta gtgataaaga
3840aaccaccatt agccagcaga gtgccgccat tttgttctgc gtcgaaggcg atgcaacgtt
3900gtggaaaggt tctcagcagt tacagcttaa accgggtgaa tcagcgttta ttgccgccaa
3960cgaatcaccg gtgactgtca aaggccacgg ccgtttagcg cgtgtttaca acaagctgta
4020agagcttact gaaaaaatta acatctcttg ctaagctggg agctcgatcc gtcgacctgc
4080agatcgttca aacatttggc aataaagttt cttaagattg aatcctgttg ccggtcttgc
4140gatgattatc atataatttc tgttgaatta cgttaagcat gtaataatta acatgtaatg
4200catgacgtta tttatgagat gggtttttat gattagagtc ccgcaattat acatttaata
4260cgcgatagaa aacaaaatat agcgcgcaaa ctaggataaa ttatcgcgcg cggtgtcatc
4320tatgttacta gatctgctag ccctgcagga aatttaccgg tgcccgggcg gccagcatgg
4380ccgtatccgc aatgtgttat taagttgtct aagcgtcaat ttgtttacac cacaatatat
4440cctgccacca gccagccaac agctccccga ccggcagctc ggcacaaaat caccactcga
4500tacaggcagc ccatcagaat taattctcat gtttgacagc ttatcatcga ctgcacggtg
4560caccaatgct tctggcgtca ggcagccatc ggaagctgtg gtatggctgt gcaggtcgta
4620aatcactgca taattcgtgt cgctcaaggc gcactcccgt tctggataat gttttttgcg
4680ccgacatcat aacggttctg gcaaatattc tgaaatgagc tgttgacaat taatcatccg
4740gctcgtataa tgtgtggaat tgtgagcgga taacaatttc acacaggaaa cagaccatga
4800gggaagcgtt gatcgccgaa gtatcgactc aactatcaga ggtagttggc gtcatcgagc
4860gccatctcga accgacgttg ctggccgtac atttgtacgg ctccgcagtg gatggcggcc
4920tgaagccaca cagtgatatt gatttgctgg ttacggtgac cgtaaggctt gatgaaacaa
4980cgcggcgagc tttgatcaac gaccttttgg aaacttcggc ttcccctgga gagagcgaga
5040ttctccgcgc tgtagaagtc accattgttg tgcacgacga catcattccg tggcgttatc
5100cagctaagcg cgaactgcaa tttggagaat ggcagcgcaa tgacattctt gcaggtatct
5160tcgagccagc cacgatcgac attgatctgg ctatcttgct gacaaaagca agagaacata
5220gcgttgcctt ggtaggtcca gcggcggagg aactctttga tccggttcct gaacaggatc
5280tatttgaggc gctaaatgaa accttaacgc tatggaactc gccgcccgac tgggctggcg
5340atgagcgaaa tgtagtgctt acgttgtccc gcatttggta cagcgcagta accggcaaaa
5400tcgcgccgaa ggatgtcgct gccgactggg caatggagcg cctgccggcc cagtatcagc
5460ccgtcatact tgaagctagg caggcttatc ttggacaaga agatcgcttg gcctcgcgcg
5520cagatcagtt ggaagaattt gttcactacg tgaaaggcga gatcaccaaa gtagtcggca
5580aataaagctc tagtggatct ccgtacccgg ggatctggct cgcggcggac gcacgacgcc
5640ggggcgagac cataggcgat ctcctaaatc aatagtagct gtaacctcga agcgtttcac
5700ttgtaacaac gattgagaat ttttgtcata aaattgaaat acttggttcg catttttgtc
5760atccgcggtc agccgcaatt ctgacgaact gcccatttag ctggagatga ttgtacatcc
5820ttcacgtgaa aatttctcaa gcgctgtgaa caagggttca gattttagat tgaaaggtga
5880gccgttgaaa cacgttcttc ttgtcgatga cgacgtcgct atgcggcatc ttattattga
5940ataccttacg atccacgcct tcaaagtgac cgcggtagcc gacagcaccc agttcacaag
6000agtactctct tccgcgacgg tcgatgtcgt ggttgttgat ctagatttag gtcgtgaaga
6060tgggctcgag atcgttcgta atctggcggc aaagtctgat attccaatca taattatcag
6120tggcgaccgc cttgaggaga cggataaagt tgttgcactc gagctaggag caagtgattt
6180tatcgctaag ccgttcagta tcagagagtt tctagcacgc attcgggttg ccttgcgcgt
6240gcgccccaac gttgtccgct ccaaagaccg acggtctttt tgttttactg actggacact
6300taatctcagg caacgtcgct tgatgtccga agctggcggt gaggtgaaac ttacggcagg
6360tgagttcaat cttctcctcg cgtttttaga gaaaccccgc gacgttctat cgcgcgagca
6420acttctcatt gccagtcgag tacgcgacga ggaggtttat gacaggagta tagatgttct
6480cattttgagg ctgcgccgca aacttgaggc agatccgtca agccctcaac tgataaaaac
6540agcaagaggt gccggttatt tctttgacgc ggacgtgcag gtttcgcacg gggggacgat
6600ggcagcctga gccaattccc agatccccga ggaatcggcg tgagcggtcg caaaccatcc
6660ggcccggtac aaatcggcgc ggcgctgggt gatgacctgg tggagaagtt gaaggccgcg
6720caggccgccc agcggcaacg catcgaggca gaagcacgcc ccggtgaatc gtggcaagcg
6780gccgctgatc gaatccgcaa agaatcccgg caaccgccgg cagccggtgc gccgtcgatt
6840aggaagccgc ccaagggcga cgagcaacca gattttttcg ttccgatgct ctatgacgtg
6900ggcacccgcg atagtcgcag catcatggac gtggccgttt tccgtctgtc gaagcgtgac
6960cgacgagctg gcgaggtgat ccgctacgag cttccagacg ggcacgtaga ggtttccgca
7020gggccggccg gcatggccag tgtgtgggat tacgacctgg tactgatggc ggtttcccat
7080ctaaccgaat ccatgaaccg ataccgggaa gggaagggag acaagcccgg ccgcgtgttc
7140cgtccacacg ttgcggacgt actcaagttc tgccggcgag ccgatggcgg aaagcagaaa
7200gacgacctgg tagaaacctg cattcggtta aacaccacgc acgttgccat gcagcgtacg
7260aagaaggcca agaacggccg cctggtgacg gtatccgagg gtgaagcctt gattagccgc
7320tacaagatcg taaagagcga aaccgggcgg ccggagtaca tcgagatcga gctagctgat
7380tggatgtacc gcgagatcac agaaggcaag aacccggacg tgctgacggt tcaccccgat
7440tactttttga tcgatcccgg catcggccgt tttctctacc gcctggcacg ccgcgccgca
7500ggcaaggcag aagccagatg gttgttcaag acgatctacg aacgcagtgg cagcgccgga
7560gagttcaaga agttctgttt caccgtgcgc aagctgatcg ggtcaaatga cctgccggag
7620tacgatttga aggaggaggc ggggcaggct ggcccgatcc tagtcatgcg ctaccgcaac
7680ctgatcgagg gcgaagcatc cgccggttcc taatgtacgg agcagatgct agggcaaatt
7740gccctagcag gggaaaaagg tcgaaaaggt ctctttcctg tggatagcac gtacattggg
7800aacccaaagc cgtacattgg gaaccggaac ccgtacattg ggaacccaaa gccgtacatt
7860gggaaccggt cacacatgta agtgactgat ataaaagaga aaaaaggcga tttttccgcc
7920taaaactctt taaaacttat taaaactctt aaaacccgcc tggcctgtgc ataactgtct
7980ggccagcgca cagccgaaga gctgcaaaaa gcgcctaccc ttcggtcgct gcgctcccta
8040cgccccgccg cttcgcgtcg gcctatcgcg gccgctggcc gctcaaaaat ggctggccta
8100cggccaggca atctaccagg gcgcggacaa gccgcgccgt cgccactcga ccgccggcgc
8160tgaggtctgc ctcgtgaaga aggtgttgct gactcatacc aggcctgaat cgccccatca
8220tccagccaga aagtgaggga gccacggttg atgagagctt tgttgtaggt ggaccagttg
8280gtgattttga acttttgctt tgccacggaa cggtctgcgt tgtcgggaag atgcgtgatc
8340tgatccttca actcagcaaa agttcgattt attcaacaaa gccgccgtcc cgtcaagtca
8400gcgtaatgct ctgccagtgt tacaaccaat taaccaattc tgattagaaa aactcatcga
8460gcatcaaatg aaactgcaat ttattcatat caggattatc aataccatat ttttgaaaaa
8520gccgtttctg taatgaagga gaaaactcac cgaggcagtt ccataggatg gcaagatcct
8580ggtatcggtc tgcgattccg actcgtccaa catcaataca acctattaat ttcccctcgt
8640caaaaataag gttatcaagt gagaaatcac catgagtgac gactgaatcc ggtgagaatg
8700gcaaaagctc tgcattaatg aatcggccaa cgcgcgggga gaggcggttt gcgtattggg
8760cgctcttccg cttcctcgct cactgactcg ctgcgctcgg tcgttcggct gcggcgagcg
8820gtatcagctc actcaaaggc ggtaatacgg ttatccacag aatcagggga taacgcagga
8880aagaacatgt gagcaaaagg ccagcaaaag gccaggaacc gtaaaaaggc cgcgttgctg
8940gcgtttttcc ataggctccg cccccctgac gagcatcaca aaaatcgacg ctcaagtcag
9000aggtggcgaa acccgacagg actataaaga taccaggcgt ttccccctgg aagctccctc
9060gtgcgctctc ctgttccgac cctgccgctt accggatacc tgtccgcctt tctcccttcg
9120ggaagcgtgg cgctttctca tagctcacgc tgtaggtatc tcagttcggt gtaggtcgtt
9180cgctccaagc tgggctgtgt gcacgaaccc cccgttcagc ccgaccgctg cgccttatcc
9240ggtaactatc gtcttgagtc caacccggta agacacgact tatcgccact ggcagcagcc
9300actggtaaca ggattagcag agcgaggtat gtaggcggtg ctacagagtt cttgaagtgg
9360tggcctaact acggctacac tagaagaaca gtatttggta tctgcgctct gctgaagcca
9420gttaccttcg gaaaaagagt tggtagctct tgatccggca aacaaaccac cgctggtagc
9480ggtggttttt ttgtttgcaa gcagcagatt acgcgcagaa aaaaaggatc tcaagaagat
9540cctttgatct tttctacggg gtctgacgct cagtggaacg aaaactcacg ttaagggatt
9600ttggtcatga gattatcaaa aaggatcttc acctagatcc ttttgatccg g
96518121593DNAArtificial sequenceZmABP-948-binary 81ttcctgtggt tggcatgcac
atacaaatgg acgaacggat aaaccttttc acgccctttt 60aaatatccga ttattctaat
aaacgctctt ttctcttagg tttacccgcc aatatatcct 120gtcaaacact gatagtttaa
actgaaggcg ggaaacgaca atctgatcat gagcggagaa 180ttaagggagt cacgttatga
cccccgccga tgacgcggga caagccgttt tacgtttgga 240actgacagaa ccgcaacgct
gcaggaattg gccgcagcgg ccatttaaat caattgggcg 300cgccagctgc ttgtggggac
cagacaaaaa aggaatggtg cagaattgtt aggcgcacct 360accaaaagca tctttgcctt
tattgcaaag ataaagcaga ttcctctagt acaagtgggg 420aacaaaataa cgtggaaaag
agctgtcctg acagcccact cactaatgcg tatgacgaac 480gcagtgacga ccacaaaact
cgagactttt caacaaaggg taatatccgg aaacctcctc 540ggattccatt gcccagctat
ctgtcacttt attgtgaaga tagtggaaaa ggaaggtggc 600tcctacaaat gccatcattg
cgataaagga aaggctatcg ttgaagatgc ctctgccgac 660agtggtccca aagatggacc
cccacccacg aggagcatcg tggaaaaaga agacgttcca 720accacgtctt caaagcaagt
ggattgatgt gatatctcca ctgacgtaag ggatgacgaa 780caatcccact atccttcggt
accggaccct atagaatagc tcactatcct atttattata 840gtttaagtat atagccaata
ttttaaattt actatttatt aaattctagg gaagatagtc 900tcaattcata actttattat
aatacgtttg aaattttaaa tctttaggaa attttcttaa 960ttcacctaga tacgattctg
gagtgttaca agctgcgaat atactggtgc cattgagtat 1020acataaatgg atttaggtgg
tgctcaatag gtgaaaatga gatactaatc acttaaattt 1080caaaatttct atggtgccac
tgtactcgga taggtctatc tagggctgga caaaatgctc 1140gtggctcgct ggctcgctcg
tttcgtggtc agctcggctc ggctcggatc ggctcatttg 1200aattttgtca cgagctgagc
tgacattcta gctcggttcg ttaacgagcc agctcgcgag 1260ctaaacgagc taccatattc
tagtaaaacg aaattatatt catatcattt atagaataat 1320tgatgaacat gttatatata
tgtgagatgt ctatggccta tgaattaaac taatgattaa 1380tgaactatgc ctatgtgtta
atttggtcta tgcaaatata attatgggtt aaactgatga 1440acatgcatgt gaattgtgaa
ttaatgagtg atgaattgtg ctaatttggt gttatattga 1500catggtttgt gaaactatga
gtataattac tattttctat tgttaaatta gtttgaaatt 1560aactaaaaaa taattattat
atacatttta ttttttttct gctctggctc gcgagctaaa 1620cgagccagct cgacctcgta
aacgagccga gccgagctga ctctgtggct cgttacctta 1680acgagccgag ccgagctggc
tcgttagctt aacgagccag ctcgaactcg gacgagccga 1740gccgagctgg ctcgttatcc
acccctaggt ctatctagct tctgatgttt gcaaacctta 1800gagttggagt gttcagccag
ctactccttt gctttgctga ataaccatac caaacacgcc 1860catattaata cccgctcggc
ggtggttctg caatcaaacg caggccgcag tcgcgtgcgg 1920aactagaggt ccttcagaga
agtgccgtgc cagtgccacc gccggccgca tcatcgttcc 1980gcccccctgg tacgagcact
tcgcagagct gcaacctaca tcccttttac ataaatctat 2040tgtctcgtat tgccgttgac
gccggaatag tcttcgcatc ccttttacat aaatccgatg 2100ttttctttct ccgattcctt
tgaggaatca tcacgggtca gggcaggtgt tctgccgttt 2160gcccttttct ttatattctc
cttagaagaa atatttagtt ggaggctgga catagccgga 2220ggagctaact aatcgagcgg
tgtactggca aaacaaaagg agcggagcaa gaaaggggag 2280aaaaaactag ccactgccgg
agcgctattg gccgtgttgg gcctggaagc ttgcatcaat 2340acttccctcg ccccgatttg
gttccaaaat catacaagtc ccaaagttgt caagatattg 2400gaggtatgca agcgacttgg
atctcaaaat agaagaaatt tcggatctga gcacaaatct 2460gagttgaaaa aactgcaact
caaaatcatc aaaaaaagaa gaagaaagaa acgaatatat 2520tcgctcctct tctcagccga
acccaaagga attgaatcca aaccctgggt aggcagacag 2580tgagatatgg aggagagcag
gaggcgaaca agagaggctg cggccacgaa tatctcacga 2640acaagcacat catgggtcca
cggagcgggc agggtgacgg gctcccgacg gcgagctaca 2700tctcggaaga gcaccagggc
agcatgtcgt gttgggcagg ttggccgtct ggcggacggc 2760ggacggtgac tcgtggtcag
ggtgcacctg ctcgattaag gcgcctgact actcatgtct 2820tcgtctcttt gcttgtgttt
gctatatgct gctcgtacct catgagcata ctaagttgac 2880tgctcagtct gctgagtctg
tttttctagg gtatagtgct gagcacaagg gatatcattg 2940ttgggatatg attgctcgtt
ggatgagggt ctcttgggat gttgtctttg atgaggctca 3000ttctttttat tcttgtcctt
ctttcgatgc tttgtcaaca tccttggttg atcccatctc 3060ttttctatat tttctagatg
cccgtgttac tattggacct gcctcacgct tggtgcgccc 3120acgatagtag ccttagctcc
ttctgacatg ttcatctctc tttcggtgcc ttcctttgtg 3180gtgccttcta tagtgttttc
tttggagcct gctgctttag cccctgacta cgctatgaac 3240acttgtctac acccgccggg
tcatcaattc ttttggtaca ccatcatcct ctcatgcgtt 3300gccctcttat gatgtgcgct
cttctgcaac tcattcattt tcttgcgatt tacctttgac 3360tgatgctccc tattcatctc
tggatccagc ttcctcagtt gactctttgc tggagccacc 3420tcttagacgg agtcatcgtt
ttcgtcagcc acctaatggg tactctcctt caggtttagt 3480cgctaccgtt ctttctgagc
tgacttctta tcatgatgct attcttcatc tgtaacgaca 3540acatgcgatt tctgaggaga
ttgctactct tgagcgcact agcacgttgg aacttgttcc 3600ttgtccatca cgtgtttgtc
ctatcaccag tatgtgggtc tataaggtca agacccgttc 3660tgatggttct cttgatcgct
ataaatctcg tctagttgcc caaggcttcc agtaggaaca 3720tggttgtggc tatgatgaga
tttttgcacc tgttgctcat atgaccactg ttcgcactct 3780tcttgctatg gcctctgttc
gtgcgtggtc catctctcat cttgatgtca agaatacctt 3840tcttgatggt aagctacttg
agttctatat gtagccatcg cctaggtatt ctatttctgc 3900ttgtatggtt tgttgtcttc
gccgttcccc ttatggcctc aagcaggctc cacattcttg 3960gtttcagctc tttgcttcta
tgataactgt tgttggtttt tctaccagta atcatggtcc 4020tgcactcttt gtgtactacc
tcctctcggg gtcggactct tctttatgtt gatgatataa 4080ttatcactgg agataacctt
gagtatgttg actttgttaa ggcacgtctt agttatcatt 4140ttctcatgtc tgatcttggt
cctctgtgtt actttcttgg gacaaaggtt tcttctttgt 4200ctcagggcct ttatctatct
caagaggagt acattcaaga ttttcttcat cgggcttctc 4260ttaccgatca ctagattgtt
gagactccca agcagctcaa tcttcacctt agtgccgatg 4320atggcgagtc ttttcccgac
catactcgtt atcgtcaaca tactgtagga agttttgttt 4380atctctgtgt cactcgtctt
gacatttcat atgttgtgtg tatcctgagt tagtttgctt 4440cagatcccat ccaggtacac
tatagtcact tgctttgtgt cctacaatat ctttgtggaa 4500ccatatctag atgtatgttc
tttccacatt ctagctcgtt gcaactgcaa tcttgttctg 4560atgctacttg ggctagtgat
tttttcgata gttggtctct ttctcaatat tgtgtttttc 4620ttggtggttc tctcattgct
cggaagacta agtagcaggt agcagtttct cgtttgagta 4680ccgaggctga gttgcgtgct
atggcccttg tgactgcaga ggttacttgg ttacgatagt 4740tgcttgagga ttttcatgtt
tctgtttcca tgacgactcc ttttgtctga cagtacaggt 4800gttatcagta ttgctcgtga
tgcggtgaag catgaggtca ccaagcatat tggagttgat 4860gtttcgtata cacgagctga
agtctaggat gatgttatct tgatttggta tgtgccttta 4920gagcttcagt tggctaattt
cttcacgagg gcacaggctc gcgctgagca taaatttttc 4980ctctcaaaac tcagtgttat
agatccacct tgagtttgag ggagtattag atagatatgg 5040gtttatttgt atttttccat
tttataaggg tattagatag ataggcaacg actgctatgc 5100aagtagtcat tctgtgcaag
cgtgcaagca aaccatctga tccattatat cgtgatccaa 5160ccgtgggtca catttaacac
ttaaaccctt ccaccaccaa ctcaataatc tttataaaaa 5220aacccctaac aaacaatggt
tatatctgtg gttggatcgt aatctaatag atcagatggt 5280ttgcttgtac gcttgcacag
aatgactgct tgcatagcag ttgttgccta gatagatatg 5340ggtttatttg tatttttctc
ttaagggttt ttgtgtatat ttgtactcat gtacctatat 5400atttgtgcta gttgacccca
taatgaatag acctgctatt cataatattt gcaaaccatg 5460aaaatttgat tattacgaac
tatccaaata ctcgaacaca tgggcattat agctcacaaa 5520aatggaaggt tgagctgctg
cttgaagaac ctcaacatct ttgaacaaca acctcaacga 5580aacttgtata tgaaccaact
tccaaacaat cccttgtgga aggatagtaa tgacttcagg 5640gcattgatca cacatatccg
acggtggaac tactgtaaca accctctttt ctgtggaata 5700tagttgaaac tctacaactt
gaccaaaacc aagatgacga catatggtgg aactaacaaa 5760acaagaggac tacactacct
cattagctta ttaagcacaa tctcttggca ccacaacaac 5820gaacaacaaa accatcattt
ggatgctctg tgggcgacta aatgcaaatt ctttgcatgg 5880ttgatcatcc caaattggtg
gcacttagct ataggctagc agtgagagga tggccgaaca 5940acatgcattg tccactatgt
tggtgtagcc atgagaccaa ccaccacata aatgccaaac 6000gttcattcac caaaaaaatc
taggcaacaa tggcttggat ttcttacctg cagctccacc 6060aagctaactg gagttcaatt
aggtcaacgt atgggtggtg gtcgagtata gcagtcacaa 6120atgatgttct aaagatgggg
ttgtgttaac acatcttgct tgtagcacga gaacactgga 6180aggagtgaaa ccaaagaatc
tttcaacaca aggacctatc aacgctatcc atgattggga 6240aattcaagga cgaaactaga
atttgggtga acacatgcac aaggcaccta ggagagcctt 6300tcttttgtac tgttaatccc
tttttaaact ctctctgtcc ttaggagttc gtttcttccg 6360ctctattcaa tgaagttagg
cacaatcttg tgtgatttca ttagaaaaac acaagtaaat 6420tgcatggtca gtacttgaag
tattacagga atctcgtctg cccccaaact attaaacctt 6480atatttggct ccctaatgta
cttaactgat ctcattctgg tcaaactaaa catggtgatg 6540gcaaggagcc gatatggtcg
cccatgtgga tgtgatttaa gcaaaaaatc tcatggtcca 6600tagctgtgtc aacaagccaa
catgccatcg cttccttatg ccgagactgc ccatgtcgct 6660cgcttttact gtcatcatca
tcaaactgcc tgtcatgtct acggatgcca tgaccgctgt 6720cacacatgat gtggagatga
acctgtccat caacttccac gtgctgccac tatcgctagc 6780tgacaccgtc ttggtcattg
ctgtgtaggg ctaggctaag agtcgctgaa tgatcctttc 6840gctctccttt acaggaacat
gctgtttact ttgtgtcgcc aaggcgtgct agagtacctc 6900ttctacacct ccagcaccag
tagccttatt gttagcttgc acatcccaca taagcaggcc 6960gatgtgaatg ataacttcag
ggacgtcgac ggcatgtcac tgccaagagt catttggtgg 7020gaagcgttgt catgccatct
gtcgtgccat tttgtcctca gttcgaccgc cattaccgtg 7080agcacaacct ttgcgcatgg
ttggccgctt ccatcaccct tattccgttt cctcgtgttg 7140gtcttgcccc aaggctatgg
ttagcagacc gtgcatatgg ccggcaaaag actattttgc 7200actgtagatt gcactcttta
tatagtgaag tttaaaatag gagatgagat gaataaggct 7260gctggagata gcctaaaccc
ttgcagctcg tgcttgcatc gggggagcca aaaggcgtcc 7320acctccacca tcgccgaagc
actgagcact actctggctt gtgtttcagc accacaccgc 7380agagtgctta gggccaccaa
cctcctcttg cctctgtgcc cagagcacca tcagctctgc 7440tgcctccctc tgttccttgt
gcttgctagg caggcaattc cgagctgggg cccaacttgt 7500aacgctgatt tcaccatctt
gccactgccg ggcaccaagt ggacacattt gacttggcct 7560agtgggtttt ctgcataaat
cacatacatg tggatgccat atcaggctct ttggtgttgt 7620cgtgtctact ttcgacaagg
atgagatcac ttaaacatat tagggagcca agtatgtaat 7680ttcatagttt agggacctac
acaaaaatcg tataatactt tagaacagcc gtgcagttta 7740ctcaatcaac acatacaaag
tcagatctta agctctgata cttcaaagga atggttgagc 7800ccagttgaca aacaatcttg
cttcattcat tgaattgttt ataggagtgg ctatgtaact 7860actgggtggt tttgtttgac
ctgtcatcca aattgtgtag tcaaccataa acatacacgt 7920cacacaatac attttggatg
tgacagatag gatttaggcg agagaatgta caatgtcact 7980gaaaaattac cactgtatgg
aaaggacaat ctaagtgaaa agagaaccag ggcctaatgg 8040tttcaggact tcaaactccg
gccaaatgaa tttacagtgc ttaaattaac tcatgttaat 8100catgatagcc aaagcatggg
caaaagagaa actatgaata aatcgacaat gtattctata 8160tagcagtaat ataccatgtc
acgagctttt acactaatgg gctgtatttt tctgcagtta 8220ttttaactgg caatattcta
tgtcacagta atatttgtta aattttttcc agaatagcaa 8280ctgaactaga agtctagtat
ttcttaattg gataacaaaa ggaattagtg tgcatttggc 8340ttacgaacaa tcagtcaccc
aacattgaat ttgaagttct gtttcctctt tgttcagacg 8400acactctcca aatgaatgcc
ttatattttg tgttgctcct cttttctgca gagtgttcag 8460taacttcttc cgatgtaaac
catggtacgt cctgtagaaa ccccaacccg tgaaatcaaa 8520aaactcgacg gcctgtgggc
attcagtctg gatcgcgaaa actgtggaat tgatcagcgt 8580tggtgggaaa gcgcgttaca
agaaagccgg gcaattgctg tgccaggcag ttttaacgat 8640cagttcgccg atgcagatat
tcgtaattat gcgggcaacg tctggtatca gcgcgaagtc 8700tttataccga aaggttgggc
aggccagcgt atcgtgctgc gtttcgatgc ggtcactcat 8760tacggcaaag tgtgggtcaa
taatcaggaa gtgatggagc atcagggcgg ctatacgcca 8820tttgaagccg atgtcacgcc
gtatgttatt gccgggaaaa gtgtacgtat caccgtttgt 8880gtgaacaacg aactgaactg
gcagactatc ccgccgggaa tggtgattac cgacgaaaac 8940ggcaagaaaa agcagtctta
cttccatgat ttctttaact atgccggaat ccatcgcagc 9000gtaatgctct acaccacgcc
gaacacctgg gtggacgata tcaccgtggt gacgcatgtc 9060gcgcaagact gtaaccacgc
gtctgttgac tggcaggtac caagctgcga atcttcgttt 9120ttttaaggaa ttctcgatct
ttatggtgta taggctctgg gttttctgtt ttttgtatct 9180cttaggattt tgtaaattcc
agatctttct atggccactt agtagtatat ttcaaaaatt 9240ctccaatcga gttcttcatt
cgcattttca gtcattttct cttcgacgtt gtttttaagc 9300ctgggtatta ctcctattta
gttgaactct gcagcaatct tagaaaatta gggttttgag 9360gtttcgattt ctctaggtaa
ccgatctatt gcattcatct gaatttctgc atatatgtct 9420tagatttctg ataagcttac
gatacgttag gtgtaattga agtttatttt tcaagagtgt 9480tattttttgt ttctgaattt
ttcaggtggt ggccaatggt gatgtcagcg ttgaactgcg 9540tgatgcggat caacaggtgg
ttgcaactgg acaaggcact agcgggactt tgcaagtggt 9600gaatccgcac ctctggcaac
cgggtgaagg ttatctctat gaactgtgcg tcacagccaa 9660aagccagaca gagtgtgata
tctacccgct tcgcgtcggc atccggtcag tggcagtgaa 9720gggcgaacag ttcctgatta
accacaaacc gttctacttt actggctttg gtcgtcatga 9780agatgcggac ttgcgtggca
aaggattcga taacgtgctg atggtgcacg accacgcatt 9840aatggactgg attggggcca
actcctaccg tacctcgcat tacccttacg ctgaagagat 9900gctcgactgg gcagatgaac
atggcatcgt ggtgattgat gaaactgctg ctgtcggctt 9960taacctctct ttaggcattg
gtttcgaagc gggcaacaag ccgaaagaac tgtacagcga 10020agaggcagtc aacggggaaa
ctcagcaagc gcacttacag gcgattaaag agctgatagc 10080gcgtgacaaa aaccacccaa
gcgtggtgat gtggagtatt gccaacgaac cggatacccg 10140tccgcaaggt gcacgggaat
atttcgcgcc actggcggaa gcaacgcgta aactcgaccc 10200gacgcgtccg atcacctgcg
tcaatgtaat gttctgcgac gctcacaccg ataccatcag 10260cgatctcttt gatgtgctgt
gcctgaaccg ttattacgga tggtatgtcc aaagcggcga 10320tttggaaacg gcagagaagg
tactggaaaa agaacttctg gcctggcagg agaaactgca 10380tcagccgatt atcatcaccg
aatacggcgt ggatacgtta gccgggctgc actcaatgta 10440caccgacatg tggagtgaag
agtatcagtg tgcatggctg gatatgtatc accgcgtctt 10500tgatcgcgtc agcgccgtcg
tcggtgaaca ggtatggaat ttcgccgatt ttgcgacctc 10560gcaaggcata ttgcgcgttg
gcggtaacaa gaaagggatc ttcactcgcg accgcaaacc 10620gaagtcggcg gcttttctgc
tgcaaaaacg ctggactggc atgaacttcg gtgaaaaacc 10680gcagcaggga ggcaaacaat
gagagctcga ggtacaaatc tcatctgtgc cttgctctag 10740tttcccaaat ggaattaact
atgcatgatt tgtttggaaa ctcttattgc atccatccag 10800ataatgcatc caccataagg
taatatcttg atgacatctg tgcctgatgg tgtaccaaat 10860gtctctatct ctgcattgag
ccacgagtag gaggatagcc taggggtgcc ttgactccaa 10920agttgtattg aaaaagatgg
atgaagcagg caaatgctgc ctgaatccat gactcagggc 10980acagattttc cactcaaagg
aagataagat tgcattactt catgatcttt tgaactgcct 11040ctgcaagacg ggactcggat
agtggatgca aagatctaat actggcctca ggcaacgagt 11100tgtttcactc gaaagtctag
aaatgaccgg gctcaaattt tgcaccccaa ggaaagtgag 11160tttgcattac ttcatgacct
tttgaactgc ctctgcaaga ctggactcag attacgcttg 11220attggttgcc ggcctcacct
tcgcctggct tgcgcgagcc tgcgtctata gaaatgcgcc 11280ggactcacgt ctccgtcgat
gcaggcattc gactgaaaaa acatttaaac tgcacccatg 11340cgtgcgggct gagcttatgt
catacaagta accaatcaca ggcttaagtt cagtcaacgc 11400atgcgctaag cttggatgtg
gctgaccggg caaccaatca cacagatagt ggatgcacgg 11460atctaatatt ggctaatttg
gttaaacttg tctaacctta gacgtggcaa gtgagtcagc 11520ggatcaaatc tgctctaaaa
ttgtctgcct cctagatgtc cttggtgttc caagatttaa 11580tcatcactgc actatttctt
tgcgttgctt cgctgcagct tcgcgttact tgcattcgct 11640taatcaggat tactttgatc
aactaggttt ctaacttcta ctaccttcac ttgcacaggg 11700tgcccgtcct gctagccggt
gtgcttgctg tgcgatcgtt tggcatgtgc ttgttgaggg 11760gttgctaggg gattggagag
gattgaaggg attaaatctc ctcctattca attttgaata 11820ggaggggatt taatcccctt
caatccccct caaaccacta gtaaccgaac gtggcctgag 11880ggggcgggcg agtctttata
ttgaatgaaa ctacataaaa tagcatgccg tctctgtcac 11940tggcaatgga cggtggtgcc
tagcgcaact cagcgcacaa ctgtgtgtct tgatttttct 12000tctgtttatc acggcattag
tgccatgccg ttttatgtta cagtgttgtg tgctcgcaag 12060catccgaaaa tatgcgtctg
agtttagggt tgggtcaaac ttgtcgaatt tggggttctg 12120ttataatatg ttgagcatga
ataaagatgg atgctggtga ctctgtcgcc atcgccgtcc 12180atcatgagtg tcctgtaatt
caacttatat ctatcatgta tgtatgtatg tatgtatgta 12240tgtatgtata tgctgtctac
tatgcttctt tgttttaact gaaatgtgtg ttacagtgtt 12300acttctctgg ggtccattta
aaacggcatt tcgtttacga taggaaccag ccattataat 12360ctttaaccaa taatttcgct
aaccaatttc aactattgca atgcgaactt aatattatca 12420gatttataac cgaatgcgct
atcaaataat cataaggttg taatcataat aatataatat 12480aaaataaatg agtgctcgaa
gtgaaatttt agagagcgtt ataagaaaaa ttgatgtgat 12540ctccaagaat aatagcccct
cccggctccc ggtacaaaca tagggcttct ttagaatgca 12600ggattgtgag aacataggaa
taggaaaaat ataggaattc tataggaatg tatatggaaa 12660acagaggatt gaaaaacaca
gaaaaaatgt gaaagcaagt ctttggatga agcgtaggaa 12720acttatagga ataggaattc
ataacggacc gcgatcgctt aattaagctt gcatgcctgc 12780agtgcagcgt gacccggtcg
tgcccctctc tagagataat gagcattgca tgtctaagtt 12840ataaaaaatt accacatatt
ttttttgtca cacttgtttg aagtgcagtt tatctatctt 12900tatacatata tttaaacttt
actctacgaa taatataatc tatagtacta caataatatc 12960agtgttttag agaatcatat
aaatgaacag ttagacatgg tctaaaggac aattgagtat 13020tttgacaaca ggactctaca
gttttatctt tttagtgtgc atgtgttctc cttttttttt 13080gcaaatagct tcacctatat
aatacttcat ccattttatt agtacatcca tttagggttt 13140agggttaatg gtttttatag
actaattttt ttagtacatc tattttattc tattttagcc 13200tctaaattaa gaaaactaaa
actctatttt agttttttta tttaataatt tagatataaa 13260atagaataaa ataaagtgac
taaaaattaa acaaataccc tttaagaaat taaaaaaact 13320aaggaaacat ttttcttgtt
tcgagtagat aatgccagcc tgttaaacgc cgtcgacgag 13380tctaacggac accaaccagc
gaaccagcag cgtcgcgtcg ggccaagcga agcagacggc 13440acggcatctc tgtcgctgcc
tctggacccc tctcgagagt tccgctccac cgttggactt 13500gctccgctgt cggcatccag
aaattgcgtg gcggagcggc agacgtgagc cggcacggca 13560ggcggcctcc tcctcctctc
acggcaccgg cagctacggg ggattccttt cccaccgctc 13620cttcgctttc ccttcctcgc
ccgccgtaat aaatagacac cccctccaca ccctctttcc 13680ccaacctcgt gttgttcgga
gcgcacacac acacaaccag atctccccca aatccacccg 13740tcggcacctc cgcttcaagg
tacgccgctc gtcctccccc cccccccctc tctaccttct 13800ctagatcggc gttccggtcc
atggttaggg cccggtagtt ctacttctgt tcatgtttgt 13860gttagatccg tgtttgtgtt
agatccgtgc tgctagcgtt cgtacacgga tgcgacctgt 13920acgtcagaca cgttctgatt
gctaacttgc cagtgtttct ctttggggaa tcctgggatg 13980gctctagccg ttccgcagac
gggatcgatt tcatgatttt ttttgtttcg ttgcataggg 14040tttggtttgc ccttttcctt
tatttcaata tatgccgtgc acttgtttgt cgggtcatct 14100tttcatgctt ttttttgtct
tggttgtgat gatgtggtct ggttgggcgg tcgttctaga 14160tcggagtaga attctgtttc
aaactacctg gtggatttat taattttgga tctgtatgtg 14220tgtgccatac atattcatag
ttacgaattg aagatgatgg atggaaatat cgatctagga 14280taggtataca tgttgatgcg
ggttttactg atgcatatac agagatgctt tttgttcgct 14340tggttgtgat gatgtggtgt
ggttgggcgg tcgttcattc gttctagatc ggagtagaat 14400actgtttcaa actacctggt
gtatttatta attttggaac tgtatgtgtg tgtcatacat 14460cttcatagtt acgagtttaa
gatggatgga aatatcgatc taggataggt atacatgttg 14520atgtgggttt tactgatgca
tatacatgat ggcatatgca gcatctattc atatgctcta 14580accttgagta cctatctatt
ataataaaca agtatgtttt ataattattt tgatcttgat 14640atacttggat gatggcatat
gcagcagcta tatgtggatt tttttagccc tgccttcata 14700cgctatttat ttgcttggta
ctgtttcttt tgtcgatgct caccctgttg tttggtgtta 14760cttctgcagg gatccccgat
catgcaaaaa ctcattaact cagtgcaaaa ctatgcctgg 14820ggcagcaaaa cggcgttgac
tgaactttat ggtatggaaa atccgtccag ccagccgatg 14880gccgagctgt ggatgggcgc
acatccgaaa agcagttcac gagtgcagaa tgccgccgga 14940gatatcgttt cactgcgtga
tgtgattgag agtgataaat cgactctgct cggagaggcc 15000gttgccaaac gctttggcga
actgcctttc ctgttcaaag tattatgcgc agcacagcca 15060ctctccattc aggttcatcc
aaacaaacac aattctgaaa tcggttttgc caaagaaaat 15120gccgcaggta tcccgatgga
tgccgccgag cgtaactata aagatcctaa ccacaagccg 15180gagctggttt ttgcgctgac
gcctttcctt gcgatgaacg cgtttcgtga attttccgag 15240attgtctccc tactccagcc
ggtcgcaggt gcacatccgg cgattgctca ctttttacaa 15300cagcctgatg ccgaacgttt
aagcgaactg ttcgccagcc tgttgaatat gcagggtgaa 15360gaaaaatccc gcgcgctggc
gattttaaaa tcggccctcg atagccagca gggtgaaccg 15420tggcaaacga ttcgtttaat
ttctgaattt tacccggaag acagcggtct gttctccccg 15480ctattgctga atgtggtgaa
attgaaccct ggcgaagcga tgttcctgtt cgctgaaaca 15540ccgcacgctt acctgcaagg
cgtggcgctg gaagtgatgg caaactccga taacgtgctg 15600cgtgcgggtc tgacgcctaa
atacattgat attccggaac tggttgccaa tgtgaaattc 15660gaagccaaac cggctaacca
gttgttgacc cagccggtga aacaaggtgc agaactggac 15720ttcccgattc cagtggatga
ttttgccttc tcgctgcatg accttagtga taaagaaacc 15780accattagcc agcagagtgc
cgccattttg ttctgcgtcg aaggcgatgc aacgttgtgg 15840aaaggttctc agcagttaca
gcttaaaccg ggtgaatcag cgtttattgc cgccaacgaa 15900tcaccggtga ctgtcaaagg
ccacggccgt ttagcgcgtg tttacaacaa gctgtaagag 15960cttactgaaa aaattaacat
ctcttgctaa gctgggagct cgatccgtcg acctgcagat 16020cgttcaaaca tttggcaata
aagtttctta agattgaatc ctgttgccgg tcttgcgatg 16080attatcatat aatttctgtt
gaattacgtt aagcatgtaa taattaacat gtaatgcatg 16140acgttattta tgagatgggt
ttttatgatt agagtcccgc aattatacat ttaatacgcg 16200atagaaaaca aaatatagcg
cgcaaactag gataaattat cgcgcgcggt gtcatctatg 16260ttactagatc tgctagccct
gcaggaaatt taccggtgcc cgggcggcca gcatggccgt 16320atccgcaatg tgttattaag
ttgtctaagc gtcaatttgt ttacaccaca atatatcctg 16380ccaccagcca gccaacagct
ccccgaccgg cagctcggca caaaatcacc actcgataca 16440ggcagcccat cagaattaat
tctcatgttt gacagcttat catcgactgc acggtgcacc 16500aatgcttctg gcgtcaggca
gccatcggaa gctgtggtat ggctgtgcag gtcgtaaatc 16560actgcataat tcgtgtcgct
caaggcgcac tcccgttctg gataatgttt tttgcgccga 16620catcataacg gttctggcaa
atattctgaa atgagctgtt gacaattaat catccggctc 16680gtataatgtg tggaattgtg
agcggataac aatttcacac aggaaacaga ccatgaggga 16740agcgttgatc gccgaagtat
cgactcaact atcagaggta gttggcgtca tcgagcgcca 16800tctcgaaccg acgttgctgg
ccgtacattt gtacggctcc gcagtggatg gcggcctgaa 16860gccacacagt gatattgatt
tgctggttac ggtgaccgta aggcttgatg aaacaacgcg 16920gcgagctttg atcaacgacc
ttttggaaac ttcggcttcc cctggagaga gcgagattct 16980ccgcgctgta gaagtcacca
ttgttgtgca cgacgacatc attccgtggc gttatccagc 17040taagcgcgaa ctgcaatttg
gagaatggca gcgcaatgac attcttgcag gtatcttcga 17100gccagccacg atcgacattg
atctggctat cttgctgaca aaagcaagag aacatagcgt 17160tgccttggta ggtccagcgg
cggaggaact ctttgatccg gttcctgaac aggatctatt 17220tgaggcgcta aatgaaacct
taacgctatg gaactcgccg cccgactggg ctggcgatga 17280gcgaaatgta gtgcttacgt
tgtcccgcat ttggtacagc gcagtaaccg gcaaaatcgc 17340gccgaaggat gtcgctgccg
actgggcaat ggagcgcctg ccggcccagt atcagcccgt 17400catacttgaa gctaggcagg
cttatcttgg acaagaagat cgcttggcct cgcgcgcaga 17460tcagttggaa gaatttgttc
actacgtgaa aggcgagatc accaaagtag tcggcaaata 17520aagctctagt ggatctccgt
acccggggat ctggctcgcg gcggacgcac gacgccgggg 17580cgagaccata ggcgatctcc
taaatcaata gtagctgtaa cctcgaagcg tttcacttgt 17640aacaacgatt gagaattttt
gtcataaaat tgaaatactt ggttcgcatt tttgtcatcc 17700gcggtcagcc gcaattctga
cgaactgccc atttagctgg agatgattgt acatccttca 17760cgtgaaaatt tctcaagcgc
tgtgaacaag ggttcagatt ttagattgaa aggtgagccg 17820ttgaaacacg ttcttcttgt
cgatgacgac gtcgctatgc ggcatcttat tattgaatac 17880cttacgatcc acgccttcaa
agtgaccgcg gtagccgaca gcacccagtt cacaagagta 17940ctctcttccg cgacggtcga
tgtcgtggtt gttgatctag atttaggtcg tgaagatggg 18000ctcgagatcg ttcgtaatct
ggcggcaaag tctgatattc caatcataat tatcagtggc 18060gaccgccttg aggagacgga
taaagttgtt gcactcgagc taggagcaag tgattttatc 18120gctaagccgt tcagtatcag
agagtttcta gcacgcattc gggttgcctt gcgcgtgcgc 18180cccaacgttg tccgctccaa
agaccgacgg tctttttgtt ttactgactg gacacttaat 18240ctcaggcaac gtcgcttgat
gtccgaagct ggcggtgagg tgaaacttac ggcaggtgag 18300ttcaatcttc tcctcgcgtt
tttagagaaa ccccgcgacg ttctatcgcg cgagcaactt 18360ctcattgcca gtcgagtacg
cgacgaggag gtttatgaca ggagtataga tgttctcatt 18420ttgaggctgc gccgcaaact
tgaggcagat ccgtcaagcc ctcaactgat aaaaacagca 18480agaggtgccg gttatttctt
tgacgcggac gtgcaggttt cgcacggggg gacgatggca 18540gcctgagcca attcccagat
ccccgaggaa tcggcgtgag cggtcgcaaa ccatccggcc 18600cggtacaaat cggcgcggcg
ctgggtgatg acctggtgga gaagttgaag gccgcgcagg 18660ccgcccagcg gcaacgcatc
gaggcagaag cacgccccgg tgaatcgtgg caagcggccg 18720ctgatcgaat ccgcaaagaa
tcccggcaac cgccggcagc cggtgcgccg tcgattagga 18780agccgcccaa gggcgacgag
caaccagatt ttttcgttcc gatgctctat gacgtgggca 18840cccgcgatag tcgcagcatc
atggacgtgg ccgttttccg tctgtcgaag cgtgaccgac 18900gagctggcga ggtgatccgc
tacgagcttc cagacgggca cgtagaggtt tccgcagggc 18960cggccggcat ggccagtgtg
tgggattacg acctggtact gatggcggtt tcccatctaa 19020ccgaatccat gaaccgatac
cgggaaggga agggagacaa gcccggccgc gtgttccgtc 19080cacacgttgc ggacgtactc
aagttctgcc ggcgagccga tggcggaaag cagaaagacg 19140acctggtaga aacctgcatt
cggttaaaca ccacgcacgt tgccatgcag cgtacgaaga 19200aggccaagaa cggccgcctg
gtgacggtat ccgagggtga agccttgatt agccgctaca 19260agatcgtaaa gagcgaaacc
gggcggccgg agtacatcga gatcgagcta gctgattgga 19320tgtaccgcga gatcacagaa
ggcaagaacc cggacgtgct gacggttcac cccgattact 19380ttttgatcga tcccggcatc
ggccgttttc tctaccgcct ggcacgccgc gccgcaggca 19440aggcagaagc cagatggttg
ttcaagacga tctacgaacg cagtggcagc gccggagagt 19500tcaagaagtt ctgtttcacc
gtgcgcaagc tgatcgggtc aaatgacctg ccggagtacg 19560atttgaagga ggaggcgggg
caggctggcc cgatcctagt catgcgctac cgcaacctga 19620tcgagggcga agcatccgcc
ggttcctaat gtacggagca gatgctaggg caaattgccc 19680tagcagggga aaaaggtcga
aaaggtctct ttcctgtgga tagcacgtac attgggaacc 19740caaagccgta cattgggaac
cggaacccgt acattgggaa cccaaagccg tacattggga 19800accggtcaca catgtaagtg
actgatataa aagagaaaaa aggcgatttt tccgcctaaa 19860actctttaaa acttattaaa
actcttaaaa cccgcctggc ctgtgcataa ctgtctggcc 19920agcgcacagc cgaagagctg
caaaaagcgc ctacccttcg gtcgctgcgc tccctacgcc 19980ccgccgcttc gcgtcggcct
atcgcggccg ctggccgctc aaaaatggct ggcctacggc 20040caggcaatct accagggcgc
ggacaagccg cgccgtcgcc actcgaccgc cggcgctgag 20100gtctgcctcg tgaagaaggt
gttgctgact cataccaggc ctgaatcgcc ccatcatcca 20160gccagaaagt gagggagcca
cggttgatga gagctttgtt gtaggtggac cagttggtga 20220ttttgaactt ttgctttgcc
acggaacggt ctgcgttgtc gggaagatgc gtgatctgat 20280ccttcaactc agcaaaagtt
cgatttattc aacaaagccg ccgtcccgtc aagtcagcgt 20340aatgctctgc cagtgttaca
accaattaac caattctgat tagaaaaact catcgagcat 20400caaatgaaac tgcaatttat
tcatatcagg attatcaata ccatattttt gaaaaagccg 20460tttctgtaat gaaggagaaa
actcaccgag gcagttccat aggatggcaa gatcctggta 20520tcggtctgcg attccgactc
gtccaacatc aatacaacct attaatttcc cctcgtcaaa 20580aataaggtta tcaagtgaga
aatcaccatg agtgacgact gaatccggtg agaatggcaa 20640aagctctgca ttaatgaatc
ggccaacgcg cggggagagg cggtttgcgt attgggcgct 20700cttccgcttc ctcgctcact
gactcgctgc gctcggtcgt tcggctgcgg cgagcggtat 20760cagctcactc aaaggcggta
atacggttat ccacagaatc aggggataac gcaggaaaga 20820acatgtgagc aaaaggccag
caaaaggcca ggaaccgtaa aaaggccgcg ttgctggcgt 20880ttttccatag gctccgcccc
cctgacgagc atcacaaaaa tcgacgctca agtcagaggt 20940ggcgaaaccc gacaggacta
taaagatacc aggcgtttcc ccctggaagc tccctcgtgc 21000gctctcctgt tccgaccctg
ccgcttaccg gatacctgtc cgcctttctc ccttcgggaa 21060gcgtggcgct ttctcatagc
tcacgctgta ggtatctcag ttcggtgtag gtcgttcgct 21120ccaagctggg ctgtgtgcac
gaaccccccg ttcagcccga ccgctgcgcc ttatccggta 21180actatcgtct tgagtccaac
ccggtaagac acgacttatc gccactggca gcagccactg 21240gtaacaggat tagcagagcg
aggtatgtag gcggtgctac agagttcttg aagtggtggc 21300ctaactacgg ctacactaga
agaacagtat ttggtatctg cgctctgctg aagccagtta 21360ccttcggaaa aagagttggt
agctcttgat ccggcaaaca aaccaccgct ggtagcggtg 21420gtttttttgt ttgcaagcag
cagattacgc gcagaaaaaa aggatctcaa gaagatcctt 21480tgatcttttc tacggggtct
gacgctcagt ggaacgaaaa ctcacgttaa gggattttgg 21540tcatgagatt atcaaaaagg
atcttcacct agatcctttt gatccggaat taa 215938215097DNAArtificial
sequenceZmABT-990-binary 82attcctgtgg ttggcatgca catacaaatg gacgaacgga
taaacctttt cacgcccttt 60taaatatccg attattctaa taaacgctct tttctcttag
gtttacccgc caatatatcc 120tgtcaaacac tgatagttta aactgaaggc gggaaacgac
aatctgatca tgagcggaga 180attaagggag tcacgttatg acccccgccg atgacgcggg
acaagccgtt ttacgtttgg 240aactgacaga accgcaacgc tgcaggaatt ggccgcagcg
gccatttaaa tcaattgggc 300gcgccagctg cttgtgggga ccagacaaaa aaggaatggt
gcagaattgt taggcgcacc 360taccaaaagc atctttgcct ttattgcaaa gataaagcag
attcctctag tacaagtggg 420gaacaaaata acgtggaaaa gagctgtcct gacagcccac
tcactaatgc gtatgacgaa 480cgcagtgacg accacaaaac tcgagacttt tcaacaaagg
gtaatatccg gaaacctcct 540cggattccat tgcccagcta tctgtcactt tattgtgaag
atagtggaaa aggaaggtgg 600ctcctacaaa tgccatcatt gcgataaagg aaaggctatc
gttgaagatg cctctgccga 660cagtggtccc aaagatggac ccccacccac gaggagcatc
gtggaaaaag aagacgttcc 720aaccacgtct tcaaagcaag tggattgatg tgatatctcc
actgacgtaa gggatgacga 780acaatcccac tatccttcgg taccggaccc ggtctgagtt
gttaggtgaa ttttactact 840atccagcgac aactaaaaaa gaaacagagt gagtactaag
gaagactata tattttgtat 900attaacgaga agagatagtt agttacagca catccattgg
agcgccggcc aaagcagata 960tatagtgtcg ttacgtttgt aatcatagtt ctggtttttc
tactatgtat aattaaacat 1020aatgcaacct tcttaagacg gatgtatcaa ttcgatgggc
tcattccctt ctttttttta 1080tttatcgcaa tttagtttaa aaaagatcta gcggacgata
aatatttaag aatgaagata 1140gtaattatct tcagtcaata caatagtttc tcaacaatat
ataatatata tttgcgcgcc 1200tgtggggtgt gtgtttttac aacacaaaca accgacaggg
aattctaacg caaatgcttc 1260cgtttgtact tgattatcaa gacataaaga cgaagatggt
tacgttacga tgcttctagt 1320tggcatctgc acataacatg catgcatgcg ccgggtttaa
tgcataatgc tgtgtacata 1380cattatttgc agcacacacg cgtattgctc atgtgacgtg
ccgcctgtct gtctatcctt 1440gaccggcact tggtaccaac cattatgttc gttgtattgc
gagctagcta gctgcctgta 1500ctatataact gcagaaaggt acactacaga atgcagatgc
tgcgccactg gttcgcatac 1560actattctat tccactggcc acctataaac atatgcatga
caattgacaa acaagctagc 1620gtctctagaa agttggtgcc ggccatagca attattcccg
actggagtga agaaaagaaa 1680ctaccatttc catgtgggtt tcctttgcat atcatagaat
caagatgtaa atatctatga 1740gataccatta tagaattttg ctgacgtggc tgcattgtat
gatatagtgt tgcggacagc 1800ctcagcagcc agctggagct gacaggggag ttcaaaagaa
acacacgtac accaaccagc 1860tagtatctcc tcaacgacat cggctaaatt atcttgtcgg
tatgcatact tttcttcgcg 1920cgcggggggc ctttcattag atgcttgcac ataaaactgc
gctagctgat gctgaatctc 1980agcctaacat atatactcct atatatatat attctcttgt
attttatgcc aattaatgta 2040acgcaattca gatgtgctgg ctggtcaaca cactgtgtgc
atatgctggc tttcggagac 2100taaacctgga ccaagtttgg cgcccgattt ggatggtttc
tggtccccta gcggcatgca 2160ggcatcagtg ggccctataa atatgcatgg agtagagcaa
cctctatgca caccacacaa 2220cacaacacaa taatacagca aaggaggcta gcagaagtgc
aggattaata agctaagcta 2280gtagaaatta agcaaagcat aggcacagcc ttggctacct
cctctggttc ttgccttatt 2340attagcctgt tggtggtggt ggtggcggcg gcgctgtcgg
cctcaacggc gtcggcacag 2400ctgtcgtcga cgttctacga cacgtcgtgc cccagcgcgt
tgtccaccat cagcagcggc 2460gtgaactccg ccgtggcgca gcaggctcgt gtgggggcgt
cgctgctccg gctccacttc 2520cacgactgct tcgtccaagc aagtctagct gtctcagatg
catctatcta tctacttata 2580tataagcatg atttcctttc tagctagcta gcatcgtcgt
gcattttaat ttgaagataa 2640aagattagca cgtcgtatat gcatgcgatt aattaaccag
gaggcatcaa ggtgaaattt 2700ctggtggtcc accagggctg cgacgcgtcc attctgctga
acgacacgtc cggggagcag 2760acccagccgc cgaacctaac tctgaacccg agggccttcg
acgtcgtcaa cagcatcaag 2820gcgcaggtgg aggcggcgtg cgcgggcgtc gtctcctgcg
ccgacatcct cgccgtcgcc 2880gcccgcgacg gagttgacgc ggtacgtagc tacatcaccg
tgcctattaa tttgctggct 2940agtagcttgt tggtttgcaa actaactaac taattccgat
cgtatgcgtg gtgcatatgc 3000agctcggcgg gccttcgtaa accatggtac gtcctgtaga
aaccccaacc cgtgaaatca 3060aaaaactcga cggcctgtgg gcattcagtc tggatcgcga
aaactgtgga attgatcagc 3120gttggtggga aagcgcgtta caagaaagcc gggcaattgc
tgtgccaggc agttttaacg 3180atcagttcgc cgatgcagat attcgtaatt atgcgggcaa
cgtctggtat cagcgcgaag 3240tctttatacc gaaaggttgg gcaggccagc gtatcgtgct
gcgtttcgat gcggtcactc 3300attacggcaa agtgtgggtc aataatcagg aagtgatgga
gcatcagggc ggctatacgc 3360catttgaagc cgatgtcacg ccgtatgtta ttgccgggaa
aagtgtacgt atcaccgttt 3420gtgtgaacaa cgaactgaac tggcagacta tcccgccggg
aatggtgatt accgacgaaa 3480acggcaagaa aaagcagtct tacttccatg atttctttaa
ctatgccgga atccatcgca 3540gcgtaatgct ctacaccacg ccgaacacct gggtggacga
tatcaccgtg gtgacgcatg 3600tcgcgcaaga ctgtaaccac gcgtctgttg actggcaggt
accaagctgc gaatcttcgt 3660ttttttaagg aattctcgat ctttatggtg tataggctct
gggttttctg ttttttgtat 3720ctcttaggat tttgtaaatt ccagatcttt ctatggccac
ttagtagtat atttcaaaaa 3780ttctccaatc gagttcttca ttcgcatttt cagtcatttt
ctcttcgacg ttgtttttaa 3840gcctgggtat tactcctatt tagttgaact ctgcagcaat
cttagaaaat tagggttttg 3900aggtttcgat ttctctaggt aaccgatcta ttgcattcat
ctgaatttct gcatatatgt 3960cttagatttc tgataagctt acgatacgtt aggtgtaatt
gaagtttatt tttcaagagt 4020gttatttttt gtttctgaat ttttcaggtg gtggccaatg
gtgatgtcag cgttgaactg 4080cgtgatgcgg atcaacaggt ggttgcaact ggacaaggca
ctagcgggac tttgcaagtg 4140gtgaatccgc acctctggca accgggtgaa ggttatctct
atgaactgtg cgtcacagcc 4200aaaagccaga cagagtgtga tatctacccg cttcgcgtcg
gcatccggtc agtggcagtg 4260aagggcgaac agttcctgat taaccacaaa ccgttctact
ttactggctt tggtcgtcat 4320gaagatgcgg acttgcgtgg caaaggattc gataacgtgc
tgatggtgca cgaccacgca 4380ttaatggact ggattggggc caactcctac cgtacctcgc
attaccctta cgctgaagag 4440atgctcgact gggcagatga acatggcatc gtggtgattg
atgaaactgc tgctgtcggc 4500tttaacctct ctttaggcat tggtttcgaa gcgggcaaca
agccgaaaga actgtacagc 4560gaagaggcag tcaacgggga aactcagcaa gcgcacttac
aggcgattaa agagctgata 4620gcgcgtgaca aaaaccaccc aagcgtggtg atgtggagta
ttgccaacga accggatacc 4680cgtccgcaag gtgcacggga atatttcgcg ccactggcgg
aagcaacgcg taaactcgac 4740ccgacgcgtc cgatcacctg cgtcaatgta atgttctgcg
acgctcacac cgataccatc 4800agcgatctct ttgatgtgct gtgcctgaac cgttattacg
gatggtatgt ccaaagcggc 4860gatttggaaa cggcagagaa ggtactggaa aaagaacttc
tggcctggca ggagaaactg 4920catcagccga ttatcatcac cgaatacggc gtggatacgt
tagccgggct gcactcaatg 4980tacaccgaca tgtggagtga agagtatcag tgtgcatggc
tggatatgta tcaccgcgtc 5040tttgatcgcg tcagcgccgt cgtcggtgaa caggtatgga
atttcgccga ttttgcgacc 5100tcgcaaggca tattgcgcgt tggcggtaac aagaaaggga
tcttcactcg cgaccgcaaa 5160ccgaagtcgg cggcttttct gctgcaaaaa cgctggactg
gcatgaactt cggtgaaaaa 5220ccgcagcagg gaggcaaaca atgagagctc ccgcgtacag
caagaagaac ctcgacgcga 5280ccgacatggt cgctctctca ggcgctcaca caatcgggca
ggcccagtgc tccagcttca 5340acggccacat ctacaacgac acgaacatca acgcggcctt
cgcgacgtcg ctcaaggcca 5400actgccccat gtccggcggc agcagcctgg cgccgctgga
caccatgacc ccgaccgtgt 5460tcgacaacga ctactacaag aacctgctgt cgcagaaggg
gctgctgcac tcggaccagg 5520agctgttcaa caacggcagc accgacagca cggtcagcaa
ctttgcgtcc agctcggccg 5580ccttcaccag cgccttcacg gcggccttgg tgaagatggg
gaacctcggc ccgctcaccg 5640ggaccagtgg gcagatcagg ctcacctgct ggaagctcaa
ctcgtcctaa taattaagga 5700cggacgtccg atagacgatc ctgcgcaatc gtatcgtacg
tgcatgatac gcatacatct 5760ggaaactact ataccaatgc aaacagagat ctatacgtac
gagtatgtat aacgacgagt 5820gatgtttgta tggatctacg tatgtaacaa ggacctctcg
tagcgcaaag gcgcgcgttg 5880ggagattaat taggtacaca agctattacc acattatata
tcactctcat tgtggctaca 5940tatctatatc tctgaggcca aatgcttggg tgtccagtac
taattaataa taattcagtg 6000cgtatgcaag atttgtgggc aaatattggt ttacgatttc
ggaaaaaaca aatttcggcc 6060cccggcgaaa aacaagaaat ttccgaattt tcggaaattc
taggtcaaaa tcaaatagat 6120tcaatacttt ttaaaacaaa gaatgatata atttatatta
aaaataccaa ttttggaagc 6180atatattttt tcggacccca ccaaaatcaa ggcaatttcg
gaaattttcg tccgaaattg 6240taaaccctgc ggaccgcgat cgcttaatta agcttgcatg
cctgcagtgc agcgtgaccc 6300ggtcgtgccc ctctctagag ataatgagca ttgcatgtct
aagttataaa aaattaccac 6360atattttttt tgtcacactt gtttgaagtg cagtttatct
atctttatac atatatttaa 6420actttactct acgaataata taatctatag tactacaata
atatcagtgt tttagagaat 6480catataaatg aacagttaga catggtctaa aggacaattg
agtattttga caacaggact 6540ctacagtttt atctttttag tgtgcatgtg ttctcctttt
tttttgcaaa tagcttcacc 6600tatataatac ttcatccatt ttattagtac atccatttag
ggtttagggt taatggtttt 6660tatagactaa tttttttagt acatctattt tattctattt
tagcctctaa attaagaaaa 6720ctaaaactct attttagttt ttttatttaa taatttagat
ataaaataga ataaaataaa 6780gtgactaaaa attaaacaaa taccctttaa gaaattaaaa
aaactaagga aacatttttc 6840ttgtttcgag tagataatgc cagcctgtta aacgccgtcg
acgagtctaa cggacaccaa 6900ccagcgaacc agcagcgtcg cgtcgggcca agcgaagcag
acggcacggc atctctgtcg 6960ctgcctctgg acccctctcg agagttccgc tccaccgttg
gacttgctcc gctgtcggca 7020tccagaaatt gcgtggcgga gcggcagacg tgagccggca
cggcaggcgg cctcctcctc 7080ctctcacggc accggcagct acgggggatt cctttcccac
cgctccttcg ctttcccttc 7140ctcgcccgcc gtaataaata gacaccccct ccacaccctc
tttccccaac ctcgtgttgt 7200tcggagcgca cacacacaca accagatctc ccccaaatcc
acccgtcggc acctccgctt 7260caaggtacgc cgctcgtcct cccccccccc ccctctctac
cttctctaga tcggcgttcc 7320ggtccatggt tagggcccgg tagttctact tctgttcatg
tttgtgttag atccgtgttt 7380gtgttagatc cgtgctgcta gcgttcgtac acggatgcga
cctgtacgtc agacacgttc 7440tgattgctaa cttgccagtg tttctctttg gggaatcctg
ggatggctct agccgttccg 7500cagacgggat cgatttcatg attttttttg tttcgttgca
tagggtttgg tttgcccttt 7560tcctttattt caatatatgc cgtgcacttg tttgtcgggt
catcttttca tgcttttttt 7620tgtcttggtt gtgatgatgt ggtctggttg ggcggtcgtt
ctagatcgga gtagaattct 7680gtttcaaact acctggtgga tttattaatt ttggatctgt
atgtgtgtgc catacatatt 7740catagttacg aattgaagat gatggatgga aatatcgatc
taggataggt atacatgttg 7800atgcgggttt tactgatgca tatacagaga tgctttttgt
tcgcttggtt gtgatgatgt 7860ggtgtggttg ggcggtcgtt cattcgttct agatcggagt
agaatactgt ttcaaactac 7920ctggtgtatt tattaatttt ggaactgtat gtgtgtgtca
tacatcttca tagttacgag 7980tttaagatgg atggaaatat cgatctagga taggtataca
tgttgatgtg ggttttactg 8040atgcatatac atgatggcat atgcagcatc tattcatatg
ctctaacctt gagtacctat 8100ctattataat aaacaagtat gttttataat tattttgatc
ttgatatact tggatgatgg 8160catatgcagc agctatatgt ggattttttt agccctgcct
tcatacgcta tttatttgct 8220tggtactgtt tcttttgtcg atgctcaccc tgttgtttgg
tgttacttct gcagggatcc 8280ccgatcatgc aaaaactcat taactcagtg caaaactatg
cctggggcag caaaacggcg 8340ttgactgaac tttatggtat ggaaaatccg tccagccagc
cgatggccga gctgtggatg 8400ggcgcacatc cgaaaagcag ttcacgagtg cagaatgccg
ccggagatat cgtttcactg 8460cgtgatgtga ttgagagtga taaatcgact ctgctcggag
aggccgttgc caaacgcttt 8520ggcgaactgc ctttcctgtt caaagtatta tgcgcagcac
agccactctc cattcaggtt 8580catccaaaca aacacaattc tgaaatcggt tttgccaaag
aaaatgccgc aggtatcccg 8640atggatgccg ccgagcgtaa ctataaagat cctaaccaca
agccggagct ggtttttgcg 8700ctgacgcctt tccttgcgat gaacgcgttt cgtgaatttt
ccgagattgt ctccctactc 8760cagccggtcg caggtgcaca tccggcgatt gctcactttt
tacaacagcc tgatgccgaa 8820cgtttaagcg aactgttcgc cagcctgttg aatatgcagg
gtgaagaaaa atcccgcgcg 8880ctggcgattt taaaatcggc cctcgatagc cagcagggtg
aaccgtggca aacgattcgt 8940ttaatttctg aattttaccc ggaagacagc ggtctgttct
ccccgctatt gctgaatgtg 9000gtgaaattga accctggcga agcgatgttc ctgttcgctg
aaacaccgca cgcttacctg 9060caaggcgtgg cgctggaagt gatggcaaac tccgataacg
tgctgcgtgc gggtctgacg 9120cctaaataca ttgatattcc ggaactggtt gccaatgtga
aattcgaagc caaaccggct 9180aaccagttgt tgacccagcc ggtgaaacaa ggtgcagaac
tggacttccc gattccagtg 9240gatgattttg ccttctcgct gcatgacctt agtgataaag
aaaccaccat tagccagcag 9300agtgccgcca ttttgttctg cgtcgaaggc gatgcaacgt
tgtggaaagg ttctcagcag 9360ttacagctta aaccgggtga atcagcgttt attgccgcca
acgaatcacc ggtgactgtc 9420aaaggccacg gccgtttagc gcgtgtttac aacaagctgt
aagagcttac tgaaaaaatt 9480aacatctctt gctaagctgg gagctcgatc cgtcgacctg
cagatcgttc aaacatttgg 9540caataaagtt tcttaagatt gaatcctgtt gccggtcttg
cgatgattat catataattt 9600ctgttgaatt acgttaagca tgtaataatt aacatgtaat
gcatgacgtt atttatgaga 9660tgggttttta tgattagagt cccgcaatta tacatttaat
acgcgataga aaacaaaata 9720tagcgcgcaa actaggataa attatcgcgc gcggtgtcat
ctatgttact agatctgcta 9780gccctgcagg aaatttaccg gtgcccgggc ggccagcatg
gccgtatccg caatgtgtta 9840ttaagttgtc taagcgtcaa tttgtttaca ccacaatata
tcctgccacc agccagccaa 9900cagctccccg accggcagct cggcacaaaa tcaccactcg
atacaggcag cccatcagaa 9960ttaattctca tgtttgacag cttatcatcg actgcacggt
gcaccaatgc ttctggcgtc 10020aggcagccat cggaagctgt ggtatggctg tgcaggtcgt
aaatcactgc ataattcgtg 10080tcgctcaagg cgcactcccg ttctggataa tgttttttgc
gccgacatca taacggttct 10140ggcaaatatt ctgaaatgag ctgttgacaa ttaatcatcc
ggctcgtata atgtgtggaa 10200ttgtgagcgg ataacaattt cacacaggaa acagaccatg
agggaagcgt tgatcgccga 10260agtatcgact caactatcag aggtagttgg cgtcatcgag
cgccatctcg aaccgacgtt 10320gctggccgta catttgtacg gctccgcagt ggatggcggc
ctgaagccac acagtgatat 10380tgatttgctg gttacggtga ccgtaaggct tgatgaaaca
acgcggcgag ctttgatcaa 10440cgaccttttg gaaacttcgg cttcccctgg agagagcgag
attctccgcg ctgtagaagt 10500caccattgtt gtgcacgacg acatcattcc gtggcgttat
ccagctaagc gcgaactgca 10560atttggagaa tggcagcgca atgacattct tgcaggtatc
ttcgagccag ccacgatcga 10620cattgatctg gctatcttgc tgacaaaagc aagagaacat
agcgttgcct tggtaggtcc 10680agcggcggag gaactctttg atccggttcc tgaacaggat
ctatttgagg cgctaaatga 10740aaccttaacg ctatggaact cgccgcccga ctgggctggc
gatgagcgaa atgtagtgct 10800tacgttgtcc cgcatttggt acagcgcagt aaccggcaaa
atcgcgccga aggatgtcgc 10860tgccgactgg gcaatggagc gcctgccggc ccagtatcag
cccgtcatac ttgaagctag 10920gcaggcttat cttggacaag aagatcgctt ggcctcgcgc
gcagatcagt tggaagaatt 10980tgttcactac gtgaaaggcg agatcaccaa agtagtcggc
aaataaagct ctagtggatc 11040tccgtacccg gggatctggc tcgcggcgga cgcacgacgc
cggggcgaga ccataggcga 11100tctcctaaat caatagtagc tgtaacctcg aagcgtttca
cttgtaacaa cgattgagaa 11160tttttgtcat aaaattgaaa tacttggttc gcatttttgt
catccgcggt cagccgcaat 11220tctgacgaac tgcccattta gctggagatg attgtacatc
cttcacgtga aaatttctca 11280agcgctgtga acaagggttc agattttaga ttgaaaggtg
agccgttgaa acacgttctt 11340cttgtcgatg acgacgtcgc tatgcggcat cttattattg
aataccttac gatccacgcc 11400ttcaaagtga ccgcggtagc cgacagcacc cagttcacaa
gagtactctc ttccgcgacg 11460gtcgatgtcg tggttgttga tctagattta ggtcgtgaag
atgggctcga gatcgttcgt 11520aatctggcgg caaagtctga tattccaatc ataattatca
gtggcgaccg ccttgaggag 11580acggataaag ttgttgcact cgagctagga gcaagtgatt
ttatcgctaa gccgttcagt 11640atcagagagt ttctagcacg cattcgggtt gccttgcgcg
tgcgccccaa cgttgtccgc 11700tccaaagacc gacggtcttt ttgttttact gactggacac
ttaatctcag gcaacgtcgc 11760ttgatgtccg aagctggcgg tgaggtgaaa cttacggcag
gtgagttcaa tcttctcctc 11820gcgtttttag agaaaccccg cgacgttcta tcgcgcgagc
aacttctcat tgccagtcga 11880gtacgcgacg aggaggttta tgacaggagt atagatgttc
tcattttgag gctgcgccgc 11940aaacttgagg cagatccgtc aagccctcaa ctgataaaaa
cagcaagagg tgccggttat 12000ttctttgacg cggacgtgca ggtttcgcac ggggggacga
tggcagcctg agccaattcc 12060cagatccccg aggaatcggc gtgagcggtc gcaaaccatc
cggcccggta caaatcggcg 12120cggcgctggg tgatgacctg gtggagaagt tgaaggccgc
gcaggccgcc cagcggcaac 12180gcatcgaggc agaagcacgc cccggtgaat cgtggcaagc
ggccgctgat cgaatccgca 12240aagaatcccg gcaaccgccg gcagccggtg cgccgtcgat
taggaagccg cccaagggcg 12300acgagcaacc agattttttc gttccgatgc tctatgacgt
gggcacccgc gatagtcgca 12360gcatcatgga cgtggccgtt ttccgtctgt cgaagcgtga
ccgacgagct ggcgaggtga 12420tccgctacga gcttccagac gggcacgtag aggtttccgc
agggccggcc ggcatggcca 12480gtgtgtggga ttacgacctg gtactgatgg cggtttccca
tctaaccgaa tccatgaacc 12540gataccggga agggaaggga gacaagcccg gccgcgtgtt
ccgtccacac gttgcggacg 12600tactcaagtt ctgccggcga gccgatggcg gaaagcagaa
agacgacctg gtagaaacct 12660gcattcggtt aaacaccacg cacgttgcca tgcagcgtac
gaagaaggcc aagaacggcc 12720gcctggtgac ggtatccgag ggtgaagcct tgattagccg
ctacaagatc gtaaagagcg 12780aaaccgggcg gccggagtac atcgagatcg agctagctga
ttggatgtac cgcgagatca 12840cagaaggcaa gaacccggac gtgctgacgg ttcaccccga
ttactttttg atcgatcccg 12900gcatcggccg ttttctctac cgcctggcac gccgcgccgc
aggcaaggca gaagccagat 12960ggttgttcaa gacgatctac gaacgcagtg gcagcgccgg
agagttcaag aagttctgtt 13020tcaccgtgcg caagctgatc gggtcaaatg acctgccgga
gtacgatttg aaggaggagg 13080cggggcaggc tggcccgatc ctagtcatgc gctaccgcaa
cctgatcgag ggcgaagcat 13140ccgccggttc ctaatgtacg gagcagatgc tagggcaaat
tgccctagca ggggaaaaag 13200gtcgaaaagg tctctttcct gtggatagca cgtacattgg
gaacccaaag ccgtacattg 13260ggaaccggaa cccgtacatt gggaacccaa agccgtacat
tgggaaccgg tcacacatgt 13320aagtgactga tataaaagag aaaaaaggcg atttttccgc
ctaaaactct ttaaaactta 13380ttaaaactct taaaacccgc ctggcctgtg cataactgtc
tggccagcgc acagccgaag 13440agctgcaaaa agcgcctacc cttcggtcgc tgcgctccct
acgccccgcc gcttcgcgtc 13500ggcctatcgc ggccgctggc cgctcaaaaa tggctggcct
acggccaggc aatctaccag 13560ggcgcggaca agccgcgccg tcgccactcg accgccggcg
ctgaggtctg cctcgtgaag 13620aaggtgttgc tgactcatac caggcctgaa tcgccccatc
atccagccag aaagtgaggg 13680agccacggtt gatgagagct ttgttgtagg tggaccagtt
ggtgattttg aacttttgct 13740ttgccacgga acggtctgcg ttgtcgggaa gatgcgtgat
ctgatccttc aactcagcaa 13800aagttcgatt tattcaacaa agccgccgtc ccgtcaagtc
agcgtaatgc tctgccagtg 13860ttacaaccaa ttaaccaatt ctgattagaa aaactcatcg
agcatcaaat gaaactgcaa 13920tttattcata tcaggattat caataccata tttttgaaaa
agccgtttct gtaatgaagg 13980agaaaactca ccgaggcagt tccataggat ggcaagatcc
tggtatcggt ctgcgattcc 14040gactcgtcca acatcaatac aacctattaa tttcccctcg
tcaaaaataa ggttatcaag 14100tgagaaatca ccatgagtga cgactgaatc cggtgagaat
ggcaaaagct ctgcattaat 14160gaatcggcca acgcgcgggg agaggcggtt tgcgtattgg
gcgctcttcc gcttcctcgc 14220tcactgactc gctgcgctcg gtcgttcggc tgcggcgagc
ggtatcagct cactcaaagg 14280cggtaatacg gttatccaca gaatcagggg ataacgcagg
aaagaacatg tgagcaaaag 14340gccagcaaaa ggccaggaac cgtaaaaagg ccgcgttgct
ggcgtttttc cataggctcc 14400gcccccctga cgagcatcac aaaaatcgac gctcaagtca
gaggtggcga aacccgacag 14460gactataaag ataccaggcg tttccccctg gaagctccct
cgtgcgctct cctgttccga 14520ccctgccgct taccggatac ctgtccgcct ttctcccttc
gggaagcgtg gcgctttctc 14580atagctcacg ctgtaggtat ctcagttcgg tgtaggtcgt
tcgctccaag ctgggctgtg 14640tgcacgaacc ccccgttcag cccgaccgct gcgccttatc
cggtaactat cgtcttgagt 14700ccaacccggt aagacacgac ttatcgccac tggcagcagc
cactggtaac aggattagca 14760gagcgaggta tgtaggcggt gctacagagt tcttgaagtg
gtggcctaac tacggctaca 14820ctagaagaac agtatttggt atctgcgctc tgctgaagcc
agttaccttc ggaaaaagag 14880ttggtagctc ttgatccggc aaacaaacca ccgctggtag
cggtggtttt tttgtttgca 14940agcagcagat tacgcgcaga aaaaaaggat ctcaagaaga
tcctttgatc ttttctacgg 15000ggtctgacgc tcagtggaac gaaaactcac gttaagggat
tttggtcatg agattatcaa 15060aaaggatctt cacctagatc cttttgatcc ggaatta
150978335DNAArtificial sequence5' Bfr1 primer
83cctggtggag tgcttaagcg acgagttctg cctgg
358436DNAArtificial sequence3'Xba1 primer 84gggcttctcc tccaggaact
ctagattgcc caggcg 368537DNAArtificial
sequence5'Gfix primer 85catcggcaag tgccaccaca gccaccactt cagcctg
378630DNAArtificial sequence3'Gfix primer
86gctgtggtgg cacttgccga tggggctggg
308733DNAArtificial sequence5'1Ab5XbaI primer 87gcccgcctgg gcaatctaga
gttcctggag gag 338837DNAArtificial
sequence3'1Ab3d6 primer 88gcgagctcct agatgcggcc ctcgagttcc tcgaaga
378929DNAArtificial sequenceoligonucleotide cy2'
89ccctgtacgg cacgatgggc aacgctgca
299030DNAArtificial sequenceoligonucleotide cy1 90atatatccac catggacaac
aaccccaaca 309131DNAArtificial
sequenceoligonucleotide cy2 91tatatagagc tcctagatgc ggccctcgag t
31
User Contributions:
Comment about this patent or add new information about this topic: