Patent application title: TRANSCRIPTIONAL ACTIVATORS INVOLVED IN ABIOTIC STRESS TOLERANCE
Inventors:
Shoba Sivasankar (Urbandale, IA, US)
Pioneer Hi-Bred International, Inc. (Johnston, IA, US)
David A. Selinger (Johnston, IA, US)
David A. Selinger (Johnston, IA, US)
Norbert Brugiere (Johnston, IA, US)
Assignees:
PIONEER HI-BRED INTERNATIONAL, INC.
IPC8 Class: AC12N1582FI
USPC Class:
800289
Class name: Multicellular living organisms and unmodified parts thereof and related processes method of introducing a polynucleotide molecule into or rearrangement of genetic material within a plant or plant part the polynucleotide confers resistance to heat or cold (e.g., chilling, etc.)
Publication date: 2013-05-23
Patent application number: 20130133110
Abstract:
The present invention provides compositions and methods for regulating
expression of nucleotide sequences in a plant. Compositions comprise
novel polypeptides involved in modulating gene expression in response to
abiotic stress such as cold or drought, and the polynucleotides encoding
the polypeptides. Methods for expressing the polynucleotides in a plant
and improving cold and/or drought tolerance of plants are also provided.Claims:
1. An isolated polynucleotide encoding a transcription factor which is
involved in modulation of gene expression in response to abiotic stress
and which comprises the full length amino acid sequence of SEQ ID NO: 13,
14, 16, 25 or 27.
2. The isolated polynucleotide of claim 1 wherein said abiotic stress is due to low temperature or dehydration.
3. An expression cassette comprising an isolated polynucleotide of claim 1 and a promoter sequence operably linked to said polynucleotide, wherein said promoter initiates transcription of said linked polynucleotide in a plant transformed with said expression cassette.
4. The expression cassette of claim 3 wherein said operably linked promoter drives expression in a stress-responsive or tissue-preferred manner.
5. A plant, or a part thereof, stably transformed with an expression cassette of claim 3.
6. The plant part of claim 5, wherein the plant part is selected from the group consisting of: cell, protoplast, cell tissue culture, callus, cell clump, embryo, pollen, ovule, seed, flower, kernel, ear, cob, leaf, husk, stalk, root, root tip, anther and silk.
7. A transgenic seed of the plant of claim 5.
8. The plant of claim 5, wherein said plant is a monocot.
9. The plant of claim 8, wherein said monocot is maize, barley, wheat, oat, rye, sorghum or rice.
10. The plant of claim 5, wherein said plant is a dicot.
11. The plant of claim 10, wherein said dicot is soybean, alfalfa, safflower, tobacco, sunflower, cotton or canola.
12. A method for increasing plant tolerance to abiotic stress, comprising transforming a plant with a transformation vector comprising an isolated polynucleotide encoding a transcription factor which is involved in modulation of gene expression and is at least 90% identical to the full length of SEQ ID NO: 13, 14, 16, 25 or 27, as determined by GAP analysis under default parameters.
13. The method of claim 12, wherein said abiotic stress is due to low temperature or dehydration.
14. The method of claim 12, wherein said polynucleotide is operably linked to a promoter which drives expression in a stress-responsive or tissue-preferred manner.
15. An isolated polynucleotide encoding a transcription factor which is involved in modulation of gene expression in response to abiotic stress and which is at least 85% identical to the full length of SEQ ID NO: 13, 14, 16, 25 or 27, as determined by GAP analysis under default parameters.
16. An isolated polynucleotide of SEQ ID NO: 30, 37, 39, 43 or 44.
17. An isolated polynucleotide at least 85% identical to the full length of the polynucleotide of claim 16 and which encodes a transcription factor involved in modulation of gene expression in response to abiotic stress.
Description:
CROSS REFERENCE
[0001] This application is a continuation of U.S. Utility patent application Ser. No. 12/358,698 filed Jan. 23, 2009 and claims priority to, and hereby incorporates by reference in its entirety, U.S. Provisional Patent Application Ser. No. 61/022,916 filed Jan. 23, 2008.
FIELD OF THE INVENTION
[0002] The present invention relates to the field of plant molecular biology, more particularly to regulation of gene expression in plants.
BACKGROUND OF THE INVENTION
[0003] Stresses to plants may be caused by both biotic and abiotic agents. For example, biotic causes of stress include infection with a pathogen, insect feeding, parasitism by another plant such as mistletoe and grazing by ruminant animals. Abiotic stresses include, for example, excessive or insufficient available water, temperature extremes, synthetic chemicals such as herbicides, and excessive wind. Yet plants survive and often flourish, even under unfavorable conditions, using a variety of internal and external mechanisms for avoiding or tolerating stress. Plants' physiological responses to stress reflect changes in gene expression.
[0004] Insufficient water for growth and development of crop plants is a major obstacle to consistent or increased food production worldwide. Population growth, climate change, irrigation-induced soil salinity, and loss of productive agricultural land to development are among the factors contributing to a need for crop plants which can tolerate drought.
[0005] Drought stress often results in reduced yield. In maize, this yield loss results in large part from plant failure to set and fill seed in the apical portion of the ear, a phenomenon known as tip kernel abortion.
[0006] Low temperatures can also reduce crop production. A sudden frost in spring or fall may cause premature tissue death.
[0007] Physiologically, the effects of drought and low-temperature stress may be similar, as both result in cellular dehydration. For example, ice formation in the intercellular spaces draws water across the plasma membrane, creating a water deficit within the cell. Thus, improvement of a plant's drought tolerance may improve its cold tolerance as well.
[0008] Plants adapt to environmental stresses such as cold, drought, and salinity through modulation of gene expression. Promoter regions of stress-inducible genes may comprise cis-acting elements, which are DNA fragments recognized by trans-acting factors. Transacting factors include, for example, proteins stimulated by abscisic acid (ABA) which bind to an ABA-responsive element (ABRE); see, for example, Yamaguchi-Shinozaki, et al., (2005) Trends in Plant Science 10(2):88-94. Transacting factors also include nuclear proteins capable of binding to regulatory DNA and interacting with other molecules, notably DNA Polymerase III, to initiate transcription of DNA operably linked to said regulatory DNA. Transcription factors may exist as families of related proteins that share a DNA-binding domain. The transcription factor genes may themselves be induced by stress. Furthermore, the downstream targets of cis-regulated genes may be transcription factors, creating a complex network of gene response cascades.
[0009] CBF genes (for C-repeat/DRE binding factor) encode proteins which may interact with a specific cis-acting element of certain plant promoters. (U.S. Pat. Nos. 5,296,462 and 5,356,816; Yamaguchi-Shinozaki, et al., (1994) The Plant Cell 6:251-264; Baker, et al., (1994) Plant Mol. Biol. 24:701-713; Jiang, et al., (1996) Plant Mol. Biol. 30:679-684) The cis-acting element is known as the C-repeat/DRE and typically comprises a 5-base-pair core sequence, CCGAC, present in one or more copies.
[0010] CBF proteins may comprise a CBF-specific domain and an AP2 domain and have been identified in various species, including Arabidopsis (Stockinger, et al., (1997) Proc. Natl. Acad. Sci. 94:1035-1040; Liu, et al., (1998) Plant Cell 10:1391-1406); Brassica napus, Lycopersicon esculentum, Secale cereale, and Triticum aestivum (Jaglo, et al., (2001) Plant Phys. 127:910-917) and Brassica juncea, Brassica oleracea, Brassica rapa, Raphanus sativus, Glycine max, and Zea mays (U.S. Pat. Nos. 6,417,428; 7,253,000 and 7,317,141).
[0011] DRE/CRT (Dehydration Response Element/C-Repeat) cis elements function in ABA-independent response to stress and have been identified in numerous plant species, including Arabidopsis, barley, Brassica, citrus, cotton, eucalyptus, grape, maize, melon, pepper, rice, soy, tobacco, tomato and wheat. The DRE/CRT elements comprise a core binding site, A/GCCGAC, recognized by the trans-activating factors known as DREB1 (DRE-Binding) and CBF (C-Repeat Binding Factor). Secondary structure in proximity to the cis element, and/or multiple cis factors appear to be additional components necessary for stress-inducible expression. (For reviews, see, Agarwal, et al., (2006) Plant Cell Rep 25:1263-1274; Yamaguchi-Shinozaki and Shinozaki, (2005) Trends in Plant Science 10(2):88-94). The promoter regions of the CBF/DREB genes may comprise cis-acting elements such as ICEr1 and ICEr2 (Zarka, et al., (2003) Plant Physiol. 133:910-918; Massari and Murre, (2000) Mol. Cell. Bio. 20:429-440).
[0012] Modification of complex agronomic traits requires the concurrent action of multiple genes belonging to multiple pathways. Use of single genes to modify complex agronomic traits may result in the realization of only part of the plant's potential to respond. In contrast, the CBF transcription factor presents an opportunity for overexpression of a single transcription factor to cause the simultaneous activation and overexpression of multiple downstream genes, to provide maximum possible modulation of the trait. The use of selected maize CBF genes based on expression analysis and association studies would enable informed targeting of transgenes or endogenous genes for transgenic modification, or marker-assisted breeding for abiotic stress tolerance.
[0013] Overexpression of CBF in plants has been shown to improve tolerance to drought, cold, and/or salt stress (Jaglo-Ottosen, et al., (1998) Science 280:104-106; Kasuga, et al., (1999) Nature Biotechnology 17:287-291; Hsieh, et al., (2002) Plant Phys. 129:1086-1094; Hsieh, et al., (2002) Plant Phys. 130:618-626; Dubouzet, et al., (2003) Plant J. 33:751-763). While CBF transcription factors may be useful in transgenic approaches to regulate plant response to stress, constitutive expression of CBF results in negative pleiotropic effects. Controlled expression of CBF in selected tissues and/or under stress conditions is of interest.
SUMMARY OF THE INVENTION
[0014] Compositions and methods for regulating gene expression in a plant are provided.
[0015] Compositions comprise isolated polypeptides involved in modulating gene expression in response to cold, salt, and/or drought, including SEQ ID NO: 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28 and 29. Further compositions of the invention comprise each polynucleotide encoding a polypeptide of the sequence set forth in SEQ ID NO: 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28 or 29, operable fragments of each, and sequences 85% identical to the full length coding sequence of each. The compositions of the invention further comprise polynucleotides set forth in SEQ ID NO: 30, 37, 38, 43 and 44, and full-length polynucleotides complementary thereto, as well as variants and fragments thereof. The sequences are referred to as CBF or CBF-like genes.
[0016] In one embodiment of the invention, a DNA construct comprises an isolated polynucleotide of the invention operably linked to a promoter sequence, wherein the promoter is capable of driving expression of the nucleotide sequence in a plant cell. The promoter sequence may be heterologous to the linked nucleotide sequence. In some embodiments, said promoter sequence is inducible by an exogenous agent or environmental condition. In some embodiments, said promoter initiates transcription preferentially in certain tissues or organs.
[0017] Also provided are expression cassettes comprising said DNA construct; vectors containing said expression cassette; transformed plant cells, transformed plants, and transformed seeds comprising the novel sequences of the invention.
[0018] Further embodiments comprise methods for expressing a polynucleotide or polypeptide of the invention in a plant. The methods comprise stably incorporating into the genome of a plant cell an expression cassette comprising a promoter sequence operably linked to a polynucleotide of the invention, wherein the promoter is capable of initiating transcription of said polynucleotide in a plant cell. Certain embodiments of the present invention comprise methods for modulating the development of a transformed plant under conditions of stress.
BRIEF DESCRIPTION OF THE DRAWINGS
[0019] FIG. 1A-1D provides an alignment of numerous CBF polypeptides from maize: ZmCBF7 (SEQ ID NO: 17), ZmCBF5 (SEQ ID NO: 15), ZmCBF8 (SEQ ID NO: 18), ZmCBF2 (SEQ ID NO: 2, also noted herein as 1084 SEQ 2), ZmCBF10 (SEQ ID NO: 20), ZmCBF4 (SEQ ID NO: 14), ZmCBF9 (SEQ ID NO: 19), ZmCBF11 (SEQ ID NO: 21), ZmCBF6 (SEQ ID NO: 16), ZmCBF1 (SEQ ID NO: 4, also noted herein as 1084 SEQ 4), ZmCBF3 (SEQ ID NO: 13), ZmCBF16 (SEQ ID NO: 26), ZmCBF15 (SEQ ID NO: 25),
[0020] ZmCBF17 (SEQ ID NO: 27), ZmCBF19 (SEQ ID NO: 29), ZmCBF12 (SEQ ID NO: 22), ZmCBF13 (SEQ ID NO: 23), ZmCBF14 (SEQ ID NO: 24), ZmCBF18 (SEQ ID NO: 28).
[0021] FIG. 2 provides a dendogram of the sequences aligned in FIG. 1. Both FIGS. 1 and 2 were created using PileUp software from Accelrys, Inc. at default settings (blosum 62 scoring matrix; gap creation penalty of 8; gap extension penalty of 2; maximum input sequence range, 5000; maximum number of gap characters added, 2000). Note that ZmCBF2 (SEQ ID NO: 2) is shown as 1084 SEQ 2; ZmCBF1 (SEQ ID NO: 4) is shown as 1084 SEQ 4.
[0022] FIG. 3 is a portion of the alignment of FIG. 1 wherein the AP2 domain is underlined and the CBF-specific domain is in bold font, for ZmCBF1, ZmCBF2, and ZmCBF3.
[0023] FIG. 4A-4F is a table of expression profiling results for ZmCBF3 through ZmCBF9 and ZmCBF11.
BRIEF DESCRIPTION OF THE SEQUENCES
TABLE-US-00001
[0024] SEQ ID NO: SEQ ID NO: SEQ ID NO: SEQ ID NO: in U.S. in U.S. in this in Pat. No. Pat. No. application 61/022,916 7,253,000 7,317,141 ZmCBF2 1 & 2 1 & 2 1 & 2 1 & 2 ZmCBF1 3 & 4 3 & 4 3 & 4 3 & 4 Zm Rab17 5 5 5 5 promoter Arabidopsis 6 6 6 6 rd29a promoter Zm RIP2 7 & 8 7 & 8 7 & 8 7 & 8 promoter Zm mLIP15 9 9 9 9 promoter Rye CBF31 10 10 10 10 Arabidopsis 11 & 12 11 & 12 11 & 12 11 & 12 CBF3 ZmCBF3 13 & 30 13 & 30 N/A N/A ZmCBF4 14, 43 & 45 14 N/A N/A ZmCBF5 15 & 31 15 & 31 N/A N/A ZmCBF6 16, 44 & 46 16 N/A N/A ZmCBF7 17 17 N/A N/A ZmCBF8 18 18 N/A N/A ZmCBF9 19 19 N/A N/A ZmCBF10 20 20 N/A N/A ZmCBF11 21 21 N/A N/A ZmCBF12 22, 32 & 33 22, 32 & 33 N/A N/A ZmCBF13 23, 34 & 35 23, 34 & 35 N/A N/A ZmCBF14 24 & 36 24 & 36 N/A N/A ZmCBF15 25 & 37 25 & 37 N/A N/A ZmCBF16 26 & 38 26 & 38 N/A N/A ZmCBF17 27 & 39 27 & 39 N/A N/A ZmCBF18 28 & 40 28 & 40 N/A N/A ZmCBF19 29 & 41 29 & 41 N/A N/A RyeCBF31 42 42 N/A N/A promoter
DETAILED DESCRIPTION OF THE INVENTION
[0025] The invention provides isolated polypeptides active as transcription initiation factors involved in stress-induced gene expression, particularly drought or cold stress.
[0026] By "recombinant expression cassette" or "expression cassette" is meant a nucleic acid construct, generated recombinantly or synthetically, comprising a series of specified nucleic acid elements which permit transcription of a particular nucleic acid in a host cell. The recombinant expression cassette can be incorporated into a plasmid, chromosome, mitochondrial DNA, plastid DNA, virus or nucleic acid fragment. Typically, the expression cassette portion of an expression vector includes, among other sequences, a promoter and a nucleic acid to be transcribed. A polynucleotide sequence encoding ZmCBF3 is provided at SEQ ID NO: 30. A polynucleotide sequence encoding ZmCBF4 is provided at SEQ ID NO: 43. A polynucleotide sequence encoding ZmCBF6 is provided at SEQ ID NO: 44. A polynucleotide sequence encoding ZmCBF15 is provided at SEQ ID NO: 37. A polynucleotide sequence encoding ZmCBF17 is provided at SEQ ID NO: 39. Other polynucleotide coding sequences can be derived by a person of skill in the art from the amino acid sequences provided.
[0027] By "heterologous nucleotide sequence" is intended a sequence that is not naturally occurring with another sequence. For example, a nucleotide sequence encoding a transcription factor may be heterologous to the promoter sequence to which it is operably linked. Further, the coding sequence and/or the promoter sequence may be native or foreign to the plant host.
[0028] By "operable fragment" is meant a truncated or altered form of a particular polynucleotide or polypeptide which is sufficient to perform or provide the relevant function. For example, where the goal is to interfere with gene function, a truncated form of a polynucleotide may be sufficient for purposes of co-suppression or anti-sense regulation. Where the goal is to initiate transcription, a promoter or transcription factor which is less than the full length known, or which comprises minimal internal deletions or alterations, may still function appropriately. Promoter sequences provided, or one or more fragments thereof, may be used either alone or in combination with other sequences to create synthetic promoters. In such embodiments, the fragments (also called "cis-acting elements" or "subsequences") confer desired properties on the synthetic promoter.
[0029] By "promoter" is intended a region of DNA upstream from the start of transcription and involved in recognition and binding of RNA polymerase and other proteins to initiate transcription. A promoter usually comprises a TATA box capable of directing RNA polymerase II to initiate RNA synthesis at the appropriate transcription initiation site for a particular coding sequence. A promoter can additionally comprise other recognition sequences generally positioned upstream or 5' to the TATA box, referred to as upstream promoter elements, which influence the transcription initiation rate. Thus a promoter region may be further defined by comprising upstream regulatory elements such as those responsible for tissue and temporal expression of the coding sequence, enhancers, and the like. In the same manner, the promoter elements which enable expression in the desired tissue can be identified, isolated, and used with other core promoters.
[0030] A "plant promoter" is a promoter capable of initiating transcription in plant cells whether or not its origin is a plant cell. Exemplary plant promoters include, but are not limited to, those that are obtained from plants, plant viruses, and bacteria which comprise genes expressed in plant cells, such as Agrobacterium or Rhizobium. Examples of promoters under developmental control include tissue-preferred promoters, which preferentially initiate transcription in certain tissues, such as leaves, roots, or seeds, and those promoters driving expression when a certain physiological stage of development is reached, such as senescence. Promoters which initiate transcription only in certain tissue are referred to as "tissue-specific." A "cell-type-preferred" promoter primarily drives expression in certain cell types in one or more organs, for example, vascular tissue in roots or leaves. An "inducible" or "repressible" promoter is a promoter which is under environmental control. Examples of environmental conditions that may effect transcription by inducible promoters include anaerobic conditions or the presence of light. Certain promoters are induced by unfavorable environmental conditions, for example, rab17 (exemplified by SEQ ID NO: 5; see also, Busk, et al., (1997) Plant J 11:1285-1295), rd29A (exemplified by SEQ ID NO: 6; see also, GenBank D13044 and Plant Cell 6:251-264, (1994)), rip2 (exemplified by SEQ ID NOS: 7 and 8; see also, GenBank L26305 and Plant Phys. 107(2):661-662 (1995)), mlip15 (exemplified by SEQ ID NO: 9; see also, GenBank D63956; Mol. Gen. Gen. 248(5):507-517 (1995); and ryeCBF31 (U.S. Patent Application Ser. No. 60/981,861 filed Oct. 23, 2007). Tissue-specific, tissue-preferred, cell-type-preferred and inducible promoters are members of the class of "non-constitutive" promoters. A "constitutive" promoter is a promoter which is active in all or nearly all tissues, at all or nearly all developmental stages, under most environmental conditions.
[0031] It is recognized that to increase transcription levels, enhancers can be utilized in combination with promoter regions to increase expression. Enhancers are known in the art and include the SV40 enhancer region, the 35S enhancer element, and the like.
[0032] A "subject plant" or "subject plant cell" is one in which genetic alteration, such as transformation, has been affected as to a gene of interest, or is a plant or plant cell which is descended from a plant or plant cell so altered and which comprises the alteration. A "control" or "control plant" or "control plant cell" provides a reference point for measuring changes in the subject plant or plant cell.
[0033] A control plant or control plant cell may comprise, for example: (a) a wild-type plant or plant cell, i.e., of the same genotype as the starting material for the genetic alteration which resulted in the subject plant or subject plant cell; (b) a plant or plant cell of the same genotype as the starting material but which has been transformed with a null construct (i.e., with a construct which has no known effect on the trait of interest, such as a construct comprising a marker gene); (c) a plant or plant cell which is a non-transformed segregant among progeny of a subject plant or subject plant cell; (d) a plant or plant cell genetically identical to the subject plant or subject plant cell but which is not exposed to conditions or stimuli that would induce expression of the gene of interest; or (e) the subject plant or subject plant cell itself, under conditions in which the gene of interest is not expressed.
[0034] The term "isolated" refers to material, such as a nucleic acid or a protein, which is: (1) substantially or essentially free from components which normally accompany or interact with it as found in its natural environment. The isolated material optionally comprises material not found with the material in its natural environment; or (2) if the material is in its natural environment, the material has been synthetically altered or synthetically produced by deliberate human intervention and/or placed at a different location within the cell. The synthetic alteration or creation of the material can be performed on the material within or apart from its natural state. For example, a naturally-occurring nucleic acid becomes an isolated nucleic acid if it is altered or produced by non-natural, synthetic methods, or if it is transcribed from DNA which has been altered or produced by non-natural, synthetic methods. The isolated nucleic acid may also be produced by the synthetic re-arrangement ("shuffling") of a part or parts of one or more allelic forms of the gene of interest. Likewise, a naturally-occurring nucleic acid (e.g., a promoter) becomes isolated if it is introduced to a different locus of the genome.
[0035] A polynucleotide may be single- or double-stranded, depending on the context, and one of skill in the art would recognize which construction of the term is appropriate.
[0036] The Zea mays sequences of the invention can be used to isolate corresponding sequences from other organisms, particularly from other plants, more particularly from other monocotyledonous plants. Methods such as PCR, hybridization, and the like can be used to identify such sequences based on their similarity to a sequence set forth herein. In hybridization techniques, all or part of a known nucleotide sequence is used as a probe that selectively hybridizes to other corresponding nucleotide sequences present in a population of cloned genomic DNA fragments or cDNA fragments (i.e., genomic or cDNA libraries) from a chosen organism. The hybridization probes may be genomic DNA fragments, cDNA fragments, RNA fragments, or other oligonucleotides, and may be labeled with a detectable group such as 32P, or any other detectable marker. Thus, for example, probes for hybridization can be made by labeling synthetic oligonucleotides based on the sequences of the invention. For example, an entire sequence disclosed herein, or one or more portions thereof, may be used as a probe capable of specifically hybridizing to corresponding sequences. To achieve specific hybridization under a variety of conditions, such probes include sequences that are distinctive and are at least about 10 nucleotides in length. The well-known process of polymerase chain reaction (PCR) may be used to isolate or amplify additional sequences from a chosen organism or as a diagnostic assay to determine the presence of corresponding sequences in an organism. Hybridization techniques include hybridization screening of plated DNA libraries (either plaques or colonies; see, for example, Sambrook, et al., supra; see also, Innis, et al., eds., (1990) PCR Protocols, A Guide to Methods and Applications, Academic Press). Methods for preparation of probes for hybridization and for construction of cDNA and genomic libraries are generally known in the art and are disclosed in Sambrook, et al., (1989) Molecular Cloning: A Laboratory Manual (2d ed., Cold Spring Harbor Laboratory Press, Plainview, N.Y.) and Ausubel, et al., eds. (1995) Current Protocols in Molecular Biology, Chapter 2 (Greene Publishing and Wiley-Interscience, New York).
[0037] Hybridization of such sequences may be carried out under stringent conditions. By "stringent conditions" or "stringent hybridization conditions" is intended conditions under which a probe will hybridize to its target sequence to a detectably greater degree than to other sequences (e.g., at least 2-fold over background). Stringent conditions are target-sequence-dependent and will differ depending on the structure of the polynucleotide. By controlling the stringency of the hybridization and/or washing conditions, target sequences that are 100% complementary to the probe can be identified (homologous probing). Alternatively, stringency conditions can be adjusted to allow some mismatching in sequences so that lower degrees of similarity are detected (heterologous probing).
[0038] Typically, stringent conditions will be those in which the salt concentration is less than about 1.5 M Na ion, typically about 0.01 to 1.0 M Na ion concentration (or other salts) at pH 7.0 to 8.3 and the temperature is at least about 30° C. for short probes (e.g., 10 to 50 nucleotides) and at least about 60° C. for long probes (e.g., greater than 50 nucleotides). Stringency may also be adjusted with the addition of destabilizing agents such as formamide. Exemplary low stringency conditions include hybridization with a buffer solution of 30 to 35% formamide, 1 M NaCI, 1% SDS (sodium dodecyl sulphate) at 37° C., and a wash in 1× to 2×SSC (20×SSC=3.0 M NaCl/0.3 M trisodium citrate) at 50 to 55° C. Exemplary moderate stringency conditions include hybridization in 40 to 45% formamide, 1 M NaCl, 1% SDS at 37° C., and a wash in 0.5× to 1×SSC at 55 to 60° C. Exemplary high stringency conditions include hybridization in 50% formamide, 1 M NaCI, 1% SDS at 37° C., and a wash in 0.1×SSC at 60 to 65° C. The duration of hybridization is generally less than about 24 hours, usually about 4 to about 12 hours.
[0039] Specificity is typically the function of post-hybridization washes, the critical factors being the ionic strength and temperature of the final wash solution. For DNA-DNA hybrids, the Tm can be approximated from the equation of Meinkoth and Wahl (1984) Anal. Biochem. 138:267-284: Tm=81.5° C.+16.6 (log M)+0.41 (% GC)-0.61 (% form)-500/L; where M is the molarity of monovalent cations, % GC is the percentage of guanine and cytosine nucleotides in the DNA, % form is the percentage of formamide in the hybridization solution, and L is the length of the hybrid in base pairs. The Tm is the temperature (under defined ionic strength and pH) at which 50% of a complementary target sequence hybridizes to a perfectly matched probe. Tm is reduced by about 1° C. for each 1% of mismatching; thus, Tm, hybridization, and/or wash conditions can be adjusted to hybridize to sequences of the desired identity. For example, if sequences with >90% identity are sought, the Tm can be decreased 10° C. Generally, stringent conditions are selected to be about 5° C. lower than the thermal melting point (Tm) for the specific sequence and its complement at a defined ionic strength and pH. However, severely stringent conditions can utilize a hybridization and/or wash at 1, 2, 3 or 4° C. lower than the thermal melting point (Tm); moderately stringent conditions can utilize a hybridization and/or wash at 6, 7, 8, 9 or 10° C. lower than the thermal melting point (Tm); low stringency conditions can utilize a hybridization and/or wash at 11, 12, 13, 14, 15 or 20° C. lower than the thermal melting point (Tm). Using the equation, hybridization and wash compositions, and desired Tm, those of ordinary skill will understand that variations in the stringency of hybridization and/or wash solutions are inherently described. If the desired degree of mismatching results in a Tm of less than 45° C. (aqueous solution) or 32° C. (formamide solution), it is preferred to increase the SSC concentration so that a higher temperature can be used. An extensive guide to the hybridization of nucleic acids is found in Tijssen (1993) Laboratory Techniques in Biochemistry and Molecular Biology-Hybridization with Nucleic Acid Probes, Part I, Chapter 2 (Elsevier, N.Y.) and Ausubel, et al., eds. (1995) Current Protocols in Molecular Biology, Chapter 2 (Greene Publishing and Wiley-Interscience, New York). See also, Sambrook, et al., (1989) Molecular Cloning: A Laboratory Manual (2d ed., Cold Spring Harbor Laboratory Press, Plainview, N.Y.). Thus, isolated sequences that retain the function of the invention and hybridize under stringent conditions to the sequences disclosed herein, or to their complements, or to fragments of either, are encompassed by the present invention. Such a sequence will usually be at least about 85% identical to a disclosed sequence. That is, the identity of sequences may range, sharing at least about 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or more sequence identity.
[0040] Methods of alignment of sequences for comparison are well-known in the art. Optimal alignment of sequences for comparison may be conducted by the local homology algorithm of Smith and Waterman, (1981) Adv. Appl. Math. 2:482; by the homology alignment algorithm of Needleman and Wunsch, (1970) J. Mol. Biol. 48:443; by the search for similarity method of Pearson and Lipman, (1988) Proc. Natl. Acad. Sci. 85:2444; by computerized implementations of these algorithms, including, but not limited to: CLUSTAL in the PC/Gene program by Intelligenetics, Mountain View, Calif.; PileUp, GAP, BESTFIT, BLAST, FASTA and TFASTA in the GCG® Wisconsin Package® from Accelrys, Inc., San Diego, Calif.
[0041] The CLUSTAL program is well described by Higgins and Sharp, (1988) Gene 73:237-244; Higgins and Sharp, (1989) CABIOS 5:151-153; Corpet, et al., (1988) Nucleic Acids Research 16:10881-90; Huang, et al., (1992) Computer Applications in the Biosciences 8:155-65, and Pearson, et al., (1994) Methods in Molecular Biology 24:307-331. A description of BLAST (Basic Local Alignment Search Tool) is provided by Altschul, et al., (1993) J. Mol. Biol. 215:403-410.
[0042] Identity to the sequence of the present invention would mean a polypeptide sequence having at least 85% sequence identity, wherein the percent sequence identity is based on the entire length of SEQ ID NO: 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28 or 29.
[0043] The AP2 domain is highly conserved among CBF genes, and some species share an additional conserved region bracketing the AP2 domains. (Jaglo, et al., (2001) Plant Phys. 127:910-917). For example, in FIG. 3, the AP2 domain of ZmCBF1, ZmCBF2 and ZmCBF3 is underlined. The CBF-specific domain of the same sequences is in bold font. Thus one of skill in the art would recognize that variants most likely to retain function are those in which at least one domain is undisturbed.
[0044] The invention encompasses isolated or substantially purified polynucleotide or protein compositions. An "isolated" or "purified" polynucleotide or protein, or biologically active portion thereof, is substantially or essentially free from components that normally accompany or interact with the polynucleotide or protein as found in its naturally occurring environment. Thus, an isolated or purified polynucleotide or protein is substantially free of other cellular material or culture medium when produced by recombinant techniques, or substantially free of chemical precursors or other chemicals when chemically synthesized. Optimally, an "isolated" polynucleotide is free of sequences (optimally protein encoding sequences) that naturally flank the polynucleotide (i.e., sequences located at the 5' and 3' ends of the polynucleotide) in the genomic DNA of the organism from which the polynucleotide is derived. For example, in various embodiments, the isolated polynucleotide can contain less than about 5 kb, 4 kb, 3 kb, 2 kb, 1 kb, 0.5 kb or 0.1 kb of nucleotide sequence that naturally flank the polynucleotide in genomic DNA of the cell from which the polynucleotide is derived. A protein that is substantially free of cellular material includes preparations of protein having less than about 30%, 20%, 10%, 5% or 1% (by dry weight) of contaminating protein. When the protein of the invention or biologically active portion thereof is recombinantly produced, optimally culture medium represents less than about 30%, 20%, 10%, 5% or 1% (by dry weight) of chemical precursors or non-protein-of-interest chemicals.
[0045] Fragments and variants of ZmCBF polynucleotides and proteins are also encompassed by the methods and compositions of the present invention. By "fragment" is intended a portion of the polynucleotide or a portion of the amino acid sequence. Fragments of a polynucleotide may encode protein fragments that retain the biological activity of the native protein and hence regulate transcription. For example, polypeptide fragments may comprise the CBF-specific domain or the AP2 domain. In some embodiments, the polypeptide fragment will comprise both the CBF-specific domain and the AP2 domain. Alternatively, fragments that are used for suppressing or silencing (i.e., decreasing the level of expression) of a CBF sequence need not encode a protein fragment, but will retain the ability to suppress expression of the target sequence. In addition, fragments that are useful as hybridization probes generally do not encode fragment proteins retaining biological activity. Thus, fragments of a nucleotide sequence may range from at least about 11 nucleotides, about 20 nucleotides, about 50 nucleotides, about 100 nucleotides and up to the full-length polynucleotide encoding a protein of the invention.
[0046] A fragment of a polynucleotide encoding a CBF-specific or AP2 domain or a CBF polypeptide will encode at least 14, 25, 30, 50, 60, 70, 100, 150, 200, 250 or 300 contiguous amino acids, or up to the total number of amino acids present in a full-length CBF-specific or AP2 domain, or CBF or CBF-like protein. Fragments of an AP2 or CBF-specific domain, or a CBF or CBF-like polynucleotide that are useful as hybridization probes, PCR primers, or as suppression constructs generally need not encode a biologically active portion of a CBF protein.
[0047] A biologically active portion of a polypeptide comprising an AP2 or CBF-specific domain, or a CBF or CBF-like protein, can be prepared by isolating a portion of a CBF-like polynucleotide, expressing the encoded portion of the CBF-like protein (e.g., by recombinant expression in vitro), and assessing the activity of the encoded portion of the CBF-like protein. A polynucleotide that is a fragment of a CBF-like nucleotide sequence, or a polynucleotide sequence comprising an AP2 or CBF-specific domain, comprises at least 42, 75, 100, 150, 200, 250, 300, 350, 400, 450, 500, 550, 600, 650, 700, 800, 900, 1000, 1100, 1200, 1300, 1400 or 1,500 contiguous nucleotides, or up to the number of nucleotides present in a full-length AP2 or CBF-specific domain or in a CBF-like polynucleotide.
[0048] "Variants" is intended to mean substantially similar sequences. For polynucleotides, a variant comprises a deletion and/or addition of one or more nucleotides at one or more sites within the native polynucleotide and/or a substitution of one or more nucleotides at one or more sites in the native polynucleotide. As used herein, a "native" polynucleotide or polypeptide comprises a naturally occurring nucleotide sequence or amino acid sequence, respectively. For polynucleotides, conservative variants include those sequences that, because of the degeneracy of the genetic code, encode the amino acid sequence of one of the CBF-like polypeptides or of an AP2 or a CBF-specific domain. Naturally occurring allelic variants such as these can be identified with the use of well-known molecular biology techniques, as, for example, with polymerase chain reaction (PCR) and hybridization techniques as outlined elsewhere herein. Variant polynucleotides also include synthetically derived polynucleotides, such as those generated, for example, by using site-directed mutagenesis but which still encode a polypeptide comprising an AP2 or a CBF-specific domain (or both), or a CBF-like polypeptide that is capable of regulating transcription or that is capable of reducing the level of expression (i.e., suppressing or silencing) of a CBF-like polynucleotide. Generally, variants of a particular polynucleotide of the invention will have at least about 40%, 45%, 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or more sequence identity to that particular polynucleotide as determined by sequence alignment programs and parameters described elsewhere herein.
[0049] Variants of a particular polynucleotide of the invention (i.e., the reference polynucleotide) can also be evaluated by comparison of the percent sequence identity between the polypeptide encoded by a variant polynucleotide and the polypeptide encoded by the reference polynucleotide. Thus, for example, an isolated polynucleotide that encodes a polypeptide with a given percent sequence identity to the polypeptide of SEQ ID NO: 13 is disclosed. Percent sequence identity between any two polypeptides can be calculated using sequence alignment programs and parameters described elsewhere herein. Where any given pair of polynucleotides of the invention is evaluated by comparison of the percent sequence identity shared by the two polypeptides they encode, the percent sequence identity between the two encoded polypeptides is at least about 40%, 45%, 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or more sequence identity.
[0050] "Variant" protein is intended to mean a protein derived from the native protein by deletion or addition of one or more amino acids at one or more sites in the native protein and/or substitution of one or more amino acids at one or more sites in the native protein. Variant proteins encompassed by the present invention are biologically active, that is they continue to possess the desired biological activity of the native protein, that is, regulate transcription as described herein. Such variants may result from, for example, genetic polymorphism or human manipulation. Biologically active variants of a CBF-like protein of the invention or of an AP2 or CBF-specific domain will have at least about 40%, 45%, 50%, 55%, 60%, 65%, 70%, 75%, 80%,85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or more sequence identity to the amino acid sequence for the CBF-like protein or the consensus AP2 or CBF-like domain as determined by sequence alignment programs and parameters described elsewhere herein. A biologically active variant of a CBF-like protein of the invention or of an AP2 or CBF domain may differ from that protein by as few as 1-15 amino acid residues, as few as 1-10, such as 6-10, as few as 5, as few as 4, 3, 2 or even by one amino acid residue.
[0051] The polypeptides of the invention may be altered in various ways including amino acid substitutions, deletions, truncations, and insertions. Methods for such manipulations are generally known in the art. For example, amino acid sequence variants and fragments of the CBF-like proteins or AP2 or CBF-like domains can be prepared by mutations in the encoding DNA. Methods for mutagenesis and polynucleotide alterations are well known in the art. See, for example, Kunkel, (1985) Proc. Natl. Acad. Sci. USA 82:488-492; Kunkel, et al., (1987) Methods in Enzymol. 154:367-382; U.S. Pat. No. 4,873,192; Walker and Gaastra, eds. (1983) Techniques in Molecular Biology (MacMillan Publishing Company, New York) and the references cited therein. Guidance as to appropriate amino acid substitutions that do not affect biological activity of the protein of interest may be found in the model of Dayhoff, et al., (1978) Atlas of Protein Sequence and Structure (Natl. Biomed. Res. Found., Washington, D.C.), herein incorporated by reference. Conservative substitutions, such as exchanging one amino acid with another having similar properties, may be optimal.
[0052] Thus, the genes and polynucleotides of the invention include both the naturally occurring sequences as well as mutant forms. Likewise, the proteins of the invention encompass both naturally occurring proteins as well as variations and modified forms thereof. Such variants will continue to possess the desired activity (i.e., the ability to regulate transcription). In specific embodiments, the mutations that will be made in the DNA encoding the variant do not place the sequence out of reading frame and do not create complementary regions that could produce secondary mRNA structure. See, EP Patent Publication Number 0075444.
[0053] The deletions, insertions and substitutions of the protein sequences encompassed herein are not expected to produce radical changes in the characteristics of the protein. However, when it is difficult to predict the exact effect of the substitution, deletion, or insertion, one skilled in the art will appreciate that the effect will be evaluated by routine screening assays. For example, the activity of a CBF-like polypeptide can be evaluated by assaying for the ability of the polypeptide to regulate transcription. Various methods can be used to assay for this activity, including, directly monitoring the level of expression of a target gene at the nucleotide or polypeptide level. Methods for such an analysis are known and include, for example, Northern blots, S1 protection assays, Western blots, enzymatic or colorimetric assays. In specific embodiments, determining if a sequence has CBF-like activity can be assayed by monitoring for an increase or decrease in the level or activity of a target gene. Alternatively, methods to assay for a modulation of transcriptional activity can include monitoring for an alteration in the phenotype of the plant. For example, as discussed in further detail elsewhere herein, modulating the level of a CBF-like polypeptide can result in altered plant tolerance to abiotic stress. Methods to assay for these changes are discussed in further detail elsewhere herein.
[0054] Variant polynucleotides and proteins also encompass sequences and proteins derived from a mutagenic and recombinogenic procedure such as DNA shuffling. With such a procedure, one or more different CBF-like coding sequences can be manipulated to create a new CBF-like sequence or AP2 or CBF-specific domain possessing the desired properties. In this manner, libraries of recombinant polynucleotides are generated from a population of related sequence polynucleotides comprising sequence regions that have substantial sequence identity and can be homologously recombined in vitro or in vivo. For example, using this approach, sequence motifs encoding a domain of interest may be shuffled between the CBF-like gene of the invention and other known CBF-like genes to obtain a new gene coding for a protein with an improved property of interest, such as an increased Km in the case of an enzyme. Strategies for such DNA shuffling are known in the art. See, for example, Stemmer, (1994) Proc. Natl. Acad. Sci. USA 91:10747-10751; Stemmer, (1994) Nature 370:389-391; Crameri, et al., (1997) Nature Biotech. 15:436-438; Moore, et al., (1997) J. Mol. Biol. 272:336-347; Zhang, et al., (1997) Proc. Natl. Acad. Sci. USA 94:4504-4509; Crameri, et al., (1998) Nature 391:288-291; and U.S. Pat. Nos. 5,605,793 and 5,837,458.
[0055] The expression cassette may also include, at the 3' terminus of the heterologous nucleotide sequence of interest, a transcriptional and translational termination region functional in plants. The termination region can be native with the promoter nucleotide sequence present in the expression cassette, can be native with the DNA sequence of interest, or can be derived from another source. Convenient termination regions are available from the Ti-plasmid of A. tumefaciens, such as the octopine synthase and nopaline synthase termination regions. See also, Guerineau, et al., (1991) Mol. Gen. Genet. 262:141-144; Proudfoot, (1991) Cell 64:671-674; Sanfacon, et al., (1991) Genes Dev. 5:141-149; Mogen, et al., (1990) Plant Cell 2:1261-1272; Munroe, et al., (1990) Gene 91:151-158; Ballas, et al., 1989) Nucleic Acids Res. 17:7891-7903; Joshi, et al., (1987) Nucleic Acid Res. 15:9627-9639.
[0056] The expression cassettes can additionally contain 5' leader sequences. Such leader sequences can act to enhance translation. Translation leaders are known in the art and include: picornavirus leaders, for example, EMCV leader (Encephalomyocarditis 5' noncoding region), Elroy-Stein, et al., (1989) Proc. Nat. Acad. Sci. USA 86:6126-6130; potyvirus leaders, for example, TEV leader (Tobacco Etch Virus), Allison, et al., (1986); MDMV leader (Maize Dwarf Mosaic Virus), Virology 154:9-20; human immunoglobulin heavy-chain binding protein (BiP), Macejak, et al., (1991) Nature 353:90-94; untranslated leader from the coat protein mRNA of alfalfa mosaic virus (AMV RNA 4), Jobling, et al., (1987) Nature 325:622-625); tobacco mosaic virus leader (TMV), Gallie, et al., (1989) Molecular Biology of RNA, pages 237-256; and maize chlorotic mottle virus leader (MCMV) Lommel, et al., (1991) Virology 81:382-385. See also, Della-Cioppa, et al., (1987) Plant Physiology 84:965-968. The cassette can also contain sequences that enhance translation and/or mRNA stability such as introns.
[0057] In those instances where it is desirable to have the expressed product of the heterologous nucleotide sequence directed to a particular organelle, particularly the plastid, amyloplast, or to the endoplasmic reticulum, or secreted at the cell's surface or extracellularly, the expression cassette can further comprise a coding sequence for a transit peptide. Such transit peptides are well known in the art and include, but are not limited to, the transit peptide for the acyl carrier protein, the small subunit of RUBISCO, plant EPSP synthase, and the like.
[0058] In preparing the expression cassette, the various DNA fragments can be manipulated so as to provide for the DNA sequences in the proper orientation and, as appropriate, in the proper reading frame. Toward this end, adapters or linkers can be employed to join the DNA fragments, or other manipulations can be involved to provide for convenient restriction sites, removal of superfluous DNA, removal of restriction sites, or the like. For this purpose, in vitro mutagenesis, primer repair, restriction digests, annealing, and resubstitutions, such as transitions and transversions, can be involved.
[0059] As noted herein, the present invention provides vectors capable of expressing the claimed sequences under the control of an operably linked promoter. In general, the vectors should be functional in plant cells. At times, it may be preferable to have vectors that are functional in E. coli (e.g., production of protein for raising antibodies, DNA sequence analysis, construction of inserts, obtaining quantities of nucleic acids). Vectors and procedures for cloning and expression in E. coli are discussed in Sambrook, et al., (supra).
[0060] The transformation vector, comprising a sequence of the present invention operably linked to a promoter in an expression cassette, can also contain at least one additional nucleotide sequence for a gene to be cotransformed into the organism. Alternatively, the additional sequence(s) can be provided on another transformation vector.
[0061] Vectors that are functional in plants can be binary plasmids derived from Agrobacterium. Such vectors are capable of transforming plant cells. These vectors contain left and right border sequences that are required for integration into the host (plant) chromosome. At a minimum, between these border sequences is the gene to be expressed under control of an operably-linked promoter. In preferred embodiments, a selectable marker and a reporter gene are also included. For ease of obtaining sufficient quantities of vector, a bacterial origin that allows replication in E. coli is preferred.
[0062] Reporter genes can be included in the transformation vectors. Examples of suitable reporter genes known in the art can be found in, for example, Jefferson, et al., (1991) in Plant Molecular Biology Manual, ed. Gelvin, et al., (Kluwer Academic Publishers), pp. 1-33; DeWet, et al., (1987) Mol. Cell. Biol. 7:725-737; Goff, et al., (1990) EMBO J. 9:2517-2522; Kain, et al., (1995) BioTechniques 19:650-655; and Chiu, et al., (1996) Current Biology 6:325-330.
[0063] Selectable marker genes for selection of transformed cells or tissues can be included in the transformation vectors. These can include genes that confer antibiotic resistance or resistance to herbicides. Examples of suitable selectable marker genes include, but are not limited to, genes encoding resistance to chloramphenicol, Herrera Estrella, et al., (1983) EMBO J. 2:987-992; methotrexate, Herrera Estrella, et al., (1983) Nature 303:209-213; Meijer, et al., (1991) Plant Mol. Biol. 16:807-820; hygromycin, Waldron, et al., (1985) Plant Mol. Biol. 5:103-108; Zhijian, et al., (1995) Plant Science 108:219-227; streptomycin, Jones, et al., (1987) Mol. Gen. Genet. 210:86-91; spectinomycin, Bretagne-Sagnard, et al., (1996) Transgenic Res. 5:131-137; bleomycin, Hille, et al., (1990) Plant Mol. Biol. 7:171-176; sulfonamide, Guerineau, et al., (1990) Plant Mol. Biol. 15:127-136; bromoxynil, Stalker, et al., (1988) Science 242:419-423; glyphosate, Shaw, et al., (1986) Science 233:478-481; phosphinothricin, DeBlock, et al., (1987) EMBO J. 6:2513-2518.
[0064] Other genes that could serve utility in the recovery of transgenic events but might not be required in the final product would include, but are not limited to, examples such as GUS (β-glucuronidase), Jefferson (1987) Plant Mol. Biol. Rep. 5:387); GFP (green fluorescence protein), Chalfie, et al., (1994) Science 263:802, and Gerdes (1996) FEBS Lett. 389:44-47; DSred (Dietrich, et al., (2002) Biotechniques 2(2):286-293); luciferase, Teeri, et al., (1989) EMBO J. 8:343; KN1 (Smith, et al., (1995) Dev. Genetics 16(4):344-348); Sugaryl, Rahman, et al., (1998) Plant Physiol. 117:425-435; James, et al., (1995) Plant Cell 7:417-429 and GenBank Accession Number U18908; and systems utilizing the maize genes encoding enzymes for anthocyanin production, including CRC, P (Bruce, et al., (2000) Plant Cell 12(1):65-79, and R (Ludwig, et al., (1990) Science 247:449).
[0065] The transformation vector comprising an isolated polynucleotide encoding a polypeptide of the present invention, operably linked to a promoter sequence in an expression cassette, can be used to transform any plant. In this manner, genetically modified plants, plant cells, plant tissue, seed, and the like can be obtained. Transformation protocols can vary depending on the type of plant or plant cell targeted for transformation, e.g., monocot or dicot. Suitable methods of transforming plant cells include microinjection, Crossway, et al., (1986) Biotechniques 4:320-334; electroporation, Riggs, et al., (1986) Proc. Natl. Acad. Sci. USA 83:5602-5606; Agrobacterium-mediated transformation, see for example, Townsend, et al., U.S. Pat. No. 5,563,055; direct gene transfer, Paszkowski, et al., (1984) EMBO J. 3:2717-2722; and ballistic particle acceleration, see for example, Sanford, et al., U.S. Pat. No. 4,945,050; Tomes, et al., (1995) in Plant Cell, Tissue, and Organ Culture: Fundamental Methods, ed. Gamborg and Phillips, (Springer-Verlag, Berlin); and McCabe, et al., (1988) Biotechnology 6:923-926. Also see, Weissinger, et al., (1988) Annual Rev. Genet. 22:421-477; Sanford, et al., (1987) Particulate Science and Technology 5:27-37 (onion); Christou, et al., (1988) Plant Physiol. 87:671-674 (soybean); McCabe, et al., (1988) Bio/Technology 6:923-926 (soybean); Datta, et al., (1990) Biotechnology 8:736-740 (rice); Klein, et al., (1988) Proc. Natl. Acad. Sci. USA 85:4305-4309 (maize); Klein, et al., (1988) Biotechnology 6:559-563 (maize); Klein, et al., (1988) Plant Physiol. 91:440-444 (maize); Fromm, et al., (1990) Biotechnology 8:833-839; Hooydaas-Van Slogteren, et al., (1984) Nature (London) 311:763-764; Bytebier et al. (1987) Proc. Natl. Acad. Sci. USA 84:5345-5349 (Liliaceae); De Wet, et al., (1985) in The Experimental Manipulation of Ovule Tissues, ed. Chapman, et al., (Longman, N.Y.), pp. 197-209 (pollen); Kaeppler, et al., (1990) Plant Cell Reports 9:415-418; and Kaeppler, et al., (1992) Theor. Appl. Genet. 84:560-566 (whisker-mediated transformation); D. Halluin, et al., (1992) Plant Cell 4:1495-1505 (electroporation); Li, et al., (1993) Plant Cell Reports 12:250-255 and Christou, et al., (1995) Annals of Botany 75:407-413 (rice); Osjoda, et al., (1996) Nature Biotechnology 14:745-750 (maize via Agrobacterium tumefaciens); all of which are herein incorporated by reference.
[0066] The cells that have been transformed can be grown into plants in accordance with conventional ways. See, for example, McCormick, et al., (1986) Plant Cell Reports 5:81-84. These plants can then be pollinated with the same transformed strain or different strains. The resulting plants having expression of the desired characteristic can then be identified. Two or more generations can be grown to ensure that the desired phenotypic characteristic is stably maintained and inherited under conditions of interest.
[0067] In certain embodiments the nucleic acid sequences of the present invention can be used in combination ("stacked") with other polynucleotide sequences of interest in order to create plants with a desired phenotype. The polynucleotides of the present invention may be stacked with any gene or combination of genes, and the combinations generated can include multiple copies of any one or more of the polynucleotides of interest. The desired combination may affect one or more traits; that is, certain combinations may be created for modulation of gene expression involved in plant response to stress. Other combinations may be designed to produce plants with a variety of desired traits, including but not limited to traits desirable for animal feed such as high oil genes (e.g., U.S. Pat. No. 6,232,529); balanced amino acids (e.g., hordothionins (U.S. Pat. Nos. 5,990,389; 5,885,801; 5,885,802 and 5,703,409); barley high lysine (Williamson, et al., (1987) Eur. J. Biochem. 165:99-106; and WO 1998/20122); and high methionine proteins (Pedersen, et al., (1986) J. Biol. Chem. 261:6279; Kirihara, et al., (1988) Gene 71:359; and Musumura, et al., (1989) Plant Mol. Biol. 12: 123)); increased digestibility (e.g., modified storage proteins (U.S. patent application Ser. No. 10/053,410, filed Nov. 7, 2001); and thioredoxins (U.S. patent application Ser. No. 10/005,429, filed Dec. 3, 2001)), the disclosures of which are herein incorporated by reference. The polynucleotides of the present invention can also be stacked with traits desirable for insect, disease or herbicide resistance (e.g., Bacillus thuringiensis toxic proteins (U.S. Pat. Nos. 5,366,892; 5,747,450; 5,737,514; 5723,756; 5,593,881; Geiser, et al., (1986) Gene 48:109); lectins (Van Damme, et al., (1994) Plant Mol. Biol. 24:825); fumonisin detoxification genes (U.S. Pat. No. 5,792,931); avirulence and disease resistance genes (Jones, et al., (1994) Science 266:789; Martin, et al., (1993) Science 262:1432; Mindrinos, et al., (1994) Cell 78:1089); acetolactate synthase (ALS) mutants that lead to herbicide resistance such as the S4 and/or Hra mutations; inhibitors of glutamine synthase such as phosphinothricin or basta (e.g., bar gene); and glyphosate resistance (EPSPS gene)); and traits desirable for processing or process products such as high oil (e.g., U.S. Pat. No. 6,232,529); modified oils (e.g., fatty acid desaturase genes (U.S. Pat. No. 5,952,544; WO 94/11516)); modified starches (e.g., ADPG pyrophosphorylases (AGPase), starch synthases (SS), starch branching enzymes (SBE) and starch debranching enzymes (SDBE)); and polymers or bioplastics (e.g., U.S. Pat. No. 5,602,321; beta-ketothiolase, polyhydroxybutyrate synthase, and acetoacetyl-CoA reductase (Schubert, et al., (1988) J. Bacteriol. 170:5837-5847) facilitate expression of polyhydroxyalkanoates (PHAs)), the disclosures of which are herein incorporated by reference. One could also combine the polynucleotides of the present invention with polynucleotides affecting agronomic traits such as male sterility (e.g., see, U.S. Pat. No. 5.583,210), stalk strength, flowering time, or transformation technology traits such as cell cycle regulation or gene targeting (e.g., WO 1999/61619; WO 2000/17364; WO 1999/25821), the disclosures of which are herein incorporated by reference.
[0068] These stacked combinations can be created by any method, including but not limited to cross breeding plants by any conventional or TopCross methodology, or genetic transformation. If the traits are stacked by genetically transforming the plants, the polynucleotide sequences of interest can be combined at any time and in any order. For example, a transgenic plant comprising one or more desired traits can be used as the target to introduce further traits by subsequent transformation. The traits can be introduced simultaneously in a co-transformation protocol with the polynucleotides of interest provided by any combination of transformation cassettes. For example, if two sequences will be introduced, the two sequences can be contained in separate transformation cassettes (trans) or contained on the same transformation cassette (cis). Expression of the sequences of interest can be driven by the same promoter or by different promoters. In certain cases, it may be desirable to introduce a transformation cassette that will suppress the expression of a polynucleotide of interest. This may be accompanied by any combination of other suppression cassettes or over-expression cassettes to generate the desired combination of traits in the plant.
[0069] The transformed plants of the invention may be used in a plant breeding program. The goal of plant breeding is to combine, in a single variety or hybrid, various desirable traits. For field crops, these traits may include, for example, resistance to diseases and insects, tolerance to heat, cold, and/or drought, reduced time to crop maturity, greater yield, and better agronomic quality. With mechanical harvesting of many crops, uniformity of plant characteristics such as germination and stand establishment, growth rate, maturity, and plant and ear height, is desirable. Traditional plant breeding is an important tool in developing new and improved commercial crops. This invention encompasses methods for producing a maize plant by crossing a first parent maize plant with a second parent maize plant wherein one or both of the parent maize plants is a transformed plant, as described herein.
[0070] Plant breeding techniques known in the art and used in a maize plant breeding program include, but are not limited to, recurrent selection, bulk selection, mass selection, backcrossing, pedigree breeding, open pollination breeding, restriction fragment length polymorphism enhanced selection, genetic marker enhanced selection, doubled haploids, and transformation. Often combinations of these techniques are used.
[0071] The development of maize hybrids in a maize plant breeding program requires, in general, the development of homozygous inbred lines, the crossing of these lines, and the evaluation of the crosses. There are many analytical methods available to evaluate the result of a cross. The oldest and most traditional method of analysis is the observation of phenotypic traits. Alternatively, the genotype of a plant can be examined.
[0072] A genetic trait which has been engineered into a particular maize plant using transformation techniques, could be moved into another line using traditional breeding techniques that are well known in the plant breeding arts. For example, a backcrossing approach is commonly used to move a transgene from a transformed maize plant to an elite inbred line, and the resulting progeny would then comprise the transgene(s). Also, if an inbred line was used for the transformation then the transgenic plants could be crossed to a different inbred in order to produce a transgenic hybrid maize plant. As used herein, "crossing" can refer to a simple X by Y cross, or the process of backcrossing, depending on the context.
[0073] The development of a maize hybrid in a maize plant breeding program involves three steps: (1) the selection of plants from various germplasm pools for initial breeding crosses; (2) the selfing of the selected plants from the breeding crosses for several generations to produce a series of inbred lines, which, while different from each other, breed true and are highly uniform; and (3) crossing the selected inbred lines with different inbred lines to produce the hybrids. During the inbreeding process in maize, the vigor of the lines decreases. Vigor is restored when two different inbred lines are crossed to produce the hybrid. An important consequence of the homozygosity and homogeneity of the inbred lines is that the hybrid created by crossing a defined pair of inbreds will always be the same. Once the inbreds that give a superior hybrid have been identified, the hybrid seed can be reproduced indefinitely as long as the homogeneity of the inbred parents is maintained.
[0074] Transgenic plants of the present invention may be used to produce a single cross hybrid, a three-way hybrid or a double cross hybrid. A single cross hybrid is produced when two inbred lines are crossed to produce the F1 progeny. A double cross hybrid is produced from four inbred lines crossed in pairs (A×B and C×D) and then the two F1 hybrids are crossed again (A×B)×(C×D). A three-way cross hybrid is produced from three inbred lines where two of the inbred lines are crossed (A×B) and then the resulting F1 hybrid is crossed with the third inbred (A×B)×C. Much of the hybrid vigor and uniformity exhibited by F1 hybrids is lost in the next generation (F2). Consequently, seed produced by hybrids is consumed rather than planted.
[0075] The following examples are offered by way of illustration and not by way of limitation.
EXAMPLES
Example 1
Expression of Transgenes in Monocot Cells
[0076] A plasmid vector is constructed comprising a polynucleotide encoding the full-length polypeptide of SEQ ID NO: 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28 or 29, operably linked to a heterologous promoter, such as a constitutive promoter or a stress-responsive promoter, for example rab17, rd29A, rip2, mlip15 or ryeCBF31. This construct can then be introduced into maize cells by the following procedure.
[0077] Immature maize embryos are dissected from developing caryopses. The embryos are isolated 10 to 11 days after pollination when they are 1.0 to 1.5 mm long. The embryos are then placed with the axis-side facing down and in contact with agarose-solidified N6 medium (Chu, et al., (1975) Sci. Sin. Peking 18:659-668). The embryos are kept in the dark at 27° C. Friable embryogenic callus, consisting of undifferentiated masses of cells with somatic proembryoids and embryoids borne on suspensor structures, proliferates from the scutellum of these immature embryos. The embryogenic callus isolated from the primary explant can be cultured on N6 medium and sub-cultured on this medium every 2 to 3 weeks.
[0078] The plasmid p35S/Ac (Hoechst Ag, Frankfurt, Germany) or equivalent may be used in transformation experiments in order to provide for a selectable marker. This plasmid contains the Pat gene (see, European Patent Publication Number 0 242 236) which encodes phosphinothricin acetyl transferase (PAT). The enzyme PAT confers resistance to herbicidal glutamine synthetase inhibitors such as phosphinothricin. The pat gene in p35S/Ac is under the control of the 35S promoter from Cauliflower Mosaic Virus (Odell, et al., (1985) Nature 313:810-812) and the 3' region of the nopaline synthase gene from the T-DNA of the Ti plasmid of Agrobacterium tumefaciens.
[0079] The particle bombardment method (Klein, et al., (1987) Nature 327:70-73) may be used to transfer genes to the callus culture cells. According to this method, gold particles (1 μm in diameter) are coated with DNA using the following technique. Ten μg of plasmid DNA are added to 50 μL of a suspension of gold particles (60 mg per mL). Calcium chloride (50 μL of a 2.5 M solution) and spermidine free base (20 μL of a 1.0 M solution) are added to the particles. The suspension is vortexed during the addition of these solutions. After 10 minutes, the tubes are briefly centrifuged (5 sec at 15,000 rpm) and the supernatant removed. The particles are resuspended in 200 μL of absolute ethanol, centrifuged again and the supernatant removed. The ethanol rinse is performed again and the particles resuspended in a final volume of 30 μL of ethanol. An aliquot (5 μL) of the DNA-coated gold particles can be placed in the center of a Kapton flying disc (Bio-Rad Labs). The particles are then accelerated into the corn tissue with a Biolistic PDS-1000/He (Bio-Rad Instruments, Hercules Calif.), using a helium pressure of 1000 psi, a gap distance of 0.5 cm and a flying distance of 1.0 cm.
[0080] For bombardment, the embryogenic tissue is placed on filter paper over agarose-solidified N6 medium. The tissue is arranged as a thin lawn and covers a circular area of about 5 cm in diameter. The petri dish containing the tissue can be placed in the chamber of the PDS-1000/He approximately 8 cm from the stopping screen. The air in the chamber is then evacuated to a vacuum of 28 inches of Hg. The macrocarrier is accelerated with a helium shock wave using a rupture membrane that bursts when the He pressure in the shock tube reaches 1000 psi.
[0081] Seven days after bombardment the tissue can be transferred to N6 medium that contains gluphosinate (2 mg per liter) and lacks casein or proline. The tissue continues to grow slowly on this medium. After an additional 2 weeks the tissue can be transferred to fresh N6 medium containing gluphosinate. After 6 weeks, areas of actively growing callus about 1 cm in diameter can be identified on some of the plates containing the glufosinate-supplemented medium. These calli may continue to grow when sub-cultured on the selective medium.
[0082] Plants can be regenerated from the transgenic callus by first transferring clusters of tissue to N6 medium supplemented with 0.2 mg per liter of 2,4-D. After two weeks the tissue can be transferred to regeneration medium (Fromm, et al., (1990) Bio/Technology 8:833-839).
Example 2
Expression of Transgenes in Dicot Cells
[0083] Soybean embryos are bombarded with a plasmid comprising a CBF polynucleotide operably linked to a promoter, as follows. To induce somatic embryos, cotyledons of 3-5 mm in length are dissected from surface-sterilized, immature seeds of the soybean cultivar A2872, then cultured in the light or dark at 26° C. on an appropriate agar medium for six to ten weeks. Somatic embryos producing secondary embryos are then excised and placed into a suitable liquid medium. After repeated selection for clusters of somatic embryos that multiplied as early, globular-staged embryos, the suspensions are maintained as described below.
[0084] Soybean embryogenic suspension cultures can be maintained in 35 ml liquid media on a rotary shaker, 150 rpm, at 26° C. with fluorescent lights on a 16:8 hour day/night schedule. Cultures are subcultured every two weeks by inoculating approximately 35 mg of tissue into 35 ml of liquid medium.
[0085] Soybean embryogenic suspension cultures may then be transformed by the method of particle gun bombardment (Klein, et al., (1987) Nature (London) 327:70-73, U.S. Pat. No. 4,945,050). A DuPont Biolistic PDS1000/HE instrument (helium retrofit) can be used for these transformations.
[0086] A selectable marker gene that can be used to facilitate soybean transformation is a transgene composed of the 35S promoter from Cauliflower Mosaic Virus (Odell, et al., (1985) Nature 313:810-812), the hygromycin phosphotransferase gene from plasmid pJR225 (from E. coli; Gritz, et al., (1983) Gene 25:179-188), and the 3' region of the nopaline synthase gene from the T-DNA of the Ti plasmid of Agrobacterium tumefaciens. The expression cassette comprising the sequence of interest operably linked to a promoter can be isolated as a restriction fragment. This fragment can then be inserted into a unique restriction site of the vector carrying the marker gene.
[0087] To 50 μl of a 60 mg/ml 1 μm gold particle suspension is added (in order): 5 μl DNA (1 μg/μl), 20 μl spermidine (0.1 M), and 50 μl CaCl2 (2.5 M). The particle preparation is then agitated for three minutes, spun in a microfuge for 10 seconds and the supernatant removed. The DNA-coated particles are then washed once in 400 μl 70% ethanol and resuspended in 40 μl of anhydrous ethanol. The DNA/particle suspension can be sonicated three times for one second each. Five microliters of the DNA-coated gold particles are then loaded on each macro carrier disk.
[0088] Approximately 300-400 mg of a two-week-old suspension culture is placed in an empty 60×15 mm petri dish and the residual liquid removed from the tissue with a pipette. For each transformation experiment, approximately 5-10 plates of tissue are normally bombarded. Membrane rupture pressure is set at 1100 psi, and the chamber is evacuated to a vacuum of 28 inches mercury. The tissue is placed approximately 3.5 inches away from the retaining screen and bombarded three times. Following bombardment, the tissue can be divided in half and placed back into liquid and cultured as described above.
[0089] Five to seven days post bombardment, the liquid media may be exchanged with fresh media, and eleven to twelve days post-bombardment with fresh media containing 50 mg/ml hygromycin. This selective media can be refreshed weekly. Seven to eight weeks post-bombardment, green, transformed tissue may be observed growing from untransformed, necrotic embryogenic clusters. Isolated green tissue is removed and inoculated into individual flasks to generate new, clonally propagated, transformed embryogenic suspension cultures. Each new line may be treated as an independent transformation event. These suspensions can then be subcultured and maintained as clusters of immature embryos or regenerated into whole plants by maturation and germination of individual somatic embryos.
Example 3
Identification of the Gene from a Computer Homology Search
[0090] Gene identities can be determined by conducting BLAST (Basic Local Alignment Search Tool; Altschul, et al., (1993) J. Mol. Biol. 215:403-410; see also, information available from NCBI (National Center for Biotechnology Information, US National Library of Medicine, 8600 Rockville Pike, Bethesda, Md. 20894)) searches under default parameters for similarity to sequences contained in the BLAST "nr" database (comprising all non-redundant GenBank CDS translations, sequences derived from the 3-dimensional structure Brookhaven Protein Data Bank, the last major release of the SWISS-PROT protein sequence database, EMBL, and DDBJ databases). The cDNA sequences are analyzed for similarity to all publicly available DNA sequences contained in the "nr" database using the BLASTN program. The DNA sequences are translated in all reading frames and compared for similarity to all publicly available protein sequences contained in the "nr" database using the BLASTX program (Gish and States, (1993) Nature Genetics 3:266-272) provided by the NCBI. In some cases, the sequencing data from two or more clones containing overlapping segments of DNA are used to construct contiguous DNA sequences.
[0091] Sequence alignments and percent identity calculations can be performed using software such as GAP, BestFit, PileUp or Pretty, available as part of the GCG® Wisconsin Package® from Accelrys, Inc., San Diego, Calif. Default parameters for pairwise alignments of polynucleotide sequences using GAP and BestFit are Gap Creation Penalty=50, Gap Extension Penalty=3; nwsgapdna.cmp is the scoring matrix. Default parameters for pairwise alignments for polypeptide sequences using GAP and BestFit are Gap Creation Penalty=8, Gap Extension Penalty=2; BLOSUM62 is the scoring matrix. There is no penalty for gaps at ends of polynucleotide or polypeptide alignments.
[0092] Default parameters for polynucleotide sequence comparison using PileUp and Pretty are: Gap Creation Penalty=5, Gap Extension Penalty=1. Default parameters for polypeptide sequence comparison using PileUp or Pretty are Gap Creation Penalty=8, Gap Extension Penalty=2; BLOSUM62 is the scoring matrix.
[0093] Sequence alignments can also be accomplished with the Megalign program of the LASERGENE bioinformatics computing suite (DNASTAR Inc., Madison, Wis.). Multiple alignment of the sequences can be performed using the Clustal method of alignment (Higgins and Sharp, (1989) CABIOS. 5:151-153) with the default parameters (GAP PENALTY=10, GAP LENGTH PENALTY=10). Default parameters for pairwise alignments using the Clustal method are KTUPLE 1, GAP PENALTY=3, WINDOW=5 and DIAGONALS SAVED=5.
[0094] Other pairwise comparison tools are also available and known to those of skill in the art.
Example 4
Standard Agro Transformation Protocol
[0095] For Agrobacterium-mediated transformation of maize, the method of Zhao is employed (U.S. Pat. No. 5,981,840, and PCT Patent Publication Number WO 1998/32326, the contents of which are hereby incorporated by reference). Briefly, immature embryos are isolated from maize and the embryos immersed in an Agrobacterium suspension, where the bacteria are capable of transferring the gene of interest to at least one cell of at least one of the immature embryos (step 1: the infection step). The embryos are then co-cultured for a time with the Agrobacterium on solid medium (step 2: the co-cultivation step). During the co-cultivation step infected embryos are cultured at 20° C. for 3 days, and then at 26° C. for 4 days. Following this co-cultivation period an optional "resting" step is contemplated in which the embryos are incubated in the presence of at least one antibiotic known to inhibit the growth of Agrobacterium, without the addition of a selective agent for plant transformants (step 3: resting step). Transient expression based on a color marker can be monitored during the co-cultivation and the resting steps. Next, inoculated embryos are cultured on medium containing a selective agent and growing transformed callus is recovered (step 4: the selection step). Finally, calli grown on selective medium are cultured on solid medium to regenerate transformed plants (step 5: the regeneration step).
Example 5
Identification and Phylogeny of Multiple Maize CBF Polypeptides
[0096] As described in Example 3, bioinformatics search tools can be used to identify polynucleotides or polypeptides with common sequences or sequence elements. Using ZmCBF1 and ZmCBF2 sequences (SEQ ID NOS: 1-4), such searches of the TIGR GSS assembly 4.0 were conducted. Seventeen maize CBF or CBF-like sequences were identified in this way.
[0097] Maize CBF protein sequences were aligned with Arabidopsis and rye CBF sequences. From the alignment, 1000 half-delete jackknife permuted datasets were generated and used to produce 1000 neighbor-joining phylogenetic trees. The consensus tree from among these was then run through the Maximum-Likelihood program of Phylip to produce a tree with branch lengths scaled to amino acid substitution distance. Based on this tree, all of the corn sequences are in a separate clade from the Arabidopsis sequences. However, the corn sequence clade forms a 100% supported grouping with the Arabidopsis CBF and At5g51990 clade. This grouping suggests that there are four Arabidopsis CBF type proteins and ten corn CBF type proteins.
Example 6
Expression Analysis of ZmCBF Genes
[0098] For genes ZmCBF3, CBF4 through CBF9, and CBF11, expression profiling was conducted using massively parallel sequencing technology (MPSS, Illumina®, Hayward, Calif.; formerly Solexa). No appropriate signature tags were available for ZmCBF1, ZmCBF2 and CBF 10.
[0099] Results are shown in FIG. 4. CBF-like7 is specifically higher in expression in the chilled seedling versus the control; see Page 5 of FIG. 4, csdl1lm-chil versus csd1lm-ctr.
[0100] CBF5 and CBF7 are specifically higher in the drought stressed pedicels versus the controls; see Page 4 of FIG. 4, cpd1-drg v. cpd1-ctr.
Example 7
ZmCBF12 Expression Data
[0101] Analysis of proprietary tissue libraries indicated that ZmCBF12 is expressed in all tissues, namely, vegetative, reproductive, and root, and it was found to be induced by biotic and abiotic stresses. The expression of this gene was highest at 550 ppm in maize whole kernels as reported in the proprietary MPSS libraries. Its expression was four-fold higher in drought-stressed maize pedicels relative to control, almost three-fold higher in ABA-treated leaves and cytokinin-treated leaf discs relative to control, and two-fold higher in seedling tissues that were recovering from freeze-treatment relative to control seedlings at optimum temperatures. This indicates potential significance of this gene in stress tolerance.
[0102] The above examples are provided to illustrate the invention but not to limit its scope. Other variants of the invention will be readily apparent to one of ordinary skill in the art and are encompassed by the appended claims.
[0103] All publications and patent applications cited in the specification are indicative of the level of skill of those in the art to which this invention pertains. All publications, patents, patent applications, and computer programs cited herein are incorporated by reference to the same extent as if specifically and individually indicated to be incorporated by reference.
Sequence CWU
1
1
461696DNAZea maysCDS(1)..(693)ZmCBF2 1atg tgc cca acc aag aag ggg atg acc
gga gag ccg agc tcg cca tgc 48Met Cys Pro Thr Lys Lys Gly Met Thr
Gly Glu Pro Ser Ser Pro Cys1 5 10
15 agc tcg gca tca gcc tcg acc tta ccg gag cac cac cag acg gtg
tgg 96Ser Ser Ala Ser Ala Ser Thr Leu Pro Glu His His Gln Thr Val
Trp 20 25 30
acg tcg ccg ccg aag cgg cca gcg ggg cgg acc aag ttc cgg gag acg
144Thr Ser Pro Pro Lys Arg Pro Ala Gly Arg Thr Lys Phe Arg Glu Thr
35 40 45
cgg cac ccg gtg ttc cgc ggc gtc cgg cgc cgg ggc agc gcc ggg cgg
192Arg His Pro Val Phe Arg Gly Val Arg Arg Arg Gly Ser Ala Gly Arg
50 55 60
tgg gtg tgc gag gtg cgc gtg ccg ggg agg cgc ggc tgc agg ctc tgg
240Trp Val Cys Glu Val Arg Val Pro Gly Arg Arg Gly Cys Arg Leu Trp
65 70 75 80
ctc ggc acc ttc gac acg gcc gag gcg gcg gcc cgc gcg cac gac gcc
288Leu Gly Thr Phe Asp Thr Ala Glu Ala Ala Ala Arg Ala His Asp Ala
85 90 95
gcc atg ctc gcc ctc gcc ggc gcg ggc gcc tgc tgc ctc aac ttc gcc
336Ala Met Leu Ala Leu Ala Gly Ala Gly Ala Cys Cys Leu Asn Phe Ala
100 105 110
gac tcg gcc tgg ctc ctc gcg gtc ccg gcc tcg tgc gcc agc ctc gcc
384Asp Ser Ala Trp Leu Leu Ala Val Pro Ala Ser Cys Ala Ser Leu Ala
115 120 125
gag gtc cgc cac gcg gtc gcg gac gcc gtg gag gac ttc ctc cgc cat
432Glu Val Arg His Ala Val Ala Asp Ala Val Glu Asp Phe Leu Arg His
130 135 140
cag gtg gtc ccg gag gac gac gcc ctc gcg gcc acg ccg tcg tcg cct
480Gln Val Val Pro Glu Asp Asp Ala Leu Ala Ala Thr Pro Ser Ser Pro
145 150 155 160
tcc agc gaa gac ggc agc acc tct gat ggc ggg gag tcc tcc tct gat
528Ser Ser Glu Asp Gly Ser Thr Ser Asp Gly Gly Glu Ser Ser Ser Asp
165 170 175
tcc tct ccg ccc acc ggg gcc tcg ccg ttc gaa ttg gat gtg ttc aac
576Ser Ser Pro Pro Thr Gly Ala Ser Pro Phe Glu Leu Asp Val Phe Asn
180 185 190
gac atg agc tgg gac ctg cac tac gcg agc ttg gcg cag gga ttg ctc
624Asp Met Ser Trp Asp Leu His Tyr Ala Ser Leu Ala Gln Gly Leu Leu
195 200 205
gtg gag cca ccg tcc gcg gtc acg gcg ctc atg gac gaa ggc ttc gcc
672Val Glu Pro Pro Ser Ala Val Thr Ala Leu Met Asp Glu Gly Phe Ala
210 215 220
gat gtg ccg ctc tgg agc tac tag
696Asp Val Pro Leu Trp Ser Tyr
225 230
2231PRTZea mays 2Met Cys Pro Thr Lys Lys Gly Met Thr Gly Glu Pro Ser Ser
Pro Cys 1 5 10 15
Ser Ser Ala Ser Ala Ser Thr Leu Pro Glu His His Gln Thr Val Trp
20 25 30 Thr Ser Pro Pro Lys
Arg Pro Ala Gly Arg Thr Lys Phe Arg Glu Thr 35
40 45 Arg His Pro Val Phe Arg Gly Val Arg
Arg Arg Gly Ser Ala Gly Arg 50 55
60 Trp Val Cys Glu Val Arg Val Pro Gly Arg Arg Gly Cys
Arg Leu Trp 65 70 75
80 Leu Gly Thr Phe Asp Thr Ala Glu Ala Ala Ala Arg Ala His Asp Ala
85 90 95 Ala Met Leu Ala
Leu Ala Gly Ala Gly Ala Cys Cys Leu Asn Phe Ala 100
105 110 Asp Ser Ala Trp Leu Leu Ala Val Pro
Ala Ser Cys Ala Ser Leu Ala 115 120
125 Glu Val Arg His Ala Val Ala Asp Ala Val Glu Asp Phe Leu
Arg His 130 135 140
Gln Val Val Pro Glu Asp Asp Ala Leu Ala Ala Thr Pro Ser Ser Pro 145
150 155 160 Ser Ser Glu Asp Gly
Ser Thr Ser Asp Gly Gly Glu Ser Ser Ser Asp 165
170 175 Ser Ser Pro Pro Thr Gly Ala Ser Pro Phe
Glu Leu Asp Val Phe Asn 180 185
190 Asp Met Ser Trp Asp Leu His Tyr Ala Ser Leu Ala Gln Gly Leu
Leu 195 200 205 Val
Glu Pro Pro Ser Ala Val Thr Ala Leu Met Asp Glu Gly Phe Ala 210
215 220 Asp Val Pro Leu Trp Ser
Tyr 225 230 3699DNAZea maysCDS(1)..(696)ZmCBF1 3atg
gag tac gcc gcc gtc ggc tac ggc tac ggg tac ggg tac gac gag 48Met
Glu Tyr Ala Ala Val Gly Tyr Gly Tyr Gly Tyr Gly Tyr Asp Glu 1
5 10 15 cgc cag
gag ccg gcg gag tcc gcg gac ggc ggc ggc ggc ggc gac gac 96Arg Gln
Glu Pro Ala Glu Ser Ala Asp Gly Gly Gly Gly Gly Asp Asp
20 25 30 gag tac gcg
acg gtg ctg tcg gcg cca ccc aag cgg ccg gcg ggg cgg 144Glu Tyr Ala
Thr Val Leu Ser Ala Pro Pro Lys Arg Pro Ala Gly Arg 35
40 45 acc aag ttc cgg
gag acg cgg cac ccc gtg tac cgc ggc gtg cgg cgg 192Thr Lys Phe Arg
Glu Thr Arg His Pro Val Tyr Arg Gly Val Arg Arg 50
55 60 cgc ggg ccc gcg ggg
cgc tgg gtg tgc gag gtc cgc gag ccc aac aag 240Arg Gly Pro Ala Gly
Arg Trp Val Cys Glu Val Arg Glu Pro Asn Lys 65
70 75 80 aag tcg cgc atc tgg
ctc ggc acc ttc gcc acc ccc gag gcc gcc gcg 288Lys Ser Arg Ile Trp
Leu Gly Thr Phe Ala Thr Pro Glu Ala Ala Ala 85
90 95 cgc gcg cac gac gtg gcc
gcg ctg gcc ctg cgg ggc cgc gcc gcg tgc 336Arg Ala His Asp Val Ala
Ala Leu Ala Leu Arg Gly Arg Ala Ala Cys 100
105 110 ctc aac ttc gcc gac tcg gcg
cgc ctg ctc cag gtc gac ccc gcc acg 384Leu Asn Phe Ala Asp Ser Ala
Arg Leu Leu Gln Val Asp Pro Ala Thr 115
120 125 ctc gcc acc ccc gac gac atc
cgc cgc gcc gcc atc cag ctc gcc gac 432Leu Ala Thr Pro Asp Asp Ile
Arg Arg Ala Ala Ile Gln Leu Ala Asp 130 135
140 gcc gcc tcg cag cag gat gag act
gcc gcc gtt gcc gct gac gtg gtc 480Ala Ala Ser Gln Gln Asp Glu Thr
Ala Ala Val Ala Ala Asp Val Val 145 150
155 160 gcg ccc tcg cag gcg gac gac gtc gcc
gcc gcc gcc gcc gcc gcg gcg 528Ala Pro Ser Gln Ala Asp Asp Val Ala
Ala Ala Ala Ala Ala Ala Ala 165
170 175 gcg atg tac ggc ggc ggc atg gag ttc
gac cac tcg tat tgc tac gac 576Ala Met Tyr Gly Gly Gly Met Glu Phe
Asp His Ser Tyr Cys Tyr Asp 180 185
190 gac ggg atg gtg agc ggg agc agc gac tgc
tgg caa agc ggc gcc ggc 624Asp Gly Met Val Ser Gly Ser Ser Asp Cys
Trp Gln Ser Gly Ala Gly 195 200
205 gcc ggt gga tgg cat agc atc gtg gac ggc gac
tac gac gac ggc gcc 672Ala Gly Gly Trp His Ser Ile Val Asp Gly Asp
Tyr Asp Asp Gly Ala 210 215
220 agc gac atg acg ctc tgg agc tac tga
699Ser Asp Met Thr Leu Trp Ser Tyr
225 230
4232PRTZea mays 4Met Glu Tyr Ala Ala Val Gly Tyr
Gly Tyr Gly Tyr Gly Tyr Asp Glu 1 5 10
15 Arg Gln Glu Pro Ala Glu Ser Ala Asp Gly Gly Gly Gly
Gly Asp Asp 20 25 30
Glu Tyr Ala Thr Val Leu Ser Ala Pro Pro Lys Arg Pro Ala Gly Arg
35 40 45 Thr Lys Phe Arg
Glu Thr Arg His Pro Val Tyr Arg Gly Val Arg Arg 50
55 60 Arg Gly Pro Ala Gly Arg Trp Val
Cys Glu Val Arg Glu Pro Asn Lys 65 70
75 80 Lys Ser Arg Ile Trp Leu Gly Thr Phe Ala Thr Pro
Glu Ala Ala Ala 85 90
95 Arg Ala His Asp Val Ala Ala Leu Ala Leu Arg Gly Arg Ala Ala Cys
100 105 110 Leu Asn Phe
Ala Asp Ser Ala Arg Leu Leu Gln Val Asp Pro Ala Thr 115
120 125 Leu Ala Thr Pro Asp Asp Ile Arg
Arg Ala Ala Ile Gln Leu Ala Asp 130 135
140 Ala Ala Ser Gln Gln Asp Glu Thr Ala Ala Val Ala Ala
Asp Val Val 145 150 155
160 Ala Pro Ser Gln Ala Asp Asp Val Ala Ala Ala Ala Ala Ala Ala Ala
165 170 175 Ala Met Tyr Gly
Gly Gly Met Glu Phe Asp His Ser Tyr Cys Tyr Asp 180
185 190 Asp Gly Met Val Ser Gly Ser Ser Asp
Cys Trp Gln Ser Gly Ala Gly 195 200
205 Ala Gly Gly Trp His Ser Ile Val Asp Gly Asp Tyr Asp Asp
Gly Ala 210 215 220
Ser Asp Met Thr Leu Trp Ser Tyr 225 230
5615DNAZea mayspromoter(1)..(615)rab17 5ctatagtatt ttaaaattgc attaacaaac
atgtcctaat tggtactcct gagatactat 60accctcctgt tttaaaatag ttggcattat
cgaattatca ttttactttt taatgttttc 120tcttctttta atatatttta tgaattttaa
tgtattttaa aatgttatgc agttcgctct 180ggacttttct gctgcgccta cacttgggtg
tactgggcct aaattcagcc tgaccgaccg 240cctgcattga ataatggatg agcaccggta
aaatccgcgt acccaacttt cgagaagaac 300cgagacgtgg cgggccgggc caccgacgca
cggcaccagc gactgcacac gtcccgccgg 360cgtacgtgta cgtgctgttc cctcactggc
cgcccaatcc actcatgcat gcccacgtac 420acccctgccg tggcgcgccc agatcctaat
cctttcgccg ttctgcactt ctgctgccta 480taaatggcgg catcgaccgt cacctgcttc
accaccggcg agccacatcg agaacacgat 540cgagcacaca agcacgaaga ctcgtttagg
agaaaccaca aaccaccaag ccgtgcaagc 600accatggacg ccgcc
61561625DNAArabidopsispromoter(1)..(1625)rd29a 6agcttggttg ctatggtagg
gactatgggg ttttcggatt ccggtggaag tgagtgggga 60ggcagtggcg gaggtaaggg
agttcaagat tctggaactg aagatttggg gttttgcttt 120tgaatgtttg cgtttttgta
tgatgcctct gtttgtgaac tttgatgtat tttatctttg 180tgtgaaaaag agattgggtt
aataaaatat ttgctttttt ggataagaaa ctcttttagc 240ggcccattaa taaaggttac
aaatgcaaaa tcatgttagc gtcagatatt taattattcg 300aagatgattg tgatagattt
aaaattatcc tagtcaaaaa gaaagagtag gttgagcaga 360aacagtgaca tctgttgttt
gtaccataca aattagttta gattattggt taacatgtta 420aatggctatg catgtgacat
ttagacctta tcggaattaa tttgtagaat tattaattaa 480gatgttgatt agttcaaaca
aaaattttat attaaaaaat gtaaacgaat attttgtatg 540ttcagtgaaa gtaaaacaaa
ttaaattaac aagaaactta tagaagaaaa tttttactat 600ttaagagaaa gaaaaaaatc
tatcatttaa tctgagtcct aaaaactgtt atacttaaca 660gttaacgcat gatttgatgg
aggagccata gatgcaattc aatcaaactg aaatttctgc 720aagaatctca aacacggaga
tctcaaagtt tgaaagaaaa tttatttctt cgactcaaaa 780caaacttacg aaatttaggt
agaacttata tacattatat tgtaattttt tgtaacaaaa 840tgtttttatt attattatag
aattttactg gttaaattaa aaatgaatag aaaaggtgaa 900ttaagaggag agaggaggta
aacattttct tctatttttt catattttca ggataaatta 960ttgtaaaagt ttacaagatt
tccatttgac tagtgtaaat gaggaatatt ctctagtaag 1020atcattattt catctacttc
ttttatcttc taccagtaga ggaataaaca atatttagct 1080cctttgtaaa tacaaattaa
ttttccttct tgacatcatt caattttaat tttacgtata 1140aaataaaaga tcatacctat
tagaacgatt aaggagaaat acaattcgaa tgagaaggat 1200gtgccgtttg ttataataaa
cagccacacg acgtaaacgt aaaatgacca catgatgggc 1260caatagacat ggaccgacta
ctaataatag taagttacat tttaggatgg aataaatatc 1320ataccgacat cagttttgaa
agaaaaggga aaaaaagaaa aaataaataa aagatatact 1380accgacatga gttccaaaaa
gcaaaaaaaa agatcaagcc gacacagaca cgcgtagaga 1440gcaaaatgac tttgacgtca
caccacgaaa acagacgctt catacgtgtc cctttatctc 1500tctcagtctc tctataaact
tagtgagacc ctcctctgtt ttactcacaa atatgcaaac 1560tagaaaacaa tcatcaggaa
taaagggttt gattacttct attggaaaga aaaaaatctt 1620tggac
162571824DNAZea
mayspromoter(1)..(1824)RIP2 7taatcattac ttaggtttta ttttaccaca tttttatttt
gttttcccct gttccttttc 60tcttcatttc cattcaatta atgggatgtt tgatacctta
cgattgcacc aacctgttca 120attgtacttc agatatcatc ttctttgatt gtctctcacc
tctgctttgc ttcagtacct 180gtattttttc ccatgaccct gaattctatt tgcccacatc
acaacacttg cttcttctcg 240aacaaataaa taaacaaact tcacagaacc gtagttttta
tttctatcca tacattgtca 300gtttgatgat ccagacgagg tagatgaaga gaaagaagtt
gagtatgaag aaatcgagga 360ggaggttgag tatgaagaga tagaggagga ttaagaaatt
gatggtgtgt gtgaatttga 420tgctaatgat gaaagtaaaa tggtcgatgt tgatgcgaat
gatgagaatg aaaaacggaa 480gcatgctgag cttcttgctc ttactcatgg agctgaagtt
tatgttgggg catatcttct 540aatgtatctt ctgaaaatct caaacaacta ttctgaagat
ctcaaacaac tatttgaatc 600tgttgggagc tgaagtttat gttgggcata tcttctgatg
tatcttctct actttagctt 660ttgcatttct attctctgca aatttagagt ccctttttct
gcaggttgta tatccttatt 720gtgtcgcatg ttttggccga tgctacccga attgggcaac
aatgatctca gaatgtcatg 780acacacattt gacattgtcc atctactatt gatcgtgcct
gcaagattga acagatcaag 840ctttgaaaga aggatgtcaa aaggcattgg tgattgaaca
aaggcagtca agagccattg 900aaagaaagtt gtatgttgag agcactaaga caacggtctt
acagtgtaca aaatatatca 960ctgaatagtt atatcttact tttttagcac ttgagcaatt
aaacttttag ttgttcattg 1020ttatagtcga tacccagata tcatacagtg tctaatatga
acatttaatt ttcatgtaat 1080cattatgctc taacattttt taaaaaataa tgtgctgttg
caacgcacgg gcatcgtact 1140agtaaagtat atatatatat atatatatat agacttttac
cattcaaaaa aatttgaggg 1200cctcaatttt ttgtttcgcc ccgggtccat gaaacctagg
gaccggccgt gtatatatat 1260ggtcttccct tcactaacta tatagagaca gatcacatcg
gaataaaaga aatttataga 1320ccaaatcgga aacctaaaaa ccaaaaaccg agcaattcgg
tctattcggt tttagttagc 1380aggttcaaaa tgtccggtcc tactaatact caacaatgat
taagaaccga tctgccatat 1440tttaaaaaat tatggaccgg aataacacat agtgaaaagt
ttaaggagcg aaaatatttt 1500tttttccttg gcaatttgga cggcacgcgg agactggcag
accgcatcct cgtgaagcac 1560gttgtccatg cctgaagaga gtattctgta ttcgcagtat
tcctgcattt aaaagtttgg 1620tgagcgaatc aataattggc ataaataatg ctaccgacgc
atcaccacat agtacgtacc 1680atagtcatcc ttatcctatc gaattaccta catgcccaac
cctcccacta catatatctg 1740caacgagcgc atcgccaatt cacaatgcca attgccagca
acccatccat actttcagct 1800gttgatacaa aaagagaaga gaga
182481099DNAZea mayspromoter(1)..(1099)RIP2
8gcccttgttt tggccgatgc tacccgaatt gggcaacaat gatctcagaa tgtcatgaca
60cacatttgac attgtccatc tactattgat cgtgcctgca agattgaaca gatcaagctt
120tgaaagaagg atgtcaaaag gcattggtga ttgaacaaag gcagtcaaga gccattgaaa
180gaaagttgta tgttgagagc actaagacaa cggtcttaca gtgtacaaaa tatatcactg
240aatagttata tcttactttt ttagcacttg agcaattaaa cttttagttg ttcattgtta
300tagtcgatac ccagatatca tacagtgtct aatatgaaca tttaattttc atgtaatcat
360tatgctctaa cattttttaa aaaataatgt gctgttgcaa cgcacgggca tcgtactagt
420aaagtatata tatatatata tatatataga cttttaccat tcaaaaaaat ttgagggcct
480caattttttg tttcgccccg ggtccatgaa acctagggac cggccgtgta tatatatggt
540cttcccttca ctaactatat agagacagat cacatcggaa taaaagaaat ttatagacca
600aatcggaaac ctaaaaacca aaaaccgagc aattcggtct attcggtttt agttagcagg
660ttcaaaatgt ccggtcctac taatactcaa caatgattaa gaaccgatct gccatatttt
720aaaaaattat ggaccggaat aacacatagt gaaaagttta aggagcgaaa atattttttt
780ttccttggca atttggacgg cacgcggaga ctggcagacc gcatcctcgt gaagcacgtt
840gtccatgcct gaagagagta ttctgtattc gcagtattcc tgcatttaaa agtttggtga
900gcgaatcaat aattggcata aataatgcta ccgacgcatc accacatagt acgtaccata
960gtcatcctta tcctatcgaa ttacctacat gcccaaccct cccactacat atatctgcaa
1020cgagcgcatc gccaattcac aatgccaatt gccagcaacc catccatact ttcagctgtt
1080gatacaaaaa gagaagaga
109991446DNAZea mayspromoter(1)..(1443)mLIP15 9aagcttggta ccgagctcgg
atccactagt aacggccgcc agtgtgctgg aattcgccct 60tctgggcaag ctgtcactag
gactggacaa aatactcgtg gctcgataac tcgctcgact 120cgtctcgtta gtagctcagc
tcgactcggc tcgttttaat tttgtagcga gccaagctag 180cattctagct cgattctcta
atgagccagc tcgggttagc tcgtgagcta gctcgcgagc 240caaacgagct aagccacaac
acaaatttgt ctagtcattg atgtcgtctc atctctcata 300gtcttgtttt ctcgtagtta
tgatctgtga tatggacatg tgtggatgtg ccatgtactt 360aaatatttat attattgcat
ggctacatgt ttgtagtgtt aaatacttaa aatataattt 420ttcggttata aatatattta
tgtacataga tatttatatt tagttgtgtg gctcacgagc 480ctaacgagct ggctcgagct
tcctaacgag ccgagccgag ccagctgttt agcccgttag 540tataacgagc cgagccgagc
tggctcgtta tagtaacgag tcataacgag ccgagccata 600acgagccaag ctggttcgat
atccacccct agctgtcacc gtcgcccagt ccgcttcgtt 660cggtcagcgg gccccgcctc
atctgcattc ttccattctc gtcctccgac ctcatctgca 720ttttcccagc caagtagtag
gtaaactagt ggcggtcccg tggccgtggc atcaggaaaa 780gaatatgccg tcccagccca
ccatcccccc accgtcccga aattcaagag ttaccttggg 840ttcaagttat aataggctgc
ccccggtaga cgttggaaac tttccccttc tcgggataaa 900agataaggag tgtgtgtcct
ttttttagga taagtccgtg ccccttctgt ttttcttaca 960ttcaggtctt cgcagctcct
ctattttttg ttgtttcttt ctttcgatct gcgagccgtg 1020caggtccagt actctccttt
ctgtgaagga actcttgcag ccggcccctc tggtttcgtc 1080gaattcttgt tccccggtcc
ctcctcctgt ccccgcgtag atccgtccgt ccgaggagca 1140caccgtcccc acccccatgt
ttacccacca gttcctctga cggccgccgt gctccgatga 1200agatgagcgt gctccgtatc
cgccgctccc actccttctc cgtcgccttc ctctactggt 1260tctacgtctt ctcatgaacg
catcgcccct ctccacctgc tgatccttcg ccgtctctct 1320ctctctctct ctctctctct
cttagatagt cttttgaatc catctntagg gctnttgttt 1380ctccccatcc tccccccacc
ccacccccca ccaaacagat tcaatccgac aagacaagca 1440tccatg
144610781DNASecale cereale
10catggacgcc gccgacgccg gctccccccg ttttgggcac aggacggtgt gctcggagcc
60gcccaagagg ccggcagggc ggaccaagtt taaggagacc cgccacccgc tgtaccgcgg
120cgtgcggcgg cggggtcggc tcgggcagtg ggtgtgcgag gtgcgcgtgc gcggcgcgca
180agggtacagg ctctggctcg gcacattcac caccgccgag atggcggcgc gcgcgcacga
240ctccgccgtg ctcgcgctcc tcgaccgcgc cgcttgcctc aacttcgccg actccgcctg
300gcggatgctg cccgtcctcg cggcaggctc gtcccgcttc agcagcgcgc gggaaatcaa
360ggacgccgtc gccgtcgccg tcgtggagtt ccagcggcag cgccccttcg tgtccacgtc
420ggagacggcc gacggcgaga aggacgtcca aggctcgccg aggccgagcg agctgtccac
480gtccagcgac ttgttggacg agcactggtt tagcggcatg gacgccggct cttactacgc
540gagcttggcg caggggatgc tcatggagcc gccggccgcc agagcgtgga gcgaggatgg
600cggcgaatac agcggcgtcc acacgccgct ttggaactag tactagtaca cttatccgac
660taattaagcc atgtacagtt ttagaaacta gactactagt ggttgtgttc ttccaaatat
720gggaagatac agagtaagca taaggagcaa ttttcccccg taaaaaaaaa aaaaaaaggg
780c
78111902DNAArabidopsis thalianaCDS(113)..(763) 11ctagaacaga aagagagaga
aactattatt tcagcaaacc ataccaacaa aaaagacaga 60gatcttttag ttaccttatc
cagtttcttg aaacagagta ctcttctgat ca atg aac 118
Met Asn
1 tca ttt tct gct ttt tct gaa
atg ttt ggc tcc gat tac gag tct tcg 166Ser Phe Ser Ala Phe Ser Glu
Met Phe Gly Ser Asp Tyr Glu Ser Ser 5
10 15 gtt tcc tca ggc ggt gat tat
att ccg acg ctt gcg agc agc tgc ccc 214Val Ser Ser Gly Gly Asp Tyr
Ile Pro Thr Leu Ala Ser Ser Cys Pro 20 25
30 aag aaa ccg gcg ggt cgt aag aag
ttt cgt gag act cgt cac cca ata 262Lys Lys Pro Ala Gly Arg Lys Lys
Phe Arg Glu Thr Arg His Pro Ile 35 40
45 50 tac aga gga gtt cgt cgg aga aac tcc
ggt aag tgg gtt tgt gag gtt 310Tyr Arg Gly Val Arg Arg Arg Asn Ser
Gly Lys Trp Val Cys Glu Val 55
60 65 aga gaa cca aac aag aaa aca agg att
tgg ctc gga aca ttt caa acc 358Arg Glu Pro Asn Lys Lys Thr Arg Ile
Trp Leu Gly Thr Phe Gln Thr 70 75
80 gct gag atg gca gct cga gct cac gac gtt
gcc gct tta gcc ctt cgt 406Ala Glu Met Ala Ala Arg Ala His Asp Val
Ala Ala Leu Ala Leu Arg 85 90
95 ggc cga tca gcc tgt ctc aat ttc gct gac tcg
gct tgg aga ctc cga 454Gly Arg Ser Ala Cys Leu Asn Phe Ala Asp Ser
Ala Trp Arg Leu Arg 100 105
110 atc ccg gaa tca act tgc gct aag gac atc caa
aag gcg gcg gct gaa 502Ile Pro Glu Ser Thr Cys Ala Lys Asp Ile Gln
Lys Ala Ala Ala Glu 115 120 125
130 gct gcg ttg gcg ttt cag gat gag atg tgt gat gcg
acg acg gat cat 550Ala Ala Leu Ala Phe Gln Asp Glu Met Cys Asp Ala
Thr Thr Asp His 135 140
145 ggc ttc gac atg gag gag acg ttg gtg gag gct att tac
acg gcg gaa 598Gly Phe Asp Met Glu Glu Thr Leu Val Glu Ala Ile Tyr
Thr Ala Glu 150 155
160 cag agc gaa aat gcg ttt tat atg cac gat gag gcg atg
ttt gag atg 646Gln Ser Glu Asn Ala Phe Tyr Met His Asp Glu Ala Met
Phe Glu Met 165 170 175
ccg agt ttg ttg gct aat atg gca gaa ggg atg ctt ttg ccg
ctt ccg 694Pro Ser Leu Leu Ala Asn Met Ala Glu Gly Met Leu Leu Pro
Leu Pro 180 185 190
tcc gta cag tgg aat cat aat cat gaa gtc gac ggc gat gat gac
gac 742Ser Val Gln Trp Asn His Asn His Glu Val Asp Gly Asp Asp Asp
Asp 195 200 205
210 gta tcg tta tgg agt tat taa aactcagatt attatttcca tttttagtac
793Val Ser Leu Trp Ser Tyr
215
gatacttttt attttattat tatttttaga tcctttttta gaatggaatc ttcattatgt
853ttgtaaaact gagaaacgag tgtaaattaa attgattcag tttcagtat
90212216PRTArabidopsis thaliana 12Met Asn Ser Phe Ser Ala Phe Ser Glu Met
Phe Gly Ser Asp Tyr Glu 1 5 10
15 Ser Ser Val Ser Ser Gly Gly Asp Tyr Ile Pro Thr Leu Ala Ser
Ser 20 25 30 Cys
Pro Lys Lys Pro Ala Gly Arg Lys Lys Phe Arg Glu Thr Arg His 35
40 45 Pro Ile Tyr Arg Gly Val
Arg Arg Arg Asn Ser Gly Lys Trp Val Cys 50 55
60 Glu Val Arg Glu Pro Asn Lys Lys Thr Arg Ile
Trp Leu Gly Thr Phe 65 70 75
80 Gln Thr Ala Glu Met Ala Ala Arg Ala His Asp Val Ala Ala Leu Ala
85 90 95 Leu Arg
Gly Arg Ser Ala Cys Leu Asn Phe Ala Asp Ser Ala Trp Arg 100
105 110 Leu Arg Ile Pro Glu Ser Thr
Cys Ala Lys Asp Ile Gln Lys Ala Ala 115 120
125 Ala Glu Ala Ala Leu Ala Phe Gln Asp Glu Met Cys
Asp Ala Thr Thr 130 135 140
Asp His Gly Phe Asp Met Glu Glu Thr Leu Val Glu Ala Ile Tyr Thr 145
150 155 160 Ala Glu Gln
Ser Glu Asn Ala Phe Tyr Met His Asp Glu Ala Met Phe 165
170 175 Glu Met Pro Ser Leu Leu Ala Asn
Met Ala Glu Gly Met Leu Leu Pro 180 185
190 Leu Pro Ser Val Gln Trp Asn His Asn His Glu Val Asp
Gly Asp Asp 195 200 205
Asp Asp Val Ser Leu Trp Ser Tyr 210 215
13245PRTZea maysPEPTIDE(1)..(245)ZmCBF3 13Met Asp Ala Asp Asp Ser Ser Tyr
Ala Ser Ser Ser Ser Phe Ser Pro 1 5 10
15 Pro Pro Ser Pro Ala Asp His Leu Arg Leu Pro Pro Lys
Arg Arg Ala 20 25 30
Gly Arg Lys Lys Phe Arg Glu Thr Arg His Pro Val Tyr Arg Gly Val
35 40 45 Arg Ala Arg Ala
Gly Gly Thr Arg Trp Val Cys Glu Val Arg Glu Pro 50
55 60 Gln Ala Gln Ala Arg Ile Trp Leu
Gly Thr Tyr Pro Thr Pro Glu Met 65 70
75 80 Ala Ala Arg Ala His Asp Val Ala Ala Ile Ala Leu
Arg Gly Ala Thr 85 90
95 Ala Ala Asp Leu Asn Phe Pro Asp Ser Ala His Ala Leu Pro Arg Ala
100 105 110 Arg Thr Ala
Ala Pro Asp Asp Ile Arg Arg Ala Ala Ala Gln Ala Ala 115
120 125 Glu Leu Tyr Arg Pro Ser Pro Ser
Ser Ser Ser Ala Ser Gly Leu Leu 130 135
140 Leu His His Gly Arg Arg Thr Ile Ala Ala Pro Pro Pro
Pro Leu Ser 145 150 155
160 Leu Pro Pro Pro Glu Ala Ser Ser Thr Cys Cys Trp Thr Thr Ser Thr
165 170 175 Gly Thr Arg Gly
Ala Gly Thr Thr Pro Thr Cys Cys Asp Gly Phe Leu 180
185 190 Asp Glu Asp Ser Ile Phe Asp Met Pro
Gly Leu Ile His Asp Met Ala 195 200
205 Trp Gly Met Leu Leu Thr Pro Pro Ala Met Gly Arg Gly Leu
Asp Trp 210 215 220
Gly Ala Leu Asp Asp Asp Asp Asp His Ser His Val Asp Cys Thr Leu 225
230 235 240 Trp Thr Leu Asp Gly
245 14222PRTZea maysPEPTIDE(1)..(222)ZmCBF4 14Met Pro Pro
Pro Pro Ser Ser Ser Ser Ser Pro Ser Gln Asp Ala Gly 1 5
10 15 Ser Pro Lys Arg Ala Ala Gly Arg
Asn Lys Phe Arg Glu Thr Arg His 20 25
30 Pro Val Phe Arg Gly Val Arg Arg Arg Gly Arg Ala Gly
Gly Arg Trp 35 40 45
Arg Trp Val Cys Glu Val Arg Val Pro Gly Arg Arg Gly Cys Arg Leu 50
55 60 Trp Leu Gly Thr
Phe Ala Ala Ala Glu Ala Ala Ala Arg Ala His Asp 65 70
75 80 Ala Ala Met Leu Ala Leu Arg Gly Gly
Ala Ala Arg Ala Arg Cys Leu 85 90
95 Asn Phe Pro Asp Ser Ala Trp Leu Leu Asp Val Pro Val Leu
Pro Leu 100 105 110
Pro His Gly Ala Ala Pro Trp Ala Asp Val Arg Arg Ala Val Ala Ile
115 120 125 Ala Val Glu Gly
Phe Phe Arg Ala Arg Pro Ala Ala Glu Asp Ala Met 130
135 140 Ser Ala Thr Ser Glu Pro Ser Ser
Ala Ala Thr Glu Ala Glu Ala Glu 145 150
155 160 Ala Ser Ser Ser Ser Gly Thr Asp Gly Gly Ala Pro
Glu Ala Ser Pro 165 170
175 Phe Glu Leu Asp Met Leu Ser Asp Met Gly Ala Gly Leu Tyr Tyr Ala
180 185 190 Cys Leu Ala
Gln Gly Leu Leu Val Glu Pro Pro Pro Leu Asp Ala Pro 195
200 205 Cys Pro Asp Asp Ser Asp Cys Gly
Leu Ala Leu Trp Ser Tyr 210 215 220
15264PRTZea maysPEPTIDE(1)..(264)ZmCBF5 15Met Asp Thr Ala Gly Leu
Val Gln His Ala Thr Ser Ser Ser Ser Thr 1 5
10 15 Ser Thr Ser Ala Ser Ser Ser Ser Ser Ser Ser
Glu Gln Gln Ser Lys 20 25
30 Ala Ala Trp Pro Pro Ser Pro Ala Ser Ser Pro Gln Gln Pro Pro
Lys 35 40 45 Lys
Arg Pro Ala Gly Arg Thr Lys Phe Arg Glu Thr Arg His Pro Val 50
55 60 Phe Arg Gly Val Arg Arg
Arg Gly Ala Ala Gly Arg Trp Val Cys Glu 65 70
75 80 Val Arg Val Pro Gly Arg Arg Gly Ala Arg Leu
Trp Leu Gly Thr Tyr 85 90
95 Leu Ala Ala Glu Ala Ala Ala Arg Ala His Asp Ala Ala Met Leu Ala
100 105 110 Leu Gln
Gly Arg Gly Ala Gly Arg Leu Asn Phe Pro Asp Ser Ala Arg 115
120 125 Leu Leu Ala Val Pro Pro Pro
Ser Ala Leu Pro Gly Leu Asp Asp Ala 130 135
140 Arg Arg Ala Ala Leu Glu Ala Val Ala Glu Phe Gln
Arg Arg Ser Gly 145 150 155
160 Ala Ala Asp Glu Ala Thr Ser Gly Ala Ser Pro Pro Ser Ser Ser Pro
165 170 175 Ser Leu Pro
Asp Val Ser Ala Ala Gly Ser Pro Ala Ala Ala Leu Glu 180
185 190 His Val Pro Val Lys Ala Asp Glu
Ala Val Ala Leu Asp Leu Asp Gly 195 200
205 Asp Val Phe Glu Pro Asp Trp Phe Gly Asp Met Asp Leu
Glu Leu Asp 210 215 220
Ala Tyr Tyr Ala Ser Leu Ala Glu Gly Leu Leu Val Glu Pro Pro Pro 225
230 235 240 Pro Ala Ala Ala
Trp Asp His Glu Asp Cys Cys Asp Ser Gly Ala Ala 245
250 255 Asp Val Ala Leu Trp Ser Tyr Tyr
260 16231PRTZea maysPEPTIDE(1)..(231)ZmCBF6 16Met
Asp Ala Ala Gly Ser Phe Ser Asp Tyr Ser Ser Gly Thr Pro Ser 1
5 10 15 Pro Val Ala Gly Gly Gly
Gly Gly Asp Asp Phe Gly Ser Ser Ser Tyr 20
25 30 Met Thr Val Ser Ser Ala Pro Pro Lys Arg
Arg Ala Gly Arg Thr Lys 35 40
45 Phe Lys Glu Thr Arg His Pro Val Tyr Lys Gly Val Arg Arg
Arg Asn 50 55 60
Pro Gly Arg Trp Val Cys Glu Val Arg Glu Pro His Gly Lys Gln Arg 65
70 75 80 Ile Trp Leu Gly Thr
Phe Glu Thr Ala Glu Met Ala Ala Arg Ala His 85
90 95 Asp Val Ala Ala Leu Ala Leu Arg Gly Arg
Ala Ala Cys Leu Asn Phe 100 105
110 Ala Asp Ser Pro Arg Leu Leu Arg Val Pro Pro Thr Gly Ser Gly
His 115 120 125 Asp
Glu Ile Arg Arg Ala Ala Ala Val Ala Ala Asp Gln Phe Arg Pro 130
135 140 Ala Pro Asp Gln Gly Asn
Val Ala Ala Glu Glu Glu Ala Ala Asp Thr 145 150
155 160 Pro Pro Pro Asp Ala Leu Pro Ser Val Thr Met
Gln Ser Val Asp Asp 165 170
175 Asp Pro Tyr Cys Ile Ile Asp Asp Arg Leu Asp Phe Gly Met Gln Gly
180 185 190 Tyr Leu
Asp Met Ala Gln Gly Met Leu Ile Asp Pro Pro Pro Met Ala 195
200 205 Gly Ser Ser Thr Ser Gly Gly
Gly Gly Asp Asp Asp Asp Asp Asp Gly 210 215
220 Glu Val Lys Leu Trp Ser Tyr 225
230 17246PRTZea maysPEPTIDE(1)..(246)ZmCBF7 17Met Asp Met Gly Arg His
Gln Leu Gln Leu Gln His Ala Ala Ser Ser 1 5
10 15 Ser Ser Thr Ser Ala Ser Ser Ser Ser Glu Gln
Asp Lys Pro Leu Cys 20 25
30 Cys Ser Gly Pro Lys Lys Arg Pro Ala Gly Arg Thr Lys Phe Arg
Glu 35 40 45 Thr
Arg His Pro Val Phe Arg Gly Val Arg Arg Arg Gly Ala Ala Gly 50
55 60 Arg Trp Val Cys Glu Val
Arg Val Pro Gly Arg Arg Gly Ala Arg Leu 65 70
75 80 Trp Leu Gly Thr Tyr Leu Gly Ala Glu Ala Ala
Ala Arg Ala His Asp 85 90
95 Ala Ala Met Leu Ala Leu Gly Arg Gly Ala Ala Cys Leu Asn Phe Pro
100 105 110 Asp Ser
Ala Trp Leu Leu Ala Val Pro Pro Pro Pro Ala Leu Ser Gly 115
120 125 Gly Leu Asp Gly Ala Arg Arg
Ala Ala Leu Glu Ala Val Ala Glu Phe 130 135
140 Gln Arg Arg Arg Phe Gly Ala Ala Ala Ala Asp Glu
Ala Thr Ser Gly 145 150 155
160 Thr Ser Pro Pro Ser Ser Ser Ser Ser Ala Thr Lys Pro Ala Pro Ala
165 170 175 Ile Glu Arg
Val Pro Val Glu Ala Ser Glu Thr Val Ala Leu Asp Gly 180
185 190 Ala Val Phe Glu Pro Asp Trp Phe
Gly Asp Met Asp Leu Asp Leu Tyr 195 200
205 Tyr Ala Ser Leu Ala Glu Gly Leu Leu Val Glu Pro Pro
Pro Pro Pro 210 215 220
Pro Pro Ala Ala Trp Asp His Gly Asp Cys Cys Asp Ser Gly Ala Asp 225
230 235 240 Val Ala Leu Trp
Ser Tyr 245 18238PRTZea maysPEPTIDE(1)..(238)ZmCBF8
18Met Cys Pro Thr Lys Lys Glu Met Ser Ala Glu Ser Ser Gly Ser Ala 1
5 10 15 Ser Ser Trp Thr
Ser Ala Ser Ala Ser Ala Ser Thr Ser Thr Ser Pro 20
25 30 Glu His Gln Thr Val Trp Thr Ser Pro
Pro Lys Arg Pro Ala Gly Arg 35 40
45 Thr Lys Phe Arg Glu Thr Arg His Pro Val Phe Arg Gly Val
Arg Arg 50 55 60
Arg Gly Ser Ala Gly Arg Trp Val Cys Glu Val Arg Val Pro Gly Arg 65
70 75 80 Arg Gly Cys Arg Leu
Trp Leu Gly Thr Phe Asp Ala Ala Glu Ala Ala 85
90 95 Ala Arg Ala His Asp Ala Ala Met Leu Ala
Ile Ala Gly Ala Ser Ala 100 105
110 Cys Leu Asn Phe Ala Asp Ser Ala Trp Leu Leu Ala Val Pro Ala
Ser 115 120 125 Tyr
Ala Ser Leu Ala Asp Val Arg Arg Ala Val Ala Glu Ala Val Glu 130
135 140 Asp Phe Gln Arg Arg Glu
Ala Ala Ala Gly Asp Asp Ala Arg Ser Ala 145 150
155 160 Thr Ser Pro Thr Pro Ser Thr Ser Gly Thr Asp
Asp Asp Ala Ala Xaa 165 170
175 Asp Gly Glu Glu Ser Ser Pro Ala Thr Glu Val Ser Ser Phe Gln Leu
180 185 190 Asp Val
Phe Asp Asn Met Ser Trp Asp Leu Tyr Tyr Ala Ser Met Ala 195
200 205 Gln Gly Met Leu Met Glu Leu
Pro Ser Ala Val Pro Ala Phe Gly Asp 210 215
220 Asp Gly Tyr Thr Asn Val Ala Asp Val Pro Leu Trp
Ser Tyr 225 230 235
19227PRTZea maysPEPTIDE(1)..(227)ZmCBF9 19Met Asp Ser Thr Asp Pro Pro Pro
Ala Ser Pro Ser Ser Pro Gly Pro 1 5 10
15 Pro Gly Gly Gln Pro Ser Pro Pro Lys Arg Pro Ala Gly
Arg Thr Lys 20 25 30
Phe Gln Glu Thr Arg His Pro Val Phe Arg Gly Val Arg Arg Arg Gly
35 40 45 Arg Ala Gly Arg
Trp Val Cys Glu Val Arg Val Pro Gly Ser Arg Gly 50
55 60 Asp Arg Leu Trp Val Gly Thr Phe
Asp Thr Ala Glu Ala Ala Ala Arg 65 70
75 80 Ala His Asp Ala Ala Met Leu Ala Leu Cys Gly Ala
Ala Ala Ser Leu 85 90
95 Asn Phe Ala Asp Ser Ala Trp Leu Leu His Val Pro Arg Ala Pro Ala
100 105 110 Gly Leu Pro
Gly Val Gln Arg Ala Ala Thr Asp Ala Val Ala Ala Phe 115
120 125 Leu Arg Thr Gln Gln Pro Arg Gly
Gly Gly Asp Ala Pro Ala Ala Ala 130 135
140 Ser Gln Gly Gln Arg Ala Asn Ala Ala Thr Glu Thr Glu
Leu Asp His 145 150 155
160 Ala Gly Ala Ser Ala Ala Ala Val Asp Gly Gly Gly Gly Ser Val Val
165 170 175 Glu Val Asp Val
Phe Gly Gly Met Asp Asp Ala Gly Ser Tyr Tyr Ala 180
185 190 Ser Leu Ala Gln Gly Leu Leu Ile Asp
Pro Pro Pro Pro Ala Val Glu 195 200
205 Cys Pro Glu Glu Glu Asp Asp Asp Cys Gly Gly Ala Gly Glu
Met Glu 210 215 220
Leu Trp Asp 225 20230PRTZea maysPEPTIDE(1)..(230)ZmCBF10 20Met
Cys Pro Ile Lys Arg Glu Thr Thr Ser Ala Glu Ser Gly Ser Pro 1
5 10 15 Cys Ser Ser Ser Thr Ser
Thr Ser Ser Ser Ser Glu His His Gln Thr 20
25 30 Ala Trp Ala Ser Pro Pro Lys Lys Arg Pro
Ala Gly Arg Thr Lys Phe 35 40
45 Arg Glu Thr Arg His Pro Val Phe Arg Gly Val Arg Arg Arg
Gly Arg 50 55 60
Ala Gly Arg Trp Val Cys Glu Val Arg Val Pro Gly Arg Arg Gly Cys 65
70 75 80 Arg Leu Trp Leu Gly
Thr Phe Asp Thr Ala Glu Ala Ala Ala Arg Ala 85
90 95 His Asp Ala Ala Met Leu Ala Val Ala Ala
Gly Ala Ala Arg Leu Asn 100 105
110 Phe Ala Asp Ser Ala Trp Leu Leu Ala Val Pro Thr Ala Ser Tyr
Ala 115 120 125 Ser
Leu Ala Asp Val Arg Arg Ala Val Ala Glu Ala Val Glu Ser Phe 130
135 140 Leu Arg Arg Arg Glu Glu
Glu Glu Gly Asp Ala Leu Ser Ala Ala Ser 145 150
155 160 Ser Thr Ser Pro Asn Asp Lys Asp Gly Asp Glu
Ser Ser Ser Ala Thr 165 170
175 Thr Thr Asp Asp Asp Ser Pro Phe Glu Leu Asp Met Phe Gly Gly Met
180 185 190 Ser Trp
Asp Leu Tyr Tyr Ala Asn Leu Ala Gln Ala Met Leu Val Glu 195
200 205 Pro Pro Pro Ile Val Pro Ala
Leu Cys Asp Asp Gly Val Ala Ser Glu 210 215
220 Leu Pro Leu Trp Ser Tyr 225 230
21222PRTZea maysPEPTIDE(1)..(222)ZmCBF11 21Met Asp Trp Ala Tyr Tyr Gly
Ser Gly Tyr Ser Thr Pro Ala Ser Gly 1 5
10 15 Gly Gly Gly Gly Asp Glu Asp Ala Tyr Met Thr
Val Ser Ser Ala Pro 20 25
30 Pro Lys Arg Arg Ala Gly Arg Thr Lys Phe Lys Glu Thr Arg His
Pro 35 40 45 Val
Tyr Lys Gly Val Arg Ser Arg Asn Pro Gly Arg Trp Val Cys Glu 50
55 60 Val Arg Glu Pro His Gly
Arg Gln Arg Ile Trp Leu Gly Thr Phe Glu 65 70
75 80 Thr Ala Glu Met Ala Ala Arg Ala His Asp Val
Ala Ala Leu Ala Leu 85 90
95 Arg Gly Arg Ala Ala Cys Leu Asn Phe Ala Asp Ser Pro Arg Arg Leu
100 105 110 Arg Val
Pro Ala Gln Gly Ala Gly His Asp Glu Ile Arg Arg Ala Ala 115
120 125 Val Glu Ala Ala Glu Leu Phe
Arg Pro Gln Pro Gly Glu Arg Asn Val 130 135
140 Gly Gly Ser Glu Ala Ala Ala Glu Ala Ala Ala Ala
Ala Pro Cys Ala 145 150 155
160 Met Gly Ser Gly Asp Leu Gly Gly Gly Glu Phe Pro Tyr Tyr Pro Val
165 170 175 Asp Asp Gly
Leu Glu Phe Glu Met Arg Gly Tyr Leu Asp Met Ala Gln 180
185 190 Gly Met Leu Ile Asp Pro Pro Gln
Pro Ala Ala Gly Gln Ser Ala Trp 195 200
205 Ile Glu Asp Glu Tyr Glu Cys Glu Val Ser Leu Trp Ser
Tyr 210 215 220 22340PRTZea
maysPEPTIDE(1)..(340)ZmCBF12 22Met Ala Ala Ala Ile Asp Met Tyr Lys Tyr
Tyr Asn Thr Ser Ala His 1 5 10
15 Gln Ile Ala Ser Ser Ser Ser Pro Ser Asp Gln Glu Leu Ala Lys
Ala 20 25 30 Leu
Glu Pro Phe Ile Thr Ser Ala Ser Ser Ser Ser Ser Ser Ser Pro 35
40 45 Tyr His Gly Tyr Ser Ser
Ser Pro Ser Met Ser Gln Asp Ser Tyr Met 50 55
60 Pro Thr Pro Ser Tyr Thr Ser Tyr Ala Thr Ser
Pro Leu Pro Thr Pro 65 70 75
80 Ala Ala Ala Ser Ser Gln Leu Pro Pro Leu Tyr Ser Ser Pro Tyr Ala
85 90 95 Ala Pro
Cys Met Thr Gly Gln Met Gly Leu Asn Gln Leu Gly Pro Ala 100
105 110 Gln Ile Gln Gln Ile Gln Ala
Gln Phe Met Phe Gln Gln Gln Gln Gln 115 120
125 Gln Arg Gly Leu His Ala Ala Phe Leu Gly Pro Arg
Ala Gln Pro Met 130 135 140
Lys Gln Ser Gly Ser Pro Pro Pro Leu Ala Pro Ala Pro Ala Gln Ser 145
150 155 160 Lys Leu Tyr
Arg Gly Val Arg Gln Arg His Trp Gly Lys Trp Val Ala 165
170 175 Glu Ile Arg Leu Pro Lys Asn Arg
Thr Arg Leu Trp Leu Gly Thr Phe 180 185
190 Asp Thr Ala Glu Asp Ala Ala Leu Ala Tyr Asp Lys Ala
Ala Phe Arg 195 200 205
Leu Arg Gly Asp Thr Ala Arg Leu Asn Phe Pro Ala Leu Arg Arg Gly 210
215 220 Gly Ala His Leu
Ala Gly Pro Leu His Ala Ser Val Asp Ala Lys Leu 225 230
235 240 Thr Ala Ile Cys Gln Ser Leu Ser Glu
Ser Lys Ser Lys Ser Gly Ser 245 250
255 Ser Gly Asp Glu Ser Ala Ala Ser Pro Pro Asp Ser Pro Lys
Cys Ser 260 265 270
Ala Ser Thr Thr Glu Gly Glu Gly Glu Glu Glu Ser Gly Ser Ala Gly
275 280 285 Ser Pro Pro Pro
Pro Pro Pro Pro Thr Leu Ala Pro Pro Val Pro Val 290
295 300 Pro Glu Met Ala Lys Leu Asp Phe
Thr Glu Ala Pro Trp Asp Glu Thr 305 310
315 320 Glu Ala Phe His Leu Arg Lys Tyr Pro Ser Trp Glu
Ile Asp Trp Asp 325 330
335 Ser Ile Leu Ser 340 23279PRTZea
maysPEPTIDE(1)..(279)ZmCBF13 23Met Ala Ala Ala Ile Asn Leu Pro Gly Pro
Ser Glu Asp Leu Met Arg 1 5 10
15 Ala Met Glu Ser Phe Met Gln Asp Asp Ala Pro Ser Pro Leu Ala
Met 20 25 30 Pro
Pro Ala Pro Ser Phe Pro Ala Ala Ala Ala His Gly Ala Gln Tyr 35
40 45 Pro Ala Thr His Leu Ser
Pro Ala Gln Met Gln Phe Ile Gln Ala Gln 50 55
60 Leu His Leu Gln Arg Asn Pro Gly Leu Gly Pro
Arg Ala Gln Pro Met 65 70 75
80 Lys Pro Ala Val Pro Val Pro Pro Ala Pro Ala Pro Gln Arg Pro Val
85 90 95 Lys Leu
Tyr Arg Gly Val Arg Gln Arg His Trp Gly Lys Trp Val Ala 100
105 110 Glu Ile Arg Leu Pro Arg Asn
Arg Thr Arg Leu Trp Leu Gly Thr Phe 115 120
125 Asp Thr Ala Glu Gln Ala Ala Leu Ala Tyr Asp Gln
Ala Ala Tyr Arg 130 135 140
Leu Arg Gly Asp Ala Ala Arg Leu Asn Phe Pro Asp Asn Ala Glu Ser 145
150 155 160 Arg Ala Pro
Leu Asp Pro Ala Val Asp Ala Lys Leu Gln Ala Ile Cys 165
170 175 Ala Thr Ile Ala Ala Ala Ser Ser
Ser Ser Lys Asn Ser Lys Ala Lys 180 185
190 Ser Lys Ala Met Pro Ile Asn Ala Ser Val Leu Glu Ala
Ala Ala Ala 195 200 205
Ser Pro Ser Asn Ser Ser Ser Asp Glu Gly Ser Gly Ser Gly Phe Gly 210
215 220 Ser Asp Asp Glu
Met Ser Ser Ser Ser Pro Thr Pro Val Val Ala Pro 225 230
235 240 Pro Val Ala Asp Met Gly Gln Leu Asp
Phe Ser Glu Val Pro Trp Asp 245 250
255 Glu Asp Glu Ser Phe Val Leu Arg Lys Tyr Pro Ser Tyr Glu
Ile Asp 260 265 270
Trp Asp Ala Leu Leu Ser Asn 275 24297PRTZea
maysPEPTIDE(1)..(297)ZmCBF14 24Met Ala Ala Thr Ile Asp Leu Ser Gly Glu
Glu Leu Met Arg Ala Leu 1 5 10
15 Glu Pro Phe Ile Arg Asp Ala Ser Ala His Gly Ser Ser Pro Leu
Leu 20 25 30 His
Pro His Gln Pro Leu Ser Pro Ser Ser Pro Phe Ser Phe His Gln 35
40 45 Ala Val Ala Ala Ala Ser
Ser Tyr Gly Gly Asn Tyr Pro Phe Ala Ala 50 55
60 Ala Asp Glu Ala Gly Gln Leu Ser Pro Ser Gln
Met Gln Tyr Ile Gln 65 70 75
80 Ala Arg Leu His Leu Gln Arg Arg Gln Ala Gln Thr Ser Val Leu Gly
85 90 95 Pro Arg
Ala Gln Pro Met Lys Ala Ser Ala Ser Ala Ala Pro Ala Pro 100
105 110 Ala Arg Pro Gln Lys Leu Tyr
Arg Gly Val Arg Gln Arg His Trp Gly 115 120
125 Lys Trp Val Ala Glu Ile Arg Leu Pro Arg Asn Arg
Thr Arg Leu Trp 130 135 140
Leu Gly Thr Phe Asp Thr Ala Glu Glu Ala Ala Leu Ala Tyr Asp Gln 145
150 155 160 Ala Ala Tyr
Arg Leu Arg Gly Asp Ala Ala Arg Leu Asn Phe Pro Asp 165
170 175 Asn Ala Ala Ser Arg Gly Pro Leu
His Ala Ser Val Asp Ala Lys Leu 180 185
190 Gln Ser Leu Cys Gln Ser Ile Ala Ala Ser Lys Lys Gly
Ala Lys Lys 195 200 205
Pro Ala Ser Ala Ala Ala Ala Ala Ser Ser Ser Ala Pro Thr Ser Asn 210
215 220 Cys Cys Ser Ser
Pro Ser Ser Asp Asp Ala Thr Ser Ser Cys Leu Glu 225 230
235 240 Ser Ala Thr Glu Ser Ser Cys Pro Ser
Pro Ser Pro Ser Ala Ser Pro 245 250
255 Gly Pro Thr Val Pro Glu Met Gln Gln Leu Asp Phe Ser Glu
Ala Pro 260 265 270
Trp Asp Glu Ala Ala Gly Phe Ala Leu Thr Lys Tyr Pro Ser Tyr Glu
275 280 285 Ile Asp Trp Asp
Ser Leu Leu Ala Asn 290 295 25244PRTZea
maysPEPTIDE(1)..(244)ZmCBF15 25Met Ala Met Asp Ser Ser Ser Ser Gly Ser
Glu Pro Thr Ser Ser Ser 1 5 10
15 Ser Ala Glu Ala Pro Ala Ser Pro Thr Ala Ser Ser Ser Glu Ser
Ser 20 25 30 Ala
Ala Gly Ser Lys Lys Arg Arg Arg Ser Lys Asp Gly His His Pro 35
40 45 Thr Tyr Arg Gly Val Arg
Met Arg Ala Trp Gly Lys Trp Val Ser Glu 50 55
60 Ile Arg Glu Pro Arg Lys Lys Ser Arg Ile Trp
Leu Gly Thr Phe Pro 65 70 75
80 Thr Ala Glu Met Ala Ala Arg Ala His Asp Ala Ala Ala Leu Ala Ile
85 90 95 Lys Gly
Arg Ala Thr Gln Leu Asn Phe Pro Val Leu Ala Gly Val Leu 100
105 110 Pro Arg Ala Ala Ser Ala Ala
Pro Lys Asp Val Gln Ala Ala Ala Met 115 120
125 Leu Ala Ala Ala Phe Thr Ser Pro Ser Ser Ala Pro
Ser Glu Leu Asp 130 135 140
Ala Gly Ala Pro Ala Pro Arg Glu Val Pro Ala Thr Lys Asn Gly Ser 145
150 155 160 Pro Ser Glu
Asp Glu Ala Gly Ala Glu Ala Pro Val Pro Pro Ala Ala 165
170 175 Ala Ser Gln Pro Gly Thr Pro Ser
Ser Gly Val Asp Glu Glu Arg Gln 180 185
190 Leu Phe Asp Leu Pro Asp Leu Leu Leu Asp Val Arg Asp
Gly Phe Gly 195 200 205
Ala Leu Pro Ala Asp Val Gly Pro Val Pro Arg Val Gly Gly Gln Cys 210
215 220 Gly Gly Gly Ala
Ala Ala Leu Gly Ile Ala Val Leu Gln Leu Gln Arg 225 230
235 240 Gly Ser Lys Gln 26281PRTZea
maysPEPTIDE(1)..(281)ZmCBF16 26Met Ala Gln Glu Leu His Glu Thr Ser Ser
Cys Ser Ala Thr Thr Thr 1 5 10
15 Ser Ser Cys Thr Thr Ser Cys Cys Ser Ser Thr Val Thr Asp Ser
Ser 20 25 30 Ser
Ser Pro Pro Ser Pro Ala Ala Ala Asn Ala Ala Pro Ala Thr Arg 35
40 45 Lys Arg Gln Ala Leu Glu
Ala Glu Ala Glu Ala Glu Ala Gly Gly Glu 50 55
60 Glu Glu Glu Glu Glu Glu Glu Gly Cys Ala Gly
Asn Lys Ala Ala Pro 65 70 75
80 Ala Lys Lys Arg Pro Arg Gly Ser Glu Gly Lys His Pro Thr Phe Arg
85 90 95 Gly Val
Arg Met Arg Ala Trp Gly Lys Trp Val Ser Glu Ile Arg Glu 100
105 110 Pro Arg Lys Lys Ser Arg Ile
Trp Leu Gly Thr Phe Pro Thr Ala Glu 115 120
125 Met Ala Ala Arg Ala His Asp Val Ala Ala Leu Ala
Ile Lys Gly Arg 130 135 140
Ala Ala His Leu Asn Phe Pro Asp Leu Ala Gly Ala Leu Pro Arg Ala 145
150 155 160 Ala Ser Ala
Ala Pro Lys Asp Val Gln Ala Ala Ala Ala Leu Ala Ala 165
170 175 Ala Phe Thr Ser Pro Ser Ser Glu
Pro Gly Ala Gly Ala His Glu Glu 180 185
190 Pro Ala Ala Lys Asp Gly Ala Ala Pro Glu Glu Ala Ala
Ala Asp Ala 195 200 205
Gln Ala Pro Val Pro Val Ala Leu Pro Pro Pro Ala Ala Ser Arg Pro 210
215 220 Gly Thr Pro Ser
Ser Gly Val Glu Asp Glu Arg Gln Leu Phe Asp Leu 225 230
235 240 Pro Asp Leu Leu Leu Asp Ile Arg Asp
Gly Phe Gly Arg Phe Pro Pro 245 250
255 Met Trp Ala Pro Leu Thr Asp Val Glu Asp Val Val Asn Ala
Glu Leu 260 265 270
Arg Leu Glu Glu Pro Leu Leu Trp Glu 275 280
27262PRTZea maysPEPTIDE(1)..(262)ZmCBF17 27Met Val Lys Thr Ala Ala Ser
Ser Ser Ser Asp Asp Ala Ala Ala Ala 1 5
10 15 Lys Arg Arg Thr Tyr Lys Gly Val Arg Met Arg
Ser Trp Gly Ser Trp 20 25
30 Val Ser Glu Val Arg Ala Pro Gly Gln Lys Thr Arg Ile Trp Leu
Gly 35 40 45 Ser
His Ala Thr Ala Glu Ala Ala Ala Arg Ala His Asp Ala Ala Leu 50
55 60 Leu Cys Leu Arg Gly Ser
Ala Ala Asp Leu Asn Phe Pro Leu Arg Leu 65 70
75 80 Pro Phe Asp Leu Pro Pro Ala Ala Thr Met Ser
Pro Lys Ala Ile Gln 85 90
95 Arg Val Ala Ala Ala Ala Ala Ala Ala Ala Ser Gly Ser Ala Ser Ser
100 105 110 Cys Gly
Gly Leu Arg Ala Pro Pro Phe Ala Pro Pro Asp Glu Asn Ser 115
120 125 Gly Thr Ser Ala Cys Ser Asp
Gly Asp Ala Thr Pro Ala Ser Ser Thr 130 135
140 Thr Ser Ser Pro Thr Ser Asp Ser Ala Pro Ala Trp
Ser Thr Thr Ser 145 150 155
160 His Ala Asp Asp Val Ser Phe Pro Gly Val His Arg Gln Gln Arg Arg
165 170 175 Arg Arg Arg
Leu Leu Arg Arg Ala Arg Gly His Arg Val Leu Leu Pro 180
185 190 Val Ala Gln Val His Gly Val Arg
Tyr Asp Gly Pro Met Gln His Val 195 200
205 Leu Arg Ala Ser Ala His Gly Asp Gly Arg Gly Gln Arg
Val Gly Val 210 215 220
Gly Gly Gly Arg Arg Asp Arg Pro Leu Glu Leu Leu Val Pro Gln Leu 225
230 235 240 Ile Ala Arg His
Gln Phe Gly Gly Gln Ala Gly Leu Lys Arg Ser Pro 245
250 255 Phe Val Ser Asn Gln Arg
260 28250PRTZea maysPEPTIDE(1)..(250)ZmCBF18 28Met Lys Gly Lys
Gly Gly Pro Asp Asn Thr Gln Cys Gly Tyr Arg Gly 1 5
10 15 Val Arg Gln Arg Thr Trp Gly Lys Trp
Val Ala Glu Ile Arg Glu Pro 20 25
30 Asn Arg Val Asp Arg Leu Trp Leu Gly Thr Phe Pro Thr Ala
Glu Asp 35 40 45
Ala Ala Arg Ala Tyr Asp Glu Ala Ala Arg Ala Met Tyr Gly Asp Leu 50
55 60 Ala Arg Thr Asn Phe
Pro Gly Gln Asp Ala Thr Thr Ser Ala Gln Ala 65 70
75 80 Ala Leu Ala Ser Thr Ser Ala Gln Ala Asp
Pro Thr Ala Val Glu Ala 85 90
95 Leu Gln Thr Gly Thr Ser Cys Glu Ser Thr Thr Thr Ser Asn Tyr
Ser 100 105 110 Asp
Ile Ala Ser Thr Ser His Lys Pro Glu Pro Glu Ala Ser Asp Ile 115
120 125 Ser Ser Ser Leu Lys Ala
Lys Cys Pro Ala Gly Ser Cys Gly Ile Gln 130 135
140 Asp Gly Thr Pro Ser Val Ala Asp Lys Glu Val
Phe Gly Pro Leu Glu 145 150 155
160 Pro Ile Thr Asn Leu Pro Asp Gly Gly Asp Gly Phe Asp Ile Gly Glu
165 170 175 Met Leu
Arg Met Met Glu Ser Asp Pro His Asn Ala Gly Gly Ala Asp 180
185 190 Ala Gly Met Gly Gln Pro Trp
Tyr Leu Asp Glu Leu Asp Ser Ser Val 195 200
205 Leu Glu Ser Met Leu Gln Pro Glu Pro Glu Pro Glu
Pro Glu Pro Phe 210 215 220
Leu Met Ser Glu Glu Pro Asp Met Phe Leu Ala Gly Phe Glu Ser Ala 225
230 235 240 Gly Phe Val
Glu Gly Leu Glu Arg Leu Asn 245 250
29316PRTZea maysPEPTIDE(1)..(316)ZmCBF19 29Met Ala Ala Ala Ile Asp Met
Tyr Lys Tyr Cys Asn Thr Ser Ala His 1 5
10 15 Leu Ile Ala Ser Ser Ser Pro Ser Asp Gln Glu
Leu Ala Lys Ala Leu 20 25
30 Glu Pro Phe Ile Thr Ser Ala Ser Ser Pro Tyr His Arg Tyr Ser
Leu 35 40 45 Ala
Pro Asp Ser Tyr Met Pro Thr Pro Ser Ser Tyr Thr Thr Ser Pro 50
55 60 Leu Pro Thr Pro Thr Ser
Ser Pro Phe Ser Gln Leu Pro Pro Leu Tyr 65 70
75 80 Ser Ser Pro Tyr Ala Ala Ser Thr Ala Ser Gly
Val Ala Gly Pro Met 85 90
95 Gly Leu Asn Gln Leu Gly Pro Ala Gln Ile Gln Gln Ile Gln Ala Gln
100 105 110 Leu Met
Phe Gln His Gln Gln Gln Arg Gly Leu His Ala Ala Phe Leu 115
120 125 Gly Pro Arg Ala Gln Pro Met
Lys Gln Ser Gly Ser Pro Pro Ala Gln 130 135
140 Ser Lys Leu Tyr Arg Gly Val Arg Gln Arg His Trp
Gly Lys Trp Val 145 150 155
160 Ala Glu Ile Arg Leu Pro Lys Asn Arg Thr Arg Leu Trp Leu Gly Thr
165 170 175 Phe Asp Thr
Ala Glu Gly Ala Ala Leu Ala Tyr Asp Glu Ala Ala Phe 180
185 190 Arg Leu Arg Gly Asp Thr Ala Arg
Leu Asn Phe Pro Ser Leu Arg Arg 195 200
205 Gly Gly Gly Ala Arg Leu Ala Gly Pro Leu His Ala Ser
Val Asp Ala 210 215 220
Lys Leu Thr Ala Ile Cys Gln Ser Leu Ala Gly Ser Lys Asn Ser Ser 225
230 235 240 Ser Ser Asp Glu
Ser Ala Ala Ser Leu Pro Asp Ser Pro Lys Cys Ser 245
250 255 Ala Ser Thr Glu Gly Asp Glu Asp Ser
Ala Ser Ala Gly Ser Pro Pro 260 265
270 Ser Pro Thr Gln Ala Pro Pro Val Pro Glu Met Ala Lys Leu
Asp Phe 275 280 285
Thr Glu Ala Pro Trp Asp Glu Thr Glu Ala Phe His Leu Arg Lys Tyr 290
295 300 Pro Ser Trp Glu Ile
Asp Trp Asp Ser Ile Leu Ser 305 310 315
30738DNAZea maysCDS(1)..(738)misc_feature(82)..(123)CBF-specific domain
30atg gac gcc gac gac tcc tcg tac gca tcg tcc tcc tcg ttc tct ccg
48Met Asp Ala Asp Asp Ser Ser Tyr Ala Ser Ser Ser Ser Phe Ser Pro
1 5 10 15
ccg ccg tcc ccc gcc gat cat ctc cgc ctg ccc ccg aag cgg cgt gcg
96Pro Pro Ser Pro Ala Asp His Leu Arg Leu Pro Pro Lys Arg Arg Ala
20 25 30
ggc cgc aag aag ttc cgg gag acg cgg cac ccg gtg tac cgc ggc gtg
144Gly Arg Lys Lys Phe Arg Glu Thr Arg His Pro Val Tyr Arg Gly Val
35 40 45
cgc gcg cgc gcc ggt ggc acc cgc tgg gtg tgc gag gtg cgg gag ccg
192Arg Ala Arg Ala Gly Gly Thr Arg Trp Val Cys Glu Val Arg Glu Pro
50 55 60
cag gcg cag gcg cgc atc tgg ctc ggc acc tac ccg acc ccg gag atg
240Gln Ala Gln Ala Arg Ile Trp Leu Gly Thr Tyr Pro Thr Pro Glu Met
65 70 75 80
gcc gcg cgc gcg cac gac gtc gcc gcc atc gcg ctc cgc ggc gcc acc
288Ala Ala Arg Ala His Asp Val Ala Ala Ile Ala Leu Arg Gly Ala Thr
85 90 95
gcc gcc gac ctc aac ttc ccg gac tct gcc cac gcg ctc ccg cgc gcg
336Ala Ala Asp Leu Asn Phe Pro Asp Ser Ala His Ala Leu Pro Arg Ala
100 105 110
cgc acc gcc gcg ccc gac gac ata cga cgc gcc gcc gcg cag gcc gcc
384Arg Thr Ala Ala Pro Asp Asp Ile Arg Arg Ala Ala Ala Gln Ala Ala
115 120 125
gag ctc tac cgc ccg tct cct tct tcc tct tcc gcc tcc ggc ctg ctg
432Glu Leu Tyr Arg Pro Ser Pro Ser Ser Ser Ser Ala Ser Gly Leu Leu
130 135 140
ctg cac cac ggc cgc cgg acg atc gct gcc ccg cca ccg ccg ctc tca
480Leu His His Gly Arg Arg Thr Ile Ala Ala Pro Pro Pro Pro Leu Ser
145 150 155 160
ctg ccg ccg ccg gag gct tct tct acc tgc tgc tgg acg acc agt aca
528Leu Pro Pro Pro Glu Ala Ser Ser Thr Cys Cys Trp Thr Thr Ser Thr
165 170 175
gga acc cga gga gca ggt acc acg ccg acc tgc tgc gac ggc ttc ctg
576Gly Thr Arg Gly Ala Gly Thr Thr Pro Thr Cys Cys Asp Gly Phe Leu
180 185 190
gac gag gat tcc atc ttt gac atg cca ggg ctc atc cac gac atg gcc
624Asp Glu Asp Ser Ile Phe Asp Met Pro Gly Leu Ile His Asp Met Ala
195 200 205
tgg ggg atg ctg ctc acg cca cca gcc atg ggc cgg ggc ctc gac tgg
672Trp Gly Met Leu Leu Thr Pro Pro Ala Met Gly Arg Gly Leu Asp Trp
210 215 220
ggc gcc ctt gac gac gac gac gac cat agc cac gtc gac tgc acg ctc
720Gly Ala Leu Asp Asp Asp Asp Asp His Ser His Val Asp Cys Thr Leu
225 230 235 240
tgg acg ctg gac gga tga
738Trp Thr Leu Asp Gly
245
31561DNAZea maysmisc_feature(538)..(561)n = A, T, C or G 31ccacgcgtcc
gccagaacca gctcactcct cactccactt ccactcccaa cagcaagctc 60aagcagtcag
tcaccggcag gggtcagggt cacagtcaca gcagcagcca tggacacggc 120cggcctcgtc
cagcacgcga cctcctcgtc ttccacctcc acctcggcgt cgtcgtcctc 180gtcctcgtcc
gagcagcaga gcaaggcggc gtggccgccg tcgcccgctt cctccccgca 240gcagccgccc
aagaagcgcc ccgcggggcg cacgaagttc cgggagacgc ggcacccggt 300gttccgcggc
gtgcggcggc ggggcgccgc gggccggtgg gtgtgcgagg tgcgcgtccc 360ggggaggcgc
ggcgcgcggc tgtggctcgg cacctacctc gccgccgagg cggcggcgcg 420cgcgcacgac
gccgcgatgc tcgccctgca gggccgcggc gcggggcgcc tcaacttccc 480ggactccgcg
cggctgctcg ccgtgccgcc cccgtccgcg ctcccgggcc tggacgannn 540ccgccgggcg
gcgctcgnnn n 561321430DNAZea
maysmisc_feature(928)..(934)n = A, T, C or G 32ccaaaaaaaa aatctacttt
gattttccat aaagaaaacc tctcagcctc tttcttctag 60ctccatccaa gctttagacc
taatggccgc agccatcgac atgtacaagt actacaatac 120cagcgcacac cagatcgcct
cctcctcctc cccctcggat caggagctcg cgaaagcact 180cgagcctttt ataacgagtg
cttcctcctc ttcgtcctcc tccccctacc atggctactc 240gtcctctcca tccatgtccc
aagattctta catgcctaca ccttcttaca ccagctacgc 300cacctcgcct cttcccactc
ccgccgctgc ctcctcgcag cttccgccac tctactcgtc 360gccttatgcg gcgccgtgca
tgaccggcca gatgggcctg aaccagctcg gcccggccca 420gatccagcag atccaggccc
agttcatgtt ccagcagcag cagcagcaga ggggcctgca 480cgcggcgttc ctgggcccgc
gggcgcagcc gatgaagcag tccgggtcgc cgccgccgct 540ggcgccggcg ccggcgcagt
cgaagctgta ccgcggcgtg cggcagcgcc actggggcaa 600gtgggtggcg gagatccgtc
tcccgaagaa ccgcacgcgg ctgtggctcg gcaccttcga 660caccgccgag gacgcggcgc
tcgcctacga caaggcggcc ttccgcctcc gcggcgacac 720ggcgcgcctc aacttcccgg
ccctccggcg cggcggcgcg cacctcgccg gcccgctgca 780cgcgtccgtg gacgccaagc
tgaccgccat ctgccagtcc ctgtcggagt ccaagtccaa 840gagcggctcg tcgggcgacg
agtcggccgc gtccccgccg gactccccca agtgctcggc 900gtcgacgacg gagggggaag
gggagganga ntcnggctcc gccggctccc ctcctcctcc 960tcctcccccg acgctggcgc
cgcccgtgcc ggtgccggag atggcgaagc tggacttcac 1020ggaggcgccg tgggacgaga
cggaggcctt ccacctgcgc aagtacccgt cctgggagat 1080cgactgggat tccatcctgt
catgagcaat aatagctccg tgtaatttaa ttttctactg 1140tctgggtttt gcggctgcgg
tggcccgatg gcattttaga cgtcggccat ggcggctgca 1200agtagcaatg agtaactagc
tagctagtac atcgtcgtcg actcgtcgtc gtccagtgtt 1260gtgaagcagc gtcaagtaca
tgcgtgctag tctctcctgg ttgagctgcc ggttgttttt 1320ttttctcacg gcacggccag
tcgagaagag tcagtagtgt aatctcgtgg tgttatgatc 1380atcggttgca gcttatgtaa
aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 143033475DNAZea
maysmisc_feature(445)..(445)n = A, T, C or G 33gaggcttgag cgacccgagc
aaaaaaaaat catccgcgaa gcagcgacgc aggtttgttc 60tctgatctct ctggttcttc
ttttagcagc agggacgcac agcacaagaa aaggtccggt 120tcaagaggct cttctgttgg
gttttttttt ttagaatcct caagagaaga gtgttcttgg 180gttaggattt ttttctcacc
tcatcacacc tctcctagga ggaacagaga tccagagaac 240cttttttctc ttctggatcc
ttgccgcctt ttggccttcg aaagggggtt gttttgcaat 300accctctctc tctgtttttt
ttttttttat cttggcacaa acccacctcg tttctctact 360tttattgagg aacagagctg
gttttccaaa aaaaaaatct actttgattt tccataaaaa 420aaacctctca gcctctttct
tctanctcca tccaagcttt aaacctaatg gccgc 475341241DNAZea mays
34gtcgtgcctg ttctttgatt ttagctaaga accagccgat ttagtcttct tcttccttac
60aaaaaaattc cccagaaaaa aaaaagttgt gagtagtata ttcttgtacc atttgatggc
120cgctgctata aatctccccg ggcccagcga ggatctgatg cgagcgatgg agtccttcat
180gcaagacgac gccccatccc ctctggccat gccgccggcg ccgtctttcc cagcggccgc
240agcccacggg gcccagtacc cggcgaccca cctgagcccg gcgcagatgc agttcatcca
300ggcccagctc cacctgcagc ggaacccggg gctgggcccg cgggcgcagc ccatgaagcc
360cgccgtccca gtgccgccgg cgccggcgcc gcagcggcct gtgaagctgt accgcggcgt
420gcggcagcgt cactggggca agtgggtggc cgagatccgg ctcccccgga accgcacccg
480cctgtggctc gggaccttcg acaccgccga gcaggcagcg ctggcctacg accaggcggc
540gtaccgcctc cgcggggacg cggcgcggct caacttcccc gacaacgcgg agtccagggc
600gccgctcgac cccgccgtgg acgccaagct gcaggccatc tgcgccacca tcgccgccgc
660gtcgtcgtca tccaagaatt ccaaggccaa gagcaaggcg atgccaatca acgcgtccgt
720tctggaagcg gcagcggcgt ctccgagcaa cagctcctcc gacgaaggtt ccggctccgg
780gttcgggtcg gacgacgaga tgtcctcgtc ctccccgacg ccggtggtgg cgccgccggt
840ggcggacatg ggacagctgg atttcagcga ggttccgtgg gacgaggacg agagcttcgt
900gctccgcaag tacccgtcct acgagatcga ctgggacgcg ctgctctcca actagtcgcc
960cttcgccgac agatgtgctg ttgtagttca gtagtggcag tgtctctggc cgccgcagat
1020gaggttttag gcaatctgca ggccgccggc ccatgtgtat taagtaggtt ttgctcagtt
1080gttggccctg gacttcgccg gcgtttttgt gaccggcgtc cccgagtgca ctgcattggt
1140gtactggtct gtctgtaaaa aaaatggatc tgtacttcta tagtgtgtat tcaaccattg
1200ttcttagttg tgatgttact cgtgctgcaa aaaaaaaaaa a
124135776DNAZea mays 35ccacaatccc cttcaacaaa cgcaccgcac tccacggcag
ccagaaaaca caatccacac 60agtctccgtg aggtctgagc gaggtaggac tagctctgag
cgagttcttc cgtcttccag 120gtataatcct tgcaactctc actatcctct tcgcatcgct
cgtctctttt tgagactagt 180tggatttctc ttctttttcc ttggatccta ctacctacta
gaacagagct ccggagcctt 240ttagtttctg gtatctgatc tactcatttt cttttcttcc
ccttttggtg acataggtag 300gtctagttaa gatctgtcgt gccagttctt tgattttagc
taagaaccag gcgatttagt 360cttcttcctt acaaaaaaaa aatccccaga aaaaaaagtt
gtgagtagta tattcttgta 420ccatttgatg gccgctgcta taaatctccc cgggcccagc
gaggatctga tgcgagcgat 480ggagtccttc atgcaagacg acgccccatc cccgctggcc
tacgaccagg cggcgtaccg 540cctccgcggg gacgcggcgc ggctcaactt ccccgacaac
gcggagtcca gggcgccgct 600cgaccccgcc gtggacgcca agctgcaggc catctgcgcc
accatcgccg ccgcgtcgtc 660gtcatccaag aattccaagg ccaagagcaa ggcgacgcca
atcaacgcgt ccgttctgga 720agcggcagcg gcgtctccga gcaacagctc ctccgacgaa
ggttccggct ccgggt 77636720DNAZea mays 36aattaacccc tcactaaagg
gaataagctt gcggccgcag aagctgtacc gcggcgtgcg 60gcagcggcac tggggcaagt
gggtggcgga gatccggctc ccgcgcaacc gcacccggct 120ctggctcggc accttcgaca
ccgccgagga ggcggcgctg gcctacgacc aggccgccta 180ccgcctccgc ggcgacgcgg
cgcgcctcaa cttccccgac aacgccgcct cccgcggccc 240gctccacgcc tccgtcgacg
ccaagctcca gagcctgtgc cagagcatcg ccgcgtccaa 300gaagggcgcc aagaagccag
cctccgccgc agctgccgcg tcgtcgtccg cccccaccag 360caactgctgc tcctcgccgt
cgtccgacga cgcgacctcg tcctgcctcg agtccgccac 420cgagtcctcg tgcccgtccc
cgtccacgtc gccgtccgcc tcgcccgggc cgacggtgcc 480ggagatgcag cagctggact
tcagcgaggc gccgtgggac gaggccgccg gcttcgcgct 540caccaagtac ccgtcgtacg
agatcgactg ggactccctc ctcgccaatt aatcgtcgcc 600ccctgttggc tggctggctg
gctcccttcc tacccacact gccgtcgtgc agctggagga 660gggttaaaga aggttaacgg
ccgtcgcgca gatgggattt tagacatcct gcgcgggcac 72037733DNAZea
maysmisc_feature(663)..(691)n = A, T, C or G 37tcctctgcca ccaccaccac
ctcgtcgtgt accacctctt gctgctcgtc cactgtcacg 60gactcgtcct cgtcgccccc
gtcaccggcg gcggccaatg ccgcgcccgc gccacggaag 120gcggaggccg agactggcgg
ggacgaggac ggttgtgctg gtaaggcggc gccggcgaag 180aagcgggagc ggagcagcga
ggggaagcac ccgacgtacc gcggcgtgcg gatgcgggcg 240tggggcaagt gggtgtcaga
gatccgcgag ccgcgcaaga agtcgcgcat atggctcggc 300acgttcccga ccgccgagat
ggccgcgcgc gcccacgacg ccgccgcgct cgccatcaag 360ggccgcgcca cgcaactcaa
ctttccggtc ctcgccggcg tgctcccgcg cgccgcgtcc 420gcggcgccca aggacgtcca
ggccgccgcc atgctggccg ccgcgttcac gtcgccgtcg 480tcagcaccat cggagctcga
cgccggcgcg ccggcgccac gcgaagtgcc cgccacaaag 540aacggctccc cgtccgagga
cgaggcaggc gctgaggcgc cggtaccacc ggcggcggcc 600tctcagccag ggacgccgtc
gagcggcgtg gacgaagagc ggcagctgtt cgacttgccg 660ganctgctcc tcgacgtccg
ggacgggttc ngggcgcttc ccgccgatgt gggccccgtt 720ccatgacgtg ttg
73338509DNAZea mays
38acttgatcct cctgcacacc agagtaccag acaccactgc gcatcagtgc aaagaggtag
60ctaccctatc tgccatggct caagagctcc acgaaacgtc ctcttgctct gccaccacca
120cctcgtcgtg caccacatcc tgctgctcgt ccactgtcac agactcgtcc tcttcgcccc
180cgtcaccggc ggcggccaat gccgcgcccg cgacacggaa gcggcaggcg ttggaggccg
240aggccgaggc cgaggcgggc ggtgaggagg aggaggagga ggaggaaggc tgtgctggta
300ataaggcggc gccggccaag aagcgaccgc ggggcagcga ggggaagcac ccgacgttcc
360gcggcgtgcg gatgcgggcg tggggcaagt gggtgtcgga gatccgcgag ccgcgcaaga
420agtcgcgcat atggctcggc acgttcccca ccgccgagat ggccgcgcgc gcccacgacg
480tcgcggcgct cgccatcaag ggccgcgcc
50939684DNAZea maysmisc_feature(655)..(679)n = A, T, C or G 39gataattcat
tatacattac ttgcacttgc agaaaaacac agcttcgtct acacatagcc 60tggcatgcaa
cggagaggag attgactcca tggaaagctg ccttaaaaca tgtagctcca 120tgagagcctc
cctccacatt gcataattag cctagcacta gtaatatata gtgcaataat 180taactagtgg
tatgaatcaa ataatccttc atagcatatt tatatatgat gcttatttat 240accgtgacca
attcttgatc acgtaggttg gttgcagttg taattaatca gtcagcgttg 300atttgagacg
aatggtgacc gcttgaggcc ggcttgacct ccgaactgat gtctggcgat 360cagttgaggg
acgagaagct ccagaggtcg atctcgcctt cctcctccca ctcccactcg 420ctggccgcgt
ccgtcgccat gggcgctggc gcgaagaacg tgctgcatgg gtccatcata 480gcgtactcca
tgcacttggg cgaccggaag aaggactcga tgtccgcgag cgcggcgagg 540tccgcgtcgc
ccgccaccat gtccacgtcg tcgtcgtgcg ccacgccgta gcacaggtca 600tcagtgttcc
cggccgtctc cggggagctc acgcccgacg acgcgtcgcc gtagntgcac 660ggaggggtgg
cgccnnnnnc ggcc 68440704DNAZea
maysmisc_feature(639)..(704)n = A, T, C or G 40ctagacatct cgcgggtctt
atcgactcca acaagaacac actacacacc agccagcgag 60atagcgaacg ctaggaaccc
agtggccatc tttggagcgg ccatgacgct ggatcagaac 120catgccatgc cgatgcagcc
cccggccctg cagcccggaa gagcatatgg agcagagggc 180agtgctgtgg tgcatggttc
catcagaaca gtaggaagaa gcgacctcgc agatcacgag 240atgggcctac gtcagtggca
gctgtcatcc agcggtgggc tgagcgcaac aagcatttgg 300agtatgagga atctgaggag
gcaaagcgac caagaaaagc acctgcaaag ggttcaaaga 360agggctgtat gaagggaaaa
ggggggcctg acaatactca atgtggatac cgtggagtga 420ggcagcgtac ttgggggaag
tgggttgctg aaataagaga gccaaatcgt gtcgacagac 480tctggctggg taccttccca
accgcggagg atgcagctag ggcctatgat gaggcagcca 540gagcgatgta tggagacttg
gcacggacta acttccccgg acaggatgca acaacctctg 600cccaagctgc tctatcatcg
acctctgccc aggctgctnn nncagctgtt gaagctcttc 660nnnnnggcac gtcatgcgag
tcgannanga cntnnnntca ctcn 70441722DNAZea mays
41gccacatccc atcccaactc cacggtgagg tttgagcgat ccgagcaaaa aaaatcaacc
60gaggcgcagc aacgcaggtt cgttctctga actctctggt tcttttagca gcagggacgc
120gcagcacaag aaaggtccgg ttcaagaggc tcttcttttg ggtttattag aatcctcaaa
180ggaagcgtgt tcttgggtta ggattttttt ttctcacctc atcacacctc tcctaggagg
240agcagagatc cagagaacct tttttctctt ctggattctc accgcttctt ttttttttct
300ctgcacaaac acaccctcgt ttctctactt ttattgagga acagggctga tttccaaact
360ttttcctacc cctgttttca taaacaagct ccttttattt cttcccgttt taagttctgc
420ccagcctctt cttctagctc catccaagct ttctccatct agtagttcca atggccgcag
480ccatagacat gtacaagtac tgcaatacca gcgcacacct tatcgcctcc tcgtccccct
540cggatcagga gctcgcgaaa gcactcgagc cttttataac gagtgcttcc tccccctacc
600atcgctactc gttggcccca gattcttaca tgcctacacc ctcctcctac accacctcgc
660ctcttcccac ccccacctcc tcgcctttct cgcagcttcc gccactctac tcgtcgcctt
720ac
722421949DNASecale cerealepromoter(1)..(1949)Rye CBF31 promoter
42ataagcatga ccatgagcca tttcgcatca ccttcaagaa gggaactcgt gcccaaaggc
60gtcattgttg cgagtgttga agcaaggaca aaatttctcc ttgagaaaga gtagagccac
120atcttatata tgcgaaaaca acacacacac acacacacac acacatggca cgcgacaaca
180tcgacgacag cacataacca aatggcaaca gatgaaagaa tggcgtaacc attgaggcca
240gacggggcgc gataaagcta tgtcaaacaa gggctacatg gatcttgtgc ggaaccagca
300gaaggcggca gatcaatgaa agatctatcc aacttagata ttgtccatcc atggcatgag
360ctagtggatt ctagcgtggg ggcctccgga atcgcgagag cgacccagga ggcaggggac
420acttttcaca aagtttgagt tgggggagga gggcagcaag tacttgtaat cacatatatt
480gtgattgatt agttactaaa catatgtttg tctcgtttgt ctccacgtct agagtagagg
540cacaatccca accaccacct ccaaaattct cccacacgcc gccgcaacgg tccccttcca
600ctttctcgtc tcgcggcaaa aggagtgaga accctttgta tgttaccttt ttttattctt
660aggttttgtt ttcctgacga catcaccaag gcgatggcgg tctcttccta cctcaacaac
720atccgataca gcactaccga agggcgcgtg tgagtttttg tccctggatg taatggctct
780cttcagatct tggtctttgt tttgtttttg tccccggatc taccggttct cctcagatat
840tggtcttcca tgggcagcta ttagacaatt tagacatgac ctgtgggagt atgtgataat
900ttgctagtta gtgaaattga tttctaccga caaaaacata aaaaccattg gaaattattg
960gagtgttgat ggttatggta tagtccagtt tttcttttgg agaaagagag gtctgtctag
1020agcacatcta tcttagttat tgtacatcta agtgactcag tcaaactaaa aagaaaaaga
1080aaaaagatta aaaaatgctt acacgaatct tagcgtaaga ttaagaacaa agttggaagt
1140acacttttca aaggacggag ggagtagcac ttagatgtca atacttagga cacatcttta
1200tgtgttttag gaaaactgga gaaaaaagat atgctccttt tcaagaatta aagtaagaaa
1260acaacgacgt gcccttaatt tgttagtcga tcaagaatca gttcccgtcg ctcacgcgtc
1320tggaaggcca gcgtatgcag ccgcaaatcc ctccccgata tacccaagta ctccgtacga
1380tatacaaaaa ggttgttctc gcacgattac aacttttgat tagataaaaa tgatgtgcag
1440ctccccgaaa agaaaagtag ggaacaaaga tatagtgtgc tcatccgtat ggaatctatt
1500atggcgtcga aacgtctaga agggcgccac agccttcaaa gcccttccga gatgaacaat
1560ctcggggtga acaagcagac accagtgcat ctcatggcaa accagaaaaa atgtaacaaa
1620agtagcaccg tggtggtacg tccaagcgag agttacctcg atgaagctgc ctactgctcg
1680ctagtgtaag tgagagaaag aagaaccggg attttccatt agaaaccaat ctgccgtgag
1740agagtccatt tccacccgag cgtccacgtc gtggcgggta cccaacccgt tgccagtagc
1800cccaaactac tcacctgctt gattccccgc ttctagttct catcggagct acaatccatc
1860gaccctcact acaacggctt aacgcgcacc acaccccgcc ccgctacgct gcacactccg
1920gtccggtgtt atacgccccc ccgctacag
194943669DNAZea maysCDS(1)..(669)ZmCBF4 43atg ccg ccg ccg ccg tcc tcg tcg
tct tcg ccc tcg cag gac gcc ggc 48Met Pro Pro Pro Pro Ser Ser Ser
Ser Ser Pro Ser Gln Asp Ala Gly 1 5
10 15 agc ccc aag cga gcc gcg ggg cgc aac
aag ttc cgg gag acg cgg cac 96Ser Pro Lys Arg Ala Ala Gly Arg Asn
Lys Phe Arg Glu Thr Arg His 20 25
30 ccg gtg ttc cgc ggc gtg cgc cgg cga ggc
cgc gcg ggg ggc cgg tgg 144Pro Val Phe Arg Gly Val Arg Arg Arg Gly
Arg Ala Gly Gly Arg Trp 35 40
45 cgg tgg gtg tgc gag gtc cgc gtt cca ggc cgc
cgc ggc tgc agg ctc 192Arg Trp Val Cys Glu Val Arg Val Pro Gly Arg
Arg Gly Cys Arg Leu 50 55
60 tgg ctc ggc acc ttc gcg gcc gcg gag gcc gcc
gcg cgc gcg cac gac 240Trp Leu Gly Thr Phe Ala Ala Ala Glu Ala Ala
Ala Arg Ala His Asp 65 70 75
80 gcc gcc atg ctc gcg ctc cgc ggc ggc gcc gcg cgc
gcg cgg tgc ctc 288Ala Ala Met Leu Ala Leu Arg Gly Gly Ala Ala Arg
Ala Arg Cys Leu 85 90
95 aac ttc ccg gac tcg gcc tgg ctg ctg gac gtg ccg gtg
ctg ccg ctg 336Asn Phe Pro Asp Ser Ala Trp Leu Leu Asp Val Pro Val
Leu Pro Leu 100 105
110 ccc cac ggc gca gcg ccc tgg gcc gac gtc cgc cgc gcc
gtc gcg ata 384Pro His Gly Ala Ala Pro Trp Ala Asp Val Arg Arg Ala
Val Ala Ile 115 120 125
gcc gtc gag ggg ttc ttc cgg gcg cgg cca gcc gcc gag gac
gcc atg 432Ala Val Glu Gly Phe Phe Arg Ala Arg Pro Ala Ala Glu Asp
Ala Met 130 135 140
tcc gcc acc tcg gag ccg tcg tca gcg gcc acg gag gcg gag gcg
gag 480Ser Ala Thr Ser Glu Pro Ser Ser Ala Ala Thr Glu Ala Glu Ala
Glu 145 150 155
160 gcg tcc tcc tcc tcc ggg acc gac ggc ggc gcg ccg gag gcc tcg
ccg 528Ala Ser Ser Ser Ser Gly Thr Asp Gly Gly Ala Pro Glu Ala Ser
Pro 165 170 175
ttc gag ctg gac atg ctg agc gat atg ggc gcc ggc ttg tac tac gcg
576Phe Glu Leu Asp Met Leu Ser Asp Met Gly Ala Gly Leu Tyr Tyr Ala
180 185 190
tgc tta gcg cag ggg ctg ctc gtg gag cct cca ccg tta gac gcg ccg
624Cys Leu Ala Gln Gly Leu Leu Val Glu Pro Pro Pro Leu Asp Ala Pro
195 200 205
tgc ccc gac gac agc gac tgt ggc ctc gcg ctc tgg tcc tac tga
669Cys Pro Asp Asp Ser Asp Cys Gly Leu Ala Leu Trp Ser Tyr
210 215 220
44696DNAZea maysCDS(1)..(696)ZmCBF6 44atg gac gcc gcc ggc tcc ttc agc gac
tac tcc tct gga acc ccg tcc 48Met Asp Ala Ala Gly Ser Phe Ser Asp
Tyr Ser Ser Gly Thr Pro Ser 1 5
10 15 cct gtc gcc ggc ggc ggc ggc ggc gac
gac ttc ggc tcc tcc tcc tac 96Pro Val Ala Gly Gly Gly Gly Gly Asp
Asp Phe Gly Ser Ser Ser Tyr 20 25
30 atg aca gtg tca tcg gcg ccg ccc aag cgc
cga gcc ggg cgg acc aag 144Met Thr Val Ser Ser Ala Pro Pro Lys Arg
Arg Ala Gly Arg Thr Lys 35 40
45 ttc aag gag acg cgg cac ccc gtg tac aag ggc
gtg cgg cgg agg aac 192Phe Lys Glu Thr Arg His Pro Val Tyr Lys Gly
Val Arg Arg Arg Asn 50 55
60 ccc ggg agg tgg gtc tgc gag gtg cgg gag ccg
cac ggc aag cag cgg 240Pro Gly Arg Trp Val Cys Glu Val Arg Glu Pro
His Gly Lys Gln Arg 65 70 75
80 ata tgg ctc ggg acc ttc gag acc gcc gag atg gcg
gcg cgc gcg cac 288Ile Trp Leu Gly Thr Phe Glu Thr Ala Glu Met Ala
Ala Arg Ala His 85 90
95 gac gtc gcc gcg ctc gcg ctg cgc ggc cgc gcc gcc tgc
ctc aac ttc 336Asp Val Ala Ala Leu Ala Leu Arg Gly Arg Ala Ala Cys
Leu Asn Phe 100 105
110 gcc gac tcg ccg cgg ctc ctc agg gtg ccc ccg acg ggc
tcc ggg cac 384Ala Asp Ser Pro Arg Leu Leu Arg Val Pro Pro Thr Gly
Ser Gly His 115 120 125
gac gag ata cgc cgc gcg gcc gcc gtg gcg gcg gac cag ttc
cgc ccg 432Asp Glu Ile Arg Arg Ala Ala Ala Val Ala Ala Asp Gln Phe
Arg Pro 130 135 140
gcg ccc gat cag ggc aat gtg gcc gcc gag gag gag gcg gcc gat
aca 480Ala Pro Asp Gln Gly Asn Val Ala Ala Glu Glu Glu Ala Ala Asp
Thr 145 150 155
160 cca cca ccg gat gcc ttg ccc agc gtg acg atg cag agc gtc gac
gac 528Pro Pro Pro Asp Ala Leu Pro Ser Val Thr Met Gln Ser Val Asp
Asp 165 170 175
gac ccg tac tgc att atc gac gac agg ctc gac ttc ggg atg cag ggg
576Asp Pro Tyr Cys Ile Ile Asp Asp Arg Leu Asp Phe Gly Met Gln Gly
180 185 190
tac ctc gac atg gcg caa ggg atg ctc att gat ccg cca ccg atg gcc
624Tyr Leu Asp Met Ala Gln Gly Met Leu Ile Asp Pro Pro Pro Met Ala
195 200 205
ggt tcc tcc acc agt ggc ggc ggc ggc gac gac gat gac gac gac ggt
672Gly Ser Ser Thr Ser Gly Gly Gly Gly Asp Asp Asp Asp Asp Asp Gly
210 215 220
gag gtc aag ctc tgg agc tac tga
696Glu Val Lys Leu Trp Ser Tyr
225 230
451993DNAZea maysCDS(136)..(804)ZmCBF4 genomic 45ccgtagcaca gcatttgttt
tcttacaatc cttcctcgca agtcgcatgc cacacttaca 60aatacgcgcc ttccaaaccg
ccatgattca ctccgattgc gcgccagatc aagctgcgag 120tcggcggcct cgatg atg
ccg ccg ccg ccg tcc tcg tcg tct tcg ccc tcg 171 Met
Pro Pro Pro Pro Ser Ser Ser Ser Ser Pro Ser 1
5 10 cag gac gcc ggc agc ccc
aag cga gcc gcg ggg cgc aac aag ttc cgg 219Gln Asp Ala Gly Ser Pro
Lys Arg Ala Ala Gly Arg Asn Lys Phe Arg 15
20 25 gag acg cgg cac ccg gtg ttc
cgc ggc gtg cgc cgg cga ggc cgc gcg 267Glu Thr Arg His Pro Val Phe
Arg Gly Val Arg Arg Arg Gly Arg Ala 30 35
40 ggg ggc cgg tgg cgg tgg gtg tgc
gag gtc cgc gtt cca ggc cgc cgc 315Gly Gly Arg Trp Arg Trp Val Cys
Glu Val Arg Val Pro Gly Arg Arg 45 50
55 60 ggc tgc agg ctc tgg ctc ggc acc ttc
gcg gcc gcg gag gcc gcc gcg 363Gly Cys Arg Leu Trp Leu Gly Thr Phe
Ala Ala Ala Glu Ala Ala Ala 65
70 75 cgc gcg cac gac gcc gcc atg ctc gcg
ctc cgc ggc ggc gcc gcg cgc 411Arg Ala His Asp Ala Ala Met Leu Ala
Leu Arg Gly Gly Ala Ala Arg 80 85
90 gcg cgg tgc ctc aac ttc ccg gac tcg gcc
tgg ctg ctg gac gtg ccg 459Ala Arg Cys Leu Asn Phe Pro Asp Ser Ala
Trp Leu Leu Asp Val Pro 95 100
105 gtg ctg ccg ctg ccc cac ggc gca gcg ccc tgg
gcc gac gtc cgc cgc 507Val Leu Pro Leu Pro His Gly Ala Ala Pro Trp
Ala Asp Val Arg Arg 110 115
120 gcc gtc gcg ata gcc gtc gag ggg ttc ttc cgg
gcg cgg cca gcc gcc 555Ala Val Ala Ile Ala Val Glu Gly Phe Phe Arg
Ala Arg Pro Ala Ala 125 130 135
140 gag gac gcc atg tcc gcc acc tcg gag ccg tcg tca
gcg gcc acg gag 603Glu Asp Ala Met Ser Ala Thr Ser Glu Pro Ser Ser
Ala Ala Thr Glu 145 150
155 gcg gag gcg gag gcg tcc tcc tcc tcc ggg acc gac ggc
ggc gcg ccg 651Ala Glu Ala Glu Ala Ser Ser Ser Ser Gly Thr Asp Gly
Gly Ala Pro 160 165
170 gag gcc tcg ccg ttc gag ctg gac atg ctg agc gat atg
ggc gcc ggc 699Glu Ala Ser Pro Phe Glu Leu Asp Met Leu Ser Asp Met
Gly Ala Gly 175 180 185
ttg tac tac gcg tgc tta gcg cag ggg ctg ctc gtg gag cct
cca ccg 747Leu Tyr Tyr Ala Cys Leu Ala Gln Gly Leu Leu Val Glu Pro
Pro Pro 190 195 200
tta gac gcg ccg tgc ccc gac gac agc gac tgt ggc ctc gcg ctc
tgg 795Leu Asp Ala Pro Cys Pro Asp Asp Ser Asp Cys Gly Leu Ala Leu
Trp 205 210 215
220 tcc tac tga aacccatgtc gaccaacgct tgtagattat ctattctttc
844Ser Tyr
cttttgggaa atgcggatat tataaattcc acgaactagt acaatatttc tctgttcact
904tcggaaatac tagaaagtta ccgaaatact gtacgatgcg catgcgactg gcagcaagcc
964ttttttcctt ttcttttctg gaatgatgca aatagaacgt gtagatgttt ttagtccagg
1024aatgttaagc gctaataatt aatttcagtc acacttttca acataaccaa actaaatggt
1084acagtgctga gggctaacat ttatttattt atttattaaa ctgatgcggc tcgctcgcca
1144gacgtagctt ccacgtcgac ctaaaatcgt gtgtctgctg cagatataca tcttggccgc
1204acacgtgtct tctaccgcta tacattgccc ttgtcttgcc tagctgctgc tttaccgtgg
1264gtccggaccg tgccgctcgg aatttcccgc gaaaaaggaa aacaaaagcg tcagctggct
1324cacccaagtt aaaaagaaaa aaaaaacaat cacttcgatc ttcaagatgc tgctctgctt
1384ttggcaagct cgtagtgtaa atgtgtaatt acctgattgg catagctaat gcctttgtgc
1444aaaaacatat tgcacaacct aagaacgttt actatatacc tgtcttagcc ttttgctact
1504ccccacacgg ccacacctcc aaccaggtgt tcgtacactc ccaatgctac cttaaagata
1564tgttttatgt ttgcgtcaat catacctcta caaatgagtt tgggtataat attatcctaa
1624actatgggag atccattctc cgagtttagg gggtacaata gtttcctaaa aaaaataaga
1684gttcttgcta acctttctca tcttggtctt ctgcagatgt caacttctta taggctcgtc
1744cacctcggtg acgaccatga accggtgtag agcaagtggc gcgcaacgaa taagcttgaa
1804acgcgaccca aaccaaattc acgatgattt ataggctcgt ccgttatcca caccgacgta
1864agtatcaatc tcaacttcgt attcgaatca caacgcacag tagatgccct catgtcgagg
1924tccctgatga tcgtgctagc atgtttcaga agtatgaaac tgaaaccttt agcccccccc
1984ccccccccc
1993464786DNAZea maysCDS(1700)..(2395)ZmCBF6 genomic 46cgacgggtgc
gggtatgagt gaagatttta acccgcgggc agccaacccg aagtttcgcg 60ggtgcgtgtc
tctatttcaa cccccgggtg acccacaacc gacccaaaat ttggtgtgtt 120cgttattttg
tgcaataatg attaagatac aatattaatt atttgtcaac aagtgacttg 180gtcaatgatt
caagaagtgt tttgttgctg ctcaagatga gaagacttaa aatacataga 240tatgactcaa
ttattcgtgg tatttgattg cttatagctg aattattgct taaaaatgta 300cgatgagtaa
aactcgatgg tgtcccaaaa cccacccgaa acctgatggg tttgggtgcg 360ggtttagaat
tgcacccgcg ggcgggttta gattaaactg gtttttagga gttatttttt 420gggagctctt
ggatatatgc ttatggataa gcatggaaaa acatatgata tgacgaccgt 480gagacagtaa
gtcctttgtt gcagcatcaa ccatcattcc tgaagtactc acaagttttg 540cagccatcat
tcaacgagag accacaattc atgctagtcc ggtccataat tgactctaaa 600ataatttaac
agaacatatc tgagatctat atcattccaa ttagatttta ttttatgaat 660tgtatagttt
tacttgtttt tacgtgatgt aattttattt tttatgaatt acgaaatttt 720atttacttaa
ttaagtactc ggatgaagtt ttatgtcaac taaacgacgt gttccatgca 780attaaaagtt
mgtraaagaa aaaaatgatg tggcgttcag gtaactgtag cttgagggta 840aaggctgtgt
actagagcac gatattagat taggactatg aagcttgata acagtgattg 900ataaatgata
tctaatataa cggtcgataa aggctaaata aaaggttaac aataatcaat 960agcaacattg
atagtatagt cgaagttgac gtgacggtcg ctaaagatat cagcagaagc 1020aatagaatca
gagtaggagt ggattgattt ctgcagcagc aagcacggcg gcgacgctct 1080cgtagcacgg
caggcgcgca aattgcgggc gtacactgcc aagccagcgg tcgccattta 1140tccatcaggc
cgtcgaacga acgaagagcg ccccgagacc aagtccaata taactgccac 1200ctgaccgtgg
agtgggatcc tctgcgtggc tccatccgat cacttgcact ggtccagcgc 1260gtacggcgta
cagagaatcc cacgctagct atccgtctcg cacgagtgcg catcagacta 1320ccagagtttc
gctctctcaa aggcgtcacg ccttttcgtt ttcgacacac gtcgccgtca 1380gtgaaagaac
cgtaccaccc aacagcagcg agggaggcca gccggctgcg cgttccgcga 1440gctccagctg
gttccgatcc ttcgagcgcg aagagacctc tcgatctcca gagtccagag 1500tccatcagtc
gcacaacctg gctggcacct cccgagccgc accgcgtagg aatagtgcac 1560gcggccacgt
ccctctccac gcgctaacct atatagcccg ccacctttcc ctccaaattc 1620aaatcaaatg
ctccgactcg ttcctccccc tgctcgccac gcccacggcc actaccactc 1680acacacacag
gcagcagcc atg gac gcc gcc ggc tcc ttc agc gac tac tcc 1732
Met Asp Ala Ala Gly Ser Phe Ser Asp Tyr Ser
1 5 10 tct gga acc ccg
tcc cct gtc gcc ggc ggc ggc ggc ggc gac gac ttc 1780Ser Gly Thr Pro
Ser Pro Val Ala Gly Gly Gly Gly Gly Asp Asp Phe 15
20 25 ggc tcc tcc tcc tac
atg aca gtg tca tcg gcg ccg ccc aag cgc cga 1828Gly Ser Ser Ser Tyr
Met Thr Val Ser Ser Ala Pro Pro Lys Arg Arg 30
35 40 gcc ggg cgg acc aag ttc
aag gag acg cgg cac ccc gtg tac aag ggc 1876Ala Gly Arg Thr Lys Phe
Lys Glu Thr Arg His Pro Val Tyr Lys Gly 45
50 55 gtg cgg cgg agg aac ccc
ggg agg tgg gtc tgc gag gtg cgg gag ccg 1924Val Arg Arg Arg Asn Pro
Gly Arg Trp Val Cys Glu Val Arg Glu Pro 60 65
70 75 cac ggc aag cag cgg ata tgg
ctc ggg acc ttc gag acc gcc gag atg 1972His Gly Lys Gln Arg Ile Trp
Leu Gly Thr Phe Glu Thr Ala Glu Met 80
85 90 gcg gcg cgc gcg cac gac gtc gcc
gcg ctc gcg ctg cgc ggc cgc gcc 2020Ala Ala Arg Ala His Asp Val Ala
Ala Leu Ala Leu Arg Gly Arg Ala 95
100 105 gcc tgc ctc aac ttc gcc gac tcg
ccg cgg ctc ctc agg gtg ccc ccg 2068Ala Cys Leu Asn Phe Ala Asp Ser
Pro Arg Leu Leu Arg Val Pro Pro 110 115
120 acg ggc tcc ggg cac gac gag ata cgc
cgc gcg gcc gcc gtg gcg gcg 2116Thr Gly Ser Gly His Asp Glu Ile Arg
Arg Ala Ala Ala Val Ala Ala 125 130
135 gac cag ttc cgc ccg gcg ccc gat cag ggc
aat gtg gcc gcc gag gag 2164Asp Gln Phe Arg Pro Ala Pro Asp Gln Gly
Asn Val Ala Ala Glu Glu 140 145
150 155 gag gcg gcc gat aca cca cca ccg gat gcc
ttg ccc agc gtg acg atg 2212Glu Ala Ala Asp Thr Pro Pro Pro Asp Ala
Leu Pro Ser Val Thr Met 160 165
170 cag agc gtc gac gac gac ccg tac tgc att atc
gac gac agg ctc gac 2260Gln Ser Val Asp Asp Asp Pro Tyr Cys Ile Ile
Asp Asp Arg Leu Asp 175 180
185 ttc ggg atg cag ggg tac ctc gac atg gcg caa ggg
atg ctc att gat 2308Phe Gly Met Gln Gly Tyr Leu Asp Met Ala Gln Gly
Met Leu Ile Asp 190 195
200 ccg cca ccg atg gcc ggt tcc tcc acc agt ggc ggc
ggc ggc gac gac 2356Pro Pro Pro Met Ala Gly Ser Ser Thr Ser Gly Gly
Gly Gly Asp Asp 205 210 215
gat gac gac gac ggt gag gtc aag ctc tgg agc tac tga
tggccgatcg 2405Asp Asp Asp Asp Gly Glu Val Lys Leu Trp Ser Tyr
220 225 230
cacgtgtgtc tgaaggaaga caggttgcat tgggcaaata acttccctgt
acagccttgg 2465gaagaaccgg taccggtgaa atgtactggc cgtggccctt tcccttcggt
tcgtctatgc 2525tatgtaatgt tatgtatcct gctcttctga tgattaaggg tattcaggag
aagcagaaga 2585ctgggtttac tcggtttgat cgtttaattt aatttggagc tagagatgta
cctgcatgca 2645tatataatgc atgcacggta actgtgatat aatattcaga gtgccatcaa
cacctgtaac 2705gagccataaa aagcatggtg tatgcattta ggaacgaccg tgggcatgat
ggtatactat 2765tgatgttgca ttgagaattt ggctcttgtc gcaggaatca gaaactagaa
tccttctaaa 2825aaaaactggt acccacacgc acctgatcgg ctgatcatga actccaatac
tctaagttca 2885tgtgcttcaa atccttgctg tactgtaatg ctacacggta aatcataacg
tggtcgctgt 2945attttgatat gtatacttgg agttattttt tatatcacct aaagctatat
ctcgtgtttt 3005cttaagaaat aaaataaggg gtatgcatga ggtaactgtt cgcttaaata
atattctctc 3065tatcccaaaa tataatttgt tctaaactat tcatatatat atttattaag
taacatatga 3125atatagttca cgtgtatgtc tatattcatt atcatttgaa tgaacatgga
tggaaaaaaa 3185agtaagctaa aaaaactata tttagaacgg atggaatact gtatattggg
tagacaatat 3245gacatctcaa atataagagg atgagaggat agtttaagat accactaagg
ctctgttcgg 3305tttgtaggga ttggagcccc ggattgattc atagccggat tacttctcta
atttatatag 3365attttgatga gctggaacga atcctggttg attctcgtag aagcgaacgg
gccctaaata 3425taggccagga tgaattgtag acgttttgac tcttggacac aaagtttttg
ctatatgtct 3485agacatatat catgtatgta agtgaataac aaaatgcatt ttcttggcaa
acttcatata 3545tatttttttc tgcggtctat caaatgattg gtcattcaaa aatttctgtc
ttttcaatcc 3605tctatttttt tacaactgta ttaatacaga atttttttat tttttctgtt
catatgtctt 3665tgcattattc taaagaggct ccaccggtct ttttttaacg aatagaacat
tctgtctcgc 3725attagcgcga gctcgtggtt tgcgttgcag cttatgaagg ctacggacta
cctgataacg 3785gaaaaggctt tgacgatgtc aatgaaacgg cgtagtgtag tgtagtgggc
tcagctggca 3845agccgcggga aggaatactc ggtccatagc atatagcgag gtactgaggt
ctcagcgtca 3905ctttcagagc tcggagtggc cgacgggtga tggtcactct cgcgcgctca
ctgggtcggg 3965ccgcaagtgc cattccaaag ccaacaaaag tagtgcccct gccaaccgcc
agctcacgga 4025gtcgcagcgt gtttctctct tttttaccca gtcctcccgc gggttcgcgc
cgccgctgca 4085tagcaaacta gaagagaagt tcctgtctgc gacttcagga gtcggaaacg
ctggcaagaa 4145ccacacgaac agcgggcagc aagttttttt ttttttttat gttttcattt
gattgtattg 4205gctcttgtca tgtaatgtac tccctcactc cgtctcaaaa ttaactagtg
cgtcatctga 4265attctgaacc gaccgtatga gacggatcag atgatatgaa cgctcaatcc
aaatgactat 4325ctggttaacg gacatcatgg tcctattaat tcatgtttca taataatata
taataatata 4385gagatagtgt tcaaataaag tcccaaaaaa agtgtctcgt cgagctagct
atataggtca 4445catgcctata gcaatatatt gtaccatccc tcataccaca aacatataca
accttattat 4505tttacggttt catgtatcgt gttaggatat acatcaacaa attctcgatg
tgtgtcaagc 4565tctataagtg gtggcgatgg aaatctttgt gtttcatgga tgacgatcaa
atgcaagact 4625tgagtgtgcc tagtgttttc catagccatg ttatttggtt tcgtcgagtt
agtttgattt 4685ttcaaagaac tattcattaa gtcaatattt gacatgtaat ggatgtacgc
atctaatacg 4745aaaagccgga actcttgttt ccataataaa aaaatgcatc g
4786
User Contributions:
Comment about this patent or add new information about this topic: