Patent application title: Plants having increased tolerance to heat stress
Inventors:
Jiangxin Wan (Bath, CA)
Yafan Huang (Bath, CA)
Monika M. Kuzma (Battersea, CA)
IPC8 Class: AC12N1582FI
USPC Class:
800289
Class name: Multicellular living organisms and unmodified parts thereof and related processes method of introducing a polynucleotide molecule into or rearrangement of genetic material within a plant or plant part the polynucleotide confers resistance to heat or cold (e.g., chilling, etc.)
Publication date: 2009-06-18
Patent application number: 20090158466
Claims:
1. A method of producing a heat stress tolerant plant comprising:a)
providing a nucleic acid construct that increases the expression of a
MYBsubgroup-14 polypeptide;b) inserting said nucleic construct into a
vector;c) transforming a plant, tissue culture, or a plant cell with the
vector to obtain a plant, tissue culture or a plant cell with increased
expression of said MYB subgroup-14 polypeptide;d) growing said plant or
regenerating a plant from said tissue culture or plant cell, wherein a
heat stress tolerant plant is produced.
2. A heat stress tolerant transgenic plant produced by the method claim of 1.
3. A transgenic seed produced by the transgenic plant of claim 2, wherein said seed produces a heat stress tolerant plant.
4. A method of identifying a heat stress tolerant plant, wherein the plant has reduced flower abortion and increased yield as compared to a control plant comprisinga. exposing a population of flowering plants to a heat stress treatment; andb. selecting a plant from said population of plants having reduced flower abortion thereby identifying a heat stress tolerant plant.
5. A plant identified by the method of claim 4.
6. A seed produced by the plant identified by the method of claim 4, wherein said seed produces a heat stress tolerant plant.
Description:
REFERENCE TO RELATED APPLICATIONS
[0001]This non-provisional patent application claims priority under 35 U.S.C. §119(e) to U.S. Ser. No. 60/925,312 filed Apr. 18, 2007, and U.S. Ser. No. 60/965,582 filed Aug. 20, 2007, the contents of which are herein incorporated by reference in their entireties.
FIELD OF THE INVENTION
[0002]The invention is in the field of plant molecular biology and relates to transgenic plants having novel phenotypes, methods of producing such plants and polynucleotides and polypeptides useful in such methods. More specifically, the invention relates to the use of MYB polynucleotides and transgenic plants expressing these polynucleotides and polypeptides.
BACKGROUND OF THE INVENTION
[0003]Environmental stresses are responsible for significant yield reduction in agricultural crops. In addition to many reports published previously, the relation between climate variation and production of corn and soybean throughout the United States for the period 1982-1998 was studied in recent years (Lobell and Asner, 2003). Gradual temperature changes have made a measurable impact on crop yield. In corn and soybean it has been estimated that yield is reduced by 17% per degree as the growth temperature rises above the season optimum. With a predicted temperature increase of 1.4° C. to 5.8° C. between the years 1990 and 2010 (IPCC Working Group I, 2001), improvement of high temperature tolerance in crop plants has become one of the major focuses of agricultural biotechnology development.
[0004]Both monocots and dicots are particularly sensitive to heat stress during flowering and seed development and therefore heat stress has a significant impact on seed yield (Young et al., 2004; Sato et al., 2002; Angadi et al., 2000; Carlson, 1990; Wahid, A., Gelani, S., Ashraf, M., and Foolad, M. R. (2007)). It has been suggested that plants possess an inherent ability for basal and acquired thermotolerance and that a common heat response mechanisms is present in diverse plant species (Kapoor et al., 1990; Vierling, 1991; Flahaut et al., 1996; Burke et al., 2000; Hong and Vieling, 2000; Massie et al., 2003; Larkindale et al., 2005). Basal thermotolerance allows plants survive from exposure to temperature above optimal for growth, whereas acquired thermotolerance is induced by a short acclimation period at a sub-lethal heat stress which enables a plant to survive a subsequent heat stress that would be otherwise lethal. A number of studies have been conducted to identify and characterize genes and pathways that are involved in plant thermotolerance. For example, heat shock transcription factors (HSF) and heat shock proteins (HSP) have received much attention to elucidate the roles and effects of these genes in response to heat stress as have plant growth hormones such as abscisic acid and ethylene.
[0005]Transcription factors are DNA binding proteins that interact with specific promoter or enhancer sequences and alter the gene expression of the associated gene. Where the specific sequence that binds the transcription factor is associated with a suite of genes whole pathways can be coordinately regulated with various component genes being simultaneously up-regulated or down-regulated. A transcription factors may coordinately alter a suite of genes in response to a stimulus such as an environmental stress, nutritional status or pathogen attack, for example, or can be a component of a signaling pathway, such as a hormone signaling pathway for example. Transcription factors posses a modular structure and are classified primarily on the basis of the DNA binding domain.
[0006]The MYB family of transcription factors is composed of at least 198 genes (Yanhui et al. 2006) and has been proposed to have regulatory functions in a wide array of processes ranging from growth and development to defense responses. Plant MYB proteins are classified based on the presence and number of imperfect MYB repeats each composed of about 52 amino acids. The MYB domain forms a helix-turn-helix conformation and represents the DNA binding domain. Three major groups of MYB proteins have been classified as R1R2R3-MYB, R2R3-MYB and MYB-related proteins.
[0007]The R2R3-MYB family of proteins in Arabidopsis consists of 125 proteins and is characterized by having a R2R3DNA binding domain at their N-terminus (Kranz et al., 1998, and Stracke et al., 2001). These genes are involved in a number of biological processes including mediating hormone actions, secondary metabolism (Paz-Ares et al., 1987), control of cell morphogenesis (Oppenheimer et al., 1991), meristem, floral and seed development (Kirik et al., 1998, Schmitz et al., 2002) and response to various environmental factors (Kranz et al., 1998; Jin and Martin, 1999; Meissner et al, 1999).
[0008]MYB sequences have been further classified into a number of subgroups based on sequence (Krantz et al. 1998, Stracke et al 2001). MYB68 falls within subgroup14 as does MYB36 and MYB84 as identified by Krantz et al 1998. However, Stracke et al 2001, have additionally include the MYB37, MYB38 and MYB87 in subgroup14. Stracke further notes that there are several cases of functional conservation of genes that cluster together in the dendrogram.
[0009]Classification of the R2R3-MYB family has identified 125 MYB proteins in Arabidopsis thaliana (At). A R2R3 MYB gene is characterized by a MYB domain containing two imperfect repeats of 53 aa (R2, and R3). Each repeat contains three helix-turn-helix structures. The R2 and R3 domains are located near the N-terminus of the proteins. The last two helices on each repeat with a loop between them form a DNA-binding motif structure similar to HLH proteins. The third helix directly binds to DNA, and the first and second helices contribute to the conformation of the HLH motif that appears to be important in recognition of a specific gene target (Ogata et al., 1994; William and Grotewold, 1997; Jia et al., 2004). The R2R3-MYB proteins were further characterized into 22 subgroups according to their phylogenetic relationship based on at least one of the shared amino acid motifs in addition to the MYB domain (Kranz et al., 1998). AtMYB68, AtMYB84, and AtMYB36 were categorized as subgroup14 based on two shared motifs: S1: SFSQLLLDPN SEQ ID NO:266 and S2: TSTSADQSTISWEDI SEQ ID NO:267, at the C-terminus of the proteins. The homology at these motifs was limited, for example, Arabidopsis MYB36 has only 20% identity. Subsequently, AtMYB87, AtMYB37 and AtMYB38 were also included in subgroup14, on the basis of sequence conservation in the MYB DNA domain: R2 and R3 helix-turn-helix repeats (Stracke et al., 2001).
[0010]The R2R3 domains may be indicative of specific DNA binding through the unique amino acid sequence of the third helix of the R3 domain and minor conformational changes associated with the structural interaction between the first two helices. It suggests that subgroup14 members may be functionally redundant orthologous. For example, lateral meristem initiation in Arabidopsis was studied with respect to MYB-subgroup14 (Muller et al., 2006). All members of MYB-subgroup14 showed high similarity to the tomato Blind (Bl) gene, a regulator of axillary meristems. Transcripts of four members: AtMYB37, AtMYB38, AtMYB84 and AtMYB87 were detected by RT-PCR in tissues including shoot tip, internode, leaf, flower bud, open flower, and root, whereas AtMYB36 and AtMYB68 expression was expressed in root tissue. Phenotypic analysis using knockouts of AtMYB37, AtMYB38 and AtMYB84 indicated that these members of MYB-subgroup14 at least partially redundant for regulating axillary bud formation.
[0011]MYB68 is a R2R3 type MYB gene, and a member of MYB-subgroup14, that has been identified in a transposon gene trapping study (Feng et al., 2004). Expression of this gene has been demonstrated to be specific to root pericycle cells. In the null mutant, no MYB68 mRNA was detectable; however, no mutant phenotype was exhibited when plants were grown under standard conditions. In the evaluation of MYB68 under a variety of growth conditions the only phenotype discerned was reduced plant leaf area when plants were grown under hot greenhouse conditions (30-40° C.). This phenotype was rescued by transformation of the myb68 mutant background with a wild-type MYB68 gene. Examination root tissue of the myb68 mutant grown in root cultures indicated increased biomass and lignin levels. The authors conclude that MYB68 is involved in root development (Feng et al., 2004).
[0012]Transcriptional activation is primarily mediated through transcription factors that interact with enhancer and promoter elements. Binding of transcription factors to such DNA elements constitutes a crucial step in transcriptional initiation. Each transcription factor binds to its specific binding sequence in a promoter and activates expression of the linked coding region through interactions with coactivators and/or proteins that are a part of the transcription complex.
SUMMARY OF THE INVENTION
[0013]This invention relates to a method for enhancing the heat stress tolerance of plants by means of increasing the expression of a MYB subgroup-14 polypeptide. Enhanced heat stress tolerance includes improved seed set during and following conditions of heat stress. Improved seed set results in increased yield. A MYB-subgroup-14 polypeptide includes for example a MYB68, a MYB36, a MYB84, a MYB37, aMYB38 or a MYB87 polypeptide. Preferably, the MYB-subgroup-14 polypeptide is a MYB68, a MYB36 or a MYB84 polypeptide. The MYB subgroup-14 polypeptide expression is ectopic, or constitutive. Alternatively, expression of the MYB subgroup-14 polypeptide in its typical place of expression, e.g. root tissue.
[0014]A heat stress heat stress tolerant plant is produced by providing a nucleic acid construct that increases the expression of a Myb subgroup-14 polypeptide, inserting the nucleic construct into a vector, transforming a plant, tissue culture, or a plant cell with the vector to obtain a plant, tissue culture or a plant cell with increased expression of the Myb subgroup-14 polypeptide and growing said plant or regenerating a plant from the tissue culture or plant cell. A nucleic acid construct that increases the expression of a Myb subgroup-14 polypeptide includes for example an enhancer element. An enhancer is a sequence found in eukaryotes and certain eukaryotic viruses which can increase transcription of a gene when located, in either orientation, up to several kilobases from the gene concerned. These sequences act as enhancers when on the 5' side (upstream) of the gene in question. However, some enhancers are active when placed on the 3' side (downstream) of the gene. The enhancer elements can activate transcription of a gene and alter the normal expression pattern of the endogenous gene. Enhancer elements are known to those skilled in the art. For example the enhancer element is a 35S enhancer element.
[0015]Additionally, a nucleic acid construct that increases the expression of a Myb subgroup-14 polypeptide includes for example a nucleic acid encoding a Myb subgroup-14 polypeptide. Exemplary, MYB polypeptides and nucleic acids include those of SEQ ID NO: 1-265. The nucleic acid encoding a Myb subgroup-14 polypeptide is operably linked to a promoter. The promoter is a heterologous promoter or a homologous promoter. Additionally, the promoter is a constitutive or an inducible promoter.
[0016]By increasing the expression of a MYB subgroup-14 polypeptide is meant that the amount produced by the cell transformed with the nucleic acid construct is greater than a cell, e.g. control cell that is not transformed with the nucleic acid construct. A control cell includes for example a cell that endogenously expresses a MYB subgroup-14 polypeptide such a plant root cell, alternatively a control cell is a non transformed cell of the same cell-type as the transformed cell, be it a leaf cell a meristem cell or a flower or seed cell. An increase is a 1-fold, 2-fold, 3 fold or greater increase. An increase of expression is also meant to include expression of a MYB subgroup-14 polypeptide in a cell that does not typically produced by a cell.
[0017]Also included in the invention is a method of identifying a heat stress tolerant plant. The plants identified by these methods have reduced flower abortion and increased yield as compared to a control plant. Heat stress tolerant plants are identified by exposing a population of flowering plants to a heat stress treatment and selecting a plant from the population of plants that has reduced flower abortion. Heat stress treatment includes for example exposing the plant to a temperature that is hot enough for a sufficient amount of time such that damage to plant functions or development results. By reduced flower abortion is meant that a plant does not loss as many flowers, due to flower abortion, or has a greater seed yield compared to another plant that is exposed to a similar level of heat stress. Plants with a reduced flower abortion have a 5, 10, 20, 25, 30% or more increase in seed yield as compared to a control plant.
[0018]The invention further includes the plants produced by the methods of the invention and the seed produced by the plants which produce a plant that has an increase tolerance to heat stress.
[0019]Unless otherwise defined, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this invention belongs. Although methods and materials similar or equivalent to those described herein can be used in the practice or testing of the present invention, suitable methods and materials are described below. All publications, patent applications, patents, and other references mentioned herein are incorporated by reference in their entirety. In case of conflict, the present specification, including definitions, will control. In addition, the materials, methods, and examples are illustrative only and not intended to be limiting.
[0020]Other features and advantages of the invention will be apparent from the following detailed description, and from the claims.
DETAILED DESCRIPTION
[0021]The invention is based upon the surprising discovery of plants that have an increased tolerance to heat stress which results in an increased yield relative to a wild-type control. More specifically, the invention is based upon the discovery that increasing the expression of a MYB-subgroup14 polypeptide (e.g., MYB68) results in a plant having an increased resistance to heat stress.
[0022]Expression of a MYB-subgroup 14 polypeptide can be accomplished for example by increasing the expression of an endogenous MYB-subgroup 14 polypeptide (e.g., activation tag insertion) or by expression of an exogenous gene construct encoding for a MYB-subgroup 14 polypeptide. The gene encoding for the MYB-subgroup 14 polypeptide may be endogenous or exogenous to the transformed species. As shown in the EXAMPLES plants having an increases resistance to heat stress were produced not only transforming a plant with its native MYB-subgroup 14 polypeptide but also with a MYB-subgroup 14 polypeptide from another plant species.
[0023]Accordingly the invention provides methods of enhancing (e.g., increasing) the heat stress tolerance of plants by increasing the expression of a MYB subgroup-14 polypeptide. Also included in the invention is a method of identifying a heat stress tolerant plant. The plants identified by these methods have reduced flower abortion and increased yield as compared to a control plant. Heat stress tolerant plants are identified by exposing the population of flowering plants to a heat stress treatment and selecting a plant from the population of plants that has reduced flower abortion. The invention also includes the transgenic plants produced by the methods of the invention and the seeds produced by the transgenic plants that produce a heat stress tolerant plant.
[0024]Unless otherwise defined, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this invention belongs. Although methods and materials similar or equivalent to those described herein can be used in the practice or testing of the present invention, suitable methods and materials are described below. All publications, patent applications, patents, and other references mentioned herein are incorporated by reference in their entirety. In the case of conflict, the present specification, including definitions, will control. In addition, the materials, methods, and examples are illustrative only and not intended to be limiting.
[0025]For convenience, before further description of the present invention, certain terms employed in the specification, examples and appended claims are defined herein. These definitions should be read in light of the remainder of the disclosure and as understood by a person of ordinary skill in the art.
[0026]The term "constitutive expression" means expression of a gene in any cell at constant levels in a non-regulated manner.
[0027]The terms "cMYB" and "MYB" refer to a cDNA clone of MYB and are used interchangeably. Where a genomic sequence has been used or referred to, it is identified and differentiated by the term "gMYB" or "genomic MYB" thereby referring to a genomic MYB sequence.
[0028]The term "ectopic expression" means expression of a gene in an abnormal place in an organism relative to the endogenous gene expression. Ectopic expression may include constitutively expressed genes depending on the native expression patterns of a given gene.
[0029]The term "expression cassette" means a vector construct wherein a gene is transcribed. Additionally, the expressed mRNA may be translated into a polypeptide.
[0030]The terms "expression" or "over-expression" are used interchangeably and means the expression of a gene such that the transgene is expressed. The total level of expression in a cell may be elevated relative to a wild-type cell.
[0031]"Flower abortion" means a flower that fails to develop and produce a fruit or seed. In addition to premature senescence of a flower, flower abortion may refer to loss of pollen production, altered pollination or fertilization and subsequent seed development. Altered growth and development of meristem tissue, a flower meristem in particular, is further included within the meaning of flower abortion.
[0032]The term "heat tolerance" is defined as a phenotype where a first plant, or plant line, has increased capacity to withstand elevated temperature and produce a yield that is in excess of a second plant or plant line, the second plant line being a control plant such as a wild-type control plant line.
[0033]A "promoter sequence", or "promoter", means a nucleic acid sequence capable of inducing transcription of an operably linked gene sequence in a plant cell.
[0034]The term "seed set" is seed formation as a result of flower pollination followed by egg cell fertilization and zygote development. Reductions in seed set which can occur due to interruption in any of the above processes will produce a net reduction in seed number produced.
[0035]The term "substantially similar" refers to nucleic acids where a change in one or more nucleotides does not alter the functional properties of the nucleic acid or the encoded polypeptide. Due to the degeneracy of the genetic code, a base pair change can result in no change in the encoded amino acid sequence. For example, the codons ACT, ACC, ACA and ACG all encode a threonine amino acid. Alternatively one or more base pair changes may alter the encoded amino acid however if the substituted amino acid has similar chemical properties functionality of the encoded protein is likely to be unaffected. For example, threonine codons ACT and ACC when changed to AGT or AGC respectively encode for serine, a chemically and biologically similar amino acid. Additionally, certain amino acids within a polypeptide are non essential and alterations may be made in these locations without an effect on the functionality of the polypeptide. Substantially similar also refers to sequences having changes at one or more nucleotide bases wherein the changes do not affect the ability of the sequence to alter gene expression by various gene silencing methodologies such as antisense, RNAi or co-suppression. The term "substantially similar" refers to polypeptides wherein a change in one or more amino acids does not alter the functional properties of the polypeptide as discussed above.
[0036]The term "yield" refers to seed number, seed weight, seed size, total plant biomass, increased biomass of a plant organ, such as stems or leaves or roots, fruit production, and flower production,
[0037]The term "yield protection" is defined as the positive difference, expressed as a % value, between the yield of the transgenic or mutant and the control, where the yield is expressed as a % of optimal, following an imposed stress. The calculation is done by comparing the optimal yield with that after the stress treatment (stress yield/optimal yield×100).
[0038]The MYB gene family is classified based on sequence homology and the presence of defined domains and motifs such as an R2R3 domain. The classification in all cases is not absolute and varies depending on the criteria selected for the analysis (Krantz et al 1998, Stracke et al 2001).
[0039]Herein we define the MYB-subgroup14 to include at least the following members, MYB68, MYB36, MYB84, MYB37, MYB38 and MYB87. The Arabidopsis MYB68, MYB36, MYB84, MYB37, MYB38 and MYB87 sequences are used to identify homologues from other species according to the methods herein, examples of which are included in Table 1
[0040]The term "MYB sequence" refers to a polynucleotide sequence or a polypeptide sequence as contextually appropriate.
Sequences
[0041]The following sequences from the MYB-subgroup14 family, and corresponding sequence identifiers, are employed throughout the specification, examples and appended claims:
TABLE-US-00001 TABLE 1 SEQ ID Accession Number MYB NO: SPECIES Reference Identification 1 ARABIDOPSIS THALIANA NM_125976.2 MYB68 NT 2 ARABIDOPSIS THALIANA NP_201380.1 MYB68 AA 3 ARABIDOPSIS THALIANA NM_114829.3 MYB84 NT 4 ARABIDOPSIS THALIANA NP_190538.1 MYB84 AA 5 ARABIDOPSIS THALIANA NM_125143.3 MYB36 NT 6 ARABIDOPSIS THALIANA NP_200570.1 MYB36 AA 7 BRASSICA RAPA MYB68 NT 8 BRASSICA RAPA MYB68 AA 9 ORYZA SATIVA NM_001057941.1 MYB36 NT 10 ORYZA SATIVA AAT85046.1 MYB36 AA 11 GOSSYPIUM TC34239 MYB68 NT 12 GOSSYPIUM TC34239_ORF MYB68 AA 13 GLYCINE MAX DQ822965.1 MYB84 NT 14 GLYCINE MAX ABH02906.1 MYB84 AA 15 GLYCINE MAX MYB84 NT 16 GLYCINE MAX MYB84 AA 17 ZEA MAYS TC370133 MYB84 NT 18 ZEA MAYS TC370133_ORF MYB84 AA 19 SORGHUM BICOLOR AF474127 MYB36 NT 20 SORGHUM BICOLOR AAL84760.1 MYB36 AA 21 TRITICUM AESTIVUM BQ483726 MYB84 NT 22 TRITICUM AESTIVUM BQ483726_ORF MYB84 AA 23 POPULUS TC54478 MYB84 NT 24 POPULUS TC54478_ORF MYB84 AA 25 MEDICAGO TC97441 MYB68 NT TRUNCATULA 26 MEDICAGO TC97441_ORF MYB68 AA TRUNCATULA 27 SOLANUM AF426174.1 MYB36 NT LYCOPERSICUM 28 SOLANUM AAL69334.1 MYB36 AA LYCOPERSICUM 29 SOLANUM BG134669 MYB36 NT LYCOPERSICUM 30 SOLANUM BG134669_ORF MYB36 AA LYCOPERSICUM 31 ARABIDOPSIS THALIANA NM_119940.3 MYB87 NT 32 ARABIDOPSIS THALIANA NP_195492.2 MYB87 AA 33 ARABIDOPSIS THALIANA NM_122206.3 MYB37 NT 34 ARABIDOPSIS THALIANA NP_197691.1 MYB37 AA 35 ARABIDOPSIS THALIANA NM_129245.2 MYB38 NT 36 ARABIDOPSIS THALIANA NP_181226.1 MYB38 AA 37 AEGILOPS SPELTOIDES BQ841600.1 MYB36 NT 38 ANTIRRHINUM MAJUS AJ794728.1 MYB68 NT 39 ANTIRRHTNUM MAJUS AJ794728.1_ORF MYB68 AA 40 AQUILEGIA TC13008 MYB84 NT 41 AQUILEGIA TC13008_ORF MYB84 AA 42 AQUILEGIA TC11167 MYB36 NT 43 AQUILEGIA TC11167_ORF MYB36 AA 44 ARACHIS HYPOGAEA CD038321.1 MYB68 NT 45 ARACHIS HYPOGAEA CD038321.1_ORF MYB68 AA 46 ARACHIS HYPOGAEA ES761155.1 MYB68 NT 47 ARACHIS HYPOGAEA ES761155.1_ORF MYB68 AA 48 ARACHIS STENOSPERMA EH046152.1 MYB36 NT 49 ARACHIS STENOSPERMA EH046152.1_ORF MYB36 AA 50 BRACHYPODIUM DV486330.1 MYB38 NT DISTACHYON 51 BRACHYPODIUM DV486330.1_ORF MYB38 AA DISTACHYON 52 BRACHYPODIUM DV488965.1 MYB38 NT DISTACHYON 53 BRACHYPODIUM DV488965.1_ORF MYB38 AA DISTACHYON 54 BRASSICA NAPUS (bud) MYB68 NT 55 BRASSICA NAPUS (bud) MYB68 AA 56 BRASSICA NAPUS (root) MYB68 NT 57 BRASSICA NAPUS (root) MYB68 AA 58 BRASSICA NAPUS TC40384 MYB68 NT 59 BRASSICA NAPUS ES900275.1 MYB68 NT 60 BRASSICA NAPUS ES900275.1_ORF MYB68 AA 61 BRASSICA NAPUS TC55899 MYB38 NT 62 BRASSICA NAPUS TC55899_ORF MYB38 AA 63 BRASSICA RAPA EX134980.1 MYB68 NT 64 BRASSICA RAPA EX134980.1_ORF MYB68 AA 65 BRASSICA RAPA EX137439.1 MYB68 NT 66 BRASSICA RAPA EX137439.1_ORF MYB68 AA 67 CARTHAMUS EL384492.1 MYB36 NT TINCTORIUS 68 CARTHAMUS EL384492.1_ORF MYB36 AA TINCTORIUS 69 CARTHAMUS EL392277.1 MYB36 NT TINCTORIUS 70 CARTHAMUS EL392277.1_ORF MYB36 AA TINCTORIUS 71 CENTAUREA MACULOSA EH724496.1 MYB36 NT 72 CENTAUREA MACULOSA EH724496.1_ORF MYB36 AA 73 CENTAUREA MACULOSA EH719165.1 MYB36 NT 74 CENTAUREA MACULOSA EH719165.1_ORF MYB36 AA 75 CENTAUREA MACULOSA EH724438.1 MYB68 NT 76 CENTAUREA MACULOSA EH724438.1_ORF MYB68 AA 77 CENTAUREA EH774519.1 MYB68 NT SOLSTITIALIS 78 CENTAUREA EH774519.1_ORF MYB68 AA SOLSTITIALIS 79 CENTAUREA EH771972.1 MYB68 NT SOLSTITIALIS 80 CENTAUREA EH771972.1_ORF MYB68 AA SOLSTITIALIS 81 CENTAUREA EH768792.1 MYB84 NT SOLSTITIALIS 82 CENTAUREA EH768792.1_ORF MYB84 AA SOLSTITIALIS 83 CICHORIUM ENDIVIA EL361859.1 MYB84 NT 84 CICHORIUM ENDIVIA EL361859.1_ORF MYB84 AA 85 CICHORIUM INTYBUS EH681135.1 MYB38 NT 86 CICHORIUM INTYBUS EH681135.1_ORF MYB38 AA 87 CICHORIUM INTYBUS EH694860.1 MYB68 NT 88 CITRUS SINENSIS CK936024.1 MYB68 NT 89 CITRUS SINENSIS CK936024.1_ORF MYB68 AA 90 COFFEA CANEPHORA DV692261.1 MYB37 NT 91 COFFEA CANEPHORA DV691112.1 MYB37 NT 92 CUCUMIS MELO AM727197.2 MYB36 NT 93 CUCUMIS MELO AM727197.2_ORF MYB36 AA 94 CUCUMIS MELO AM716075.2 MYB36 NT 95 CUCUMIS MELO AM716075.2_ORF MYB36 AA 96 DAUCUS CAROTA AB298508.1 MYB68 NT 97 DAUCUS CAROTA BAF49444.1 MYB68 AA 98 ELAEIS GUINEENSIS EL690464.1 MYB84 NT 99 ELAEIS GUINEENSIS EL690464.1_ORF MYB84 AA 100 ELAEIS OLEIFERA ES370938.1 MYB84 NT 101 ELAEIS OLEIFERA ES370938.1_ORF MYB84 AA 102 ESCHSCHOLZIA CD480801.1 MYB68 NT CALIFORNICA 103 ESCHSCHOLZIA CD480801.1_ORF MYB68 AA CALIFORNICA 104 EUPHORBIA ESULA DV138530.1 MYB84 NT 105 EUPHORBIA ESULA DV138530.1_ORF MYB84 AA 106 EUPHORBIA ESULA DV126436.1 MYB36 NT 107 EUPHORBIA ESULA DV126436.1_ORF MYB36 AA 108 EUPHORBIA TIRUCALLI BP958179.1 MYB84 NT 109 GINKGO BILOBA EX940876.1 MYB68 NT 110 GLYCINE MAX MYB84 NT 111 GLYCINE MAX MYB84 AA 112 GLYCINE MAX TC213651 MYB84 NT 113 GLYCINE MAX TC213651_ORF MYB84 AA 114 GLYCINE MAX DQ822971.1 MYB36 NT 115 GLYCINE MAX ABH02912.1 MYB36 AA 116 GLYCINE MAX TC211227 MYB36 NT 117 GLYCINE MAX TC211227_ORF MYB36 AA 118 GOSSYPIUM TC62721 MYB68 NT 119 GOSSYPIUM TC62721_ORF MYB68 AA 120 GOSSYPIUM DW491290.1 MYB36 NT 121 GOSSYPIUM DW491290.1_ORF MYB36 AA 122 HEDYOTIS TERMINALIS CB077617.1 MYB84 NT 123 HEDYOTIS TERMINALIS CB077617.1_ORF MYB84 AA 124 HELIANTHUS ANNUUS BQ967558 MYB36 NT 125 HELIANTHUS EE621630.1 MYB36 NT ARGOPHYLLUS 126 HELIANTHUS EE621630.1_ORF MYB36 AA ARGOPHYLLUS 127 HELIANTHUS EE619500.1 MYB36 NT ARGOPHYLLUS 128 HELIANTHUS EE619500.1_ORF MYB36 AA ARGOPHYLLUS 129 HELIANTHUS CILIARIS EL422629.1 MYB68 NT 130 HELIANTHUS EXILIS EE645503.1 MYB68 NT 131 HELIANTHUS EXILIS EE645503.1_ORF MYB68 AA 132 HELIANTHUS EXILIS EE646813.1 MYB36 NT 133 HELIANTHUS EXILIS EE646813.1_ORF MYB36 AA 134 HELIANTHUS EL474327.1 MYB84 NT PARADOXUS 135 HELIANTHUS PETIOLARIS DY942970.1 MYB84 NT 136 HELIANTHUS PETIOLARIS DY942970.1_ORF MYB84 AA 137 HELIANTHUS PETIOLARIS DY953493.1 MYB68 NT 138 HELIANTHUS PETIOLARIS DY953493.1_ORF MYB68 AA 139 HELIANTHUS EL445341.1 MYB36 NT TUBEROSUS 140 HELIANTHUS EL445341.1_ORF MYB36 AA TUBEROSUS 141 HORDEUM VULGARE BY845215.1 MYB38 NT 142 HORDEUM VULGARE BY845215.1_ORF MYB38 AA 143 HUMULUS LUPULUS AJ876882.1 MYB36 NT 144 HUMULUS LUPULUS CAI46244.1 MYB36 AA 145 LACTUCA PERENNIS DW092247.1 MYB84 NT 146 LACTUCA PERENNIS DW092247.1_ORF MYB84 AA 147 LACTUCA SALIGNA DW065247.1 MYB68 NT 148 LACTUCA SALIGNA DW065247.1_ORF MYB68 AA 149 LACTUCA SATIVA DY960463.1 MYB38 NT 150 LACTUCA SATIVA DY960463.1_ORF MYB38 AA 151 LACTUCA SATIVA DY969483.1 MYB38 NT 152 LACTUCA SATIVA DY969483.1_ORF MYB38 AA 153 LACTUCA SATIVA DY980672.1 MYB38 NT 154 LACTUCA SATIVA DY980672.1_ORF MYB38 AA 155 LACTUCA VIROSA DW108054.1 MYB38 NT 156 LACTUCA VIROSA DW160139.1 MYB84 NT 157 LACTUCA VIROSA DW160139.1_ORF MYB84 AA 158 LACTUCA VIROSA DW160891.1 MYB38 NT 159 LACTUCA VIROSA DW160891.1_ORF MYB38 AA 160 LIRIODENDRON CO998829.1 MYB38 NT TULIPIFERA 161 LIRIODENDRON CO998829.1_ORF MYB38 AA TULIPIFERA 162 MALUS DOMESTICA DT002401.1 MYB36 NT 163 MALUS DOMESTICA DT002401.1_ORF MYB36 AA 164 MALUS DOMESTICA DQ074472.1 MYB38 NT 165 MALUS DOMESTICA AAZ20440.1 MYB38 AA 166 MANIHOT ESCULENTA DB936694.1 MYB68 NT 167 MANIHOT ESCULENTA DB936694.1_ORF MYB68 AA 168 MARCHANTIA BJ846153.1 MYB84 NT POLYMORPHA 169 MARCHANTIA BJ846153.1_ORF MYB84 AA POLYMORPHA 170 MEDICAGO TC110497 MYB36 NT TRUNCATULA 171 MEDICAGO TC110497_ORF MYB36 AA TRUNCATULA 172 MEDICAGO BF634640 MYB84 NT TRUNCATULA 173 MEDICAGO BF634640_ORF MYB84 AA TRUNCATULA 174 NUPHAR ADVENA CD472544.1 MYB36 NT 175 NUPHAR ADVENA CD472544.1_ORF MYB36 AA 176 ORYZA SATIVA LOC_Os01g09590.1_cds MYB38 NT 177 ORYZA SATIVA LOC_Os01g09590.1 MYB38 AA 178 ORYZA SATIVA LOC_Os01g49160.1_cds MYB36 NT 179 ORYZA SATIVA LOC_Os01g49160.1 MYB36 AA 180 ORYZA SATIVA LOC_Os01g52410.1_cds MYB38 NT 181 ORYZA SATIVA LOC_Os01g52410.1 MYB38 AA 182 ORYZA SATIVA LOC_Os02g54520.1_cds MYB36 NT 183 ORYZA SATIVA LOC_Os02g54520.1 MYB36 AA 184 ORYZA SATIVA LOC_Os05g48010.1_cds MYB36 NT 185 ORYZA SATIVA LOC_Os05g48010.1 MYB36 AA 186 ORYZA SATIVA LOC_Os08g15020.1_cds MYB36 NT 187 ORYZA SATIVA LOC_Os08g15020.1 MYB36 AA 188 ORYZA SATIVA LOC_Os09g26170.1_cds MYB36 NT 189 ORYZA SATIVA LOC_Os09g26170.1 MYB36 AA 190 ORYZA SATIVA LOC_Os10g35660.1_cds MYB36 NT 191 ORYZA SATIVA LOC_Os10g35660.1 MYB36 AA 192 PICEA EX361512.1 MYB68 NT 193 PICEA EX361512.1_ORF MYB68 AA 194 PICEA TC20498 MYB68 NT 195 PICEA TC20498_ORF MYB68 AA 196 PINUS DR015810 MYB84 NT 197 PINUS DR015810_ORF MYB84 AA 198 PINUS TC66643 MYB68 NT 199 PINUS TC66643_ORF MYB68 AA 200 PONCIRUS TRIFOLIATA CD575120.1 MYB38 NT 201 PONCIRUS TRIFOLIATA CD575120.1_ORF MYB38 AA 202 POPULUS Gw1.II.96.1 MYB68 NT 203 POPULUS Gw1.II.96.1_ORF MYB68 AA 204 POPULUS DB879439.1 MYB84 NT 205 POPULUS DB879439.1_ORF MYB84 AA 206 POPULUS TC74579 MYB36 NT 207 POPULUS TC74579_ORF MYB36 AA
208 QUERCUS PETRAEA CU639795.1 MYB36 NT 209 QUERCUS PETRAEA CU639795.1_ORF MYB36 AA 210 QUERCUS SUBER EE743680.1 MYB84 NT 211 RAPHANUS FD544184.1 MYB68 NT RAPHANISTRUM 212 RAPHANUS FD544184.1_ORF MYB68 AA RAPHANISTRUM 213 RAPHANUS EY915531.1 MYB68 NT RAPHANISTRUM 214 RAPHANUS FD540311.1 MYB68 NT RAPHANISTRUM 215 RAPHANUS FD540311.1_ORF MYB68 AA RAPHANISTRUM 216 RAPHANUS EV548164.1 MYB38 NT RAPHANISTRUM 217 RAPHANUS SATIVUS FD580369.1 MYB68 NT 218 ROSA HYBRID EC587279.1 MYB68 NT 219 ROSA HYBRID EC587279.1_ORF MYB68 AA 220 SACCHARUM CA150911 MYB36 NT OFFICINARUM 221 SACCHARUM CA150911_ORF MYB36 AA OFFICINARUM 222 SACCHARUM CA258665 MYB84 NT OFFICINARUM 223 SACCHARUM CA258665_ORF MYB84 AA OFFICINARUM 224 SACCHARUM TC44677 MYB36 NT OFFICINARUM 225 SACCHARUM TC44677_ORF MYB36 AA OFFICINARUM 226 SECALE CEREALE BE495537 MYB38 NT 227 SECALE CEREALE BE495537_ORF MYB38 AA 228 SOLANUM TC182203 MYB36 NT LYCOPERSICUM 229 SOLANUM TC182203_ORF MYB36 AA LYCOPERSICUM 230 SOLANUM TUBEROSUM AM907873.1 MYB36 NT 231 SOLANUM TUBEROSUM AM907873.1_ORF MYB36 AA 232 SORGHUM BICOLOR TC98185 MYB36 NT 233 SORGHUM BICOLOR TC98185_ORF MYB36 AA 234 SORGHUM BICOLOR TC101637 MYB36 NT 235 SORGHUM BICOLOR TC101637_ORF MYB36 AA 236 SORGHUM PROPINQUUM BG560270.1 MYB36 NT 237 SORGHUM PROPINQUUM BG560270.1_ORF MYB36 AA 238 TARAXACUM DY830100.1 MYB68 NT OFFICINALE 239 TARAXACUM DY830100.1_ORF MYB68 AA OFFICINALE 240 TRIPHYSARIA PUSILLA EY172046.1 MYB68 NT 241 TRIPHYSARIA PUSILLA EY172046.1_ORF MYB68 AA 242 TRIPHYSARIA PUSILLA EY179359.1 MYB36 NT 243 TRIPHYSARIA PUSILLA EY179359.1_ORF MYB36 AA 244 TRIPHYSARIA PUSILLA EY174724.1 MYB38 NT 245 TRIPHYSARIA PUSILLA EY174724.1_ORF MYB38 AA 246 TRIPHYSARIA EX989121.1 MYB38 NT VERSICOLOR 247 TRIPHYSARIA EY018825.1 MYB38 NT VERSICOLOR 248 TRIPHYSARIA EY018825.1_ORF MYB38 AA VERSICOLOR 249 TRITICUM AESTIVUM MYB84 NT 250 TRITICUM AESTIVUM MYB84 AA 251 VACCINIUM CV090776.1 MYB36 NT CORYMBOSUM 252 VACCINIUM CV090776.1_ORF MYB36 AA CORYMBOSUM 253 VITIS VINIFERA CAO70108.1_cds MYB84 NT 254 VITIS VINIFERA CAO70108.1 MYB84 AA 255 VITIS VINIFERA CAO43296.1_cds MYB36 NT 256 VITIS VINIFERA CAO43296.1 MYB36 AA 257 VITIS VINIFERA CAO61524.1_cds MYB84 NT 258 VITIS VINIFERA CAO61524.1 MYB84 AA 259 VITIS VINIFERA DT006424 MYB36 NT 260 VITIS VINIFERA DT006424_ORF MYB36 AA 261 ZEA MAYS MYB36 NT 262 ZEA MAYS MYB36 AA 263 ZEA MAYS TC320820 MYB36 NT 264 ZEA MAYS TC320820_ORF MYB36 AA 265 ORYZA SATIVA MYB36 NT
Determining Homology Between Two or More Sequences
[0042]To determine the percent homology of two amino acid sequences or of two nucleic acids, the sequences are aligned for optimal comparison purposes (e.g., gaps can be introduced in either of the sequences being compared for optimal alignment between the sequences). The amino acid residues or nucleotides at corresponding amino acid positions or nucleotide positions are then compared. When a position in the first sequence is occupied by the same amino acid residue or nucleotide as the corresponding position in the second sequence, then the molecules are homologous at that position (i.e., as used herein amino acid or nucleic acid "homology" is equivalent to amino acid or nucleic acid "identity").
[0043]The nucleic acid sequence homology may be determined as the degree of identity between two sequences. The homology may be determined using computer programs known in the art, such as GAP software provided in the GCG program package. See, Needleman and Wunsch 1970 J Mol Biol 48: 443-453. Using GCG GAP software with the following settings for nucleic acid sequence comparison: GAP creation penalty of 5.0 and GAP extension penalty of 0.3, the coding region of the analogous nucleic acid sequences referred to above exhibits a degree of identity preferably of at least 70%, 75%, 80%, 85%, 90%, 95%, 98%, or 99%, with the coding sequence (encoding) part of the DNA sequence shown in Table 1.
[0044]The term "sequence identity" refers to the degree to which two polynucleotide or polypeptide sequences are identical on a residue-by-residue basis over a particular region of comparison. The term "percentage of sequence identity" is calculated by comparing two optimally aligned sequences over that region of comparison, determining the number of positions at which the identical nucleic acid base (e.g., A, T, C, G, U, or I, in the case of nucleic acids) occurs in both sequences to yield the number of matched positions, dividing the number of matched positions by the total number of positions in the region of comparison (i.e., the window size), and multiplying the result by 100 to yield the percentage of sequence identity. The term "substantial identity" as used herein denotes a characteristic of a polynucleotide sequence, wherein the polynucleotide comprises a sequence that has at least 80 percent sequence identity, preferably at least 85 percent identity and often 90 to 95 percent sequence identity, more usually at least 99 percent sequence identity as compared to a reference sequence over a comparison region. The term "percentage of positive residues" is calculated by comparing two optimally aligned sequences over that region of comparison, determining the number of positions at which the identical and conservative amino acid substitutions, as defined above, occur in both sequences to yield the number of matched positions, dividing the number of matched positions by the total number of positions in the region of comparison (i.e., the window size), and multiplying the result by 100 to yield the percentage of positive residues.
Recombinant Expression Vectors and Host Cells
[0045]Another aspect of the invention pertains to vectors, preferably expression vectors, containing a nucleic acid encoding a MYB-subgroup14 protein, gene, analogs or homologs thereof. The sequence encoding a MYB-subgroup14 polypeptide may be a genomic sequence or a cDNA sequence. As used herein the term expression vector includes vectors which are designed to provide transcription of the nucleic acid sequence. The transcribed nucleic acid may be translated into a polypeptide or protein product. As used herein, the term "vector" refers to a nucleic acid molecule capable of transporting another nucleic acid to which it has been linked. One type of vector is a "plasmid", which refers to a circular double stranded DNA loop into which additional DNA segments can be ligated. Another type of vector is a viral vector, wherein additional DNA segments can be ligated into the viral genome. Certain vectors are capable of autonomous replication in a host cell into which they are introduced (e.g., bacterial vectors having a bacterial origin of replication). Other vectors are integrated into the genome of a host cell upon introduction into the host cell, and thereby are replicated along with the host genome. Moreover, certain vectors are capable of directing the expression of genes to which they are operatively-linked. Such vectors are referred to herein as "expression vectors". In general, expression vectors of utility in recombinant DNA techniques are often in the form of plasmids. In the present specification, "plasmid" and "vector" can be used interchangeably as the plasmid is the most commonly used form of vector. However, the invention is intended to include such other forms of expression vectors, such as viral vectors or plant transformation vectors, binary or otherwise, which serve equivalent functions.
[0046]The recombinant expression vectors of the invention comprise a nucleic acid of the invention in a form suitable for expression of the nucleic acid in a host cell, which means that the recombinant expression vectors include one or more regulatory sequences, selected on the basis of the host cells to be used for expression, that is operatively-linked to the nucleic acid sequence to be expressed. Within a recombinant expression vector, "operably-linked" is intended to mean that the nucleotide sequence of interest is linked to the regulatory sequence(s) in a manner that allows for expression of the nucleotide sequence (e.g., in an in vitro transcription/translation system or in a host cell when the vector is introduced into the host cell).
[0047]The term "regulatory sequence" is intended to include promoters, enhancers and other expression control elements (e.g., polyadenylation signals). Such regulatory sequences are described, for example, in Goeddel, GENE EXPRESSION TECHNOLOGY: METHODS IN ENZYMOLOGY 185, Academic Press, San Diego, Calif. (1990). Regulatory sequences include those that direct constitutive expression of a nucleotide sequence in many types of host cell and those that direct expression of the nucleotide sequence only in certain host cells (e.g., tissue-specific regulatory sequences) or inducible promoters (e.g., induced in response to abiotic factors such as environmental conditions, heat, drought, nutrient status or physiological status of the cell or biotic such as pathogen responsive). Examples of suitable promoters include for example constitutive promoters, ABA inducible promoters, tissue specific promoters and abiotic or biotic inducible promoters. It will be appreciated by those skilled in the art that the design of the expression vector can depend on such factors as the choice of the host cell to be transformed, the level of expression of protein desired as well as timing and location of expression, etc. The expression vectors of the invention can be introduced into host cells to thereby produce proteins or peptides, including fusion proteins or peptides, encoded by nucleic acids as described herein (e.g., MYB-subgroup14 proteins such as MYB68 proteins, mutant forms of MYB68 proteins, fusion proteins, etc.).
[0048]The recombinant expression vectors of the invention can be designed for expression of a MYB-subgroup14 gene or a MYB-subgroup14 protein in prokaryotic or eukaryotic cells. For example, a MYB-subgroup14 gene or a MYB-subgroup14 protein can be expressed in bacterial cells such as Escherichia coli, insect cells (using baculovirus expression vectors) yeast cells, plant cells or mammalian cells. Suitable host cells are discussed further in Goeddel, GENE EXPRESSION TECHNOLOGY: METHODS IN ENZYMOLOGY 185, Academic Press, San Diego, Calif. (1990). Alternatively, the recombinant expression vector can be transcribed and translated in vitro, for example using T7 promoter regulatory sequences and T7 polymerase.
[0049]In one embodiment, a nucleic acid of the invention is expressed in plants cells using a plant expression vector. Examples of plant expression vectors systems include tumor inducing (Ti) plasmid or portion thereof found in Agrobacterium, cauliflower mosaic virus (CAMV) DNA and vectors such as pBI121, a pCAMBA series vector or one of preferred choice to a person skilled in the art.
[0050]For expression in plants, the recombinant expression cassette will contain in addition to a MYB-subgroup14 polynucleotide, a promoter region functional in a plant cell, a transcription initiation site (if the coding sequence to transcribed lacks one), and a transcription termination/polyadenylation sequence. The termination/polyadenylation region may be obtained from the same gene as the promoter sequence or may be obtained from different genes. Unique restriction enzyme sites at the 5' and 3' ends of the cassette are typically included to allow for easy insertion into a pre-existing vector.
[0051]Examples of suitable plant expressible promoters include promoters from plant viruses such as the 35S promoter from cauliflower mosaic virus (CaMV) (Odell, et al., Nature, 313: 810-812 (1985)), promoters from genes such as rice actin (McElroy, et al., Plant Cell, 163-171 (1990)), ubiquitin (Christensen, et al., Plant Mol. Biol., 12: 619-632 (1992); and Christensen, et al., Plant Mol. Biol., 18: 675-689 (1992)), pEMU (Last, et al., Theor. Appl. Genet., 81: 581-588 (1991)), MAS (Velten, et al., EMBO J., 3: 2723-2730 (1984)), maize H3 histone (Lepetit, et al., Mol. Gen. Genet., 231: 276-285 (1992); and Atanassvoa, et al., Plant Journal, 2(3): 291-300 (1992)), the 5'- or 3'-promoter derived from T-DNA of Agrobacterium tumefaciens, the Smas promoter, the cinnamyl alcohol dehydrogenase promoter (U.S. Pat. No. 5,683,439), the Nos promoter, the rubisco promoter, the GRP1-8 promoter, ALS promoter, (WO 96/30530), a synthetic promoter, such as Rsyn7, SCP and UCP promoters, ribulose-1,3-diphosphate carboxylase, fruit-specific promoters, heat shock promoters, seed-specific promoters and other transcription initiation regions from various plant genes, for example, including the various opine initiation regions, such as for example, octopine, mannopine, and nopaline. Useful promoters also include heat inducible promoters such as the HSP18.2 or HSP81.1 promoters (Takahashi et al. 1992, Plant J. 2, 751-761; Yoshida et al., 1995, Appl. Microbiol. Biotechnol. 44, 466-472; Ueda et al., 1996, Mol Gen Genet. 250, 533-539). Cryptic promoters are also useful for chimeric constructs useful in the invention. Cryptic gene regulatory elements are inactive at their native locations in the genome but are fully functional when positioned adjacent to genes in transgenic plants.
[0052]In addition to chimeric promoter-gene constructs the use of a native MYB-subgroup14 promoter is contemplated. Expression characteristics of a native promoter may be modified by inclusion of regulatory elements such that expression levels are elevated and or expressed ectopically and or constitutively. For example, a 4×35S enhancer sequence (Wiegel et al., 2000) may be included in a construct to enhance expression. Alternatively a population of plants may be produced by transformation with a construct having a 4×35S enhancer sequence, such as, a pSKI15 vector as per Wiegel et al., 2000. The transformed population can be screened for plants having increased expression of a MYB-subgroup14 sequence, or screened for plants having increased heat tolerance and reduced flower abortion, or a combination of such screens to identify a plant of interest.
[0053]Additional regulatory elements that may be connected to a MYB-subgroup14 encoding nucleic acid sequence for expression in plant cells include terminators, polyadenylation sequences, and nucleic acid sequences encoding signal peptides that permit localization within a plant cell or secretion of the protein from the cell. Such regulatory elements and methods for adding or exchanging these elements with the regulatory elements of a MYB-subgroup14 gene are known, and include, but are not limited to, 3' termination and/or polyadenylation regions such as those of the Agrobacterium tumefaciens nopaline synthase (nos) gene (Bevan, et al., Nucl. Acids Res., 12: 369-385 (1983)); the potato proteinase inhibitor II (PINII) gene (Keil, et al., Nucl. Acids Res., 14: 5641-5650 (1986) and hereby incorporated by reference); and An, et al., Plant Cell, 1: 115-122 (1989)); and the CaMV 19S gene (Mogen, et al., Plant Cell, 2: 1261-1272 (1990)).
[0054]Plant signal sequences, including, but not limited to, signal-peptide encoding DNA/RNA sequences which target proteins to the extracellular matrix of the plant cell (Dratewka-Kos, et al., J. Biol. Chem., 264: 4896-4900 (1989)) and the Nicotiana plumbaginifolia extension gene (DeLoose, et al., Gene, 99: 95-100 (1991)), or signal peptides which target proteins to the vacuole like the sweet potato sporamin gene (Matsuka, et al., Proc. Nat'l Acad. Sci. (USA), 88: 834 (1991)) and the barley lectin gene (Wilkins, et al., Plant Cell, 2: 301-313 (1990)), or signals which cause proteins to be secreted such as that of PRIb (Lind, et al., Plant Mol. Biol., 18: 47-53 (1992)), or those which target proteins to the plastids such as that of rapeseed enoyl-ACP reductase (Verwaert, et al., Plant Mol. Biol., 26: 189-202 (1994)) are useful in the invention.
[0055]In another embodiment, the recombinant expression vector is capable of directing expression of the nucleic acid preferentially in a particular cell type (e.g., tissue-specific regulatory elements are used to express the nucleic acid). Tissue-specific regulatory elements are known in the art. Especially useful in connection with the nucleic acids of the present invention are expression systems which are operable in plants. These include systems which are under control of a tissue-specific promoter, as well as those which involve promoters that are operable in all plant tissues.
[0056]Organ-specific promoters are also well known. For example, the chalcone synthase-A gene (van der Meer et al., 1990, Plant Molecular Biology 15(1):95-109) or the dihydroflavonol-4-reductase (dfr) promoter (Elomaa et al., The Plant Journal, 16(1) 93-99) direct expression in specific floral tissues. Also available are the patatin class I promoter is transcriptionally activated only in the potato tuber and can be used to target gene expression in the tuber (Bevan, M., 1986, Nucleic Acids Research 14:4625-4636). Another potato-specific promoter is the granule-bound starch synthase (GBSS) promoter (Visser, R. G. R, et al., 1991, Plant Molecular Biology 17:691-699).
[0057]Other organ-specific promoters appropriate for a desired target organ can be isolated using known procedures. These control sequences are generally associated with genes uniquely expressed in the desired organ. In a typical higher plant, each organ has thousands of mRNAs that are absent from other organ systems (reviewed in Goldberg, P., 1986, Trans. R. Soc. London B314:343).
[0058]The resulting expression system or cassette is ligated into or otherwise constructed to be included in a recombinant vector which is appropriate for plant transformation. The vector may also contain a selectable marker gene by which transformed plant cells can be identified in culture. The marker gene may encode antibiotic resistance. These markers include resistance to G418, hygromycin, bleomycin, kanamycin, and gentamicin. Alternatively the marker gene may encode a herbicide tolerance gene that provides tolerance to glufosinate or glyphosate type herbicides. After transforming the plant cells, those cells having the vector will be identified by their ability to grow on a medium containing the particular antibiotic or herbicide. Replication sequences, of bacterial or viral origin, are generally also included to allow the vector to be cloned in a bacterial or phage host, preferably a broad host range prokaryotic origin of replication is included: A selectable marker for bacteria should also be included to allow selection of bacterial cells bearing the desired construct. Suitable prokaryotic selectable markers also include resistance to antibiotics such as kanamycin or tetracycline.
[0059]Other DNA sequences encoding additional functions may also be present in the vector, as is known in the art. For instance, in the case of Agrobacterium transformations, T-DNA sequences will also be included for subsequent transfer to plant chromosomes.
[0060]Another aspect of the invention pertains to host cells into which a recombinant expression vector of the invention has been introduced. The terms "host cell" and "recombinant host cell" are used interchangeably herein. It is understood that such terms refer not only to the particular subject cell but also to the progeny or potential progeny of such a cell. Because certain modifications may occur in succeeding generations due to either mutation or environmental influences, such progeny may not, in fact, be identical to the parent cell, but are still included within the scope of the term as used herein. Vector DNA can be introduced into prokaryotic or eukaryotic cells via conventional transformation or transfection techniques. As used herein, the terms "transformation" and "transfection" are intended to refer to a variety of art-recognized techniques for introducing foreign nucleic acid (e.g., DNA) into a host cell.
[0061]A host cell of the invention, such as a prokaryotic or eukaryotic host cell in culture, can be used to produce (i.e., express) a polypeptide of the invention encoded in an open reading frame of a polynucleotide of the invention. Accordingly, the invention further provides methods for producing a polypeptide using the host cells of the invention. In one embodiment, the method comprises culturing the host cell of invention (into which a recombinant expression vector encoding a polypeptide of the invention has been introduced) in a suitable medium such that the polypeptide is produced. In another embodiment, the method further comprises isolating the polypeptide from the medium or the host cell.
[0062]A number of cell types may act as suitable host cell for expression of a polypeptide encoded by an open reading frame in a polynucleotide of the invention. Plant host cells include, for example, plant cells that could function as suitable hosts for the expression of a polynucleotide of the invention include epidermal cells, mesophyll and other ground tissues, and vascular tissues in leaves, stems, floral organs, and roots from a variety of plant species, for example Arabidopsis, Brassica, Oryza, Zea, Sorghum, Gossypium, Triticum, Glycine, Pisum, Phaseolus, Lycopersicon, Trifolium, Cannabis, Cucurbita, Rosa, Vitis, Juglans, Fragaria, Lotus, Medicago, Onobrychis, Trigonella, Vigna, Citrus, Linum, Geranium, Manihot, Daucus, Raphanus, Sinapis, Atropa, Capsicum, Datura, Hyoscyamus, Nicotiana, Solanum, Petunia, Digitalis, Majorana, Ciahorium, Helianthus, Lactuca, Bromus, Asparagus, Antirrhinum, Heterocallis, Nemesis, Pelargonium, Panieum, Pennisetum, Ranunculus, Senecio, Salpiglossis, Cucumis, Browaalia, Lolium, Avena, Hordeum, Secale, Picea, Caco, and Populus.
Conservative Mutations
[0063]In addition to naturally-occurring allelic variants of a MYB-subgroup14 or a MYB68 sequence that may exist in the population, the skilled artisan will further appreciate that changes can be introduced by mutation into the nucleotide sequence of SEQ ID NO:1, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 59, 61, 63, 65, 67, 69, 71, 73, 75, 77, 79, 81, 83, 85, 87, 88, 90, 91, 92, 94, 96, 98, 100, 102, 104, 106, 108, 109, 110, 112, 114, 116, 118, 120, 122, 124, 125, 127, 129, 130, 132, 134, 135, 137, 139, 141, 143, 145, 147, 149, 151, 153, 155, 156, 158, 160, 162, 164, 166, 168, 170, 172, 174, 176, 178, 180, 182, 184, 186, 188, 190, 192, 194, 196, 198, 200, 202, 204, 206, 208, 210, 211, 213, 214, 216, 217, 218, 220, 222, 224, 226, 228, 230, 232, 234, 236, 238, 240, 242, 244, 246, 247, 249, 251, 253, 255, 257, 259, 261, 263 and 265 thereby leading to changes in the amino acid sequence of the encoded MYB-subgroup14 or a MYB68 protein, without altering the functional ability of the MYB-subgroup14 or a MYB68 protein. For example, nucleotide substitutions leading to amino acid substitutions at "non-essential" amino acid residues can be made in the sequence of SEQ ID NO:2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 39, 41, 43, 45, 47, 49, 51, 53, 55, 57, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 89, 93, 95, 97, 99, 101, 103, 105, 107, 111, 113, 115, 117, 119, 121, 123, 126, 128, 131, 133, 136, 138, 140, 142, 144, 146, 148, 150, 152, 154, 157, 159, 161, 163, 165, 167, 169, 171, 173, 175, 177, 179, 181, 183, 185, 187, 189, 191, 193, 195, 197, 199, 201, 203, 205, 207, 209, 212, 215, 219, 221, 223, 225, 227, 229, 231, 233, 235, 237, 239, 241, 243, 245, 248, 250, 252, 254, 256, 258, 260, 262 and 264. A "non-essential" amino acid residue is a residue that can be altered from the wild-type sequence of a MYB-subgroup14 or a MYB68 without altering the biological activity, whereas an "essential" amino acid residue is required for biological activity. For example, amino acid residues that are conserved among MYB-subgroup14 or MYB68 proteins of the present invention are predicted to be poor candidates for alteration. Alignments and identification of conserved regions are described herein and provide further guidance as to identification of essential amino acids and conserved amino acids.
[0064]Another aspect of the invention pertains to nucleic acid molecules encoding a MYB-subgroup14 or MYB68 protein that contain changes in amino acid residues that are not essential for activity. Such MYB-subgroup14 or MYB68 proteins differ in amino acid sequence from SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 39, 41, 43, 45, 47, 49, 51, 53, 55, 57, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 89, 93, 95, 97, 99, 101, 103, 105, 107, 111, 113, 115, 117, 119, 121, 123, 126, 128, 131, 133, 136, 138, 140, 142, 144, 146, 148, 150, 152, 154, 157, 159, 161, 163, 165, 167, 169, 171, 173, 175, 177, 179, 181, 183, 185, 187, 189, 191, 193, 195, 197, 199, 201, 203, 205, 207, 209, 212, 215, 219, 221, 223, 225, 227, 229, 231, 233, 235, 237, 239, 241, 243, 245, 248, 250, 252, 254, 256, 258, 260, 262 and 264 yet retain biological activity. In one embodiment, the isolated nucleic acid molecule comprises a nucleotide sequence encoding a protein, wherein the protein comprises an amino acid sequence at least about 75% homologous to the amino acid sequence of SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 39, 41, 43, 45, 47, 49, 51, 53, 55, 57, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 89, 93, 95, 97, 99, 101, 103, 105, 107, 111, 113, 115, 117, 119, 121, 123, 126, 128, 131, 133, 136, 138, 140, 142, 144, 146, 148, 150, 152, 154, 157, 159, 161, 163, 165, 167, 169, 171, 173, 175, 177, 179, 181, 183, 185, 187, 189, 191, 193, 195, 197, 199, 201, 203, 205, 207, 209, 212, 215, 219, 221, 223, 225, 227, 229, 231, 233, 235, 237, 239, 241, 243, 245, 248, 250, 252, 254, 256, 258, 260, 262 and 264. Preferably, the protein encoded by the nucleic acid is at least about 80% homologous to SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 39, 41, 43, 45, 47, 49, 51, 53, 55, 57, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 89, 93, 95, 97, 99, 101, 103, 105, 107, 111, 113, 115, 117, 119, 121, 123, 126, 128, 131, 133, 136, 138, 140, 142, 144, 146, 148, 150, 152, 154, 157, 159, 161, 163, 165, 167, 169, 171, 173, 175, 177, 179, 181, 183, 185, 187, 189, 191, 193, 195, 197, 199, 201, 203, 205, 207, 209, 212, 215, 219, 221, 223, 225, 227, 229, 231, 233, 235, 237, 239, 241, 243, 245, 248, 250, 252, 254, 256, 258, 260, 262 and 264 more preferably at least about 90%, 95%, 98%, and most preferably at least about 99% homologous to SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 39, 41, 43, 45, 47, 49, 51, 53, 55, 57, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 89, 93, 95, 97, 99, 101, 103, 105, 107, 111, 113, 115, 117, 119, 121, 123, 126, 128, 131, 133, 136, 138, 140, 142, 144, 146, 148, 150, 152, 154, 157, 159, 161, 163, 165, 167, 169, 171, 173, 175, 177, 179, 181, 183, 185, 187, 189, 191, 193, 195, 197, 199, 201, 203, 205, 207, 209, 212, 215, 219, 221, 223, 225, 227, 229, 231, 233, 235, 237, 239, 241, 243, 245, 248, 250, 252, 254, 256, 258, 260, 262 and 264.
[0065]An isolated nucleic acid molecule encoding a MYB-subgroup14 or a MYB68 protein homologous to the protein of SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 39, 41, 43, 45, 47, 49, 51, 53, 55, 57, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 89, 93, 95, 97, 99, 101, 103, 105, 107, 111, 113, 115, 117, 119, 121, 123, 126, 128, 131, 133, 136, 138, 140, 142, 144, 146, 148, 150, 152, 154, 157, 159, 161, 163, 165, 167, 169, 171, 173, 175, 177, 179, 181, 183, 185, 187, 189, 191, 193, 195, 197, 199, 201, 203, 205, 207, 209, 212, 215, 219, 221, 223, 225, 227, 229, 231, 233, 235, 237, 239, 241, 243, 245, 248, 250, 252, 254, 256, 258, 260, 262 and 264 can be created by introducing one or more nucleotide substitutions, additions or deletions into the nucleotide sequence of SEQ ID NO: 1, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 59, 61, 63, 65, 67, 69, 71, 73, 75, 77, 79, 81, 83, 85, 87, 88, 90, 91, 92, 94, 96, 98, 100, 102, 104, 106, 108, 109, 110, 112, 114, 116, 118, 120, 122, 124, 125, 127, 129, 130, 132, 134, 135, 137, 139, 141, 143, 145, 147, 149, 151, 153, 155, 156, 158, 160, 162, 164, 166, 168, 170, 172, 174, 176, 178, 180, 182, 184, 186, 188, 190, 192, 194, 196, 198, 200, 202, 204, 206, 208, 210, 211, 213, 214, 216, 217, 218, 220, 222, 224, 226, 228, 230, 232, 234, 236, 238, 240, 242, 244, 246, 247, 249, 251, 253, 255, 257, 259, 261, 263 and 265 such that one or more amino acid substitutions, additions or deletions are introduced into the encoded protein.
[0066]Mutations can be introduced into the nucleotide sequence of SEQ ID NO: 1, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 59, 61, 63, 65, 67, 69, 71, 73, 75, 77, 79, 81, 83, 85, 87, 88, 90, 91, 92, 94, 96, 98, 100, 102, 104, 106, 108, 109, 110, 112, 114, 116, 118, 120, 122, 124, 125, 127, 129, 130, 132, 134, 135, 137, 139, 141, 143, 145, 147, 149, 151, 153, 155, 156, 158, 160, 162, 164, 166, 168, 170, 172, 174, 176, 178, 180, 182, 184, 186, 188, 190, 192, 194, 196, 198, 200, 202, 204, 206, 208, 210, 211, 213, 214, 216, 217, 218, 220, 222, 224, 226, 228, 230, 232, 234, 236, 238, 240, 242, 244, 246, 247, 249, 251, 253, 255, 257, 259, 261, 263 and 265 by standard techniques, such as site-directed mutagenesis and PCR-mediated mutagenesis. Preferably, conservative amino acid substitutions are made at one or more predicted non-essential amino acid residues. A "conservative amino acid substitution" is one in which the amino acid residue is replaced with an amino acid residue having a similar side chain. Families of amino acid residues having similar side chains have been defined in the art. These families include amino acids with basic side chains (e.g., lysine, arginine, histidine), acidic side chains (e.g., aspartic acid, glutamic acid), uncharged polar side chains (e.g., glycine, asparagine, glutamine, serine, threonine, tyrosine, cysteine), nonpolar side chains (e.g., alanine, valine, leucine, isoleucine, proline, phenylalanine, methionine, tryptophan), beta-branched side chains (e.g., threonine, valine, isoleucine) and aromatic side chains (e.g., tyrosine, phenylalanine, tryptophan, histidine). Thus, a predicted nonessential amino acid residue in MYB68 is replaced with another amino acid residue from the same side chain family. Alternatively, in another embodiment, mutations can be introduced randomly along all or part of a MYB-subgroup14 or a MYB68 coding sequence, such as by saturation mutagenesis, and the resultant mutants can be screened for biological activity to identify mutants that retain activity and the desired phenotypes. Following mutagenesis of SEQ ID NO:1, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 59, 61, 63, 65, 67, 69, 71, 73, 75, 77, 79, 81, 83, 85, 87, 88, 90, 91, 92, 94, 96, 98, 100, 102, 104, 106, 108, 109, 110, 112, 114, 116, 118, 120, 122, 124, 125, 127, 129, 130, 132, 134, 135, 137, 139, 141, 143, 145, 147, 149, 151, 153, 155, 156, 158, 160, 162, 164, 166, 168, 170, 172, 174, 176, 178, 180, 182, 184, 186, 188, 190, 192, 194, 196, 198, 200, 202, 204, 206, 208, 210, 211, 213, 214, 216, 217, 218, 220, 222, 224, 226, 228, 230, 232, 234, 236, 238, 240, 242, 244, 246, 247, 249, 251, 253, 255, 257, 259, 261, 263 and 265 the encoded protein can be expressed by any recombinant technology known in the art and the activity of the protein can be determined.
Transformed Plants Cells and Transgenic Plants
[0067]The invention includes a protoplast, plants cell, plant tissue and plant (e.g., monocot or dicot) transformed with a MYB-subgroup14 nucleic acid, a vector containing a MYB-subgroup14 nucleic acid or an expression vector containing a MYB-subgroup14 nucleic acid. As used herein, "plant" is meant to include not only a whole plant but also a portion thereof (i.e., cells, and tissues, including for example, leaves, stems, shoots, roots, flowers, fruits and seeds).
[0068]The plant can be any plant type including, for example, species from the genera Arabidopsis, Brassica, Oryza, Zea, Sorghum, Gossypium, Triticum, Glycine, Pisum, Phaseolus, Lycopersicon, Trifolium, Cannabis, Cucurbita, Rosa, Vitis, Juglans, Fragaria, Lotus, Medicago, Onobrychis, Trigonella, Vigna, Citrus, Linum, Geranium, Manihot, Daucus, Raphanus, Sinapis, Atropa, Capsicum, Datura, Hyoscyamus, Nicotiana, Solanum, Petunia, Digitalis, Majorana, Ciahorium, Helianthus, Lactuca, Bromus, Asparagus, Antirrhinum, Heterocallis, Nemesis, Pelargonium, Panieum, Pennisetum, Ranunculus, Senecio, Salpiglossis, Cucumis, Browaalia, Lolium, Avena, Hordeum, Secale, Picea, Caco, and Populus.
[0069]The invention also includes cells, tissues, including for example, leaves, stems, shoots, roots, flowers, fruits and seeds and the progeny derived from the transformed plant.
[0070]Numerous methods for introducing foreign genes into plants are known and can be used to insert a gene into a plant host, including biological and physical plant transformation protocols (See, for example, Miki et al., (1993) "Procedure for Introducing Foreign DNA into Plants", In: Methods in Plant Molecular Biology and Biotechnology, Glick and Thompson, eds., CRC Press, Inc., Boca Raton, pages 67-88; and Andrew Bent in, Clough S J and Bent A F, (1998) "Floral dipping: a simplified method for Agrobacterium-mediated transformation of Arabidopsis thaliana"). The methods chosen vary with the host plant, and include chemical transfection methods such as calcium phosphate, polyethylene glycol (PEG) transformation, microorganism-mediated gene transfer such as Agrobacterium (Horsch, et al., Science, 227: 1229-31 (1985)), electroporation, protoplast transformation, micro-injection, flower dipping and biolistic bombardment.
Agrobacterium-Mediated Transformation
[0071]The most widely utilized method for introducing an expression vector into plants is based on the natural transformation system of Agrobacterium tumefaciens and A. rhizogenes which are plant pathogenic bacteria which genetically transform plant cells. The Ti and Ri plasmids of A. tumefaciens and A. rhizogenes, respectfully, carry genes responsible for genetic transformation of plants (See, for example, Kado, Crit. Rev. Plant Sci., 10:1-32 (1991)). Descriptions of the Agrobacterium vector systems and methods for Agrobacterium-mediated gene transfer are provided in Gruber et al., supra; and Moloney, et al, Plant Cell Reports, 8: 238-242 (1989).
[0072]Transgenic Arabidopsis plants can be produced easily by the method of dipping flowering plants into an Agrobacterium culture, based on the method of Andrew Bent in, Clough S J and Bent A F, 1998. Floral dipping: a simplified method for Agrobacterium-mediated transformation of Arabidopsis thaliana. Wild type plants are grown until the plant has both developing flowers and open flowers. The plants are inverted for 1 minute into a solution of Agrobacterium culture carrying the appropriate gene construct. Plants are then left horizontal in a tray and kept covered for two days to maintain humidity and then righted and bagged to continue growth and seed development. Mature seed is bulk harvested.
Direct Gene Transfer
[0073]A generally applicable method of plant transformation is microprojectile-mediated transformation, where DNA is carried on the surface of microprojectiles measuring about 1 to 4 μm. The expression vector is introduced into plant tissues with a biolistic device that accelerates the microprojectiles to speeds of 300 to 600 m/s which is sufficient to penetrate the plant cell walls and membranes. (Sanford, et al., Part. Sci. Technol., 5: 27-37 (1987); Sanford, Trends Biotech, 6: 299-302 (1988); Sanford, Physiol. Plant, 79: 206-209 (1990); Klein, et al., Biotechnology, 10: 286-291 (1992)).
[0074]Plant transformation can also be achieved by the Aerosol Beam Injector (ABI) method described in U.S. Pat. No. 5,240,842, U.S. Pat. No. 6,809,232. Aerosol beam technology is used to accelerate wet or dry particles to speeds enabling the particles to penetrate living cells Aerosol beam technology employs the jet expansion of an inert gas as it passes from a region of higher gas pressure to a region of lower gas pressure through a small orifice. The expanding gas accelerates aerosol droplets, containing nucleic acid molecules to be introduced into a cell or tissue. The accelerated particles are positioned to impact a preferred target, for example a plant cell. The particles are constructed as droplets of a sufficiently small size so that the cell survives the penetration. The transformed cell or tissue is grown to produce a plant by standard techniques known to those in the applicable art.
Regeneration of Transformants
[0075]The development or regeneration of plants from either single plant protoplasts or various explants is well known in the art (Weissbach and Weissbach, 1988). This regeneration and growth process typically includes the steps of selection of transformed cells, culturing those individualized cells through the usual stages of embryonic development through the rooted plantlet stage. Transgenic embryos and seeds are similarly regenerated. The resulting transgenic rooted shoots are thereafter planted in an appropriate plant growth medium such as soil.
[0076]The development or regeneration of plants containing the foreign, exogenous gene that encodes a polypeptide of interest introduced by Agrobacterium from leaf explants can be achieved by methods well known in the art such as described (Horsch et al., 1985). In this procedure, transformants are cultured in the presence of a selection agent and in a medium that induces the regeneration of shoots in the plant strain being transformed as described (Fraley et al., 1983). In particular, U.S. Pat. No. 5,349,124 (specification incorporated herein by reference) details the creation of genetically transformed lettuce cells and plants resulting therefrom which express hybrid crystal proteins conferring insecticidal activity against Lepidopteran larvae to such plants.
[0077]This procedure typically produces shoots within two to four months and those shoots are then transferred to an appropriate root-inducing medium containing the selective agent and an antibiotic to prevent bacterial growth. Shoots that rooted in the presence of the selective agent to form plantlets are then transplanted to soil or other media to allow the production of roots. These procedures vary depending upon the particular plant strain employed, such variations being well known in the art.
[0078]Preferably, the regenerated plants are self-pollinated to provide homozygous transgenic plants, or pollen obtained from the regenerated plants is crossed to seed-grown plants of agronomically important, preferably inbred lines. Conversely, pollen from plants of those important lines is used to pollinate regenerated plants. A transgenic plant of the present invention containing a desired polypeptide is cultivated using methods well known to one skilled in the art.
[0079]A preferred transgenic plant is an independent segregant and can transmit the MYB68 gene and its activity to its progeny. A more preferred transgenic plant is homozygous for the gene, and transmits that gene to all offspring on sexual mating. Seed from a transgenic plant may be grown in the field or greenhouse, and resulting sexually mature transgenic plants are self-pollinated to generate true breeding plants. The progeny from these plants become true breeding lines that are evaluated for increased expression of the MYB68 transgene.
Method of Producing Transgenic Plants
[0080]Included in the invention are methods of producing a transgenic plant. The method includes introducing into one or more plant cells a compound that alters expression or activity of a MYB-subgroup14 in the plant to generate a transgenic plant cell and regenerating a transgenic plant from the transgenic cell. The compound increases MYB-subgroup14 expression or activity. The increased expression and or activity can additionally be directed to occur ectopically or constitutively or in a tissue specific manner. The compound can be, e.g., (i) a MYB-subgroup14 polypeptide; (ii) a MYB-subgroup14 nucleic acid and analogs and homologs thereof; (iii) a nucleic acid that increases expression of a MYB-subgroup14 nucleic acid. A nucleic acid that increases expression of a MYB-subgroup14 nucleic acid may include promoters or enhancer elements. The promoter is a heterologous promoter or a homologous promoter. Additionally, the promoter is a constitutive or an inducible promoter. Promoters include for example, organ specif promoter or tissue specific promoter. Promoter suitable for directing gene expression are know in the art and are described herein. Enhancer elements are known to those skilled in the art. For example the enhancer element is a 35S enhancer element.
[0081]By increasing the expression of a MYB subgroup-14 polypeptide is meant that the amount produced by the cell transformed with the nucleic acid construct is greater than a cell, e.g. control cell that is not transformed with the nucleic acid construct. A control cell includes for example a cell that endogenously expresses a MYB subgroup-14 polypeptide such as a plant root cell, alternatively a control cell is a non transformed cell of the same cell-type as the transformed cell, be it a leaf cell a meristem cell or a flower or seed cell. An increase is a 1-fold, 2-fold, 3 fold or greater increase. An increase of expression is also meant to include expression of a MYB subgroup-14 polypeptide in a cell that does not typically express a MYB subgroup-14 polypeptide.
[0082]The nucleic acid can be either endogenous or exogenous. Preferably, the compound is a MYB-subgroup14 polypeptide or a MYB-subgroup14 nucleic acid encoding a MYB-subgroup14 polypeptide. For example the compound comprises the nucleic acid sequence of SEQ ID NO: 1, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 59, 61, 63, 65, 67, 69, 71, 73, 75, 77, 79, 81, 83, 85, 87, 88, 90, 91, 92, 94, 96, 98, 100, 102, 104, 106, 108, 109, 110, 112, 114, 116, 118, 120, 122, 124, 125, 127, 129, 130, 132, 134, 135, 137, 139, 141, 143, 145, 147, 149, 151, 153, 155, 156, 158, 160, 162, 164, 166, 168, 170, 172, 174, 176, 178, 180, 182, 184, 186, 188, 190, 192, 194, 196, 198, 200, 202, 204, 206, 208, 210, 211, 213, 214, 216, 217, 218, 220, 222, 224, 226, 228, 230, 232, 234, 236, 238, 240, 242, 244, 246, 247, 249, 251, 253, 255, 257, 259, 261, 263 and 265. Preferably, the compound is a MYB-subgroup14 nucleic acid sequence from an endogenous source to the species being transformed. Alternatively, the compound is a MYB-subgroup14 nucleic acid sequence from an exogenous source to the species being transformed.
[0083]Also included in the invention are methods of producing a transgenic plant. The method includes introducing into one or more plant cells a compound that alters a MYB-subgroup 14 nucleic acid expression or activity in the plant to generate a transgenic plant cell and regenerating a transgenic plant from the transgenic cell. The compound increases a MYB-subgroup14 sequence expression or activity. The compound can be, e.g., (i) a MYB-subgroup14 polypeptide; (ii) a MYB-subgroup14 nucleic acid and analogs and homologs thereof; (iii) a nucleic acid that increases expression of a MYB-subgroup14 nucleic acid. A nucleic acid that increases expression of a MYB-subgroup14 nucleic acid may include promoters or enhancer elements. The nucleic acid can be either endogenous or exogenous. Preferably, the compound is a MYB-subgroup14 polypeptide or a MYB-subgroup14 nucleic acid. For example the compound comprises the nucleic acid sequence of SEQ ID NO: 1, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 59, 61, 63, 65, 67, 69, 71, 73, 75, 77, 79, 81, 83, 85, 87, 88, 90, 91, 92, 94, 96, 98, 100, 102, 104, 106, 108, 109, 110, 112, 114, 116, 118, 120, 122, 124, 125, 127, 129, 130, 132, 134, 135, 137, 139, 141, 143, 145, 147, 149, 151, 153, 155, 156, 158, 160, 162, 164, 166, 168, 170, 172, 174, 176, 178, 180, 182, 184, 186, 188, 190, 192, 194, 196, 198, 200, 202, 204, 206, 208, 210, 211, 213, 214, 216, 217, 218, 220, 222, 224, 226, 228, 230, 232, 234, 236, 238, 240, 242, 244, 246, 247, 249, 251, 253, 255, 257, 259, 261, 263 and 265. Preferably, the compound is a MYB-subgroup14 nucleic acid sequence endogenous to the species being transformed. Alternatively, the compound is a MYB-subgroup14 nucleic acid sequence exogenous to the species being transformed.
[0084]An exogenous MYB-subgroup14 sequence expressed in a host species need not be identical to the endogenous MYB-subgroup14 sequence. For example, sequences of three Arabidopsis GAMYB-like genes were obtained on the basis of sequence similarity to GAMYB genes from barley, rice, and L. temulentum. These three Arabidopsis genes were determined to encode transcription factors (AtMYB33, AtMYB65, and AtMYB101) and could substitute for a barley GAMYB and control alpha-amylase expression (Gocal et al. (2001) Plant Physiol. 127: 1682 1693).
[0085]Maize, petunia and Arabidopsis MYB transcription factors that regulate flavonoid biosynthesis are very genetically similar and affect the same trait in their native species, therefore sequence and function of these MYB transcription factors correlate with each other in these diverse species (Borevitz et al. (2000) Plant Cell 12: 2383-2394).
[0086]Therefore an expressed MYB-subgroup14 need only be functionally recognized in the host cell. Expression of MYB-subgroup14 encoding nucleic acids in Arabidopsis provides the basis of a functionally equivalent assay. For example expression of a MYB-subgroup14 from a Brassica, soybean, cotton or corn source in Arabidopsis and assessment of the heat tolerance demonstrates functional equivalence and provides a sound basis for prediction that the exogenous sequence is a MYB-subgroup14 gene and functions accordingly.
[0087]Disclosed herein is a description of expression of MYB-subgroup14 sequences from Arabidopsis, Brassica and soybean that have been demonstrated to be functional in Arabidopsis in that the resulting plants have increased heat tolerance as indicated by reduced flower abortion and increased seed set under heat stress conditions during flowering.
[0088]In various aspects the transgenic plant has an altered phenotype as compared to a wild type plant (i.e., untransformed). By altered phenotype is meant that the plant has a one or more characteristic that is different from the wild type plant. For example, when the transgenic plant has been contacted with a compound that increases the expression or activity of a MYB-subgroup14 nucleic acid, the plant has a phenotype such as increased heat tolerance as compared to a wild type plant and manifests this trait in phenotypes such as decreased flower abortion, increased seed set and development, increased yield protection and protection of pollen development and protection of meristems, particularly flower meristems, from heat damage, drought tolerance and salt tolerance for example. Plants with a reduced flower abortion have a 5, 10, 20, 25, 30% or more increase in seed yield as compared to a control plant.
[0089]The plant can be any plant type including, for example, species from the genera Arabidopsis, Brassica, Oryza, Zea, Sorghum, Gossypium, Triticum, Glycine, Pisum, Phaseolus, Lycopersicon, Trifolium, Cannabis, Cucurbita, Rosa, Vitis, Juglans, Fragaria, Lotus, Medicago, Onobrychis, Trigonella, Vigna, Citrus, Linum, Geranium, Manihot, Daucus, Raphanus, Sinapis, Atropa, Capsicum, Datura, Hyoscyamus, Nicotiana, Solanum, Petunia, Digitalis, Majorana, Ciahorium, Helianthus, Lactuca, Bromus, Asparagus, Antirrhinum, Heterocallis, Nemesis, Pelargonium, Panieum, Pennisetum, Ranunculus, Senecio, Salpiglossis, Cucumis, Browaalia, Lolium, Avena, Hordeum, Secale, Picea, Caco, and Populus.
Method of Identifying a Heat Stress Tolerant Plant
[0090]Also included in the invention is a method of identifying a heat stress tolerant plant. The plants identified by these methods have reduced flower abortion and increased yield as compared to a control plant. Heat stress tolerant plants are identified by exposing a population of flowering plants to a heat stress treatment and selecting a plant from the population of plants that has reduced flower abortion. Heat stress treatment includes for example exposing the plant to a temperature that is hot enough for a sufficient amount of time such that damage to plant functions or development results. By reduced flower abortion is meant that a plant does not loss as many flowers, due to flower abortion, or has a greater seed yield compared to another plant that is exposed to a similar level of heat stress. Plants with a reduced flower abortion have a 5, 10, 20, 25, 30% or more increase in seed yield as compared to a control plant.
EXAMPLES
[0091]The invention will be further illustrated in the following non-limiting examples.
Example 1
Identification of Heat Tolerant Mutant
[0092]Arabidopsis thaliana var. Columbia was transformed with pSKI15 vector containing a 4×35S enhancer sequence (Wiegel et al., 2000). A T3 population of Arabidopsis seed was obtained from ABRC and used to produce a T4-generation that was used in genetic screen experiments. The Arabidopsis h138 mutant was identified as having reduced or no flower abortion, relative to a wild type control, when exposed to a heat stress during flowering of about 45° C. for about 30 to 60 minutes. Initial isolates were retested by having flowering plants subjected to a 1 hour temperature ramp-up from 22° C. to 45° C. followed by a 2 hour heat stress of 45° C., flower production, seed set and seed development was monitored and heat tolerant lines selected.
Example 2
Identification of the Heat Tolerant MYB68 Gene
[0093]Genome walking to localize the T-DNA activation tag insertion was performed as follows. Genomic DNA was purified by phenol:chloroform extraction using 10-day-old seedlings of mutant h138. The isolated DNA was subsequently digested by the restriction enzymes such as EcoRV, PvuII, NruI, or Stul to generate DNA fragments with blunt ends. The resulting fragments from each digestion were ligated to an adaptor that was formed by the annealing of two oligos: Adaptor 1 and Adaptor 2. The addition of the adaptor to the DNA fragments enables PCR amplification using primers specific to the adaptor and a T-DNA insertion site.
[0094]Two rounds of PCR were used to generate DNA fragments for further sequencing analysis. Primer HeatL1 (SEQ) that is specific to the T-DNA left border, and primer CAP1 (SEQ) that is specific to the adaptor, were used for the 1st PCR. The resulted PCR products were diluted 50 folds to serve as templates for 2nd PCR. A confirmed DNA fragment was then amplified by two nested primers HeatL2 (SEQ) and CAP2 (SEQ). PCR programs TOUCH1 (6 cycles of 94° C., 25 sec; 72° C., 7 min; 32 cycles of 94° C., 25 sec; 67° C., 7 min and 1 cycle of 67° C., 10 min) and TOUCH2 (4 cycles of 94° C., 25 sec; 72° C., 7 min; 20 cycles of 94° C., 25 sec; 67° C., 7 min and 1 cycle of 67° C., 10 min) were used for the two rounds of PCR. All PCR was carried out using Ex-Taq as DNA polymerase and a Biometra® thermocycler. The PCR products were sequenced, and the flanking genomic sequences identified. The 4×35S enhancers were inserted into an intergenic region that is 5 kb down stream of 3' end of genomic AtMYB68 (AT5G65790) on chromosome 5. Northern analysis and real-time PCR showed that the expression of MYB68 in h138 was induced to more than 2 fold relative to wild type.
Example 3
Physiological Characterization of the h138 Mutant (myb68)
[0095]Plants were assessed for heat tolerance during flowering and scored based on the number of aborted flowers or pods and final seed yield. Plants were grown in controlled environment chambers where optimal growth conditions were 16 hr light 200 uE and 8 hr dark, 22° C. and 70% relative humidity. Three groups of plants were used in the experimental design; 1) A control group grown under optimal conditions; 2) a 3-hour heat treatment group and; 4) a 4-hour heat treatment group. Heat treatment was performed 6 days after first open flower and the temperature was ramped from 22° C. to 44° C. over a 1-hour period. Each group of plants contained the myb68 mutant and its wild type control (myb68-null) with 10 replicate pots per entry per treatment with each pot containing 5 plants. Plants were assessed for flower abortion a week following the heat stress treatments then left to grow under optimal conditions until maturity. Final seed yield per pot was determined for all 3 groups of plants.
[0096]Following heat stress the seed yield of myb68 was lower than that of the myb68-null control in both the 3-hour (25%) and 4-hour (17%) stress treatments however the difference was only statistically significant for the 3-hour treatment. The 3-hour treatment resulted in 32% fewer aborted pods relative to myb68-null and the final seed yield was increased by 16% relative to myb68 plants grown in optimal conditions. The 4-hour treatment also resulted in a 16% increase in seed yield relative to optimally grown myb68 plants. In contrast, the myb68-null showed 15% and 23% reductions in seed yield relative to optimally grown plants. The overall yield protection provided by the myb68 mutation was 31% and 39%, relative to the wild-type. Additional experiments have shown results of yield protection ranging from 5% to 44% depending on the experimental conditions.
Example 4
Constructs Useful for Expression of MYB-Subgroup14 Sequences Including MYB68
[0097]According to the methods described below, expression vector constructs can be produced using appropriate promoters and a MYB gene of the invention. For example any of the gene sequences described by the SEQ ID NO: 1, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 59, 61, 63, 65, 67, 69, 71, 73, 75, 77, 79, 81, 83, 85, 87, 88, 90, 91, 92, 94, 96, 98, 100, 102, 104, 106, 108, 109, 110, 112, 114, 116, 118, 120, 122, 124, 125, 127, 129, 130, 132, 134, 135, 137, 139, 141, 143, 145, 147, 149, 151, 153, 155, 156, 158, 160, 162, 164, 166, 168, 170, 172, 174, 176, 178, 180, 182, 184, 186, 188, 190, 192, 194, 196, 198, 200, 202, 204, 206, 208, 210, 211, 213, 214, 216, 217, 218, 220, 222, 224, 226, 228, 230, 232, 234, 236, 238, 240, 242, 244, 246, 247, 249, 251, 253, 255, 257, 259, 261, 263 and 265. Such vector constructs are useful to produce a MYB68 gene, operably linked to a sequence that functions as a promoter in a plant cell and to operably express said gene and protein encoded by the gene.
[0098]Vectors to over-express MYB68 under regulatory control of either constitutive or conditional promoters may be constructed, as described below. The sequence encoding a MYB68 open reading frame has been operably linked to the promoter sequences of the 35S CaMV constitutive promoter, the P18.2 or P81.1 heat inducible promoters and its endogenous PMYB68 promoter. Additionally the genomic sequence of MYB68 has been cloned behind the 35S CaMV constitutive promoter in a pEGAD vector backbone.
35S-Genonic AtMYB68 (in pEGAD Vector (35S-gAtMYB68)
[0099]A 1.4 kb of MYB68 genomic DNA including 83 bps of 3' UTR was amplified by PCR using primers: MYB68FW-BamH3 (5'-AAAGGATCCATGGGAAGAGCACCGTGTTG-3') (SEQ ID NO:300) and MYB68RV-BamH4 (5'-AAAGGATCCCCACTCCCTAAAGACACAGATTT-3') (SEQ ID NO:301), and subsequently digested with BamHI. The resulting DNA fragment was ligated into pBluescript II SK (+/-), and then subcloned into pEGAD at the same site to obtain 35S-gemonicAtMYB68 (35S-gAtMYB68) in pEGAD.
35S-AtMYB68, 35S-AtMYB84, 35S-AtMYB36, 35S-AtMYB37, 35S-AtMYB38, 35S-AtMYB87
[0100]AtMYB84 (At3g49690), AtMYB36 (At5g57620), AtMYB37 (At5g23000), AtMYB38 (At2g36890) and AtMYB87 (At4g37780) are classified as members of the MYB-subgroup14 family along with AtMYB68 (Stracke et al., 2001), therefore it is possible that their functions are redundant. These MYB genes are over-expressed in Arabidopsis to test their functionality as an AtMYB68 orthologue with respect to heat tolerance. The cDNA sequences are amplified by RT-PCR, and cloned into pBI121 without GUS to generate constructs of 35S-AtMYB84, 35S-AtMYB36, 35S-AtMYB37, 35S-AtMYB38 and 35S-AtMYB87.
[0101]A 1.1 kb of AtMYB68 cDNA was produced by RT-PCR using primers HG2F (5'-AAATCTAGAATGGGAAGAGCACCGTGTT-3') (SEQ ID NO:302) and HG2R (5'-AAAGGATCCTTACACATGATTTGGCGCAT-3') (SEQ ID NO:303), and digested with XbaI and BamHI. The resulting DNA fragment was cloned into pBluescript II SK (+/-), and then into pBI121 without GUS to generate 35S-MYB68. pBI121 without GUS was obtained by SmaI and EcolcR1 double digestion and followed by self-ligation of the remaining vector.
[0102]The coding sequence of AtMYB84 (933 bp, AtMYB84, At3g49690) was amplified by RT-PCR using forward primer 690M84-Xba-FW containing an XbaI site (5'-acgt TCTAGA ATG GGA AGA GCA CCG TGT TG-3') (SEQ ID NO:273) and reverse primer 690M84-Bam-Re containing a BamHI site (5'-atcg GGATCC TTA AAA AAA TTG CTT TGA ATC AGA ATA-3') (SEQ ID NO:274). The PCR product was cloned at the XbaI-BamHI sites in pBI121, generating construct of 35S-AtMYB84.
[0103]The coding sequence of AtMYB36 (1002 bp, AtMYB36, At5g57620) was amplified by RT-PCR using forward primer M36-Xb-FW containing an XbaI site (5'-actg TCTAGA ATG GGA AGA GCT CCA TGC TG-3') (SEQ ID NO:304) and reverse primer M36-Bm-Re containing a BamHI site (5'-cagt GGATCC TTA AAC ACT GTG GTA GCT CAT C-3') (SEQ ID NO:305). The PCR product was cloned at the XbaI-BamHI sites in pBI121, generating construct of 35S-AtMYB36.
[0104]The coding sequence of AtMYB37 (990 bp, AtMYB37, At5g23000) was amplified by RT-PCR using forward primer AM37-Xb-FW containing an XbaI site (5'-actg TCTAGA ATG GGA AGA GCT CCG TGT TG-3') (SEQ ID NO:306) and reverse primer AM37-Bm-Re containing a BamH I site (5'-acgt GGATC CTA GGA GTA GAA ATA GGG CAA G-3') (SEQ ID NO:307). The PCR product was cloned at the XbaI-BamHI sites in pBI121, generating construct of 35S-AtMYB37.
[0105]The coding sequence of AtMYB38 (897 bp, AtMYB38, At2g36890) was amplified by RT-PCR using forward primer AM38-Xb-FW containing an XbaI site (5'-actg TCTAGA ATG GOT AGG GCT CCA TGT TGT-3') (SEQ ID NO:308) and reverse primer AM38-Bm-Re containing a BamH I site (5'-acgt GGATCC TCA GTA GTA CAA CAT GAA CTT ATC-3') (SEQ ID NO:309). The PCR product was cloned at the XbaI-BamHI sites in pBI121, generating construct of 35 S-AtMYB38.
[0106]The coding sequence of AtMYB87 (918 bp, AtMYB87, At4g37780) will be amplified by RT-PCR using forward primer M87-Xb-FW containing an XbaI site (5'-aaaa TCTAGA ATG GGA AGA GCA CCG TGC-5') (SEQ ID NO:310) and reverse primer M87-Bg-Re containing a Bg12 site (5'-aaaa AGATCT CTA CTC ATT ATC GTA TAG AGG-3') (SEQ ID NO:311). The PCR product will be cloned at the XbaI-BamHI sites in pBI121, generating construct of 35S-AtMYB87.
P18.2-MYB68, and P81.1-MYB68
[0107]The construction involved 4-steps; 1) a 869 bp of Hsp18.2 promoter, and a 406 bp of Hsp81.1 promoter were amplified by PCR using primer sets: HP1F (SEQ ID NO: 277) and HP1R (SEQ ID NO:278), and HP2F (SEQ ID NO:279) and HP2R (SEQ ID NO:280), respectively, and digested with SalI and XbaI. The resulting DNA fragments were cloned into pBI101 at the same sites to generate the new vectors: P18.2pBI101 and P81.1pBI101; 2) a MCS2-oligo (including restriction sites of XbaI, HpaI, Agel, KpnI, XhoI, ScaI, SpeI, SalI, BamHI and SmaI) was cloned into the new vectors at XbaI and SmaI sites. The resulting vectors were named P18.2pBI101MCS and P81.1pBI101MCS; 3) the GUS gene was removed by SmaI and EcolcR1 double digestion and followed by self-ligation of the remaining vector to give vectors P18.2pBI101MCS without GUS and P81.1pBI101MCS without GUS; 4) the 1.1 kb of MYB68 cDNA fragment was ligated into the two newer vectors at XbaI and BamHI sites to complete the construction of P18.2pBI101 MCS without GUS for P18.2-MYB68, and P81.1 MCSpBI121 without GUS for P81.1-MYB68.
pHSP81.1-AtMYB68
[0108]The coding sequence of AtMYB68 was isolated by restriction digestion with XbaI and BamHI from plasmid pHSP18.2-AtMYB68, and cloned at the XbaI-BamHI sites in pHSP81.1.
pHPR-AtMYB68
[0109]The promoter sequence (-1 to -506 bp, relative to ATG start codon) of the Arabidopsis hydroxy pyruvate reductase gene (HPR, At1g68010) was amplified by PCR from Arabidopsis genomic DNA using a forward primer containing a Sal I site (HPR-Sal-FW, acgt gtcgac GAAGCAGCAGAAGCCTTGAT) (SEQ ID NO:312) and a reverse primer containing an Xba I site (HPR-Xb-R2, acgt tctaga GGT AGA GAA AAG AGA aag cct c) (SEQ ID NO:313). The digested fragment was cloned into the vector pHSP81.1-AtMYB68 that was pre-digested with SalI and XbaI to remove the HSP81.1 promoter. This generates a recombinant plasmid with the HPR promoter placed in front of AtMYB68.
PMYB68-AtMYB68
[0110]The AtMYB68 promoter (-1 through -1034 with respect to the MYB68 ATG start codon) was amplified by PCR using primers: Pm68-H3-FW (SEQ ID NO:275) and Pm68-Av-Xh-Re (SEQ ID NO:276), and digested by restriction enzymes: HindIII and XhoI. The resulting promoter fragment was cloned into P81.1MCSpBI121 without GUS at the same sites, replacing the Hsp81.1 promoter. This vector is then named PMYB68pBI121, and used for further cloning of AtMYB68 cDNA (1.1 kb) at Avr II and BamHI sites to obtain PMYB68-AtMYB68. The AvrII-BamHI fragment of MYB cDNA was recovered from the plasmid 18.2-MYB68.
pM68-AtMYB84
[0111]The coding sequence of AtMYB84 (933 bp, AtMYB84, At3g49690) was amplified by RT-PCR using forward primer 690M84-Xba-FW containing an XbaI site (5'-acgt TCTAGA ATG GGA AGA GCA CCG TGT TG-3') (SEQ ID NO:273) and reverse primer 690M84-Bam-Re containing a BamHI site (5'-atcg GGATCC TTA AAA AAA TTG CTT TGA ATC AGA ATA-3') (SEQ ID NO:274). The PCR product was cloned at the AvRII-BamHI sites in pB-Pm68, generating construct of AtMYB84 under control of the AtMYB68 promoter.
pM68-AtMYB36
[0112]The coding sequence of AtMYB36 (1002 bp, At5g57620) was amplified from RNA isolated from young Arabidopsis seedlings (leaves and roots) by RT-PCR using forward primer M36-Xb-FW containing an XbaI site (5'-actg TCTAGA ATG GGA AGA GCT CCA TGC TG-3') (SEQ ID NO:305) and reverse primer M36-Bm-Re containing a BamHI site (5'-cagt GGATCC TTA AAC ACT GTG GTA GCT CAT C-3') (SEQ ID NO:306). The PCR product was cloned at the Avr II and BamHI sites in pBI-Pm68 described above. This generated a construct of AtMYB36 under control of the AtMYB68 promoter.
35S-OsMYB36
[0113]Rice MYB36 cDNA homolog: The coding sequence (966 bp) of a rice MYB36 gene (SEQ ID NO:9), encoding a protein identified as SEQ ID NO:10 was amplified by RT-PCR from rice root RNA using forward primer rM-Xb-FW2 containing an XbaI site (5'-acgt TCTAGA ATG GGG AGA GCG CCG TGC TG-3') (SEQ ID NO:314) and reverse primer rM-Bm-Re2 containing a BamH I site (5'-tgca GGATCC CTA CTG CAT CCC GAG GTC AG CT-3') (SEQ ID NO:315). The PCR product was cloned at the XbaI-BamHI sites in pBI121, generating construct 35S-Os MYB36.
35S-gOs MYB36
[0114]Rice MYB genomic homolog clone: Using the same primers described above forward primer rM-Xb-FW2 (SEQ ID NO:314) and reverse primer rM-Bm-Re2 (SEQ ID NO:315), the genomic sequence of the rice MYB36 gene (SEQ ID NO:265) was amplified (1259 bp). The PCR product was cloned at the XbaI-BamHI sites in pBI121, generating construct 35S-gOsMYB36.
35S-GmMYB84
[0115]The soybean MYB161 is a homolog of Arabidopsis MYB84. Herein the term `soybean MYB84` is used interchangeably with Soybean MYB161. The 1068 bp coding sequence of a soybean MYB161 was cloned by RT-PCR from soybean root RNA using forward primer soybM-Xba-FW2 containing an XbaI site (5'-acgt TCTAGA ATG GGG AGG GCA CCT TGC T-3') (SEQ ID NO:316) and reverse primer soybM-Bm-Re containing a BamHI site (5'-acgt GGATC CTA TTG CGC CCC CGG GTA G-3') (SEQ ID NO:317). The PCR product was cloned at the XbaI-BamHI sites in pBI121, generating construct 35S-GmMYB84.
35S-ZmMYB36
[0116]The corn MYB36 cDNA (SEQ ID NO:261) was amplified by PCR using primers: ZmYYBFW-XbaI (5'-aaatctagaATGGGGAGAGCTCCGTGCTGCGACA-3') (SEQ ID NO:318) and ZmMYBRV-BamHI2 (5'-aaaggatccCTACTTCATCCCAAGGTTTCCTGGC-3') (SEQ ID NO:319). The DNA fragment was digested by XbaI and BamHI and subsequently ligated to the same sites at pBluescript II SK (+), and then subcloned into the same sites of pBI121 replacing GUS.
35S-GhMYB68 or 35SS-CotMYB68
[0117]The cotton MYB68 cDNA was amplified by PCR using primers: CotM-Xb-Fw (5'-acgt TCTAGA ATG GGG AGA GCT CCT TGT TG-3') (SEQ ID NO:320) and CotM-Bm-Re (5'-acgt GGATCC CTA TTG CGC TCC TCC TGG G-3') (SEQ ID NO:321). The DNA fragment was digested by XbaI and BamHI. It was ligated to the same sites at pBluescript II SK (+), and then subloned into the same sites in pBI121 replacing GUS.
35S-BnMYB68r
[0118]The canola root MYB cDNA was amplified by PCR using primers: Bn68root-FW-XbaI (5'-aaatctagaATGGGAAGAGCACCGTGTTGTGATAAGGCC-3') (SEQ ID NO:322) and Bn68root-RV-BamHI (5'-aaaggatccTTACACATTATTTGGCCCATTGAAGTATCTTGC-3') (SEQ ID NO:323). The DNA fragment was digested by XbaI and BamHI. It was ligated to the same sites at pBluescript II SK (+). The same fragment was then subcloned to the same sites in pBI121 replacing GUS.
35S-BnMYB68r
[0119]The canola bud MYB cDNA was amplified by PCR using primers: Bn68Bud-FW-XbaI (5'-aaatctagaATGGGAAGAGCACCGTGTTGTGACAAGGCT-3') (SEQ ID NO:324) and Bn68Bud-RV-BamHI (5'-aaaggatccTTACAAATGATTTGCCCCATTGAAGTAACTTGC-3') (SEQ ID NO:325). The DNA fragment was digested by XbaI and BamHI. It was ligated to the same sites at pBluescript II SK (+). The same fragment was then subcloned to the same sites in pBI121 replacing GUS.
[0120]Table 2 below describes oligonucleotide primers used to make the vector constructs described above, and additional primers useful for cloning AtMYB homologues.
TABLE-US-00002 TABLE 2 Oligonucleotide primers synthesized for cloning AtMYB68 homologues SEQ ID Restriction NO Primer name site Sequence (5'-3') Remark 271 790M68-Xba- XbaI ACGT TCTAGA ATG GGA AGA AtMYB68 FW GCA CCG TGT TG (at5g65790) 272 790M68-Bam- BamHI ATCG GGATCC TTA CAC ATG ATT AtMYB68 Re TGG CGC ATT G (at5g65790) 273 690M84-Xba- Xba I ACGT TCTAGA ATG GGA AGA AtMYB84 FW GCA CCG TGT TG (at3g49690) 274 690M84-Bam- Bam HI ATCG GGATCC TTA AAA AAA TTG AtMYB84 Re CTT TGA ATC AGA ATA (at3g49690) 275 Pm68-H3-FW Hind III ACGT AAGCTT TCG TAA AAT CTC AtMYB68 TCA TG Promoter 276 Pm68-Av-Xh- Avr II and GTCA CTCGAG CCTAGG TTT CTT AtMYB68 Re Xho I GAT TCT TGA TTC TTG ATC Promoter 277 HP1f AAAGTCGACGCATCTTTACAATGT AAAGCTTTTCT 278 HP1R AAATCTAGATGTTCGTTGCTTTTC GGG 279 HP2F AAAGTCGACAGAAGACAAATGAG AGTTGGTTTATATTT 280 HP2R AAATCTAGACGCAACGAACTTTG ATTCAA 281 BnMYB68FW2 ATGGGAAGAGCACCGTGTTGTGA Canola MYB68 TAAGGCC (AC189266.1) 282 BnMYB68RV2 TTAATTTGGCGCATTGAAGTAACT Canola MYB68 TGCATCTTCGG (AC189266.1) 283 rM-Xb-FW Xba I ACGT TCTAGA ATG GGG AGA Rice MYB GCG CCG TGC (AAT85046) 284 rM-Bm-Re BamH I TGCA GGATC CTA CTG CAT CCC Rice MYB GAG GTC AG (AAT85046) 285 cotM-Xb-FW Xba I ACGT TCTAGA ATG GGG AGA Cotton MYB GCT CCT TGT TG (TC34239) 286 cotM-Bm-Re BamH I ACGT GGATCC CTA TTG CGC TCC TCC TGG G 287 soybM-Xba- XbaI ACGT TCTAGA ATG GGG AGG Soybean MYB FW2 GCA CCT TGC T (ABH02906) 288 cornM-Xba- XbaI I ACGT TCTAGA ATG GGG AGA Corn MYB FW2 GCT CCG TGC T (TC370133) 289 wheatM-Xba- XbaI ACTG TCTAGA ATG GGG AGG Wheat MYB FW GCG CCG TGC (BQ483726) 290 MedtM-Xba- XbaI ACTG TCTAGA ATG GGA AGA M. truncatula FW GCT CCT TGC TGT MYB (TC97441) 291 sorgM-Xb-FW XbaI ACGTTCTAGA ATGGGGAGAG sorghum MYB CTCCGTGCT (AAL84760) 292 toM-Xb-FW XbaI ACTGTCTAGAATGGGAAGAGCTC Tomato blind CATGTTGT (AAL69334) 293 toM-Bm-Re BamHI GACT GGATCC TTA GTA ATA AAA CAT CCC TAT CTC A 294 popM-Xb-FW Xba I ACGT TCTAGA ATG GGG AGA Poplar MYB GCT CCT TGC TG (TC54478) 295 popM-Bm-Re BamHI GACT GGATCC TCA TTG TGG CCC Poplar MYB AAA GAA GCT (TC54478) 296 HSP18.2 HP1F AAAGTCGACGCATCTTTACAATGT AAAGCTTTTCT 297 HSP18.2 HP1R AAATCTAGATGTTCGTTGCTTTTC GGG 298 HSP81.1 HP2F AAAGTCGACAGAAGACAAATGAG AGTTGGTTTATATTT 299 HSP81.1 HP2R AAATCTAGACGCAACGAACTTTG ATTCAA Note: 1. FW: Forward primer with gene specific sequence starting from the ATG start codon. 2. Re: Reverse primer with gene specific sequence starting at the stop codon. 3. Restriction sites at the 5' end are underlined.
[0121]The expression vector constructs of the invention can be introduced into Arabidopsis, the plant of origin or any species of choice. For example an Arabidopsis MYB gene may be over expressed in a Brassica species or alternatively a soybean, maize, rice or cotton species.
Example 5
Amino Acid Sequence Analysis of MYB68
[0122]The tables below provide a comparison of amino acid sequences from the MYB gene family across different plant species, under different settings for multiple sequence alignment and amino acid sequence analysis. Note: Different MYB naming and numbering systems are used in the literature and databases for different plant species. In the tables below, (*) indicates the predicted ORF genomic DNA sequence was edited according to peptide sequence alignments to generate a putative coding sequence sequence. The (P) designation indicates a partial sequence. Sequence homology and multiple sequence alignments were compared by ClustalW at http://www.ebi.ac.uk/clustalw/.
TABLE-US-00003 TABLE 3 Protein Multi-alignment scores to Sequence size AtMYBs (%, Clustal) Species Name file (a.a.) MYB68 MYB84 MYB36 AtMYB68 100 AtMYB84 At3g49690 310 64 100 SEQ ID NO: 4 AtMYB36 At5g57650 333 35 39 100 SEQ ID NO: 6 Canola AC189266 364 (*) 88 59 33 SEQ ID NO: 8 Rice Rice MYB s8137 360 34 36 Rice MYB AAT85046 321 39 40 39 s3656 SEQ ID NO: 10 Soybean Soybean ABH02831 259 21 20 MYB84 Soybean ABH02839 317 22 20 MYB84 Soybean ABH02906 198 (P) 63 65 58 MYB161 SEQ ID NO: 14 Soybean ABH02912 209 (P) 58 56 MYB71 Corn TC32080 361 33 38 TC32080 131 79 80 ZmMYB- AF099429 43 (P) 86 86 IP30 ZmMYB- TC370133 131 (P) 82 83 83 IP30 SEQ ID NO: 18 ZmMYB- AF099383 43 (P) 81 81 HX43 Corn CAJ42201 226 33 35 MYB8 Corn CAJ42202 275 26 26 MYB31 Corn P20025 255 30 30 MYB38 Cotton Cotton TC34239 356 40 41 34 MYB SEQ ID NO: 12 Cotton AAK19616 309 32 27 GHMYB25 Cotton AAK19619 264 28 25 GHMYB9 Cotton GHMYB30 AAZ83352 307 31 29 Sorghum Sorghum AAL90639 87 (P) 60 62 MYB68 Sorghum AAQ54875 157 (P) 69 70 MYB86 Sorghum AAL84760 157 (P) 69 70 75 MYB20 SEQ ID NO: 20 Sorghum AAL84761 203 46 40 MYB34 Sbi_042749 318 38 37 203 55 53 157 63 63 Medicago ABE78637 336 28 31 truncatula ABE90877 319 24 24 TC97441 178 (P) 70 70 66 SEQ ID NO: 26 Tomato Blind AAL69334 315 39 SEQ ID NO: 28 EST467561 237 51 51 SEQ ID NO: 30 TC182203 185 (P) 60 54 64 AAL69334 185 62 58 62 EST467561 185 61 58 59 Wheat BQ483726 175 (P) 66 65 70 SEQ ID NO: 22 Poplar TC54478 345 39 44 38 SEQ ID NO: 24
TABLE-US-00004 TABLE 4 ATMYB68 Homologues from Different Crops/Species Protein Multi-alignment scores to AtMYBs Sequence size (%, ClustalW) Name file (a.a.) AtMYB68 AtMYB84 AtMYB36 AtMYB68 At5g65790 374 100 SEQ ID NO: 2 AtMYB84 At3g49690 310 64 100 AtMYB36 At5g57620 333 35 39 100 Canola AC189266 364 ((*)) 88 59 33 Soybean ABH02906 198 (P) 63 65 58 Cotton TC34239 356 40 41 34 Tomato Blind 315 39 AAL69334 Medicago TC97441 178 (P) 70 70 66 truncatula Rice AAT85046 321 39 40 38 Corn TC370133 131 (P) 82 83 83 Wheat BQ483726 175 (P) 66 65 70 Sorghum AAL84760 157 (P) 69 70 75 Poplar TC54478 345 39 44 38
[0123]In Table 4 above, sequence homology and multiple sequence alignments were compared by ClustalW at http://www.ebi.ac.uk/clustalw/ with the following default settings:
Matrix: Gonnet 250
[0124]GAP OPEN: 10.0
[0125]END GAPS: -1
[0126]GAP EXTENSION: 0.2
[0127]GAP DISTANCES: 4
TABLE-US-00005 TABLE 5 ATMYB68 Homologues from Different Crops/Species Sequence homology Protein size (%) to AtMYB68 Species Sequence file (a.a.) Protein DNA AtMYB68 At5g65790 374 100 100 SEQ ID NO: 2 AtMYB84 At3g49690 310 64 68 AtMYB36 At5g57620 333 37 29 Canola AC189266 364 ((*)) 88 90 Soybean ABH02906 198 (P) 63 53 Cotton TC34239 356 40 30 Tomato Blind 315 39 28 (AAL69334) Medicago TC97441 178 (P) 70 58 truncatula Rice AAT85046 321 39 28 Corn TC370133 131 (P) 82 67 Wheat BQ483726 175 (P) 66 53 Sorghum AAL84760 157 (P) 69 59 Poplar TC54478 345 39 30
[0128]In Table 5 above, sequence homology and multiple sequence alignments were compared by ClustalW at http://www.ebi.ac.uk/clustalw/ with the following default settings:
[0129]Matrix: Protein: Gonnet 250
[0130]GAP OPEN: DNA: 15.0 Protein: 10.0
[0131]END GAPS: -1
[0132]GAP EXTENSION: DNA: 6.66 Protein: 0.2
[0133]GAP DISTANCES: 4
Example 6
Physiological Characterization of the 35S-MYB68 Expression Lines
[0134]Within a population of transgenic lines a gradation of expression levels and physiological response will exist. In part, the gradation of variation is a result of the site of integration of the gene construct and the local environment for gene expression at that locus. Therefore, lines must be screened and evaluated in order to select the best performing lines. This process is one of routine to one skilled in the art.
[0135]Homozygous lines expressing a 35S-MYB68 expression construct have been evaluated in a heat and flower abortion experiment. The experimental set up included 8 replicate plants per line with 1 plant per 3'' pot and grown under optimal conditions in a controlled environment chamber (18 hr light at 200 uE, 6 hr dark, 22° C., 70% relative humidity). Three days after the appearance of the first flower, plants were exposed to a heat shock of 1 hour at 42° C. and returned to optimal conditions for a further 7 days. Plants were assessed for flower abortion on the main stem. Lines were identified that demonstrated reduced flower abortion rates from 34% to 60% relative to wild type controls.
[0136]Seed yields and yield protection, expressed as a percent relative to wild type, was determined for nine independent 35S-MYB68 transgenic lines. The experimental set up included 3 plants per 3'' pot which were grown as above with 22 replicates per line. Three days after the appearance of the first flower, 12 replicates per line were exposed to a 3-hour heat stress at 45° C. with a 1 hour ramp up from 22° C. Three days later the heat stress was applied again. The remaining 10 replicates the plants per line were maintained under optimal conditions throughout their life cycle. The final seed yield was determined for all the plants. Wild type plants showed a 40% reduction in yield due to the applied heat stress whereas transgenic lines, while still having a reduced yield due to heat stress the reduction was less severe resulting in a 10% to 12% yield protection.
[0137]Selected lines were re-evaluated and stressed as follows. Plants were exposed to a 1-hour ramp up period from 22° C. to 45° C. and a heat stress of 45° C. for 1.5 to 1.8 hours was maintained. Heat stress was applied daily for five consecutive days followed by a five day recovery period and then a sixth heat treatment. The heat treatment resulted in 11% reduction in yield in WT plants and in four of the transgenic lines a yield increase was observed ranging from 2% to 17%. As shown in Table 6, six transgenic lines (68, 80, 93, 73, 83, 30) showed yield protection relative to WT following the heat stress. This protection ranged from 3 to 31%. Two of these transgenic lines (30 and 73) have been shown to have yield protection in the previous experiment and two lines (93 and 83) showed reduction in flower abortion.
TABLE-US-00006 TABLE 6 Yield protection Line Heat seed yield relative to WT 68 0.816 31% 80 0.714 26% 93 0.781 17% 73 0.820 14% 83 0.719 5% 30 0.801 3% WT 0.784 -- myb68 0.577 15% WT (myb68- 0.779 -- null)
Characterization of Arabidopsis Lines Expressing the PMYB68-AtMYB68 Construct
[0138]Fourteen homozygous transgenic lines at T3 were evaluated in a flower abortion experiment. Plants (fourteen replicates per entry) were grown under optimal conditions (18 hr light, 200 uE, 22 C, 60% RH) in 2.25'' pots until three days from first open flower. At that point plants were exposed to 1 hr treatment at 43 C. Following the heat treatment plants were returned to optimal conditions for 7 more days. The impact of heat treatment on the pod formation was assessed at that point by counting the aborted and damaged (short) pods in the region that was exposed to the heat stress. Six transgenic lines showed a reduction in the total number of damaged pods as compared to controls (wild type-Colombia and the null-n11-8) (see Table 7 below). The best line showed only 70% of the heat damage as the control. Two of the transgenic lines examined here also showed reduced flower abortion at the T2 stage of screening. The myb68 mutant, included as a positive control also showed a reduction in the total number of damaged pods as compared to its segregating null control (myb68-null).
TABLE-US-00007 TABLE 7 # of damaged pods total damaged Entry Mean Std Err pods as % of col 15-8 2.3 0.4 70% 3-3 2.4 0.4 72% 4-4 2.9 0.4 87% 8-1 2.9 0.4 89% 27-1 3.1 0.4 93% 20-4 3.2 0.4 98% n-11-8 4.1 0.4 126% col 3.3 0.4 100% myb68 1.9 0.5 60% myb68-null 3.2 0.4 100%
Example 7
Physiological Assessment of Transgenic Plants Expressing MYB68
[0139]Arabidopsis myb68 Mutant Shows Drought Tolerance
[0140]Five plants per 3'' pot with 6 replicates per entry were grown under optimal conditions of 16 hr light (180 uE), 22 C, 70% RH in a growth chamber until first open flower. A drought treatment was started equalizing the amount of water in all pots and cessation of watering. Water loss was measured daily by weighing the pots. Soil water content (SWC) was calculated as a % of initial. Plants were harvested on days 0, 2 and 4 of the drought treatment. Water lost relative to shoot dry biomass of the plants was calculated. The myb68 plants show trends towards greater SWC than that of control throughout the drought treatment. Shoot biomass was not different or slightly larger than that of control. The ratio of water lost in 2 days over shoot dry weight on day 2 was lower for the myb68 mutant than control indicating trends towards drought tolerance.
TABLE-US-00008 TABLE 8 Soil water content (% of initial) on days of the drought treatment and water lost in 2 days relative to shoot dry weight on day 2. SWC-d2 SWC-d4 Water lost in 2 d/ (% Initial) (% Initial) Shoot DW-d2 Entry Mean S.E. Mean S.E. Mean S.E. myb68 36.3 1.0 9.1 0.2 161.6 4.9 WT 32.3 0.8 8.5 0.2 185.3 8.6 myb68 mutant and 35S-gMYB68 and 35S-MYB68 Arabidopsis transgenic lines show salt tolerance at seedling stage
[0141]Seeds were sterilized and placed on agar plates with 1/2 MS growth media containing salt (200 mM NaCl) or no salt (optimal plates) with 6 plates per entry and 30 seeds per plate. After 3 days at 4 C plates were placed in the growth room at 22 C, and 18 hr lights (100 uE) for 16 days. After 7 days plates were scored for germination and after additional 9 days seedlings were scored for bleaching (% white seedling indicative of stress). No differences were found between controls and transgenic lines or the mutant in germination on optimal plates and on salt plates. But after 16 days of salt exposure seedlings showed signs of stress by becoming bleached. Results indicated that the myb68 mutant and the transgenic 35S-Myb68 expressing lines had fewer bleached seedlings which are indicative of lower sensitivity to salt stress.
TABLE-US-00009 TABLE 9 Bleached seedlings (% of white seedling) scored after 16 days on 200 mM NaCl containing agar and 1/2 MS plates. % White Seedlings construct Entry Mean Std Err myb68 (mutant) myb68 51.3 6.5 myb68- 60.1 7.0 null 35S-gMYB68 30 29.3 5.0 73 29.8 4.7 control 41.6 4.6 35S-MYB68 104-3 56.7 4.9 22-7 62.1 5.4 control 75.6 5.6 Constitutive expression of MYB68 in Arabidopsis results in yield protection following heat stress.
[0142]The 35S-Myb68 Arabidopsis plants were grown in 3'' pots with 3 plants per pot and 10 replicates per optimal treatment and 12 replicates per heat treatment. All plants were grown under optimal conditions until 2 days into flowering. At that point optimal plants remained in optimal conditions (22 C, 18 hr light of 200 uE, 70% RH) and the test group had a daily heat treatment applied by increasing temperature from 22° C. to 45° C. over a 1 hr ramp period and maintaining that temperature for 1.5 to 2.5 hr for five consecutive days. Plants recovered for a two day period without applied stress after which stress was applied again for an additional three days (total of eight days of heat treatment). Following the heat treatments plants were maintained in optimal conditions till maturity together with the optimal group and final seed yield of both groups was determined. The results indicate that four 35S-Myb68 transgenic lines (22-7, 20-11, 35-1 and 8-6) showed yield protection following the heat stress treatments that ranged from 8 to 21% relative to controls.
TABLE-US-00010 TABLE 10 Seed yield per pot from plants grown under optimal and heat stress conditions as described above. Yield protection was calculated as the difference between the seed yield following the heat stress and expressed as % of optimal in transgenic lines and col. yield optimal heat Protection seed yield (g) seed yield (g) % of relative to Entry n Mean Std Err n Mean Std Err opt col 22-7 8 0.449 0.034 12 0.470 0.018 105% 21% 20-11 10 0.194 0.025 11 0.201 0.014 104% 20% 35-1 8 0.500 0.033 12 0.465 0.023 93% 10% 8-6 9 0.517 0.028 12 0.473 0.011 91% 8% 28-12 10 0.572 0.025 12 0.503 0.012 88% 4% 33-3 8 0.155 0.016 11 0.133 0.012 86% 2% 25-6 10 0.395 0.027 12 0.335 0.013 85% 1% 20-7 10 0.571 0.021 12 0.484 0.010 85% (null) 33-10 10 0.546 0.018 12 0.456 0.016 84% (null) Col 10 0.548 0.029 12 0.457 0.020 83%
Example 8
Functional Confirmation of Arabidopsis MYB-Subgroup14 Sequences And Homologues from Other Species Produce the Desired Phenotypes Such as Heat Tolerance
Constitutive Expression of BnMYB68 in Arabidopsis Results in Reduced Flower Abortion Following Heat Stress.
[0143]Two closely related Myb68 sequences were identified from Brassica. Their expression patterns differ in that one is expressed predominately in the roots (SEQ ID NO:54), the other in flower buds (SEQ ID NO:56). Plants having constructs expressing either of the BnMyb68 were produced and evaluated. Plants were grown in 2.25'' pots (1/pot) under optimal conditions (22 C, 50% RH, 17 hr light of 200 uE) until 3 days from first open flower. Plants were transferred from 22 C to 43 C for 2-2.5 hr (see tables below). Following this heat stress plants were returned to optimal conditions at 22 C for a week. One week following the heat stress number of aborted flowers was counted. Transgenic lines of 35S-BnMyb68(root) construct showed fewer aborted pods than its control (null). Two transgenic lines of 35S-BnMyb68(bud) construct showed reduced flower abortion following heat stress than their control (null line).
TABLE-US-00011 TABLE 11 Number of aborted flowers following 2 hr heat stress at 43 C. - 35S-BnMYB68 (root) # # Aborted pods Entry Reps Mean Std Err % Null 36-1 10 6.6 0.9 79% 49-1 12 6.6 0.9 79% 61-4 11 6.8 0.7 82% 70-2 11 7.5 1.1 89% 73-3 12 7.5 0.7 90% 75-7 11 5.9 1.0 71% 78-8 11 6.5 0.9 77% 80-8 12 6.5 0.9 78% null 75-5 11 8.4 0.9
TABLE-US-00012 TABLE 12 Number of aborted flowers following 2.5 hr heat stress at 43 C. - 35S-BnMYB68 (bud) # # Aborted pods Entry Reps Mean Std Err % Null 11-6 11 4.6 0.4 73% 61-12 12 5.8 0.8 92% Null 97-1 12 6.3 0.6 Constitutive expression of soybean MYB84 or Arabidopsis MYB84 or Arabidopsis MYB36 in Arabidopsis results in reduced flower abortion following heat stress.
[0144]Over-expression constructs of soybean-Myb84 (GmMyb84) or Arabidopsis Myb84 (AtMyb84) or Arabidopsis Myb36 (AtMyb36) were made and transformed into Arabidopsis plants functionally confirm the Myb-subgroup14 homologues resulted in heat tolerance as demonstrated by reduced flower abortion under heat stress. Transgenic plants were produced and the T2 generation was used for an initial screen of heat tolerance. Subsequently, T3 homozygous transgenic plants are used for detailed physiological assessment and confirmation of initial results.
[0145]The T2 seeds were plated on 0.5×MS agar plates with vitamins (1 plate/flat). Each test group included a positive control, the original heat tolerant mutant myb68, and a corresponding wild type. Seedlings were transplanted to soil on day ten post germination. At the early flowering stage, plants were placed into a heating chamber at 45° C., 65% humidity) for 24-30 minutes. The plants were then placed back into the growing chamber under normal growth conditions (17 h light/7 h dark, 200 μE, 22° C., 70% humidity). Plants were examined on day thirty-two and scored for aborted siliques, partially aborted siliques, dead meristems or normal siliques. A heat stress was deemed effective if a majority of wild type plants had significant flower abortion Transgenic lines were assessed for gene expression by RT-PCR and demonstrated to have elevated expression levels.
[0146]Constructs and transgenic plants are produced using homologous of Myb-subgroup14 sequences from desired crop species, for example, rice, corn, wheat, soybean and cotton and evaluated for heat tolerance.
TABLE-US-00013 TABLE 13 flower abortion as % Construct of control 35S-Bn MYB68-bud 84 35S-Bn-MYB68-root 82 35S-GmMYB84 58 35S-AtMYB84 77 35S-AtMYB36 81
[0147]This example demonstrates that Arabidopsis can be used as a model system to assess and provide conformation that a Mub-subgroup14 sequence can provide heat tolerance and that sequences identified from other plant species are functional in Arabidopsis.
Characterization of Arabidopsis Lines Expressing a 35S-AtMYB36 Construct
[0148]Fifteen homozygous transgenic lines at T3 were evaluated in a flower abortion experiment. Plants (fourteen replicates per entry) were grown under optimal conditions (18 hr light, 200 uE, 22 C, 60% RH) in 2.25'' pots until four days from first open flower. At that point plants were exposed to 1 hr treatment at 43 C. Following the heat treatment plants were returned to optimal conditions for 7 more days. The impact of heat treatment on the pod formation was assessed at that point by counting the aborted and damaged (short) pods in the region that was exposed to the heat stress. Nine transgenic lines showed reduction in the total number of damaged pods as compared to controls (wild type-Columbia and the null-n77-4) (see table below). Three of the lines examined here also showed reduced flower abortion at the T2 stage of screening. The myb68 mutant, included as a positive control also showed a reduction in the total number of damaged pods as compared to its segregating null control (myb68-null).
TABLE-US-00014 TABLE 14 total # of damaged pods total damaged entry Mean Std Err pods as % of col 108-2 3.2 0.6 65% 14-3 3.7 0.5 76% 36-2 3.8 0.4 78% 103-7 3.8 0.4 78% 43-6 3.8 0.4 78% 23-4 3.9 0.4 81% 67-7 4.0 0.4 82% 82-7 4.4 0.4 90% 40-4 4.5 0.6 93% n77-4 4.1 0.5 85% col 4.9 0.6 100% myb68 3.1 0.4 72% myb68- 4.3 0.5 100% null Constitutive or inducible expression in Brassica napus of an Arabidopsis MYB68 shows reduced flower abortion following heat stress
[0149]Five transgenic lines having an Arabidopsis Myb68 gene sequence under the control of a constitutive promoter or a heat inducible promoter were evaluated for heat stress tolerance under growth chamber conditions. These lines were at the T2 stage and were heterozygous, with the exception of the 01-105G-1-E line. Analysis of heterozygous lines typically produces greater variation than the same analysis performed on homozygous lines. However, early analysis allows for screening and subsequent analysis of the derived homozygous lines. Segregating nulls and the parent DH12075 were included as controls. The experiment was arranged in a split-plot design with temperature as main factor and transgenic line as subfactor. The plants were grown in 15 cm plastic pots filled with "Sunshine Mix #3" under approximately 500 μmol m-2 s-1 photosynthetically active radiation at the top of the crop canopy. Molecular analysis was performed to confirm transgene presence. Two groups of plants were included in the test; one group was grown under optimum conditions (22/18° C. day/night temperature, 16-h photoperiod) throughout the growing period, while the second group was subjected to heat stress at 31° C. for 5 hr. per day (ramped-up from 18 to 31° C., then back down to 18° C.). Heat stress conditions were initiated on the third day following the first flower opening and for seven days thereafter. The heat-stressed plants were then returned to optimum conditions until maturity. All racemes on the stressed plants were marked at the beginning and end of the heat stress period, to indicate which flowers had been subjected to heat stress. Viable and aborted pods on the marked racemes were counted and the pod abortion rate calculated as the ratio of aborted pods to aborted plus viable pods under heat-stress conditions, expressed as percent. Seed yield was determined after harvest.
[0150]There were significant differences in flower abortion among different lines, with two 35S-Myb68 lines showing significantly reduced abortion rate under heat stress compared to the parental line. Line 02-104G-4-A had a significantly lower abortion rate (33%) than its segregating null (48%) and DH12075 (61%). The abortion rate for line 02-104G-3-K (47%) was also significantly lower than that of the segregating null (66%) and DH12075. The homozygous line (01-105G-1-E) had a slightly lower abortion rate (53%) than its segregating null (58%) and the parental control. Within a population of transgenic lines a gradation of expression levels and physiological response will exist. In part, the gradation of variation is a result of the site of integration of the gene construct and the local environment for gene expression at that locus. This gradation is expected and therefore, lines must be screened and evaluated in order to select the best performing lines. This process is one of routine to one skilled in the art.
[0151]Seed yield differed among the tested lines. The yield of three transgenic lines, 02-104G-3-K, 02-104G-4-A and 01-105G-1-E, was similar to that of the parental line, however compared to it's own null the three lines showed between 10% and 22% increase in seed yield. Two transgenic lines (02-33H-1-V and 01-105G-3-G) appeared to under perform compared to the DH12075 parent however, compared to appropriate nulls, one line was significantly lower. Due to the zygosity of these lines such variability is not unexpected.
[0152]A similar trend was found for seed number per raceme under heat stress conditions. Lines 02-104G-3-K, 02-104G-4-A and 01-105G-1-E had a slightly higher seed number per raceme but the other transgenic lines (02-33H-1-V and 01-105G-3-G) had significantly lower seed number per raceme compared the parent line. There were no significant differences in 100 seed weight among the tested lines.
[0153]In general, two transgenic lines expressing the Arabidopsis Myb68 gene demonstrated significant protection against flower abortion during heat stress imposed at flowering. Additionally, three transgenic lines indicated a trend of increased seed yield.
TABLE-US-00015 TABLE 15 abortion Yield Construct Line rate % S.E. Ab % null Yield S.E. % Null 35S-MYB68 02-104G-3-K 47 4.8 71 0.71 0.188 122 02-104G-3-G null 66 3.7 0.58 0.091 02-104G-4-A 33 7.6 69 0.75 0.153 110 02-104G-4-F null 48 2.8 0.68 0.090 02-33H-1-V 79 1.9 116 0.31 0.159 44 02-33H-1-F null 68 4.3 0.71 0.090 P18.2-MYB68 01-105G-1-E 53 4.1 91 0.73 0.097 116 01-105G-1-B null 58 3.0 0.63 0.090 01-105G-3-G 75 4.9 94 0.31 0.099 96 01-105G-3-J null 80 3.6 0.32 0.097 Control DH12075 parent 61 2.4 0.62 0.106
Example 9
Identification of MYB-Subgroup14 Sequences and Homologues
[0154]Methods for identification of Arabidopsis MYB sequences, classification of MYB sequences into designated subgroups and identification of MYB-subgroup14 sequences are further described in Stracke et al., 2001 and Kranz et al., 1998. MYB-subgroup14 sequences have been is defined as a nucleotide or protein sequence comprising an Arabidopsis the characteristics described herein. The MYB-subgroup14 is a R2R3 MYB sequence that additionally comprises a conserved motif or motifs as described by the following patterns.
[0155]The general pattern (SEQ ID NO:266) provides a sequence that can be used to identify a candidate MYB-subgroup14 sequence. At some positions multiple amino acid residues are permitted at a given location. Where multiple amino acids are permitted, the optional residues are indicated within square brackets. Where a R2R3MYB protein sequence fits the general pattern it is likely to be a MYB-subgroup14 sequence. A MYB-subgroup14 sequence may be less than identical to the general pattern, for example it may be 90%, 95% or 99% identical.
[0156]If a candidate MYB-subgroup14 sequence, matching the general pattern further includes a match to the exclusive pattern (SEQ ID NO:267) then the sequence is a strong candidate for inclusion as a MYB-subgroup14 sequence. The exclusive sequence (SEQ ID NO:267) defines the pattern of amino acids that are present in MYB-subgroup14 sequences but may differ in other R2R3 MYB proteins. Presence of the exclusive pattern within a MYB protein is a strong indicator that the MYB is a member of the MYB-subgroup14 family.
[0157]If a candidate MYB-subgroup14 sequence, matching the general pattern further includes a match to the absolute pattern (SEQ ID NO:268) then the sequence is a strong candidate for inclusion as a MYB-subgroup14 sequence. The Absolute pattern (SEQ ID NO:268) represents sequence residues present in all MYB-subgroup14 sequences analyzed to date.
[0158]For the general, exclusive, and absolute patterns (SEQ ID NOs:266-268) listed below, "X" denotes any amino acid, "X(N)", where N is any number, denotes a string of the indicated number of "X"s, (i.e., X(23) denotes a string of 23 "X"s), where X is any amino acid. At some positions multiple amino acid residues are permitted at a given location. Where multiple amino acids are permitted, the optional residues are indicated within square brackets.
TABLE-US-00016 General Pattern (SEQ ID NO:266) M-G-R-X-P-C-C-D-[KR]-X-X-[MV]-K-[RK]-G-X-W-[SA]-X- [DQE]-E-D-X-X-[IL]-[RK]-X-[FY]-X-X-X-X-G-X-X-X- [SN]-W-I-X-X-P-X-[RK]-X-G-[IL]-X-R-C-G-[KR]-S-C-R- L-R-W-[IL]-N-Y-L-R-P-X-[IL]-[RK]-H-G-X-[FY]-[ST]- X-X-E-[DE]-X-X-[IV]-X-X-X-[FY]-X-X-X-G-S-[KR]-W-S- X-[MI]-A-X-X-[ML]-X-X-R-T-D-N-D-[ILV]-K-N-[HY]-W- [DN]-[ST]-[RK]-L-[RK]-[RK]-[RK] Exclusive Pattern (SEQ ID NO:267) [RK]-X(9)G-X-X-I-X(28)-H-X(14)-[YF]-X(4)-S-X(23)- [RK] Absolute Pattern (SEQ ID NO:268) G-X-W-X-X-X-E-D-X-X-[IL]-[RK]-X-X-X-X-X-X-G-X(23)- R-W-[IL]-N-Y-L-R-P-X-[IL]-[RK]-H-G-X-[FY]-X-X-X-E- [DE]-X(13)-W-X-X-X-A-X-X-X-X-X-R-T
[0159]MYB-subgroup14 sequences can be defined by the consensus sequence of SEQ ID NO:266. Positions have been identified in which the amino acid residues are found exclusively or predominately in the MYB-subgroup14 sequences. In the following description all position numbers are in reference to Arabidopsis MYB68 protein (SEQ ID NO:2) which correlates to the general consensus sequence above, SEQ ID NO:266.
[0160]Within the R2 domain at position 26, a positively charged (K or R) residue is conserved in MYB-subgroup14. Although this amino acid is not exclusive to this subgroup at this position, the tendency for the rest of the Arabidopsis MYB proteins is for a hydrophobic residue, with over 50% of all MYB proteins having a hydrophobic isoleucine or valine residue (Stracke et al., 2001).
[0161]At position 36, all members of subgroup 14 contain an insertion, resulting in an extra amino acid residue. This appears to be exclusive to MYB-subgroup14. Glycine (G), a small hydrophobic residue, is the extra residue in all cases except in MYB87 (SEQ ID NO:32), which contains an asparagine residue at this position and a rice homologue (SEQ ID NO:194) which contains an argentine at this position.
[0162]At position 39, members of the MYB-subgroup14 predominantly contain a hydrophobic isoleucine residue. One exception to this is a rice homologue identified as SEQ ID NO:191, which possesses a glutamine at this position. Although other hydrophobic residues are generally found in MYB proteins at this position, isoleucine appears to be exclusive to MYB-subgroup14. Additionally, positively charged residues (39% R), and polar residues (23% N) are most prevalent at this position.
[0163]Within the R3 domain at position 68, all MYB-subgroup14 members contain the positively charged histadine residue. A positively charged residue appears at this position in all MYB proteins; however in contrast to the MYB-subgroup14, in other MYB proteins 73% contain an arginine (R) and 17% contain a lysine residue.
[0164]At position 83, most members of MYB-subgroup14 contain an aromatic hydrophobic residue (tyrosine Y, or phenylalanine F). The appearance of an aromatic hydrophobic residue at this position seems to be exclusive to MYB-subgroup14. The majority of MYB proteins have a histidine residue (87%). This histidine has been suggested to be a crucial residue in the hydrophobic core of the helix (Ogata et al., 1992). Through NMR analysis, it appears to be in contact with two of the critical tryptophan residues. The absence of a histidine at position 83 does not necessarily exclude it as a member of MYB-subgroup14 as members have been identified that do possess a histidine residue, for example SEQ ID NO:169 and SEQ ID NO:225.
[0165]At position 88, members of MYB-subgroup14 contain the polar serine residue, while almost all other MYB proteins (91%) contain the polar asparagine residue. A serine residue appears to be exclusive to this subgroup. The absence of a serine at position 88 does not necessarily exclude it as a member of MYB-subgroup14 as at least one MYB-subgroup14 member has been identified that possesses a phenylalanine residue in position 88, for example SEQ ID NO:256.
[0166]At position 112, members of MYB-subgroup14 contain a positively charged arginine or lysine residue. The majority of MYB proteins also contain positively charged residues at this position, however, histadine (48%) is the most prevalent residue found. The absence of a arginine at position 112 does not necessarily exclude it as a member of MYB-subgroup14 as at least one MYB-subgroup14 member has been identified that possesses a glutamic acid residue in position 112, for example SEQ ID NO:68 contains a glutamic acid residue and SEQ ID NO:191 contains a threonine.
[0167]Variation within a R2R3 domain is permissible as shown in the identified consensus sequences. A R2R3 domain of a MYB-subgroup14 sequence may be 90% homologous, preferably 95% homologous or more preferably 99% homologous to the consensus sequence presented.
The Addition of S1 and S2 Motifs
[0168]Within MYB-subgroup14 the MYB68 and MYB84 sequences contain two further conserved motifs identified as S1 (SFSQLLLDPN) (SEQ ID NO:269) and S2 (TSTSADQSTISWEDI) (SEQ ID NO:270). These motifs are found in both Arabidopsis and Brassica MYB68, MYB36 and MYB84 sequences and show at least 70% homology within the amino acid sequence. Additionally, homology to the S1 or S2 motifs was found to exist in orthologs in Brassica napus (Canola), Brassica rapa (Cabbage), Brassica oleracea, Raphanus raphanistrum (Radish), and homology to the S2 region in Poncirus trifoliate (Orange), and weak homology in a Medicago trunculata homologue and Vitis Vinifera (Grape) homologue within the S2 region.
[0169]For inclusion as a S1 or S2 motif target sequences show homology of at least 70%, more preferably 80% and most preferably 95%.
[0170]The MYB68 and MYB84 sequences from species other than Arabidopsis and Brassica may not contain a S1 and S2 motif but may still be classified as a MYB subgroup-14 sequence based on sequence analysis and inclusion of other criteria.
Identification of MYB-Subgroup14 Members, Including MYB68, Homologues
[0171]Homologues of an Arabidopsis MYB-subgroup14 sequence (Table 1) or a desired MYB68 (SEQ ID NO:1, SEQ ID NO:2), can be found using a variety of public or commercial software that is known to those skilled in the art. Blast alignments can be performed and putative sequences identified. Searches can be performed as outlined herein. The top homologues are determined using programs such as tblastn, tblastp, searches against available databases in NCBI, such as the EST, GSS, HTG and chromosomal databases, as well as other genomic databases, such as the TIGR unigene database, Cucurbit genomics database (http://www.icugi.org/), Sunflower and Lettuce (http://compgenomics.ucdavis.edu/compositae_index.php), Medicago truncatula (International Medicago Genome Annotation Group), SGN (http://www.sgn.cornell.edu/), and Orange (http://harvest.ucr.edu/). In instances where species were more highly divergent, the alignment parameters such as Gap costs, matrix values can be appropriately changed.
[0172]To confirm the top hit to AtMYB68 in each species is in fact a MYB68 homologue, a reciprocal blast can be performed, in which the homologue is blasted against all Arabidopsis proteins. In many cases the homologue's closest Arabidopsis hit is to one of MYB68's gene family members (a MYB subgroup-14), instead of MYB68 itself. In cases where the homologue is closest to an Arabidopsis protein outside of the MYB68 gene family, the homologue is assessed not to be a MYB-subgroup14 member.
[0173]Open reading frames may be determined using programs such as "getorf" from the EMBOSS program, or ESTScan.
[0174]Methods for identification of MYB sequences, classification of MYB sequences into designated subgroups and identification of MYB-subgroup14 sequences are further described in Stracke et al., 2001 and Kranz et al., 1998.
[0175]Weakly conserved homologues between highly divergent species are often not found using traditional blast methods. In such cases, conserved motifs, domains and fingerprints will exist between homologues, and are good predictors of functional homology. Many programs exist that are proficient at finding conserved domains across species using hidden markov models, position-specific-scoring matrices, and patterns. PSI-Blast, PRATT, PHI-Blast, and HMMBuild/HMMSearch.
[0176]PRATT is a tool provided by the PROSITE database. It generates conserved patterns from a group of conserved proteins http://www.expasy.ch/prosite/. PRATT was used to determine a conserved pattern between MYB68 and its closest homologues. ScanProsite and PHI-BLAST was then used to look for the conserved pattern in the Swiss-Prot and NCBI protein databases respectively. The search results are limited to alignments that also contain the pattern.
[0177]HMMBuild was used to build a hidden markov model using the MYB68 and its homologues. HMMSearch was used to scan NCBI's protein database using the hidden markov model. Similar proteins to the basic blastp search were found.
[0178]Utilizing the above methods, MYB68 homologues were found in over 50 different plant species. Homology was restricted in most cases to the N-terminal MYB DNA binding domain. Homology in the less conserved C-terminal region existed in genes found in Brassica napus (Canola), Brassica rapa (Cabbage), Brassica oleracea, Raphanus raphanistrum (Radish), Poncirus trifoliate (Orange), and weak homology in genes found in Medicago trunculata homologue and Vitis Vinifera (Grape) within the S2 region.
[0179]The Poncirus trifoliate and Brassica rapa genes were identified by downloading strong EST hits, and assembling them using CAP3. The resulting contigs did not code for a complete protein. The contig sequence from Orange coded for a partial protein spanning only the S2 region, and in Brassica rapa, a partial 202 aa protein spanning from 1-202 in AtMYB68 was found.
[0180]A variety of programs have characterized the MYB domain with profiles, patterns, and hidden markov models. To confirm the presence of the MYB domain in a unknown sequence, a sequence can be searched against these profiles. InterProScan is particularly useful as it provides an interface to query 13 programs simultaneously. The databases incorporated in InterPro include
a) Propom: a database of protein domain families. Built by clustering homologous segments from Swiss-Prot/Trembl database, followed by recursive PSI-BLAST searches.b) HMMTIGR: Protein families represented by Hidden Markov Models.c) TMHMM: Prediction of transmembrane helices in proteins.d) FPrintScan: Searches the PRINTS database of fingerprints. Fingerprints are protein families represented by multiple motifs.e) ProfileScan: Profiles from family related sequences.f) HMMPanther: A database of hidden markov modelsg) HMMPIR: Hidden markov models based on evolutionary relationship of whole proteins.h) ScanRegExp: Scans the prosite database of patterns and profilesi) Gene3D: a database of proteins containing functional informationj) HMMPfam: Protein domain families represented by hidden markov models.k) Superfamily: a database of structural and functional protein annotations for all completely sequenced organisms.l) HMMSmart: Allows the identification of mobile domains based on hidden markov models.m) SignalIP: predicts the presence and location of signal peptide cleavage sites in amino acid sequences.
[0181]Blocks are multiply aligned ungapped segments corresponding to the most highly conserved regions of proteins. The MYB domain is represented by three blocks. As expected AtMYB68 contained each of these three blocks, as well as 1 of the 5 Wos2 blocks.
[0182]The invention is based in part on the discovery of plants that are heat stress tolerant. The gene responsible for the heat tolerant phenotype has been determined and shown to be a MYB68 gene. Methods of producing a heat tolerant transgenic plant are disclosed herein. Specifically the invention identifies a transcription factor gene family, specifically the MYB gene family, and in particular a MYB-subgroup14 that when expressed in plants results in plants that are heat stress tolerant and have an increased yield following a heat stress or display tolerance to drought stress or salt stress.
Example 10
Identification of MYB68 Homologues
[0183]Homologues from the same plant, different plant species or other organisms were identified using database sequence search tools, such as the Basic Local Alignment Search Tool (BLAST) (Altschul et al. (1990) J. Mol. Biol. 215:403-410; and Altschul et al. (1997) Nucl. Acid Res. 25: 3389-3402). The tblastn or blastn sequence analysis programs were employed using the BLOSUM-62 scoring matrix (Henikoff, S, and Henikoff, J. G. (1992) Proc. Natl. Acad. Sci. USA 89: 10915-10919). The output of a BLAST report provides a score that takes into account the alignment of similar or identical residues and any gaps needed in order to align the sequences. The scoring matrix assigns a score for aligning any possible pair of sequences. The P values reflect how many times one expects to see a score occur by chance. Higher scores are preferred and a low threshold P value threshold is preferred. These are the sequence identity criteria. The tblastn sequence analysis program was used to query a polypeptide sequence against six-way translations of sequences in a nucleotide database. Hits with a P value less than -25, preferably less than -70, and more preferably less than -100, were identified as homologous sequences (exemplary selected sequence criteria). The blastn sequence analysis program was used to query a nucleotide sequence against a nucleotide sequence database. In this case too, higher scores were preferred and a preferred threshold P value was less than -13, preferably less than -50, and more preferably less than -100.
[0184]Alternatively, a fragment of a sequence from SEQ ID NO: 1, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 59, 61, 63, 65, 67, 69, 71, 73, 75, 77, 79, 81, 83, 85, 87, 88, 90, 91, 92, 94, 96, 98, 100, 102, 104, 106, 108, 109, 110, 112, 114, 116, 118, 120, 122, 124, 125, 127, 129, 130, 132, 134, 135, 137, 139, 141, 143, 145, 147, 149, 151, 153, 155, 156, 158, 160, 162, 164, 166, 168, 170, 172, 174, 176, 178, 180, 182, 184, 186, 188, 190, 192, 194, 196, 198, 200, 202, 204, 206, 208, 210, 211, 213, 214, 216, 217, 218, 220, 222, 224, 226, 228, 230, 232, 234, 236, 238, 240, 242, 244, 246, 247, 249, 251, 253, 255, 257, 259, 261, 263 and 265 is 32P-radiolabeled by random priming (Sambrook et al., (1989) Molecular Cloning. A Laboratory Manual, 2nd Ed., Cold Spring Harbor Laboratory Press, New York) and used to screen a plant genomic library (the exemplary test polynucleotides) As an example, total plant DNA from Arabidopsis thaliana, Nicotiana tabacum, Lycopersicon pimpinellifolium, Prunus avium, Prunus cerasus, Cucumis sativus, or Oryza sativa are isolated according to Stockinger al (Stockinger, E. J., et al., (1996), J. Heredity, 87:214-218). Approximately 2 to 10 μg of each DNA sample are restriction digested, transferred to nylon membrane (Micron Separations, Westboro, Mass.) and hybridized. Hybridization conditions are: 42° C. in 50% formamide, 5×SSC, 20 mM phosphate buffer 1×Denhardt's, 10% dextran sulfate, and 100 μg/ml herring sperm DNA. Four low stringency washes at RT in 2×SSC, 0.05% sodium sarcosyl and 0.02% sodium pyrophosphate are performed prior to high stringency washes at 55° C. in 0.2×SSC, 0.05% sodium sarcosyl and 0.01% sodium pyrophosphate. High stringency washes are performed until no counts are detected in the washout according to Walling et al. (Walling, L. L., et al., (1988) Nucl. Acids Res. 16:10477-10492).
Example 11
Identification of MYB68 Technical Features
[0185]A MYB68 gene can be identified by identifying genes that have high homology to an Arabidopsis MYB68 (SEQ ID NO:1). In addition to having homology of the nucleotide or amino acid sequence, one may identify candidate genes that share conformational protein structure. Such structural motifs may assist in identification of related proteins and their structure and function relationships.
Example 12
Functional Confirmation of Homologues
[0186]Candidate homologues are introduced into Arabidopsis and assessed for heat tolerance. Genes disclosed as SEQ ID NO: 1, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 59, 61, 63, 65, 67, 69, 71, 73, 75, 77, 79, 81, 83, 85, 87, 88, 90, 91, 92, 94, 96, 98, 100, 102, 104, 106, 108, 109, 110, 112, 114, 116, 118, 120, 122, 124, 125, 127, 129, 130, 132, 134, 135, 137, 139, 141, 143, 145, 147, 149, 151, 153, 155, 156, 158, 160, 162, 164, 166, 168, 170, 172, 174, 176, 178, 180, 182, 184, 186, 188, 190, 192, 194, 196, 198, 200, 202, 204, 206, 208, 210, 211, 213, 214, 216, 217, 218, 220, 222, 224, 226, 228, 230, 232, 234, 236, 238, 240, 242, 244, 246, 247, 249, 251, 253, 255, 257, 259, 261, 263 and 265 are expressed in Arabidopsis plants and heat tolerance assessed as described herein. Optionally, the expression of If a candidate MYB-subgroup14 sequence, matching the general pattern further includes a match to the exclusive pattern then the sequence is a strong candidate for inclusion as a MYB-subgroup14 genes or MYB68 genes can be evaluated in any transformable species, for example, Brassica, maize, cotton, soybean or rice. Examples of such functional testing have been provided in this disclosure.
[0187]It is noted that some of the sequences disclosed herein are not full length. Full length sequences can be obtained by standard molecular methods known to those of skill in the art. The full length sequences can then be expressed as described herein.
Sequence CWU
1
32511125DNAArabidopsis thaliana 1atgggaagag caccgtgttg tgataaggcc
aacgtgaaga aagggccttg gtctcctgag 60gaagacgcca aactcaaaga ttacatcgag
aatagtggca caggaggcaa ctggattgct 120ttgcctcaga aaattggttt aaggagatgt
gggaagagtt gcaggctaag gtggctcaac 180tatttgagac caaacatcaa acatggtggc
ttctccgagg aagaagacaa catcatttgt 240aacctctatg ttactattgg tagcaggtgg
tctataattg ctgcacaatt gccgggaaga 300accgacaacg atatcaaaaa ctattggaac
acgaggctga agaagaagct tctgaacaaa 360caaaggaaag agttccaaga agcgcgaatg
aagcaagaga tggtgatgat gaaaaggcaa 420caacaaggac aaggacaagg tcaaagtaat
ggtagtacgg atctttatct taacaacatg 480tttggatcat caccatggcc attactacca
caacttcctc ctccacatca tcaaatacct 540cttggaatga tggaaccaac aagctgtaac
tactaccaaa cgacaccgtc ttgtaaccta 600gaacaaaagc cattgatcac actcaagaac
atggtcaaga ttgaagaaga acaggaaagg 660acaaaccctg atcatcatca tcaagattct
gtcacaaacc cttttgattt ctctttctct 720cagcttttgt tagatcccaa ttactatctg
ggatcaggag ggggaggaga aggagatttt 780gctatcatga gcagcagcac aaactcacca
ttaccaaaca caagtagtga tcaacatcca 840agtcaacagc aagagattct tcaatggttt
gggagcagta actttcagac agaagcaatc 900aacgatatgt tcataaacaa caacaacaac
atagtgaatc ttgagaccat cgagaacaca 960aaagtctatg gagacgcctc agtagccgga
gccgctgtcc gagcagcttt gggcggaggg 1020acaacgagta catcggcgga tcaaagtaca
ataagttggg aggatataac ttctctagtt 1080aattccgaag atgcaagtta cttcaatgcg
ccaaatcatg tgtaa 11252374PRTArabidopsis thaliana 2Met
Gly Arg Ala Pro Cys Cys Asp Lys Ala Asn Val Lys Lys Gly Pro1
5 10 15Trp Ser Pro Glu Glu Asp Ala
Lys Leu Lys Asp Tyr Ile Glu Asn Ser20 25
30Gly Thr Gly Gly Asn Trp Ile Ala Leu Pro Gln Lys Ile Gly Leu Arg35
40 45Arg Cys Gly Lys Ser Cys Arg Leu Arg Trp
Leu Asn Tyr Leu Arg Pro50 55 60Asn Ile
Lys His Gly Gly Phe Ser Glu Glu Glu Asp Asn Ile Ile Cys65
70 75 80Asn Leu Tyr Val Thr Ile Gly
Ser Arg Trp Ser Ile Ile Ala Ala Gln85 90
95Leu Pro Gly Arg Thr Asp Asn Asp Ile Lys Asn Tyr Trp Asn Thr Arg100
105 110Leu Lys Lys Lys Leu Leu Asn Lys Gln
Arg Lys Glu Phe Gln Glu Ala115 120 125Arg
Met Lys Gln Glu Met Val Met Met Lys Arg Gln Gln Gln Gly Gln130
135 140Gly Gln Gly Gln Ser Asn Gly Ser Thr Asp Leu
Tyr Leu Asn Asn Met145 150 155
160Phe Gly Ser Ser Pro Trp Pro Leu Leu Pro Gln Leu Pro Pro Pro
His165 170 175His Gln Ile Pro Leu Gly Met
Met Glu Pro Thr Ser Cys Asn Tyr Tyr180 185
190Gln Thr Thr Pro Ser Cys Asn Leu Glu Gln Lys Pro Leu Ile Thr Leu195
200 205Lys Asn Met Val Lys Ile Glu Glu Glu
Gln Glu Arg Thr Asn Pro Asp210 215 220His
His His Gln Asp Ser Val Thr Asn Pro Phe Asp Phe Ser Phe Ser225
230 235 240Gln Leu Leu Leu Asp Pro
Asn Tyr Tyr Leu Gly Ser Gly Gly Gly Gly245 250
255Glu Gly Asp Phe Ala Ile Met Ser Ser Ser Thr Asn Ser Pro Leu
Pro260 265 270Asn Thr Ser Ser Asp Gln His
Pro Ser Gln Gln Gln Glu Ile Leu Gln275 280
285Trp Phe Gly Ser Ser Asn Phe Gln Thr Glu Ala Ile Asn Asp Met Phe290
295 300Ile Asn Asn Asn Asn Asn Ile Val Asn
Leu Glu Thr Ile Glu Asn Thr305 310 315
320Lys Val Tyr Gly Asp Ala Ser Val Ala Gly Ala Ala Val Arg
Ala Ala325 330 335Leu Gly Gly Gly Thr Thr
Ser Thr Ser Ala Asp Gln Ser Thr Ile Ser340 345
350Trp Glu Asp Ile Thr Ser Leu Val Asn Ser Glu Asp Ala Ser Tyr
Phe355 360 365Asn Ala Pro Asn His
Val3703933DNAArabidopsis thaliana 3atgggaagag caccgtgttg tgacaaagca
aacgtgaaga aagggccttg gtctcctgag 60gaagatgcaa aactcaaatc ttacattgaa
aatagtggca ccggaggcaa ttggatcgct 120ttgcctcaaa agattggttt aaagagatgt
ggaaagagtt gcaggctgag gtggcttaac 180tatcttagac caaacatcaa acatggtggc
ttctctgagg aagaagaaaa catcatttgt 240agcctttacc ttacaattgg tagcaggtgg
tctataatcg ctgctcaatt gccgggacga 300acagacaacg atataaaaaa ctattggaac
acgaggctca agaagaaact cattaacaaa 360caacgcaagg agcttcaaga agcttgtatg
gagcagcaag agatgatggt gatgatgaag 420agacaacacc aacaacaaca aatccaaact
tcttttatga tgagacaaga ccaaacaatg 480ttcacatggc cactacatca tcataatgtt
caagttccag ctcttttcat gaatcaaacc 540aactcgtttt gcgaccaaga agatgttaag
ccagtgctca tcaagaacat ggtcaagatc 600gaagatcaag aactggagaa aacaaaccct
catcatcatc aagattcaat gacaaacgct 660tttgatcatc tctctttctc tcaactcttg
ttagatccta atcataacca cttaggatca 720ggcgagggtt tctccatgaa ctctatcttg
agcgccaaca caaactctcc attgcttaat 780acaagtaatg ataatcagtg gttcgggaat
ttccaggccg aaaccgtaaa cttgttctca 840ggagcctcca caagtacttc ggcagatcaa
agcactataa gttgggaaga cataagctct 900cttgtttatt ctgattcaaa gcaatttttt
taa 9334310PRTArabidopsis thaliana 4Met
Gly Arg Ala Pro Cys Cys Asp Lys Ala Asn Val Lys Lys Gly Pro1
5 10 15Trp Ser Pro Glu Glu Asp Ala
Lys Leu Lys Ser Tyr Ile Glu Asn Ser20 25
30Gly Thr Gly Gly Asn Trp Ile Ala Leu Pro Gln Lys Ile Gly Leu Lys35
40 45Arg Cys Gly Lys Ser Cys Arg Leu Arg Trp
Leu Asn Tyr Leu Arg Pro50 55 60Asn Ile
Lys His Gly Gly Phe Ser Glu Glu Glu Glu Asn Ile Ile Cys65
70 75 80Ser Leu Tyr Leu Thr Ile Gly
Ser Arg Trp Ser Ile Ile Ala Ala Gln85 90
95Leu Pro Gly Arg Thr Asp Asn Asp Ile Lys Asn Tyr Trp Asn Thr Arg100
105 110Leu Lys Lys Lys Leu Ile Asn Lys Gln
Arg Lys Glu Leu Gln Glu Ala115 120 125Cys
Met Glu Gln Gln Glu Met Met Val Met Met Lys Arg Gln His Gln130
135 140Gln Gln Gln Ile Gln Thr Ser Phe Met Met Arg
Gln Asp Gln Thr Met145 150 155
160Phe Thr Trp Pro Leu His His His Asn Val Gln Val Pro Ala Leu
Phe165 170 175Met Asn Gln Thr Asn Ser Phe
Cys Asp Gln Glu Asp Val Lys Pro Val180 185
190Leu Ile Lys Asn Met Val Lys Ile Glu Asp Gln Glu Leu Glu Lys Thr195
200 205Asn Pro His His His Gln Asp Ser Met
Thr Asn Ala Phe Asp His Leu210 215 220Ser
Phe Ser Gln Leu Leu Leu Asp Pro Asn His Asn His Leu Gly Ser225
230 235 240Gly Glu Gly Phe Ser Met
Asn Ser Ile Leu Ser Ala Asn Thr Asn Ser245 250
255Pro Leu Leu Asn Thr Ser Asn Asp Asn Gln Trp Phe Gly Asn Phe
Gln260 265 270Ala Glu Thr Val Asn Leu Phe
Ser Gly Ala Ser Thr Ser Thr Ser Ala275 280
285Asp Gln Ser Thr Ile Ser Trp Glu Asp Ile Ser Ser Leu Val Tyr Ser290
295 300Asp Ser Lys Gln Phe Phe305
31051002DNAArabidopsis thaliana 5atgggaagag ctccatgctg cgacaaggca
aacgtgaaga aaggaccatg gtcaccggaa 60gaagatgtga agctcaagga ttacatcgac
aaatatggca ctggtggcaa ctggatcgca 120ctgcctcaga aaattgggct gaagagatgt
ggtaagagtt gcagactgag atggcttaat 180tacttaagac caaacatcaa acatggtggt
ttttctgagg aagaagatag aatcatcttg 240agtctctaca ttagcattgg aagccggtgg
tccataattg cagctcagct tcctggaagg 300actgacaatg atatcaagaa ttattggaac
acaaaactga agaagaaact tctaggaaga 360cagaaacaaa tgaatcgtca agactccata
accgattcta ctgagaacaa cctcagcaac 420aataacaaca ataagagtcc tcagaatctg
agcaattcgg cactggagag gctccagctt 480cacatgcagc ttcagaatct acagagccct
ttctctagtt tctacaacaa ccctatcttg 540tggcccaagc ttcatccatt gctccagagc
actacaacta atcaaaaccc taagcttgca 600tctcaagaaa gcttccaccc tttaggagtt
aacgttgatc atcagcacaa caataccaag 660ctagctcaga taaacaatgg agcctcttct
ctctattcgg agaacgtaga gcaatcccaa 720aaccctgctc atgaatttca acctaatttc
ggtttttcac aggaccttcg attagataat 780cataacatgg actttatgaa cagaggggtt
tctaaagaac tgtttcaagt gggcaacgag 840tttgagctaa cgaacggttc gagttggtgg
tcagaggaag tggaactaga gaggaaaacg 900acgtcgtcga gttcttgggg gtcagcttct
gtgcttgatc agacaactga gggaatggtt 960atgcttcaag attacgctca gatgagctac
cacagtgttt aa 10026333PRTArabidopsis thaliana 6Met
Gly Arg Ala Pro Cys Cys Asp Lys Ala Asn Val Lys Lys Gly Pro1
5 10 15Trp Ser Pro Glu Glu Asp Val
Lys Leu Lys Asp Tyr Ile Asp Lys Tyr20 25
30Gly Thr Gly Gly Asn Trp Ile Ala Leu Pro Gln Lys Ile Gly Leu Lys35
40 45Arg Cys Gly Lys Ser Cys Arg Leu Arg Trp
Leu Asn Tyr Leu Arg Pro50 55 60Asn Ile
Lys His Gly Gly Phe Ser Glu Glu Glu Asp Arg Ile Ile Leu65
70 75 80Ser Leu Tyr Ile Ser Ile Gly
Ser Arg Trp Ser Ile Ile Ala Ala Gln85 90
95Leu Pro Gly Arg Thr Asp Asn Asp Ile Lys Asn Tyr Trp Asn Thr Lys100
105 110Leu Lys Lys Lys Leu Leu Gly Arg Gln
Lys Gln Met Asn Arg Gln Asp115 120 125Ser
Ile Thr Asp Ser Thr Glu Asn Asn Leu Ser Asn Asn Asn Asn Asn130
135 140Lys Ser Pro Gln Asn Leu Ser Asn Ser Ala Leu
Glu Arg Leu Gln Leu145 150 155
160His Met Gln Leu Gln Asn Leu Gln Ser Pro Phe Ser Ser Phe Tyr
Asn165 170 175Asn Pro Ile Leu Trp Pro Lys
Leu His Pro Leu Leu Gln Ser Thr Thr180 185
190Thr Asn Gln Asn Pro Lys Leu Ala Ser Gln Glu Ser Phe His Pro Leu195
200 205Gly Val Asn Val Asp His Gln His Asn
Asn Thr Lys Leu Ala Gln Ile210 215 220Asn
Asn Gly Ala Ser Ser Leu Tyr Ser Glu Asn Val Glu Gln Ser Gln225
230 235 240Asn Pro Ala His Glu Phe
Gln Pro Asn Phe Gly Phe Ser Gln Asp Leu245 250
255Arg Leu Asp Asn His Asn Met Asp Phe Met Asn Arg Gly Val Ser
Lys260 265 270Glu Leu Phe Gln Val Gly Asn
Glu Phe Glu Leu Thr Asn Gly Ser Ser275 280
285Trp Trp Ser Glu Glu Val Glu Leu Glu Arg Lys Thr Thr Ser Ser Ser290
295 300Ser Trp Gly Ser Ala Ser Val Leu Asp
Gln Thr Thr Glu Gly Met Val305 310 315
320Met Leu Gln Asp Tyr Ala Gln Met Ser Tyr His Ser Val325
33071092DNABrassica rapa 7atgggaagag caccgtgttg tgacaaggct
aacgtgaaga aagggccgtg gtctcctgaa 60gaagacgcaa aactcaaaga ttacatcgag
aataatggca caggcggcaa ctggattgcg 120ttgcctcaga agattggtct aagaagatgt
gggaagagtt gcagactaag gtggctcaac 180tatttgagac caaacatcaa acatggtggc
ttctctgagg aagaggacaa catcatttgt 240aatctctatg ttaccattgg tagcaggtgg
tctataattg ctgcacaatt gcctggaaga 300acagacaatg atatcaagaa ctattggaac
acgaggctga agaagaagct tcttaacaaa 360caaagaaaag agtaccaaga agctcggatg
aagcaagata tggtgatgat aaagcgacaa 420gaacaaggga caggccaaag taatgctagt
agggatcttt attcgaacaa catgtttgga 480tcatcaccat ggccattact acaacagctt
cctcctcatc atcaagtacc tcttgtgatg 540atggaaccaa caagttgtaa ctactaccaa
acgtcaccct cttgtaacct agaacaaaag 600ccactgatca ctttcaagaa catggtcaag
attgaagaag aaccggagag aacaaaccct 660tataatcctc agcatcaaaa ttctatcaca
aacccttttg atgtctcctt ctcccagctc 720ttgttagatc ctaattacta cttaggatca
ggaggaggag cagaagggga ttttgctatc 780atgagtagca gcacaaattc tccattacca
aacacaagtg gtgatcaaaa tgaacatcag 840cagcaagaga ttcttcaatg gtttgggagt
agtaatcttc agacagaagc aagcagtgat 900atgttcttaa acaacatagg gaatcttgag
accaacgagg acacaagatt ctactcatca 960ttagccggtg ttggagcggc tttggccgga
ggaacgacga gtacatcggc agatcaaagc 1020acaataagtt gggaggacat aacatctctt
gttaattccg aagatgcaag ttacttcaat 1080gggccaaatt aa
10928363PRTBrassica rapa 8Met Gly Arg
Ala Pro Cys Cys Asp Lys Ala Asn Val Lys Lys Gly Pro1 5
10 15Trp Ser Pro Glu Glu Asp Ala Lys Leu
Lys Asp Tyr Ile Glu Asn Asn20 25 30Gly
Thr Gly Gly Asn Trp Ile Ala Leu Pro Gln Lys Ile Gly Leu Arg35
40 45Arg Cys Gly Lys Ser Cys Arg Leu Arg Trp Leu
Asn Tyr Leu Arg Pro50 55 60Asn Ile Lys
His Gly Gly Phe Ser Glu Glu Glu Asp Asn Ile Ile Cys65 70
75 80Asn Leu Tyr Val Thr Ile Gly Ser
Arg Trp Ser Ile Ile Ala Ala Gln85 90
95Leu Pro Gly Arg Thr Asp Asn Asp Ile Lys Asn Tyr Trp Asn Thr Arg100
105 110Leu Lys Lys Lys Leu Leu Asn Lys Gln Arg
Lys Glu Tyr Gln Glu Ala115 120 125Arg Met
Lys Gln Asp Met Val Met Ile Lys Arg Gln Glu Gln Gly Thr130
135 140Gly Gln Ser Asn Ala Ser Arg Asp Leu Tyr Ser Asn
Asn Met Phe Gly145 150 155
160Ser Ser Pro Trp Pro Leu Leu Gln Gln Leu Pro Pro His His Gln Val165
170 175Pro Leu Val Met Met Glu Pro Thr Ser
Cys Asn Tyr Tyr Gln Thr Ser180 185 190Pro
Ser Cys Asn Leu Glu Gln Lys Pro Leu Ile Thr Phe Lys Asn Met195
200 205Val Lys Ile Glu Glu Glu Pro Glu Arg Thr Asn
Pro Tyr Asn Pro Gln210 215 220His Gln Asn
Ser Ile Thr Asn Pro Phe Asp Val Ser Phe Ser Gln Leu225
230 235 240Leu Leu Asp Pro Asn Tyr Tyr
Leu Gly Ser Gly Gly Gly Ala Glu Gly245 250
255Asp Phe Ala Ile Met Ser Ser Ser Thr Asn Ser Pro Leu Pro Asn Thr260
265 270Ser Gly Asp Gln Asn Glu His Gln Gln
Gln Glu Ile Leu Gln Trp Phe275 280 285Gly
Ser Ser Asn Leu Gln Thr Glu Ala Ser Ser Asp Met Phe Leu Asn290
295 300Asn Ile Gly Asn Leu Glu Thr Asn Glu Asp Thr
Arg Phe Tyr Ser Ser305 310 315
320Leu Ala Gly Val Gly Ala Ala Leu Ala Gly Gly Thr Thr Ser Thr
Ser325 330 335Ala Asp Gln Ser Thr Ile Ser
Trp Glu Asp Ile Thr Ser Leu Val Asn340 345
350Ser Glu Asp Ala Ser Tyr Phe Asn Gly Pro Asn355
3609966DNAOryza sativa 9atggggagag cgccgtgctg cgacaaggcg agcgtgaaga
aggggccatg gtcaccggag 60gaggacgcga agctcaaatc ctacatcgag cagaacggca
ccggcggcaa ctggatcgcc 120ctgccccaga agatcggttt gaaaaggtgc ggcaagagct
gccgcctccg gtggctgaac 180tacctccggc cgaacatcaa gcacggcggg ttctcggagg
aggaagacag gatcatcctc 240agcctctaca tcagcatagg aagcaggtgg tcgataatag
cagcgcagct gccggggcgg 300acggacaatg acatcaagaa ctattggaac acgaggctca
agaagaagct ctttggcaag 360cagtcgcgca aggatcagag gcagcagcag cacctggcgc
gccaggcggc agcagctgcc 420agcgacttgc agatcaaaca agaagcgagc aggggtgcaa
acgaagccga tggcttggct 480gccggtgcca attacacttg gcatcaccac cacgccatgg
ccgtgcctgt gcacccgatg 540tcggcaccaa tggtggtgga aggaggccgt gtgggagacg
atgtcgatga gtcgatccgg 600aagcttctgt tcaagctcgg agggaaccca ttcgcggcct
cgccggcacc gccatgcata 660cctccaccac caatgtacga ggaagcccca agcttcgtgc
caccattggc gcacggcgtg 720ccgctcaacg aaggcggcat gcagtgctcc agcgtgctgc
cggcgctgga gctggacgag 780aacttccact tcaaccatgt caagctggac gggctcgagt
gcctcttcgg gatgggagat 840caccaaaaca tgagatggaa tgaggtgagc ccgttggttt
gccctaataa cgctgtggcg 900tccagctccc aagggatgca gcagtactgc ctagttgaag
aaccagctga cctcgggatg 960cagtag
96610321PRTOryza sativa 10Met Gly Arg Ala Pro Cys
Cys Asp Lys Ala Ser Val Lys Lys Gly Pro1 5
10 15Trp Ser Pro Glu Glu Asp Ala Lys Leu Lys Ser Tyr
Ile Glu Gln Asn20 25 30Gly Thr Gly Gly
Asn Trp Ile Ala Leu Pro Gln Lys Ile Gly Leu Lys35 40
45Arg Cys Gly Lys Ser Cys Arg Leu Arg Trp Leu Asn Tyr Leu
Arg Pro50 55 60Asn Ile Lys His Gly Gly
Phe Ser Glu Glu Glu Asp Arg Ile Ile Leu65 70
75 80Ser Leu Tyr Ile Ser Ile Gly Ser Arg Trp Ser
Ile Ile Ala Ala Gln85 90 95Leu Pro Gly
Arg Thr Asp Asn Asp Ile Lys Asn Tyr Trp Asn Thr Arg100
105 110Leu Lys Lys Lys Leu Phe Gly Lys Gln Ser Arg Lys
Asp Gln Arg Gln115 120 125Gln Gln His Leu
Ala Arg Gln Ala Ala Ala Ala Ala Ser Asp Leu Gln130 135
140Ile Lys Gln Glu Ala Ser Arg Gly Ala Asn Glu Ala Asp Gly
Leu Ala145 150 155 160Ala
Gly Ala Asn Tyr Thr Trp His His His His Ala Met Ala Val Pro165
170 175Val His Pro Met Ser Ala Pro Met Val Val Glu
Gly Gly Arg Val Gly180 185 190Asp Asp Val
Asp Glu Ser Ile Arg Lys Leu Leu Phe Lys Leu Gly Gly195
200 205Asn Pro Phe Ala Ala Ser Pro Ala Pro Pro Cys Ile
Pro Pro Pro Pro210 215 220Met Tyr Glu Glu
Ala Pro Ser Phe Val Pro Pro Leu Ala His Gly Val225 230
235 240Pro Leu Asn Glu Gly Gly Met Gln Cys
Ser Ser Val Leu Pro Ala Leu245 250 255Glu
Leu Asp Glu Asn Phe His Phe Asn His Val Lys Leu Asp Gly Leu260
265 270Glu Cys Leu Phe Gly Met Gly Asp His Gln Asn
Met Arg Trp Asn Glu275 280 285Val Ser Pro
Leu Val Cys Pro Asn Asn Ala Val Ala Ser Ser Ser Gln290
295 300Gly Met Gln Gln Tyr Cys Leu Val Glu Glu Pro Ala
Asp Leu Gly Met305 310 315
320Gln111071DNAGossypium 11atggggagag ctccttgttg tgacaaagct aacgtcaaga
aaggcccatg gtcacctgaa 60gaagacacca agctcaaggc atacatcgag cagcatggca
ctggcggaaa ctggatcgct 120ttgcctcaca aaattggcct taagagatgt gggaagagct
gccgcctcag atggttaaac 180tatctccgcc caaatattaa gcatggagga ttctccgaag
aagaagataa aattatttgc 240agcctctata tcagtattgg gagcaggtgg tctattattg
ctgcacaatt accggggagg 300actgataacg atataaagaa ctattggaac acaaggctta
agaagaagct tctgggcaag 360caacgcaaag agcatcagtc tcgaagaggc aacagcctaa
agcaagatat gaagagatca 420tcagctagtg tgggggattc catggttcct gcagataaca
tcaatcaaat cccttactgg 480ccagagctgc ctgtgctggc tgcggcggcg gctcccatac
cgcactcaag tcaagaacat 540cgcattgaca gccaagcctc gatgagaaga ttactaatca
agcttggggg aagattttct 600gaggatgatc atgtggttaa tgatgggaca actcttcatc
agtttcctaa tgatttatcc 660actactgatc aggatcttta cgagcagact gtctatgtgc
cctcttcttc ttcttcttct 720cccatggatg ccttgagctt aagcaataac atcggttctc
agtttgtgaa ctctcagttc 780gccatagatg gagggaatct gcccattctg caaggacaaa
gcactacttt ttcatcagag 840ctccaggaaa tgggatatag cagcaaccca cagagattag
atggaatgga gttcttatat 900ggggaaggca tggtcgataa tagaggcgtg aatccttgtg
aaagcattgg ctggggtgac 960actagctctc tggttggccc tccttgtgct tcggaatatg
gagtcatgca acaaggaatg 1020cttcaagaat atggttttag tgagatgagg tacccaggag
gagcgcaata g 107112356PRTGossypium 12Met Gly Arg Ala Pro Cys
Cys Asp Lys Ala Asn Val Lys Lys Gly Pro1 5
10 15Trp Ser Pro Glu Glu Asp Thr Lys Leu Lys Ala Tyr
Ile Glu Gln His20 25 30Gly Thr Gly Gly
Asn Trp Ile Ala Leu Pro His Lys Ile Gly Leu Lys35 40
45Arg Cys Gly Lys Ser Cys Arg Leu Arg Trp Leu Asn Tyr Leu
Arg Pro50 55 60Asn Ile Lys His Gly Gly
Phe Ser Glu Glu Glu Asp Lys Ile Ile Cys65 70
75 80Ser Leu Tyr Ile Ser Ile Gly Ser Arg Trp Ser
Ile Ile Ala Ala Gln85 90 95Leu Pro Gly
Arg Thr Asp Asn Asp Ile Lys Asn Tyr Trp Asn Thr Arg100
105 110Leu Lys Lys Lys Leu Leu Gly Lys Gln Arg Lys Glu
His Gln Ser Arg115 120 125Arg Gly Asn Ser
Leu Lys Gln Asp Met Lys Arg Ser Ser Ala Ser Val130 135
140Gly Asp Ser Met Val Pro Ala Asp Asn Ile Asn Gln Ile Pro
Tyr Trp145 150 155 160Pro
Glu Leu Pro Val Leu Ala Ala Ala Ala Ala Pro Ile Pro His Ser165
170 175Ser Gln Glu His Arg Ile Asp Ser Gln Ala Ser
Met Arg Arg Leu Leu180 185 190Ile Lys Leu
Gly Gly Arg Phe Ser Glu Asp Asp His Val Val Asn Asp195
200 205Gly Thr Thr Leu His Gln Phe Pro Asn Asp Leu Ser
Thr Thr Asp Gln210 215 220Asp Leu Tyr Glu
Gln Thr Val Tyr Val Pro Ser Ser Ser Ser Ser Ser225 230
235 240Pro Met Asp Ala Leu Ser Leu Ser Asn
Asn Ile Gly Ser Gln Phe Val245 250 255Asn
Ser Gln Phe Ala Ile Asp Gly Gly Asn Leu Pro Ile Leu Gln Gly260
265 270Gln Ser Thr Thr Phe Ser Ser Glu Leu Gln Glu
Met Gly Tyr Ser Ser275 280 285Asn Pro Gln
Arg Leu Asp Gly Met Glu Phe Leu Tyr Gly Glu Gly Met290
295 300Val Asp Asn Arg Gly Val Asn Pro Cys Glu Ser Ile
Gly Trp Gly Asp305 310 315
320Thr Ser Ser Leu Val Gly Pro Pro Cys Ala Ser Glu Tyr Gly Val Met325
330 335Gln Gln Gly Met Leu Gln Glu Tyr Gly
Phe Ser Glu Met Arg Tyr Pro340 345 350Gly
Gly Ala Gln35513596DNAGlycine max 13atggggaggg caccttgctg tgacaaagca
aacgtgaaga aaggtccttg gtcgccagaa 60gaagatacca aactcaaatc ctacatcgaa
cagcacggaa ccggaggaaa ctggatcgct 120ttgccacaaa agattggcct taaacgatgt
ggcaagagtt gccgtctcag gtggctaaac 180tacctccgcc ctaacatcag acacggtggt
ttctccgaag aagaagacaa catcatttgc 240agcctctacg ttagcattgg aagcaggtgg
tcggtcattg cagcacaatt gccgggaaga 300actgataatg acataaagaa ctattggaac
acgaggctga agaagaagct tctagggaaa 360caccgtaagg aactacaggc acgcaacaaa
ggaaacggtg gtatccttaa acaggagaac 420agttcctcgc tcttgcttca gcaaaacagt
gctcaacaat tacatccatg ttggccacag 480attccagtgc tgccactttc atcaccgtac
acaaaccaaa gtccaagttt caacgaccaa 540gattccatta gaaagcttct aatcaagctt
ggagggagat tctccgatga ttatca 59614198PRTGlycine max 14Met Gly Arg
Ala Pro Cys Cys Asp Lys Ala Asn Val Lys Lys Gly Pro1 5
10 15Trp Ser Pro Glu Glu Asp Thr Lys Leu
Lys Ser Tyr Ile Glu Gln His20 25 30Gly
Thr Gly Gly Asn Trp Ile Ala Leu Pro Gln Lys Ile Gly Leu Lys35
40 45Arg Cys Gly Lys Ser Cys Arg Leu Arg Trp Leu
Asn Tyr Leu Arg Pro50 55 60Asn Ile Arg
His Gly Gly Phe Ser Glu Glu Glu Asp Asn Ile Ile Cys65 70
75 80Ser Leu Tyr Val Ser Ile Gly Ser
Arg Trp Ser Val Ile Ala Ala Gln85 90
95Leu Pro Gly Arg Thr Asp Asn Asp Ile Lys Asn Tyr Trp Asn Thr Arg100
105 110Leu Lys Lys Lys Leu Leu Gly Lys His Arg
Lys Glu Leu Gln Ala Arg115 120 125Asn Lys
Gly Asn Gly Gly Ile Leu Lys Gln Glu Asn Ser Ser Ser Leu130
135 140Leu Leu Gln Gln Asn Ser Ala Gln Gln Leu His Pro
Cys Trp Pro Gln145 150 155
160Ile Pro Val Leu Pro Leu Ser Ser Pro Tyr Thr Asn Gln Ser Pro Ser165
170 175Phe Asn Asp Gln Asp Ser Ile Arg Lys
Leu Leu Ile Lys Leu Gly Gly180 185 190Arg
Phe Ser Asp Asp Tyr195151463DNAGlycine max 15aaataacact ctctctctct
caattatttc tgccgtaaat agacaagggt tatctcaaat 60aatcaatcta agctagctga
aagcaacaac aacaacatgg ggagggcacc ttgctgtgac 120aaatcaaacg tgaagaaagg
cccttggtcg ccagatgaag atgccaaact caaatcctac 180atcgaacagc acggaaccgg
agggaactgg attgctttgc cacaaaagat tggcctcaaa 240cgatgtggca agagttgccg
tctcaggtgg ctaaactacc tccgccctaa catcagacac 300ggtggtttct ccgaagaaga
agacaacatc atttgcagcc tctacgttag cattggaagc 360aggtggtcgg tcattgcagc
acaattgccg ggaagaactg ataatgacat aaagaactat 420tggaacacga ggctgaagaa
gaagcttcta gggaaacacc gtaaggaact acaggcacgc 480aacaaaggaa acggtggtat
ccttaaacag gagaacagtt cctcgctctt gcttcagcaa 540aacagtgctc aacaattaca
tccatgttgg ccacagattc cagtgctgcc actttcatca 600ccgtacacaa accaaagtcc
aagtttcaac gaccaagatt ccattagaaa gcttctaatc 660aagcttggag ggagattctc
cgatgattat caacccatcc tagatgggtt gaatcttcag 720ttcccacatg gttcaaattc
tctctcatca acacaacaaa ttcaagagga gcaagtccat 780gttggttctt ctgcactagg
gtgcatgaac tctattggcc acaatcaagt acaatttggt 840cagagtaatg agtactgtgc
tgagttggtg caaggacaag ggagtttcat taccacagca 900attggggaaa tggtttcttc
taatgattat tctcgaaggt tattaggtgg gtcgttggag 960ttcttctatg gggaggaaat
gattactgat aataagataa tgggtgcttg tgcttcttca 1020agttgtgggc aaagtactaa
ttggggtgaa accagtagtt ctctgatgta tccttctctt 1080gttgcttcga attacgaggg
tgtgcggcaa gagatgccac gagaatgtgc ttttcaagag 1140ttgagctacc cgggggcgca
atagcatgtg tgtacattat ttgttatcaa ttggtggagg 1200ttgggattat atgtatccaa
attgaagcta gcaaaaggga attaactttg tatttgattc 1260ccatatagat aataaaaatc
agcttgattt atatataact agagacttgt ccgctatgtg 1320agactttctc atgtgctgat
cccttagtgg cagtgcagtt taattatctg atttgtttgt 1380gtattatttg gttgtaatct
tgagagtcat ttaaatatat ccttttatgt tccaagctaa 1440aaaaaaaaaa aaagaaaacc
ccc 146316355PRTGlycine max
16Met Gly Arg Ala Pro Cys Cys Asp Lys Ser Asn Val Lys Lys Gly Pro1
5 10 15Trp Ser Pro Asp Glu Asp
Ala Lys Leu Lys Ser Tyr Ile Glu Gln His20 25
30Gly Thr Gly Gly Asn Trp Ile Ala Leu Pro Gln Lys Ile Gly Leu Lys35
40 45Arg Cys Gly Lys Ser Cys Arg Leu Arg
Trp Leu Asn Tyr Leu Arg Pro50 55 60Asn
Ile Arg His Gly Gly Phe Ser Glu Glu Glu Asp Asn Ile Ile Cys65
70 75 80Ser Leu Tyr Val Ser Ile
Gly Ser Arg Trp Ser Val Ile Ala Ala Gln85 90
95Leu Pro Gly Arg Thr Asp Asn Asp Ile Lys Asn Tyr Trp Asn Thr Arg100
105 110Leu Lys Lys Lys Leu Leu Gly Lys
His Arg Lys Glu Leu Gln Ala Arg115 120
125Asn Lys Gly Asn Gly Gly Ile Leu Lys Gln Glu Asn Ser Ser Ser Leu130
135 140Leu Leu Gln Gln Asn Ser Ala Gln Gln
Leu His Pro Cys Trp Pro Gln145 150 155
160Ile Pro Val Leu Pro Leu Ser Ser Pro Tyr Thr Asn Gln Ser
Pro Ser165 170 175Phe Asn Asp Gln Asp Ser
Ile Arg Lys Leu Leu Ile Lys Leu Gly Gly180 185
190Arg Phe Ser Asp Asp Tyr Gln Pro Ile Leu Asp Gly Leu Asn Leu
Gln195 200 205Phe Pro His Gly Ser Asn Ser
Leu Ser Ser Thr Gln Gln Ile Gln Glu210 215
220Glu Gln Val His Val Gly Ser Ser Ala Leu Gly Cys Met Asn Ser Ile225
230 235 240Gly His Asn Gln
Val Gln Phe Gly Gln Ser Asn Glu Tyr Cys Ala Glu245 250
255Leu Val Gln Gly Gln Gly Ser Phe Ile Thr Thr Ala Ile Gly
Glu Met260 265 270Val Ser Ser Asn Asp Tyr
Ser Arg Arg Leu Leu Gly Gly Ser Leu Glu275 280
285Phe Phe Tyr Gly Glu Glu Met Ile Thr Asp Asn Lys Ile Met Gly
Ala290 295 300Cys Ala Ser Ser Ser Cys Gly
Gln Ser Thr Asn Trp Gly Glu Thr Ser305 310
315 320Ser Ser Leu Met Tyr Pro Ser Leu Val Ala Ser Asn
Tyr Glu Gly Val325 330 335Arg Gln Glu Met
Pro Arg Glu Cys Ala Phe Gln Glu Leu Ser Tyr Pro340 345
350Gly Ala Gln35517395DNAZea mays 17atggggagag ctccgtgctg
cgacaaggct actgtgaaga agggcccgtg gtcaccggag 60gaggacgcca agctcaagtc
ctacatcgag cagaacggca ccggcggtaa ctggatagcc 120ctgccgcaga agataggttt
gaagaggtgc ggcaagagct gccgcctccg gtggctcaac 180tacctccggc caaacatcaa
gcacggcggg ttctcggatg aggaggacat gataatcctt 240agcctctaca tcagcatagg
cagcaggtgg tcgataatag cggcgcagct gccgggaagg 300acggacaacg acataaagaa
ctactggaac acgaggctca agaagaagct cttcggcaag 360ccgtttaaaa aaggtattcg
gcagggttcc ttttt 39518131PRTZea mays 18Met
Gly Arg Ala Pro Cys Cys Asp Lys Ala Thr Val Lys Lys Gly Pro1
5 10 15Trp Ser Pro Glu Glu Asp Ala
Lys Leu Lys Ser Tyr Ile Glu Gln Asn20 25
30Gly Thr Gly Gly Asn Trp Ile Ala Leu Pro Gln Lys Ile Gly Leu Lys35
40 45Arg Cys Gly Lys Ser Cys Arg Leu Arg Trp
Leu Asn Tyr Leu Arg Pro50 55 60Asn Ile
Lys His Gly Gly Phe Ser Asp Glu Glu Asp Met Ile Ile Leu65
70 75 80Ser Leu Tyr Ile Ser Ile Gly
Ser Arg Trp Ser Ile Ile Ala Ala Gln85 90
95Leu Pro Gly Arg Thr Asp Asn Asp Ile Lys Asn Tyr Trp Asn Thr Arg100
105 110Leu Lys Lys Lys Leu Phe Gly Lys Pro
Phe Lys Lys Gly Ile Arg Gln115 120 125Gly
Ser Phe13019471DNASorghum bicolor 19atggggagag ctccgtgctg cgacaaggct
actgtgaaga agggtccgtg gtcgccggag 60gaggacgcca agctcaagtc ctacatcgag
cagaacggca ccggcggtaa ctggatagcc 120ctgcctcaga agataggtct gaagaggtgt
ggcaagagct gccgcctccg gtggctcaac 180tacctccggc caaacatcaa gcacggtggg
ttctccgagg aggaagacag aataatcctt 240agcctctaca tcagcatagg cagcaggtgg
tcgataatag cggcgcagct gccggggagg 300acggacaatg acataaagaa ctactggaac
acgaggctca agaagaagct cttcggcaag 360caatcgcgca aggatcaacg gcagcatcag
ttcatgcgcc agcaggcggc agcggcaaac 420gatgggatga tgaagcaaga agcagcacac
cccgggatgc agcccggagc a 47120157PRTSorghum bicolor 20Met Gly
Arg Ala Pro Cys Cys Asp Lys Ala Thr Val Lys Lys Gly Pro1 5
10 15Trp Ser Pro Glu Glu Asp Ala Lys
Leu Lys Ser Tyr Ile Glu Gln Asn20 25
30Gly Thr Gly Gly Asn Trp Ile Ala Leu Pro Gln Lys Ile Gly Leu Lys35
40 45Arg Cys Gly Lys Ser Cys Arg Leu Arg Trp
Leu Asn Tyr Leu Arg Pro50 55 60Asn Ile
Lys His Gly Gly Phe Ser Glu Glu Glu Asp Arg Ile Ile Leu65
70 75 80Ser Leu Tyr Ile Ser Ile Gly
Ser Arg Trp Ser Ile Ile Ala Ala Gln85 90
95Leu Pro Gly Arg Thr Asp Asn Asp Ile Lys Asn Tyr Trp Asn Thr Arg100
105 110Leu Lys Lys Lys Leu Phe Gly Lys Gln
Ser Arg Lys Asp Gln Arg Gln115 120 125His
Gln Phe Met Arg Gln Gln Ala Ala Ala Ala Asn Asp Gly Met Met130
135 140Lys Gln Glu Ala Ala His Pro Gly Met Gln Pro
Gly Ala145 150 15521525DNATriticum
aestivum 21atggggaggg cgccgtgctg cgacaaggcg acggtgaaga agggcccctg
gtcgccggag 60gaggacgcca agctcaaggc ctacatcgac gagaatggca ccggcggcaa
ctggatcgcc 120ctgccgcaga agatcgggct gaagaggtgc ggcaaaagct gtaggctcag
atggctcaac 180tatctgaggc caaacatcaa gcacggcgac ttcacagagg aagaggaaca
catcatttgc 240agcctctaca ttagcatcgg cagcaggtgg tcgatcatcg cggcgcagct
gccgggcaga 300acggacaacg acatcaagaa ctactggaac accaagctca agaagaagct
cctcggcaag 360cgcgcgccgt cccgccgcct gcagcgcgcc aaccaagacg cgccgatgcc
ctactcctac 420ctggcggcgg gcggcggcag cgccagcagc agcgggaacg ctagcggcac
caccgcggca 480ctcagctcgt cagcgctgga gcggatccag ctccacatgc gcctg
52522175PRTTriticum aestivum 22Met Gly Arg Ala Pro Cys Cys
Asp Lys Ala Thr Val Lys Lys Gly Pro1 5 10
15Trp Ser Pro Glu Glu Asp Ala Lys Leu Lys Ala Tyr Ile
Asp Glu Asn20 25 30Gly Thr Gly Gly Asn
Trp Ile Ala Leu Pro Gln Lys Ile Gly Leu Lys35 40
45Arg Cys Gly Lys Ser Cys Arg Leu Arg Trp Leu Asn Tyr Leu Arg
Pro50 55 60Asn Ile Lys His Gly Asp Phe
Thr Glu Glu Glu Glu His Ile Ile Cys65 70
75 80Ser Leu Tyr Ile Ser Ile Gly Ser Arg Trp Ser Ile
Ile Ala Ala Gln85 90 95Leu Pro Gly Arg
Thr Asp Asn Asp Ile Lys Asn Tyr Trp Asn Thr Lys100 105
110Leu Lys Lys Lys Leu Leu Gly Lys Arg Ala Pro Ser Arg Arg
Leu Gln115 120 125Arg Ala Asn Gln Asp Ala
Pro Met Pro Tyr Ser Tyr Leu Ala Ala Gly130 135
140Gly Gly Ser Ala Ser Ser Ser Gly Asn Ala Ser Gly Thr Thr Ala
Ala145 150 155 160Leu Ser
Ser Ser Ala Leu Glu Arg Ile Gln Leu His Met Arg Leu165
170 175231035DNAPopulus 23atggggagag ctccttgctg
tgacaaagcc aacgtgaaga aaggtccatg gtctcccgag 60gaagatgcca tactcaaggc
ctatattgag cagcatggaa ccggtggcaa tcggattgcc 120ttgccacaga agataggcct
gaaaaggtgt ggcaagagtt gtcgacttag atggttgaat 180tatctccgtc caaacattaa
acatggaggg ttttcagagg aagaagataa cattatttgc 240agtctctata taagtattgg
aagcaggtgg tctatcattg ctgctcagtt acctggaagg 300accgataatg atataaaaaa
ctactggaac acaaggctta agaagaagct cttaggaaaa 360cagcgcaagg aacaagcagc
ccggcgagct agtctcaagc aagaaatcat gacaaagaga 420gaaataaacg agagcttcat
ggttcctggg gctatccctc atcaacaaag cccttactgg 480ccagaggtac cagctctagt
catgaatcaa aaccaagatt ctcatttgat ggatcaagaa 540tccattagga acttgctgat
caaacttggt gggagatttt ctgacaataa tcaagagtca 600gccctattca ctacagtcaa
caattaccct ctcgacggtt caagcagaca agatcaagta 660ccatacacaa actccataga
tgtgctttct tcttcggcac ccatgcgatc aatggatagt 720actaccagtt gttctcaatt
tcctaatagc aactacaacg gtccaaatat gtgtcaagct 780ggacttgaaa acttgttggt
cgagctaggt ggattggtct gtagcaaccc acagcgtgta 840gaaggtttgg atagtttcta
cgggatggac atggctgcca gtggcagcac gggggctagt 900tctgcagaaa gtaacagctg
gggacatata agctctcttg gttatcctca gcttgtttct 960gactatgaaa cttgcttgca
aagcatgcca caagattcat cttttgaaga ttcaagcttc 1020tttgggccac aatga
103524344PRTPopulus 24Met Gly
Arg Ala Pro Cys Cys Asp Lys Ala Asn Val Lys Lys Gly Pro1 5
10 15Trp Ser Pro Glu Glu Asp Ala Ile
Leu Lys Ala Tyr Ile Glu Gln His20 25
30Gly Thr Gly Gly Asn Arg Ile Ala Leu Pro Gln Lys Ile Gly Leu Lys35
40 45Arg Cys Gly Lys Ser Cys Arg Leu Arg Trp
Leu Asn Tyr Leu Arg Pro50 55 60Asn Ile
Lys His Gly Gly Phe Ser Glu Glu Glu Asp Asn Ile Ile Cys65
70 75 80Ser Leu Tyr Ile Ser Ile Gly
Ser Arg Trp Ser Ile Ile Ala Ala Gln85 90
95Leu Pro Gly Arg Thr Asp Asn Asp Ile Lys Asn Tyr Trp Asn Thr Arg100
105 110Leu Lys Lys Lys Leu Leu Gly Lys Gln
Arg Lys Glu Gln Ala Ala Arg115 120 125Arg
Ala Ser Leu Lys Gln Glu Ile Met Thr Lys Arg Glu Ile Asn Glu130
135 140Ser Phe Met Val Pro Gly Ala Ile Pro His Gln
Gln Ser Pro Tyr Trp145 150 155
160Pro Glu Val Pro Ala Leu Val Met Asn Gln Asn Gln Asp Ser His
Leu165 170 175Met Asp Gln Glu Ser Ile Arg
Asn Leu Leu Ile Lys Leu Gly Gly Arg180 185
190Phe Ser Asp Asn Asn Gln Glu Ser Ala Leu Phe Thr Thr Val Asn Asn195
200 205Tyr Pro Leu Asp Gly Ser Ser Arg Gln
Asp Gln Val Pro Tyr Thr Asn210 215 220Ser
Ile Asp Val Leu Ser Ser Ser Ala Pro Met Arg Ser Met Asp Ser225
230 235 240Thr Thr Ser Cys Ser Gln
Phe Pro Asn Ser Asn Tyr Asn Gly Pro Asn245 250
255Met Cys Gln Ala Gly Leu Glu Asn Leu Leu Val Glu Leu Gly Gly
Leu260 265 270Val Cys Ser Asn Pro Gln Arg
Val Glu Gly Leu Asp Ser Phe Tyr Gly275 280
285Met Asp Met Ala Ala Ser Gly Ser Thr Gly Ala Ser Ser Ala Glu Ser290
295 300Asn Ser Trp Gly His Ile Ser Ser Leu
Gly Tyr Pro Gln Leu Val Ser305 310 315
320Asp Tyr Glu Thr Cys Leu Gln Ser Met Pro Gln Asp Ser Ser
Phe Glu325 330 335Asp Ser Ser Phe Phe Gly
Pro Gln34025537DNAMedicago truncatula 25atgggaagag ctccttgctg tgacaaagcc
aatgtcaaaa aaggtccatg gtcacctgaa 60gaagatgcta aactcaagtc ttacatagaa
caaaatggta ctggtggaaa ttggattgct 120ctacctcaga agataggact taagagatgt
gggaaaagct gtcgtcttag atggttaaat 180tatcttagac caaatattaa acatggtggt
ttctccgaag aagaagacga cattatttgc 240agcctctatg ttagtatcgg aagcaggtgg
tctatcatag cagcacaatt accaggacga 300acggataatg atataaagaa ctactggaac
actaggctga agaagaagct ccttgggaaa 360caaaggaagg agcaacaagc tcgccgagtt
agcaacatga aacaagagat gaaaagagaa 420acaaatcaga atttgatgat ggcggctttg
ggtgtcaata gtatgagttc agcaccatat 480tggcctgcag agtattctgt tcatcctatg
ccagtttcaa attcatccat aatttga 53726178PRTMedicago truncatula 26Met
Gly Arg Ala Pro Cys Cys Asp Lys Ala Asn Val Lys Lys Gly Pro1
5 10 15Trp Ser Pro Glu Glu Asp Ala
Lys Leu Lys Ser Tyr Ile Glu Gln Asn20 25
30Gly Thr Gly Gly Asn Trp Ile Ala Leu Pro Gln Lys Ile Gly Leu Lys35
40 45Arg Cys Gly Lys Ser Cys Arg Leu Arg Trp
Leu Asn Tyr Leu Arg Pro50 55 60Asn Ile
Lys His Gly Gly Phe Ser Glu Glu Glu Asp Asp Ile Ile Cys65
70 75 80Ser Leu Tyr Val Ser Ile Gly
Ser Arg Trp Ser Ile Ile Ala Ala Gln85 90
95Leu Pro Gly Arg Thr Asp Asn Asp Ile Lys Asn Tyr Trp Asn Thr Arg100
105 110Leu Lys Lys Lys Leu Leu Gly Lys Gln
Arg Lys Glu Gln Gln Ala Arg115 120 125Arg
Val Ser Asn Met Lys Gln Glu Met Lys Arg Glu Thr Asn Gln Asn130
135 140Leu Met Met Ala Ala Leu Gly Val Asn Ser Met
Ser Ser Ala Pro Tyr145 150 155
160Trp Pro Ala Glu Tyr Ser Val His Pro Met Pro Val Ser Asn Ser
Ser165 170 175Ile Ile27948DNASolanum
laciniatum 27atgggaagag ctccatgttg tgataaagca aatgtaaaaa gaggtccatg
gtctccagaa 60gaagatgcaa aacttaaaga ttttattcat aaatttggta ctgctggtaa
ttggattgca 120cttcctcaaa aagcaggact aaggagatgt ggaaaaagtt gtagattgag
atggctaaat 180tatttaaggc ctaatataaa gcatggtgat ttttctgatg atgaagatag
agttatttgt 240aacttatatg ccaatattgg aagcaggtgg tcaattatag cggcgcaatt
acctggaaga 300acagacaatg atatcaaaaa ttattggaac acgaagctca agaaaaagct
catgggatta 360ataataccta ataacaattc ttcttcttct tataataata ataataataa
ttaccaaaaa 420aaattaccat attttccatc aacttctctt catcaagccc aacaaaatct
tgggcctttt 480attacaggcc cagaacagcc cattattatt ccagcagccc aacaaaacaa
tttgttcacc 540aataataaca acaacttgat gatgatagcc aataatttgc aaaattatcc
aaattttggt 600gctacaaatt acaatttgca aaattatcca aattttgggg aagttgcaag
ttgttcttca 660tcagatggaa gccaaatgag ttttggtact aataaagata ttattaatat
taaaagagaa 720gaaattatga gttttggtgg tcatcatgat ggtgctattt atgaagaaat
taataataac 780caacaattta attttggtta ttttggaaat actatgcagg agattgctac
aagttgttgt 840accaatggca atagtggtag tactaccaat agtagtaata attttttgtt
gtacaataat 900gatgaaaatt gtaacaagtc aaatgagata gggatgtttt attactaa
94828315PRTSolanum lycopersicum 28Met Gly Arg Ala Pro Cys Cys
Asp Lys Ala Asn Val Lys Arg Gly Pro1 5 10
15Trp Ser Pro Glu Glu Asp Ala Lys Leu Lys Asp Phe Ile
His Lys Phe20 25 30Gly Thr Ala Gly Asn
Trp Ile Ala Leu Pro Gln Lys Ala Gly Leu Arg35 40
45Arg Cys Gly Lys Ser Cys Arg Leu Arg Trp Leu Asn Tyr Leu Arg
Pro50 55 60Asn Ile Lys His Gly Asp Phe
Ser Asp Asp Glu Asp Arg Val Ile Cys65 70
75 80Asn Leu Tyr Ala Asn Ile Gly Ser Arg Trp Ser Ile
Ile Ala Ala Gln85 90 95Leu Pro Gly Arg
Thr Asp Asn Asp Ile Lys Asn Tyr Trp Asn Thr Lys100 105
110Leu Lys Lys Lys Leu Met Gly Leu Ile Ile Pro Asn Asn Asn
Ser Ser115 120 125Ser Ser Tyr Asn Asn Asn
Asn Asn Asn Tyr Gln Lys Lys Leu Pro Tyr130 135
140Phe Pro Ser Thr Ser Leu His Gln Ala Gln Gln Asn Leu Gly Pro
Phe145 150 155 160Ile Thr
Gly Pro Glu Gln Pro Ile Ile Ile Pro Ala Ala Gln Gln Asn165
170 175Asn Leu Phe Thr Asn Asn Asn Asn Asn Leu Met Met
Ile Ala Asn Asn180 185 190Leu Gln Asn Tyr
Pro Asn Phe Gly Ala Thr Asn Tyr Asn Leu Gln Asn195 200
205Tyr Pro Asn Phe Gly Glu Val Ala Ser Cys Ser Ser Ser Asp
Gly Ser210 215 220Gln Met Ser Phe Gly Thr
Asn Lys Asp Ile Ile Asn Ile Lys Arg Glu225 230
235 240Glu Ile Met Ser Phe Gly Gly His His Asp Gly
Ala Ile Tyr Glu Glu245 250 255Ile Asn Asn
Asn Gln Gln Phe Asn Phe Gly Tyr Phe Gly Asn Thr Met260
265 270Gln Glu Ile Ala Thr Ser Cys Cys Thr Asn Gly Asn
Ser Gly Ser Thr275 280 285Thr Asn Ser Ser
Asn Asn Phe Leu Leu Tyr Asn Asn Asp Glu Asn Cys290 295
300Asn Lys Ser Asn Glu Ile Gly Met Phe Tyr Tyr305
310 31529714DNASolanum lycopersicum 29atgggaagag
ctccatgttg tgataaagca aatgtgaaga aagggccatg gtcaccagaa 60gaagatgcaa
aattaaaaga atatattgac aaatttggca ctggtggaaa ttggattgct 120cttccacaaa
aagctgggct aagaagatgt ggaaaaagct gtcgattaag atggttaaat 180tatcttaggc
caaatattaa acacggagag ttttcagacg aagaagacag aatcatttgc 240agcctttatg
ctaacattgg aagcaggtgg tcaatcatag cagctcaatt accaggcagg 300acagataatg
atatcaaaaa ctattggaac acgaagctga agaagaaatt aatgggattt 360gtctcttcat
ctcacaagat taggcctctt aatcaccatg attatcacca ccaaattccc 420actaattgtt
acaataatta ttcctcactt gttcaagctt catctttatt aatctcatca 480aattatccca
acaacacaac tttcccatgc tatgaaacaa atattcctag tacaacccca 540tcaagtacaa
gtttcttaag cgcgggtgca tctactagtt gtacctcagg cattactgct 600agtactttcg
cgggtcgtac tacctcttct gatgagaggt atgacatttc gaattttaat 660tttcatagct
atatgtataa taacaatggt ggttttagtg aaggagaaaa gtga
71430237PRTSolanum lycopersicum 30Met Gly Arg Ala Pro Cys Cys Asp Lys Ala
Asn Val Lys Lys Gly Pro1 5 10
15Trp Ser Pro Glu Glu Asp Ala Lys Leu Lys Glu Tyr Ile Asp Lys Phe20
25 30Gly Thr Gly Gly Asn Trp Ile Ala Leu
Pro Gln Lys Ala Gly Leu Arg35 40 45Arg
Cys Gly Lys Ser Cys Arg Leu Arg Trp Leu Asn Tyr Leu Arg Pro50
55 60Asn Ile Lys His Gly Glu Phe Ser Asp Glu Glu
Asp Arg Ile Ile Cys65 70 75
80Ser Leu Tyr Ala Asn Ile Gly Ser Arg Trp Ser Ile Ile Ala Ala Gln85
90 95Leu Pro Gly Arg Thr Asp Asn Asp Ile
Lys Asn Tyr Trp Asn Thr Lys100 105 110Leu
Lys Lys Lys Leu Met Gly Phe Val Ser Ser Ser His Lys Ile Arg115
120 125Pro Leu Asn His His Asp Tyr His His Gln Ile
Pro Thr Asn Cys Tyr130 135 140Asn Asn Tyr
Ser Ser Leu Val Gln Ala Ser Ser Leu Leu Ile Ser Ser145
150 155 160Asn Tyr Pro Asn Asn Thr Thr
Phe Pro Cys Tyr Glu Thr Asn Ile Pro165 170
175Ser Thr Thr Pro Ser Ser Thr Ser Phe Leu Ser Ala Gly Ala Ser Thr180
185 190Ser Cys Thr Ser Gly Ile Thr Ala Ser
Thr Phe Ala Gly Arg Thr Thr195 200 205Ser
Ser Asp Glu Arg Tyr Asp Ile Ser Asn Phe Asn Phe His Ser Tyr210
215 220Met Tyr Asn Asn Asn Gly Gly Phe Ser Glu Gly
Glu Lys225 230 23531918DNAArabidopsis
thaliana 31atgggaagag caccgtgctg cgacaagatg gcggtgaaga aagggccatg
gtcgacggag 60gaagatgccg tgcttaagtc ttacatcgaa aaacacggca ccggcaacaa
ctggatttcc 120ctccctcaga gaattgggat aaagaggtgc gggaagagtt gtcgtttgag
atggcttaat 180tacttgaggc ctaacttaaa gcatggaggc ttcaccgatg aagaagatta
catcatttgc 240agcctttaca ttactattgg aagcaggtgg tctatcattg cttcacaatt
accgggaaga 300acagacaacg acatcaaaaa ctattggaac acgaggctga agaagaagct
attgagcaag 360caagggaagg catttcatca acaacttaat gtcaaatttg agcgtggaac
aacatcatca 420tcatcgagtc agaaccagat ccaaatcttt catgatgaga acaccaaatc
gaaccaaaca 480ttatataatc aagtggtgga tccatcaatg agagcttttg ccatggaaga
acaaagcatg 540atcaagaatc agatattgga accattttct tgggaaccaa acaaggtttt
gtttgatgtt 600gattatgatg cagctgcttc atcttatcat catcatgcat ccccatcatt
gaactccatg 660agcagtacta gtagtattgg tactaataat tcatctttac aaatgtctca
ctacaccgtc 720aatcacaatg atcatgatca accagatatg ttctttatgg acgggtttga
gaatttccag 780gccgagctat ttgatgagat agccaacaac aacacggtag aaaatggctt
tgacggaacc 840gagatcctga tcaataacaa ctacttggat cacgatatta gctccttcat
tgattatcct 900ctatacgata atgagtag
91832305PRTArabidopsis thaliana 32Met Gly Arg Ala Pro Cys Cys
Asp Lys Met Ala Val Lys Lys Gly Pro1 5 10
15Trp Ser Thr Glu Glu Asp Ala Val Leu Lys Ser Tyr Ile
Glu Lys His20 25 30Gly Thr Gly Asn Asn
Trp Ile Ser Leu Pro Gln Arg Ile Gly Ile Lys35 40
45Arg Cys Gly Lys Ser Cys Arg Leu Arg Trp Leu Asn Tyr Leu Arg
Pro50 55 60Asn Leu Lys His Gly Gly Phe
Thr Asp Glu Glu Asp Tyr Ile Ile Cys65 70
75 80Ser Leu Tyr Ile Thr Ile Gly Ser Arg Trp Ser Ile
Ile Ala Ser Gln85 90 95Leu Pro Gly Arg
Thr Asp Asn Asp Ile Lys Asn Tyr Trp Asn Thr Arg100 105
110Leu Lys Lys Lys Leu Leu Ser Lys Gln Gly Lys Ala Phe His
Gln Gln115 120 125Leu Asn Val Lys Phe Glu
Arg Gly Thr Thr Ser Ser Ser Ser Ser Gln130 135
140Asn Gln Ile Gln Ile Phe His Asp Glu Asn Thr Lys Ser Asn Gln
Thr145 150 155 160Leu Tyr
Asn Gln Val Val Asp Pro Ser Met Arg Ala Phe Ala Met Glu165
170 175Glu Gln Ser Met Ile Lys Asn Gln Ile Leu Glu Pro
Phe Ser Trp Glu180 185 190Pro Asn Lys Val
Leu Phe Asp Val Asp Tyr Asp Ala Ala Ala Ser Ser195 200
205Tyr His His His Ala Ser Pro Ser Leu Asn Ser Met Ser Ser
Thr Ser210 215 220Ser Ile Gly Thr Asn Asn
Ser Ser Leu Gln Met Ser His Tyr Thr Val225 230
235 240Asn His Asn Asp His Asp Gln Pro Asp Met Phe
Phe Met Asp Gly Phe245 250 255Glu Asn Phe
Gln Ala Glu Leu Phe Asp Glu Ile Ala Asn Asn Asn Thr260
265 270Val Glu Asn Gly Phe Asp Gly Thr Glu Ile Leu Ile
Asn Asn Asn Tyr275 280 285Leu Asp His Asp
Ile Ser Ser Phe Ile Asp Tyr Pro Leu Tyr Asp Asn290 295
300Glu30533990DNAArabidopsis thaliana 33atgggaagag
ctccgtgttg cgacaagaca aaagtgaagc gagggccttg gtcgcctgaa 60gaagactcta
aacttagaga ttacattgaa aagtatggta atggtggaaa ttggatctct 120ttccccctca
aagccggttt gaggagatgt gggaagagtt gtagactgag gtggctaaac 180tatttgagac
caaacataaa gcatggtgac ttctctgagg aagaagacag gatcattttt 240agtctcttcg
ctgccatagg aagcaggtgg tcaataatag cagctcatct accgggacga 300acagacaacg
acataaaaaa ctattggaac acaaagctaa ggaagaaact cttgtcttct 360tcctctgatt
catcatcatc agccatggct tctccttatc taaaccctat ttctcaggat 420gtgaaaagac
caacctcacc aacaacaatc ccatcttctt cttacaatcc gtatgctgaa 480aaccctaatc
aatacccaac aaaatccctc atctccagca tcaatggctt cgaagctggt 540gacaaacaga
taatttccta tattaaccct aattatcctc aagatctcta tctctcggac 600agcaacaaca
acacctcgaa cgcaaatggt ttcttgctca accacaatat gtgtgatcag 660tacaagaacc
acaccagttt ttcttcagac gtcaatggga taagatcaga gattatgatg 720aagcaagaag
agataatgat gatgatgatg atagaccacc acattgacca gaggacaaaa 780gggtacaatg
gggaattcac acaagggtat tataattact acaatgggca tggggatttg 840aagcaaatga
ttagtggaac aggcactaat tctaacataa acatgggtgg ttcaggttca 900tcttctagtt
cgataagcaa cctagctgag aacaaaagca gtggtagcct cctactagaa 960tacaaatgct
tgccctattt ctactcctag
99034329PRTArabidopsis thaliana 34Met Gly Arg Ala Pro Cys Cys Asp Lys Thr
Lys Val Lys Arg Gly Pro1 5 10
15Trp Ser Pro Glu Glu Asp Ser Lys Leu Arg Asp Tyr Ile Glu Lys Tyr20
25 30Gly Asn Gly Gly Asn Trp Ile Ser Phe
Pro Leu Lys Ala Gly Leu Arg35 40 45Arg
Cys Gly Lys Ser Cys Arg Leu Arg Trp Leu Asn Tyr Leu Arg Pro50
55 60Asn Ile Lys His Gly Asp Phe Ser Glu Glu Glu
Asp Arg Ile Ile Phe65 70 75
80Ser Leu Phe Ala Ala Ile Gly Ser Arg Trp Ser Ile Ile Ala Ala His85
90 95Leu Pro Gly Arg Thr Asp Asn Asp Ile
Lys Asn Tyr Trp Asn Thr Lys100 105 110Leu
Arg Lys Lys Leu Leu Ser Ser Ser Ser Asp Ser Ser Ser Ser Ala115
120 125Met Ala Ser Pro Tyr Leu Asn Pro Ile Ser Gln
Asp Val Lys Arg Pro130 135 140Thr Ser Pro
Thr Thr Ile Pro Ser Ser Ser Tyr Asn Pro Tyr Ala Glu145
150 155 160Asn Pro Asn Gln Tyr Pro Thr
Lys Ser Leu Ile Ser Ser Ile Asn Gly165 170
175Phe Glu Ala Gly Asp Lys Gln Ile Ile Ser Tyr Ile Asn Pro Asn Tyr180
185 190Pro Gln Asp Leu Tyr Leu Ser Asp Ser
Asn Asn Asn Thr Ser Asn Ala195 200 205Asn
Gly Phe Leu Leu Asn His Asn Met Cys Asp Gln Tyr Lys Asn His210
215 220Thr Ser Phe Ser Ser Asp Val Asn Gly Ile Arg
Ser Glu Ile Met Met225 230 235
240Lys Gln Glu Glu Ile Met Met Met Met Met Ile Asp His His Ile
Asp245 250 255Gln Arg Thr Lys Gly Tyr Asn
Gly Glu Phe Thr Gln Gly Tyr Tyr Asn260 265
270Tyr Tyr Asn Gly His Gly Asp Leu Lys Gln Met Ile Ser Gly Thr Gly275
280 285Thr Asn Ser Asn Ile Asn Met Gly Gly
Ser Gly Ser Ser Ser Ser Ser290 295 300Ile
Ser Asn Leu Ala Glu Asn Lys Ser Ser Gly Ser Leu Leu Leu Glu305
310 315 320Tyr Lys Cys Leu Pro Tyr
Phe Tyr Ser32535897DNAArabidopsis thaliana 35atgggtaggg ctccatgttg
tgacaaggca aatgtgaaga gaggtccatg gtctcctgaa 60gaagacgcaa agcttaaaga
ttacatcgag aaacaaggaa ctggtggcaa ttggattgct 120ctccctcaca aagctggttt
aaggagatgt gggaagagtt gcagactgag atggttaaat 180tatttgagac caaacataag
acatggagat ttcactgaag aagaagacaa tattatctac 240agcctctttg cctccattgg
aagcaggtgg tcagtaatag cagctcactt gcaaggtaga 300actgataatg acatcaagaa
ctattggaac actaagctca agaagaagct catagccacc 360atggctcctc ctccacatca
ccacttagcc attgctacat catcatcatc agcatcccca 420tcatcatcat cacattacaa
catgatcaat agtcttcttc cgtataaccc atcaacaaac 480caacttctca cacctcatca
gggtatcatg atgacaatga tgggccaaca acaacaacta 540ttttatcaag aagacatggg
caatttggta aattctccaa acagaaacaa tctcataatg 600agccatcaag aagacaacca
agagcaaagt acaaacaagg gaataatgtt gttgagtgat 660gtaagaagtg ggtcgagtac
aacaagtaca gtaacaagag tgaagatgga acatcgtgat 720catgatgatc atcatcatca
tcatgaagaa gatgagagat caatgacctc ggtagtgatg 780gaagattatg gaatggagga
gatcaagcaa ttaataagta gtagttgtac gagtagtaac 840aatagcttgt ggtttgacga
aaacaagacg gaggataagt tcatgttgta ctactga 89736298PRTArabidopsis
thaliana 36Met Gly Arg Ala Pro Cys Cys Asp Lys Ala Asn Val Lys Arg Gly
Pro1 5 10 15Trp Ser Pro
Glu Glu Asp Ala Lys Leu Lys Asp Tyr Ile Glu Lys Gln20 25
30Gly Thr Gly Gly Asn Trp Ile Ala Leu Pro His Lys Ala
Gly Leu Arg35 40 45Arg Cys Gly Lys Ser
Cys Arg Leu Arg Trp Leu Asn Tyr Leu Arg Pro50 55
60Asn Ile Arg His Gly Asp Phe Thr Glu Glu Glu Asp Asn Ile Ile
Tyr65 70 75 80Ser Leu
Phe Ala Ser Ile Gly Ser Arg Trp Ser Val Ile Ala Ala His85
90 95Leu Gln Gly Arg Thr Asp Asn Asp Ile Lys Asn Tyr
Trp Asn Thr Lys100 105 110Leu Lys Lys Lys
Leu Ile Ala Thr Met Ala Pro Pro Pro His His His115 120
125Leu Ala Ile Ala Thr Ser Ser Ser Ser Ala Ser Pro Ser Ser
Ser Ser130 135 140His Tyr Asn Met Ile Asn
Ser Leu Leu Pro Tyr Asn Pro Ser Thr Asn145 150
155 160Gln Leu Leu Thr Pro His Gln Gly Ile Met Met
Thr Met Met Gly Gln165 170 175Gln Gln Gln
Leu Phe Tyr Gln Glu Asp Met Gly Asn Leu Val Asn Ser180
185 190Pro Asn Arg Asn Asn Leu Ile Met Ser His Gln Glu
Asp Asn Gln Glu195 200 205Gln Ser Thr Asn
Lys Gly Ile Met Leu Leu Ser Asp Val Arg Ser Gly210 215
220Ser Ser Thr Thr Ser Thr Val Thr Arg Val Lys Met Glu His
Arg Asp225 230 235 240His
Asp Asp His His His His His Glu Glu Asp Glu Arg Ser Met Thr245
250 255Ser Val Val Met Glu Asp Tyr Gly Met Glu Glu
Ile Lys Gln Leu Ile260 265 270Ser Ser Ser
Cys Thr Ser Ser Asn Asn Ser Leu Trp Phe Asp Glu Asn275
280 285Lys Thr Glu Asp Lys Phe Met Leu Tyr Tyr290
29537622DNAAegilops speltoides 37ggcgccgagc tgcaacaagg cgaccgtgaa
gagggcgcca tgctcaccgg aggaggacgc 60gatgctcaag gcctacattg aggagcgtgg
caccggcaac aactggattg cactgccaca 120caagattggg ctgaagagat gcggcaagag
ctgcaggctg aggtggctca actacctgag 180gcccaacata aagcactggg acttcacccc
agaggaggac agcaccatct gcaagctcta 240cattagcatc gggagcaggt ggtcaatcat
cgccgcacag ctgccaggaa ggaccgacaa 300cgacgtcaag aactactgga acaccaagct
caagaagcgg ctccttggcg gccgccgcaa 360ggaccgcggc gccggcacgc agcagcaccg
ccagggagag ctggacggcg gcaacaacga 420aggggagcag cagccgctga gcgcgtccgc
gatggagagg atccagctct gcatgcagct 480gcaggaaatg cagaaccccc tgagcagcat
cggcaaccac aacaacccct tgcacctgtg 540gcagcctgga agccatcatc aggtggccgc
cactcacagt aacaacaggc acaacaacag 600caacagcagc cgcagcagca gc
62238693DNAAntirrhinum majus
38atgggaagag ctccatgttg tgataaggca aatgtaaaaa aagggccatg gtcacctgaa
60gaagatgcaa agcttaaaga gtacatagag caacaaggca gtgttggaaa ttggattgct
120cttccacaaa aagctggttt gagaagatgt ggaaagagct gcagattgag atggttgaac
180tatcttagac caaacattaa gcatggagag ttttcagatg atgaagatag aatcatttgc
240agcctctttg ctaacattgg aagcaggtgg tcaataatag cagctcaatt accaggaaga
300actgataatg acatcaagaa ctattggaac acaaaactca agaaaaaact catgggagca
360atgtcccctt catatcagaa aaaacctcat caagctacaa cttttccttt accatctact
420cataattttc tctcacaaga ctctccatat tcatctcttc tctcacaaat ctcatcagca
480tatgaaggta acaacaacaa ctattacaac acctcaatta aggccttctc aagagcctat
540gaacccataa tttctccaaa tcctaatgcc cctaattcag tcctacaaac acaagattca
600tatgtgggtc ccatgcatgg cgattacaat aataataatc tcctaatctt tggaggggaa
660ccaagctgca gttcatctga tgggagtcaa att
69339231PRTAntirrhinum majus 39Met Gly Arg Ala Pro Cys Cys Asp Lys Ala
Asn Val Lys Lys Gly Pro1 5 10
15Trp Ser Pro Glu Glu Asp Ala Lys Leu Lys Glu Tyr Ile Glu Gln Gln20
25 30Gly Ser Val Gly Asn Trp Ile Ala Leu
Pro Gln Lys Ala Gly Leu Arg35 40 45Arg
Cys Gly Lys Ser Cys Arg Leu Arg Trp Leu Asn Tyr Leu Arg Pro50
55 60Asn Ile Lys His Gly Glu Phe Ser Asp Asp Glu
Asp Arg Ile Ile Cys65 70 75
80Ser Leu Phe Ala Asn Ile Gly Ser Arg Trp Ser Ile Ile Ala Ala Gln85
90 95Leu Pro Gly Arg Thr Asp Asn Asp Ile
Lys Asn Tyr Trp Asn Thr Lys100 105 110Leu
Lys Lys Lys Leu Met Gly Ala Met Ser Pro Ser Tyr Gln Lys Lys115
120 125Pro His Gln Ala Thr Thr Phe Pro Leu Pro Ser
Thr His Asn Phe Leu130 135 140Ser Gln Asp
Ser Pro Tyr Ser Ser Leu Leu Ser Gln Ile Ser Ser Ala145
150 155 160Tyr Glu Gly Asn Asn Asn Asn
Tyr Tyr Asn Thr Ser Ile Lys Ala Phe165 170
175Ser Arg Ala Tyr Glu Pro Ile Ile Ser Pro Asn Pro Asn Ala Pro Asn180
185 190Ser Val Leu Gln Thr Gln Asp Ser Tyr
Val Gly Pro Met His Gly Asp195 200 205Tyr
Asn Asn Asn Asn Leu Leu Ile Phe Gly Gly Glu Pro Ser Cys Ser210
215 220Ser Ser Asp Gly Ser Gln Ile225
230401050DNAAquilegia 40atgggaagag ctccttgttg tgacaaagcc aatgttaaga
aaggtccatg gtcacctgaa 60gaagatgcaa aactcaaaga atacattgaa caacatggta
caggagggaa ttggatcgcc 120ttgccacaaa agattggcct taagagatgt gggaagagct
gtcgtctcag atggttgaat 180tatctccgtc caaatcttaa gcatggaggt ttctctgaag
aagaagatca catgatttgc 240agcctctata tgagtattgg aagccgatgg tccatcattg
cagcacaatt acctggtcgg 300actgacaacg acataaaaaa ctattggaac actaagctga
agaagaagct cttgggaaaa 360caaaggaagg aacaccaagc tcgtagagct agctacctaa
agcaagacag catgaagaaa 420agcaattcta ttccactggt aattgcagat catgggatgc
aaacgccgta ctggccacca 480gagccactga tgcctatatc aacaccgtat ccaaaccaag
gcaatcggca taatgatcat 540gcatctctta gaaaattact gatcaaactt ggggggaagt
tttctgatga ttatcaacaa 600gaacccaaca acgatgcaaa gaataatctt cacctcccca
ttgaatgttc ctcaacatat 660caacaattca atgatcagaa atccattaat ctattctctt
cttcgtcttc cataagttcc 720ttggatattc cctattctaa attgttgacc aacgagtaca
atatcgaggg agcaagcatg 780cacatgttgc aaggccttaa cggatttccg gttgagtttg
atgagatgat atgtagcaat 840ccagagagtt tagatgaatt ggaatgccat tatggagttg
gtatggacaa tggagtaagc 900agtagtggtc caatatcaac agaaagcact acctgggatg
acatgggatc tttcatcaac 960cctcctaata tcaccaacta tgaagcttta caacaagaag
tgatacaaga atgtgcatta 1020ttcgaccagt cgaggtacct tggatttcag
105041350PRTAquilegia 41Met Gly Arg Ala Pro Cys Cys
Asp Lys Ala Asn Val Lys Lys Gly Pro1 5 10
15Trp Ser Pro Glu Glu Asp Ala Lys Leu Lys Glu Tyr Ile
Glu Gln His20 25 30Gly Thr Gly Gly Asn
Trp Ile Ala Leu Pro Gln Lys Ile Gly Leu Lys35 40
45Arg Cys Gly Lys Ser Cys Arg Leu Arg Trp Leu Asn Tyr Leu Arg
Pro50 55 60Asn Leu Lys His Gly Gly Phe
Ser Glu Glu Glu Asp His Met Ile Cys65 70
75 80Ser Leu Tyr Met Ser Ile Gly Ser Arg Trp Ser Ile
Ile Ala Ala Gln85 90 95Leu Pro Gly Arg
Thr Asp Asn Asp Ile Lys Asn Tyr Trp Asn Thr Lys100 105
110Leu Lys Lys Lys Leu Leu Gly Lys Gln Arg Lys Glu His Gln
Ala Arg115 120 125Arg Ala Ser Tyr Leu Lys
Gln Asp Ser Met Lys Lys Ser Asn Ser Ile130 135
140Pro Leu Val Ile Ala Asp His Gly Met Gln Thr Pro Tyr Trp Pro
Pro145 150 155 160Glu Pro
Leu Met Pro Ile Ser Thr Pro Tyr Pro Asn Gln Gly Asn Arg165
170 175His Asn Asp His Ala Ser Leu Arg Lys Leu Leu Ile
Lys Leu Gly Gly180 185 190Lys Phe Ser Asp
Asp Tyr Gln Gln Glu Pro Asn Asn Asp Ala Lys Asn195 200
205Asn Leu His Leu Pro Ile Glu Cys Ser Ser Thr Tyr Gln Gln
Phe Asn210 215 220Asp Gln Lys Ser Ile Asn
Leu Phe Ser Ser Ser Ser Ser Ile Ser Ser225 230
235 240Leu Asp Ile Pro Tyr Ser Lys Leu Leu Thr Asn
Glu Tyr Asn Ile Glu245 250 255Gly Ala Ser
Met His Met Leu Gln Gly Leu Asn Gly Phe Pro Val Glu260
265 270Phe Asp Glu Met Ile Cys Ser Asn Pro Glu Ser Leu
Asp Glu Leu Glu275 280 285Cys His Tyr Gly
Val Gly Met Asp Asn Gly Val Ser Ser Ser Gly Pro290 295
300Ile Ser Thr Glu Ser Thr Thr Trp Asp Asp Met Gly Ser Phe
Ile Asn305 310 315 320Pro
Pro Asn Ile Thr Asn Tyr Glu Ala Leu Gln Gln Glu Val Ile Gln325
330 335Glu Cys Ala Leu Phe Asp Gln Ser Arg Tyr Leu
Gly Phe Gln340 345 350421005DNAAquilegia
42atgggaaggg caccttgttg tgacaaagca aatgtcaaga aaggaccatg gtctcctgaa
60gaagacacaa ggcttaagga gtacatagag aaatttggga ctggtgggaa ctggattgct
120cttccacaaa aagctggtct aaagagatgt gggaagagtt gtagattgag atggcttaat
180tatcttaggc caaacataaa acatggccaa ttctccgatg atgaagataa agtgatctgc
240agcctctttg ctagcatagg tagcaggtgg tcaataatag ctgcacagtt gccaggcagg
300actgacaatg atatcaaaaa ctattggaac accaagctca agaaaaaatt tatggggttt
360gttccttcat cacagattaa agcacttcca ccaatctttc catctccatt ccacactgga
420tcaacatatg attactaccc tttatcaaaa tctcttccag accttgaatc tctttcaatc
480ccatcaaatt ttttgaacaa cactaacact acaaccagta ttattagtac atctcttcat
540gaatcccaac aaaatttgga aggtttcatg caccattatc aagagaaaga caactttctt
600atctttggag gtgaacctag ttgcagttct tctgatggaa gctgtactaa tcaaataagc
660tacaacaaag atattgagta tgactatagt ggcaatggca atggtggtgg tggtggtgga
720aatggtccta atggtgtaag attaggccaa gagagttatt tttgcaatgg agttcaagat
780aatcagaaat tcatgcttgg taatggttcc attggtagta atggatggtc tgatcagagg
840ctaaatagtg gattaatgtg gggtgatgat cagacatcat taccattaga tcatcatcat
900tatgatggtc ttgaggatat taaactagtc aaaaacagta gtggttcttg tagtaatatt
960ttcaatggtg atgaaactaa agcacagagt agaatcatgt acttc
100543335PRTAquilegia 43Met Gly Arg Ala Pro Cys Cys Asp Lys Ala Asn Val
Lys Lys Gly Pro1 5 10
15Trp Ser Pro Glu Glu Asp Thr Arg Leu Lys Glu Tyr Ile Glu Lys Phe20
25 30Gly Thr Gly Gly Asn Trp Ile Ala Leu Pro
Gln Lys Ala Gly Leu Lys35 40 45Arg Cys
Gly Lys Ser Cys Arg Leu Arg Trp Leu Asn Tyr Leu Arg Pro50
55 60Asn Ile Lys His Gly Gln Phe Ser Asp Asp Glu Asp
Lys Val Ile Cys65 70 75
80Ser Leu Phe Ala Ser Ile Gly Ser Arg Trp Ser Ile Ile Ala Ala Gln85
90 95Leu Pro Gly Arg Thr Asp Asn Asp Ile Lys
Asn Tyr Trp Asn Thr Lys100 105 110Leu Lys
Lys Lys Phe Met Gly Phe Val Pro Ser Ser Gln Ile Lys Ala115
120 125Leu Pro Pro Ile Phe Pro Ser Pro Phe His Thr Gly
Ser Thr Tyr Asp130 135 140Tyr Tyr Pro Leu
Ser Lys Ser Leu Pro Asp Leu Glu Ser Leu Ser Ile145 150
155 160Pro Ser Asn Phe Leu Asn Asn Thr Asn
Thr Thr Thr Ser Ile Ile Ser165 170 175Thr
Ser Leu His Glu Ser Gln Gln Asn Leu Glu Gly Phe Met His His180
185 190Tyr Gln Glu Lys Asp Asn Phe Leu Ile Phe Gly
Gly Glu Pro Ser Cys195 200 205Ser Ser Ser
Asp Gly Ser Cys Thr Asn Gln Ile Ser Tyr Asn Lys Asp210
215 220Ile Glu Tyr Asp Tyr Ser Gly Asn Gly Asn Gly Gly
Gly Gly Gly Gly225 230 235
240Asn Gly Pro Asn Gly Val Arg Leu Gly Gln Glu Ser Tyr Phe Cys Asn245
250 255Gly Val Gln Asp Asn Gln Lys Phe Met
Leu Gly Asn Gly Ser Ile Gly260 265 270Ser
Asn Gly Trp Ser Asp Gln Arg Leu Asn Ser Gly Leu Met Trp Gly275
280 285Asp Asp Gln Thr Ser Leu Pro Leu Asp His His
His Tyr Asp Gly Leu290 295 300Glu Asp Ile
Lys Leu Val Lys Asn Ser Ser Gly Ser Cys Ser Asn Ile305
310 315 320Phe Asn Gly Asp Glu Thr Lys
Ala Gln Ser Arg Ile Met Tyr Phe325 330
33544435DNAArachis hypogaeamisc_feature(335)..(335)n can be any
nucleotide 44atgggaagag ctccttgttg tgacaaagca aatgtgaaga gaggaccatg
gtctcctgat 60gaggatgcaa cactcaagaa ctatcttcac actcatggca ctggaggaaa
ttggattgca 120ttgccaagaa aagctggttt aaggaggtgt gggaagagtt gccgtctaag
gtggctgaat 180tatctaaggc cagatataaa acatggagga tttaccgaac aagaggatca
aatcatttgc 240actctctata ctcaaatggg aagcagatgg tctgcaatag catctcaact
tcctggcaga 300acagacaatg atgtcaaaaa ctattggaac accangctcn agaagaagct
aatggcagga 360aaagtaattt cccctantaa taataatant aataaaacat tattgactac
tcancgaaac 420tcangatttt canga
43545145PRTArachis hypogaeaMISC_FEATURE(112)..(112)X can be
any amino acid 45Met Gly Arg Ala Pro Cys Cys Asp Lys Ala Asn Val Lys Arg
Gly Pro1 5 10 15Trp Ser
Pro Asp Glu Asp Ala Thr Leu Lys Asn Tyr Leu His Thr His20
25 30Gly Thr Gly Gly Asn Trp Ile Ala Leu Pro Arg Lys
Ala Gly Leu Arg35 40 45Arg Cys Gly Lys
Ser Cys Arg Leu Arg Trp Leu Asn Tyr Leu Arg Pro50 55
60Asp Ile Lys His Gly Gly Phe Thr Glu Gln Glu Asp Gln Ile
Ile Cys65 70 75 80Thr
Leu Tyr Thr Gln Met Gly Ser Arg Trp Ser Ala Ile Ala Ser Gln85
90 95Leu Pro Gly Arg Thr Asp Asn Asp Val Lys Asn
Tyr Trp Asn Thr Xaa100 105 110Leu Xaa Lys
Lys Leu Met Ala Gly Lys Val Ile Ser Pro Xaa Asn Asn115
120 125Asn Xaa Asn Lys Thr Leu Leu Thr Thr Xaa Arg Asn
Ser Xaa Phe Ser130 135
140Xaa14546476DNAArachis hypogaea 46atgggaagag ctccttgttg tgacaaagca
aatgtgaaga gaggaccatg gtctcctgat 60gaggatgcaa cactcaagaa ctatcttcac
actcatggca ctggaggaaa ttggattgca 120ttgccaagaa aagctggttt aaggaggtgt
gggaagagtt gccgtctaag gtggctgaat 180tatctaaggc cagatataaa acatggagga
tttaccgaac aagaggatca aatcatttgc 240actctctata ctcaaatggg aagcagatgg
tctgcaatag catctcaact tcctggcaga 300acagacaatg atgtcaaaaa ctattggaac
accaagctca agaagaagct aatggcagga 360aaagtaattt cccctaataa taataataat
aataaaacat tattgactac tcaacaaaac 420tcagattttc aagataataa aaattgttca
ccaccctcat catcatcatc attagt 47647159PRTArachis hypogaea 47Met Gly
Arg Ala Pro Cys Cys Asp Lys Ala Asn Val Lys Arg Gly Pro1 5
10 15Trp Ser Pro Asp Glu Asp Ala Thr
Leu Lys Asn Tyr Leu His Thr His20 25
30Gly Thr Gly Gly Asn Trp Ile Ala Leu Pro Arg Lys Ala Gly Leu Arg35
40 45Arg Cys Gly Lys Ser Cys Arg Leu Arg Trp
Leu Asn Tyr Leu Arg Pro50 55 60Asp Ile
Lys His Gly Gly Phe Thr Glu Gln Glu Asp Gln Ile Ile Cys65
70 75 80Thr Leu Tyr Thr Gln Met Gly
Ser Arg Trp Ser Ala Ile Ala Ser Gln85 90
95Leu Pro Gly Arg Thr Asp Asn Asp Val Lys Asn Tyr Trp Asn Thr Lys100
105 110Leu Lys Lys Lys Leu Met Ala Gly Lys
Val Ile Ser Pro Asn Asn Asn115 120 125Asn
Asn Asn Lys Thr Leu Leu Thr Thr Gln Gln Asn Ser Asp Phe Gln130
135 140Asp Asn Lys Asn Cys Ser Pro Pro Ser Ser Ser
Ser Ser Leu Val145 150 15548744DNAArachis
stenosperma 48atgggaagag ctccatgttg tgacaaggca aatgtgaaga gaggaccatg
gtcacctgaa 60gaagacttaa agctcaaaga atacatacac aagcatggca ctggtggcaa
ttggattgct 120cttcctcaaa aagctggtct taagagatgt gggaagagtt gcagactgag
atggcttaac 180tatctgaggc caaacattaa gcatggagaa ttttctgaag aggaagacag
aatcatttgc 240agcctctatg ttaacattgg aagcagatgg tcaattatag cagctcagtt
gccaggaagg 300actgacaatg atataaagaa ttattggaac actaagctca agaaaaaact
catctctggt 360ttcattccca attcttcttc tattcgacaa agaaaacatc atcatcatca
acaacaacaa 420caacctccat ttccatcatc atcaataatg tacatggatc aatgttatgg
atcttatgtc 480acaggccttg ttgatcctat ttcacttcct tcaactgaat actatgcaaa
cacaacaaca 540acaacctcag ttccatttta ccagcaccaa gtagattcca tagttagccc
cagcagcatg 600caacaaaatt atcacatgtt tggaagtgaa ggtagttgca gtttctctga
taatggaagc 660agtatcaagc aagaggaaat tgggtatcat catcaaggat catacaacaa
cattattgca 720tttgatgagt tcaacaacac acaa
74449248PRTArachis stenosperma 49Met Gly Arg Ala Pro Cys Cys
Asp Lys Ala Asn Val Lys Arg Gly Pro1 5 10
15Trp Ser Pro Glu Glu Asp Leu Lys Leu Lys Glu Tyr Ile
His Lys His20 25 30Gly Thr Gly Gly Asn
Trp Ile Ala Leu Pro Gln Lys Ala Gly Leu Lys35 40
45Arg Cys Gly Lys Ser Cys Arg Leu Arg Trp Leu Asn Tyr Leu Arg
Pro50 55 60Asn Ile Lys His Gly Glu Phe
Ser Glu Glu Glu Asp Arg Ile Ile Cys65 70
75 80Ser Leu Tyr Val Asn Ile Gly Ser Arg Trp Ser Ile
Ile Ala Ala Gln85 90 95Leu Pro Gly Arg
Thr Asp Asn Asp Ile Lys Asn Tyr Trp Asn Thr Lys100 105
110Leu Lys Lys Lys Leu Ile Ser Gly Phe Ile Pro Asn Ser Ser
Ser Ile115 120 125Arg Gln Arg Lys His His
His His Gln Gln Gln Gln Gln Pro Pro Phe130 135
140Pro Ser Ser Ser Ile Met Tyr Met Asp Gln Cys Tyr Gly Ser Tyr
Val145 150 155 160Thr Gly
Leu Val Asp Pro Ile Ser Leu Pro Ser Thr Glu Tyr Tyr Ala165
170 175Asn Thr Thr Thr Thr Thr Ser Val Pro Phe Tyr Gln
His Gln Val Asp180 185 190Ser Ile Val Ser
Pro Ser Ser Met Gln Gln Asn Tyr His Met Phe Gly195 200
205Ser Glu Gly Ser Cys Ser Phe Ser Asp Asn Gly Ser Ser Ile
Lys Gln210 215 220Glu Glu Ile Gly Tyr His
His Gln Gly Ser Tyr Asn Asn Ile Ile Ala225 230
235 240Phe Asp Glu Phe Asn Asn Thr
Gln24550477DNABrachypodium distachyon 50atggggcgcg cgccgtgctg cgacaaggcg
agcgtgaagc gcgggccgtg gtcgccggag 60gaggacgagc agctccggcg ctacgtccgc
gcccatggca tcggcggcaa ctggatcgcc 120ctgccgcaca aagccgggct gaagcggtgc
ggcaagagct gccggctgcg gtggctgaac 180tacctgcggc cggacatcag gcacggcggg
tacacggccg aggaggaccg ggtcatctgc 240tccctatacg ggtcaatcgg cagccggtgg
tcgattatcg cgtccaagct ccccggccgc 300acggacaacg acgtcaagaa ctactggaac
accaagctca agaagaaggc cgtcgccatg 360gggatgcagc atgccaccgg aggatcagcc
ttctccgctc cccatagcca gtgcgcgctc 420tcgccggcgg cctcgggctc ctcctcgtcg
accgtcgaca ccactagctc cagcgcc 47751159PRTBrachypodium distachyon
51Met Gly Arg Ala Pro Cys Cys Asp Lys Ala Ser Val Lys Arg Gly Pro1
5 10 15Trp Ser Pro Glu Glu Asp
Glu Gln Leu Arg Arg Tyr Val Arg Ala His20 25
30Gly Ile Gly Gly Asn Trp Ile Ala Leu Pro His Lys Ala Gly Leu Lys35
40 45Arg Cys Gly Lys Ser Cys Arg Leu Arg
Trp Leu Asn Tyr Leu Arg Pro50 55 60Asp
Ile Arg His Gly Gly Tyr Thr Ala Glu Glu Asp Arg Val Ile Cys65
70 75 80Ser Leu Tyr Gly Ser Ile
Gly Ser Arg Trp Ser Ile Ile Ala Ser Lys85 90
95Leu Pro Gly Arg Thr Asp Asn Asp Val Lys Asn Tyr Trp Asn Thr Lys100
105 110Leu Lys Lys Lys Ala Val Ala Met
Gly Met Gln His Ala Thr Gly Gly115 120
125Ser Ala Phe Ser Ala Pro His Ser Gln Cys Ala Leu Ser Pro Ala Ala130
135 140Ser Gly Ser Ser Ser Ser Thr Val Asp
Thr Thr Ser Ser Ser Ala145 150
15552717DNABrachypodium distachyon 52atggggaggg cgccgtgctg cgacagggcg
gcggtgaagc gtgggccgtg gtcgccggag 60gaggacgaca agctgcgcga ctacatccag
cgccacggca ccggcggcag ctggatcacc 120ttccccaaga aagccgggct gaggaggtgt
ggcaagagct gcaggctgcg gtggctcaac 180tacctccgcc cggacatccg gcacggcggc
ttcaccgacg aggaggacgc gctcatcttc 240tccctctaca gcaagctcgg cagcaagtgg
tcgctgatcg cgtcgcagct ggagaggagg 300acggacaacg acgtcaagaa ccactggaac
accaagctca agaagcgcct cgcagccttc 360tcctctcccc cgtcgtcttc ctcctccttc
ctgccggcgc ctgcgcccat ggccgtggcc 420gtcgcgcacc cgctcgccct ggccgtgccg
accgtcaagg ccgagacgta cgcctacgac 480gacttcatgg cgccgccggc cgcgctccac
gtcttcgacc acccgttcgg caacggcgcc 540gatcagcccg gctccacgac gtccgcctcc
gcagcgtcgt ccatgtccaa ctggtcgtcc 600gcggcggaca acgcgggggc cgccgacggg
ttcttcgcgg acttctgcaa cgccggcgcg 660gcggaccagt tcctcggcgg cttctactac
cctctcgatc ccaccttgtc gctagtc 71753239PRTBrachypodium distachyon
53Met Gly Arg Ala Pro Cys Cys Asp Arg Ala Ala Val Lys Arg Gly Pro1
5 10 15Trp Ser Pro Glu Glu Asp
Asp Lys Leu Arg Asp Tyr Ile Gln Arg His20 25
30Gly Thr Gly Gly Ser Trp Ile Thr Phe Pro Lys Lys Ala Gly Leu Arg35
40 45Arg Cys Gly Lys Ser Cys Arg Leu Arg
Trp Leu Asn Tyr Leu Arg Pro50 55 60Asp
Ile Arg His Gly Gly Phe Thr Asp Glu Glu Asp Ala Leu Ile Phe65
70 75 80Ser Leu Tyr Ser Lys Leu
Gly Ser Lys Trp Ser Leu Ile Ala Ser Gln85 90
95Leu Glu Arg Arg Thr Asp Asn Asp Val Lys Asn His Trp Asn Thr Lys100
105 110Leu Lys Lys Arg Leu Ala Ala Phe
Ser Ser Pro Pro Ser Ser Ser Ser115 120
125Ser Phe Leu Pro Ala Pro Ala Pro Met Ala Val Ala Val Ala His Pro130
135 140Leu Ala Leu Ala Val Pro Thr Val Lys
Ala Glu Thr Tyr Ala Tyr Asp145 150 155
160Asp Phe Met Ala Pro Pro Ala Ala Leu His Val Phe Asp His
Pro Phe165 170 175Gly Asn Gly Ala Asp Gln
Pro Gly Ser Thr Thr Ser Ala Ser Ala Ala180 185
190Ser Ser Met Ser Asn Trp Ser Ser Ala Ala Asp Asn Ala Gly Ala
Ala195 200 205Asp Gly Phe Phe Ala Asp Phe
Cys Asn Ala Gly Ala Ala Asp Gln Phe210 215
220Leu Gly Gly Phe Tyr Tyr Pro Leu Asp Pro Thr Leu Ser Leu Val225
230 235541092DNABrassica napus 54atgggaaggg
caccgtgttg tgacaaagcc aacgtgaaga aagggccttg gtctcctgag 60gaagatgcca
aactcaaaga ttacatcgag aatagtggta caggaggcaa ctggatcgct 120ttgcctcaga
agattggtct aaggagatgt gggaagagtt gcagactaag gtggctcaac 180tatttgagac
caaacatcaa acatggtggc ttctctgagg aggaagacac catcatttgt 240aacctttatg
ttactattgg tagcaggtgg tctataattg ctgcacaact gccgggaaga 300acggacaacg
atatcaagaa ctattggaac acgaggctaa agaagaagct tctaaacaaa 360caaaggacag
agttccaaga agctcggatg aagcaagaga tggtgatgat gaagagacaa 420caacaaggac
atgaccacat caatggtagt acggatcttt atctgaaaaa catgtttgga 480tcatcaccat
ggccattact acaacagctt cctcatcatc aagtacctct tgtgatgatg 540gaaccaacaa
gttgtaacta ctaccaaacg tcaccctctt gtaacctaga acaaaagcca 600cttatcactt
tcaataacat ggtcaagatt gaagaagaac cggagaaaac aaaccctgat 660catcctcagc
atcaaaattc tatcacaaac ccttttgatg tctccgtctc ccagctcttg 720ttagatcctg
attactactt aggatcagga ggaggagaag gggattttgc tatcatgagt 780agcagcacaa
attctccatt accaaacaca agtggtgatc aaaatgaaca tcagcagcaa 840gagattcttc
aatggtttgg gagtagtaat cttcagacag aagcaagcag tgatatgttc 900ttaaacaaca
tagggaatct tgagaccaac gaggacacaa gattctactc atcattagcc 960ggtgctggag
cggctccggc aggaggaacg acgagcacat cggcagatca aagcacaata 1020agttgggagg
acataacctc tcttgttaat tccgaagatg caagttactt caatggggca 1080aatcatttgt
aa
109255363PRTBrassica napus 55Met Gly Arg Ala Pro Cys Cys Asp Lys Ala Asn
Val Lys Lys Gly Pro1 5 10
15Trp Ser Pro Glu Glu Asp Ala Lys Leu Lys Asp Tyr Ile Glu Asn Ser20
25 30Gly Thr Gly Gly Asn Trp Ile Ala Leu Pro
Gln Lys Ile Gly Leu Arg35 40 45Arg Cys
Gly Lys Ser Cys Arg Leu Arg Trp Leu Asn Tyr Leu Arg Pro50
55 60Asn Ile Lys His Gly Gly Phe Ser Glu Glu Glu Asp
Thr Ile Ile Cys65 70 75
80Asn Leu Tyr Val Thr Ile Gly Ser Arg Trp Ser Ile Ile Ala Ala Gln85
90 95Leu Pro Gly Arg Thr Asp Asn Asp Ile Lys
Asn Tyr Trp Asn Thr Arg100 105 110Leu Lys
Lys Lys Leu Leu Asn Lys Gln Arg Thr Glu Phe Gln Glu Ala115
120 125Arg Met Lys Gln Glu Met Val Met Met Lys Arg Gln
Gln Gln Gly His130 135 140Asp His Ile Asn
Gly Ser Thr Asp Leu Tyr Leu Lys Asn Met Phe Gly145 150
155 160Ser Ser Pro Trp Pro Leu Leu Gln Gln
Leu Pro His His Gln Val Pro165 170 175Leu
Val Met Met Glu Pro Thr Ser Cys Asn Tyr Tyr Gln Thr Ser Pro180
185 190Ser Cys Asn Leu Glu Gln Lys Pro Leu Ile Thr
Phe Asn Asn Met Val195 200 205Lys Ile Glu
Glu Glu Pro Glu Lys Thr Asn Pro Asp His Pro Gln His210
215 220Gln Asn Ser Ile Thr Asn Pro Phe Asp Val Ser Val
Ser Gln Leu Leu225 230 235
240Leu Asp Pro Asp Tyr Tyr Leu Gly Ser Gly Gly Gly Glu Gly Asp Phe245
250 255Ala Ile Met Ser Ser Ser Thr Asn Ser
Pro Leu Pro Asn Thr Ser Gly260 265 270Asp
Gln Asn Glu His Gln Gln Gln Glu Ile Leu Gln Trp Phe Gly Ser275
280 285Ser Asn Leu Gln Thr Glu Ala Ser Ser Asp Met
Phe Leu Asn Asn Ile290 295 300Gly Asn Leu
Glu Thr Asn Glu Asp Thr Arg Phe Tyr Ser Ser Leu Ala305
310 315 320Gly Ala Gly Ala Ala Pro Ala
Gly Gly Thr Thr Ser Thr Ser Ala Asp325 330
335Gln Ser Thr Ile Ser Trp Glu Asp Ile Thr Ser Leu Val Asn Ser Glu340
345 350Asp Ala Ser Tyr Phe Asn Gly Ala Asn
His Leu355 360561116DNABrassica napus 56atgggaagag
caccgtgttg tgataaggct aacgtgaaga aagggccttg gtctcctgaa 60gaagatgcaa
agctcaaaga ttacatcgag agtagtggca caggaggcaa ctggatcgct 120ttgcctcaga
agattggtct aaggagatgt gggaagagtt gcagactaag gtggcttaac 180tatttgagac
ctaacatcaa acatggtggc ttctctgagg aagaggacaa catcatttgt 240aacctctatg
ttactattgg tagcaggtgg tctataattg ctgcacaatt gcctggaaga 300acagacaacg
atatcaagaa ctattggaac acgaggctga agaagaagct tcttaacaaa 360caaagaaaag
agttccaaga agctcggatg aagcaagaga tggtgatgat gaaacgacag 420caacaaggac
aaggacaatg ccaaagtaat ggtagtacgg atctttatct gaacaacatg 480tttagatcat
caccatggcc attactgcct catcttcctc ctcctcatag tcaagtacct 540cttgtgatga
tggaaccaac aagctgcaat tactaccaac cgacaccgtc ttgcgcatta 600gaacaaaagc
cattgatccc actcaagaac atggtcaaga ttgaagcaga accggagaga 660tcaaaccctg
atcatcatta tccggaagac tcaatgacaa actcttttga tctttccttc 720tctcagcttt
tgttagatcc taattactac ctggaatcag gaggaggaga aggagagttt 780gctctcatga
gtagaagtac gaactctcca ttaccaaaca caagtagtga tcaccatcaa 840catcaaaaag
agattactca atggtttggg agtagtaatt ttcagacaga agctatcaat 900gatgtgttct
taaacaacaa caacttagcg aattttgaga ccaacgacga gaacgcaaaa 960ctatatgcaa
actcatcagt agccggagct ggagcagcgt ttgccggagg aacgacgagt 1020acatcggctg
atcaaagcac aataagttgg gaggacataa cctctcttgt taactccgat 1080gatgcaagat
acttcaatgg gccaaataat gtgtaa
111657371PRTBrassica napus 57Met Gly Arg Ala Pro Cys Cys Asp Lys Ala Asn
Val Lys Lys Gly Pro1 5 10
15Trp Ser Pro Glu Glu Asp Ala Lys Leu Lys Asp Tyr Ile Glu Ser Ser20
25 30Gly Thr Gly Gly Asn Trp Ile Ala Leu Pro
Gln Lys Ile Gly Leu Arg35 40 45Arg Cys
Gly Lys Ser Cys Arg Leu Arg Trp Leu Asn Tyr Leu Arg Pro50
55 60Asn Ile Lys His Gly Gly Phe Ser Glu Glu Glu Asp
Asn Ile Ile Cys65 70 75
80Asn Leu Tyr Val Thr Ile Gly Ser Arg Trp Ser Ile Ile Ala Ala Gln85
90 95Leu Pro Gly Arg Thr Asp Asn Asp Ile Lys
Asn Tyr Trp Asn Thr Arg100 105 110Leu Lys
Lys Lys Leu Leu Asn Lys Gln Arg Lys Glu Phe Gln Glu Ala115
120 125Arg Met Lys Gln Glu Met Val Met Met Lys Arg Gln
Gln Gln Gly Gln130 135 140Gly Gln Cys Gln
Ser Asn Gly Ser Thr Asp Leu Tyr Leu Asn Asn Met145 150
155 160Phe Arg Ser Ser Pro Trp Pro Leu Leu
Pro His Leu Pro Pro Pro His165 170 175Ser
Gln Val Pro Leu Val Met Met Glu Pro Thr Ser Cys Asn Tyr Tyr180
185 190Gln Pro Thr Pro Ser Cys Ala Leu Glu Gln Lys
Pro Leu Ile Pro Leu195 200 205Lys Asn Met
Val Lys Ile Glu Ala Glu Pro Glu Arg Ser Asn Pro Asp210
215 220His His Tyr Pro Glu Asp Ser Met Thr Asn Ser Phe
Asp Leu Ser Phe225 230 235
240Ser Gln Leu Leu Leu Asp Pro Asn Tyr Tyr Leu Glu Ser Gly Gly Gly245
250 255Glu Gly Glu Phe Ala Leu Met Ser Arg
Ser Thr Asn Ser Pro Leu Pro260 265 270Asn
Thr Ser Ser Asp His His Gln His Gln Lys Glu Ile Thr Gln Trp275
280 285Phe Gly Ser Ser Asn Phe Gln Thr Glu Ala Ile
Asn Asp Val Phe Leu290 295 300Asn Asn Asn
Asn Leu Ala Asn Phe Glu Thr Asn Asp Glu Asn Ala Lys305
310 315 320Leu Tyr Ala Asn Ser Ser Val
Ala Gly Ala Gly Ala Ala Phe Ala Gly325 330
335Gly Thr Thr Ser Thr Ser Ala Asp Gln Ser Thr Ile Ser Trp Glu Asp340
345 350Ile Thr Ser Leu Val Asn Ser Asp Asp
Ala Arg Tyr Phe Asn Gly Pro355 360 365Asn
Asn Val370581454DNABrassica napus 58ctcctcttca tcatcaacat agttgagaaa
gataaaaaat aaagagggag agaaagagaa 60agaaacacat caagaacaag aataacgaat
caagaagatg ggaagagcac cgtgttgtga 120caaggctaac gtgaagaaag ggccgtggtc
tcctgaagaa gacgcaaaac tcaaagatta 180catcgagaat aatggcacag gaggcaactg
gatcgcgttg cctcagaaga ttggtctaag 240aagatgtggg aagagttgca gactaaggtg
gctcaactat ttgagaccaa acatcaaaca 300tggtggcttc tctgaggaag aggacaacat
catttgtaac ctctatgtta ccattggtag 360caggtggtct ataattgctg cacaattgcc
tggaagaaca gacaacgata tcaagaacta 420ttggaacacg aggctgaaga agaagcttct
taacaaacaa agaaaagagt accaagaagc 480tcggatgaag caagagatgg tgatgatgaa
agcgacaagc aacaagggac aatgccaaag 540taatgctagt acggatcttt attctgaaca
acatgtttgg atcatcacca tggccattac 600tgcctcagct tcctcctcct cattgtcaag
tacctcttgt gatgatggaa ccaacaagct 660gcaattacta ccaaccgaca ccgtcttgcg
cattagaaca aaagccattg atcccactca 720agaacatggt caagattgaa gaagaaccgg
agagatcaaa ccctgatcat cattatccgg 780aagactcaat gacaaactct tttgatcttt
ccttttctca gcttttgtta gatcctaatt 840actacctgga atcaggagga ggagaaggag
agtttgctct catgagtagc agtacgaact 900ctccattacc aaacacaagt gatgatcacc
atcaacatca aaaagagatt actcaatggt 960ttgggagtgg taattttcaa acagaagcta
tcaatgatgt gttcttaaac aacaacaact 1020tagcgaattt tgagaccaac gatgagaacg
caaaactata tgcaaactca tcagtaaccg 1080gagctggagc agcgtttgcc ggaggaacga
cgagtacatc ggctgatcaa agcacaataa 1140gttgggagga cataacctct cttcttaaat
ccgatgatgc aagttacttc aatgggccaa 1200atcatgtgta attttttctg caaaatttta
tttttactta tatatataaa gaggtttttg 1260tataagtata tatgacataa gagagtggta
aaaaaaatac atggattgag attaatattc 1320tcatatttgt gtattttagc ttcaacttag
ctttattcac gaagagcaat atatatgatc 1380ttttgagttt ttctttttga ttaatgcttt
tataaatttg aaagagaaaa aaaaaaaaaa 1440aaaaaaaaaa aaaa
145459753DNABrassica napus 59atgggaaggg
caccgtgttg tgacaaagcc aacgtgaaga aagggccttg gtctcctgag 60gaagatgcca
aactcaaaga ttacatcgag aatagtggta caggaggcaa ctggatcgct 120ttgcctcaga
agattggtct aaggagatgt gggaagagtt gcagactaag gtggctcaac 180tatttgagac
caaacatcaa acatggtggc ttctctgagg aggaagacac catcatttgt 240aacctttatg
ttactattgg tagcaggtgg tctataattg ctgcacaact gccgggaaga 300acggacaacg
atatcaagaa ctattggaac acgaggctaa agaagaagct tctaaacaaa 360caaaggacag
agttccaaga agctcggatg aagcaagaga tggtgatgat gaagagacaa 420caacaaggac
atgaccacat caatggtagt acggatcttt atctgaaaaa catgtttgga 480tcatcaccat
ggccattact acaacagctt cctcatcatc aagtacctct tgtgatgatg 540gaaccaacaa
gttgtaacta ctaccaaacg tcaccctctt gtaacctaga acaaaagcca 600cttatcactt
tcaataacat ggtcaagatt gaagaagaac cggagaaaac aaaccctgat 660catcctcagc
atcaaaattc tatcacaaac ccttttgatg tctccgtctc ccagctcttg 720ttagatcctg
attactactt aggatcagga gga
75360251PRTBrassica napus 60Met Gly Arg Ala Pro Cys Cys Asp Lys Ala Asn
Val Lys Lys Gly Pro1 5 10
15Trp Ser Pro Glu Glu Asp Ala Lys Leu Lys Asp Tyr Ile Glu Asn Ser20
25 30Gly Thr Gly Gly Asn Trp Ile Ala Leu Pro
Gln Lys Ile Gly Leu Arg35 40 45Arg Cys
Gly Lys Ser Cys Arg Leu Arg Trp Leu Asn Tyr Leu Arg Pro50
55 60Asn Ile Lys His Gly Gly Phe Ser Glu Glu Glu Asp
Thr Ile Ile Cys65 70 75
80Asn Leu Tyr Val Thr Ile Gly Ser Arg Trp Ser Ile Ile Ala Ala Gln85
90 95Leu Pro Gly Arg Thr Asp Asn Asp Ile Lys
Asn Tyr Trp Asn Thr Arg100 105 110Leu Lys
Lys Lys Leu Leu Asn Lys Gln Arg Thr Glu Phe Gln Glu Ala115
120 125Arg Met Lys Gln Glu Met Val Met Met Lys Arg Gln
Gln Gln Gly His130 135 140Asp His Ile Asn
Gly Ser Thr Asp Leu Tyr Leu Lys Asn Met Phe Gly145 150
155 160Ser Ser Pro Trp Pro Leu Leu Gln Gln
Leu Pro His His Gln Val Pro165 170 175Leu
Val Met Met Glu Pro Thr Ser Cys Asn Tyr Tyr Gln Thr Ser Pro180
185 190Ser Cys Asn Leu Glu Gln Lys Pro Leu Ile Thr
Phe Asn Asn Met Val195 200 205Lys Ile Glu
Glu Glu Pro Glu Lys Thr Asn Pro Asp His Pro Gln His210
215 220Gln Asn Ser Ile Thr Asn Pro Phe Asp Val Ser Val
Ser Gln Leu Leu225 230 235
240Leu Asp Pro Asp Tyr Tyr Leu Gly Ser Gly Gly245
25061864DNABrassica napus 61atgggaaggg ctccatgttg tgacaaagca aacgtcaaga
gaggtccatg gtctcctgaa 60gaagatgcaa agcttaaaga ttacatagag aaacaaggca
ctggtggcaa ctggattgct 120ctccctcaca aagctggttt aagaagatgt gggaagagtt
gtagactgag gtggttaaac 180tatttgaggc caaacataag acatggagat ttctctgagg
aagaagacaa gattatcttc 240agcctctttt cctccattgg aagcaggtgg tcagtaattg
cagctcacct gcatggtaga 300actgataacg acatcaagaa ctattggagc actaagctca
agaagaagct cattgccact 360atggctcctc caccacctca tcatctctta gccattgcct
catcatcatc atcaccatca 420tcatcacact acaacatgac caatagtctt cctccgtata
acccatcaat atctacaaat 480gagctgttaa cacctcatca ggagatgatg atgacaatga
tggaccaaca acaacaacaa 540ctattatacc aagaagccgt ggacagtttg gtaaattctc
caaatagcaa caagcttata 600atgagccatc aagaagacag ccgggagcaa agtacaaaca
aaggaataat gttgttgagt 660gatgtaagaa gtgggtcaag tacaacaagt acagtaacaa
gagtgaagat ggaacaacat 720gatcatcatc atgaagagag atcaatggaa gattatggaa
tggaggagat caatcactta 780ataaatagta gttgtacgag tagtagtagc aacagcttgt
ggtttgatga aaacaagacc 840gatgatacgt tcatgttgta ctat
86462288PRTBrassica napus 62Met Gly Arg Ala Pro
Cys Cys Asp Lys Ala Asn Val Lys Arg Gly Pro1 5
10 15Trp Ser Pro Glu Glu Asp Ala Lys Leu Lys Asp
Tyr Ile Glu Lys Gln20 25 30Gly Thr Gly
Gly Asn Trp Ile Ala Leu Pro His Lys Ala Gly Leu Arg35 40
45Arg Cys Gly Lys Ser Cys Arg Leu Arg Trp Leu Asn Tyr
Leu Arg Pro50 55 60Asn Ile Arg His Gly
Asp Phe Ser Glu Glu Glu Asp Lys Ile Ile Phe65 70
75 80Ser Leu Phe Ser Ser Ile Gly Ser Arg Trp
Ser Val Ile Ala Ala His85 90 95Leu His
Gly Arg Thr Asp Asn Asp Ile Lys Asn Tyr Trp Ser Thr Lys100
105 110Leu Lys Lys Lys Leu Ile Ala Thr Met Ala Pro Pro
Pro Pro His His115 120 125Leu Leu Ala Ile
Ala Ser Ser Ser Ser Ser Pro Ser Ser Ser His Tyr130 135
140Asn Met Thr Asn Ser Leu Pro Pro Tyr Asn Pro Ser Ile Ser
Thr Asn145 150 155 160Glu
Leu Leu Thr Pro His Gln Glu Met Met Met Thr Met Met Asp Gln165
170 175Gln Gln Gln Gln Leu Leu Tyr Gln Glu Ala Val
Asp Ser Leu Val Asn180 185 190Ser Pro Asn
Ser Asn Lys Leu Ile Met Ser His Gln Glu Asp Ser Arg195
200 205Glu Gln Ser Thr Asn Lys Gly Ile Met Leu Leu Ser
Asp Val Arg Ser210 215 220Gly Ser Ser Thr
Thr Ser Thr Val Thr Arg Val Lys Met Glu Gln His225 230
235 240Asp His His His Glu Glu Arg Ser Met
Glu Asp Tyr Gly Met Glu Glu245 250 255Ile
Asn His Leu Ile Asn Ser Ser Cys Thr Ser Ser Ser Ser Asn Ser260
265 270Leu Trp Phe Asp Glu Asn Lys Thr Asp Asp Thr
Phe Met Leu Tyr Tyr275 280
28563792DNABrassica rapamisc_feature(760)..(760)n can be any nucleotide
63atgggaagag caccgtgttg tgacaaggct aacgtgaaga aagggccttg gtctcctgaa
60gaagatgcaa agctcaaaga ttacatcgag agtagtggca caggaggcaa ctggatcgct
120ttgcctcaga agattggtct aaggagatgt gggaagagtt gcagactaag gtggcttaac
180tatttgagac ctaacatcaa acatggtggc ttctctgagg aagaggacaa catcatttgt
240aacctctatg ttactattgg tagcaggtgg tctataattg ctgcacaatt gcctggaaga
300acagacaacg atatcaagaa ctattggaac acgaggctga agaagaagct tcttaacaaa
360caaagaaaag agttccaaga agctcggatg aagcaagaga tggtgatgat gaaacgacag
420caacaaggac aaggacaatg ccaaagtaat ggtagtacgg atctttatct gaacaacatg
480tttagatcat caccatggcc attactgcct catcttcctc ctcctcatag tcaagtacct
540cttgtgatga tggaaccaac aagctgcaat tactaccaac cgacaccgtc ttgcgcatta
600gaacaaaagc cattgatccc actcaagaac atggtcaaga ttgaagcaga accggagaga
660tcaaaccctg atcatcatta tccggaagac tcaatgacaa actcttttga tctttccttc
720tctcagcttt tgttagatcc taattactac ctggaatcan gaggaggaga aaggagagtt
780tgctctcatg ag
79264264PRTBrassica rapaMISC_FEATURE(254)..(254)X can be any amino acid
64Met Gly Arg Ala Pro Cys Cys Asp Lys Ala Asn Val Lys Lys Gly Pro1
5 10 15Trp Ser Pro Glu Glu Asp
Ala Lys Leu Lys Asp Tyr Ile Glu Ser Ser20 25
30Gly Thr Gly Gly Asn Trp Ile Ala Leu Pro Gln Lys Ile Gly Leu Arg35
40 45Arg Cys Gly Lys Ser Cys Arg Leu Arg
Trp Leu Asn Tyr Leu Arg Pro50 55 60Asn
Ile Lys His Gly Gly Phe Ser Glu Glu Glu Asp Asn Ile Ile Cys65
70 75 80Asn Leu Tyr Val Thr Ile
Gly Ser Arg Trp Ser Ile Ile Ala Ala Gln85 90
95Leu Pro Gly Arg Thr Asp Asn Asp Ile Lys Asn Tyr Trp Asn Thr Arg100
105 110Leu Lys Lys Lys Leu Leu Asn Lys
Gln Arg Lys Glu Phe Gln Glu Ala115 120
125Arg Met Lys Gln Glu Met Val Met Met Lys Arg Gln Gln Gln Gly Gln130
135 140Gly Gln Cys Gln Ser Asn Gly Ser Thr
Asp Leu Tyr Leu Asn Asn Met145 150 155
160Phe Arg Ser Ser Pro Trp Pro Leu Leu Pro His Leu Pro Pro
Pro His165 170 175Ser Gln Val Pro Leu Val
Met Met Glu Pro Thr Ser Cys Asn Tyr Tyr180 185
190Gln Pro Thr Pro Ser Cys Ala Leu Glu Gln Lys Pro Leu Ile Pro
Leu195 200 205Lys Asn Met Val Lys Ile Glu
Ala Glu Pro Glu Arg Ser Asn Pro Asp210 215
220His His Tyr Pro Glu Asp Ser Met Thr Asn Ser Phe Asp Leu Ser Phe225
230 235 240Ser Gln Leu Leu
Leu Asp Pro Asn Tyr Tyr Leu Glu Ser Xaa Gly Gly245 250
255Glu Arg Arg Val Cys Ser His Glu26065558DNABrassica rapa
65atgggaaggg caccgtgttg tgacaaagcc aacgtgaaga aagggccttg gtctcctgag
60gaagatgcca aactcaaaga ttacatcgag aatagtggta caggaggcaa ctggatcgct
120ttgcctcaaa agattggtct aaggagatgt gggaagagtt gcagactaag gtggctcaac
180tatttgagac caaacatcaa acatggtggc ttctctgagg aagaagacaa catcatttgt
240aacctctatg ttactattgg tagcaggtgg tctataattg ctgcacaact gtcgggaaga
300acggacaacg atatcaagaa ctattggaac acgaggctaa agaagaagct tctaaacaaa
360caaaggaaag agtttcaaga agctcggatg aagcaagaga tggtgatgat gaagagacaa
420caagaaggac atgaccacat caatggtagt acggatcttt atctgaaaaa tatgtttgga
480tcatcaccat ggccattact acaacagctt cctctgttgg gattgcgaaa tcccgtgtcc
540aactctatct tatcttat
55866186PRTBrassica rapa 66Met Gly Arg Ala Pro Cys Cys Asp Lys Ala Asn
Val Lys Lys Gly Pro1 5 10
15Trp Ser Pro Glu Glu Asp Ala Lys Leu Lys Asp Tyr Ile Glu Asn Ser20
25 30Gly Thr Gly Gly Asn Trp Ile Ala Leu Pro
Gln Lys Ile Gly Leu Arg35 40 45Arg Cys
Gly Lys Ser Cys Arg Leu Arg Trp Leu Asn Tyr Leu Arg Pro50
55 60Asn Ile Lys His Gly Gly Phe Ser Glu Glu Glu Asp
Asn Ile Ile Cys65 70 75
80Asn Leu Tyr Val Thr Ile Gly Ser Arg Trp Ser Ile Ile Ala Ala Gln85
90 95Leu Ser Gly Arg Thr Asp Asn Asp Ile Lys
Asn Tyr Trp Asn Thr Arg100 105 110Leu Lys
Lys Lys Leu Leu Asn Lys Gln Arg Lys Glu Phe Gln Glu Ala115
120 125Arg Met Lys Gln Glu Met Val Met Met Lys Arg Gln
Gln Glu Gly His130 135 140Asp His Ile Asn
Gly Ser Thr Asp Leu Tyr Leu Lys Asn Met Phe Gly145 150
155 160Ser Ser Pro Trp Pro Leu Leu Gln Gln
Leu Pro Leu Leu Gly Leu Arg165 170 175Asn
Pro Val Ser Asn Ser Ile Leu Ser Tyr180
18567501DNACarthamus tinctorius 67atgataaggg caccctgctg cgacaaaacc
aatgtgaaaa agggaccatg gtcttctgaa 60gaagatgcta agctcaaaga ttacatcgga
aagtatggta ccggcggtaa ctggatcgct 120cttccacaaa aaatcgagct caagatctgt
gggacaaggt gcagattgag atggttgaat 180tacctgagac ccaacatcaa gcatggggga
ttctctgaag aagaagacaa catcatctgc 240agcctctata ttagcatagg cagcaggtgg
tccataattg cggctcaatt gccaggcaga 300accgataacg acgtcaagaa ctattgcaac
accgagctga agaagatatt actcggaggg 360cgcaaacaat cccaggccaa taataagctc
tcgggtgatc cgaaagacag caggcggata 420gaaacgctaa gcaacttcgg gatcaaaagg
ctccaactgc acaggcgact cggaagcctt 480gaagacccta atttctgtca c
50168167PRTCarthamus tinctorius 68Met
Ile Arg Ala Pro Cys Cys Asp Lys Thr Asn Val Lys Lys Gly Pro1
5 10 15Trp Ser Ser Glu Glu Asp Ala
Lys Leu Lys Asp Tyr Ile Gly Lys Tyr20 25
30Gly Thr Gly Gly Asn Trp Ile Ala Leu Pro Gln Lys Ile Glu Leu Lys35
40 45Ile Cys Gly Thr Arg Cys Arg Leu Arg Trp
Leu Asn Tyr Leu Arg Pro50 55 60Asn Ile
Lys His Gly Gly Phe Ser Glu Glu Glu Asp Asn Ile Ile Cys65
70 75 80Ser Leu Tyr Ile Ser Ile Gly
Ser Arg Trp Ser Ile Ile Ala Ala Gln85 90
95Leu Pro Gly Arg Thr Asp Asn Asp Val Lys Asn Tyr Cys Asn Thr Glu100
105 110Leu Lys Lys Ile Leu Leu Gly Gly Arg
Lys Gln Ser Gln Ala Asn Asn115 120 125Lys
Leu Ser Gly Asp Pro Lys Asp Ser Arg Arg Ile Glu Thr Leu Ser130
135 140Asn Phe Gly Ile Lys Arg Leu Gln Leu His Arg
Arg Leu Gly Ser Leu145 150 155
160Glu Asp Pro Asn Phe Cys His16569749DNACarthamus tinctorius
69atggggagag ccccttgttg tgataaagca aatgtgaaga aaggtccatg gtctcaagaa
60gaagatacaa aactcaaaga gtttattgaa aaatttggta ctggtggtaa ctggattgtt
120ctccctcaaa aagctggtct gaaaagatgt gggaagagtt gcaggttgag atggcttaac
180tatctaagac ccaatatcag gcatggtgaa ttctctgatg atgaagacag gatcatctgc
240agcttgtatg ctacatttgg tagcagatgg tcagtaatag cagctcagct accaggaagg
300actgacaatg atatcaagaa ctattggaac accaaactca agaagaagct cttcactatg
360cttccttccc ttcaagaaaa acaaactttc tttccatctt tgcccttcca accaccacca
420ccatatgatc accaccacca tcatcatcac ccatcagacc cattcttcac cacttcatca
480tcatcatcat catcattcta caccagttac aaccctaact ttatgaatct taacaccact
540tccaattctc tcattgttca agatcatgat gttaatgttg ttcaacacaa tctctcacca
600ccaccgccac caccaccacc tgtgaaccat ttgatcaatt tgctgagcaa caacaccatt
660gatgttaatg tgattgatga tcataactcc tacagttatc atctagggtt tcaagatgac
720caccatgatc agagcatgtg caacagttc
74970250PRTCarthamus tinctorius 70Met Gly Arg Ala Pro Cys Cys Asp Lys Ala
Asn Val Lys Lys Gly Pro1 5 10
15Trp Ser Gln Glu Glu Asp Thr Lys Leu Lys Glu Phe Ile Glu Lys Phe20
25 30Gly Thr Gly Gly Asn Trp Ile Val Leu
Pro Gln Lys Ala Gly Leu Lys35 40 45Arg
Cys Gly Lys Ser Cys Arg Leu Arg Trp Leu Asn Tyr Leu Arg Pro50
55 60Asn Ile Arg His Gly Glu Phe Ser Asp Asp Glu
Asp Arg Ile Ile Cys65 70 75
80Ser Leu Tyr Ala Thr Phe Gly Ser Arg Trp Ser Val Ile Ala Ala Gln85
90 95Leu Pro Gly Arg Thr Asp Asn Asp Ile
Lys Asn Tyr Trp Asn Thr Lys100 105 110Leu
Lys Lys Lys Leu Phe Thr Met Leu Pro Ser Leu Gln Glu Lys Gln115
120 125Thr Phe Phe Pro Ser Leu Pro Phe Gln Pro Pro
Pro Pro Tyr Asp His130 135 140His His His
His His His Pro Ser Asp Pro Phe Phe Thr Thr Ser Ser145
150 155 160Ser Ser Ser Ser Ser Phe Tyr
Thr Ser Tyr Asn Pro Asn Phe Met Asn165 170
175Leu Asn Thr Thr Ser Asn Ser Leu Ile Val Gln Asp His Asp Val Asn180
185 190Val Val Gln His Asn Leu Ser Pro Pro
Pro Pro Pro Pro Pro Pro Val195 200 205Asn
His Leu Ile Asn Leu Leu Ser Asn Asn Thr Ile Asp Val Asn Val210
215 220Ile Asp Asp His Asn Ser Tyr Ser Tyr His Leu
Gly Phe Gln Asp Asp225 230 235
240His His Asp Gln Ser Met Cys Asn Ser Ser245
25071648DNACentaurea maculosa 71atgggaaggg caccttgttg tgacaaagca
aacgttaaga aagggccatg gtctcccgaa 60gaagatgcta aactcaaaga ttatatcgaa
aaatatggta ccgggggtaa ctggatcgct 120cttccgcaga agattggcat aaagagatgt
ggaaaaagtt gtagactaag gtggttgaat 180tacctaagac ctaatatcaa gcatggagga
ttctctgaag aagaagacaa catcatctgc 240agcctctata ttagtatcgg cagcaggtgg
tccataattg ctgcccaact gcctggaaga 300actgataatg atatcaagaa ctattggaac
actaggctaa agaagaaatt gcttggaagg 360cgaaaacagt ctcatgctag taggctcggc
accggttcaa accaggatcc caaagatggg 420aacgggctag aaacgttaag caattcggcc
atcgaaaggc ttcaacttca catgcaactt 480caaagccttg aaaaccctaa tatcgctaat
catggtgtta atctcgcgac gtggccttgt 540aagctgaacc cgattcaaga gaagatgatg
cagagtcttc aacttctaaa tgagtcgcca 600aaccctctaa tgatgcaacc tcattcccct
caaaaggttg aactttat 64872216PRTCentaurea maculosa 72Met
Gly Arg Ala Pro Cys Cys Asp Lys Ala Asn Val Lys Lys Gly Pro1
5 10 15Trp Ser Pro Glu Glu Asp Ala
Lys Leu Lys Asp Tyr Ile Glu Lys Tyr20 25
30Gly Thr Gly Gly Asn Trp Ile Ala Leu Pro Gln Lys Ile Gly Ile Lys35
40 45Arg Cys Gly Lys Ser Cys Arg Leu Arg Trp
Leu Asn Tyr Leu Arg Pro50 55 60Asn Ile
Lys His Gly Gly Phe Ser Glu Glu Glu Asp Asn Ile Ile Cys65
70 75 80Ser Leu Tyr Ile Ser Ile Gly
Ser Arg Trp Ser Ile Ile Ala Ala Gln85 90
95Leu Pro Gly Arg Thr Asp Asn Asp Ile Lys Asn Tyr Trp Asn Thr Arg100
105 110Leu Lys Lys Lys Leu Leu Gly Arg Arg
Lys Gln Ser His Ala Ser Arg115 120 125Leu
Gly Thr Gly Ser Asn Gln Asp Pro Lys Asp Gly Asn Gly Leu Glu130
135 140Thr Leu Ser Asn Ser Ala Ile Glu Arg Leu Gln
Leu His Met Gln Leu145 150 155
160Gln Ser Leu Glu Asn Pro Asn Ile Ala Asn His Gly Val Asn Leu
Ala165 170 175Thr Trp Pro Cys Lys Leu Asn
Pro Ile Gln Glu Lys Met Met Gln Ser180 185
190Leu Gln Leu Leu Asn Glu Ser Pro Asn Pro Leu Met Met Gln Pro His195
200 205Ser Pro Gln Lys Val Glu Leu Tyr210
21573693DNACentaurea maculosa 73atgggaaggg caccttgttg
tgacaaagca aacgttaaga aagggccatg gtctcccgaa 60gaagatgcta aactcaaaga
ttatatcgaa aaatatggta ccgggggtaa ctggatcgct 120cttccgcaga agattggcat
aaagagatgt ggaaaaagtt gtagactaag gtggttgaat 180tacctaagac ctaatatcaa
gcatggagga ttctctgaag aagaagacaa catcatctgc 240agcctctata ttagtatcgg
cagcaggtgg tccataattg ctgcccaact gcctggaaga 300actgataatg atatcaagaa
ctattggaac actaggctaa agaggaaatt gcttggaagg 360cgaaaacagt ctcatgctag
taggctcggc accggttcaa accaggatcc caaagatggg 420aacgggctag aaacgttaag
caattcggcc atcgaaaggc ttcaacttca catgcaactt 480caaagccttg aaaaccctaa
tatcgctaat catggtgtta atctcgcgac gtggccttgt 540aagctgaacc cgattcaaga
gaagatgatg cagagtcttc aacttctaaa tgagtcgcca 600aaccctctaa tgatgcaacc
tcattcccct caaaaggttg aactttatgc acaatcaaat 660aactttcttc aacaagtgta
ctctccatcg agc 69374231PRTCentaurea
maculosa 74Met Gly Arg Ala Pro Cys Cys Asp Lys Ala Asn Val Lys Lys Gly
Pro1 5 10 15Trp Ser Pro
Glu Glu Asp Ala Lys Leu Lys Asp Tyr Ile Glu Lys Tyr20 25
30Gly Thr Gly Gly Asn Trp Ile Ala Leu Pro Gln Lys Ile
Gly Ile Lys35 40 45Arg Cys Gly Lys Ser
Cys Arg Leu Arg Trp Leu Asn Tyr Leu Arg Pro50 55
60Asn Ile Lys His Gly Gly Phe Ser Glu Glu Glu Asp Asn Ile Ile
Cys65 70 75 80Ser Leu
Tyr Ile Ser Ile Gly Ser Arg Trp Ser Ile Ile Ala Ala Gln85
90 95Leu Pro Gly Arg Thr Asp Asn Asp Ile Lys Asn Tyr
Trp Asn Thr Arg100 105 110Leu Lys Arg Lys
Leu Leu Gly Arg Arg Lys Gln Ser His Ala Ser Arg115 120
125Leu Gly Thr Gly Ser Asn Gln Asp Pro Lys Asp Gly Asn Gly
Leu Glu130 135 140Thr Leu Ser Asn Ser Ala
Ile Glu Arg Leu Gln Leu His Met Gln Leu145 150
155 160Gln Ser Leu Glu Asn Pro Asn Ile Ala Asn His
Gly Val Asn Leu Ala165 170 175Thr Trp Pro
Cys Lys Leu Asn Pro Ile Gln Glu Lys Met Met Gln Ser180
185 190Leu Gln Leu Leu Asn Glu Ser Pro Asn Pro Leu Met
Met Gln Pro His195 200 205Ser Pro Gln Lys
Val Glu Leu Tyr Ala Gln Ser Asn Asn Phe Leu Gln210 215
220Gln Val Tyr Ser Pro Ser Ser225
23075651DNACentaurea maculosa 75atgggaagag ctccatgttg tgataaggca
aatgtgaaga aagggccatg gtcacctgaa 60gaagatagaa agctaaaaga gtacatagaa
actcatggta ctggtggcaa ctggattgct 120ctcccacaaa aagcaggcct tagaagatgt
ggaaaaagct gcagattaag atggttgaac 180tatctaagac caaacatcaa acatggtgaa
ttttctgatg atgaagataa agttatctgt 240gcactctttg ctagcattgg cagcaggtgg
tcaataatgg cagcacagtt accaggaagg 300acagacaatg atataaagaa ctactggaac
acaaagctga agaagaagat gatgagcagc 360ttaatctcca ttcctgaaat cagaaaacca
tttcatcatc aacatcttga atctttcaaa 420tcttcatcca actacatgag ctacccatca
tcactctcta tttttcaaaa cagtaataat 480atcagtgctc cattatcttc atcatcacca
ccttcatcat atctctataa tagtacaaca 540acaactactt ctactcatca tcatcatgat
catcatcatc aaaccctatc tatcccatca 600agccctactc ctgctcatgg atataatctg
ggtttgggtc ccatggtggt a 65176217PRTCentaurea maculosa 76Met
Gly Arg Ala Pro Cys Cys Asp Lys Ala Asn Val Lys Lys Gly Pro1
5 10 15Trp Ser Pro Glu Glu Asp Arg
Lys Leu Lys Glu Tyr Ile Glu Thr His20 25
30Gly Thr Gly Gly Asn Trp Ile Ala Leu Pro Gln Lys Ala Gly Leu Arg35
40 45Arg Cys Gly Lys Ser Cys Arg Leu Arg Trp
Leu Asn Tyr Leu Arg Pro50 55 60Asn Ile
Lys His Gly Glu Phe Ser Asp Asp Glu Asp Lys Val Ile Cys65
70 75 80Ala Leu Phe Ala Ser Ile Gly
Ser Arg Trp Ser Ile Met Ala Ala Gln85 90
95Leu Pro Gly Arg Thr Asp Asn Asp Ile Lys Asn Tyr Trp Asn Thr Lys100
105 110Leu Lys Lys Lys Met Met Ser Ser Leu
Ile Ser Ile Pro Glu Ile Arg115 120 125Lys
Pro Phe His His Gln His Leu Glu Ser Phe Lys Ser Ser Ser Asn130
135 140Tyr Met Ser Tyr Pro Ser Ser Leu Ser Ile Phe
Gln Asn Ser Asn Asn145 150 155
160Ile Ser Ala Pro Leu Ser Ser Ser Ser Pro Pro Ser Ser Tyr Leu
Tyr165 170 175Asn Ser Thr Thr Thr Thr Thr
Ser Thr His His His His Asp His His180 185
190His Gln Thr Leu Ser Ile Pro Ser Ser Pro Thr Pro Ala His Gly Tyr195
200 205Asn Leu Gly Leu Gly Pro Met Val
Val210 21577732DNACentaurea solstitialis 77atggggagag
caccttgctg tgataaagcc aatgtgaaga aaggtccatg ggctcctgaa 60gaagatgcta
ctcttaaagc ttatattgaa gaacatggta ctggtggtaa ctggattgct 120ttgcctcaga
aaatagggct taaaagatgc gggaagagtt gtcgcctgcg gtggctaaat 180tatctccgtc
cgaatatcaa gcatggaggt ttttcggaag aagaagatcg cataatttgc 240agcctctatg
ttagcatagg gagcaggtgg tctataatcg cagcacaatt acctggacga 300actgataacg
atataaagaa ctactggaat acaaggctga agaagaaact cttgggtaag 360caacgaaaag
aacaaatttc tcgtagaaag ggggagatgc taatgaagaa agggagatcg 420ccatcggaaa
ttccaccttc cgtaatcgtt agtggtaacg ataccaataa caattgcccg 480gatccttatt
ggccagagtt gccggttttg ccacctgcgc cctattcgat ccaagaacca 540tgctttgcca
acggtcatgc ctctatacga aaactactca tgaagcttgg aggaaggttt 600tcttgtgatg
acaatggaaa ccagtccatc aacatggtct cacactttcc cattgatcat 660cttccaacat
cattaatgca tgatgatcac aactttattg ctagtagtac tactccggct 720tcttcttcag
ct
73278244PRTCentaurea solstitialis 78Met Gly Arg Ala Pro Cys Cys Asp Lys
Ala Asn Val Lys Lys Gly Pro1 5 10
15Trp Ala Pro Glu Glu Asp Ala Thr Leu Lys Ala Tyr Ile Glu Glu
His20 25 30Gly Thr Gly Gly Asn Trp Ile
Ala Leu Pro Gln Lys Ile Gly Leu Lys35 40
45Arg Cys Gly Lys Ser Cys Arg Leu Arg Trp Leu Asn Tyr Leu Arg Pro50
55 60Asn Ile Lys His Gly Gly Phe Ser Glu Glu
Glu Asp Arg Ile Ile Cys65 70 75
80Ser Leu Tyr Val Ser Ile Gly Ser Arg Trp Ser Ile Ile Ala Ala
Gln85 90 95Leu Pro Gly Arg Thr Asp Asn
Asp Ile Lys Asn Tyr Trp Asn Thr Arg100 105
110Leu Lys Lys Lys Leu Leu Gly Lys Gln Arg Lys Glu Gln Ile Ser Arg115
120 125Arg Lys Gly Glu Met Leu Met Lys Lys
Gly Arg Ser Pro Ser Glu Ile130 135 140Pro
Pro Ser Val Ile Val Ser Gly Asn Asp Thr Asn Asn Asn Cys Pro145
150 155 160Asp Pro Tyr Trp Pro Glu
Leu Pro Val Leu Pro Pro Ala Pro Tyr Ser165 170
175Ile Gln Glu Pro Cys Phe Ala Asn Gly His Ala Ser Ile Arg Lys
Leu180 185 190Leu Met Lys Leu Gly Gly Arg
Phe Ser Cys Asp Asp Asn Gly Asn Gln195 200
205Ser Ile Asn Met Val Ser His Phe Pro Ile Asp His Leu Pro Thr Ser210
215 220Leu Met His Asp Asp His Asn Phe Ile
Ala Ser Ser Thr Thr Pro Ala225 230 235
240Ser Ser Ser Ala79675DNACentaurea solstitialis
79atggggagag caccttgctg tgataaagcc aatgtgaaga aaggtccatg ggctcctgaa
60gaagatgcta ctcttaaagc ttatattgaa gaacatggta ctggtggtaa ctggattgct
120ttgcctcaga aaatagggct taaaagatgc gggaagagtt gtcgcctgcg gtggctaaat
180tatctccgtc cgaatatcaa gcatggaggt ttttcggaag aagaagatcg cataatttgc
240agcctctatg ttagcatagg gagcaggtgg tctataatcg caccacaatt acctggacga
300actgataacg atataaagaa ctactggaat acaaggctga agaagaaact cttgggtaag
360caacgaaaag aacaaatttc tcgtaaaaag ggggagatgc taatgaagaa agggagatcg
420ccatcggaaa ttccatcttc cgtaatcgtt agtggtaacg ataccaataa caattgcccg
480gatccttatt ggccagagtt gccggttttg ccacctgcgc cctattcgat ccaagaacca
540tgctttgcca acggtcatgc ctctatacga aaactactca tgaagcttgc aggaaggttt
600tcttgtgatg acaatggaaa ccagtccatc aacatggtct cacacttacc cattgatcat
660cttacgacat catta
67580225PRTCentaurea solstitialis 80Met Gly Arg Ala Pro Cys Cys Asp Lys
Ala Asn Val Lys Lys Gly Pro1 5 10
15Trp Ala Pro Glu Glu Asp Ala Thr Leu Lys Ala Tyr Ile Glu Glu
His20 25 30Gly Thr Gly Gly Asn Trp Ile
Ala Leu Pro Gln Lys Ile Gly Leu Lys35 40
45Arg Cys Gly Lys Ser Cys Arg Leu Arg Trp Leu Asn Tyr Leu Arg Pro50
55 60Asn Ile Lys His Gly Gly Phe Ser Glu Glu
Glu Asp Arg Ile Ile Cys65 70 75
80Ser Leu Tyr Val Ser Ile Gly Ser Arg Trp Ser Ile Ile Ala Pro
Gln85 90 95Leu Pro Gly Arg Thr Asp Asn
Asp Ile Lys Asn Tyr Trp Asn Thr Arg100 105
110Leu Lys Lys Lys Leu Leu Gly Lys Gln Arg Lys Glu Gln Ile Ser Arg115
120 125Lys Lys Gly Glu Met Leu Met Lys Lys
Gly Arg Ser Pro Ser Glu Ile130 135 140Pro
Ser Ser Val Ile Val Ser Gly Asn Asp Thr Asn Asn Asn Cys Pro145
150 155 160Asp Pro Tyr Trp Pro Glu
Leu Pro Val Leu Pro Pro Ala Pro Tyr Ser165 170
175Ile Gln Glu Pro Cys Phe Ala Asn Gly His Ala Ser Ile Arg Lys
Leu180 185 190Leu Met Lys Leu Ala Gly Arg
Phe Ser Cys Asp Asp Asn Gly Asn Gln195 200
205Ser Ile Asn Met Val Ser His Leu Pro Ile Asp His Leu Thr Thr Ser210
215 220Leu22581783DNACentaurea solstitialis
81atgggaagag caccttgctg tgacaaagcc aacgtcaaaa gaggaccatg gtcacctgaa
60gaagatgcca aactcaaatc gtatatcgaa gaacatggaa ccggaggcaa ctggatcgct
120ctacctcata agattggtct taagaggtgt ggcaagagtt gtcgccttcg atggttgaat
180tatctacgcc caaatatcaa gcatggatct ttctccgaag aagaagatca catcatttgc
240accctttatc tcagtattgg cagccgatgg tcaattatag ctgcacaact acctgggcga
300accgataatg acatcaagaa ctactggaac acgaggctaa agaaaaagct tttgggaaaa
360cagcgtaaag atcaacaatc ttcgaagaaa agtggtttga tcaagcaaga aatgaagaga
420gaagttcttg aagatttcaa ggcaccatct ttatgcatga gcatgaacct ctatcaatct
480tactggccta cagatttccc taatttgatc cttacaaacc ctgatcatga tcatcatcaa
540gagcaaccac ttcatgccaa aaaccaagat cccatcacaa gttttgatga tcaataccct
600tttgatacct ccccaaatca agcccaattg ttcgatcaaa accacgagaa acctatttca
660accattaatt atcatcccca attgcaaaac acaccatata atcttgtgat caatggagat
720caaggtgtga atctttttca agaattcaac aactacccat ttgggatcaa tgaattggtc
780tac
78382261PRTCentaurea solstitialis 82Met Gly Arg Ala Pro Cys Cys Asp Lys
Ala Asn Val Lys Arg Gly Pro1 5 10
15Trp Ser Pro Glu Glu Asp Ala Lys Leu Lys Ser Tyr Ile Glu Glu
His20 25 30Gly Thr Gly Gly Asn Trp Ile
Ala Leu Pro His Lys Ile Gly Leu Lys35 40
45Arg Cys Gly Lys Ser Cys Arg Leu Arg Trp Leu Asn Tyr Leu Arg Pro50
55 60Asn Ile Lys His Gly Ser Phe Ser Glu Glu
Glu Asp His Ile Ile Cys65 70 75
80Thr Leu Tyr Leu Ser Ile Gly Ser Arg Trp Ser Ile Ile Ala Ala
Gln85 90 95Leu Pro Gly Arg Thr Asp Asn
Asp Ile Lys Asn Tyr Trp Asn Thr Arg100 105
110Leu Lys Lys Lys Leu Leu Gly Lys Gln Arg Lys Asp Gln Gln Ser Ser115
120 125Lys Lys Ser Gly Leu Ile Lys Gln Glu
Met Lys Arg Glu Val Leu Glu130 135 140Asp
Phe Lys Ala Pro Ser Leu Cys Met Ser Met Asn Leu Tyr Gln Ser145
150 155 160Tyr Trp Pro Thr Asp Phe
Pro Asn Leu Ile Leu Thr Asn Pro Asp His165 170
175Asp His His Gln Glu Gln Pro Leu His Ala Lys Asn Gln Asp Pro
Ile180 185 190Thr Ser Phe Asp Asp Gln Tyr
Pro Phe Asp Thr Ser Pro Asn Gln Ala195 200
205Gln Leu Phe Asp Gln Asn His Glu Lys Pro Ile Ser Thr Ile Asn Tyr210
215 220His Pro Gln Leu Gln Asn Thr Pro Tyr
Asn Leu Val Ile Asn Gly Asp225 230 235
240Gln Gly Val Asn Leu Phe Gln Glu Phe Asn Asn Tyr Pro Phe
Gly Ile245 250 255Asn Glu Leu Val
Tyr26083768DNACichorium endivia 83atgggaagag caccttgctg tgacaaagcc
aacgtcaaaa gaggaccttg gtcacctgaa 60gaagatgcca aactcaagtc ttacattgaa
gaacatggaa cgggaggcaa ctggatcgct 120ctacctcata aaatcggact caagaggtgt
ggaaagagtt gtcgccttcg atggttaaat 180tatcttcgcc caaacatcaa gcacggatct
ttttccgaag aagaagatca cataatttgt 240accctatatc ttagtattgg cagccggtgg
tcaattatcg ctgcacaatt acccgggaga 300actgataatg atatcaagaa ctactggaac
acaaggctaa agaaaaagct tttgggaaag 360cagcgtaaag atcaacaatt gtcaaagaga
agtgcccaaa taaagcatga aatgaaggag 420agagtggttc ttgaggattt aaaggcacca
tctttatgca tgagcatgaa tctttatcaa 480tcatactggc ctgctgaatt cccttcgact
ctcacggacc atgatcagca tcatcaagaa 540catcatgtca aaacccaaga acccgttttc
aatgaatacc cttttgacat aatttcaaac 600caagcacaat tgttctacca gaacagcgag
aaacctatct cgactactgt ttaccatccc 660cagttggcaa acatacccta ttttggaatt
ggggatggca taaacttgtt tcaagaattc 720aacaattacc catttgaaat caatgaattg
gtttacaaca ccacacaa 76884256PRTCichorium endivia 84Met
Gly Arg Ala Pro Cys Cys Asp Lys Ala Asn Val Lys Arg Gly Pro1
5 10 15Trp Ser Pro Glu Glu Asp Ala
Lys Leu Lys Ser Tyr Ile Glu Glu His20 25
30Gly Thr Gly Gly Asn Trp Ile Ala Leu Pro His Lys Ile Gly Leu Lys35
40 45Arg Cys Gly Lys Ser Cys Arg Leu Arg Trp
Leu Asn Tyr Leu Arg Pro50 55 60Asn Ile
Lys His Gly Ser Phe Ser Glu Glu Glu Asp His Ile Ile Cys65
70 75 80Thr Leu Tyr Leu Ser Ile Gly
Ser Arg Trp Ser Ile Ile Ala Ala Gln85 90
95Leu Pro Gly Arg Thr Asp Asn Asp Ile Lys Asn Tyr Trp Asn Thr Arg100
105 110Leu Lys Lys Lys Leu Leu Gly Lys Gln
Arg Lys Asp Gln Gln Leu Ser115 120 125Lys
Arg Ser Ala Gln Ile Lys His Glu Met Lys Glu Arg Val Val Leu130
135 140Glu Asp Leu Lys Ala Pro Ser Leu Cys Met Ser
Met Asn Leu Tyr Gln145 150 155
160Ser Tyr Trp Pro Ala Glu Phe Pro Ser Thr Leu Thr Asp His Asp
Gln165 170 175His His Gln Glu His His Val
Lys Thr Gln Glu Pro Val Phe Asn Glu180 185
190Tyr Pro Phe Asp Ile Ile Ser Asn Gln Ala Gln Leu Phe Tyr Gln Asn195
200 205Ser Glu Lys Pro Ile Ser Thr Thr Val
Tyr His Pro Gln Leu Ala Asn210 215 220Ile
Pro Tyr Phe Gly Ile Gly Asp Gly Ile Asn Leu Phe Gln Glu Phe225
230 235 240Asn Asn Tyr Pro Phe Glu
Ile Asn Glu Leu Val Tyr Asn Thr Thr Gln245 250
25585639DNACichorium intybus 85atgggaaggg ctccatgttg tgacaaagaa
aatgtaaaga gagggccatg gtctcctgaa 60gaagatgcaa aactcaaaag cttcattgac
aaaaacggca ctggtggtaa ctggattgct 120ctccctcaca aagctggtct aaaaagatgt
ggcaagagct gcagattgag atggctaaac 180tatctgaggc ccaatattaa gcatggtgaa
ttcactgatg atgaagacaa gatcatctgc 240agcttgtatg ctagcattgg tagcagatgg
tcaataatag cagctcagct accaggaagg 300actgataatg atatcaagaa ttactggaac
accaagctca agaagaagct cttggctatg 360cttccttcct ttcaaaagaa gtcatcttta
tttccatcta catccctcca atcaccatca 420ccatacagat caaatcagtt catgaccaat
aattcgtcac ctttttacgg ttataccact 480acccctaact tcatgaacat gaacaacaat
atgatcagtt ctctgccgag acctaccagc 540accaccacct gtgttcaaga tcatatcact
catctctcac caccaccacc gctagatcca 600aactctttga tcagtttaat gaccagcaac
gttgacaac 63986213PRTCichorium intybus 86Met
Gly Arg Ala Pro Cys Cys Asp Lys Glu Asn Val Lys Arg Gly Pro1
5 10 15Trp Ser Pro Glu Glu Asp Ala
Lys Leu Lys Ser Phe Ile Asp Lys Asn20 25
30Gly Thr Gly Gly Asn Trp Ile Ala Leu Pro His Lys Ala Gly Leu Lys35
40 45Arg Cys Gly Lys Ser Cys Arg Leu Arg Trp
Leu Asn Tyr Leu Arg Pro50 55 60Asn Ile
Lys His Gly Glu Phe Thr Asp Asp Glu Asp Lys Ile Ile Cys65
70 75 80Ser Leu Tyr Ala Ser Ile Gly
Ser Arg Trp Ser Ile Ile Ala Ala Gln85 90
95Leu Pro Gly Arg Thr Asp Asn Asp Ile Lys Asn Tyr Trp Asn Thr Lys100
105 110Leu Lys Lys Lys Leu Leu Ala Met Leu
Pro Ser Phe Gln Lys Lys Ser115 120 125Ser
Leu Phe Pro Ser Thr Ser Leu Gln Ser Pro Ser Pro Tyr Arg Ser130
135 140Asn Gln Phe Met Thr Asn Asn Ser Ser Pro Phe
Tyr Gly Tyr Thr Thr145 150 155
160Thr Pro Asn Phe Met Asn Met Asn Asn Asn Met Ile Ser Ser Leu
Pro165 170 175Arg Pro Thr Ser Thr Thr Thr
Cys Val Gln Asp His Ile Thr His Leu180 185
190Ser Pro Pro Pro Pro Leu Asp Pro Asn Ser Leu Ile Ser Leu Met Thr195
200 205Ser Asn Val Asp
Asn21087832DNACichorium intybus 87tgggaagagc tccatgttgt gacaagtcaa
aggtgaagaa agggccatgg tcgcctgagg 60aagatacgaa gttgaaagac tatatacaca
aaaatggtac tggaggcaac tggattgctc 120ttccacataa agcaggcctt aaaagatgtg
gaaaaagctg cagattgcga tggttgaact 180accttagacc agacatcaaa cacggtgaat
tctccgatca tgaagataga ctcatctgta 240ctctcttttc tagcatcggt agcaggtggt
cagttatagc agcacagttg cctggaagaa 300cggataatga tatcaagaac tactggaaca
cgaagctcaa aaagaagctt atgaatttca 360tctccaccaa tcgtcaaatt gggaaaccgc
ttcgtcatct tgatcaatct atcaattgta 420cttcttcaaa ttggagctac ccaacaagct
cttctgctat aactgttgat aatcatccag 480ttctaccatc agatgatcat ggatacatta
atgtagaccc actggagacc tatcaagtaa 540aagacgattc tacccttatg tttgaaggcg
gtgatcaagc tgctggttgt acttctactt 600cggattggcg gtaccatcat gtgtatggtg
gtggtgtgtt tgatgctcat aataacatgg 660gagtcttgaa gaccggtgat tcttacaaga
gagatgatgg gaattgtgaa aaggcaagtg 720ggtattatgg ggattctaca ttggagtttg
gtcttgagga gttcaagaag ctcattagca 780ctaatctgtg cagcggtaac actaacaata
atctcaatgt ctttgttgat ga 83288435DNACitrus sinensis
88atgggaagag ctccatgttg tgacaaagct aacgttaaaa gaggaccttg gtctgctgaa
60gaagactcga ttctcaaaaa ttaccttgag caatttggaa atggcggcaa ctggattgct
120ttacccaaga aagcaggcct taatagatgt ggcaagagtt gtcggctcag atggctaaat
180tatctaaggc cagatattaa gcatggagga tttactaagg aagaagatac catcatatgc
240aatctttatt gtaccatggg aagtaggtgg tctgttatag cttctcagct gccaggaaga
300actgacaatg atgtaaagaa ttattggaac accaagttga agaaaaatgt tttggcagga
360aagctgtcgg acaacactca agtttcagtt tcaacaattc ctgaagaatt tggaaattca
420tcttactact tgagt
43589145PRTCitrus sinensis 89Met Gly Arg Ala Pro Cys Cys Asp Lys Ala Asn
Val Lys Arg Gly Pro1 5 10
15Trp Ser Ala Glu Glu Asp Ser Ile Leu Lys Asn Tyr Leu Glu Gln Phe20
25 30Gly Asn Gly Gly Asn Trp Ile Ala Leu Pro
Lys Lys Ala Gly Leu Asn35 40 45Arg Cys
Gly Lys Ser Cys Arg Leu Arg Trp Leu Asn Tyr Leu Arg Pro50
55 60Asp Ile Lys His Gly Gly Phe Thr Lys Glu Glu Asp
Thr Ile Ile Cys65 70 75
80Asn Leu Tyr Cys Thr Met Gly Ser Arg Trp Ser Val Ile Ala Ser Gln85
90 95Leu Pro Gly Arg Thr Asp Asn Asp Val Lys
Asn Tyr Trp Asn Thr Lys100 105 110Leu Lys
Lys Asn Val Leu Ala Gly Lys Leu Ser Asp Asn Thr Gln Val115
120 125Ser Val Ser Thr Ile Pro Glu Glu Phe Gly Asn Ser
Ser Tyr Tyr Leu130 135
140Ser14590685DNACoffea canephora 90gcacgcaggg aaattaatcc acacacacac
acacactcaa gcatttcaaa agcactcgaa 60agaagcctgg atagagagaa gcagaaacag
gcacagagtg taacaaccag agagagatgg 120gaagagctcc ttgctgtgac aaggcgaatg
tgaagaaagg accttggtcc cctgaagaag 180atgcaaagct gaaggagtac atagaaaagt
cagggactgg agggaattgg atagctcttc 240cacacaaggc tggtaagccc acttatgaat
ttccgtgtaa ttacaacact agtatttgtt 300atggtttgcc agcagttggt tggcataatt
atgatcaggg cttagaagat gcggaaagag 360ttgcagattg agatggctta actacctcag
gcccaacatt aaacatggag acttcacaga 420tgatgaggat aggataattt gcagcctctt
tgcgagcatt ggtagcaggt ggtcaataat 480agcagctcaa ttaccaggca ggacggacaa
tgacatcaag aactactgga acaccaagct 540gaaaaagaaa ataatgggat ctcacaaaaa
atgccatcaa cccagcccct acgcatcacc 600atattcgggg tttgagcctg catctttaac
atcatcacca ttttcagctc catcaccatc 660aatatcatcg gcgtatcaca attat
68591731DNACoffea canephora
91acacacactc aagcatttca aaagcactcg aaccctgcct ggatagagag aagcagaaac
60aggcacagag tgtaacaacc agagagagat gggaagagct ccttgctgtg acaaggcgaa
120tgtgaagaaa ggaccttggt cccctgaaga agatgcaaag ctgaaggagt acatagaaaa
180gtcagggact ggagggaatt ggatagctct tccacacaag gctggtaagc ccacttatga
240atttccgtgt aattacaaca ctagtatttg ttatggtttg ccagcagttg gttggcataa
300ttatgatcag ggcttagaag atgcggaaag agttgcagat tgagatggct taactacctc
360aggcccaaca ttaaacatgg agacttcaca gatgatgagg ataggataat ttgcagcctc
420tttgcgagca ttggtagcag gtggtcaata atagcagctc aattaccagg caggacggac
480aatgacatca agaactactg gaacaccaag ctgaaaaaga aaataatggg atctcacaaa
540aaatgccatc aacccagccc ctacgcatca ccatattcgg ggtttgagcc tgcatcttta
600acatcatcac cattttcagc tccatcacca tcaatatcat cggcgtatca caattatcac
660accaccaccc aatttaggtc cctctcacgc tatgaaactt tttcatcaac cccgccaagt
720ctgttgaacg c
73192711DNACucumis melo 92atggggagag ctccttgctg tgacaaagca aacgtgaaga
agggtccgtg gtcgccggag 60gaagacatga aactcaaatc ctatatcgag cagcacggta
caggtgggaa ctggattgca 120ctgccccaga aaattggtct gaagagatgt gggaaaagct
gccgactccg gtggttgaac 180taccttcggc ctaatattaa gcatggggat ttttctgaag
aagaagataa aataatttgc 240agcctttatg tcagcattgg aagcaggtgg tcgattattg
cagctcaatt accgggacga 300acagacaacg atataaagaa ttattggaac acaagattga
agaagaaatt atttggaaaa 360cattatgaga aacaacaaca acaattggtg agaagaggaa
gaaaaattaa atccggaata 420ggaaattcca tggctattgt ttttgataat caaggtattt
acaataataa taataacaac 480caatcgcctt tcttgccgaa attatcaccg gcgctcaatt
caccgccacc accgccgtac 540gctaattatc agccgcttca gttccattca aattcgttca
gcaacaccgc cggcgttatt 600gatgatctga ggagaaccga cagtgtactc cagtttggaa
tggatggggc ttcgtgtcag 660ccagtaacga gcacccacgt cggtgagttg ggagaaaggt
tgtttacagc a 71193237PRTCucumis melo 93Met Gly Arg Ala Pro
Cys Cys Asp Lys Ala Asn Val Lys Lys Gly Pro1 5
10 15Trp Ser Pro Glu Glu Asp Met Lys Leu Lys Ser
Tyr Ile Glu Gln His20 25 30Gly Thr Gly
Gly Asn Trp Ile Ala Leu Pro Gln Lys Ile Gly Leu Lys35 40
45Arg Cys Gly Lys Ser Cys Arg Leu Arg Trp Leu Asn Tyr
Leu Arg Pro50 55 60Asn Ile Lys His Gly
Asp Phe Ser Glu Glu Glu Asp Lys Ile Ile Cys65 70
75 80Ser Leu Tyr Val Ser Ile Gly Ser Arg Trp
Ser Ile Ile Ala Ala Gln85 90 95Leu Pro
Gly Arg Thr Asp Asn Asp Ile Lys Asn Tyr Trp Asn Thr Arg100
105 110Leu Lys Lys Lys Leu Phe Gly Lys His Tyr Glu Lys
Gln Gln Gln Gln115 120 125Leu Val Arg Arg
Gly Arg Lys Ile Lys Ser Gly Ile Gly Asn Ser Met130 135
140Ala Ile Val Phe Asp Asn Gln Gly Ile Tyr Asn Asn Asn Asn
Asn Asn145 150 155 160Gln
Ser Pro Phe Leu Pro Lys Leu Ser Pro Ala Leu Asn Ser Pro Pro165
170 175Pro Pro Pro Tyr Ala Asn Tyr Gln Pro Leu Gln
Phe His Ser Asn Ser180 185 190Phe Ser Asn
Thr Ala Gly Val Ile Asp Asp Leu Arg Arg Thr Asp Ser195
200 205Val Leu Gln Phe Gly Met Asp Gly Ala Ser Cys Gln
Pro Val Thr Ser210 215 220Thr His Val Gly
Glu Leu Gly Glu Arg Leu Phe Thr Ala225 230
23594831DNACucumis melo 94atggggagag ctccttgctg tgacaaagca aacgtgaaga
agggtccgtg gtcgccggag 60gaagacatga aactcaaatc ctacatcgag cagcacggta
caggtgggaa ctggattgca 120ctgccccaga aaattggtct gaagagatgt gggaaaagct
gccgactccg gtggttgaac 180taccttcggc ctaatattaa gcatggggat ttttctgaag
aagaagataa aataatttgc 240agcctttatg tcagcattgg aagcaggtgg tcgattattg
cagctcaatt accgggacga 300acagacaacg atataaagaa ttattggaac acaagattga
agaagaaatt atttggaaaa 360cattatgaga aacaacaaca acaattggtg agaagaggaa
gaaaaattaa atccggaata 420ggaaattcca tggctattgt ttttgataat caaggtattt
acaataataa taataacaac 480caatcgcctt tcttgccgaa attatcaccg gcgctcaatt
caccgccacc accgccgtac 540gctaattatc agccgcttca gttccattca aattcgttca
gcaacaccgc cggcgttatt 600gatgatctga ggagaaccga cagtgtactc cagtttggaa
tggatggggc ttcgtgtcag 660ccagtaacga gcacccacgt cggtgagttg gagaaggttg
tttacagcaa tactccgtcg 720tttgacgatg ggctgcagtt ttcatgtgat aacaatgggt
tgaatttgat gaacgatttg 780gattgggggg aaatgagttc tttgatttct gctcctttat
atccttcgat g 83195277PRTCucumis melo 95Met Gly Arg Ala Pro
Cys Cys Asp Lys Ala Asn Val Lys Lys Gly Pro1 5
10 15Trp Ser Pro Glu Glu Asp Met Lys Leu Lys Ser
Tyr Ile Glu Gln His20 25 30Gly Thr Gly
Gly Asn Trp Ile Ala Leu Pro Gln Lys Ile Gly Leu Lys35 40
45Arg Cys Gly Lys Ser Cys Arg Leu Arg Trp Leu Asn Tyr
Leu Arg Pro50 55 60Asn Ile Lys His Gly
Asp Phe Ser Glu Glu Glu Asp Lys Ile Ile Cys65 70
75 80Ser Leu Tyr Val Ser Ile Gly Ser Arg Trp
Ser Ile Ile Ala Ala Gln85 90 95Leu Pro
Gly Arg Thr Asp Asn Asp Ile Lys Asn Tyr Trp Asn Thr Arg100
105 110Leu Lys Lys Lys Leu Phe Gly Lys His Tyr Glu Lys
Gln Gln Gln Gln115 120 125Leu Val Arg Arg
Gly Arg Lys Ile Lys Ser Gly Ile Gly Asn Ser Met130 135
140Ala Ile Val Phe Asp Asn Gln Gly Ile Tyr Asn Asn Asn Asn
Asn Asn145 150 155 160Gln
Ser Pro Phe Leu Pro Lys Leu Ser Pro Ala Leu Asn Ser Pro Pro165
170 175Pro Pro Pro Tyr Ala Asn Tyr Gln Pro Leu Gln
Phe His Ser Asn Ser180 185 190Phe Ser Asn
Thr Ala Gly Val Ile Asp Asp Leu Arg Arg Thr Asp Ser195
200 205Val Leu Gln Phe Gly Met Asp Gly Ala Ser Cys Gln
Pro Val Thr Ser210 215 220Thr His Val Gly
Glu Leu Glu Lys Val Val Tyr Ser Asn Thr Pro Ser225 230
235 240Phe Asp Asp Gly Leu Gln Phe Ser Cys
Asp Asn Asn Gly Leu Asn Leu245 250 255Met
Asn Asp Leu Asp Trp Gly Glu Met Ser Ser Leu Ile Ser Ala Pro260
265 270Leu Tyr Pro Ser Met27596939DNADaucus carota
96atggggagag ctccttgctg tgacaaggca agtgtaaaga gagggccatg gtcacctgaa
60gaagatcaga agctaaaaga ctatatagag aagcatggga ctggaggaaa ctggattgct
120ctaccacaaa aggctggtct tagaagatgt gggaaaagct gcagattaag atggctcaac
180tatctcaggc ctaacattaa acatggagaa ttttcagatg atgaagatag gatcatatgc
240accctctttg ccaatattgg gagcaggtgg tcaataatag caggtcagtt accagggagg
300actgacaatg atatcaagaa ctactggaac accaagctca agaagaaact cctcatgtct
360tcattaatgc tccctaatcc ttttgttagg cctcctaata atattaataa ctaccaacaa
420tacctattat catcaccatc tccttcttca tattcctccc cattgcctcc aacattgaga
480ctttgtgaca acaattaccc aaccctaggt gcttctaggt cattcacaag tgagggcttc
540tcaaatattt caacaaacct ctttaatatc cagcctcaag atcagggttt gggtcccatg
600caaaattatc aagagaaaga atcacctctc attatgtttg gaggggcagg tgatgatcag
660caagctgcta gctctagttc ttatgatgaa catttcatga tgttctccaa ctgtgggatg
720aataatattt ctgatcaaca gaagcaagta aatcttggtg tgtctagtgg tgatcatgag
780attgttagca atgtactgga tcagtataac atgagctgtg ttgaggagat taagcaactg
840attagcacta ctaatctgtc atgtaattct aatctgagtt tctttgttga tgaaaacaag
900acagtgagaa gtgaggagga gaaagtgttg atgtactac
93997313PRTDaucus carota 97Met Gly Arg Ala Pro Cys Cys Asp Lys Ala Ser
Val Lys Arg Gly Pro1 5 10
15Trp Ser Pro Glu Glu Asp Gln Lys Leu Lys Asp Tyr Ile Glu Lys His20
25 30Gly Thr Gly Gly Asn Trp Ile Ala Leu Pro
Gln Lys Ala Gly Leu Arg35 40 45Arg Cys
Gly Lys Ser Cys Arg Leu Arg Trp Leu Asn Tyr Leu Arg Pro50
55 60Asn Ile Lys His Gly Glu Phe Ser Asp Asp Glu Asp
Arg Ile Ile Cys65 70 75
80Thr Leu Phe Ala Asn Ile Gly Ser Arg Trp Ser Ile Ile Ala Gly Gln85
90 95Leu Pro Gly Arg Thr Asp Asn Asp Ile Lys
Asn Tyr Trp Asn Thr Lys100 105 110Leu Lys
Lys Lys Leu Leu Met Ser Ser Leu Met Leu Pro Asn Pro Phe115
120 125Val Arg Pro Pro Asn Asn Ile Asn Asn Tyr Gln Gln
Tyr Leu Leu Ser130 135 140Ser Pro Ser Pro
Ser Ser Tyr Ser Ser Pro Leu Pro Pro Thr Leu Arg145 150
155 160Leu Cys Asp Asn Asn Tyr Pro Thr Leu
Gly Ala Ser Arg Ser Phe Thr165 170 175Ser
Glu Gly Phe Ser Asn Ile Ser Thr Asn Leu Phe Asn Ile Gln Pro180
185 190Gln Asp Gln Gly Leu Gly Pro Met Gln Asn Tyr
Gln Glu Lys Glu Ser195 200 205Pro Leu Ile
Met Phe Gly Gly Ala Gly Asp Asp Gln Gln Ala Ala Ser210
215 220Ser Ser Ser Tyr Asp Glu His Phe Met Met Phe Ser
Asn Cys Gly Met225 230 235
240Asn Asn Ile Ser Asp Gln Gln Lys Gln Val Asn Leu Gly Val Ser Ser245
250 255Gly Asp His Glu Ile Val Ser Asn Val
Leu Asp Gln Tyr Asn Met Ser260 265 270Cys
Val Glu Glu Ile Lys Gln Leu Ile Ser Thr Thr Asn Leu Ser Cys275
280 285Asn Ser Asn Leu Ser Phe Phe Val Asp Glu Asn
Lys Thr Val Arg Ser290 295 300Glu Glu Glu
Lys Val Leu Met Tyr Tyr305 31098744DNAElaeis guineensis
98atgggaagag ccccttgttg cgacaaagcc aacgtgaaga aaggcccatg gtcccccgag
60gaggacgcca agctaaaggc ctacatcgag cagtatggca ctggaggcaa ctggatcgcc
120ctccctcaaa agattggcct taagcgatgc ggtaagagct gccgcctccg atggctaaac
180tatcttcgac cgaatattaa gcatggagga ttctctgaag aagaagatca catcatttgt
240agcctgtaca tcagcatcgg tagcagatgg tctataattg ccgcccaatt gccgggaagg
300accgataacg acatcaagaa ctactggaac acaaggctga agaaaaagct tttaggcaag
360caacgcaagg atcaccagca acaggcgcgt cgaggtgggg ggccaaagca agaagatagg
420agtgatgaga gcggaaatcc ggcagttgcc ggcgagggtg gcagccggag cacgtattgg
480cctgagccag ccatgcctgt ttactcgacc ggccagcccg atcatcatct caacaacaac
540catgcttcga cgacagatag actgctgatg atgctcgggg gaaggtcctc caacgacagt
600aatgcaagcc atcagagccc tttcattcca cagtactcat cagttccact gccacagatc
660tataataatt cagctagctt aattaaccct tcggtcaaca ccacttcctt caaagaagcc
720atgcaagttc ctcatcggtt tgca
74499248PRTElaeis guineensis 99Met Gly Arg Ala Pro Cys Cys Asp Lys Ala
Asn Val Lys Lys Gly Pro1 5 10
15Trp Ser Pro Glu Glu Asp Ala Lys Leu Lys Ala Tyr Ile Glu Gln Tyr20
25 30Gly Thr Gly Gly Asn Trp Ile Ala Leu
Pro Gln Lys Ile Gly Leu Lys35 40 45Arg
Cys Gly Lys Ser Cys Arg Leu Arg Trp Leu Asn Tyr Leu Arg Pro50
55 60Asn Ile Lys His Gly Gly Phe Ser Glu Glu Glu
Asp His Ile Ile Cys65 70 75
80Ser Leu Tyr Ile Ser Ile Gly Ser Arg Trp Ser Ile Ile Ala Ala Gln85
90 95Leu Pro Gly Arg Thr Asp Asn Asp Ile
Lys Asn Tyr Trp Asn Thr Arg100 105 110Leu
Lys Lys Lys Leu Leu Gly Lys Gln Arg Lys Asp His Gln Gln Gln115
120 125Ala Arg Arg Gly Gly Gly Pro Lys Gln Glu Asp
Arg Ser Asp Glu Ser130 135 140Gly Asn Pro
Ala Val Ala Gly Glu Gly Gly Ser Arg Ser Thr Tyr Trp145
150 155 160Pro Glu Pro Ala Met Pro Val
Tyr Ser Thr Gly Gln Pro Asp His His165 170
175Leu Asn Asn Asn His Ala Ser Thr Thr Asp Arg Leu Leu Met Met Leu180
185 190Gly Gly Arg Ser Ser Asn Asp Ser Asn
Ala Ser His Gln Ser Pro Phe195 200 205Ile
Pro Gln Tyr Ser Ser Val Pro Leu Pro Gln Ile Tyr Asn Asn Ser210
215 220Ala Ser Leu Ile Asn Pro Ser Val Asn Thr Thr
Ser Phe Lys Glu Ala225 230 235
240Met Gln Val Pro His Arg Phe Ala245100369DNAElaeis oleifera
100atgggaagag caccgtgctg tgacaaggct aatgtgaaga gaggaccatg gtctccagag
60gaagatgcag tccttaaaaa ctatattgag aagcatggga gtgttggtaa ctggattgct
120ttgccccaaa aagcagggct caaacgttgc ggcaaaagct gccgcctccg atggctcaat
180tacctccggc cagacatcaa gcacggcggc ttcacggagg aagaagacat cgccatctgc
240actctctaca acaaaattgg aagcaggtgg tctgttattg cttccaagct gcgaggaaga
300actgacaatg atgtcaagaa ttattggaac actaagctca agaaaaagat gataaacgga
360caaattagt
369101123PRTElaeis oleifera 101Met Gly Arg Ala Pro Cys Cys Asp Lys Ala
Asn Val Lys Arg Gly Pro1 5 10
15Trp Ser Pro Glu Glu Asp Ala Val Leu Lys Asn Tyr Ile Glu Lys His20
25 30Gly Ser Val Gly Asn Trp Ile Ala Leu
Pro Gln Lys Ala Gly Leu Lys35 40 45Arg
Cys Gly Lys Ser Cys Arg Leu Arg Trp Leu Asn Tyr Leu Arg Pro50
55 60Asp Ile Lys His Gly Gly Phe Thr Glu Glu Glu
Asp Ile Ala Ile Cys65 70 75
80Thr Leu Tyr Asn Lys Ile Gly Ser Arg Trp Ser Val Ile Ala Ser Lys85
90 95Leu Arg Gly Arg Thr Asp Asn Asp Val
Lys Asn Tyr Trp Asn Thr Lys100 105 110Leu
Lys Lys Lys Met Ile Asn Gly Gln Ile Ser115
120102303DNAEschscholzia californica 102atgggaagag ctccatgttg tgataaagca
aatgtaaaga gaggtccatg gtcacctgaa 60gaagatggaa agctcaaatc ctatattgaa
gaacatggta ctggtggtaa ttggatagcc 120ttgcctcaaa aaataggtct aaagagatgt
gggaagagtt gtcgtctccg gtggttgaac 180tatcttcgac caaacatcaa gcatggtggt
ttctccgaag aagaagataa cattatatgc 240aatctttata ttagtatcgg aagcaggtgg
tctataatag cagcacaact acctggaaga 300act
303103101PRTEschscholzia californica
103Met Gly Arg Ala Pro Cys Cys Asp Lys Ala Asn Val Lys Arg Gly Pro1
5 10 15Trp Ser Pro Glu Glu Asp
Gly Lys Leu Lys Ser Tyr Ile Glu Glu His20 25
30Gly Thr Gly Gly Asn Trp Ile Ala Leu Pro Gln Lys Ile Gly Leu Lys35
40 45Arg Cys Gly Lys Ser Cys Arg Leu Arg
Trp Leu Asn Tyr Leu Arg Pro50 55 60Asn
Ile Lys His Gly Gly Phe Ser Glu Glu Glu Asp Asn Ile Ile Cys65
70 75 80Asn Leu Tyr Ile Ser Ile
Gly Ser Arg Trp Ser Ile Ile Ala Ala Gln85 90
95Leu Pro Gly Arg Thr100104332DNAEuphorbia esula 104atggggagag
ctccatgctg tgataaagct aatgtgaaga aaggtccatg gtcacctgag 60gaagatgcta
ctctcaaatc ttatattgaa gaaaatggca ccggcggcaa ctggattgcc 120ttgcctcaaa
aaattgggct taagagatgt gggaagagtt gtagacttag atggttgaat 180tatctccggc
caaatattaa acatggaggc ttttctgagg aggaagataa cattatttgc 240agcctctata
taaatattgg aagcaggtgg tctataattg ctgcacaatt acctggaagg 300actgataatg
atataaagaa ctactggaac ac
332105111PRTEuphorbia esula 105Met Gly Arg Ala Pro Cys Cys Asp Lys Ala
Asn Val Lys Lys Gly Pro1 5 10
15Trp Ser Pro Glu Glu Asp Ala Thr Leu Lys Ser Tyr Ile Glu Glu Asn20
25 30Gly Thr Gly Gly Asn Trp Ile Ala Leu
Pro Gln Lys Ile Gly Leu Lys35 40 45Arg
Cys Gly Lys Ser Cys Arg Leu Arg Trp Leu Asn Tyr Leu Arg Pro50
55 60Asn Ile Lys His Gly Gly Phe Ser Glu Glu Glu
Asp Asn Ile Ile Cys65 70 75
80Ser Leu Tyr Ile Asn Ile Gly Ser Arg Trp Ser Ile Ile Ala Ala Gln85
90 95Leu Pro Gly Arg Thr Asp Asn Asp Ile
Lys Asn Tyr Trp Asn Thr100 105
110106771DNAEuphorbia esulamisc_feature(747)..(747)n can be any
nucleotide 106atgggaagag ctccttgttg tgataaggct aatgtgaaga aagggccatg
gtctcctgag 60gaagatgctg ctcttaagga ctatattgag aaaaatggaa ctggtggcaa
ttggattgct 120cttcctcaaa aagctggtct taaaagatgt ggaaaaagct gcagattaag
atggctaaac 180tatctaaggc ctaacatcaa gcatggagat ttttctgatc atgaagatag
gatcatttgg 240actctctact ccaacattgg aagcagatgg tcaataattg cagcacaatt
accaggaaga 300acagacaatg atataaaaaa ttattggaac acaaagttga agaagaagct
aatgatgagc 360ataatcaaca ctaatcctac tcttaatcct caaccaaaat tactatctac
tgctggattt 420tcttctcttc ttcaatttag tccttcttcc ccaatttctt ccaccacatc
accatcatct 480tcttcttctt cttcatcaat attaaacact aattacaata atttgtcatt
agtagaaccc 540acaattcagc cattttcaaa caacccgtta atgggttctc tccaaaatta
tcctcatcaa 600gcaaattatc atatgggttt tgaagggtat aatggtaatt actataatgg
agagattaat 660aatcacagtg agtatggatt agaggagatt aaggaaatga ttagcacaac
taatacaatt 720agtagcaact ttttgtttga agaaaannng acagagagta ttatgtactt t
771107257PRTEuphorbia esulaMISC_FEATURE(249)..(249)X can be
any amino acid 107Met Gly Arg Ala Pro Cys Cys Asp Lys Ala Asn Val Lys Lys
Gly Pro1 5 10 15Trp Ser
Pro Glu Glu Asp Ala Ala Leu Lys Asp Tyr Ile Glu Lys Asn20
25 30Gly Thr Gly Gly Asn Trp Ile Ala Leu Pro Gln Lys
Ala Gly Leu Lys35 40 45Arg Cys Gly Lys
Ser Cys Arg Leu Arg Trp Leu Asn Tyr Leu Arg Pro50 55
60Asn Ile Lys His Gly Asp Phe Ser Asp His Glu Asp Arg Ile
Ile Trp65 70 75 80Thr
Leu Tyr Ser Asn Ile Gly Ser Arg Trp Ser Ile Ile Ala Ala Gln85
90 95Leu Pro Gly Arg Thr Asp Asn Asp Ile Lys Asn
Tyr Trp Asn Thr Lys100 105 110Leu Lys Lys
Lys Leu Met Met Ser Ile Ile Asn Thr Asn Pro Thr Leu115
120 125Asn Pro Gln Pro Lys Leu Leu Ser Thr Ala Gly Phe
Ser Ser Leu Leu130 135 140Gln Phe Ser Pro
Ser Ser Pro Ile Ser Ser Thr Thr Ser Pro Ser Ser145 150
155 160Ser Ser Ser Ser Ser Ser Ile Leu Asn
Thr Asn Tyr Asn Asn Leu Ser165 170 175Leu
Val Glu Pro Thr Ile Gln Pro Phe Ser Asn Asn Pro Leu Met Gly180
185 190Ser Leu Gln Asn Tyr Pro His Gln Ala Asn Tyr
His Met Gly Phe Glu195 200 205Gly Tyr Asn
Gly Asn Tyr Tyr Asn Gly Glu Ile Asn Asn His Ser Glu210
215 220Tyr Gly Leu Glu Glu Ile Lys Glu Met Ile Ser Thr
Thr Asn Thr Ile225 230 235
240Ser Ser Asn Phe Leu Phe Glu Glu Xaa Xaa Thr Glu Ser Ile Met Tyr245
250 255Phe108649DNAEuphorbia tirucalli
108aattattagt tcagcaagat caatacaagt atgggaagag ctccttgctg cgacaaagcc
60aacgtcaaaa aaggcccttg gtcccctgaa gaagatagct aagcttaaat cttatattga
120gaaacatggc accggtggta actggattgc tttgcctcaa aaaattggcc ttaaaagatg
180tggaaagagt tgccgtctta gatagttgaa ctatcttcgc ccaaacatca aacatggcgg
240cttctctgag gaagaagata acatcatttg cagcctttat attagtattg ggagcagata
300gtctataatt gcagcccaat tgccaggaag aactgataat gacataaaaa attactggaa
360cacaaggctg aaaaagaagc ttcttggaaa gcaaagaaaa gagcaacatt ctagcagaag
420aggaaatggc ctcaagcaag atattatcaa gagagcaaat cggaccaata atattaattc
480atctgatcaa aattactctc attggcctga gctgcctttg cttgctccta caccattcat
540ctccacccaa aatcaagcct tcaatgatca tgcttctatc aggaaattgc ttgtgaagct
600tggagggaaa ttttctgatg atcatcaaga taatcccatt tttaattct
649109671DNAGinkgo biloba 109gattggattg ccttacccca taaagcaggt ctgaagcgat
gcggtaagag ctgcagacta 60agatggttga actatttgag gcctgatatc aagcatggag
gcttctctga agaagaagac 120actatcattt gtagcctcta cggcagcatt ggaagcaggt
ggtctatcat tgctgcgcag 180ttaccgggta gaacagacaa cgatatcaag aattactgga
acacaagact gaagaaaaaa 240ttgcttggga aatgcaagga tcatcagcaa actcgtcgtc
tgtcaagaga agcagagaag 300gaagtgagag cctttgcttc agaagccgtg tccgtggcca
ccactccaac ctcgtctaca 360tctcttgcaa atccagtatc ttatcacatg actactgatc
cctctaatta tgccagtttg 420gatgccggaa tggtctctca ggcgttggaa cactcgagtt
ctcccataag aaccatattg 480gctcatgatc agtctccatt atctccaagt acagctcatg
ggtttgggaa tattcatgat 540gatcacgaag ctttcagaga aatgtcagat tatgcctgcc
ttacaagcct gctcatgcgg 600cttaatgaag gcagcgattc tttcacaaac atttcttgcg
gaatatcaag tacctctcaa 660gttatatccc c
6711101068DNAGlycine max 110atggggaggg caccttgctg
tgacaaagca aacgtgaaga aaggtccttg gtcgccagaa 60gaagatacca aactcaaatc
ctacatcgaa cagcacggaa ccggaggaaa ctggatcgct 120ttgccacaaa agattggcct
taaacgatgt ggcaagagtt gccgtctcag gtggctaaac 180tacctccgcc ctaacatcag
acacggtggt ttctccgaag aagaagacaa catcatttgc 240agcctctacg ttagcattgg
aagcaggtgg tcggtcattg cagcacaatt gccgggaaga 300actgataatg acataaagaa
ctattggaac acgaggctga agaagaagct tctagggaaa 360caccgtaagg aactacaggc
acgcaacaaa ggaaacggtg gtatccttaa acaggagaac 420agttcctcgc tcttgcttca
gcaaaacagt gctcaacaat tacatccatg ttggccacag 480attccagtgc tgccactttc
atcaccgtac acaaaccaaa gtccaagttt caacgaccaa 540gattccatta gaaagcttct
aatcaagctt ggagggagat tctccgatga ttatcaaccc 600atcctagatg ggttgaatct
tcagttccca catggttcaa attctctctc atcaacacaa 660caaattcaag aggagcaagt
ccatgttggt tcttctgcac tagggtgcat gaactctatt 720ggccacaatc aagtacaatt
tggtcagagt aatgagtact gtgctgagtt ggtgcaagga 780caagggagtt tcattaccac
agcaattggg gaaatggttt cttctaatga ttattctcga 840aggttattag gtgggtcgtt
ggagttcttc tatggggagg aaatgattac tgataataag 900ataatgggtg cttgtgcttc
ttcaagttgt gggcaaagta ctaattgggg tgaaaccagt 960agttctctga tgtatccttc
tcttgttgct tcgaattacg agggtgtgcg gcaagagatg 1020ccacgagaat gtgcttttca
agagttgagc tacccggggg cgcaatag 1068111355PRTGlycine max
111Met Gly Arg Ala Pro Cys Cys Asp Lys Ala Asn Val Lys Lys Gly Pro1
5 10 15Trp Ser Pro Glu Glu Asp
Thr Lys Leu Lys Ser Tyr Ile Glu Gln His20 25
30Gly Thr Gly Gly Asn Trp Ile Ala Leu Pro Gln Lys Ile Gly Leu Lys35
40 45Arg Cys Gly Lys Ser Cys Arg Leu Arg
Trp Leu Asn Tyr Leu Arg Pro50 55 60Asn
Ile Arg His Gly Gly Phe Ser Glu Glu Glu Asp Asn Ile Ile Cys65
70 75 80Ser Leu Tyr Val Ser Ile
Gly Ser Arg Trp Ser Val Ile Ala Ala Gln85 90
95Leu Pro Gly Arg Thr Asp Asn Asp Ile Lys Asn Tyr Trp Asn Thr Arg100
105 110Leu Lys Lys Lys Leu Leu Gly Lys
His Arg Lys Glu Leu Gln Ala Arg115 120
125Asn Lys Gly Asn Gly Gly Ile Leu Lys Gln Glu Asn Ser Ser Ser Leu130
135 140Leu Leu Gln Gln Asn Ser Ala Gln Gln
Leu His Pro Cys Trp Pro Gln145 150 155
160Ile Pro Val Leu Pro Leu Ser Ser Pro Tyr Thr Asn Gln Ser
Pro Ser165 170 175Phe Asn Asp Gln Asp Ser
Ile Arg Lys Leu Leu Ile Lys Leu Gly Gly180 185
190Arg Phe Ser Asp Asp Tyr Gln Pro Ile Leu Asp Gly Leu Asn Leu
Gln195 200 205Phe Pro His Gly Ser Asn Ser
Leu Ser Ser Thr Gln Gln Ile Gln Glu210 215
220Glu Gln Val His Val Gly Ser Ser Ala Leu Gly Cys Met Asn Ser Ile225
230 235 240Gly His Asn Gln
Val Gln Phe Gly Gln Ser Asn Glu Tyr Cys Ala Glu245 250
255Leu Val Gln Gly Gln Gly Ser Phe Ile Thr Thr Ala Ile Gly
Glu Met260 265 270Val Ser Ser Asn Asp Tyr
Ser Arg Arg Leu Leu Gly Gly Ser Leu Glu275 280
285Phe Phe Tyr Gly Glu Glu Met Ile Thr Asp Asn Lys Ile Met Gly
Ala290 295 300Cys Ala Ser Ser Ser Cys Gly
Gln Ser Thr Asn Trp Gly Glu Thr Ser305 310
315 320Ser Ser Leu Met Tyr Pro Ser Leu Val Ala Ser Asn
Tyr Glu Gly Val325 330 335Arg Gln Glu Met
Pro Arg Glu Cys Ala Phe Gln Glu Leu Ser Tyr Pro340 345
350Gly Ala Gln355112444DNAGlycine max 112atggggaggg
caccttgctg tgacaaagca aacgtgaaga aaggtccttg gtcgccagaa 60gaagatacca
aactcaaatc ctacatcgaa cagcacggaa ccggaggaaa ctggatcgct 120ttgccacaaa
agattggcct taaacgatgt ggcaagagtt gccgtctcag gtggctaaac 180tacctccgcc
ctaacatcag acacggtggt ttctccgaag aagaagacaa catcatttgc 240agcctctacg
ttagcattgg aagcaggtgg tcggtcattg cagcacaatt gccgggaaga 300actgataatg
acataaagaa ctattggaac acgaggctga agaagaagct tctagggaaa 360caccgtaagg
aactacaggc acgcaacaaa ggaaacggtg gtatccttaa acaggagaac 420agttcctcgc
tcttgcttca gcaa
444113148PRTGlycine max 113Met Gly Arg Ala Pro Cys Cys Asp Lys Ala Asn
Val Lys Lys Gly Pro1 5 10
15Trp Ser Pro Glu Glu Asp Thr Lys Leu Lys Ser Tyr Ile Glu Gln His20
25 30Gly Thr Gly Gly Asn Trp Ile Ala Leu Pro
Gln Lys Ile Gly Leu Lys35 40 45Arg Cys
Gly Lys Ser Cys Arg Leu Arg Trp Leu Asn Tyr Leu Arg Pro50
55 60Asn Ile Arg His Gly Gly Phe Ser Glu Glu Glu Asp
Asn Ile Ile Cys65 70 75
80Ser Leu Tyr Val Ser Ile Gly Ser Arg Trp Ser Val Ile Ala Ala Gln85
90 95Leu Pro Gly Arg Thr Asp Asn Asp Ile Lys
Asn Tyr Trp Asn Thr Arg100 105 110Leu Lys
Lys Lys Leu Leu Gly Lys His Arg Lys Glu Leu Gln Ala Arg115
120 125Asn Lys Gly Asn Gly Gly Ile Leu Lys Gln Glu Asn
Ser Ser Ser Leu130 135 140Leu Leu Gln
Gln145114627DNAGlycine max 114atgggtagag ctccatgctg tgacaaggcc aacgtgaaga
aaggaccatg gtctcctgaa 60gaggatgctg cactcaaagc ctacattgaa aagaatggaa
ctggtggcaa ttggattgct 120cttcctcaga aaattggttt ggaaaggtgt ggaaagagtt
gcagacttag gtggttaaat 180tacttgaggc ctaatatcaa acatggtgga tttactgaag
aagaagacaa catcatctgc 240agcctgtaca ttagcattgg aagcaggtgg tccataattg
ctgctcagtt acctggaagg 300acagataacg acatcaagaa ctattggaac actagattga
agaagaaact gctcgggagg 360cgcaaacaat ctaacttaag cgcaaaagac acaaacaatg
gaatagagga gaattcttat 420tcaaatgccc taagctcttc agctcttgaa agactccaac
tgcacatgca acttcaaagc 480cttcaaaacc ctttctcttt ctacaacaac ccggcgctgt
ggcccaagtt gcatcctttc 540caagaaaaga tgatccagag cctgcagtct ttgaatgatg
gctctagccc tgtgatgcaa 600aatgctttgc ctaattccca catgtcg
627115209PRTGlycine max 115Met Gly Arg Ala Pro Cys
Cys Asp Lys Ala Asn Val Lys Lys Gly Pro1 5
10 15Trp Ser Pro Glu Glu Asp Ala Ala Leu Lys Ala Tyr
Ile Glu Lys Asn20 25 30Gly Thr Gly Gly
Asn Trp Ile Ala Leu Pro Gln Lys Ile Gly Leu Glu35 40
45Arg Cys Gly Lys Ser Cys Arg Leu Arg Trp Leu Asn Tyr Leu
Arg Pro50 55 60Asn Ile Lys His Gly Gly
Phe Thr Glu Glu Glu Asp Asn Ile Ile Cys65 70
75 80Ser Leu Tyr Ile Ser Ile Gly Ser Arg Trp Ser
Ile Ile Ala Ala Gln85 90 95Leu Pro Gly
Arg Thr Asp Asn Asp Ile Lys Asn Tyr Trp Asn Thr Arg100
105 110Leu Lys Lys Lys Leu Leu Gly Arg Arg Lys Gln Ser
Asn Leu Ser Ala115 120 125Lys Asp Thr Asn
Asn Gly Ile Glu Glu Asn Ser Tyr Ser Asn Ala Leu130 135
140Ser Ser Ser Ala Leu Glu Arg Leu Gln Leu His Met Gln Leu
Gln Ser145 150 155 160Leu
Gln Asn Pro Phe Ser Phe Tyr Asn Asn Pro Ala Leu Trp Pro Lys165
170 175Leu His Pro Phe Gln Glu Lys Met Ile Gln Ser
Leu Gln Ser Leu Asn180 185 190Asp Gly Ser
Ser Pro Val Met Gln Asn Ala Leu Pro Asn Ser His Met195
200 205Ser116684DNAGlycine max 116atgggtagag ctccttgttg
tgacaaggca aatgtaaaga aagggccatg gtcaccagaa 60gaagatgcaa agctaaagga
ctacatagag caacatggca ctggaggcaa ttggattgcg 120cttccacaaa aagttggttt
gaaaagatgt ggaaagagtt gtagactaag gtggcttaat 180tatcttagac ccaacattaa
gcatggtcaa ttctctgaag cagaagataa aataatctgt 240agcctctttg ctagcattgg
aagcaggtgg tccataatag catctcagtt gccggggagg 300actgacaatg atatcaagaa
ctattggaac accaagctta agaagaaaat gatggccatg 360aatccttcac tccaaaggaa
gcctcaacaa attacccttt tatccatcct tcaaagttca 420gcatattcat caccatcgtc
attcagagac agtaacaaca tctcatacta ccaccctcac 480ggttcctttt cctactcgtc
aaatcttctg agtggtaaca acacttctgc ttctgctaat 540tccttctttc aagcacaaga
gagtttcatt gaccccactc agaagtgcca attcaaagat 600agcagcaaca atagcatgtt
tttgtttggt ggtgaagcca cagccacagc cactagttgc 660agctcttctg atgggagctg
caac 684117228PRTGlycine max
117Met Gly Arg Ala Pro Cys Cys Asp Lys Ala Asn Val Lys Lys Gly Pro1
5 10 15Trp Ser Pro Glu Glu Asp
Ala Lys Leu Lys Asp Tyr Ile Glu Gln His20 25
30Gly Thr Gly Gly Asn Trp Ile Ala Leu Pro Gln Lys Val Gly Leu Lys35
40 45Arg Cys Gly Lys Ser Cys Arg Leu Arg
Trp Leu Asn Tyr Leu Arg Pro50 55 60Asn
Ile Lys His Gly Gln Phe Ser Glu Ala Glu Asp Lys Ile Ile Cys65
70 75 80Ser Leu Phe Ala Ser Ile
Gly Ser Arg Trp Ser Ile Ile Ala Ser Gln85 90
95Leu Pro Gly Arg Thr Asp Asn Asp Ile Lys Asn Tyr Trp Asn Thr Lys100
105 110Leu Lys Lys Lys Met Met Ala Met
Asn Pro Ser Leu Gln Arg Lys Pro115 120
125Gln Gln Ile Thr Leu Leu Ser Ile Leu Gln Ser Ser Ala Tyr Ser Ser130
135 140Pro Ser Ser Phe Arg Asp Ser Asn Asn
Ile Ser Tyr Tyr His Pro His145 150 155
160Gly Ser Phe Ser Tyr Ser Ser Asn Leu Leu Ser Gly Asn Asn
Thr Ser165 170 175Ala Ser Ala Asn Ser Phe
Phe Gln Ala Gln Glu Ser Phe Ile Asp Pro180 185
190Thr Gln Lys Cys Gln Phe Lys Asp Ser Ser Asn Asn Ser Met Phe
Leu195 200 205Phe Gly Gly Glu Ala Thr Ala
Thr Ala Thr Ser Cys Ser Ser Ser Asp210 215
220Gly Ser Cys Asn2251181068DNAGossypium 118atggggagag ctccttgttg
tgacaaagct aacgtcaaga aaggcccatg gtcacctgaa 60gaagacacca agctcaaggc
atacatcgag cagcatggca ctggcggaaa ctggatcgct 120ttgcctcaca aaattggcct
taagagatgt gggaagagct gccgcctcag atggttaaac 180tatctccgcc caaatattaa
gcatggagga ttctccgaag aagaagataa aattatttgc 240agcctctata tcagtattgg
gagcaggtgg tctattattg ctgcacaatt accggggagg 300actgataacg atataaagaa
ctattggaac acaaggctta agaagaagct tctgggcaag 360caacgcaaag agcatcagtc
tcgaagaggc aacagcctaa agcaagatat gaagagatca 420tcagctagtg tgggggattc
catggttcct gcagataaca tcaatcaaat cccttactgg 480ccagagctgc ctgtgctggc
tgcggcggcg gctcccatac cgcactcaag tcaagaacat 540cgcattgaca gccaagcctc
gatgagaaga ttactaatca agcttggggg aagattttct 600gaggatgatc atgtggttaa
tgatgggaca actcttcatc agtttcctaa tgatttatcc 660actactgatc aggatcttta
cgagcagact gtctatgtgc cctcttcttc ttcttcttct 720cccatggatg ccttgagctt
aagcaataac atcggttctc agtttgtgaa ctctcagttc 780gccatagatg gagggaatct
gcccattctg caaggacaaa gcactacttt ttcatcagag 840ctccaggaaa tgggatatag
cagcaaccca cagagattag atggaatgga gttcttatat 900ggggaaggca tggtcgataa
tagaggcgtg aatccttgtg aaagcattgg ctggggtgac 960actagctctc tggttggccc
tccttgtgct tcggaatatg gagtcatgca acaaggaatg 1020cttcaagaat atggttttag
tgagatgagg tacccaggag gagcgcaa 1068119356PRTGossypium
119Met Gly Arg Ala Pro Cys Cys Asp Lys Ala Asn Val Lys Lys Gly Pro1
5 10 15Trp Ser Pro Glu Glu Asp
Thr Lys Leu Lys Ala Tyr Ile Glu Gln His20 25
30Gly Thr Gly Gly Asn Trp Ile Ala Leu Pro His Lys Ile Gly Leu Lys35
40 45Arg Cys Gly Lys Ser Cys Arg Leu Arg
Trp Leu Asn Tyr Leu Arg Pro50 55 60Asn
Ile Lys His Gly Gly Phe Ser Glu Glu Glu Asp Lys Ile Ile Cys65
70 75 80Ser Leu Tyr Ile Ser Ile
Gly Ser Arg Trp Ser Ile Ile Ala Ala Gln85 90
95Leu Pro Gly Arg Thr Asp Asn Asp Ile Lys Asn Tyr Trp Asn Thr Arg100
105 110Leu Lys Lys Lys Leu Leu Gly Lys
Gln Arg Lys Glu His Gln Ser Arg115 120
125Arg Gly Asn Ser Leu Lys Gln Asp Met Lys Arg Ser Ser Ala Ser Val130
135 140Gly Asp Ser Met Val Pro Ala Asp Asn
Ile Asn Gln Ile Pro Tyr Trp145 150 155
160Pro Glu Leu Pro Val Leu Ala Ala Ala Ala Ala Pro Ile Pro
His Ser165 170 175Ser Gln Glu His Arg Ile
Asp Ser Gln Ala Ser Met Arg Arg Leu Leu180 185
190Ile Lys Leu Gly Gly Arg Phe Ser Glu Asp Asp His Val Val Asn
Asp195 200 205Gly Thr Thr Leu His Gln Phe
Pro Asn Asp Leu Ser Thr Thr Asp Gln210 215
220Asp Leu Tyr Glu Gln Thr Val Tyr Val Pro Ser Ser Ser Ser Ser Ser225
230 235 240Pro Met Asp Ala
Leu Ser Leu Ser Asn Asn Ile Gly Ser Gln Phe Val245 250
255Asn Ser Gln Phe Ala Ile Asp Gly Gly Asn Leu Pro Ile Leu
Gln Gly260 265 270Gln Ser Thr Thr Phe Ser
Ser Glu Leu Gln Glu Met Gly Tyr Ser Ser275 280
285Asn Pro Gln Arg Leu Asp Gly Met Glu Phe Leu Tyr Gly Glu Gly
Met290 295 300Val Asp Asn Arg Gly Val Asn
Pro Cys Glu Ser Ile Gly Trp Gly Asp305 310
315 320Thr Ser Ser Leu Val Gly Pro Pro Cys Ala Ser Glu
Tyr Gly Val Met325 330 335Gln Gln Gly Met
Leu Gln Glu Tyr Gly Phe Ser Glu Met Arg Tyr Pro340 345
350Gly Gly Ala Gln355120633DNAGossypium 120atgggtcgag
ctccttgctg tgacaaagcc aacgtgaaga aaggaccatg gtcacctgaa 60gaagatgcta
agcttaaggc ttatatcgaa cagtatggca ctggtggcaa ctggattgct 120cttcctcaga
aaattggtct taagcgatgt gggaagagtt gcaggttgag atggctgaat 180tatttgaggc
caaacattaa gcatggtgga ttctctgagg aagaagacaa catcatatgc 240agtctctata
taagtatagg aagcaggtgg tccataattg ctgcacaact gcctggaaga 300actgataatg
atatcaaaaa ctactggaac accaaattaa agaagaaact acttgggagg 360cgcaaacaac
ctagtaacat ccataggtta tcgaatcaag accctaatga tcctcatcaa 420cctactggtt
cagatgataa tcaattctct cagggtttga gtaacttagc catggaacgt 480cttcagcttc
atatgcagct ccaaactctt cagaaccctt tctctttcta taataaccct 540gctttatggc
ctaagattca ccccttgcaa gaaaagatga tccaaaacat gcaggcttct 600aatggaaacc
cttgtctcgt cctctgcctg atg
633121211PRTGossypium 121Met Gly Arg Ala Pro Cys Cys Asp Lys Ala Asn Val
Lys Lys Gly Pro1 5 10
15Trp Ser Pro Glu Glu Asp Ala Lys Leu Lys Ala Tyr Ile Glu Gln Tyr20
25 30Gly Thr Gly Gly Asn Trp Ile Ala Leu Pro
Gln Lys Ile Gly Leu Lys35 40 45Arg Cys
Gly Lys Ser Cys Arg Leu Arg Trp Leu Asn Tyr Leu Arg Pro50
55 60Asn Ile Lys His Gly Gly Phe Ser Glu Glu Glu Asp
Asn Ile Ile Cys65 70 75
80Ser Leu Tyr Ile Ser Ile Gly Ser Arg Trp Ser Ile Ile Ala Ala Gln85
90 95Leu Pro Gly Arg Thr Asp Asn Asp Ile Lys
Asn Tyr Trp Asn Thr Lys100 105 110Leu Lys
Lys Lys Leu Leu Gly Arg Arg Lys Gln Pro Ser Asn Ile His115
120 125Arg Leu Ser Asn Gln Asp Pro Asn Asp Pro His Gln
Pro Thr Gly Ser130 135 140Asp Asp Asn Gln
Phe Ser Gln Gly Leu Ser Asn Leu Ala Met Glu Arg145 150
155 160Leu Gln Leu His Met Gln Leu Gln Thr
Leu Gln Asn Pro Phe Ser Phe165 170 175Tyr
Asn Asn Pro Ala Leu Trp Pro Lys Ile His Pro Leu Gln Glu Lys180
185 190Met Ile Gln Asn Met Gln Ala Ser Asn Gly Asn
Pro Cys Leu Val Leu195 200 205Cys Leu
Met210122542DNAHedyotis terminalis 122atgggaagag caccttgctg cgacaaagcc
aatgtgaagc gaggtccatg gtcacctgaa 60gaagatgcta aactcaaatc ctacattgag
gaatttggca ccggaggaaa ttggatttcc 120ttaccccaga aaataggtct caagagatgc
ggaaaaagtt gtcggctgag atggttgaat 180tatctgcggc ctaacatcaa gcatggcgga
ttttcagaag aagaagatac cataatctgc 240agcctctata tgagcatagg aagcaggtgg
tctattatag cgggacagct gcccggaaga 300actgataatg acataaagaa ctactggaat
acgaagctga agaagaagtt actagggaaa 360caacgaaagg atccaaagcc acgttcatca
agctcagtct tgactaaaag aaaacagaat 420tcagtaattt ctgaagacat aagtttatac
aacaccataa cacatccaca gatggcttta 480ccagcagttc atgttccaaa gcttgaacaa
caagtccatt tccagtatca gccatctttt 540gc
542123181PRTHedyotis terminalis 123Met
Gly Arg Ala Pro Cys Cys Asp Lys Ala Asn Val Lys Arg Gly Pro1
5 10 15Trp Ser Pro Glu Glu Asp Ala
Lys Leu Lys Ser Tyr Ile Glu Glu Phe20 25
30Gly Thr Gly Gly Asn Trp Ile Ser Leu Pro Gln Lys Ile Gly Leu Lys35
40 45Arg Cys Gly Lys Ser Cys Arg Leu Arg Trp
Leu Asn Tyr Leu Arg Pro50 55 60Asn Ile
Lys His Gly Gly Phe Ser Glu Glu Glu Asp Thr Ile Ile Cys65
70 75 80Ser Leu Tyr Met Ser Ile Gly
Ser Arg Trp Ser Ile Ile Ala Gly Gln85 90
95Leu Pro Gly Arg Thr Asp Asn Asp Ile Lys Asn Tyr Trp Asn Thr Lys100
105 110Leu Lys Lys Lys Leu Leu Gly Lys Gln
Arg Lys Asp Pro Lys Pro Arg115 120 125Ser
Ser Ser Ser Val Leu Thr Lys Arg Lys Gln Asn Ser Val Ile Ser130
135 140Glu Asp Ile Ser Leu Tyr Asn Thr Ile Thr His
Pro Gln Met Ala Leu145 150 155
160Pro Ala Val His Val Pro Lys Leu Glu Gln Gln Val His Phe Gln
Tyr165 170 175Gln Pro Ser Phe
Ala180124616DNAHelianthus annuus 124gatgcaaaac ttaaagaatt tattgagaaa
tacggaactg gtggtaactg gattgctctc 60cctcaaaaag ctggtcttaa aagatgtgga
aagagttgca ggttaagatg gctaaactat 120ctgagaccaa atataagaca tggtgaattc
tctgaagagg aagataggat cattttcagc 180ttgtatgcta gcattggtag cagatggtcg
gtaatagcag ctcagctacc aggaaggact 240gacaatgaca tcaagaatta ctggaacact
aaactgaaga agaagctttt aactatgctt 300ccttattttc aagaaaaatc atcttttttt
ccacctatac cctttcaacc acaaccacca 360tcaaaggatc atcttacatc tcaccagttc
ttgaacaatt catcatcatt ctacactttt 420agtaccccta acttcatgaa tgttgacaac
acttctaact ctctctttgt tcatgaagag 480catactaatg ttactgatct tcagtcatca
gtaccacccc cacccccacc actacaactt 540catgcaaaca gtttgatgtg caacaacaat
actgatgtta atgataactg ttatagtagt 600tatagcatgt gcaaca
616125645DNAHelianthus argophyllus
125atgggtagag caccatgttg tgacaaaaca aatgtgaaaa agggaccttg gtcttctgaa
60gaagatgcta agctcaaaga ttacatcgaa aagtatggta ctggtggtaa ctggattgca
120cttccccaga agatcgggct aaagaggtgt ggaaaaagtt gcagattgag atggttaaat
180taccttcgac ccaacatcaa gcatggagga ttctctgaag aagaagacaa catcatctgc
240agcctctata ttagtatagg cagcaggtgg tctataatag cggctcaact tcctggaaga
300acagataacg acatcaagaa ctactggaac acaaggctga agaagaagtt actcggaagg
360cgcaaacaat cgcaggcgaa taagctctcg ggttcaagcc agtacccgaa agacggcagc
420ggaatagaaa ccctaagcaa ctcagctatt gaaaggcttc aactccacat gcaactccaa
480agccttgaaa accctaattt ccctcactat gataatcccc ccatgtggcc ttgcaaattg
540aatcctatac aagaaaaaat gatgcaaact ctccaacttg caaatgaatc ctctaatcct
600ctcatgatgc aaaatttctc acctgctact cctcaaaagg ttgaa
645126215PRTHelianthus argophyllus 126Met Gly Arg Ala Pro Cys Cys Asp Lys
Thr Asn Val Lys Lys Gly Pro1 5 10
15Trp Ser Ser Glu Glu Asp Ala Lys Leu Lys Asp Tyr Ile Glu Lys
Tyr20 25 30Gly Thr Gly Gly Asn Trp Ile
Ala Leu Pro Gln Lys Ile Gly Leu Lys35 40
45Arg Cys Gly Lys Ser Cys Arg Leu Arg Trp Leu Asn Tyr Leu Arg Pro50
55 60Asn Ile Lys His Gly Gly Phe Ser Glu Glu
Glu Asp Asn Ile Ile Cys65 70 75
80Ser Leu Tyr Ile Ser Ile Gly Ser Arg Trp Ser Ile Ile Ala Ala
Gln85 90 95Leu Pro Gly Arg Thr Asp Asn
Asp Ile Lys Asn Tyr Trp Asn Thr Arg100 105
110Leu Lys Lys Lys Leu Leu Gly Arg Arg Lys Gln Ser Gln Ala Asn Lys115
120 125Leu Ser Gly Ser Ser Gln Tyr Pro Lys
Asp Gly Ser Gly Ile Glu Thr130 135 140Leu
Ser Asn Ser Ala Ile Glu Arg Leu Gln Leu His Met Gln Leu Gln145
150 155 160Ser Leu Glu Asn Pro Asn
Phe Pro His Tyr Asp Asn Pro Pro Met Trp165 170
175Pro Cys Lys Leu Asn Pro Ile Gln Glu Lys Met Met Gln Thr Leu
Gln180 185 190Leu Ala Asn Glu Ser Ser Asn
Pro Leu Met Met Gln Asn Phe Ser Pro195 200
205Ala Thr Pro Gln Lys Val Glu210 215127716DNAHelianthus
argophyllus 127atggggagag ccccttgttg tgacaaagca aatgtgaaga aggggccatg
gtcttctgaa 60gaagatgcaa aactcaaaga atttattggg aaatatggaa ctggtgggaa
ctggattgct 120ctccctcaaa aagctggcct taaaagatgt ggaaagagtt gcagattaag
gtggctaaac 180tatctaagac ccaacattag acatggtgaa tactctgatg aagaagacag
gatcatatgc 240aacttgtatg ctagcattgg tagcagatgg tctgttatag ctgctcaact
accaggaagg 300actgataatg atatcaagaa ctactggaac accaaactga agaagaaact
tttaactatg 360ccttcttacc ttcaagaaaa acaaacattt ttttcatcta tgtcctttca
accaccacca 420ccatcatata atattacatc acaccaattg ttaaacaatt catcatcatg
ttactctttt 480agtaccccta acttcatgaa tgttgacaac atgtccaact ctctctttgt
taatgatgat 540attcataaca atcttactaa tcttcaatca ccaacacctc ttgcaaacaa
tttggtgagt 600aacaacagtg ttgatgttgt taatgataac tgtttgagtt atctagggtt
tcaagatggg 660cattatggca tgtgcaatag tccaatgctc atgtatggag gtacatgttc
ttcttc 716128239PRTHelianthus argophyllus 128Met Gly Arg Ala Pro
Cys Cys Asp Lys Ala Asn Val Lys Lys Gly Pro1 5
10 15Trp Ser Ser Glu Glu Asp Ala Lys Leu Lys Glu
Phe Ile Gly Lys Tyr20 25 30Gly Thr Gly
Gly Asn Trp Ile Ala Leu Pro Gln Lys Ala Gly Leu Lys35 40
45Arg Cys Gly Lys Ser Cys Arg Leu Arg Trp Leu Asn Tyr
Leu Arg Pro50 55 60Asn Ile Arg His Gly
Glu Tyr Ser Asp Glu Glu Asp Arg Ile Ile Cys65 70
75 80Asn Leu Tyr Ala Ser Ile Gly Ser Arg Trp
Ser Val Ile Ala Ala Gln85 90 95Leu Pro
Gly Arg Thr Asp Asn Asp Ile Lys Asn Tyr Trp Asn Thr Lys100
105 110Leu Lys Lys Lys Leu Leu Thr Met Pro Ser Tyr Leu
Gln Glu Lys Gln115 120 125Thr Phe Phe Ser
Ser Met Ser Phe Gln Pro Pro Pro Pro Ser Tyr Asn130 135
140Ile Thr Ser His Gln Leu Leu Asn Asn Ser Ser Ser Cys Tyr
Ser Phe145 150 155 160Ser
Thr Pro Asn Phe Met Asn Val Asp Asn Met Ser Asn Ser Leu Phe165
170 175Val Asn Asp Asp Ile His Asn Asn Leu Thr Asn
Leu Gln Ser Pro Thr180 185 190Pro Leu Ala
Asn Asn Leu Val Ser Asn Asn Ser Val Asp Val Val Asn195
200 205Asp Asn Cys Leu Ser Tyr Leu Gly Phe Gln Asp Gly
His Tyr Gly Met210 215 220Cys Asn Ser Pro
Met Leu Met Tyr Gly Gly Thr Cys Ser Ser Ser225 230
235129803DNAHelianthus ciliaris 129agactacata caaactcatg
gtactggtgg caactggatt actctcccac aaaaagcagg 60ccttagaaga tgtggtaaga
gttgcaggtt gagatggctc aactatctca gaccaaacat 120caaacatggt gaattttctg
atgatgaaga caaacttatt tgtacccttt atgctagcat 180tggtagcagg tggtcaataa
tggcagcaca gttaccagga agaacagata atgatatcaa 240gaactactgg aacacaaaac
tgaagaagaa gaagatgatg agcaacttaa ttaccatgcc 300tgaaattcaa aaacctcttc
atcatcttca ttctttcatt tcctcgacaa actacaacta 360cccatcactg tcaaatttcc
atcctagtga tatcaatatt aatgctccat tatcatcacc 420accaccacca cagtcatcat
atatctatag ctcacacact gctagatatc atgatcaaac 480cctatctatc ccatcaagtc
ctacaataac aacaactgct tctgctgatg ctgctgctgt 540tgtttctgat catcatcacc
atccagttct acaaccacaa gatcatggga atcttggtta 600ccaaagggta aaagacagtt
ctacccttgt gatgtttgga ggtgatcaag ctagttccag 660tacttctaac tctgagggga
gttgtcatta tgagtatggt actggtatag gtgtgtatga 720tcacaacaag tacaacatgg
gacatatgag gagcaatcct ttcaaggaat gtgggatgat 780gacacatgat aaggattttg
ggt 803130684DNAHelianthus
exilismisc_feature(631)..(631)n is any nucleotide 130atgggaagag
cactttgttg tgacaaagct aatgtcaaaa gaggaccatg gtcacctgaa 60gaagatgcca
aactcaagtc atatatcgat gaatatggca ccggcggcaa ctggattgcg 120ctaccgcata
aaataggact caggagatgt ggcaagagtt gtcgccttcg atggttaaac 180tatcttcgtc
cgaatatcaa gcatggatct ttctcggaag aagaagacca tatcatttcc 240accctctatc
tcaatatcgg tagccggtgg tccattatag cttcacaatt acccgggaga 300actgataatg
atatcaagaa ctactggaac acaagactaa agaaaaaact catgggaatg 360cagcgaaaag
atcaacaatt gtcaaagaga ggtgggataa tcaagcaaga aatgaagaga 420gaggttcttg
aagatttaaa ggtaccatct ttatccacaa gcatgaatct ttatcaatct 480ttctggccta
cagaattccc tttgatagtc acaaacccta accatcatgg tcaagaactt 540catgtgaaac
cacaagatcc cacctttaat cactaccctt ttgacacaac ttcaatccaa 600ccagaattgt
tggatcaaaa caacaagagt ncattgccaa acacacccta ttttgctaat 660ggggatgttg
taaacctatt tcaa
684131228PRTHelianthus exilisMISC_FEATURE(211)..(211)X is any amino acid
131Met Gly Arg Ala Leu Cys Cys Asp Lys Ala Asn Val Lys Arg Gly Pro1
5 10 15Trp Ser Pro Glu Glu Asp
Ala Lys Leu Lys Ser Tyr Ile Asp Glu Tyr20 25
30Gly Thr Gly Gly Asn Trp Ile Ala Leu Pro His Lys Ile Gly Leu Arg35
40 45Arg Cys Gly Lys Ser Cys Arg Leu Arg
Trp Leu Asn Tyr Leu Arg Pro50 55 60Asn
Ile Lys His Gly Ser Phe Ser Glu Glu Glu Asp His Ile Ile Ser65
70 75 80Thr Leu Tyr Leu Asn Ile
Gly Ser Arg Trp Ser Ile Ile Ala Ser Gln85 90
95Leu Pro Gly Arg Thr Asp Asn Asp Ile Lys Asn Tyr Trp Asn Thr Arg100
105 110Leu Lys Lys Lys Leu Met Gly Met
Gln Arg Lys Asp Gln Gln Leu Ser115 120
125Lys Arg Gly Gly Ile Ile Lys Gln Glu Met Lys Arg Glu Val Leu Glu130
135 140Asp Leu Lys Val Pro Ser Leu Ser Thr
Ser Met Asn Leu Tyr Gln Ser145 150 155
160Phe Trp Pro Thr Glu Phe Pro Leu Ile Val Thr Asn Pro Asn
His His165 170 175Gly Gln Glu Leu His Val
Lys Pro Gln Asp Pro Thr Phe Asn His Tyr180 185
190Pro Phe Asp Thr Thr Ser Ile Gln Pro Glu Leu Leu Asp Gln Asn
Asn195 200 205Lys Ser Xaa Leu Pro Asn Thr
Pro Tyr Phe Ala Asn Gly Asp Val Val210 215
220Asn Leu Phe Gln225132654DNAHelianthus exilis 132atggggagag ccccttgttg
tgacaaagca aatgtgaaga aagggccatg gtcttctgaa 60gaagatgcaa aactcaaaga
atttattggg aaatatggaa ctggtgggaa ctggattgct 120ctccctcaaa aagctggcct
taaaagatgt ggaaagagtt gcagattaag gtggctaaac 180tatctaagac ccaacattag
acatggtgaa tactctgatg aagaagacag gatcatatgc 240aacttgtatg ctagcattgg
tagcagatgg tctgttatag ctgctcaact accaggaagg 300actgataatg atatcaagaa
ttactggaac actaaactga agaagaaact tttaactatg 360ccttcttacc ttcaagaaaa
gcaaacattt ttttcatcca tgcccttcca accaccacca 420ccatcatata atattacatc
acaccagatc ttgaacaatt catcatcatg ttactctttt 480agtaccccta acttcatgaa
tgttgacaac atgtccaact cactctttgt tcatgatgat 540attcataaca atcttactaa
tcttcaatca ccaacacctc ttgcagacaa tttgatgagt 600aacaacagtg ttgatgttgt
taatgataag tgtttgagtt atctagggtt tcaa 654133218PRTHelianthus
exilis 133Met Gly Arg Ala Pro Cys Cys Asp Lys Ala Asn Val Lys Lys Gly
Pro1 5 10 15Trp Ser Ser
Glu Glu Asp Ala Lys Leu Lys Glu Phe Ile Gly Lys Tyr20 25
30Gly Thr Gly Gly Asn Trp Ile Ala Leu Pro Gln Lys Ala
Gly Leu Lys35 40 45Arg Cys Gly Lys Ser
Cys Arg Leu Arg Trp Leu Asn Tyr Leu Arg Pro50 55
60Asn Ile Arg His Gly Glu Tyr Ser Asp Glu Glu Asp Arg Ile Ile
Cys65 70 75 80Asn Leu
Tyr Ala Ser Ile Gly Ser Arg Trp Ser Val Ile Ala Ala Gln85
90 95Leu Pro Gly Arg Thr Asp Asn Asp Ile Lys Asn Tyr
Trp Asn Thr Lys100 105 110Leu Lys Lys Lys
Leu Leu Thr Met Pro Ser Tyr Leu Gln Glu Lys Gln115 120
125Thr Phe Phe Ser Ser Met Pro Phe Gln Pro Pro Pro Pro Ser
Tyr Asn130 135 140Ile Thr Ser His Gln Ile
Leu Asn Asn Ser Ser Ser Cys Tyr Ser Phe145 150
155 160Ser Thr Pro Asn Phe Met Asn Val Asp Asn Met
Ser Asn Ser Leu Phe165 170 175Val His Asp
Asp Ile His Asn Asn Leu Thr Asn Leu Gln Ser Pro Thr180
185 190Pro Leu Ala Asp Asn Leu Met Ser Asn Asn Ser Val
Asp Val Val Asn195 200 205Asp Lys Cys Leu
Ser Tyr Leu Gly Phe Gln210 215134840DNAHelianthus
paradoxus 134tgccaactaa ggcttatatc gaagaacatg gcaccggagg caactggatt
gctctacctc 60ataaaattgg cctcaagaga tgtggcaaga gttgtcgcct tcgatggcta
aactatcttc 120gcccaaacat caagcatgga tctttctccg aggaagaaga tcacatcatt
tgcaccctct 180atctcagtat tggcagccgg tggtctatca tagcggcaca attgcctggg
cgaacggata 240atgatatcaa gaactactgg aacacgagac taaagaaaaa actaatggga
aaacaacgga 300aagatcaaca atcgtcgaaa cgaggttgct taatcaagca agaaatgaag
agagaggttc 360ttgaagattt aaaggcacca tctttatgca tgagcatgaa tctttatcaa
tcttactggc 420caacagaatt ccctttgatt ctcacaaacc ctaatcagca tcatcatcaa
gaacttgcca 480aaacccaaga tcccactttt gagccatacc aatttgacac aatctcaaat
caacccgaat 540tgttcaatca aaacaacgat aaaccgatat caactgcttc ttaccatcct
cagttgccaa 600acacacccta ttttctaaat ggagatggta taaatctgtt tcaagaattc
aacaattacc 660catttgggat cgatgaattt ggctgcaaca caacacagtt agagcagttt
gatgggttgg 720ttgggggttt ggataacctt gtcaatggaa gcaattgcac aagctcatcg
gcagaaagta 780caactagttg gggtgatcct ctagcatgcc ctcctccccc tccgatggcg
tatgatgatt 840135429DNAHelianthus petiolaris 135atgggaagag caccttgctg
tgacaaagcc aacgtcaaaa gaggaccatg gtcacctgaa 60gaagatgcca aactaaaggc
ttatatcgaa gaacatggca ccggaggcaa ctggattgct 120ctacctcata aaattggcct
caagagatgt ggcaagagtt gtcgccttcg atggctaaac 180tatcttcgcc caaacatcaa
gcatggatct ttctccgagg aagaagatca catcatttgc 240accctctatc tcagtattgg
cagccggtgg tctatcatag cggcacaatt gcctgggcga 300acggataatg atatcaagaa
ctactggaac acgagactaa agaaaaaaac taatgggaaa 360acaacggaaa gatcaacaat
cgtcgaaacg aggtggctta atcaagcaag aaatgaagag 420agaggttct
429136143PRTHelianthus
petiolaris 136Met Gly Arg Ala Pro Cys Cys Asp Lys Ala Asn Val Lys Arg Gly
Pro1 5 10 15Trp Ser Pro
Glu Glu Asp Ala Lys Leu Lys Ala Tyr Ile Glu Glu His20 25
30Gly Thr Gly Gly Asn Trp Ile Ala Leu Pro His Lys Ile
Gly Leu Lys35 40 45Arg Cys Gly Lys Ser
Cys Arg Leu Arg Trp Leu Asn Tyr Leu Arg Pro50 55
60Asn Ile Lys His Gly Ser Phe Ser Glu Glu Glu Asp His Ile Ile
Cys65 70 75 80Thr Leu
Tyr Leu Ser Ile Gly Ser Arg Trp Ser Ile Ile Ala Ala Gln85
90 95Leu Pro Gly Arg Thr Asp Asn Asp Ile Lys Asn Tyr
Trp Asn Thr Arg100 105 110Leu Lys Lys Lys
Thr Asn Gly Lys Thr Thr Glu Arg Ser Thr Ile Val115 120
125Glu Thr Arg Trp Leu Asn Gln Ala Arg Asn Glu Glu Arg Gly
Ser130 135 140137723DNAHelianthus
petiolaris 137atgggaagag caccttgttg tgacaaagct aatgtaaaaa gaggaccatg
gtcacctgaa 60gaagatgcca aactcaagtc atatattgat gaatatggca ccggcggcaa
ctggatcgcg 120ctaccgcata aaataggact caggagatgt ggcaagagtt gccgccttcg
atggttaaac 180tatcttcgtc cgaatatcaa gcatggatct ttctcggaag aagaagatca
tatcatttcc 240actctctatc tcaatatcgg tagccggtgg tcaattatag cttcacattt
acccggccgg 300actgataatg atatcaagaa ctactggaac acaagactaa agaaaaaact
catgggaatg 360cagcgaaaag atcaacaatt gtcaaagaga ggggggatag tcaagcaaga
aatgaagaga 420gaggttcttg aagaattaaa gggaccatct ttagctacaa gcatgaatct
ttatcaatct 480ttttggccta cagaatccgc tttgatagtc acaaacccta atcatcatgg
tcaagaactt 540catgtgaaac cacaagatcc cacctttaat cactaccctt ttgacacaac
ctcaatccaa 600ccagaattgt tggatcaaaa caacaagaat cagttgctaa acacacccta
ttttgctaat 660ggggatgttg taaacctatt tcaagaattc acaattaccc atttgagatc
aatgagttta 720act
723138241PRTHelianthus petiolaris 138Met Gly Arg Ala Pro Cys
Cys Asp Lys Ala Asn Val Lys Arg Gly Pro1 5
10 15Trp Ser Pro Glu Glu Asp Ala Lys Leu Lys Ser Tyr
Ile Asp Glu Tyr20 25 30Gly Thr Gly Gly
Asn Trp Ile Ala Leu Pro His Lys Ile Gly Leu Arg35 40
45Arg Cys Gly Lys Ser Cys Arg Leu Arg Trp Leu Asn Tyr Leu
Arg Pro50 55 60Asn Ile Lys His Gly Ser
Phe Ser Glu Glu Glu Asp His Ile Ile Ser65 70
75 80Thr Leu Tyr Leu Asn Ile Gly Ser Arg Trp Ser
Ile Ile Ala Ser His85 90 95Leu Pro Gly
Arg Thr Asp Asn Asp Ile Lys Asn Tyr Trp Asn Thr Arg100
105 110Leu Lys Lys Lys Leu Met Gly Met Gln Arg Lys Asp
Gln Gln Leu Ser115 120 125Lys Arg Gly Gly
Ile Val Lys Gln Glu Met Lys Arg Glu Val Leu Glu130 135
140Glu Leu Lys Gly Pro Ser Leu Ala Thr Ser Met Asn Leu Tyr
Gln Ser145 150 155 160Phe
Trp Pro Thr Glu Ser Ala Leu Ile Val Thr Asn Pro Asn His His165
170 175Gly Gln Glu Leu His Val Lys Pro Gln Asp Pro
Thr Phe Asn His Tyr180 185 190Pro Phe Asp
Thr Thr Ser Ile Gln Pro Glu Leu Leu Asp Gln Asn Asn195
200 205Lys Asn Gln Leu Leu Asn Thr Pro Tyr Phe Ala Asn
Gly Asp Val Val210 215 220Asn Leu Phe Gln
Glu Phe Thr Ile Thr His Leu Arg Ser Met Ser Leu225 230
235 240Thr139758DNAHelianthus
tuberosusmisc_feature(612)..(612)n is any nucleotide 139atgggtagag
caccatgttg tgacaaaaca aatgtgaaaa agggaccttg gtcttctgaa 60gaagatgcta
agctcaaaga ttatatcgaa aagtatggta ctggtggtaa ctggattgca 120cttccccaga
agatcgggct aaagaggtgc ggaaaaagtt gcagattgag atggttaaat 180taccttcgac
ccaacatcaa gcatggagga ttctctgaag aagaagacaa catcatctgc 240agcctctata
ttagtatagg cagcaggtgg tctataatag cggctcaact tcctggaaga 300acagataacg
acatcaagaa ctactggaac acaaggttga agaagaagtt actcggaagg 360cgcaaacaat
cacaggcgaa taagctttcg ggttcaagcc aggacccgaa agacggcagc 420ggaatagaaa
ccctaagcaa ctcagccatt gaaaggcttc aacttcacat gcaactccaa 480agccttgaaa
accctaattt ccctcactat gataatcccc ccatgtggcc ttgcaagttg 540aatcctatac
aagaaaaaat gatgcaaact ctccaacttg caaatgaagc ctctaaccct 600ctcatgatgc
anaatttctc acctgctact cctcaaaaag ttgaatacta tgcacaatcc 660aataaccctc
tttctctaca agaaaactgg gtcgtttccg gcaatatgat gattaacgga 720atggagaatt
ccatgggaat caacattcct gagtcatc
758140253PRTHelianthus tuberosusMISC_FEATURE(204)..(204)X is any amino
acid 140Met Gly Arg Ala Pro Cys Cys Asp Lys Thr Asn Val Lys Lys Gly Pro1
5 10 15Trp Ser Ser Glu
Glu Asp Ala Lys Leu Lys Asp Tyr Ile Glu Lys Tyr20 25
30Gly Thr Gly Gly Asn Trp Ile Ala Leu Pro Gln Lys Ile Gly
Leu Lys35 40 45Arg Cys Gly Lys Ser Cys
Arg Leu Arg Trp Leu Asn Tyr Leu Arg Pro50 55
60Asn Ile Lys His Gly Gly Phe Ser Glu Glu Glu Asp Asn Ile Ile Cys65
70 75 80Ser Leu Tyr Ile
Ser Ile Gly Ser Arg Trp Ser Ile Ile Ala Ala Gln85 90
95Leu Pro Gly Arg Thr Asp Asn Asp Ile Lys Asn Tyr Trp Asn
Thr Arg100 105 110Leu Lys Lys Lys Leu Leu
Gly Arg Arg Lys Gln Ser Gln Ala Asn Lys115 120
125Leu Ser Gly Ser Ser Gln Asp Pro Lys Asp Gly Ser Gly Ile Glu
Thr130 135 140Leu Ser Asn Ser Ala Ile Glu
Arg Leu Gln Leu His Met Gln Leu Gln145 150
155 160Ser Leu Glu Asn Pro Asn Phe Pro His Tyr Asp Asn
Pro Pro Met Trp165 170 175Pro Cys Lys Leu
Asn Pro Ile Gln Glu Lys Met Met Gln Thr Leu Gln180 185
190Leu Ala Asn Glu Ala Ser Asn Pro Leu Met Met Xaa Asn Phe
Ser Pro195 200 205Ala Thr Pro Gln Lys Val
Glu Tyr Tyr Ala Gln Ser Asn Asn Pro Leu210 215
220Ser Leu Gln Glu Asn Trp Val Val Ser Gly Asn Met Met Ile Asn
Gly225 230 235 240Met Glu
Asn Ser Met Gly Ile Asn Ile Pro Glu Ser Ser245
250141473DNAHordeum vulgare 141atggggaggg cgccgtgctg cgacagggcg
gcggtgaacc ggggcccgtg gtcgccggag 60gaggacgacg cgctgcgcga ctacatgcag
cgccatggca acaccggcag ctggatcacc 120ctccccgcca aagccgggct caagaggtgc
ggcaagagct gcaggctgcg gtggctcaac 180tacctgcgcc cggacatccg ccacggcggc
ttcaccgacg aggaggacgc catcatctac 240tccctctaca gccagctcgg cagcaagtgg
tcgctgatag cgtcgcagct ggagaggagg 300acggacaacg acgtcaagaa ccactggaac
accaagctca agaagcgcct cgtcgccgcg 360gccgccgcct tcccggccgc cccctcctcc
cgccgccccc cgtccttcac tccggccgca 420gcgcacgcgc acgcgcaccc gtcgccgctg
ctcccgctcc ccgcgccgac cgt 473142158PRTHordeum vulgare 142Met
Gly Arg Ala Pro Cys Cys Asp Arg Ala Ala Val Asn Arg Gly Pro1
5 10 15Trp Ser Pro Glu Glu Asp Asp
Ala Leu Arg Asp Tyr Met Gln Arg His20 25
30Gly Asn Thr Gly Ser Trp Ile Thr Leu Pro Ala Lys Ala Gly Leu Lys35
40 45Arg Cys Gly Lys Ser Cys Arg Leu Arg Trp
Leu Asn Tyr Leu Arg Pro50 55 60Asp Ile
Arg His Gly Gly Phe Thr Asp Glu Glu Asp Ala Ile Ile Tyr65
70 75 80Ser Leu Tyr Ser Gln Leu Gly
Ser Lys Trp Ser Leu Ile Ala Ser Gln85 90
95Leu Glu Arg Arg Thr Asp Asn Asp Val Lys Asn His Trp Asn Thr Lys100
105 110Leu Lys Lys Arg Leu Val Ala Ala Ala
Ala Ala Phe Pro Ala Ala Pro115 120 125Ser
Ser Arg Arg Pro Pro Ser Phe Thr Pro Ala Ala Ala His Ala His130
135 140Ala His Pro Ser Pro Leu Leu Pro Leu Pro Ala
Pro Thr Val145 150 155143792DNAHumulus
lupulus 143atgggaagag ccccttgttg cgacaaaacc aaagtgaaaa gaggaccttg
gtctcctgaa 60gaagacgccg ccctcaagca ctacatgcac aacaatggaa ctgggggtaa
ttggattgct 120ctacctcaca aagcaggtct taaccggtgc ggcaaaagtt gtcgtttgag
atggctgaat 180tatctcagac cagatatcaa gcacgggggt tttaccgagg aagaagacaa
tgttgtttgg 240accctttaca gtaacatcgg aagtaggtgg tctgtcatag catcccaact
acctggaaga 300acagacaacg atgtgaaaaa ccactggaac accaagttga agaaaaaact
attggcaaga 360agcaccaact gcaatgagac tactaacact accgatcacc tcaacttggc
tcggttctca 420ccattgattc cgaaaactga gaattctgat caagggaact cagcttattg
ttatactaat 480tcagatacat taccatatct gatgaatggg aactgtgtgc aaaactttga
tctcaagaag 540ccaagtaaac cttcttactc atcaaaccct gttcaagttt ctgattttgg
aacaaatggg 600aacaccagtt ctagcatttc atcaactaga gaggcttcga gtctttcaac
ttcatcatct 660actttggcta tggagaacta tactaatttt gcttcatggt ctggcactgg
cattgaaagt 720actgaagatg ggattcttgt ggacttggga tttgactctt ctaatgatgt
tttcctgagt 780ggctttggat tt
792144264PRTHumulus lupulus 144Met Gly Arg Ala Pro Cys Cys
Asp Lys Thr Lys Val Lys Arg Gly Pro1 5 10
15Trp Ser Pro Glu Glu Asp Ala Ala Leu Lys His Tyr Met
His Asn Asn20 25 30Gly Thr Gly Gly Asn
Trp Ile Ala Leu Pro His Lys Ala Gly Leu Asn35 40
45Arg Cys Gly Lys Ser Cys Arg Leu Arg Trp Leu Asn Tyr Leu Arg
Pro50 55 60Asp Ile Lys His Gly Gly Phe
Thr Glu Glu Glu Asp Asn Val Val Trp65 70
75 80Thr Leu Tyr Ser Asn Ile Gly Ser Arg Trp Ser Val
Ile Ala Ser Gln85 90 95Leu Pro Gly Arg
Thr Asp Asn Asp Val Lys Asn His Trp Asn Thr Lys100 105
110Leu Lys Lys Lys Leu Leu Ala Arg Ser Thr Asn Cys Asn Glu
Thr Thr115 120 125Asn Thr Thr Asp His Leu
Asn Leu Ala Arg Phe Ser Pro Leu Ile Pro130 135
140Lys Thr Glu Asn Ser Asp Gln Gly Asn Ser Ala Tyr Cys Tyr Thr
Asn145 150 155 160Ser Asp
Thr Leu Pro Tyr Leu Met Asn Gly Asn Cys Val Gln Asn Phe165
170 175Asp Leu Lys Lys Pro Ser Lys Pro Ser Tyr Ser Ser
Asn Pro Val Gln180 185 190Val Ser Asp Phe
Gly Thr Asn Gly Asn Thr Ser Ser Ser Ile Ser Ser195 200
205Thr Arg Glu Ala Ser Ser Leu Ser Thr Ser Ser Ser Thr Leu
Ala Met210 215 220Glu Asn Tyr Thr Asn Phe
Ala Ser Trp Ser Gly Thr Gly Ile Glu Ser225 230
235 240Thr Glu Asp Gly Ile Leu Val Asp Leu Gly Phe
Asp Ser Ser Asn Asp245 250 255Val Phe Leu
Ser Gly Phe Gly Phe260145783DNALactuca perennismisc_feature(709)..(709)n
is any nucleotide 145atgggaagag caccttgctg tgacaaagcc aacgtcaaaa
gaggaccttg gtcacctgaa 60gaagatgcca aactcaagtc ttacattgaa gaacatggca
ccggaggcaa ctggatcgct 120ctacctcata aaatcggact caaaaggtgt ggcaagagtt
gtcgccttag atggttaaat 180tatcttcgcc caaacataaa gcatggatct ttctccgaag
aagaagatca catcatttgt 240accctatatc ttagtattgg cagcaggtgg tctatcatcg
cagcacaatt acctggaaga 300actgataatg atataaaaaa ctactggaac acgaggctaa
agaaaaagct tttgggaaag 360cagcgtaaag atcaacaatc gtcaaagaga ggtgggcaag
tcaagcaaga aatgaagaca 420gaggtgtttg aggacttaaa ggcaccatct ttatgcatga
ccatgaatct ttatcaatca 480tactggcctt cagaattccc tttgattctc acaaaccctg
atcaacatca tcaggaactt 540cacgtcaaaa accaagatcc caatttcaac ccatacccat
ttgacataac ctcaaaccaa 600gcccaaatgt tccatcagaa ctgtgaaaaa cctatctcta
ctacctctta ccatccccag 660ttggcaacca caccctatta tccaaatggg gatggagtaa
acctgtttnc agaatttaaa 720cattacccat ttgagatcaa tgaattggtt tacaacacca
cacaattaga ccagtttgat 780ggg
783146261PRTLactuca
perennisMISC_FEATURE(237)..(237)X is any amino acid 146Met Gly Arg Ala
Pro Cys Cys Asp Lys Ala Asn Val Lys Arg Gly Pro1 5
10 15Trp Ser Pro Glu Glu Asp Ala Lys Leu Lys
Ser Tyr Ile Glu Glu His20 25 30Gly Thr
Gly Gly Asn Trp Ile Ala Leu Pro His Lys Ile Gly Leu Lys35
40 45Arg Cys Gly Lys Ser Cys Arg Leu Arg Trp Leu Asn
Tyr Leu Arg Pro50 55 60Asn Ile Lys His
Gly Ser Phe Ser Glu Glu Glu Asp His Ile Ile Cys65 70
75 80Thr Leu Tyr Leu Ser Ile Gly Ser Arg
Trp Ser Ile Ile Ala Ala Gln85 90 95Leu
Pro Gly Arg Thr Asp Asn Asp Ile Lys Asn Tyr Trp Asn Thr Arg100
105 110Leu Lys Lys Lys Leu Leu Gly Lys Gln Arg Lys
Asp Gln Gln Ser Ser115 120 125Lys Arg Gly
Gly Gln Val Lys Gln Glu Met Lys Thr Glu Val Phe Glu130
135 140Asp Leu Lys Ala Pro Ser Leu Cys Met Thr Met Asn
Leu Tyr Gln Ser145 150 155
160Tyr Trp Pro Ser Glu Phe Pro Leu Ile Leu Thr Asn Pro Asp Gln His165
170 175His Gln Glu Leu His Val Lys Asn Gln
Asp Pro Asn Phe Asn Pro Tyr180 185 190Pro
Phe Asp Ile Thr Ser Asn Gln Ala Gln Met Phe His Gln Asn Cys195
200 205Glu Lys Pro Ile Ser Thr Thr Ser Tyr His Pro
Gln Leu Ala Thr Thr210 215 220Pro Tyr Tyr
Pro Asn Gly Asp Gly Val Asn Leu Phe Xaa Glu Phe Lys225
230 235 240His Tyr Pro Phe Glu Ile Asn
Glu Leu Val Tyr Asn Thr Thr Gln Leu245 250
255Asp Gln Phe Asp Gly260147756DNALactuca saligna 147atgggaaggg
ctccatgttg tgacaaggca aacgtgaaga aagggccatg gtcacctgaa 60gaagatagaa
agcttaaaga ctacatagaa acgcatggca ctggtggcaa ctggatcgct 120ctcccacaaa
aagcaggcct tagacgatgt ggaaagagct gcagattgag atggttgaac 180tatctgagac
caaacattaa acatggcgaa ttttctgatg atgaagataa agttatctgt 240gccctctatg
ctagcattgg tagcaggtgg tcaataatgg cagcacagtt accaggaaga 300acagacaatg
atatcaagaa ctactggaac acaaagctga agaagaagat gatgaacagc 360ttaatcaccc
tacccgaact tagaaaacct cttcaacatc ttcaatcttt cagttcttca 420acaaactaca
gctacccatc aaaatcgatt tttcaaaatt gtaatatcaa tgttaatgct 480ccattatcat
catcaatatc atcgtctcca tcatatttgt acagtacaaa tacttctagc 540taccatgatc
aaaccttatc tatcccatct agccctagaa ttaatgctgc taatcgtcaa 600catccagctc
tccaatcaca agatcatgga tttctcggtt tggtttccgc ggagacttac 660cagcaggggg
taaaagacag ttctaccctt gtcttctttg gaggtgatca agctagttgc 720agttctaatt
ccgatgggag ctgccattat gagtat
756148252PRTLactuca saligna 148Met Gly Arg Ala Pro Cys Cys Asp Lys Ala
Asn Val Lys Lys Gly Pro1 5 10
15Trp Ser Pro Glu Glu Asp Arg Lys Leu Lys Asp Tyr Ile Glu Thr His20
25 30Gly Thr Gly Gly Asn Trp Ile Ala Leu
Pro Gln Lys Ala Gly Leu Arg35 40 45Arg
Cys Gly Lys Ser Cys Arg Leu Arg Trp Leu Asn Tyr Leu Arg Pro50
55 60Asn Ile Lys His Gly Glu Phe Ser Asp Asp Glu
Asp Lys Val Ile Cys65 70 75
80Ala Leu Tyr Ala Ser Ile Gly Ser Arg Trp Ser Ile Met Ala Ala Gln85
90 95Leu Pro Gly Arg Thr Asp Asn Asp Ile
Lys Asn Tyr Trp Asn Thr Lys100 105 110Leu
Lys Lys Lys Met Met Asn Ser Leu Ile Thr Leu Pro Glu Leu Arg115
120 125Lys Pro Leu Gln His Leu Gln Ser Phe Ser Ser
Ser Thr Asn Tyr Ser130 135 140Tyr Pro Ser
Lys Ser Ile Phe Gln Asn Cys Asn Ile Asn Val Asn Ala145
150 155 160Pro Leu Ser Ser Ser Ile Ser
Ser Ser Pro Ser Tyr Leu Tyr Ser Thr165 170
175Asn Thr Ser Ser Tyr His Asp Gln Thr Leu Ser Ile Pro Ser Ser Pro180
185 190Arg Ile Asn Ala Ala Asn Arg Gln His
Pro Ala Leu Gln Ser Gln Asp195 200 205His
Gly Phe Leu Gly Leu Val Ser Ala Glu Thr Tyr Gln Gln Gly Val210
215 220Lys Asp Ser Ser Thr Leu Val Phe Phe Gly Gly
Asp Gln Ala Ser Cys225 230 235
240Ser Ser Asn Ser Asp Gly Ser Cys His Tyr Glu Tyr245
250149774DNALactuca sativa 149atgggaaggg ctccgtgttg tgacaaagaa
aatgtaaaga gaggcccatg gtctcctgaa 60gaagacgcaa aactcaaaag cttcatcgac
aaatatggca ctggtggtaa ctggattgct 120ctccctcaca aagctggtct aaagaggtgt
ggaaagagct gcagattgcg atggttaaac 180tatctgaggc ccaatattaa gcatggtgaa
ttcactgatg atgaagacaa gatcatctgc 240agcttgtatg ctagcattgg tagcagatgg
tcaataatag cagctcagct accaggaagg 300actgataatg atataaagaa ttactggaac
accaaactca agaagaagct cttggctatg 360cttccttcct ttcaaaagaa ggcatctttt
tttccatcta tatcccttca atcaccatca 420ccatacagat catcagatca gttcatgacc
gataattctt cccttttcta cggttacact 480aatttcatga acatgaacaa caatatcaac
tctctgctga cacctaccac caacaccacc 540accggtgttc aagatcatct gatctcacca
tcaccaccac caccaccagg agccgatcta 600aactctttaa tcggtttgat gaccaacaac
attgatcaca atgagaattc tttctattta 660gggtatcagg agaatcatca gagcatgtac
aacaccccca tggaatacca ctaccttaca 720tcagatgtga aagaaccgat gctcattttt
ggagtggtgg tgagggtcat gaag 774150258PRTLactuca sativa 150Met Gly
Arg Ala Pro Cys Cys Asp Lys Glu Asn Val Lys Arg Gly Pro1 5
10 15Trp Ser Pro Glu Glu Asp Ala Lys
Leu Lys Ser Phe Ile Asp Lys Tyr20 25
30Gly Thr Gly Gly Asn Trp Ile Ala Leu Pro His Lys Ala Gly Leu Lys35
40 45Arg Cys Gly Lys Ser Cys Arg Leu Arg Trp
Leu Asn Tyr Leu Arg Pro50 55 60Asn Ile
Lys His Gly Glu Phe Thr Asp Asp Glu Asp Lys Ile Ile Cys65
70 75 80Ser Leu Tyr Ala Ser Ile Gly
Ser Arg Trp Ser Ile Ile Ala Ala Gln85 90
95Leu Pro Gly Arg Thr Asp Asn Asp Ile Lys Asn Tyr Trp Asn Thr Lys100
105 110Leu Lys Lys Lys Leu Leu Ala Met Leu
Pro Ser Phe Gln Lys Lys Ala115 120 125Ser
Phe Phe Pro Ser Ile Ser Leu Gln Ser Pro Ser Pro Tyr Arg Ser130
135 140Ser Asp Gln Phe Met Thr Asp Asn Ser Ser Leu
Phe Tyr Gly Tyr Thr145 150 155
160Asn Phe Met Asn Met Asn Asn Asn Ile Asn Ser Leu Leu Thr Pro
Thr165 170 175Thr Asn Thr Thr Thr Gly Val
Gln Asp His Leu Ile Ser Pro Ser Pro180 185
190Pro Pro Pro Pro Gly Ala Asp Leu Asn Ser Leu Ile Gly Leu Met Thr195
200 205Asn Asn Ile Asp His Asn Glu Asn Ser
Phe Tyr Leu Gly Tyr Gln Glu210 215 220Asn
His Gln Ser Met Tyr Asn Thr Pro Met Glu Tyr His Tyr Leu Thr225
230 235 240Ser Asp Val Lys Glu Pro
Met Leu Ile Phe Gly Val Val Val Arg Val245 250
255Met Lys151816DNALactuca sativa 151atgggaaggg ctccgtgttg
tgacaaagaa aatgtaaaga gaggcccatg gtctcctgaa 60gaagacgcaa aactcaaaag
cttcatcgac aaatatggca ctggtggtaa ctggattgct 120ctccctcaca aagctggtct
aaaaaggtgt ggaaagagct gcagattgcg atggttaaac 180tatctgaggc ccaatattaa
gcatggtgaa ttcactgatg atgaagacaa gatcatctgc 240agcttgtatg ctagcattgg
tagcagatgg tcaataatag cagctcagct accaggaagg 300actgataatg atataaagaa
ttactggaac accaaactca agaagaagct cttggctatg 360cttccttcct ttcaaaagaa
ggcatctttt tttccatcta tatcccttca atcaccatca 420ccatacagat catcagatca
gttcatgacc gataattctt cccttttcta cggttacact 480aatttcatga acatgaacaa
caatatcaac tctctgctga cacctaccac caacaccacc 540accggtgttc aagatcatct
gatctcacca tcaccaccac caccaccagg agccgatcta 600aactctttaa tcggtttgat
gaccaacaac attgacaaca atgagaattc tttctattta 660gggtatcagg agaatcatca
gagcatgtac aacaccccca tggaatacca ctaccttaca 720tcagatgtga aagaaccgat
gctcattttt ggaagtggtg gtgagggtca tgaagtgagc 780accactggtt cttcctctga
agctgggagt tctttg 816152272PRTLactuca sativa
152Met Gly Arg Ala Pro Cys Cys Asp Lys Glu Asn Val Lys Arg Gly Pro1
5 10 15Trp Ser Pro Glu Glu Asp
Ala Lys Leu Lys Ser Phe Ile Asp Lys Tyr20 25
30Gly Thr Gly Gly Asn Trp Ile Ala Leu Pro His Lys Ala Gly Leu Lys35
40 45Arg Cys Gly Lys Ser Cys Arg Leu Arg
Trp Leu Asn Tyr Leu Arg Pro50 55 60Asn
Ile Lys His Gly Glu Phe Thr Asp Asp Glu Asp Lys Ile Ile Cys65
70 75 80Ser Leu Tyr Ala Ser Ile
Gly Ser Arg Trp Ser Ile Ile Ala Ala Gln85 90
95Leu Pro Gly Arg Thr Asp Asn Asp Ile Lys Asn Tyr Trp Asn Thr Lys100
105 110Leu Lys Lys Lys Leu Leu Ala Met
Leu Pro Ser Phe Gln Lys Lys Ala115 120
125Ser Phe Phe Pro Ser Ile Ser Leu Gln Ser Pro Ser Pro Tyr Arg Ser130
135 140Ser Asp Gln Phe Met Thr Asp Asn Ser
Ser Leu Phe Tyr Gly Tyr Thr145 150 155
160Asn Phe Met Asn Met Asn Asn Asn Ile Asn Ser Leu Leu Thr
Pro Thr165 170 175Thr Asn Thr Thr Thr Gly
Val Gln Asp His Leu Ile Ser Pro Ser Pro180 185
190Pro Pro Pro Pro Gly Ala Asp Leu Asn Ser Leu Ile Gly Leu Met
Thr195 200 205Asn Asn Ile Asp Asn Asn Glu
Asn Ser Phe Tyr Leu Gly Tyr Gln Glu210 215
220Asn His Gln Ser Met Tyr Asn Thr Pro Met Glu Tyr His Tyr Leu Thr225
230 235 240Ser Asp Val Lys
Glu Pro Met Leu Ile Phe Gly Ser Gly Gly Glu Gly245 250
255His Glu Val Ser Thr Thr Gly Ser Ser Ser Glu Ala Gly Ser
Ser Leu260 265 270153750DNALactuca sativa
153atgggaagag ctccatgttg tgacaagtca aaggtgaaga aagggccatg gtcacctcaa
60gaagatacaa agttgaaaga ctacatacac aaaaatggta ctggaggcaa ctggattgct
120cttccacata aagcaggcct taaaagatgt gggaaaagct gccgattaag atggttgaac
180taccttagac cagacatcaa acatggtgaa ttctccgacc atgaagatag actcatctat
240actctctttt ctagcatcgg tagcaggtgg tcagtaatag cagcacagtt accaggaaga
300acagataatg atatcaagaa ctactggaac acgaagctca agaagaagat tatgaatttc
360atctccacca atcatcaaac tgagaaacca cttcatcatc ttgagtttgt caattgtcct
420tcttcaaact atagctaccc gacaagctct tcggttacaa ctgttgataa tcatccagtt
480ctaccatcag atgatcatgg atacataaat gtggacatgg aaacctaccg agtaaaagac
540aattctaccc tttttatgct tgaaggtgat gctcaagctg ctgattgcag ttctaattct
600gatggaaggt ggcatcatgt gtatggtagt ggtgcgtttg atgcgcgtaa taacatggga
660ggcttgaaga caagtaattc ctacatggga gttgatggga attattgtga aaaggcaaga
720gggtgttatg gggattctac catggagttt
750154250PRTLactuca sativa 154Met Gly Arg Ala Pro Cys Cys Asp Lys Ser Lys
Val Lys Lys Gly Pro1 5 10
15Trp Ser Pro Gln Glu Asp Thr Lys Leu Lys Asp Tyr Ile His Lys Asn20
25 30Gly Thr Gly Gly Asn Trp Ile Ala Leu Pro
His Lys Ala Gly Leu Lys35 40 45Arg Cys
Gly Lys Ser Cys Arg Leu Arg Trp Leu Asn Tyr Leu Arg Pro50
55 60Asp Ile Lys His Gly Glu Phe Ser Asp His Glu Asp
Arg Leu Ile Tyr65 70 75
80Thr Leu Phe Ser Ser Ile Gly Ser Arg Trp Ser Val Ile Ala Ala Gln85
90 95Leu Pro Gly Arg Thr Asp Asn Asp Ile Lys
Asn Tyr Trp Asn Thr Lys100 105 110Leu Lys
Lys Lys Ile Met Asn Phe Ile Ser Thr Asn His Gln Thr Glu115
120 125Lys Pro Leu His His Leu Glu Phe Val Asn Cys Pro
Ser Ser Asn Tyr130 135 140Ser Tyr Pro Thr
Ser Ser Ser Val Thr Thr Val Asp Asn His Pro Val145 150
155 160Leu Pro Ser Asp Asp His Gly Tyr Ile
Asn Val Asp Met Glu Thr Tyr165 170 175Arg
Val Lys Asp Asn Ser Thr Leu Phe Met Leu Glu Gly Asp Ala Gln180
185 190Ala Ala Asp Cys Ser Ser Asn Ser Asp Gly Arg
Trp His His Val Tyr195 200 205Gly Ser Gly
Ala Phe Asp Ala Arg Asn Asn Met Gly Gly Leu Lys Thr210
215 220Ser Asn Ser Tyr Met Gly Val Asp Gly Asn Tyr Cys
Glu Lys Ala Arg225 230 235
240Gly Cys Tyr Gly Asp Ser Thr Met Glu Phe245
250155893DNALactuca serriola 155ggaagagctc catgttgtga ccagtcaaag
gtgaagaaag ggccatggtc acctcaagaa 60gatacaaagt tgaaagacta catacacaaa
aatggtactg gaggcaactg gattgctctt 120ccacataaag caggccttaa aagatgtggg
aaaagctgcc gattaagatg gttgaactac 180cttagaccag acatcaaaca tggtgaattc
tccgaccatg aagatagact catctatact 240ctcttttcta gcatcggtag caggtggtca
gtaatagcag cacagttacc aggaagaaca 300gataatgata tcaagaacta ctggaacacg
aagctcaaga agaagattat gaatttcatc 360tccaccaatc atcaaactga gaaaccactt
catcatcttg agtttgtcaa ttgtccttct 420tcaaactata gctacccgac aagctcttcg
gttacaactg ttgataatca tccagttcta 480ccatcagatg atcatggata cataaatgtg
gacatggaaa cctaccgagt aaaagacaat 540tctacccttt ttatgcttga aggtgatgct
caagctgctg attgcagttc taattctgat 600ggaaggtggc atcatgtgta tggtagtggt
gcgtttgatg cgcgtaataa catgggaggc 660ttgaagacaa gtaattccta catgggagtt
gatgggaatt attgtgaaaa ggcaagaggg 720tgttatgggg attctactat ggagtttagt
cttgaggagt tcaagaagct cattagcact 780aatctttgca acagtaacac taacaataat
ctcaatgtct ttgttgatga actcaaggca 840gaagagaaca ttatgtacta ctaaactatt
ttgatttatt ttggtgtaaa aaa 893156702DNALactuca
virosamisc_feature(566)..(566)n is any nucleotide 156atgggaagag
caccttgctg tgacaaagcc aacgtcaaaa gaggaccttg gtcacctgaa 60gaagatgcca
aactcaagtc ttacattgaa gaacatggca ccggaggcaa ctggattgct 120ctacctcaca
aaatcggact tagatggtta aattatcttc gcccaaacat aaagcatgga 180tctttctccg
aagaagaaga tcacatcatt tgcaccccat atcttagtat tggaagccgg 240tggtctatca
tcgcagcaca attacctgga agaactgata atgatataaa aaactactgg 300aacacgaggc
taaagaaaaa gcttttggga aagcagcgta aagatcaaca atcgtcaaag 360agaggtgcgc
aagtcaagca agaaatgaag agagaggtgt ttgaggagtt aaaggcacca 420tctttatgca
tgagcatgaa tctttatcaa tcatactggc cttcagaatt ccctttgatt 480ctcacaaacc
ctgatcaact tcatcaagaa cttcatgtca aaaaccaagt tcccaatttc 540aaccaatacc
cttttgacac aacctncaac caagccgaaa tgttccacca gaactgtgaa 600aaacctatct
ctactaccgc ttatcatccc cagttggcaa acacacacta ttatccaaat 660ggggatggaa
taaacctgtt tcaagaatta acattacccg ta
702157234PRTLactuca virosaMISC_FEATURE(189)..(189)X is any amino acid
157Met Gly Arg Ala Pro Cys Cys Asp Lys Ala Asn Val Lys Arg Gly Pro1
5 10 15Trp Ser Pro Glu Glu Asp
Ala Lys Leu Lys Ser Tyr Ile Glu Glu His20 25
30Gly Thr Gly Gly Asn Trp Ile Ala Leu Pro His Lys Ile Gly Leu Arg35
40 45Trp Leu Asn Tyr Leu Arg Pro Asn Ile
Lys His Gly Ser Phe Ser Glu50 55 60Glu
Glu Asp His Ile Ile Cys Thr Pro Tyr Leu Ser Ile Gly Ser Arg65
70 75 80Trp Ser Ile Ile Ala Ala
Gln Leu Pro Gly Arg Thr Asp Asn Asp Ile85 90
95Lys Asn Tyr Trp Asn Thr Arg Leu Lys Lys Lys Leu Leu Gly Lys Gln100
105 110Arg Lys Asp Gln Gln Ser Ser Lys
Arg Gly Ala Gln Val Lys Gln Glu115 120
125Met Lys Arg Glu Val Phe Glu Glu Leu Lys Ala Pro Ser Leu Cys Met130
135 140Ser Met Asn Leu Tyr Gln Ser Tyr Trp
Pro Ser Glu Phe Pro Leu Ile145 150 155
160Leu Thr Asn Pro Asp Gln Leu His Gln Glu Leu His Val Lys
Asn Gln165 170 175Val Pro Asn Phe Asn Gln
Tyr Pro Phe Asp Thr Thr Xaa Asn Gln Ala180 185
190Glu Met Phe His Gln Asn Cys Glu Lys Pro Ile Ser Thr Thr Ala
Tyr195 200 205His Pro Gln Leu Ala Asn Thr
His Tyr Tyr Pro Asn Gly Asp Gly Ile210 215
220Asn Leu Phe Gln Glu Leu Thr Leu Pro Val225
230158867DNALactuca virosa 158atgggaagag ctccatgttg tgacaagtca aaggtgaaga
aagggccatg gtcacctcaa 60gaagatacaa agttgaaaga ctacatacac aaaaatggta
ccggaggcaa ctggattgct 120cttccacata aagcaggtct taaaagatgt gggaaaagct
gccgattaag atggttgaac 180tacctaagac cagacatcaa acatggtgaa ttctccgacc
atgaagatag actcatctat 240actctctttt ctagcatcgg tagcaggtgg tcagtaatag
cagcacagtt accaggaaga 300acagataatg atatcaagaa ctactggaac acgaagctca
agaagaagat tatgaatttc 360atctccacca atcatcaaac tgagaaaccg cttcatcatc
ttgagtctgt caattgtcct 420tcttcaaact atagctaccc aacaagctct tcggttacaa
ctgttgataa tcatccagtt 480ctaccatcag atgatcatgg atacataaat gtggacatgg
aaacctaccg agtaaaagac 540aattctaccc tttttatgct tgaaggtgat gctcaagctg
ctgatgattg cagttctaat 600tctgatggaa ggtggcatca tgtgtatggt agtggtgcgt
ttgatgctca taataacatg 660ggaggcttga agacaagtaa ttcctacatg ggagttgatg
ggaattattg tgaaaaagca 720agagggtgtt atggggattc tactatggag tttagtcttg
aggagttcaa gaagctcatt 780agcactaatc tttgcagcag taacactaac aataaatctc
atgtctttgt tgatgaaact 840caggcagaag agaacattat gtactac
867159289PRTLactuca virosa 159Met Gly Arg Ala Pro
Cys Cys Asp Lys Ser Lys Val Lys Lys Gly Pro1 5
10 15Trp Ser Pro Gln Glu Asp Thr Lys Leu Lys Asp
Tyr Ile His Lys Asn20 25 30Gly Thr Gly
Gly Asn Trp Ile Ala Leu Pro His Lys Ala Gly Leu Lys35 40
45Arg Cys Gly Lys Ser Cys Arg Leu Arg Trp Leu Asn Tyr
Leu Arg Pro50 55 60Asp Ile Lys His Gly
Glu Phe Ser Asp His Glu Asp Arg Leu Ile Tyr65 70
75 80Thr Leu Phe Ser Ser Ile Gly Ser Arg Trp
Ser Val Ile Ala Ala Gln85 90 95Leu Pro
Gly Arg Thr Asp Asn Asp Ile Lys Asn Tyr Trp Asn Thr Lys100
105 110Leu Lys Lys Lys Ile Met Asn Phe Ile Ser Thr Asn
His Gln Thr Glu115 120 125Lys Pro Leu His
His Leu Glu Ser Val Asn Cys Pro Ser Ser Asn Tyr130 135
140Ser Tyr Pro Thr Ser Ser Ser Val Thr Thr Val Asp Asn His
Pro Val145 150 155 160Leu
Pro Ser Asp Asp His Gly Tyr Ile Asn Val Asp Met Glu Thr Tyr165
170 175Arg Val Lys Asp Asn Ser Thr Leu Phe Met Leu
Glu Gly Asp Ala Gln180 185 190Ala Ala Asp
Asp Cys Ser Ser Asn Ser Asp Gly Arg Trp His His Val195
200 205Tyr Gly Ser Gly Ala Phe Asp Ala His Asn Asn Met
Gly Gly Leu Lys210 215 220Thr Ser Asn Ser
Tyr Met Gly Val Asp Gly Asn Tyr Cys Glu Lys Ala225 230
235 240Arg Gly Cys Tyr Gly Asp Ser Thr Met
Glu Phe Ser Leu Glu Glu Phe245 250 255Lys
Lys Leu Ile Ser Thr Asn Leu Cys Ser Ser Asn Thr Asn Asn Lys260
265 270Ser His Val Phe Val Asp Glu Thr Gln Ala Glu
Glu Asn Ile Met Tyr275 280
285Tyr160392DNALiriodendron tulipifera 160atggggagag ctccttgctg
cgacaaggcc aacgtgaaaa gagggccttg gtctcccgaa 60gaagacacag ccctcaaaaa
ctacgtcgag aaacatggaa gtggtgggaa ttggattgct 120ttaccccaca aagcaggcct
caaacgttgc ggcaagagtt gccgcctgcg gtggcttaat 180tatcttagac cagacatcaa
acatggaggt tttaccgacg aagaagataa catcatatgc 240gctctctaca aaaacattgg
gagcaaatgg tctatcatag cttctcatct gccagggaga 300acagacaatg atgtgaagaa
ctactggaat actaagctga aaaagaagct attggcagga 360aatacaaatc tcacaaggga
tgcttctatt ac 392161131PRTLiriodendron
tulipifera 161Met Gly Arg Ala Pro Cys Cys Asp Lys Ala Asn Val Lys Arg Gly
Pro1 5 10 15Trp Ser Pro
Glu Glu Asp Thr Ala Leu Lys Asn Tyr Val Glu Lys His20 25
30Gly Ser Gly Gly Asn Trp Ile Ala Leu Pro His Lys Ala
Gly Leu Lys35 40 45Arg Cys Gly Lys Ser
Cys Arg Leu Arg Trp Leu Asn Tyr Leu Arg Pro50 55
60Asp Ile Lys His Gly Gly Phe Thr Asp Glu Glu Asp Asn Ile Ile
Cys65 70 75 80Ala Leu
Tyr Lys Asn Ile Gly Ser Lys Trp Ser Ile Ile Ala Ser His85
90 95Leu Pro Gly Arg Thr Asp Asn Asp Val Lys Asn Tyr
Trp Asn Thr Lys100 105 110Leu Lys Lys Lys
Leu Leu Ala Gly Asn Thr Asn Leu Thr Arg Asp Ala115 120
125Ser Ile Thr130162563DNAMalus domestica 162atggggagag
ctccttgctg tgacaaggca aatgtaaaga aaggaccatg gtcacctgaa 60gaagatgcaa
agctgaaaga gtacatagaa aaatatggga ctggagggaa ttggattgct 120cttccacaga
aagctggtct taggagatgt gggaagagct gcagactaag atggcttaac 180tatctgaggc
ccaacattaa acatggtgaa ttttctgatg aagaagatag aattatctgc 240aacctcttta
ctaacattgg aagcaggtgg tcaataatag cagctcagtt gccaggcagg 300actgacaatg
atatcaaaaa ctactggaac accaagcaaa aaaagaagct catgggcata 360agcattctcc
catcccagct gctaaaatct cacccatctc ttctccttca aacttcatcc 420acatcctctt
cgccgttatc ataccgagga agcaacacca gcactacatt ttacacacaa 480accaggtctt
tcaccggcac tttggagccc atttcttttt cacaaagtct aatgagcagc 540agttccatta
attctgctcc ttc
563163188PRTMalus domestica 163Met Gly Arg Ala Pro Cys Cys Asp Lys Ala
Asn Val Lys Lys Gly Pro1 5 10
15Trp Ser Pro Glu Glu Asp Ala Lys Leu Lys Glu Tyr Ile Glu Lys Tyr20
25 30Gly Thr Gly Gly Asn Trp Ile Ala Leu
Pro Gln Lys Ala Gly Leu Arg35 40 45Arg
Cys Gly Lys Ser Cys Arg Leu Arg Trp Leu Asn Tyr Leu Arg Pro50
55 60Asn Ile Lys His Gly Glu Phe Ser Asp Glu Glu
Asp Arg Ile Ile Cys65 70 75
80Asn Leu Phe Thr Asn Ile Gly Ser Arg Trp Ser Ile Ile Ala Ala Gln85
90 95Leu Pro Gly Arg Thr Asp Asn Asp Ile
Lys Asn Tyr Trp Asn Thr Lys100 105 110Gln
Lys Lys Lys Leu Met Gly Ile Ser Ile Leu Pro Ser Gln Leu Leu115
120 125Lys Ser His Pro Ser Leu Leu Leu Gln Thr Ser
Ser Thr Ser Ser Ser130 135 140Pro Leu Ser
Tyr Arg Gly Ser Asn Thr Ser Thr Thr Phe Tyr Thr Gln145
150 155 160Thr Arg Ser Phe Thr Gly Thr
Leu Glu Pro Ile Ser Phe Ser Gln Ser165 170
175Leu Met Ser Ser Ser Ser Ile Asn Ser Ala Pro Ser180
1851641044DNAMalus domestica 164atggggaggg ctccttgttg tgacaaagca
aacgtgaaga agggaccatg gtcaccagaa 60gaagattcaa agctaaaaga gtacatagag
aagtatggga ctggtgggaa ttggattgct 120ctcccacaga aagctggtct gaagagatgt
gggaaaagct gcagattgag atggcttaac 180tatctgaggc caaacatcaa acatggagaa
ttctctgatg aggaagacag gataatatgc 240agcctctttg ctagtattgg aagcaggtgg
tcagttatag ctgctcagct gccaggcagg 300actgacaacg atatcaagaa ctactggaac
accaagctca agaagaagct catggggatg 360catatgggtc ctcctcgacc tcacaaccac
cataaaaatt tactaaagcc tcccccattt 420ccttcttcct ctcatcacaa ttaccaaaac
caaccattaa ttccatctga acctctatcg 480tcgctgtaca aagacctaag caattacaac
agatctttct tagggtttga agcgccagtg 540ccattgccac cacaagtttc gctgacgtcc
aataatttct caaacatttc caccaactct 600tctatttttc aaaccctaaa ttacccagct
ggagtgaagg aaaataacaa taataatacc 660ctcctcgtgt ttggaagtga agggagttgc
agtacttcgt ctgatggaag ctgtaataat 720cagatcagct atgactactg cagcagatca
gagattaata acatcaacca agaagaaatg 780ggttttgatc atcagggctt catgatgctt
aactatggca accaatggac cgaaaggcca 840aatgggtttt attttggatc agaaaacaac
acattagaat ttactgatct ggacgaagat 900gttaagcagc agctgattag tactagaagt
aaaaataatt ataataataa tactattata 960aatgagtcct ctaagtctaa cagcttttta
ttcaatgtta atgaaagcaa gacagaagat 1020gatgagaagg tcatgtattt ctac
1044165348PRTMalus domestica 165Met Gly
Arg Ala Pro Cys Cys Asp Lys Ala Asn Val Lys Lys Gly Pro1 5
10 15Trp Ser Pro Glu Glu Asp Ser Lys
Leu Lys Glu Tyr Ile Glu Lys Tyr20 25
30Gly Thr Gly Gly Asn Trp Ile Ala Leu Pro Gln Lys Ala Gly Leu Lys35
40 45Arg Cys Gly Lys Ser Cys Arg Leu Arg Trp
Leu Asn Tyr Leu Arg Pro50 55 60Asn Ile
Lys His Gly Glu Phe Ser Asp Glu Glu Asp Arg Ile Ile Cys65
70 75 80Ser Leu Phe Ala Ser Ile Gly
Ser Arg Trp Ser Val Ile Ala Ala Gln85 90
95Leu Pro Gly Arg Thr Asp Asn Asp Ile Lys Asn Tyr Trp Asn Thr Lys100
105 110Leu Lys Lys Lys Leu Met Gly Met His
Met Gly Pro Pro Arg Pro His115 120 125Asn
His His Lys Asn Leu Leu Lys Pro Pro Pro Phe Pro Ser Ser Ser130
135 140His His Asn Tyr Gln Asn Gln Pro Leu Ile Pro
Ser Glu Pro Leu Ser145 150 155
160Ser Leu Tyr Lys Asp Leu Ser Asn Tyr Asn Arg Ser Phe Leu Gly
Phe165 170 175Glu Ala Pro Val Pro Leu Pro
Pro Gln Val Ser Leu Thr Ser Asn Asn180 185
190Phe Ser Asn Ile Ser Thr Asn Ser Ser Ile Phe Gln Thr Leu Asn Tyr195
200 205Pro Ala Gly Val Lys Glu Asn Asn Asn
Asn Asn Thr Leu Leu Val Phe210 215 220Gly
Ser Glu Gly Ser Cys Ser Thr Ser Ser Asp Gly Ser Cys Asn Asn225
230 235 240Gln Ile Ser Tyr Asp Tyr
Cys Ser Arg Ser Glu Ile Asn Asn Ile Asn245 250
255Gln Glu Glu Met Gly Phe Asp His Gln Gly Phe Met Met Leu Asn
Tyr260 265 270Gly Asn Gln Trp Thr Glu Arg
Pro Asn Gly Phe Tyr Phe Gly Ser Glu275 280
285Asn Asn Thr Leu Glu Phe Thr Asp Leu Asp Glu Asp Val Lys Gln Gln290
295 300Leu Ile Ser Thr Arg Ser Lys Asn Asn
Tyr Asn Asn Asn Thr Ile Ile305 310 315
320Asn Glu Ser Ser Lys Ser Asn Ser Phe Leu Phe Asn Val Asn
Glu Ser325 330 335Lys Thr Glu Asp Asp Glu
Lys Val Met Tyr Phe Tyr340 345166596DNAManihot esculenta
166atgggaaggg ctccttgctg tgataaagct aatgtgaaga aaggcccatg gtcaccagaa
60gaagatgcaa agcttaaaga ctatatagag aaacaaggga ctgtaggaaa ttggattgct
120ctccctcaaa aggctggtct caaaagatgt gggaaaagct gcagattaag atggctaaat
180tatctgagac ccaacatcaa gcatggagat ttttctgatg atgaagataa aataatctgt
240aaactctatt ccaacattgg gagcaggtgg tcaataatag cagctcagtt gccaggcagg
300actgataatg atatcaaaaa ctattggaac acaaaactca agaagaagct tatggggatg
360atgatgattc atccttctca aacaaaatta ccccaccaat taactacaaa gtttgcttct
420cttctttgtc aggcttcatc atcatcatca tcaataccat catcaccatc tactgcaata
480tcttcaccat catcttcata tgccctagct aggtctttca ctgaacctat tccattttca
540agtaacaatt ccttcactgc tgctaataaa tccatcctcc catcccaaga aagctc
596167199PRTManihot esculenta 167Met Gly Arg Ala Pro Cys Cys Asp Lys Ala
Asn Val Lys Lys Gly Pro1 5 10
15Trp Ser Pro Glu Glu Asp Ala Lys Leu Lys Asp Tyr Ile Glu Lys Gln20
25 30Gly Thr Val Gly Asn Trp Ile Ala Leu
Pro Gln Lys Ala Gly Leu Lys35 40 45Arg
Cys Gly Lys Ser Cys Arg Leu Arg Trp Leu Asn Tyr Leu Arg Pro50
55 60Asn Ile Lys His Gly Asp Phe Ser Asp Asp Glu
Asp Lys Ile Ile Cys65 70 75
80Lys Leu Tyr Ser Asn Ile Gly Ser Arg Trp Ser Ile Ile Ala Ala Gln85
90 95Leu Pro Gly Arg Thr Asp Asn Asp Ile
Lys Asn Tyr Trp Asn Thr Lys100 105 110Leu
Lys Lys Lys Leu Met Gly Met Met Met Ile His Pro Ser Gln Thr115
120 125Lys Leu Pro His Gln Leu Thr Thr Lys Phe Ala
Ser Leu Leu Cys Gln130 135 140Ala Ser Ser
Ser Ser Ser Ser Ile Pro Ser Ser Pro Ser Thr Ala Ile145
150 155 160Ser Ser Pro Ser Ser Ser Tyr
Ala Leu Ala Arg Ser Phe Thr Glu Pro165 170
175Ile Pro Phe Ser Ser Asn Asn Ser Phe Thr Ala Ala Asn Lys Ser Ile180
185 190Leu Pro Ser Gln Glu Ser
Ser195168543DNAMarchantia polymorphamisc_feature(421)..(421)n is any
nucleotide 168atgggtcgag ctccatgttg tgacaaggcc aatgtgaaaa agggtccttg
gtcgcccgaa 60gaggatgcca agctcaagtc tttcatcgaa cacaatggca caggtggcaa
ttggatcacg 120ctgccgagca aagcaggtct gaagcgctgc ggaaagagct gcagactccg
ctggatcaat 180tatttgcgtc ccgacatcaa acatggaagc ttcactgaag aagaagaaaa
gacaatttat 240cgcctccacg ctcaaattgg cagcagatgg tccttgatcg ctgctcagct
gcctgggaga 300accgataacg atatcaagaa ttactggaac actcggctga agaagaagct
cctggagaga 360gctataatat ggtgggtgcg gacgtccgca ccactctttc cagatttatc
ccatgaatca 420nccggccaac gatcanatgt caaatgccgt ggggagagaa ggccttctga
attctcantt 480tctgcatgct cangcttatt atcantacnt tcancaacaa cnnccgcant
attatcttcc 540nca
543169181PRTMarchantia polymorphaMISC_FEATURE(141)..(141)X is
any amino acid 169Met Gly Arg Ala Pro Cys Cys Asp Lys Ala Asn Val Lys Lys
Gly Pro1 5 10 15Trp Ser
Pro Glu Glu Asp Ala Lys Leu Lys Ser Phe Ile Glu His Asn20
25 30Gly Thr Gly Gly Asn Trp Ile Thr Leu Pro Ser Lys
Ala Gly Leu Lys35 40 45Arg Cys Gly Lys
Ser Cys Arg Leu Arg Trp Ile Asn Tyr Leu Arg Pro50 55
60Asp Ile Lys His Gly Ser Phe Thr Glu Glu Glu Glu Lys Thr
Ile Tyr65 70 75 80Arg
Leu His Ala Gln Ile Gly Ser Arg Trp Ser Leu Ile Ala Ala Gln85
90 95Leu Pro Gly Arg Thr Asp Asn Asp Ile Lys Asn
Tyr Trp Asn Thr Arg100 105 110Leu Lys Lys
Lys Leu Leu Glu Arg Ala Ile Ile Trp Trp Val Arg Thr115
120 125Ser Ala Pro Leu Phe Pro Asp Leu Ser His Glu Ser
Xaa Gly Gln Arg130 135 140Ser Xaa Val Lys
Cys Arg Gly Glu Arg Arg Pro Ser Glu Phe Ser Xaa145 150
155 160Ser Ala Cys Ser Xaa Leu Leu Ser Xaa
Xaa Ser Xaa Thr Thr Xaa Ala165 170 175Xaa
Leu Ser Ser Xaa1801701050DNAMedicago truncatula 170atgggtagag ctccatgctg
tgacaaggct aacgtgaaga aaggaccttg gtctcctgaa 60gaagatgcta cactcaaatc
ttacattgaa acaaatggaa ctggaggaaa ttggattgct 120cttcctcaaa aaattgggct
caagagatgt ggaaagagtt gcagacttag gtggttaaat 180tacttgagac ctaatatcaa
acatggtgga tttactgaag aagaagacaa catcatttgc 240agcctttaca taagcattgg
aagcaggtgg tccattattg ctgctcagtt acctggaaga 300acagacaatg acataaaaaa
ctattggaac acaagattga agaagaaatt actcggaaag 360cgaaaacagt cgaatataaa
caactgcttg atgaaccaaa aggacacaaa tggaatagat 420gagaattctt attcaaattc
attaagtagc tcagctcttg agagacttca acttcatatg 480caacttcaaa gccttcaaaa
ccctttgtct ttttacaata ataaccctgc tgcactagtt 540tggccaaagt tgcatccttc
tcaagaaaaa atgatccaaa ttagccttca aaactctaat 600aacaacccta tgatgcaaaa
tgctttctct tcaccacagg ttgatctttt ggagaatatt 660attcctttgg agaataataa
taacaattca gttaccttca atgcttctgg aaatagtagt 720aataataata attcaatcat
gcattcaagt gttgcaccaa gaggagaagc tgttgagaag 780agtactaaca atgaaggaat
tcaggaactg gaaagtgaac tagatgaaat tctcaacaac 840agaaatataa ttactatgga
agatgaatat cgtgtggctg aatttgattg tttcagagat 900atgaataata atggttcaaa
ggatcaaaac ttgatatggt ggtcaaatga ttcaggtgat 960actaaatcag gatcctcaaa
ctcatgggat tcaacaacta atcttatgca agaagggatg 1020ttccaagatt atgaactagg
ttatggtctg 1050171350PRTMedicago
truncatula 171Met Gly Arg Ala Pro Cys Cys Asp Lys Ala Asn Val Lys Lys Gly
Pro1 5 10 15Trp Ser Pro
Glu Glu Asp Ala Thr Leu Lys Ser Tyr Ile Glu Thr Asn20 25
30Gly Thr Gly Gly Asn Trp Ile Ala Leu Pro Gln Lys Ile
Gly Leu Lys35 40 45Arg Cys Gly Lys Ser
Cys Arg Leu Arg Trp Leu Asn Tyr Leu Arg Pro50 55
60Asn Ile Lys His Gly Gly Phe Thr Glu Glu Glu Asp Asn Ile Ile
Cys65 70 75 80Ser Leu
Tyr Ile Ser Ile Gly Ser Arg Trp Ser Ile Ile Ala Ala Gln85
90 95Leu Pro Gly Arg Thr Asp Asn Asp Ile Lys Asn Tyr
Trp Asn Thr Arg100 105 110Leu Lys Lys Lys
Leu Leu Gly Lys Arg Lys Gln Ser Asn Ile Asn Asn115 120
125Cys Leu Met Asn Gln Lys Asp Thr Asn Gly Ile Asp Glu Asn
Ser Tyr130 135 140Ser Asn Ser Leu Ser Ser
Ser Ala Leu Glu Arg Leu Gln Leu His Met145 150
155 160Gln Leu Gln Ser Leu Gln Asn Pro Leu Ser Phe
Tyr Asn Asn Asn Pro165 170 175Ala Ala Leu
Val Trp Pro Lys Leu His Pro Ser Gln Glu Lys Met Ile180
185 190Gln Ile Ser Leu Gln Asn Ser Asn Asn Asn Pro Met
Met Gln Asn Ala195 200 205Phe Ser Ser Pro
Gln Val Asp Leu Leu Glu Asn Ile Ile Pro Leu Glu210 215
220Asn Asn Asn Asn Asn Ser Val Thr Phe Asn Ala Ser Gly Asn
Ser Ser225 230 235 240Asn
Asn Asn Asn Ser Ile Met His Ser Ser Val Ala Pro Arg Gly Glu245
250 255Ala Val Glu Lys Ser Thr Asn Asn Glu Gly Ile
Gln Glu Leu Glu Ser260 265 270Glu Leu Asp
Glu Ile Leu Asn Asn Arg Asn Ile Ile Thr Met Glu Asp275
280 285Glu Tyr Arg Val Ala Glu Phe Asp Cys Phe Arg Asp
Met Asn Asn Asn290 295 300Gly Ser Lys Asp
Gln Asn Leu Ile Trp Trp Ser Asn Asp Ser Gly Asp305 310
315 320Thr Lys Ser Gly Ser Ser Asn Ser Trp
Asp Ser Thr Thr Asn Leu Met325 330 335Gln
Glu Gly Met Phe Gln Asp Tyr Glu Leu Gly Tyr Gly Leu340
345 350172576DNAMedicago truncatula 172atgggtagag
ctccttgttg tgacaaagca aacgtgaaga aaggtccatg gtctccagaa 60gaagattcta
aactcaaatc ttacatagaa caacatggta ctggtggtaa ctggattgct 120ctcccacaaa
aaattggctt gaagcgttgt ggaaagagct gtcgtctccg gtggctaaac 180tatcttcgcc
ctaatctcaa acatggtggt ttctccgaag aagaagataa cattatttgc 240agcctttaca
ttagtattgg aagcaggtgg tcgataattg cagctcaatt gccaggaaga 300actgataatg
acataaagaa ctattggaat acaaggttga aaaagaagct tttggggaaa 360caccgtaaag
atcaacaaca acaagcacgt aatagaggaa ataatggtgc tattgttaag 420caagaaagca
ataataatag agtgatgagt agtaatgaat tttccttatc aaatttggtt 480caagaacaac
catatttacc acatgtcatg caacctttgc tatcaacacc ccaacctcca 540ccaccatcaa
tgttgtcata cacaaaccaa caaggc
576173192PRTMedicago truncatula 173Met Gly Arg Ala Pro Cys Cys Asp Lys
Ala Asn Val Lys Lys Gly Pro1 5 10
15Trp Ser Pro Glu Glu Asp Ser Lys Leu Lys Ser Tyr Ile Glu Gln
His20 25 30Gly Thr Gly Gly Asn Trp Ile
Ala Leu Pro Gln Lys Ile Gly Leu Lys35 40
45Arg Cys Gly Lys Ser Cys Arg Leu Arg Trp Leu Asn Tyr Leu Arg Pro50
55 60Asn Leu Lys His Gly Gly Phe Ser Glu Glu
Glu Asp Asn Ile Ile Cys65 70 75
80Ser Leu Tyr Ile Ser Ile Gly Ser Arg Trp Ser Ile Ile Ala Ala
Gln85 90 95Leu Pro Gly Arg Thr Asp Asn
Asp Ile Lys Asn Tyr Trp Asn Thr Arg100 105
110Leu Lys Lys Lys Leu Leu Gly Lys His Arg Lys Asp Gln Gln Gln Gln115
120 125Ala Arg Asn Arg Gly Asn Asn Gly Ala
Ile Val Lys Gln Glu Ser Asn130 135 140Asn
Asn Arg Val Met Ser Ser Asn Glu Phe Ser Leu Ser Asn Leu Val145
150 155 160Gln Glu Gln Pro Tyr Leu
Pro His Val Met Gln Pro Leu Leu Ser Thr165 170
175Pro Gln Pro Pro Pro Pro Ser Met Leu Ser Tyr Thr Asn Gln Gln
Gly180 185 190174596DNANuphar advena
174atgggaaggt ctccgtgctg tgacaaggcc aacgttaaga gagggccatg gtcgcctgag
60gaggacgcca ccctcaagaa ctacgttgag aggtttggca ccggaggcaa ttggattgca
120ttgcctcaga aagcagggct caagcgttgt ggaaagagct gccgtttgcg ttggcttaac
180taccttagac ctgatattag gcacggtggt tttactgagg aagaggatac catcattttc
240tctctctatg agagcatggg cagcaggtgg tctgtcatag catcacagtt accaggaaga
300accgacaatg atgtgaagaa ttactggaac accaagttga agaagaaaat gcttgcagca
360aaagccaatc ttgacagtga tgcccatatt cacatgaatt cagtttcaat ctcatcatca
420tcttcacctt caccttcacc ttcctcttca tcaccatcca cttccactcc tacccattta
480accattgcat caaaaagtgt gggccctgcc tattacttct ccaaactcag ctgcaccaaa
540tcccaagatg aaggcatggt gatgcctcca atggaagttg gctttcagta cactct
596175199PRTNuphar advena 175Met Gly Arg Ser Pro Cys Cys Asp Lys Ala Asn
Val Lys Arg Gly Pro1 5 10
15Trp Ser Pro Glu Glu Asp Ala Thr Leu Lys Asn Tyr Val Glu Arg Phe20
25 30Gly Thr Gly Gly Asn Trp Ile Ala Leu Pro
Gln Lys Ala Gly Leu Lys35 40 45Arg Cys
Gly Lys Ser Cys Arg Leu Arg Trp Leu Asn Tyr Leu Arg Pro50
55 60Asp Ile Arg His Gly Gly Phe Thr Glu Glu Glu Asp
Thr Ile Ile Phe65 70 75
80Ser Leu Tyr Glu Ser Met Gly Ser Arg Trp Ser Val Ile Ala Ser Gln85
90 95Leu Pro Gly Arg Thr Asp Asn Asp Val Lys
Asn Tyr Trp Asn Thr Lys100 105 110Leu Lys
Lys Lys Met Leu Ala Ala Lys Ala Asn Leu Asp Ser Asp Ala115
120 125His Ile His Met Asn Ser Val Ser Ile Ser Ser Ser
Ser Ser Pro Ser130 135 140Pro Ser Pro Ser
Ser Ser Ser Pro Ser Thr Ser Thr Pro Thr His Leu145 150
155 160Thr Ile Ala Ser Lys Ser Val Gly Pro
Ala Tyr Tyr Phe Ser Lys Leu165 170 175Ser
Cys Thr Lys Ser Gln Asp Glu Gly Met Val Met Pro Pro Met Glu180
185 190Val Gly Phe Gln Tyr Thr Leu195176795DNAOryza
sativa 176atggggagga cgccgtgctg cgacagggag gcggtgaaga ggggcccgtg
gtcgccggag 60gaggacgacg cgctgcgcga ctacatcaac cgccacggca ccgccggcaa
ctggatctcc 120ctccccaaca aggccgggtt gaggaggtgc ggcaagagct gcaggctgcg
gtggctcaac 180tacctccgcc ccgacatccg ccatggcgcc ttcaccgacg aggaggacgc
catcatcacc 240tccctctact ccaagctcgg cagcaagtgg tcgaccatcg cggcgcagct
ggagaggagg 300acggacaacg acgtcaagaa ccactggaac accaagctca agcgccgcct
cgccgccgcc 360gccgcctgca cgcccttact gccgctcccg gcgccgccgc ccctcgccgc
cacgcacacg 420tcgccgtcgt cgtcgctgct gctcctcccg ccgctcgccg taccgaccgt
caagaccgag 480gcgtacacct gcgacgactt cctgcagcag ctgctgccga ccgccaccgc
cgccacggcg 540ctccgggatc ccttcgccga cggcgccgcc acggacggcg gctcgacgtc
cgcctccgcc 600gcgtcgtcgg ggtccaactg gtcggcggac accggcgtcg tcgtcgtcgg
tggcggcggc 660ggcggcgggc tgttcccgga attctgcatg agctccgacg acctcgccgg
cgccgccacg 720gcggaggacg accacttcat cggcggcggc tactactacc ctctcgatcc
gagcttgtca 780tcatcactag tgtag
795177264PRTOryza sativa 177Met Gly Arg Thr Pro Cys Cys Asp
Arg Glu Ala Val Lys Arg Gly Pro1 5 10
15Trp Ser Pro Glu Glu Asp Asp Ala Leu Arg Asp Tyr Ile Asn
Arg His20 25 30Gly Thr Ala Gly Asn Trp
Ile Ser Leu Pro Asn Lys Ala Gly Leu Arg35 40
45Arg Cys Gly Lys Ser Cys Arg Leu Arg Trp Leu Asn Tyr Leu Arg Pro50
55 60Asp Ile Arg His Gly Ala Phe Thr Asp
Glu Glu Asp Ala Ile Ile Thr65 70 75
80Ser Leu Tyr Ser Lys Leu Gly Ser Lys Trp Ser Thr Ile Ala
Ala Gln85 90 95Leu Glu Arg Arg Thr Asp
Asn Asp Val Lys Asn His Trp Asn Thr Lys100 105
110Leu Lys Arg Arg Leu Ala Ala Ala Ala Ala Cys Thr Pro Leu Leu
Pro115 120 125Leu Pro Ala Pro Pro Pro Leu
Ala Ala Thr His Thr Ser Pro Ser Ser130 135
140Ser Leu Leu Leu Leu Pro Pro Leu Ala Val Pro Thr Val Lys Thr Glu145
150 155 160Ala Tyr Thr Cys
Asp Asp Phe Leu Gln Gln Leu Leu Pro Thr Ala Thr165 170
175Ala Ala Thr Ala Leu Arg Asp Pro Phe Ala Asp Gly Ala Ala
Thr Asp180 185 190Gly Gly Ser Thr Ser Ala
Ser Ala Ala Ser Ser Gly Ser Asn Trp Ser195 200
205Ala Asp Thr Gly Val Val Val Val Gly Gly Gly Gly Gly Gly Gly
Leu210 215 220Phe Pro Glu Phe Cys Met Ser
Ser Asp Asp Leu Ala Gly Ala Ala Thr225 230
235 240Ala Glu Asp Asp His Phe Ile Gly Gly Gly Tyr Tyr
Tyr Pro Leu Asp245 250 255Pro Ser Leu Ser
Ser Ser Leu Val260178957DNAOryza sativa 178atggggaggg cgccgtgctg
cgacaaggcg agcgtgaaga gggggccgtg gtcgccggag 60gaggacgagc tgctgcggag
ctacgtccgc agccacggca ccggtggcaa ctggatcgcg 120ctcccgcaga aagcagggct
gaaccggtgc gggaagagct gtaggctgcg gtggctcaac 180tacctccgcc cggacatcaa
gcacggcggc tacaccgacc aggaggaccg gatcatctgt 240tccctctaca actccatcgg
aagcaggtgg tccatcatcg cgtcgaagct gcccggccgg 300acggacaacg acgtcaagaa
ttactggaat accaagctca agaagaaggc catggccatg 360catcatcatc atcagccgcc
gccgccgcag cagcaacact accaccacca ccaccaccac 420cgtgtcgccg gcggtggcgc
gcgcgtcacg ctcgtgtcgc ctccgcccgc cccgcagagc 480caatgcgcgt ccatgcagcc
gtcgccggcg tccgcctcct cgtccggcgg cgacgcgtgc 540agcttcggcg ccgccgccat
gtactccccc tccccgtcaa cccagcaggc gccacaggcg 600gcgacgctcg cggtcgcggg
gtacacctcc gtggcgacgg cggcggcggc ggcggcggtg 660gcggcgcagc gctcgccgct
cgacgagctg atctgccagg tgccaccacc tcccactact 720accgccgccg actgctgggc
cagcggcgtg accctcgacg acgtgttctt gcccgagctc 780gtcggagccg gcgagttccc
caacggcgac ctcttcggcg ggttcggccc gctgctccag 840gacaggtcgt ccatggagct
ctccgcgtgc tacttcccca acgccgcggc ggcggagatg 900tggccggcgg ccacggacat
cgtcaagccg gccgggctgt gccacagcct gacatga 957179318PRTOryza sativa
179Met Gly Arg Ala Pro Cys Cys Asp Lys Ala Ser Val Lys Arg Gly Pro1
5 10 15Trp Ser Pro Glu Glu Asp
Glu Leu Leu Arg Ser Tyr Val Arg Ser His20 25
30Gly Thr Gly Gly Asn Trp Ile Ala Leu Pro Gln Lys Ala Gly Leu Asn35
40 45Arg Cys Gly Lys Ser Cys Arg Leu Arg
Trp Leu Asn Tyr Leu Arg Pro50 55 60Asp
Ile Lys His Gly Gly Tyr Thr Asp Gln Glu Asp Arg Ile Ile Cys65
70 75 80Ser Leu Tyr Asn Ser Ile
Gly Ser Arg Trp Ser Ile Ile Ala Ser Lys85 90
95Leu Pro Gly Arg Thr Asp Asn Asp Val Lys Asn Tyr Trp Asn Thr Lys100
105 110Leu Lys Lys Lys Ala Met Ala Met
His His His His Gln Pro Pro Pro115 120
125Pro Gln Gln Gln His Tyr His His His His His His Arg Val Ala Gly130
135 140Gly Gly Ala Arg Val Thr Leu Val Ser
Pro Pro Pro Ala Pro Gln Ser145 150 155
160Gln Cys Ala Ser Met Gln Pro Ser Pro Ala Ser Ala Ser Ser
Ser Gly165 170 175Gly Asp Ala Cys Ser Phe
Gly Ala Ala Ala Met Tyr Ser Pro Ser Pro180 185
190Ser Thr Gln Gln Ala Pro Gln Ala Ala Thr Leu Ala Val Ala Gly
Tyr195 200 205Thr Ser Val Ala Thr Ala Ala
Ala Ala Ala Ala Val Ala Ala Gln Arg210 215
220Ser Pro Leu Asp Glu Leu Ile Cys Gln Val Pro Pro Pro Pro Thr Thr225
230 235 240Thr Ala Ala Asp
Cys Trp Ala Ser Gly Val Thr Leu Asp Asp Val Phe245 250
255Leu Pro Glu Leu Val Gly Ala Gly Glu Phe Pro Asn Gly Asp
Leu Phe260 265 270Gly Gly Phe Gly Pro Leu
Leu Gln Asp Arg Ser Ser Met Glu Leu Ser275 280
285Ala Cys Tyr Phe Pro Asn Ala Ala Ala Ala Glu Met Trp Pro Ala
Ala290 295 300Thr Asp Ile Val Lys Pro Ala
Gly Leu Cys His Ser Leu Thr305 310
315180897DNAOryza sativa 180atggggaggt cgccgtgctg cgacaaggcg agcgtgaagc
gggggccgtg gtcggaggag 60gaggacgcca tactcaggag cttcgtcgag aggttcggca
atgccggcaa ctggatcgcg 120ctgccccaca aagcagggct taaacggtgc ggcaagagct
gccgcctccg gtggctcaac 180tacctccgcc cggcgatccg gcacggcggc ttcaccgacg
aggaggacaa cctcatcctg 240tcgctctacg gcgaaatggg aagcaagtgg tcggtgatcg
cgtccaagct ccccggccgg 300acggacaacg acgtcaagaa ctactggaac accaagctca
agaagaggta cttggccgcg 360gccgcaacag aagcaaccac tcctcctcct cctgccgccg
gcgacgatga caacaaccca 420acgacacaag cctccagcca acccgctcct cctactcctc
cggcgcccct cgtcaacctc 480gacgcggccg gcctcgacgg cgccgtgggc gacaacgacg
agctcctgct gcacaagtcg 540gagcagctgt acgccgagct gatgggcctc atcgagcagc
agcagtactc cacgatcacc 600gccgccgccg tcgacgcggc gacgacgacg acgtcgtggt
cgtcgccgtc gacgggaacg 660acaagtccga ccgctagcag cagtactgac ggcagcagca
gcagcagcaa cctgccgtgg 720ccggccgtgg acgtgcacga cagtacgatg atgccgccgt
tgtcggagtc cagcggcagc 780agcagcggct tgttcttcgg ctctcacgcg ttcggtagtg
gctcgttcca agacctgctc 840ggctctgcag cttccttcga cgatgtcatg ctgtcgcaag
agatgctgta ctactag 897181298PRTOryza sativa 181Met Gly Arg Ser Pro
Cys Cys Asp Lys Ala Ser Val Lys Arg Gly Pro1 5
10 15Trp Ser Glu Glu Glu Asp Ala Ile Leu Arg Ser
Phe Val Glu Arg Phe20 25 30Gly Asn Ala
Gly Asn Trp Ile Ala Leu Pro His Lys Ala Gly Leu Lys35 40
45Arg Cys Gly Lys Ser Cys Arg Leu Arg Trp Leu Asn Tyr
Leu Arg Pro50 55 60Ala Ile Arg His Gly
Gly Phe Thr Asp Glu Glu Asp Asn Leu Ile Leu65 70
75 80Ser Leu Tyr Gly Glu Met Gly Ser Lys Trp
Ser Val Ile Ala Ser Lys85 90 95Leu Pro
Gly Arg Thr Asp Asn Asp Val Lys Asn Tyr Trp Asn Thr Lys100
105 110Leu Lys Lys Arg Tyr Leu Ala Ala Ala Ala Thr Glu
Ala Thr Thr Pro115 120 125Pro Pro Pro Ala
Ala Gly Asp Asp Asp Asn Asn Pro Thr Thr Gln Ala130 135
140Ser Ser Gln Pro Ala Pro Pro Thr Pro Pro Ala Pro Leu Val
Asn Leu145 150 155 160Asp
Ala Ala Gly Leu Asp Gly Ala Val Gly Asp Asn Asp Glu Leu Leu165
170 175Leu His Lys Ser Glu Gln Leu Tyr Ala Glu Leu
Met Gly Leu Ile Glu180 185 190Gln Gln Gln
Tyr Ser Thr Ile Thr Ala Ala Ala Val Asp Ala Ala Thr195
200 205Thr Thr Thr Ser Trp Ser Ser Pro Ser Thr Gly Thr
Thr Ser Pro Thr210 215 220Ala Ser Ser Ser
Thr Asp Gly Ser Ser Ser Ser Ser Asn Leu Pro Trp225 230
235 240Pro Ala Val Asp Val His Asp Ser Thr
Met Met Pro Pro Leu Ser Glu245 250 255Ser
Ser Gly Ser Ser Ser Gly Leu Phe Phe Gly Ser His Ala Phe Gly260
265 270Ser Gly Ser Phe Gln Asp Leu Leu Gly Ser Ala
Ala Ser Phe Asp Asp275 280 285Val Met Leu
Ser Gln Glu Met Leu Tyr Tyr290 2951821692DNAOryza sativa
182atggcggcgg cggcggctcg cgatgacgac ctcggcaacg acgacctcga cgatgaccgg
60tcaaatccac gccgccagtc tggaggtcgc acggccggcc caatccggca ggggttgcat
120ggtgacatag gcctccaaat gacggcagcc atggctggtg acgacgcaaa tgacggcggc
180catggctggc ggcggtggct gtcaacggcg gcggtcgcgg cggtgctcct cagcttaggg
240gatggtggtg acgatgacaa ggcaaccgag gggatacggg ggagctcagc agggatgtat
300actgataaaa gatcagtttc agaggacaca cacctctatc catctactaa acctactata
360aagccagccc ggccacaacc ccagatgcac ccgctcaccg ctgctgccaa acacgccatc
420accataccac caccaccacc accagccgca gcagctagct acacatcgtc gccgtcgtca
480ggcactagcg atccatcggt tctagatctc tcttctgctg agatagacga cgacggcgac
540ggcgacgacg atcgagctga gcagcaagaa atcaagaatt cgaaggagtt agtgatgggg
600agggcgccgt gctgcgacaa ggcgagcgtg aagaaagggc cgtggtcgcc ggaggaggac
660gccaagctca aggcctacat cgaggagaac ggcaccggcg gcaactggat cgcgctgccg
720cagaagatcg ggctgaagag atgtggcaag agttgcaggc tcagatggct caactacctg
780cggccaaaca ttaagcatgg tgatttcaca gaagaagagg agcacatcat ttgtagcctc
840tacattagca tcggtagcag gtggtcgatc atcgcggcgc agctgccggg gagaacggac
900aacgacatca agaactactg gaacaccaag ctgaagaaga agctcctcgg caagcgcgcg
960ccgtcgcgcc gcgcccgcgc aaaccaagac cactgcggtc tagcaggcag cgccgccgcc
1020gccatgtgcg gtggcgtcgg caccgcggcg gcggcggcgc cgccgcatca agccctaagc
1080tcgtcggccc tcgagcggat ccagctccac atgcgcctcc aaggcctcta caacagcgcg
1140ttcggctgca ccaccaccag cagcaacggc ggcggcgtcg gcgtcgcgcc gccgcagtgg
1200ccgaagctcg aggcgctgct gccgagcaga ccgctcccgg ccgtgcagcc gacggacgcc
1260gtcgtcgcca ccgtgcaaca cccccatcat ttggtcgtcg gcggccatac cctcgccacc
1320gccgccgccg ccgccgccac cacgtcggag gcgttccaag ccgccgagca cctcgaccct
1380gcggcggcga ccggttcgaa ctacatgccg ggagtcgccg gtgtagagat gacctcgtcg
1440tcgtcaatgg cgggtggtgg tgggttcgtc gccggctacg gtctccacga cgagctatac
1500gacttcctct tcaagtgcga gtcgatcggc ggagcgcaag gcgggatcat cccttcgtcg
1560ttgccggagc tgcagtgccc ggacggcagc gccatcatcg gcgccgacga gaagttctcg
1620acgtggacgt cgtcgtcttg cgactacggt tcgggcggcg ccggcgatta cgttctaggg
1680tatgatcaat aa
1692183563PRTOryza sativa 183Met Ala Ala Ala Ala Ala Arg Asp Asp Asp Leu
Gly Asn Asp Asp Leu1 5 10
15Asp Asp Asp Arg Ser Asn Pro Arg Arg Gln Ser Gly Gly Arg Thr Ala20
25 30Gly Pro Ile Arg Gln Gly Leu His Gly Asp
Ile Gly Leu Gln Met Thr35 40 45Ala Ala
Met Ala Gly Asp Asp Ala Asn Asp Gly Gly His Gly Trp Arg50
55 60Arg Trp Leu Ser Thr Ala Ala Val Ala Ala Val Leu
Leu Ser Leu Gly65 70 75
80Asp Gly Gly Asp Asp Asp Lys Ala Thr Glu Gly Ile Arg Gly Ser Ser85
90 95Ala Gly Met Tyr Thr Asp Lys Arg Ser Val
Ser Glu Asp Thr His Leu100 105 110Tyr Pro
Ser Thr Lys Pro Thr Ile Lys Pro Ala Arg Pro Gln Pro Gln115
120 125Met His Pro Leu Thr Ala Ala Ala Lys His Ala Ile
Thr Ile Pro Pro130 135 140Pro Pro Pro Pro
Ala Ala Ala Ala Ser Tyr Thr Ser Ser Pro Ser Ser145 150
155 160Gly Thr Ser Asp Pro Ser Val Leu Asp
Leu Ser Ser Ala Glu Ile Asp165 170 175Asp
Asp Gly Asp Gly Asp Asp Asp Arg Ala Glu Gln Gln Glu Ile Lys180
185 190Asn Ser Lys Glu Leu Val Met Gly Arg Ala Pro
Cys Cys Asp Lys Ala195 200 205Ser Val Lys
Lys Gly Pro Trp Ser Pro Glu Glu Asp Ala Lys Leu Lys210
215 220Ala Tyr Ile Glu Glu Asn Gly Thr Gly Gly Asn Trp
Ile Ala Leu Pro225 230 235
240Gln Lys Ile Gly Leu Lys Arg Cys Gly Lys Ser Cys Arg Leu Arg Trp245
250 255Leu Asn Tyr Leu Arg Pro Asn Ile Lys
His Gly Asp Phe Thr Glu Glu260 265 270Glu
Glu His Ile Ile Cys Ser Leu Tyr Ile Ser Ile Gly Ser Arg Trp275
280 285Ser Ile Ile Ala Ala Gln Leu Pro Gly Arg Thr
Asp Asn Asp Ile Lys290 295 300Asn Tyr Trp
Asn Thr Lys Leu Lys Lys Lys Leu Leu Gly Lys Arg Ala305
310 315 320Pro Ser Arg Arg Ala Arg Ala
Asn Gln Asp His Cys Gly Leu Ala Gly325 330
335Ser Ala Ala Ala Ala Met Cys Gly Gly Val Gly Thr Ala Ala Ala Ala340
345 350Ala Pro Pro His Gln Ala Leu Ser Ser
Ser Ala Leu Glu Arg Ile Gln355 360 365Leu
His Met Arg Leu Gln Gly Leu Tyr Asn Ser Ala Phe Gly Cys Thr370
375 380Thr Thr Ser Ser Asn Gly Gly Gly Val Gly Val
Ala Pro Pro Gln Trp385 390 395
400Pro Lys Leu Glu Ala Leu Leu Pro Ser Arg Pro Leu Pro Ala Val
Gln405 410 415Pro Thr Asp Ala Val Val Ala
Thr Val Gln His Pro His His Leu Val420 425
430Val Gly Gly His Thr Leu Ala Thr Ala Ala Ala Ala Ala Ala Thr Thr435
440 445Ser Glu Ala Phe Gln Ala Ala Glu His
Leu Asp Pro Ala Ala Ala Thr450 455 460Gly
Ser Asn Tyr Met Pro Gly Val Ala Gly Val Glu Met Thr Ser Ser465
470 475 480Ser Ser Met Ala Gly Gly
Gly Gly Phe Val Ala Gly Tyr Gly Leu His485 490
495Asp Glu Leu Tyr Asp Phe Leu Phe Lys Cys Glu Ser Ile Gly Gly
Ala500 505 510Gln Gly Gly Ile Ile Pro Ser
Ser Leu Pro Glu Leu Gln Cys Pro Asp515 520
525Gly Ser Ala Ile Ile Gly Ala Asp Glu Lys Phe Ser Thr Trp Thr Ser530
535 540Ser Ser Cys Asp Tyr Gly Ser Gly Gly
Ala Gly Asp Tyr Val Leu Gly545 550 555
560Tyr Asp Gln184870DNAOryza sativa 184atggggcgcg cgccgtgctg
cgacaaggcg agcgtgaagc gggggccgtg gtcgccggag 60gaggacgagc agctgcggag
ctacgtccag agccacggca tcggcggcaa ctggatcgcg 120ctcccgcaga aagcagggct
caaccgctgc ggcaagagct gcaggctccg gtggctcaac 180tacctgaggc cggacatcaa
gcacggcggc tacaccgagc aggaggacca catcatctgc 240tcgctctaca actcgattgg
aagcaggtgg tccatcatcg cgtcgaagct ccccggccgg 300actgacaacg acgtcaagaa
ctactggaac accaagctca agaagaaagc catgggcgcc 360gtgcagccgc gcgccgccgc
ctcggcgccg agccaatgca cgtcgtcagc gatggcgccg 420gcgctctcgc cggcctcctc
gtccgtcacc agctcgagcg gcgacgcctg cttcgccgcc 480gccgccacca ccaccaccac
catgtacccg ccaccgacga cgccgccgca gcagcagttc 540atccgcttcg acgcaccacc
cgcggcggcg gcggcggcat ccccgaccga cctcgcgccc 600gtgccaccac cggccaccgt
cacggcggac ggcgacggcg gctgggcgtc cgacgccttg 660tccctcgacg acgtgttcct
cggcgagctc acggccggcg agccgctgtt cccttacgcc 720gagctgttca gcggcttcgc
cggcgcggcg ccggacagca aggccacctt ggagctctcg 780gcgtgctact tcccgaacat
ggcggagatg tgggcggcct ccgaccacgc ctacgccaag 840ccacagggtc tctgcaacac
cctgacatag 870185289PRTOryza sativa
185Met Gly Arg Ala Pro Cys Cys Asp Lys Ala Ser Val Lys Arg Gly Pro1
5 10 15Trp Ser Pro Glu Glu Asp
Glu Gln Leu Arg Ser Tyr Val Gln Ser His20 25
30Gly Ile Gly Gly Asn Trp Ile Ala Leu Pro Gln Lys Ala Gly Leu Asn35
40 45Arg Cys Gly Lys Ser Cys Arg Leu Arg
Trp Leu Asn Tyr Leu Arg Pro50 55 60Asp
Ile Lys His Gly Gly Tyr Thr Glu Gln Glu Asp His Ile Ile Cys65
70 75 80Ser Leu Tyr Asn Ser Ile
Gly Ser Arg Trp Ser Ile Ile Ala Ser Lys85 90
95Leu Pro Gly Arg Thr Asp Asn Asp Val Lys Asn Tyr Trp Asn Thr Lys100
105 110Leu Lys Lys Lys Ala Met Gly Ala
Val Gln Pro Arg Ala Ala Ala Ser115 120
125Ala Pro Ser Gln Cys Thr Ser Ser Ala Met Ala Pro Ala Leu Ser Pro130
135 140Ala Ser Ser Ser Val Thr Ser Ser Ser
Gly Asp Ala Cys Phe Ala Ala145 150 155
160Ala Ala Thr Thr Thr Thr Thr Met Tyr Pro Pro Pro Thr Thr
Pro Pro165 170 175Gln Gln Gln Phe Ile Arg
Phe Asp Ala Pro Pro Ala Ala Ala Ala Ala180 185
190Ala Ser Pro Thr Asp Leu Ala Pro Val Pro Pro Pro Ala Thr Val
Thr195 200 205Ala Asp Gly Asp Gly Gly Trp
Ala Ser Asp Ala Leu Ser Leu Asp Asp210 215
220Val Phe Leu Gly Glu Leu Thr Ala Gly Glu Pro Leu Phe Pro Tyr Ala225
230 235 240Glu Leu Phe Ser
Gly Phe Ala Gly Ala Ala Pro Asp Ser Lys Ala Thr245 250
255Leu Glu Leu Ser Ala Cys Tyr Phe Pro Asn Met Ala Glu Met
Trp Ala260 265 270Ala Ser Asp His Ala Tyr
Ala Lys Pro Gln Gly Leu Cys Asn Thr Leu275 280
285Thr186951DNAOryza sativa 186atggggaggg caccttgctg tgacaaggca
acagtgaaga aggggccatg gtcacctgag 60gaggatgcaa tgctcaagaa ctacattgag
gagcatggca ccggtggcaa ctggattgca 120ctgcctcaca agattgggct gaagaggtgt
ggcaagagct gcaggctaag gtggctgaat 180tacctgaggc caaacataaa gcatggggac
ttcaccccag aggaggacag catcatctgc 240agcctctaca ttagcatagg gagcaggtgg
tcaatcatag cagcacagct gccagggagg 300acggacaacg atgtcaagaa ctactggaac
acaaagctga agaagagact ccttggccgg 360cgcaaggacc gcggcggcgg ccaccaccac
cgcagccaga gcaccgccga cgatcttccg 420gccggtggtg acggcggcat gaacgacggc
ggcggcggcg gcggagagcg gtcgctgagc 480gcgtcggcga tggagaggat ccagctctgc
atgcagctgc aggagctgca gaacccactg 540tccatccacc acaacccctt gctctctcat
cagtggccaa gcaaggccac cattgatgat 600cagaatcaca acaatgtcac tgtggctgaa
catggaatgt caagctctgt gagcgaccac 660caccgcctcg atgggcagca gctggagagc
ggcgccggcg ccgccgccat gcagcaggcg 720tcgccgtcga gcggcggcga gaactccaac
gtcgtcgtcg ccatcgaggc cgagctccag 780gagcttctct acgccggcgg cggcgcgatc
gtcgacggcg gcgcgccgcc gcagggggat 840gtggactggt ggagctatga tcagggaaag
cagtcacctg tgacttgctg ggatttcacc 900cctgaaacca gctccatctt ccaggattat
gcaacagttt atgacatctg a 951187316PRTOryza sativa 187Met Gly
Arg Ala Pro Cys Cys Asp Lys Ala Thr Val Lys Lys Gly Pro1 5
10 15Trp Ser Pro Glu Glu Asp Ala Met
Leu Lys Asn Tyr Ile Glu Glu His20 25
30Gly Thr Gly Gly Asn Trp Ile Ala Leu Pro His Lys Ile Gly Leu Lys35
40 45Arg Cys Gly Lys Ser Cys Arg Leu Arg Trp
Leu Asn Tyr Leu Arg Pro50 55 60Asn Ile
Lys His Gly Asp Phe Thr Pro Glu Glu Asp Ser Ile Ile Cys65
70 75 80Ser Leu Tyr Ile Ser Ile Gly
Ser Arg Trp Ser Ile Ile Ala Ala Gln85 90
95Leu Pro Gly Arg Thr Asp Asn Asp Val Lys Asn Tyr Trp Asn Thr Lys100
105 110Leu Lys Lys Arg Leu Leu Gly Arg Arg
Lys Asp Arg Gly Gly Gly His115 120 125His
His Arg Ser Gln Ser Thr Ala Asp Asp Leu Pro Ala Gly Gly Asp130
135 140Gly Gly Met Asn Asp Gly Gly Gly Gly Gly Gly
Glu Arg Ser Leu Ser145 150 155
160Ala Ser Ala Met Glu Arg Ile Gln Leu Cys Met Gln Leu Gln Glu
Leu165 170 175Gln Asn Pro Leu Ser Ile His
His Asn Pro Leu Leu Ser His Gln Trp180 185
190Pro Ser Lys Ala Thr Ile Asp Asp Gln Asn His Asn Asn Val Thr Val195
200 205Ala Glu His Gly Met Ser Ser Ser Val
Ser Asp His His Arg Leu Asp210 215 220Gly
Gln Gln Leu Glu Ser Gly Ala Gly Ala Ala Ala Met Gln Gln Ala225
230 235 240Ser Pro Ser Ser Gly Gly
Glu Asn Ser Asn Val Val Val Ala Ile Glu245 250
255Ala Glu Leu Gln Glu Leu Leu Tyr Ala Gly Gly Gly Ala Ile Val
Asp260 265 270Gly Gly Ala Pro Pro Gln Gly
Asp Val Asp Trp Trp Ser Tyr Asp Gln275 280
285Gly Lys Gln Ser Pro Val Thr Cys Trp Asp Phe Thr Pro Glu Thr Ser290
295 300Ser Ile Phe Gln Asp Tyr Ala Thr Val
Tyr Asp Ile305 310 3151881134DNAOryza
sativa 188atgggcaggg cgccgtgctg cgacaaggcg acggtgaaga agggcccgtg
ggcgccggag 60gaggacgccg cgctcaaggc ctacgtcgac gcccatggca ccggcggcaa
ctggatcgcc 120ctcccccaca agatcgggct gaacaggtgc ggcaagagct gccggctgcg
gtggctcaac 180tacctccggc cgaacatccg gcacggcggc ttcaccgagg acgaggaccg
cctcatctgc 240agcctctaca tcgccatcgg gagcaggtgg gcgaccatcg cggcgcagct
gccggggagg 300acggacaacg acatcaagaa ctactggaac agcaagctca agcgccgcct
gctcggcggc 360ggccgccggc cgcggggcgc gccaccgcgg ctcgtgctcg ccggcccggg
ccccgctgta 420accgccgccg ccacgtcgcg caacgccatg gccgcgtcgg cgatcgagcg
gatgcagctc 480agcgtgcggc tgcgccgcct ggaggccgcg gcgccgccgc cgccgcagcc
cttcaccttc 540tacggcagca acaacctcgc cgcgccgccg tggcagcagc ctatctcccc
ggccgcgggc 600ggcagctcgg agatgccacg gcggctgcat caccaccacc cctccggcgc
cgccgctacc 660tcgagctact ccggcctgat cagcagctgg ccgtcgtctc gctcccacat
catccacgac 720gcctggctcg acgcctcctc caccccgccg ctgtcgacga cgagcatggg
cgacgccgcc 780acgacgacga cgacggccgg cggggagagc tcgagctcga cgccgacggt
gagcacggcc 840accacgccgt tcatcggcgg cagcatcgac atggacgacg agatcgacat
gctgctccag 900cagatcaggt gcttcgatga gaacggcgac gacggcgacg acgacgccga
ccagcggctg 960atcgtcggcg acgaggccgc agccggagca gagaactacc tcagggcatt
gatagacgag 1020gcggcagcga acggtggcga tgtcggcgtt ggctcctgga gctcttgctc
tactccagga 1080gtggactccg tgttccatga gtatgctcag ctagactacg gacagtataa
ttaa 1134189377PRTOryza sativa 189Met Gly Arg Ala Pro Cys Cys Asp
Lys Ala Thr Val Lys Lys Gly Pro1 5 10
15Trp Ala Pro Glu Glu Asp Ala Ala Leu Lys Ala Tyr Val Asp
Ala His20 25 30Gly Thr Gly Gly Asn Trp
Ile Ala Leu Pro His Lys Ile Gly Leu Asn35 40
45Arg Cys Gly Lys Ser Cys Arg Leu Arg Trp Leu Asn Tyr Leu Arg Pro50
55 60Asn Ile Arg His Gly Gly Phe Thr Glu
Asp Glu Asp Arg Leu Ile Cys65 70 75
80Ser Leu Tyr Ile Ala Ile Gly Ser Arg Trp Ala Thr Ile Ala
Ala Gln85 90 95Leu Pro Gly Arg Thr Asp
Asn Asp Ile Lys Asn Tyr Trp Asn Ser Lys100 105
110Leu Lys Arg Arg Leu Leu Gly Gly Gly Arg Arg Pro Arg Gly Ala
Pro115 120 125Pro Arg Leu Val Leu Ala Gly
Pro Gly Pro Ala Val Thr Ala Ala Ala130 135
140Thr Ser Arg Asn Ala Met Ala Ala Ser Ala Ile Glu Arg Met Gln Leu145
150 155 160Ser Val Arg Leu
Arg Arg Leu Glu Ala Ala Ala Pro Pro Pro Pro Gln165 170
175Pro Phe Thr Phe Tyr Gly Ser Asn Asn Leu Ala Ala Pro Pro
Trp Gln180 185 190Gln Pro Ile Ser Pro Ala
Ala Gly Gly Ser Ser Glu Met Pro Arg Arg195 200
205Leu His His His His Pro Ser Gly Ala Ala Ala Thr Ser Ser Tyr
Ser210 215 220Gly Leu Ile Ser Ser Trp Pro
Ser Ser Arg Ser His Ile Ile His Asp225 230
235 240Ala Trp Leu Asp Ala Ser Ser Thr Pro Pro Leu Ser
Thr Thr Ser Met245 250 255Gly Asp Ala Ala
Thr Thr Thr Thr Thr Ala Gly Gly Glu Ser Ser Ser260 265
270Ser Thr Pro Thr Val Ser Thr Ala Thr Thr Pro Phe Ile Gly
Gly Ser275 280 285Ile Asp Met Asp Asp Glu
Ile Asp Met Leu Leu Gln Gln Ile Arg Cys290 295
300Phe Asp Glu Asn Gly Asp Asp Gly Asp Asp Asp Ala Asp Gln Arg
Leu305 310 315 320Ile Val
Gly Asp Glu Ala Ala Ala Gly Ala Glu Asn Tyr Leu Arg Ala325
330 335Leu Ile Asp Glu Ala Ala Ala Asn Gly Gly Asp Val
Gly Val Gly Ser340 345 350Trp Ser Ser Cys
Ser Thr Pro Gly Val Asp Ser Val Phe His Glu Tyr355 360
365Ala Gln Leu Asp Tyr Gly Gln Tyr Asn370
375190885DNAOryza sativa 190atgaagaagg gaaaatggtc caaggaagag gacgatttga
tcaaaaacca catggagaag 60tatggcattg gccgtagctg gcaggcactg tctgatgctt
tagggctgca gaggtgtggc 120cggagctgcc gttcccggtg gctgaactac cttcggccgg
ggctgaagca cggcgacttc 180tcgccggcgg aggagaggat catctgcaag atgtacagca
agaagggaag cagctggtcg 240gccatcgccg cgcagctgcc ggggaggacg gacctcgccg
tcaagaacta ctggaacagc 300acgctcaaga agaggttccc ggcggcggcg gcggcgagga
gcaccgccgc cgcgcgccgc 360aggcaccgcc ccgccgccag cgccaccaca tcgtccgacg
acgacgacga cgtcgacgtc 420gacgacgcga ccccgcccgg cctcgcgctg gtcgtctaca
gcgaggggag caccgccgcc 480gcggcggcgg ccggcgagct cgctccgtac tccatctcat
ccccggccgc cactgccgac 540gcggcggaag aagaagagcc gatcgcggcc gttccgatca
gcacctgcat cctggcactg 600ccgccgccac caccaccgcc accgccgccg ccgagcgatg
ccaccggcgg cgaggtgagc 660atcccctgct tccccttctc gccgctgcca ttcatcgagc
cggacttgcc ggagctgacc 720tggacgaccg acctcgacga cattaccgcc accttcgatg
ccgccgcctg ccgctacccg 780aagcggccgc atccaatcac accggattcg cctcgccttc
cgaagcttcc cgccattgcc 840ggcttcgacg acgtgcagag cttcttgtct tggttcgatg
actga 885191294PRTOryza sativa 191Met Lys Lys Gly Lys
Trp Ser Lys Glu Glu Asp Asp Leu Ile Lys Asn1 5
10 15His Met Glu Lys Tyr Gly Ile Gly Arg Ser Trp
Gln Ala Leu Ser Asp20 25 30Ala Leu Gly
Leu Gln Arg Cys Gly Arg Ser Cys Arg Ser Arg Trp Leu35 40
45Asn Tyr Leu Arg Pro Gly Leu Lys His Gly Asp Phe Ser
Pro Ala Glu50 55 60Glu Arg Ile Ile Cys
Lys Met Tyr Ser Lys Lys Gly Ser Ser Trp Ser65 70
75 80Ala Ile Ala Ala Gln Leu Pro Gly Arg Thr
Asp Leu Ala Val Lys Asn85 90 95Tyr Trp
Asn Ser Thr Leu Lys Lys Arg Phe Pro Ala Ala Ala Ala Ala100
105 110Arg Ser Thr Ala Ala Ala Arg Arg Arg His Arg Pro
Ala Ala Ser Ala115 120 125Thr Thr Ser Ser
Asp Asp Asp Asp Asp Val Asp Val Asp Asp Ala Thr130 135
140Pro Pro Gly Leu Ala Leu Val Val Tyr Ser Glu Gly Ser Thr
Ala Ala145 150 155 160Ala
Ala Ala Ala Gly Glu Leu Ala Pro Tyr Ser Ile Ser Ser Pro Ala165
170 175Ala Thr Ala Asp Ala Ala Glu Glu Glu Glu Pro
Ile Ala Ala Val Pro180 185 190Ile Ser Thr
Cys Ile Leu Ala Leu Pro Pro Pro Pro Pro Pro Pro Pro195
200 205Pro Pro Pro Ser Asp Ala Thr Gly Gly Glu Val Ser
Ile Pro Cys Phe210 215 220Pro Phe Ser Pro
Leu Pro Phe Ile Glu Pro Asp Leu Pro Glu Leu Thr225 230
235 240Trp Thr Thr Asp Leu Asp Asp Ile Thr
Ala Thr Phe Asp Ala Ala Ala245 250 255Cys
Arg Tyr Pro Lys Arg Pro His Pro Ile Thr Pro Asp Ser Pro Arg260
265 270Leu Pro Lys Leu Pro Ala Ile Ala Gly Phe Asp
Asp Val Gln Ser Phe275 280 285Leu Ser Trp
Phe Asp Asp290192807DNAPicea 192atgggaagag cgccctgttg tgacaaggca
aatgtcaaaa agggaccatg gtcgccagaa 60gaagacgcaa aactcaaggc ctttatcgag
caacatggca ctggtggcaa ttggattgct 120cttccacaga aagctggtct gaaaaggtgt
ggaaaaagct gcaggcttag atggttgaac 180tatttgaggc cagatataag gcatggtggt
ttctcagagg atgaagatag catcatttgt 240ggcctctatg caagcattgg aagcaggtgg
tccataattg cagcccagtt accgggaaga 300acggacaatg acatcaaaaa ctactggaat
acaagactga agaaaaaact gcctgggaag 360cgtaaagagc agcaaacacg taggtttaaa
gaagcaaaga gcatgggtaa tggggcaggg 420ccttatgttt cagaaggtat gtctgctgca
tcatccacca taaatgcaat tagatctctg 480atgtcaaaca cagaggcact ttatctgtcc
gatcgaatgc ctcatatgga tatggatcca 540tcgttaggcc tacttaatcc tcagttcttg
cagcatactg tatcaaatat tgctaattat 600tgctcaagct cggagcatgt tagctttgga
actggtcctc cccaggattg gatcaggagc 660ttacatttga caatgcaagt cttagaaagc
tgctcttcag gactgaatgc gccttcactg 720cagacaataa caataatacc tctcaagcat
ctcccgcaat cattagttct cagatatatt 780ccccatctac tattttggat cccgggc
807193269PRTPicea 193Met Gly Arg Ala
Pro Cys Cys Asp Lys Ala Asn Val Lys Lys Gly Pro1 5
10 15Trp Ser Pro Glu Glu Asp Ala Lys Leu Lys
Ala Phe Ile Glu Gln His20 25 30Gly Thr
Gly Gly Asn Trp Ile Ala Leu Pro Gln Lys Ala Gly Leu Lys35
40 45Arg Cys Gly Lys Ser Cys Arg Leu Arg Trp Leu Asn
Tyr Leu Arg Pro50 55 60Asp Ile Arg His
Gly Gly Phe Ser Glu Asp Glu Asp Ser Ile Ile Cys65 70
75 80Gly Leu Tyr Ala Ser Ile Gly Ser Arg
Trp Ser Ile Ile Ala Ala Gln85 90 95Leu
Pro Gly Arg Thr Asp Asn Asp Ile Lys Asn Tyr Trp Asn Thr Arg100
105 110Leu Lys Lys Lys Leu Pro Gly Lys Arg Lys Glu
Gln Gln Thr Arg Arg115 120 125Phe Lys Glu
Ala Lys Ser Met Gly Asn Gly Ala Gly Pro Tyr Val Ser130
135 140Glu Gly Met Ser Ala Ala Ser Ser Thr Ile Asn Ala
Ile Arg Ser Leu145 150 155
160Met Ser Asn Thr Glu Ala Leu Tyr Leu Ser Asp Arg Met Pro His Met165
170 175Asp Met Asp Pro Ser Leu Gly Leu Leu
Asn Pro Gln Phe Leu Gln His180 185 190Thr
Val Ser Asn Ile Ala Asn Tyr Cys Ser Ser Ser Glu His Val Ser195
200 205Phe Gly Thr Gly Pro Pro Gln Asp Trp Ile Arg
Ser Leu His Leu Thr210 215 220Met Gln Val
Leu Glu Ser Cys Ser Ser Gly Leu Asn Ala Pro Ser Leu225
230 235 240Gln Thr Ile Thr Ile Ile Pro
Leu Lys His Leu Pro Gln Ser Leu Val245 250
255Leu Arg Tyr Ile Pro His Leu Leu Phe Trp Ile Pro Gly260
265194495DNAPicea 194atgggccgag caccctgctg cgacaaagca aatgtcaaga
gagggccgtg gtctcctgaa 60gaggacacca tactcaaaaa tttcgtagag aagcatggca
ccggaggcaa ctggatcgct 120ttgcctcgca aagcaggttt gaaaaggtgc ggcaagagct
gtagactgag gtggttgaat 180tatttgaggc ctgatatcaa gcacggtgac ttctcagaag
aagaagacag catcatttgc 240agcctctata ccagcattgg aagcagatgg tctatcattg
cagcccagtt gcccgggaga 300acagacaacg acataaagaa ctactggaac acgcggctga
agaaaaaatt gcttggcaaa 360tgcaccaagg acaacgataa tcagcaaatc cggcgcttac
tggcaaaaga agcagccaaa 420ggtgcgggaa ttaggagacc ttacaacggt gatcatattt
cttcagaaac ggccgccatg 480tctgcgtcaa acact
495195165PRTPicea 195Met Gly Arg Ala Pro Cys Cys
Asp Lys Ala Asn Val Lys Arg Gly Pro1 5 10
15Trp Ser Pro Glu Glu Asp Thr Ile Leu Lys Asn Phe Val
Glu Lys His20 25 30Gly Thr Gly Gly Asn
Trp Ile Ala Leu Pro Arg Lys Ala Gly Leu Lys35 40
45Arg Cys Gly Lys Ser Cys Arg Leu Arg Trp Leu Asn Tyr Leu Arg
Pro50 55 60Asp Ile Lys His Gly Asp Phe
Ser Glu Glu Glu Asp Ser Ile Ile Cys65 70
75 80Ser Leu Tyr Thr Ser Ile Gly Ser Arg Trp Ser Ile
Ile Ala Ala Gln85 90 95Leu Pro Gly Arg
Thr Asp Asn Asp Ile Lys Asn Tyr Trp Asn Thr Arg100 105
110Leu Lys Lys Lys Leu Leu Gly Lys Cys Thr Lys Asp Asn Asp
Asn Gln115 120 125Gln Ile Arg Arg Leu Leu
Ala Lys Glu Ala Ala Lys Gly Ala Gly Ile130 135
140Arg Arg Pro Tyr Asn Gly Asp His Ile Ser Ser Glu Thr Ala Ala
Met145 150 155 160Ser Ala
Ser Asn Thr165196827DNAPinus 196atgggaagag caccctgttg tgacaaggca
aatgtcaaaa aaggatcttg gtcaccagaa 60gaagacgcaa aactcaaggc gtttattgaa
cagcatggca ctggtggcaa ttggattgct 120cttccacaga aagctggtct gaaaaggtgt
ggaaagagct gcaggcttag atggttgaac 180tatttgaggc cagatataag gcatggtggt
ttctcagaag atgaagataa catcatttgt 240agcctctatg caagcattgg aagcaggtgg
tctataattg cagcccagtt acctggaaga 300acagacaatg acataaaaaa ttactggaac
acaaggctga agaaaaagtt gcttgggaaa 360cgtaaagatc agcaaacacg taggtttaaa
gaagcaaaga acatgagtaa tggcacaggg 420ccttatgttt cagaaggtat gtctactaca
tcatccacca tcaatgcaat tagatctctg 480atgtcaaaca ccgaggcact ttatctgtca
gatcgaatgc ctcatatgga tatcgatcca 540tcatcattgg gcctgcttaa tcctcaggtc
ttgcagcata ctgtatcaaa tattgctaat 600tattgctcaa gctcagagca tattagcttt
ggaactggtc ctccccaggg actggatcag 660gagcttacat ttgacaatgc aagtctgaga
aagctgctct ttaggactga gtgtggcttc 720actgcagaaa ataacaataa cagtgtctct
caagcatctc caccaatcat tagttctcag 780atatattccc catctactat tttggatccc
ggacttgtcc atgactc 827197276PRTPinus 197Met Gly Arg Ala
Pro Cys Cys Asp Lys Ala Asn Val Lys Lys Gly Ser1 5
10 15Trp Ser Pro Glu Glu Asp Ala Lys Leu Lys
Ala Phe Ile Glu Gln His20 25 30Gly Thr
Gly Gly Asn Trp Ile Ala Leu Pro Gln Lys Ala Gly Leu Lys35
40 45Arg Cys Gly Lys Ser Cys Arg Leu Arg Trp Leu Asn
Tyr Leu Arg Pro50 55 60Asp Ile Arg His
Gly Gly Phe Ser Glu Asp Glu Asp Asn Ile Ile Cys65 70
75 80Ser Leu Tyr Ala Ser Ile Gly Ser Arg
Trp Ser Ile Ile Ala Ala Gln85 90 95Leu
Pro Gly Arg Thr Asp Asn Asp Ile Lys Asn Tyr Trp Asn Thr Arg100
105 110Leu Lys Lys Lys Leu Leu Gly Lys Arg Lys Asp
Gln Gln Thr Arg Arg115 120 125Phe Lys Glu
Ala Lys Asn Met Ser Asn Gly Thr Gly Pro Tyr Val Ser130
135 140Glu Gly Met Ser Thr Thr Ser Ser Thr Ile Asn Ala
Ile Arg Ser Leu145 150 155
160Met Ser Asn Thr Glu Ala Leu Tyr Leu Ser Asp Arg Met Pro His Met165
170 175Asp Ile Asp Pro Ser Ser Leu Gly Leu
Leu Asn Pro Gln Val Leu Gln180 185 190His
Thr Val Ser Asn Ile Ala Asn Tyr Cys Ser Ser Ser Glu His Ile195
200 205Ser Phe Gly Thr Gly Pro Pro Gln Gly Leu Asp
Gln Glu Leu Thr Phe210 215 220Asp Asn Ala
Ser Leu Arg Lys Leu Leu Phe Arg Thr Glu Cys Gly Phe225
230 235 240Thr Ala Glu Asn Asn Asn Asn
Ser Val Ser Gln Ala Ser Pro Pro Ile245 250
255Ile Ser Ser Gln Ile Tyr Ser Pro Ser Thr Ile Leu Asp Pro Gly Leu260
265 270Val His Asp Ser2751981245DNAPinus
198atgggtcgag caccctgctg tgacaaagca aatgtcaaga gagggccatg gtctcctgaa
60gaggacacca tactaaaaaa tttcgtggag aagcatggca ctggaggcaa ctggatcgct
120ttgcctcgca aagcaggttt aaaaaggtgt ggcaagagtt gcaggctgag atggctgaat
180tatttgaggc ctgatattaa gcatggtgac ttctcagaag aagaagacga catcatttgc
240accctgtata ccagcattgg aagcagatgg tctatcattg cagcccagtt gccagggcga
300acagacaacg acataaagaa ctactggaac acgcggctga agaaaaaatt gcttggcaaa
360tgcagcaagg acaacgataa tcagcaaatc cgacgcctac tggcaaaaga agcagctaaa
420ggcgcgggaa ttagaggact ttacaacggt gatcataata tttcttcaga aacggccgcc
480atttctgcat caaacactca aagttcgtca gaatctctga cagaaacagc cactgccgca
540aatgccatga atagccctta taatgcccta gaaacgggta aagtcccggg atcgaattcg
600tctgagtcaa aagcggccgg ttttaatgcc acccacagtc ctaataccca ttattatgaa
660caaggtttca gtcaaatcat gtctcaggca gatcagtatt cttccctgac acacatgctt
720ctgagactcg agaataacga gagcgattgc tcaacagaca atattcaatc tctgggcatc
780gatacaattc ccagtgaggt ccctttctac gcttccacgg ccatgaatgt aaaagctgaa
840gccatggagc gcccatctag tgatccccag ctcaatcaag cccgcaattc agtgcctgca
900ctctgggaaa cctgtaccag caattccaca tctgggggca acaattatca gttcagtacg
960ttgataaacg atgatacggg caatttcagc tatgggctct cagcggacat ggatgagttc
1020atagtgtatc gaaattatgg aggcaatggc tcaatatctc aggtgaaaga ggagcctgac
1080tactccacag ctgaagccta ctgggcttct caactagctg aacctgcaaa gtcctcaggc
1140ttaacaacaa catgtcccaa ttatgcatac attttgccat catctgaggg tggtatgggt
1200tctggaacta ccccacaggg gctctttcag gagggaatta tctat
1245199415PRTPinus 199Met Gly Arg Ala Pro Cys Cys Asp Lys Ala Asn Val Lys
Arg Gly Pro1 5 10 15Trp
Ser Pro Glu Glu Asp Thr Ile Leu Lys Asn Phe Val Glu Lys His20
25 30Gly Thr Gly Gly Asn Trp Ile Ala Leu Pro Arg
Lys Ala Gly Leu Lys35 40 45Arg Cys Gly
Lys Ser Cys Arg Leu Arg Trp Leu Asn Tyr Leu Arg Pro50 55
60Asp Ile Lys His Gly Asp Phe Ser Glu Glu Glu Asp Asp
Ile Ile Cys65 70 75
80Thr Leu Tyr Thr Ser Ile Gly Ser Arg Trp Ser Ile Ile Ala Ala Gln85
90 95Leu Pro Gly Arg Thr Asp Asn Asp Ile Lys
Asn Tyr Trp Asn Thr Arg100 105 110Leu Lys
Lys Lys Leu Leu Gly Lys Cys Ser Lys Asp Asn Asp Asn Gln115
120 125Gln Ile Arg Arg Leu Leu Ala Lys Glu Ala Ala Lys
Gly Ala Gly Ile130 135 140Arg Gly Leu Tyr
Asn Gly Asp His Asn Ile Ser Ser Glu Thr Ala Ala145 150
155 160Ile Ser Ala Ser Asn Thr Gln Ser Ser
Ser Glu Ser Leu Thr Glu Thr165 170 175Ala
Thr Ala Ala Asn Ala Met Asn Ser Pro Tyr Asn Ala Leu Glu Thr180
185 190Gly Lys Val Pro Gly Ser Asn Ser Ser Glu Ser
Lys Ala Ala Gly Phe195 200 205Asn Ala Thr
His Ser Pro Asn Thr His Tyr Tyr Glu Gln Gly Phe Ser210
215 220Gln Ile Met Ser Gln Ala Asp Gln Tyr Ser Ser Leu
Thr His Met Leu225 230 235
240Leu Arg Leu Glu Asn Asn Glu Ser Asp Cys Ser Thr Asp Asn Ile Gln245
250 255Ser Leu Gly Ile Asp Thr Ile Pro Ser
Glu Val Pro Phe Tyr Ala Ser260 265 270Thr
Ala Met Asn Val Lys Ala Glu Ala Met Glu Arg Pro Ser Ser Asp275
280 285Pro Gln Leu Asn Gln Ala Arg Asn Ser Val Pro
Ala Leu Trp Glu Thr290 295 300Cys Thr Ser
Asn Ser Thr Ser Gly Gly Asn Asn Tyr Gln Phe Ser Thr305
310 315 320Leu Ile Asn Asp Asp Thr Gly
Asn Phe Ser Tyr Gly Leu Ser Ala Asp325 330
335Met Asp Glu Phe Ile Val Tyr Arg Asn Tyr Gly Gly Asn Gly Ser Ile340
345 350Ser Gln Val Lys Glu Glu Pro Asp Tyr
Ser Thr Ala Glu Ala Tyr Trp355 360 365Ala
Ser Gln Leu Ala Glu Pro Ala Lys Ser Ser Gly Leu Thr Thr Thr370
375 380Cys Pro Asn Tyr Ala Tyr Ile Leu Pro Ser Ser
Glu Gly Gly Met Gly385 390 395
400Ser Gly Thr Thr Pro Gln Gly Leu Phe Gln Glu Gly Ile Ile Tyr405
410 415200681DNAPoncirus trifoliata
200atgggtaggg ctccttgctg tgacaaggca aatgtgaaga aggggccatg gtcacctgaa
60gaagactcaa agcttaagga gtacatagag aaatatggca ctggtggcaa ctggattgct
120cttcctcaga aagctggtct aaagagatgt gggaaaagtt gcagactgag atggctcaac
180tatttgaggc caaatattaa acatggggaa tttactgatg aggaagatag gctgatctgt
240agcctctttg ctagcattgg aagcagatgg tcaattatag ctgctcagct accgggtagg
300actgataatg atatcaaaaa ctactggaac acaaagctca agaagaaact catggccatg
360gctcctccat tatcatcaca aaagaagagc actgctccac cactgatccc atcatctcat
420catcatcacc aagctttagt gtcactacta ccatctcaac cctattacac cccttctaat
480aaatctctta tcagtaactc ttttgattct tttgaaccca tttcatcaaa caccccatta
540catctttttg acaacaacaa cttgatccaa atccaagaaa accagctgtt tagtaataat
600aattccatgc agggttacta tccaatgaaa gacaatttct tgacttttgg tagcgagcaa
660agttgcagtt cttctgatgg t
681201227PRTPoncirus trifoliata 201Met Gly Arg Ala Pro Cys Cys Asp Lys
Ala Asn Val Lys Lys Gly Pro1 5 10
15Trp Ser Pro Glu Glu Asp Ser Lys Leu Lys Glu Tyr Ile Glu Lys
Tyr20 25 30Gly Thr Gly Gly Asn Trp Ile
Ala Leu Pro Gln Lys Ala Gly Leu Lys35 40
45Arg Cys Gly Lys Ser Cys Arg Leu Arg Trp Leu Asn Tyr Leu Arg Pro50
55 60Asn Ile Lys His Gly Glu Phe Thr Asp Glu
Glu Asp Arg Leu Ile Cys65 70 75
80Ser Leu Phe Ala Ser Ile Gly Ser Arg Trp Ser Ile Ile Ala Ala
Gln85 90 95Leu Pro Gly Arg Thr Asp Asn
Asp Ile Lys Asn Tyr Trp Asn Thr Lys100 105
110Leu Lys Lys Lys Leu Met Ala Met Ala Pro Pro Leu Ser Ser Gln Lys115
120 125Lys Ser Thr Ala Pro Pro Leu Ile Pro
Ser Ser His His His His Gln130 135 140Ala
Leu Val Ser Leu Leu Pro Ser Gln Pro Tyr Tyr Thr Pro Ser Asn145
150 155 160Lys Ser Leu Ile Ser Asn
Ser Phe Asp Ser Phe Glu Pro Ile Ser Ser165 170
175Asn Thr Pro Leu His Leu Phe Asp Asn Asn Asn Leu Ile Gln Ile
Gln180 185 190Glu Asn Gln Leu Phe Ser Asn
Asn Asn Ser Met Gln Gly Tyr Tyr Pro195 200
205Met Lys Asp Asn Phe Leu Thr Phe Gly Ser Glu Gln Ser Cys Ser Ser210
215 220Ser Asp Gly225202423DNAPopulus
202atggggaggg ctccttgctg cgacaaagcc aacgtcaaga aaggcccatg gtcacctgaa
60gaagatgcga agctcaagtc atatatagag cagcatggca ctggtggtaa ctggattgct
120ttgcctcaaa aaattggtct taagagatgc ggcaagagct gccgccttag atggttaaat
180tatcttcgcc cgaatattaa gcacggaggc ttctctgaag aagaagataa cataatttgc
240agcctttaca ttagtatcgg aagcagatgg tccataattg cagcacaatt gccaggaaga
300accgataatg atataaagaa ctactggaac acaaggctga agaagaagct tcttggcaag
360cagcgaaaag agcaacaggc tcgcagaagt agtggcctaa agcaagaaat gaagagagga
420aat
423203345PRTPopulus 203Met Gly Arg Ala Pro Cys Cys Asp Lys Ala Asn Val
Lys Lys Gly Pro1 5 10
15Trp Ser Pro Glu Glu Asp Ala Lys Leu Lys Ser Tyr Ile Glu Gln His20
25 30Gly Thr Gly Gly Asn Trp Ile Ala Leu Pro
Gln Lys Ile Gly Leu Lys35 40 45Arg Cys
Gly Lys Ser Cys Arg Leu Arg Trp Leu Asn Tyr Leu Arg Pro50
55 60Asn Ile Lys His Gly Gly Phe Ser Glu Glu Glu Asp
Asn Ile Ile Cys65 70 75
80Ser Leu Tyr Ile Ser Ile Gly Ser Arg Trp Ser Ile Ile Ala Ala Gln85
90 95Leu Pro Gly Arg Thr Asp Asn Asp Ile Lys
Asn Tyr Trp Asn Thr Arg100 105 110Leu Lys
Lys Lys Leu Leu Gly Lys Gln Arg Lys Glu Gln Gln Ala Arg115
120 125Arg Ser Ser Gly Leu Lys Gln Glu Met Lys Arg Gly
Asn Gly Asn Pro130 135 140Val Val Pro Ala
Asp Asn Asn Asn Gln Asn Pro Tyr Trp Pro Glu Leu145 150
155 160Pro Ile Leu Ala Pro Ile Pro Tyr Ser
Asn His Glu Pro His Phe Asn165 170 175Asp
His Ala Ser Ile Arg Lys Leu Leu Ile Lys Leu Gly Gly Lys Phe180
185 190Ser Asp Asp Asp Gln Val Asn His Asn Ala Met
Asn Pro Gln Phe Pro195 200 205Ala Asp Val
Ser Tyr Thr Gln Gln Leu Tyr Asp Asp Gln Ser Ile Asn210
215 220Val Ser Ser Ser Ala Pro Met Glu Glu Thr Leu Asp
Asp Thr Asp Thr225 230 235
240Gln Phe Ala Gln Thr Gln Tyr Asp Ile Asp Gly Ala Ala Gly Leu Gln245
250 255Met Leu Gln Gly Gln Ser Ser Phe Pro
Ala Gly Leu Glu Gln Met Val260 265 270Ser
Ser Asn Pro Pro Arg Leu Asp Gly Leu Glu Phe Leu Leu Gly Asp275
280 285Asp Met Val Asn Asn Arg Ile Arg Thr Ala Tyr
Gly Thr Glu Ser Met290 295 300Ile Asp Cys
Gly Glu Met Thr Ser Leu Ile Phe Pro Pro Gly Ala Ser305
310 315 320Asn Cys Glu Gly Ile Ile Gln
Gln Arg Leu Leu Gln Glu Cys Ala Phe325 330
335Asp Glu Pro Arg Tyr Pro Gly Pro Leu340
345204572DNAPopulus 204atggggagag ctccttgctg tgacaaagcc aacgtgaaga
aaggtccatg gtctcccgag 60gaagatgcca tactcaaggc ctatattgag cagcatggaa
cgggtggcaa ttggattgcc 120ttgccacaga agataggcct gaaaaggtgt ggcaagagtt
gtcgacttag atggttgaat 180tatctccgtc caaacattaa acatggaggg ttttcagagg
aagaagataa cattatttgc 240agtctctata taagtattgg aagcaggtgg tctatcattg
ctgctcagtt acctggaaga 300accgataatg atataaaaaa ctactggaac acaaggctta
agaagaagct cttaggaaaa 360cagcgcaagg aacaagcagc ccggcgagct agtctcaagc
aagaaatcat gacaaagaga 420gaaataaatg agagcttcat ggttcctggg gctatccctc
atcaacaaag cccttactgg 480ccagaggtac cagctctagt catgaatcaa aaccaagatt
ctcatttgat ggatcaagaa 540tccattagga acttgctgat caaacttggt gg
572205191PRTPopulus 205Met Gly Arg Ala Pro Cys Cys
Asp Lys Ala Asn Val Lys Lys Gly Pro1 5 10
15Trp Ser Pro Glu Glu Asp Ala Ile Leu Lys Ala Tyr Ile
Glu Gln His20 25 30Gly Thr Gly Gly Asn
Trp Ile Ala Leu Pro Gln Lys Ile Gly Leu Lys35 40
45Arg Cys Gly Lys Ser Cys Arg Leu Arg Trp Leu Asn Tyr Leu Arg
Pro50 55 60Asn Ile Lys His Gly Gly Phe
Ser Glu Glu Glu Asp Asn Ile Ile Cys65 70
75 80Ser Leu Tyr Ile Ser Ile Gly Ser Arg Trp Ser Ile
Ile Ala Ala Gln85 90 95Leu Pro Gly Arg
Thr Asp Asn Asp Ile Lys Asn Tyr Trp Asn Thr Arg100 105
110Leu Lys Lys Lys Leu Leu Gly Lys Gln Arg Lys Glu Gln Ala
Ala Arg115 120 125Arg Ala Ser Leu Lys Gln
Glu Ile Met Thr Lys Arg Glu Ile Asn Glu130 135
140Ser Phe Met Val Pro Gly Ala Ile Pro His Gln Gln Ser Pro Tyr
Trp145 150 155 160Pro Glu
Val Pro Ala Leu Val Met Asn Gln Asn Gln Asp Ser His Leu165
170 175Met Asp Gln Glu Ser Ile Arg Asn Leu Leu Ile Lys
Leu Gly Gly180 185 190206999DNAPopulus
206atgggaaggg ctccttgttg tgacaaggct aatatgaaga aagggccctg gccgcctgaa
60gaagatgcaa agctgaagga gtacttggaa aaacaaggga ctggagggaa ttggattgct
120ctcccacaaa aggctggtca taaaagatgt gggaaaagct gcagattaag atggctaaac
180tatctcaggc ccaacattaa acatggagag ttctctgatg atgaagacag gataatctgc
240agcctatatg ccaacattgg aagccggtgg tcaataatag cagctcagtt gccaggcagg
300acggataatg acataaaaaa ctactggaac accaagctca agaagaaact aatgggtgtg
360attaatccta ttgctcagag aaaacctcag caagctgctc ttttttcatc tctccttcaa
420gctacatcat caccgtcttc accatccact ctcctgtcat caccatcatc atcattcaca
480tgcagcaaca acagttatta cagtaaccta actaggtctt tcactgatcc aatctcattt
540tcatcaagtc ctatgagcac tagctctttc gctactgctt ccatgctaca cccccaacaa
600acctttgtgg ggcctatgca aaatgatcaa gttaaagata gtcttataat gtttggaggt
660gaagccagct gcagctcgtc tgatgggagt tgcaacaacc agatgagcca tgtcaaagag
720gagtatgaat atagtggtgg tacaaataac aacaatgaac agatggggtt acaaaattat
780ccctacaatg gggttgaaga tggccaaaaa ttgatggttt caagtgaggc tgctgctcat
840ggtgtactta atgggtggat tgagaagcaa aatgggttat ggccgggaga taactcacta
900gactatggtc tagaggagat taagcaactt attagcacta gcagctgtaa tagctttttg
960tttgatgaaa acaagacagg aggaaaagtc atgtactac
999207333PRTPopulus 207Met Gly Arg Ala Pro Cys Cys Asp Lys Ala Asn Met
Lys Lys Gly Pro1 5 10
15Trp Pro Pro Glu Glu Asp Ala Lys Leu Lys Glu Tyr Leu Glu Lys Gln20
25 30Gly Thr Gly Gly Asn Trp Ile Ala Leu Pro
Gln Lys Ala Gly His Lys35 40 45Arg Cys
Gly Lys Ser Cys Arg Leu Arg Trp Leu Asn Tyr Leu Arg Pro50
55 60Asn Ile Lys His Gly Glu Phe Ser Asp Asp Glu Asp
Arg Ile Ile Cys65 70 75
80Ser Leu Tyr Ala Asn Ile Gly Ser Arg Trp Ser Ile Ile Ala Ala Gln85
90 95Leu Pro Gly Arg Thr Asp Asn Asp Ile Lys
Asn Tyr Trp Asn Thr Lys100 105 110Leu Lys
Lys Lys Leu Met Gly Val Ile Asn Pro Ile Ala Gln Arg Lys115
120 125Pro Gln Gln Ala Ala Leu Phe Ser Ser Leu Leu Gln
Ala Thr Ser Ser130 135 140Pro Ser Ser Pro
Ser Thr Leu Leu Ser Ser Pro Ser Ser Ser Phe Thr145 150
155 160Cys Ser Asn Asn Ser Tyr Tyr Ser Asn
Leu Thr Arg Ser Phe Thr Asp165 170 175Pro
Ile Ser Phe Ser Ser Ser Pro Met Ser Thr Ser Ser Phe Ala Thr180
185 190Ala Ser Met Leu His Pro Gln Gln Thr Phe Val
Gly Pro Met Gln Asn195 200 205Asp Gln Val
Lys Asp Ser Leu Ile Met Phe Gly Gly Glu Ala Ser Cys210
215 220Ser Ser Ser Asp Gly Ser Cys Asn Asn Gln Met Ser
His Val Lys Glu225 230 235
240Glu Tyr Glu Tyr Ser Gly Gly Thr Asn Asn Asn Asn Glu Gln Met Gly245
250 255Leu Gln Asn Tyr Pro Tyr Asn Gly Val
Glu Asp Gly Gln Lys Leu Met260 265 270Val
Ser Ser Glu Ala Ala Ala His Gly Val Leu Asn Gly Trp Ile Glu275
280 285Lys Gln Asn Gly Leu Trp Pro Gly Asp Asn Ser
Leu Asp Tyr Gly Leu290 295 300Glu Glu Ile
Lys Gln Leu Ile Ser Thr Ser Ser Cys Asn Ser Phe Leu305
310 315 320Phe Asp Glu Asn Lys Thr Gly
Gly Lys Val Met Tyr Tyr325 330208387DNAQuercus petraea
208atgggaagag ctccttgttg tgataaggcc aacgtgaaaa aagggccatg gtcacctgaa
60gaagatgcaa agctgaaaga gtatatagat aaatatggga ctggagggaa ttggattgct
120cttccacaga aagcaggtct caagagatgt ggaaaaagcc gcagattaag atggcttaac
180tatctcagac ccaacattaa acatggtgaa ttcactgaaa atgaagatag gataatatgc
240agcctctttg ctaacattgg aagcaggtgg tcaataatag cagctcagtt gccaggcagg
300acagacaatg accttaagaa ctattgtaac accaagctta agaagaaact catgggtata
360tttcctccat ctcagagaaa aggtcac
387209129PRTQuercus petraea 209Met Gly Arg Ala Pro Cys Cys Asp Lys Ala
Asn Val Lys Lys Gly Pro1 5 10
15Trp Ser Pro Glu Glu Asp Ala Lys Leu Lys Glu Tyr Ile Asp Lys Tyr20
25 30Gly Thr Gly Gly Asn Trp Ile Ala Leu
Pro Gln Lys Ala Gly Leu Lys35 40 45Arg
Cys Gly Lys Ser Arg Arg Leu Arg Trp Leu Asn Tyr Leu Arg Pro50
55 60Asn Ile Lys His Gly Glu Phe Thr Glu Asn Glu
Asp Arg Ile Ile Cys65 70 75
80Ser Leu Phe Ala Asn Ile Gly Ser Arg Trp Ser Ile Ile Ala Ala Gln85
90 95Leu Pro Gly Arg Thr Asp Asn Asp Leu
Lys Asn Tyr Cys Asn Thr Lys100 105 110Leu
Lys Lys Lys Leu Met Gly Ile Phe Pro Pro Ser Gln Arg Lys Gly115
120 125His210547DNAQuercus suber 210accttgctct
cctattctgg catgcttaat ttctggaatt ttctatacaa actttatgcc 60acaccctcct
taaaatatta attattggct gtttccctaa taaccatgta tatcttcaag 120cttttataat
cctctctctc tctctctctc tctctctctc tctctctcat attcttattt 180gggtttcaga
gggagaagct ataggctata gcatcaagct caaagcaaca ccatggggag 240agctccatgt
tgtgacaaag caaacgtcaa gaaaggccca tggtcacctg aagaagatgc 300caagctcaag
tcgtatatag agaagcatgg caccggtggt aactggatcg ctttaccaca 360aaaaaattgg
cctgaagaga tgcggcaaga gttgccgcct tagatggtta aactatctcc 420gtccaaatat
caagcatgga ggattttcag aagaggaaga taacataatt tgcagcctct 480atattagtat
tggaagcagg tggtctatta ttgcagcaca attgccagga agaacagata 540atgatat
547211648DNARaphanus raphanistrum 211atgggaaggg caccttgttg tgacaaagcc
aacgtgaaga aagggccttg gtctcctgag 60gaagatgcca aactcaaaga ttacatcgag
aatagtggta caggaggcaa ctggatcgct 120ttgcctcaga agattggtct aaggagatgt
gggaagagtt gcaggctaag gtggctcaac 180tatttgagac caaacatcaa acatggtggc
ttctctgagg aagaagacaa catcatttgt 240aacctctatg ttactattgg tagcaggtgg
tctataattg ctgcacaact gccgggaaga 300acggacaacg atatcaagaa ctactggaac
acgaggctga agaagaagct tctaaacaaa 360caaaggaaag agttccaaga agctcgaatg
aagcaagaga tggtgatgat gaagaggcaa 420caacaagtac aaaatggtag tacggatctt
tatctgaaaa acatgtttgg atcatcatca 480tggccattac tacaacagct tcctcatcat
caagtacctc ttgtgatgat ggaaccaaca 540aactgtaact actaccaaat gtcaccgtct
tgcaacctag aacaaaagca acttatcgct 600ctcaataaca tggtcaagat tgaagaagaa
ccggagagaa caaaccct 648212216PRTRaphanus raphanistrum
212Met Gly Arg Ala Pro Cys Cys Asp Lys Ala Asn Val Lys Lys Gly Pro1
5 10 15Trp Ser Pro Glu Glu Asp
Ala Lys Leu Lys Asp Tyr Ile Glu Asn Ser20 25
30Gly Thr Gly Gly Asn Trp Ile Ala Leu Pro Gln Lys Ile Gly Leu Arg35
40 45Arg Cys Gly Lys Ser Cys Arg Leu Arg
Trp Leu Asn Tyr Leu Arg Pro50 55 60Asn
Ile Lys His Gly Gly Phe Ser Glu Glu Glu Asp Asn Ile Ile Cys65
70 75 80Asn Leu Tyr Val Thr Ile
Gly Ser Arg Trp Ser Ile Ile Ala Ala Gln85 90
95Leu Pro Gly Arg Thr Asp Asn Asp Ile Lys Asn Tyr Trp Asn Thr Arg100
105 110Leu Lys Lys Lys Leu Leu Asn Lys
Gln Arg Lys Glu Phe Gln Glu Ala115 120
125Arg Met Lys Gln Glu Met Val Met Met Lys Arg Gln Gln Gln Val Gln130
135 140Asn Gly Ser Thr Asp Leu Tyr Leu Lys
Asn Met Phe Gly Ser Ser Ser145 150 155
160Trp Pro Leu Leu Gln Gln Leu Pro His His Gln Val Pro Leu
Val Met165 170 175Met Glu Pro Thr Asn Cys
Asn Tyr Tyr Gln Met Ser Pro Ser Cys Asn180 185
190Leu Glu Gln Lys Gln Leu Ile Ala Leu Asn Asn Met Val Lys Ile
Glu195 200 205Glu Glu Pro Glu Arg Thr Asn
Pro210 215213846DNARaphanus raphanistrum 213ggatactttt
ctccattata cacattttca tgtatgtata tataaatcat cacatacacg 60caccataaca
ctaacatcct cctcctcttc atcatcaaca tagttgagaa agatcgaaaa 120aaagagagag
aaagagggat acatcaagaa caagaatcgc gaatcaagag gatgggaagg 180gcaccgtgtt
gtgataaggc taacgtgaag aaagggcctt ggtctcctga agaagatgca 240aaactcaaag
attacataga gagtagtggc acaggaggca actggatcgc tttgcctcag 300aagattggtc
taaggagatg tgggaagagt tgcagactaa ggtggctcaa ctatttaaga 360ccaaacatca
aacatggtgg cttctctgag gaagaggaca acatcatttg taacctctat 420gttactattg
gtagcaggtg gtctataata gctgcacaat tgcctggaag aacagacaat 480gacatcaaga
actattggaa cacgaggctg aagaagaagc ttcttaacaa acaaagaaaa 540gagttccaag
aagctcggat acagcagaga tggtgatgat gaaacgacaa caacaagaca 600aggacaattc
caaattaatg gtagtacgga tctttatctg aacaacatgt ttggatcatc 660agcatggcca
ttactacctc agcttcctcc tcctcatagt caagtacctc ttgtgatgat 720ggaaccaaca
agctgcaatt actaccaaac gacaccttct tgcacataga caaagcatga 780tacactcaga
cacgtcagat gagagacgag agaacaacct gatcatcatc accacgagac 840tctatg
846214522DNARaphanus raphanistrum 214atgggaaggg caccttgttg tgacaaagcc
aacgtgaaga aagggccttg gtctcctgag 60gaagatgcca aactcaaaga ttacatcgag
aatagtggta caggaggcaa ctggatcgct 120ttgcctcaga agattggtct aaggagatgt
gggaagagtt gcaggctaag gtggctcaac 180tatttgagac caaacatcaa acatggtggc
ttctctgagg aagaagacaa catcatttgt 240aacctctatg ttactattgg tagcaggtgg
tctataattg ctgcacaact gccgggaaga 300acggacaacg atatcaagaa ctactggaac
acgaggctga agaagaagct tctaaacaaa 360caaaggaaag agttccaaga agctcgaatg
aagcaagaga tggtgatgat gaagaggcaa 420caacaagtac aaaatggtag tacggatctt
tatctgaaaa acatgtttgg atcatcatca 480tggccattac tacaacagct tcctcatcat
caagtacctc tt 522215174PRTRaphanus raphanistrum
215Met Gly Arg Ala Pro Cys Cys Asp Lys Ala Asn Val Lys Lys Gly Pro1
5 10 15Trp Ser Pro Glu Glu Asp
Ala Lys Leu Lys Asp Tyr Ile Glu Asn Ser20 25
30Gly Thr Gly Gly Asn Trp Ile Ala Leu Pro Gln Lys Ile Gly Leu Arg35
40 45Arg Cys Gly Lys Ser Cys Arg Leu Arg
Trp Leu Asn Tyr Leu Arg Pro50 55 60Asn
Ile Lys His Gly Gly Phe Ser Glu Glu Glu Asp Asn Ile Ile Cys65
70 75 80Asn Leu Tyr Val Thr Ile
Gly Ser Arg Trp Ser Ile Ile Ala Ala Gln85 90
95Leu Pro Gly Arg Thr Asp Asn Asp Ile Lys Asn Tyr Trp Asn Thr Arg100
105 110Leu Lys Lys Lys Leu Leu Asn Lys
Gln Arg Lys Glu Phe Gln Glu Ala115 120
125Arg Met Lys Gln Glu Met Val Met Met Lys Arg Gln Gln Gln Val Gln130
135 140Asn Gly Ser Thr Asp Leu Tyr Leu Lys
Asn Met Phe Gly Ser Ser Ser145 150 155
160Trp Pro Leu Leu Gln Gln Leu Pro His His Gln Val Pro
Leu165 170216842DNARaphanus raphanistrum 216tcttcaaaaa
cgaaaccaag attcctcaaa cagaacacac agagtagcta taaggaccga 60gagagatcgt
agagagatgg ggagggctcc atgttgtgac aaagcaaatg tgaagagagg 120tccatggtct
ccggaagaag atgcaaagct taaagattac atagagaaac aaggcactgg 180tggcaactgg
attgctctcc ctcacaaagc tggtttaagg agatgtggga agagttgtag 240actgaggtgg
ttaaactatt tgaggccaaa cataagacat ggagatttct ctgaggaaga 300agacaatatt
atctgcagcc tctttgcctc cattggaaca ctaagctcaa gaagaagctc 360attgccacta
tggctcctcc accacctcat caactcgtag ccattgcctc atcatcatca 420tcaccatcat
catcacacta caacatgacc aataatcttc ctccgtataa cccaacaata 480tctacaaacc
agctgatctc acctcatctc tcacctcatc aggagatgat gatgacaatg 540atggaccaac
aacaacaact attatacaaa gaagccatgg acagtttggt aaattctcca 600aatagcaaca
agcttataat gagccatcaa gaagacagcc atgcgcaaag tacaaacaaa 660ggaataatgt
tgttgagtga tgtaagaagt gggtcaagca caacaagtac agtaacaaga 720gtgaagatgg
aacaacatga tcatcatcat gaagagagat caatgcaaga ttatggaatg 780gaggagatca
atcacttaat aagtagtagt tgtacgagta gtagtagcac agcttgtggt 840tt
842217788DNARaphanus sativus 217gctcctcatc aagtacctct tgtgatgatg
gaaccgacaa actgtaacta ctaccaaacg 60tcaccgtctt gcaacctaga acaaaagcaa
ccgatcgctc tcaataacat ggtcaagatt 120gaagaagaac cggagagaac aaaccctgat
catcatcagc atcaagattc tatcacaaac 180ccttttgatg tctccttctc tcagcttttg
ttagatcctt attactactt aggatcagga 240gaaggaggag gagaagggga ttttgctatc
atgagtagca gcacaaattc tccattacca 300aacacaagtg gtgatcaaaa tgaacatcag
cagcaagaga tttttcaatg gtttgggagt 360agtaatcttc agacagaagc aagcagcgat
atgttcttaa acaacaacat agcgaatctt 420gagaccaacg agggcacaaa attctactca
tcattagctg gcgctggggc ggctttggcc 480gaaggaacga cgagtacatc cgcagatcaa
agcacaataa gttgggagga cataacttct 540cttgttaatt cagaagatgc aagttacttc
aatgggccaa attagttatg tgcatttttc 600aataaacatt tgactttttg aggtgtttca
ttttgtataa ttatgtatca ttagcgggtc 660ggccgattaa ttatatggat tgagattaat
gttcatgcag agatttgttt agtttagttt 720caacttagct ttatgtacga accttttttt
taacaccatt cacgacatat atatgagtga 780agagatct
788218731DNARosa hybrida 218atgggaagag
ctccatgttg tgacaaagct agtgtcaaaa gggggcaatg gtctcctgag 60gaagatgaag
ctctcaagac ttaccttgag actcatggaa ctggtggcaa ttggattcat 120ctgcctcaaa
aagctgggct taggagatgt gggaagagct gccgtctgcg gtggttgaac 180tatctgaggc
cagacatcaa acatggaggc ttcactgaag aggaggacaa cattatctgt 240aacctctaca
ctcaaatggg aagccgatgg tcatacatag ctgctcaaat gcctggaaga 300acagacaatg
atgtgaagaa ctattggaac accaaattga agaagaggtt ttttagcaaa 360aacccaaatg
cctcctcact cacactgcct tgtgttccaa aacttgaaac caatgatgag 420tttcagaccc
aaatacctcc cacattgatg ttatctgatt ataattctgg attgaatgcc 480tatgatcaaa
gtttgagctt aaagccaaat ccattacaca actccaagct cacgggctta 540tcacaatttg
gtgcaagttc tgtttcaatg tctcaagatg gttctagtgt ttcagattcg 600tcttcaattg
ctgcagttga aaatggttta ctacatggaa atagcttcat aaatgaggat 660gctgctggga
tacccatgga ttttggattt ggaggattgc cttatgatgt tgtcaatcag 720atgtggtttg g
731219244PRTRosa
hybrida 219Met Gly Arg Ala Pro Cys Cys Asp Lys Ala Ser Val Lys Arg Gly
Gln1 5 10 15Trp Ser Pro
Glu Glu Asp Glu Ala Leu Lys Thr Tyr Leu Glu Thr His20 25
30Gly Thr Gly Gly Asn Trp Ile His Leu Pro Gln Lys Ala
Gly Leu Arg35 40 45Arg Cys Gly Lys Ser
Cys Arg Leu Arg Trp Leu Asn Tyr Leu Arg Pro50 55
60Asp Ile Lys His Gly Gly Phe Thr Glu Glu Glu Asp Asn Ile Ile
Cys65 70 75 80Asn Leu
Tyr Thr Gln Met Gly Ser Arg Trp Ser Tyr Ile Ala Ala Gln85
90 95Met Pro Gly Arg Thr Asp Asn Asp Val Lys Asn Tyr
Trp Asn Thr Lys100 105 110Leu Lys Lys Arg
Phe Phe Ser Lys Asn Pro Asn Ala Ser Ser Leu Thr115 120
125Leu Pro Cys Val Pro Lys Leu Glu Thr Asn Asp Glu Phe Gln
Thr Gln130 135 140Ile Pro Pro Thr Leu Met
Leu Ser Asp Tyr Asn Ser Gly Leu Asn Ala145 150
155 160Tyr Asp Gln Ser Leu Ser Leu Lys Pro Asn Pro
Leu His Asn Ser Lys165 170 175Leu Thr Gly
Leu Ser Gln Phe Gly Ala Ser Ser Val Ser Met Ser Gln180
185 190Asp Gly Ser Ser Val Ser Asp Ser Ser Ser Ile Ala
Ala Val Glu Asn195 200 205Gly Leu Leu His
Gly Asn Ser Phe Ile Asn Glu Asp Ala Ala Gly Ile210 215
220Pro Met Asp Phe Gly Phe Gly Gly Leu Pro Tyr Asp Val Val
Asn Gln225 230 235 240Met
Trp Phe Gly220362DNASaccharum
officinarummisc_feature(207)..(207)misc_feature(207)..(207)n is any
nucleotide 220atggggcgcg cgccgtgctg cgacaaggcg agcgtgaagc gggggccctg
gtcgcccgag 60gaggacgagc agctgcggag ctacgtccag cgcaacggca tcggcggcaa
ctggatcgcc 120ctgccgcaga aagcagggct gaaccggtgc ggcaagagct gccggctgcg
gtggctcaac 180tacctgcggc cggacatcaa gcacggnggc tacaccgagc aggaggaccg
gatcatctgg 240tcactctaca gctccatcgg aagcaggtgg tcgatcatcg cgtccaagct
ccccggccgg 300accgacaacg acgtcaagaa ctactggaac accaagctca agaagaaggc
catggcggcg 360gt
362221121PRTSaccharum officinarum 221Met Gly Arg Ala Pro Cys
Cys Asp Lys Ala Ser Val Lys Arg Gly Pro1 5
10 15Trp Ser Pro Glu Glu Asp Glu Gln Leu Arg Ser Tyr
Val Gln Arg Asn20 25 30Gly Ile Gly Gly
Asn Trp Ile Ala Leu Pro Gln Lys Ala Gly Leu Asn35 40
45Arg Cys Gly Lys Ser Cys Arg Leu Arg Trp Leu Asn Tyr Leu
Arg Pro50 55 60Asp Ile Lys His Gly Gly
Tyr Thr Glu Gln Glu Asp Arg Ile Ile Trp65 70
75 80Ser Leu Tyr Ser Ser Ile Gly Ser Arg Trp Ser
Ile Ile Ala Ser Lys85 90 95Leu Pro Gly
Arg Thr Asp Asn Asp Val Lys Asn Tyr Trp Asn Thr Lys100
105 110Leu Lys Lys Lys Ala Met Ala Ala Val115
120222393DNASaccharum officinarummisc_feature(296)..(296)n is any
nucleotide 222atgggtcggg cgccgtgctg cgacaagacg agtgtgaagc gggggccgtg
gtcgcccgag 60gaggacgagc tgctgcggag ctacgtccac aaccacggca ccggcggcaa
ctggatcgcg 120ctcccgcaca aagcagggct gaaccggtgc ggcaagagct gccggctgcg
gtggctcaac 180tacctgcgcc cggacatcaa gcacggcggc tacacggagc aggaggaccg
gatcatctgc 240tccctctaca actccatcgg gagcaggtgg tccatcatcg cgtccaagct
gccggngcgg 300acggacaacg acgtcaagaa ctactgggac accaagctca agaagaaggc
natcgccatg 360caccaccacc agcagcagca gcagcaggag tac
393223131PRTSaccharum officinarumMISC_FEATURE(99)..(99)X is
any amino acid 223Met Gly Arg Ala Pro Cys Cys Asp Lys Thr Ser Val Lys Arg
Gly Pro1 5 10 15Trp Ser
Pro Glu Glu Asp Glu Leu Leu Arg Ser Tyr Val His Asn His20
25 30Gly Thr Gly Gly Asn Trp Ile Ala Leu Pro His Lys
Ala Gly Leu Asn35 40 45Arg Cys Gly Lys
Ser Cys Arg Leu Arg Trp Leu Asn Tyr Leu Arg Pro50 55
60Asp Ile Lys His Gly Gly Tyr Thr Glu Gln Glu Asp Arg Ile
Ile Cys65 70 75 80Ser
Leu Tyr Asn Ser Ile Gly Ser Arg Trp Ser Ile Ile Ala Ser Lys85
90 95Leu Pro Xaa Arg Thr Asp Asn Asp Val Lys Asn
Tyr Trp Asp Thr Lys100 105 110Leu Lys Lys
Lys Ala Ile Ala Met His His His Gln Gln Gln Gln Gln115
120 125Gln Glu Tyr130224726DNASaccharum officinarum
224atggggcgga cgccgtgctg cgacagggcg gccgtgaagc agggcccgtg gtcgccggag
60gaggacgagg cgctgcggag ctacgtccag cgccacggca gcggcggcaa ctggatcgcc
120atgcccaaga aagccgggct caagaggtgc ggcaagagct gcaggctgcg ctggctcaac
180tacctccgcc cggacatccg ccacggcggc ttcaccgccg aggaggacgc cgtcatcatg
240tccctccaca cccagctcgg gagcaagtgg tcgctgatag cctcgcagat ggaagggagg
300acggacaacg acgtcaagaa ccactggaac accaagctca agaagcgcct cctcgccgcc
360gccctgtccc cgttcccgcc acctcacgcc cggctgccgg cgcccgcgcc tgcttccagc
420accgcgcacg cgtcgtcgct attcccttcg ctcgacatac cgaccgtgaa gacggaggcg
480tacacctgcg acgacttcct ggcgccggcg ctccgcgacc cgttcgctgc cgccgccgac
540ggctccacct cggtcgcctc cgcggcgtcg tcggcctcca actggtcgac ggcggacaac
600ggcgccagcg aagcgtccct cttcctgctg gacttctgcg cgggccccga cctcggcgcc
660gccgctgacc agctccagct ccccggcggc tactactacc ctctcgatcc aagcttgtcg
720ccggta
726225242PRTSaccharum officinarum 225Met Gly Arg Thr Pro Cys Cys Asp Arg
Ala Ala Val Lys Gln Gly Pro1 5 10
15Trp Ser Pro Glu Glu Asp Glu Ala Leu Arg Ser Tyr Val Gln Arg
His20 25 30Gly Ser Gly Gly Asn Trp Ile
Ala Met Pro Lys Lys Ala Gly Leu Lys35 40
45Arg Cys Gly Lys Ser Cys Arg Leu Arg Trp Leu Asn Tyr Leu Arg Pro50
55 60Asp Ile Arg His Gly Gly Phe Thr Ala Glu
Glu Asp Ala Val Ile Met65 70 75
80Ser Leu His Thr Gln Leu Gly Ser Lys Trp Ser Leu Ile Ala Ser
Gln85 90 95Met Glu Gly Arg Thr Asp Asn
Asp Val Lys Asn His Trp Asn Thr Lys100 105
110Leu Lys Lys Arg Leu Leu Ala Ala Ala Leu Ser Pro Phe Pro Pro Pro115
120 125His Ala Arg Leu Pro Ala Pro Ala Pro
Ala Ser Ser Thr Ala His Ala130 135 140Ser
Ser Leu Phe Pro Ser Leu Asp Ile Pro Thr Val Lys Thr Glu Ala145
150 155 160Tyr Thr Cys Asp Asp Phe
Leu Ala Pro Ala Leu Arg Asp Pro Phe Ala165 170
175Ala Ala Ala Asp Gly Ser Thr Ser Val Ala Ser Ala Ala Ser Ser
Ala180 185 190Ser Asn Trp Ser Thr Ala Asp
Asn Gly Ala Ser Glu Ala Ser Leu Phe195 200
205Leu Leu Asp Phe Cys Ala Gly Pro Asp Leu Gly Ala Ala Ala Asp Gln210
215 220Leu Gln Leu Pro Gly Gly Tyr Tyr Tyr
Pro Leu Asp Pro Ser Leu Ser225 230 235
240Pro Val226413DNASecale cereale 226atggggaggg cgccgtgctg
cgacagggcg gcggtgaaga ggggcccgtg gtcgccggag 60gaggacgacg cgctgcgcga
ctacatgcag cgctacggaa acaccggcag ctggatcacg 120ctccccatga gaggtgggct
caagaggtgc ggcaagagct gcaggctgcg gtggctcaac 180tacctccgcc ccgacatccg
ccacggcggc ttcaccgacg aggaggacac catcatctac 240tccctctaca gccagctggg
cagcaagtgg tcgctgatag cgtcgcagct ggagaggagg 300acggacaacg acgtcaagaa
ccactggaac accaagctca agaagcggct cgccgccgcg 360gccgccgcct tctccacccg
ccgttccccg tccttcatgc cgctaccggc gcc 413227138PRTSecale cereale
227Met Gly Arg Ala Pro Cys Cys Asp Arg Ala Ala Val Lys Arg Gly Pro1
5 10 15Trp Ser Pro Glu Glu Asp
Asp Ala Leu Arg Asp Tyr Met Gln Arg Tyr20 25
30Gly Asn Thr Gly Ser Trp Ile Thr Leu Pro Met Arg Gly Gly Leu Lys35
40 45Arg Cys Gly Lys Ser Cys Arg Leu Arg
Trp Leu Asn Tyr Leu Arg Pro50 55 60Asp
Ile Arg His Gly Gly Phe Thr Asp Glu Glu Asp Thr Ile Ile Tyr65
70 75 80Ser Leu Tyr Ser Gln Leu
Gly Ser Lys Trp Ser Leu Ile Ala Ser Gln85 90
95Leu Glu Arg Arg Thr Asp Asn Asp Val Lys Asn His Trp Asn Thr Lys100
105 110Leu Lys Lys Arg Leu Ala Ala Ala
Ala Ala Ala Phe Ser Thr Arg Arg115 120
125Ser Pro Ser Phe Met Pro Leu Pro Ala Pro130
135228555DNASolanum lycopersicum 228atgggaagag ctccttgttg tgataagaat
aatgttaaaa gagggccatg gtcaccagaa 60gaagatgcta agcttaaaga attcattgaa
aaatatggaa ctggtggtaa ttggattgct 120cttcctctaa aagctggatt aaagagatgt
ggaaagagct gcagattaag atggctaaat 180tatctaaggc caaatataaa gcatggtgat
ttttctgatg aagaagatag ggtaatatgc 240agtttatatg ccagcattgg gagcaggtgg
tcaattatag cagctcagtt accagggagg 300accgacaatg atattaaaaa ctattggaac
acaaagctta agaagaagct catgggtttt 360attcagtcat catctaatat taaccagaga
actaaatcac ctaatttatt attccctcct 420acaagtactc ttcaaacaac ttttcaatcc
caatctcaag catcaatttc aaatctttta 480agagattcat atgtagagcc cattccacta
gtccaaccaa atttcatgta caacaacaat 540aacatgatga acttt
555229185PRTSolanum lycopersicum 229Met
Gly Arg Ala Pro Cys Cys Asp Lys Asn Asn Val Lys Arg Gly Pro1
5 10 15Trp Ser Pro Glu Glu Asp Ala
Lys Leu Lys Glu Phe Ile Glu Lys Tyr20 25
30Gly Thr Gly Gly Asn Trp Ile Ala Leu Pro Leu Lys Ala Gly Leu Lys35
40 45Arg Cys Gly Lys Ser Cys Arg Leu Arg Trp
Leu Asn Tyr Leu Arg Pro50 55 60Asn Ile
Lys His Gly Asp Phe Ser Asp Glu Glu Asp Arg Val Ile Cys65
70 75 80Ser Leu Tyr Ala Ser Ile Gly
Ser Arg Trp Ser Ile Ile Ala Ala Gln85 90
95Leu Pro Gly Arg Thr Asp Asn Asp Ile Lys Asn Tyr Trp Asn Thr Lys100
105 110Leu Lys Lys Lys Leu Met Gly Phe Ile
Gln Ser Ser Ser Asn Ile Asn115 120 125Gln
Arg Thr Lys Ser Pro Asn Leu Leu Phe Pro Pro Thr Ser Thr Leu130
135 140Gln Thr Thr Phe Gln Ser Gln Ser Gln Ala Ser
Ile Ser Asn Leu Leu145 150 155
160Arg Asp Ser Tyr Val Glu Pro Ile Pro Leu Val Gln Pro Asn Phe
Met165 170 175Tyr Asn Asn Asn Asn Met Met
Asn Phe180 185230476DNASolanum tuberosum 230atgggaagag
ctccatgttg tgataaagca aatgtgaaga aagggccatg gtcacctgaa 60gaagatgcaa
aattaaaaga atatattaac aaatttggca ctggtggaaa ttggattgct 120cttccacaaa
aagctgggct aagaagatgt ggaaaaagct gcagattaag atggctaaat 180tatcttaggc
caaatattaa acacggagag ttttcagacg aagaagacag aatcatttgc 240agcctttatg
ctaacattgg aagcaggtgg tcaataatag cagctcaatt accaggcagg 300acagataatg
atatcaaaaa ctattggaac acaaagctca agaagaaatt aatgggattt 360gtctcttcat
ctcacaagat cattaggcct cttaatcatc atcattatca acaccaaatt 420cccactaatt
gttacaataa ttattcctca caaatttcac ttcttcaagc tccatc
476231159PRTSolanum tuberosum 231Met Gly Arg Ala Pro Cys Cys Asp Lys Ala
Asn Val Lys Lys Gly Pro1 5 10
15Trp Ser Pro Glu Glu Asp Ala Lys Leu Lys Glu Tyr Ile Asn Lys Phe20
25 30Gly Thr Gly Gly Asn Trp Ile Ala Leu
Pro Gln Lys Ala Gly Leu Arg35 40 45Arg
Cys Gly Lys Ser Cys Arg Leu Arg Trp Leu Asn Tyr Leu Arg Pro50
55 60Asn Ile Lys His Gly Glu Phe Ser Asp Glu Glu
Asp Arg Ile Ile Cys65 70 75
80Ser Leu Tyr Ala Asn Ile Gly Ser Arg Trp Ser Ile Ile Ala Ala Gln85
90 95Leu Pro Gly Arg Thr Asp Asn Asp Ile
Lys Asn Tyr Trp Asn Thr Lys100 105 110Leu
Lys Lys Lys Leu Met Gly Phe Val Ser Ser Ser His Lys Ile Ile115
120 125Arg Pro Leu Asn His His His Tyr Gln His Gln
Ile Pro Thr Asn Cys130 135 140Tyr Asn Asn
Tyr Ser Ser Gln Ile Ser Leu Leu Gln Ala Pro Ser145 150
155232471DNASorghum bicolor 232atggggagag ctccgtgctg
cgacaaggct actgtgaaga agggtccgtg gtcgccggag 60gaggacgcca agctcaagtc
ctacatcgag cagaacggca ccggcggtaa ctggatagcc 120ctgcctcaga agataggtct
gaagaggtgt ggcaagagct gccgcctccg gtggctcaac 180tacctccggc caaacatcaa
gcacggtggg ttctccgagg aggaagacag aataatcctt 240agcctctaca tcagcatagg
cagcaggtgg tcgataatag cggcgcagct gccggggagg 300acggacaatg acataaagaa
ctactggaac acgaggctca agaagaagct cttcggcaag 360caatcgcgca aggatcaacg
gcagcatcag ttcatgcgcc agcaggcggc agcggcaaac 420gatgggatga tgaagcaaga
agcagcacaa cccgggatgc acccggaaca a 471233157PRTSorghum
bicolor 233Met Gly Arg Ala Pro Cys Cys Asp Lys Ala Thr Val Lys Lys Gly
Pro1 5 10 15Trp Ser Pro
Glu Glu Asp Ala Lys Leu Lys Ser Tyr Ile Glu Gln Asn20 25
30Gly Thr Gly Gly Asn Trp Ile Ala Leu Pro Gln Lys Ile
Gly Leu Lys35 40 45Arg Cys Gly Lys Ser
Cys Arg Leu Arg Trp Leu Asn Tyr Leu Arg Pro50 55
60Asn Ile Lys His Gly Gly Phe Ser Glu Glu Glu Asp Arg Ile Ile
Leu65 70 75 80Ser Leu
Tyr Ile Ser Ile Gly Ser Arg Trp Ser Ile Ile Ala Ala Gln85
90 95Leu Pro Gly Arg Thr Asp Asn Asp Ile Lys Asn Tyr
Trp Asn Thr Arg100 105 110Leu Lys Lys Lys
Leu Phe Gly Lys Gln Ser Arg Lys Asp Gln Arg Gln115 120
125His Gln Phe Met Arg Gln Gln Ala Ala Ala Ala Asn Asp Gly
Met Met130 135 140Lys Gln Glu Ala Ala Gln
Pro Gly Met His Pro Glu Gln145 150
155234609DNASorghum bicolor 234atgggacgga cgccgtgctg cgacagggcg
gccgtgaagc ggggcccgtg gtcgccggag 60gaggacgagg cgctgcggag ctacgtccag
cgccacggca gcggcggcaa ctggatcgcc 120atgcccaaga aagccgggct caagaggtgc
ggcaagagct gcaggctgcg ctggctcaac 180tacctccgcc ccgacatccg ccacggcggc
ttcaccgccg aggaggacgc cgtcatcttg 240tccctctaca cccagctcgg gagcaagtgg
tcgctgatag cctcgcagat ggaggggagg 300acggacaacg acgtcaagaa ccactggaac
accaagctca agaagcgcct cctcgccgcc 360gcgggcacct tctcgccggc accccacgcc
cgggtgccgg cacccgcgcc tgctttccgt 420accggacacg tttccccgtt gtcactggtc
ccatcggttg gcataccgac cgtgaagaac 480cgagcgtaca cctacgacga cttccttggc
gccggcgctc gtgacccgtt cgccgccgcc 540gacgggtcca cctcgggcgc cttccggggg
tgtcggcctc caatggtcca cgggggacaa 600cgggcccag
609235203PRTSorghum bicolor 235Met Gly
Arg Thr Pro Cys Cys Asp Arg Ala Ala Val Lys Arg Gly Pro1 5
10 15Trp Ser Pro Glu Glu Asp Glu Ala
Leu Arg Ser Tyr Val Gln Arg His20 25
30Gly Ser Gly Gly Asn Trp Ile Ala Met Pro Lys Lys Ala Gly Leu Lys35
40 45Arg Cys Gly Lys Ser Cys Arg Leu Arg Trp
Leu Asn Tyr Leu Arg Pro50 55 60Asp Ile
Arg His Gly Gly Phe Thr Ala Glu Glu Asp Ala Val Ile Leu65
70 75 80Ser Leu Tyr Thr Gln Leu Gly
Ser Lys Trp Ser Leu Ile Ala Ser Gln85 90
95Met Glu Gly Arg Thr Asp Asn Asp Val Lys Asn His Trp Asn Thr Lys100
105 110Leu Lys Lys Arg Leu Leu Ala Ala Ala
Gly Thr Phe Ser Pro Ala Pro115 120 125His
Ala Arg Val Pro Ala Pro Ala Pro Ala Phe Arg Thr Gly His Val130
135 140Ser Pro Leu Ser Leu Val Pro Ser Val Gly Ile
Pro Thr Val Lys Asn145 150 155
160Arg Ala Tyr Thr Tyr Asp Asp Phe Leu Gly Ala Gly Ala Arg Asp
Pro165 170 175Phe Ala Ala Ala Asp Gly Ser
Thr Ser Gly Ala Phe Arg Gly Cys Arg180 185
190Pro Pro Met Val His Gly Gly Gln Arg Ala Gln195
200236396DNASorghum propinquum 236atggggcgcg cgccgtgctg cgacaaggcg
agcgtgaagc gggggccctg gtcgcccgag 60gaggacgagc agctgcggag ctacgtccag
cgcaacggca tcggcggcaa ctggatcgcc 120ctgccgcaga aagcagggct gaaccggtgc
ggcaagagct gccggctgcg gtggctcaac 180tacctgcggc cggacatcaa gcacgggggc
tacaccgagg aggaggaccg gatcatctgg 240tcgctctaca gctccatcgg cagcaggtgg
tccatcatcg cgtccaagct cccgggccgg 300accgacaacg acgtcaagaa ctactggaac
accaagctca agaagaaggc tatggccatg 360gcggcggcgg ccggcctcag cgccagtagc
agcgga 396237132PRTSorghum propinquum 237Met
Gly Arg Ala Pro Cys Cys Asp Lys Ala Ser Val Lys Arg Gly Pro1
5 10 15Trp Ser Pro Glu Glu Asp Glu
Gln Leu Arg Ser Tyr Val Gln Arg Asn20 25
30Gly Ile Gly Gly Asn Trp Ile Ala Leu Pro Gln Lys Ala Gly Leu Asn35
40 45Arg Cys Gly Lys Ser Cys Arg Leu Arg Trp
Leu Asn Tyr Leu Arg Pro50 55 60Asp Ile
Lys His Gly Gly Tyr Thr Glu Glu Glu Asp Arg Ile Ile Trp65
70 75 80Ser Leu Tyr Ser Ser Ile Gly
Ser Arg Trp Ser Ile Ile Ala Ser Lys85 90
95Leu Pro Gly Arg Thr Asp Asn Asp Val Lys Asn Tyr Trp Asn Thr Lys100
105 110Leu Lys Lys Lys Ala Met Ala Met Ala
Ala Ala Ala Gly Leu Ser Ala115 120 125Ser
Ser Ser Gly130238803DNATaraxacum officinale 238atggggagag caccttgctg
tgataaagac aatgtaaaga aaggtccatg ggctcctgaa 60gaagatgcta agcttaaagc
atatattgaa gaacatggca ccggtggtaa ctggattgct 120ttgcctcaga aaatcgggct
caagagatgt ggaaagagtt gtcgcctccg gtggctaaat 180tacctccggc caaatatcaa
gcatggaggt ttctccgatg aagaagatcg cataatttgc 240agcctctata ttagcatagg
gagcaggtgg tctataattg cggcacaatt accggggcga 300actgataacg atataaagaa
ctactggaac acaaggctga agaagaagct cttgggtgca 360ggtaaacaga gaaaagaaga
aatatatgga agaaaacgag agttactaat gaaaaaagca 420aggtcgacct catcggatat
taatattcca tcatcgatag ttgtttccgg caaccaagac 480ccagcttatt ggccggaaat
gccgttgatg ccacctgtac cctactccaa taatcaagaa 540ccgtgttttg ttaacgatca
tgcttccata cgaaaactac tcatcaagct tggaggacga 600tttactaatt ccgatcatga
taatggaagt caatccacca aacacatcgt cgactctcac 660tttccgatcg acactttcat
tgatcttcag ggatcatcac aagaccatga tcagaagtta 720attaatgcaa ttagtacgcc
tctttcgtct tcttcatcat caatgttgct aaatagccca 780tacaacatga tcatctttcc
atc 803239268PRTTaraxacum
officinale 239Met Gly Arg Ala Pro Cys Cys Asp Lys Asp Asn Val Lys Lys Gly
Pro1 5 10 15Trp Ala Pro
Glu Glu Asp Ala Lys Leu Lys Ala Tyr Ile Glu Glu His20 25
30Gly Thr Gly Gly Asn Trp Ile Ala Leu Pro Gln Lys Ile
Gly Leu Lys35 40 45Arg Cys Gly Lys Ser
Cys Arg Leu Arg Trp Leu Asn Tyr Leu Arg Pro50 55
60Asn Ile Lys His Gly Gly Phe Ser Asp Glu Glu Asp Arg Ile Ile
Cys65 70 75 80Ser Leu
Tyr Ile Ser Ile Gly Ser Arg Trp Ser Ile Ile Ala Ala Gln85
90 95Leu Pro Gly Arg Thr Asp Asn Asp Ile Lys Asn Tyr
Trp Asn Thr Arg100 105 110Leu Lys Lys Lys
Leu Leu Gly Ala Gly Lys Gln Arg Lys Glu Glu Ile115 120
125Tyr Gly Arg Lys Arg Glu Leu Leu Met Lys Lys Ala Arg Ser
Thr Ser130 135 140Ser Asp Ile Asn Ile Pro
Ser Ser Ile Val Val Ser Gly Asn Gln Asp145 150
155 160Pro Ala Tyr Trp Pro Glu Met Pro Leu Met Pro
Pro Val Pro Tyr Ser165 170 175Asn Asn Gln
Glu Pro Cys Phe Val Asn Asp His Ala Ser Ile Arg Lys180
185 190Leu Leu Ile Lys Leu Gly Gly Arg Phe Thr Asn Ser
Asp His Asp Asn195 200 205Gly Ser Gln Ser
Thr Lys His Ile Val Asp Ser His Phe Pro Ile Asp210 215
220Thr Phe Ile Asp Leu Gln Gly Ser Ser Gln Asp His Asp Gln
Lys Leu225 230 235 240Ile
Asn Ala Ile Ser Thr Pro Leu Ser Ser Ser Ser Ser Ser Met Leu245
250 255Leu Asn Ser Pro Tyr Asn Met Ile Ile Phe Pro
Ser260 265240632DNATriphysaria pusilla 240atggggagag
cgccttgttg tgacaaagcc aatgtaaaga gaggtccatg gtcacctgat 60gaagacgatg
ctcttaaatc ctacattcat aagcatggta ctggtggcaa ctggattgcc 120ttgcctaaca
aaattgggtt gaagagatgt ggaaagagtt gtcggctgag atggcttaat 180tatctgcgcc
caaacatcaa gcatggtggt tttactgaag aagaagataa ctttatttgc 240aacctctata
ttagtattgg aagcagatgg tctatcatcg cagctcaatt gcctggaaga 300acagataacg
atatcaaaaa ctactggaac actaggctaa ggaagaaact cttaggcagg 360caacaaaggc
gagaacaacg atccactaaa aacaatatta ttagagtggc cagttctgat 420caaaatcatc
aaaattacac attagcccat gaatattatt cttcaaattt cagtttatac 480cctcagccgc
cgccgccacc accggcacat atggtggcgc cttcctttat ccaggcggca 540tcgaatccac
aaaccaatca catttccttc atttcaccat tagtcgaaga accaacgaca 600tttgcagctt
gttatgatga tcaatcacaa gt
632241211PRTTriphysaria pusilla 241Met Gly Arg Ala Pro Cys Cys Asp Lys
Ala Asn Val Lys Arg Gly Pro1 5 10
15Trp Ser Pro Asp Glu Asp Asp Ala Leu Lys Ser Tyr Ile His Lys
His20 25 30Gly Thr Gly Gly Asn Trp Ile
Ala Leu Pro Asn Lys Ile Gly Leu Lys35 40
45Arg Cys Gly Lys Ser Cys Arg Leu Arg Trp Leu Asn Tyr Leu Arg Pro50
55 60Asn Ile Lys His Gly Gly Phe Thr Glu Glu
Glu Asp Asn Phe Ile Cys65 70 75
80Asn Leu Tyr Ile Ser Ile Gly Ser Arg Trp Ser Ile Ile Ala Ala
Gln85 90 95Leu Pro Gly Arg Thr Asp Asn
Asp Ile Lys Asn Tyr Trp Asn Thr Arg100 105
110Leu Arg Lys Lys Leu Leu Gly Arg Gln Gln Arg Arg Glu Gln Arg Ser115
120 125Thr Lys Asn Asn Ile Ile Arg Val Ala
Ser Ser Asp Gln Asn His Gln130 135 140Asn
Tyr Thr Leu Ala His Glu Tyr Tyr Ser Ser Asn Phe Ser Leu Tyr145
150 155 160Pro Gln Pro Pro Pro Pro
Pro Pro Ala His Met Val Ala Pro Ser Phe165 170
175Ile Gln Ala Ala Ser Asn Pro Gln Thr Asn His Ile Ser Phe Ile
Ser180 185 190Pro Leu Val Glu Glu Pro Thr
Thr Phe Ala Ala Cys Tyr Asp Asp Gln195 200
205Ser Gln Val210242651DNATriphysaria pusilla 242atgggaagag caccgtgctg
tgacaaagcc gatgtgaaga agggaccctg gtcaactgaa 60gaagatgcaa cattgaaagc
ttatatcgag aaatatggaa ctggtggaaa ttggattgcc 120cttcctcaga aaatcgggct
taagagatgt gggaagagtt gcagactgag gtggttgaat 180tacttgaggc ctaatctcaa
gcatggtggt tttactgatg aagaagataa tgttatttgc 240agcctctata tcagcattgg
cagcaggtgg tctataatag ctgcccagtt accaggaaga 300acagataatg acatcgagaa
ttactggaac accaggctca agaaaaaatt actcggcagg 360cgcaaacaag ctaattataa
cactaaccga ctctcggctt taggcggcca ggatgagcca 420aaagaagatg cctataactt
gcaaaaccta agcagttcag ccctcgaaag actccaactc 480cacatgcatc ttcaaaccct
tcaaaaccat aattcctcat tttattccaa taatccggca 540atttggccca agatgatcga
aaccctaaac tccttaaacg ataaacaaaa catgaataat 600ctgatgcagc aaattatccc
ctccacccct catcaactct gtccgcctgt a 651243217PRTTriphysaria
pusilla 243Met Gly Arg Ala Pro Cys Cys Asp Lys Ala Asp Val Lys Lys Gly
Pro1 5 10 15Trp Ser Thr
Glu Glu Asp Ala Thr Leu Lys Ala Tyr Ile Glu Lys Tyr20 25
30Gly Thr Gly Gly Asn Trp Ile Ala Leu Pro Gln Lys Ile
Gly Leu Lys35 40 45Arg Cys Gly Lys Ser
Cys Arg Leu Arg Trp Leu Asn Tyr Leu Arg Pro50 55
60Asn Leu Lys His Gly Gly Phe Thr Asp Glu Glu Asp Asn Val Ile
Cys65 70 75 80Ser Leu
Tyr Ile Ser Ile Gly Ser Arg Trp Ser Ile Ile Ala Ala Gln85
90 95Leu Pro Gly Arg Thr Asp Asn Asp Ile Glu Asn Tyr
Trp Asn Thr Arg100 105 110Leu Lys Lys Lys
Leu Leu Gly Arg Arg Lys Gln Ala Asn Tyr Asn Thr115 120
125Asn Arg Leu Ser Ala Leu Gly Gly Gln Asp Glu Pro Lys Glu
Asp Ala130 135 140Tyr Asn Leu Gln Asn Leu
Ser Ser Ser Ala Leu Glu Arg Leu Gln Leu145 150
155 160His Met His Leu Gln Thr Leu Gln Asn His Asn
Ser Ser Phe Tyr Ser165 170 175Asn Asn Pro
Ala Ile Trp Pro Lys Met Ile Glu Thr Leu Asn Ser Leu180
185 190Asn Asp Lys Gln Asn Met Asn Asn Leu Met Gln Gln
Ile Ile Pro Ser195 200 205Thr Pro His Gln
Leu Cys Pro Pro Val210 215244636DNATriphysaria
pusillamisc_feature(609)..(609)n is any nucleotide 244atgggaagag
caccttgctg tgacaaggca aaagtgaaga aagggccatg gtctcctgat 60gaagattcta
aactaaaaga atatatacaa aaacatggga ctattggaaa ttggattgct 120ctcccacaca
aagttggtct taaaagatgt ggcaaaagct gcagattgag atggctcaat 180tatctgagac
caaatattaa gcatggagat ttttcagatg atgaggatac aattatttgc 240acacttttta
atagcattgg aagcaggtgg tcagtaatag cagcacaatt accaggcaga 300acagacaatg
atatcaagaa ctactggaac acaaagctca aaaagaaact catggccaat 360aatattttta
tgcctccaat ttctacccat taccaaatga ggcctcacca agctaccatg 420taccctacaa
caccaaataa ctcctaccca ataccatatg aatatgccat tagcaattgt 480tatactaatt
cgaccccaat taaaatttat cagcccatat cccctaactt caccacgcta 540ttttcttcaa
actcgatcgg cccggatggg ttaatgggcc cggcccggcc tggatgggtt 600acaaagatna
tgaagaaatc ttccctgtgt tggtgg
636245212PRTTriphysaria pusillaMISC_FEATURE(203)..(203)X is any amino
acid 245Met Gly Arg Ala Pro Cys Cys Asp Lys Ala Lys Val Lys Lys Gly Pro1
5 10 15Trp Ser Pro Asp
Glu Asp Ser Lys Leu Lys Glu Tyr Ile Gln Lys His20 25
30Gly Thr Ile Gly Asn Trp Ile Ala Leu Pro His Lys Val Gly
Leu Lys35 40 45Arg Cys Gly Lys Ser Cys
Arg Leu Arg Trp Leu Asn Tyr Leu Arg Pro50 55
60Asn Ile Lys His Gly Asp Phe Ser Asp Asp Glu Asp Thr Ile Ile Cys65
70 75 80Thr Leu Phe Asn
Ser Ile Gly Ser Arg Trp Ser Val Ile Ala Ala Gln85 90
95Leu Pro Gly Arg Thr Asp Asn Asp Ile Lys Asn Tyr Trp Asn
Thr Lys100 105 110Leu Lys Lys Lys Leu Met
Ala Asn Asn Ile Phe Met Pro Pro Ile Ser115 120
125Thr His Tyr Gln Met Arg Pro His Gln Ala Thr Met Tyr Pro Thr
Thr130 135 140Pro Asn Asn Ser Tyr Pro Ile
Pro Tyr Glu Tyr Ala Ile Ser Asn Cys145 150
155 160Tyr Thr Asn Ser Thr Pro Ile Lys Ile Tyr Gln Pro
Ile Ser Pro Asn165 170 175Phe Thr Thr Leu
Phe Ser Ser Asn Ser Ile Gly Pro Asp Gly Leu Met180 185
190Gly Pro Ala Arg Pro Gly Trp Val Thr Lys Xaa Met Lys Lys
Ser Ser195 200 205Leu Cys Trp
Trp210246419DNATriphysaria versicolor 246ggaagagcac cttgctgtga caaggcaaaa
gtgaagaaag ggccatggtc tcctgatgaa 60gattctaaac taaaggaata tatacaaaaa
catgggacta ttggaaattg gattgctctc 120ccacacaaag ttggtcttaa aagatgtggc
aaaagctgca gattgagatg gctcaattat 180ctgagaccaa atattaagca tggagatttt
tcagatgatg aggatacaat tatttgcaca 240ctttttaata gcattggaag caggtggtca
gtaatagcag cacaattacc aggcagaaca 300gacaatgata tcaagaacta ctggaacaca
aagctcaaaa agaaactcat ggccaataat 360atttttatgc ctccaatttc tacccattac
caaatgaggc ctcaccaagc taccatgta 419247606DNATriphysaria versicolor
247atgggaagag caccttgctg tgacaaggca aaagtgaaga aagggccatg gtctcctgat
60gaagattcta aactaaaaga atatatacaa aaacatggga ctattggaaa ttggattgct
120ctcccacaca aagttggtct taaaagatgt ggcaaaagct gcagattgag atggctcaat
180tatctgagac caaatattaa gcatggagat ttttcagatg atgaggatac aattatttgc
240acacttttta atagcattgg aagcaggtgg tcagtaatag cagcacaatt accaggcaga
300acagacaatg atatcaagaa ctactggaac acaaagctca aaaagaaact catggccaat
360aatattttta tgcctccaat ttctacccat taccaaatga ggcctcacca agctaccatg
420taccatacaa caccaaataa ctcctaccca ataccatatg aatatgacat tagcaattgt
480tatactaatt cgaccccaat taaaatttat cagcccatat cccctaactt caccacgcta
540ttttcttcaa actcgatcgg cccggatggg ttaatgggcc cggcccggcc tggatgggtt
600aacaaa
606248202PRTTriphysaria versicolor 248Met Gly Arg Ala Pro Cys Cys Asp Lys
Ala Lys Val Lys Lys Gly Pro1 5 10
15Trp Ser Pro Asp Glu Asp Ser Lys Leu Lys Glu Tyr Ile Gln Lys
His20 25 30Gly Thr Ile Gly Asn Trp Ile
Ala Leu Pro His Lys Val Gly Leu Lys35 40
45Arg Cys Gly Lys Ser Cys Arg Leu Arg Trp Leu Asn Tyr Leu Arg Pro50
55 60Asn Ile Lys His Gly Asp Phe Ser Asp Asp
Glu Asp Thr Ile Ile Cys65 70 75
80Thr Leu Phe Asn Ser Ile Gly Ser Arg Trp Ser Val Ile Ala Ala
Gln85 90 95Leu Pro Gly Arg Thr Asp Asn
Asp Ile Lys Asn Tyr Trp Asn Thr Lys100 105
110Leu Lys Lys Lys Leu Met Ala Asn Asn Ile Phe Met Pro Pro Ile Ser115
120 125Thr His Tyr Gln Met Arg Pro His Gln
Ala Thr Met Tyr His Thr Thr130 135 140Pro
Asn Asn Ser Tyr Pro Ile Pro Tyr Glu Tyr Asp Ile Ser Asn Cys145
150 155 160Tyr Thr Asn Ser Thr Pro
Ile Lys Ile Tyr Gln Pro Ile Ser Pro Asn165 170
175Phe Thr Thr Leu Phe Ser Ser Asn Ser Ile Gly Pro Asp Gly Leu
Met180 185 190Gly Pro Ala Arg Pro Gly Trp
Val Asn Lys195 200249903DNATricticum aestivum
249atggggaggg cgccgtgctg cgacaaggcg acggtgaaga agggcccctg gtcgccggag
60gaggacgcca agctcaaggc ctacatcgac gagaatggca ccggcggcaa ctggatcgcc
120ctgccgcaga agatcgggct gaagaggtgc ggcaaaagct gtaggctcag atggctcaac
180tatctgaggc caaacatcaa gcacggcgac ttcacagagg aagaggaaca catcatttgc
240agcctctaca ttagcatcgg cagcaggtgg tcgatcatcg cggcgcagct gccgggcaga
300acggacaacg acatcaagaa ctactggaac accaagctca agaagaagct cctcggcaag
360cgcgcgccgt cccgccgcct gcagcgcgcc aaccaagacg cgccgatgcc ctactcctac
420ctggcggcgg gcggcggcag cgccagcagc agcgggaacg ctagcggacg tcgtggaagc
480cgagcaacag ctaaaccctt ccgctggagc gagctacgac atgccggtgc cgggcggttt
540cgaggagcga tccgcctcca agctagggtt tggctctgct gccggcgagg ccgctggagt
600ggccagtacc gtcgagatga gctcggcgtc aatggtaggc ggtggcttcg gctacggcca
660cgtcgacgag ctgtacgact tcctctacag caagcagctt gccgcggccg gagcctttca
720aggcggagtt cctcccctgc cggagctgca gtgcccggat ggcggcgccg tcgttggtgc
780cgacgagaag ttctccacgt ggatggcgtc ttgcgatcac tatgttccta cgaccggcgg
840ccagcagctc cagctccaag gtggaaactc aatgaatctg caggatttcg ttttagggta
900tga
903250300PRTTricticum aestivum 250Met Gly Arg Ala Pro Cys Cys Asp Lys Ala
Thr Val Lys Lys Gly Pro1 5 10
15Trp Ser Pro Glu Glu Asp Ala Lys Leu Lys Ala Tyr Ile Asp Glu Asn20
25 30Gly Thr Gly Gly Asn Trp Ile Ala Leu
Pro Gln Lys Ile Gly Leu Lys35 40 45Arg
Cys Gly Lys Ser Cys Arg Leu Arg Trp Leu Asn Tyr Leu Arg Pro50
55 60Asn Ile Lys His Gly Asp Phe Thr Glu Glu Glu
Glu His Ile Ile Cys65 70 75
80Ser Leu Tyr Ile Ser Ile Gly Ser Arg Trp Ser Ile Ile Ala Ala Gln85
90 95Leu Pro Gly Arg Thr Asp Asn Asp Ile
Lys Asn Tyr Trp Asn Thr Lys100 105 110Leu
Lys Lys Lys Leu Leu Gly Lys Arg Ala Pro Ser Arg Arg Leu Gln115
120 125Arg Ala Asn Gln Asp Ala Pro Met Pro Tyr Ser
Tyr Leu Ala Ala Gly130 135 140Gly Gly Ser
Ala Ser Ser Ser Gly Asn Ala Ser Gly Arg Arg Gly Ser145
150 155 160Arg Ala Thr Ala Lys Pro Phe
Arg Trp Ser Glu Leu Arg His Ala Gly165 170
175Ala Gly Arg Phe Arg Gly Ala Ile Arg Leu Gln Ala Arg Val Trp Leu180
185 190Cys Cys Arg Arg Gly Arg Trp Ser Gly
Gln Tyr Arg Arg Asp Glu Leu195 200 205Gly
Val Asn Gly Arg Arg Trp Leu Arg Leu Arg Pro Arg Arg Arg Ala210
215 220Val Arg Leu Pro Leu Gln Gln Ala Ala Cys Arg
Gly Arg Ser Leu Ser225 230 235
240Arg Arg Ser Ser Ser Pro Ala Gly Ala Ala Val Pro Gly Trp Arg
Arg245 250 255Arg Arg Trp Cys Arg Arg Glu
Val Leu His Val Asp Gly Val Leu Arg260 265
270Ser Leu Cys Ser Tyr Asp Arg Arg Pro Ala Ala Pro Ala Pro Arg Trp275
280 285Lys Leu Asn Glu Ser Ala Gly Phe Arg
Phe Arg Val290 295 300251648DNAVaccinium
corymbosum 251atgggaagag ctccatgttg tgacaaggca aatgtgaaga aagggccatg
gtcccctgaa 60gaagatgcaa agctaaaaga gtacatagat aagcatggga ctggagggaa
ttggattgct 120ctccctcaaa aagcagccct caaaagatgt ggaaaaagct gcagattgag
atggctcaac 180tatcttagac caaatatcaa acatggagag ttctctgatg atgaagatca
taaaatctgc 240agcctcttca ctactattgg aagcaggtgg tccataatag cagctcaatt
gccaggaaga 300acagataacg atatcaaaaa ctattggaac actaagctca agaagaaatt
catgggaaaa 360ttattccctc atccatctca aaatcagaga aaatcccacc aatccccaaa
ttacccttca 420tttttcccat tatattcacc acaatcccct acccaaacta gccatttcac
aacctacgaa 480aaccccattt cactcccacc aagttttttg aattccagtg ctaatttttc
ctctgatgcc 540actccaacaa ctattctaca agcccaagag agttatgtgg gtcccatgtt
tggaggtgaa 600gcaagttcca gttctgatgg gagttgcagc aaccaggtga tcagaggg
648252216PRTVaccinium corymbosum 252Met Gly Arg Ala Pro Cys
Cys Asp Lys Ala Asn Val Lys Lys Gly Pro1 5
10 15Trp Ser Pro Glu Glu Asp Ala Lys Leu Lys Glu Tyr
Ile Asp Lys His20 25 30Gly Thr Gly Gly
Asn Trp Ile Ala Leu Pro Gln Lys Ala Ala Leu Lys35 40
45Arg Cys Gly Lys Ser Cys Arg Leu Arg Trp Leu Asn Tyr Leu
Arg Pro50 55 60Asn Ile Lys His Gly Glu
Phe Ser Asp Asp Glu Asp His Lys Ile Cys65 70
75 80Ser Leu Phe Thr Thr Ile Gly Ser Arg Trp Ser
Ile Ile Ala Ala Gln85 90 95Leu Pro Gly
Arg Thr Asp Asn Asp Ile Lys Asn Tyr Trp Asn Thr Lys100
105 110Leu Lys Lys Lys Phe Met Gly Lys Leu Phe Pro His
Pro Ser Gln Asn115 120 125Gln Arg Lys Ser
His Gln Ser Pro Asn Tyr Pro Ser Phe Phe Pro Leu130 135
140Tyr Ser Pro Gln Ser Pro Thr Gln Thr Ser His Phe Thr Thr
Tyr Glu145 150 155 160Asn
Pro Ile Ser Leu Pro Pro Ser Phe Leu Asn Ser Ser Ala Asn Phe165
170 175Ser Ser Asp Ala Thr Pro Thr Thr Ile Leu Gln
Ala Gln Glu Ser Tyr180 185 190Val Gly Pro
Met Phe Gly Gly Glu Ala Ser Ser Ser Ser Asp Gly Ser195
200 205Cys Ser Asn Gln Val Ile Arg Gly210
2152531020DNAVitis vinifera 253atggggagag ctccttgctg tgacaaagcc
aacgtcaaga aagggccatg gtcaccagag 60gaagatgcca agctcaagac ttatattgag
aaaaacggca ctgggggcaa ctggatagcc 120ttacctcaaa agattggcct taagcgatgt
gggaagagct gccgcctccg ctggttgaat 180tatctcaggc ccaatatcaa acatggagga
ttttcagagg aggaggataa catcatttgc 240agcctctata taagtattgg aagcaggtgg
tccgtcatcg cgactcaatt accgggaaga 300actgataatg atattaagaa ctactggaac
acaaggttga agaaaaagct actcggcaag 360cagcggaaag aacaacaggc tcgccgagct
aactacgtca agcaagagat gaagagagaa 420cctgggattt ttgtggttcc tgatcaggcc
gttggcagaa acccttattg ggcggagcta 480cctgcagtat ctgtgcatcc gaaccaagat
ccggatatca aggaccaagc atccattaag 540aaattgctga tcaagctggg aggaagattt
ggtgatgatg gtcaacaacc aggcagtaac 600agcataaact ttcagtaccc tcttgatatt
tccaccgcac aagatggtcc atatgagaca 660tccatgaaca tgtttgcctc atctacgtcg
accatgaatt caatcaacag tacgggttcc 720caattgccaa acacccacta tgacgtcaat
ggatcagctt caaatgttct tcaagggtat 780aacagcctcc taaatgaact ggtgtatggc
aacccacaac aattagatgg ctctgggagc 840ttttacggaa tagaagacat cgccaatggt
agcactggga ctagctccgc agaaagcagc 900agttggggag atataaactc tctggtttac
ccctctatca tctccaatta tggagttggc 960caacaagggc cattacaaga ttctggtttt
gaagagttga ggtacctggg ctcccaatag 1020254339PRTVitis vinifera 254Met Gly
Arg Ala Pro Cys Cys Asp Lys Ala Asn Val Lys Lys Gly Pro1 5
10 15Trp Ser Pro Glu Glu Asp Ala Lys
Leu Lys Thr Tyr Ile Glu Lys Asn20 25
30Gly Thr Gly Gly Asn Trp Ile Ala Leu Pro Gln Lys Ile Gly Leu Lys35
40 45Arg Cys Gly Lys Ser Cys Arg Leu Arg Trp
Leu Asn Tyr Leu Arg Pro50 55 60Asn Ile
Lys His Gly Gly Phe Ser Glu Glu Glu Asp Asn Ile Ile Cys65
70 75 80Ser Leu Tyr Ile Ser Ile Gly
Ser Arg Trp Ser Val Ile Ala Thr Gln85 90
95Leu Pro Gly Arg Thr Asp Asn Asp Ile Lys Asn Tyr Trp Asn Thr Arg100
105 110Leu Lys Lys Lys Leu Leu Gly Lys Gln
Arg Lys Glu Gln Gln Ala Arg115 120 125Arg
Ala Asn Tyr Val Lys Gln Glu Met Lys Arg Glu Pro Gly Ile Phe130
135 140Val Val Pro Asp Gln Ala Val Gly Arg Asn Pro
Tyr Trp Ala Glu Leu145 150 155
160Pro Ala Val Ser Val His Pro Asn Gln Asp Pro Asp Ile Lys Asp
Gln165 170 175Ala Ser Ile Lys Lys Leu Leu
Ile Lys Leu Gly Gly Arg Phe Gly Asp180 185
190Asp Gly Gln Gln Pro Gly Ser Asn Ser Ile Asn Phe Gln Tyr Pro Leu195
200 205Asp Ile Ser Thr Ala Gln Asp Gly Pro
Tyr Glu Thr Ser Met Asn Met210 215 220Phe
Ala Ser Ser Thr Ser Thr Met Asn Ser Ile Asn Ser Thr Gly Ser225
230 235 240Gln Leu Pro Asn Thr His
Tyr Asp Val Asn Gly Ser Ala Ser Asn Val245 250
255Leu Gln Gly Tyr Asn Ser Leu Leu Asn Glu Leu Val Tyr Gly Asn
Pro260 265 270Gln Gln Leu Asp Gly Ser Gly
Ser Phe Tyr Gly Ile Glu Asp Ile Ala275 280
285Asn Gly Ser Thr Gly Thr Ser Ser Ala Glu Ser Ser Ser Trp Gly Asp290
295 300Ile Asn Ser Leu Val Tyr Pro Ser Ile
Ile Ser Asn Tyr Gly Val Gly305 310 315
320Gln Gln Gly Pro Leu Gln Asp Ser Gly Phe Glu Glu Leu Arg
Tyr Leu325 330 335Gly Ser
Gln2551101DNAVitis vinifera 255atgggtcgag ctccatgctg tgacaaagcc
aacgtgaaga agggaccatg gtcccctgag 60gaggatgcta agcttaaaac ttacattgag
caatatggta ctggaggcaa ctggattgct 120cttccccaga aaataggact taagagatgt
ggaaaaagtt gcagactgag atggttgaat 180tacttgagac ctaacatcaa gcatggtgga
ttctctgaag aggaagacag catcatctgc 240aacctctata taagtattgt tttttttttt
ttgtggtcgg taattgcagc ccaattgcct 300ggaagaacag ataatgacat taagaactac
tggaacacaa gactgaagaa gaagttgctt 360ggaaagcgta aacaatctca tttcaatcga
ctttcggttg caggtcagga ccctaaagat 420gcaactggtg tagaagataa tccatattca
caggccttaa gcaattctgc ccttgaaaga 480ctgcagctcc atatgcagct tcagagcctg
caacacccct ttgctttcta caataatcct 540gcactgtggc ccaagctgca tcccctccaa
gaaaagctga tccagaacct ccagaatgag 600ctccccagcc ctctgatgca acatacccca
cctagttctg acccgatggg acaagaacag 660aagtctgggt tctatgaacc gcccactgct
tccatcacac ttcaacaaga gtatccgaaa 720accaacaatc caaagggaga tgaatttgag
aaatctttaa atgttttgcc ctcctcagac 780agttcgatgg cctacaacag cgaaaatatg
gtgggtgcac ctattatgtc taaaccagat 840ggtgtagacc aatccaacat tgcacaacaa
gcagtatcaa cctttcaagc tgagcttgat 900gacttcctta ataacaaaat ggttggtttc
ctcccacagg aagagcagat gactgaaata 960gatggctcca aggacagctt ggcttggtgg
tctaatgact ttgacacaaa atcggcatcc 1020tcacactctt gggattccac ttctgttatt
cagtctgagg ggatgttcca agattatgta 1080ttaggttaca atctgcagtg a
1101256366PRTVitis vinifera 256Met Gly
Arg Ala Pro Cys Cys Asp Lys Ala Asn Val Lys Lys Gly Pro1 5
10 15Trp Ser Pro Glu Glu Asp Ala Lys
Leu Lys Thr Tyr Ile Glu Gln Tyr20 25
30Gly Thr Gly Gly Asn Trp Ile Ala Leu Pro Gln Lys Ile Gly Leu Lys35
40 45Arg Cys Gly Lys Ser Cys Arg Leu Arg Trp
Leu Asn Tyr Leu Arg Pro50 55 60Asn Ile
Lys His Gly Gly Phe Ser Glu Glu Glu Asp Ser Ile Ile Cys65
70 75 80Asn Leu Tyr Ile Ser Ile Val
Phe Phe Phe Leu Trp Ser Val Ile Ala85 90
95Ala Gln Leu Pro Gly Arg Thr Asp Asn Asp Ile Lys Asn Tyr Trp Asn100
105 110Thr Arg Leu Lys Lys Lys Leu Leu Gly
Lys Arg Lys Gln Ser His Phe115 120 125Asn
Arg Leu Ser Val Ala Gly Gln Asp Pro Lys Asp Ala Thr Gly Val130
135 140Glu Asp Asn Pro Tyr Ser Gln Ala Leu Ser Asn
Ser Ala Leu Glu Arg145 150 155
160Leu Gln Leu His Met Gln Leu Gln Ser Leu Gln His Pro Phe Ala
Phe165 170 175Tyr Asn Asn Pro Ala Leu Trp
Pro Lys Leu His Pro Leu Gln Glu Lys180 185
190Leu Ile Gln Asn Leu Gln Asn Glu Leu Pro Ser Pro Leu Met Gln His195
200 205Thr Pro Pro Ser Ser Asp Pro Met Gly
Gln Glu Gln Lys Ser Gly Phe210 215 220Tyr
Glu Pro Pro Thr Ala Ser Ile Thr Leu Gln Gln Glu Tyr Pro Lys225
230 235 240Thr Asn Asn Pro Lys Gly
Asp Glu Phe Glu Lys Ser Leu Asn Val Leu245 250
255Pro Ser Ser Asp Ser Ser Met Ala Tyr Asn Ser Glu Asn Met Val
Gly260 265 270Ala Pro Ile Met Ser Lys Pro
Asp Gly Val Asp Gln Ser Asn Ile Ala275 280
285Gln Gln Ala Val Ser Thr Phe Gln Ala Glu Leu Asp Asp Phe Leu Asn290
295 300Asn Lys Met Val Gly Phe Leu Pro Gln
Glu Glu Gln Met Thr Glu Ile305 310 315
320Asp Gly Ser Lys Asp Ser Leu Ala Trp Trp Ser Asn Asp Phe
Asp Thr325 330 335Lys Ser Ala Ser Ser His
Ser Trp Asp Ser Thr Ser Val Ile Gln Ser340 345
350Glu Gly Met Phe Gln Asp Tyr Val Leu Gly Tyr Asn Leu Gln355
360 3652571023DNAVitis vinifera 257atggggaggg
caccttgctg tgacaaagcc aacgtcaaga aaggcccatg gtcgcctgaa 60gaagatgcca
agctcaagtc atatattgag cagcatggca ctggtggtaa ttggatcgct 120ttgccacaaa
aaattggcct caagagatgt gggaagagct gccgccttag atggttaaac 180tatctccgcc
caaatatcag gcacggagga ttctctgaag aagaagataa catcatctgt 240agcctgtata
taagtattgg aagcaggtgg tctgttattg cagcacaatt accaggacga 300actgacaatg
atataaagaa ctactggaat acgagactga agaagaagct ccttggtaag 360cagcgcaaag
aacaacaggc tcgcagaggt agctgcccga agcaagagat gaaaagagca 420agtgggaacg
ccatggtttc tgataatgaa aacttaaacc cttactggcc tgagttgccc 480gtgccagcac
caataccgta cccaaacgaa gaaccacgct tcaacgatca ttcatccatc 540agaagattgt
tgatcaggct tgggggaaga ttttctgatg acggacaacc actccaaaac 600gggacaaatc
ttcagctccc gattgatatt tcatccgttc aacaaccttt tgaccactca 660gtaaactttc
tgtcttcttc tcctatgaat gccctaaaca accctcgttc tgaatttcca 720aacgctgagt
acaatatgga agggggaggg ctacacatgc tacaaggaca gaacagtttt 780ttagctgagc
tggaggagat ggcttgtagc aacccacaaa gattagatgg gctggagttc 840ttatatgcgc
aggacatggc caataacaaa cccggatcaa ctgcttatga gcaaagtctc 900agttggggtg
agacgagctc tctggtttat cctcctgttg cttccaacta tgaaggtctt 960caacagcaag
gggtgacgca agaacatgac tttgatgagt tgaggtaccc taccccgcga 1020taa
1023258340PRTVitis
vinifera 258Met Gly Arg Ala Pro Cys Cys Asp Lys Ala Asn Val Lys Lys Gly
Pro1 5 10 15Trp Ser Pro
Glu Glu Asp Ala Lys Leu Lys Ser Tyr Ile Glu Gln His20 25
30Gly Thr Gly Gly Asn Trp Ile Ala Leu Pro Gln Lys Ile
Gly Leu Lys35 40 45Arg Cys Gly Lys Ser
Cys Arg Leu Arg Trp Leu Asn Tyr Leu Arg Pro50 55
60Asn Ile Arg His Gly Gly Phe Ser Glu Glu Glu Asp Asn Ile Ile
Cys65 70 75 80Ser Leu
Tyr Ile Ser Ile Gly Ser Arg Trp Ser Val Ile Ala Ala Gln85
90 95Leu Pro Gly Arg Thr Asp Asn Asp Ile Lys Asn Tyr
Trp Asn Thr Arg100 105 110Leu Lys Lys Lys
Leu Leu Gly Lys Gln Arg Lys Glu Gln Gln Ala Arg115 120
125Arg Gly Ser Cys Pro Lys Gln Glu Met Lys Arg Ala Ser Gly
Asn Ala130 135 140Met Val Ser Asp Asn Glu
Asn Leu Asn Pro Tyr Trp Pro Glu Leu Pro145 150
155 160Val Pro Ala Pro Ile Pro Tyr Pro Asn Glu Glu
Pro Arg Phe Asn Asp165 170 175His Ser Ser
Ile Arg Arg Leu Leu Ile Arg Leu Gly Gly Arg Phe Ser180
185 190Asp Asp Gly Gln Pro Leu Gln Asn Gly Thr Asn Leu
Gln Leu Pro Ile195 200 205Asp Ile Ser Ser
Val Gln Gln Pro Phe Asp His Ser Val Asn Phe Leu210 215
220Ser Ser Ser Pro Met Asn Ala Leu Asn Asn Pro Arg Ser Glu
Phe Pro225 230 235 240Asn
Ala Glu Tyr Asn Met Glu Gly Gly Gly Leu His Met Leu Gln Gly245
250 255Gln Asn Ser Phe Leu Ala Glu Leu Glu Glu Met
Ala Cys Ser Asn Pro260 265 270Gln Arg Leu
Asp Gly Leu Glu Phe Leu Tyr Ala Gln Asp Met Ala Asn275
280 285Asn Lys Pro Gly Ser Thr Ala Tyr Glu Gln Ser Leu
Ser Trp Gly Glu290 295 300Thr Ser Ser Leu
Val Tyr Pro Pro Val Ala Ser Asn Tyr Glu Gly Leu305 310
315 320Gln Gln Gln Gly Val Thr Gln Glu His
Asp Phe Asp Glu Leu Arg Tyr325 330 335Pro
Thr Pro Arg340259510DNAVitis vinifera 259atgggaagag ctccttgctg tgacaaggca
aatgtgaaga aaggcccatg gtcacctgaa 60gaagactcaa agcttaagga gtatattgag
aaatatggga ctgggggaaa ttggattgct 120ctcccacaaa aagctggtct taagagatgt
gggaaaagct gcagattgag atggttgaac 180tatctcagac ccaacattaa acatggagaa
ttctctgatg atgaagatag gatcatctgc 240agcgtctttg ctagcattgg aagcaagtgg
tcagtaatag caaattactt gccggggagg 300actgataacg atatcaagaa ctactggaac
accaagctga agaagaaact catggggatg 360gtgcctgtat ctcagagaaa accccatcaa
gctaccttct ctcatcatct tcaaacctat 420tcatcgttat catcaccttc acatgcggta
acaacaacag caacttcatc atcatatgaa 480agcaacagca gcaaccctta ttacactcca
510260170PRTVitis vinifera 260Met Gly
Arg Ala Pro Cys Cys Asp Lys Ala Asn Val Lys Lys Gly Pro1 5
10 15Trp Ser Pro Glu Glu Asp Ser Lys
Leu Lys Glu Tyr Ile Glu Lys Tyr20 25
30Gly Thr Gly Gly Asn Trp Ile Ala Leu Pro Gln Lys Ala Gly Leu Lys35
40 45Arg Cys Gly Lys Ser Cys Arg Leu Arg Trp
Leu Asn Tyr Leu Arg Pro50 55 60Asn Ile
Lys His Gly Glu Phe Ser Asp Asp Glu Asp Arg Ile Ile Cys65
70 75 80Ser Val Phe Ala Ser Ile Gly
Ser Lys Trp Ser Val Ile Ala Asn Tyr85 90
95Leu Pro Gly Arg Thr Asp Asn Asp Ile Lys Asn Tyr Trp Asn Thr Lys100
105 110Leu Lys Lys Lys Leu Met Gly Met Val
Pro Val Ser Gln Arg Lys Pro115 120 125His
Gln Ala Thr Phe Ser His His Leu Gln Thr Tyr Ser Ser Leu Ser130
135 140Ser Pro Ser His Ala Val Thr Thr Thr Ala Thr
Ser Ser Ser Tyr Glu145 150 155
160Ser Asn Ser Ser Asn Pro Tyr Tyr Thr Pro165
170261944DNAZea mays 261atggggagag ctccgtgctg cgacaaggct actgtgaaga
agggcccgtg gtcaccggag 60gaggacgcca agctcaagtc ctacatcgag cagaacggca
ccggcggtaa ctggatagcc 120ctgccgcaga agataggttt gaagaggtgc ggcaagagct
gccgcctccg gtggctcaac 180tacctccggc caaacatcag gcacggcggg ttctcggagg
aggaggacag gataatcctt 240agcctctaca tcagcatagg cagcaggtgg tcgataatag
cggcgcagct gccggggagg 300acggacaatg acataaagaa ctactggaac acgaagctca
agaagaagct cttcggcaag 360cagtcccgca gggatcagcg gcagcatcag ttcatgcgcc
aggcagcggc ggcaagcgat 420gggatgatga agcaagaagc ggcaaccggg gaggcagcgg
gaagcagtag catggcggtg 480gccaactaca actggcacca ccaccatcaa gccatggcgg
ccgtgcctgt gcaaccaata 540ctaggctcga taatggaagg ccatcgcgca ggggatgagg
tagatgagtc gattcggaag 600cttctttata agctaggagg agcaggccct tccacagcac
tctcggttcc tcagtgtgtt 660cctccaatgt actgtggaag tccagccctc atgccgccgc
cgccgccgtc atgcccagca 720gtggacaccg gcacatcgct tgatgaaggc ggcgtgcagg
gctccggcgt gctgccggcg 780ctggagatgg accatagctt ccacttcaac caggttaagc
tagatggcct ggaaagcttc 840ttcggcatga gtactgatca gagcatgcgg tggagtgagg
tgagcccatt ggtttgccct 900agcagtctgg ttgttgatga gccaggaaac cttgggatga
agta 944262314PRTZea mays 262Met Gly Arg Ala Pro Cys
Cys Asp Lys Ala Thr Val Lys Lys Gly Pro1 5
10 15Trp Ser Pro Glu Glu Asp Ala Lys Leu Lys Ser Tyr
Ile Glu Gln Asn20 25 30Gly Thr Gly Gly
Asn Trp Ile Ala Leu Pro Gln Lys Ile Gly Leu Lys35 40
45Arg Cys Gly Lys Ser Cys Arg Leu Arg Trp Leu Asn Tyr Leu
Arg Pro50 55 60Asn Ile Arg His Gly Gly
Phe Ser Glu Glu Glu Asp Arg Ile Ile Leu65 70
75 80Ser Leu Tyr Ile Ser Ile Gly Ser Arg Trp Ser
Ile Ile Ala Ala Gln85 90 95Leu Pro Gly
Arg Thr Asp Asn Asp Ile Lys Asn Tyr Trp Asn Thr Lys100
105 110Leu Lys Lys Lys Leu Phe Gly Lys Gln Ser Arg Arg
Asp Gln Arg Gln115 120 125His Gln Phe Met
Arg Gln Ala Ala Ala Ala Ser Asp Gly Met Met Lys130 135
140Gln Glu Ala Ala Thr Gly Glu Ala Ala Gly Ser Ser Ser Met
Ala Val145 150 155 160Ala
Asn Tyr Asn Trp His His His His Gln Ala Met Ala Ala Val Pro165
170 175Val Gln Pro Ile Leu Gly Ser Ile Met Glu Gly
His Arg Ala Gly Asp180 185 190Glu Val Asp
Glu Ser Ile Arg Lys Leu Leu Tyr Lys Leu Gly Gly Ala195
200 205Gly Pro Ser Thr Ala Leu Ser Val Pro Gln Cys Val
Pro Pro Met Tyr210 215 220Cys Gly Ser Pro
Ala Leu Met Pro Pro Pro Pro Pro Ser Cys Pro Ala225 230
235 240Val Asp Thr Gly Thr Ser Leu Asp Glu
Gly Gly Val Gln Gly Ser Gly245 250 255Val
Leu Pro Ala Leu Glu Met Asp His Ser Phe His Phe Asn Gln Val260
265 270Lys Leu Asp Gly Leu Glu Ser Phe Phe Gly Met
Ser Thr Asp Gln Ser275 280 285Met Arg Trp
Ser Glu Val Ser Pro Leu Val Cys Pro Ser Ser Leu Val290
295 300Val Asp Glu Pro Gly Asn Leu Gly Met Lys305
3102631083DNAZea mays 263atggggaggg cgccgtgctg cgacaaggcg
agcgtgaaga aggggccgtg gtcgccggac 60gaggacgcca agctcaaggc ctacatcgag
gagcacggca ccggcggcaa ctggatcgcg 120ctaccgcaca agattgggct gaagagatgc
gggaaaagct gcaggctgag atggctcaat 180tatctgaggc ccaatattaa gcatggtgac
ttcacagaag aagaggagca tatcatttgc 240agcctctaca ttagcattgg aagcaggtgg
tccatcattg cagcgcagct gccggggcga 300acggacaacg acatcaagaa ctactggaac
accaagctga agaagaagct cctcggaaag 360cgcgcgccgt ccaggcgcgc gaggacgaac
caggatccct gctacctcgc cgcgggagcc 420gccagcagca gcagcagcag cagcgccgcg
acggccacgc aagccctaag cgcgtcggct 480ctcgagcgga tccagctcca catgcggctc
caaggcatct acggcgcgtt agcttgcagc 540ggcggcaacg acgacagcaa cgctgccttc
gggtcggcgg cggcggcgcc gccgcagtgg 600ccgaagcttg aggcgctgtc gcaggccaac
aggctgctcc cggggtcgct gccggcggac 660gccatggcca caaccgtgag cgtgcagccg
cacccacagc atctggtcga ggtcgaccac 720caccagagcc tcgccgccgc cgcactcgag
ggcgagcaac agctcagctc tgccggtgaa 780ggaggctttt tcgagcggcc caaggtagat
ttttactctc catcagccga ggtcgccgcc 840gccgccgccg ccagcgtaga gatgaactcg
gtcgccccga tggttgttgg cggctacgcg 900ggcgccgccg gattcgggcc tcagcaccat
cacgacgagc tctacgactt cctgtacagc 960aagtacggat cagtgggcgg gctggcgcac
gacggcgggc atgttccaac gttgccagag 1020ctgcagtgcc cggacggcgc tgccgccgtc
gtcggagccg acgagaagtt ctcggcgtgg 1080acc
1083264361PRTZea mays 264Met Gly Arg Ala
Pro Cys Cys Asp Lys Ala Ser Val Lys Lys Gly Pro1 5
10 15Trp Ser Pro Asp Glu Asp Ala Lys Leu Lys
Ala Tyr Ile Glu Glu His20 25 30Gly Thr
Gly Gly Asn Trp Ile Ala Leu Pro His Lys Ile Gly Leu Lys35
40 45Arg Cys Gly Lys Ser Cys Arg Leu Arg Trp Leu Asn
Tyr Leu Arg Pro50 55 60Asn Ile Lys His
Gly Asp Phe Thr Glu Glu Glu Glu His Ile Ile Cys65 70
75 80Ser Leu Tyr Ile Ser Ile Gly Ser Arg
Trp Ser Ile Ile Ala Ala Gln85 90 95Leu
Pro Gly Arg Thr Asp Asn Asp Ile Lys Asn Tyr Trp Asn Thr Lys100
105 110Leu Lys Lys Lys Leu Leu Gly Lys Arg Ala Pro
Ser Arg Arg Ala Arg115 120 125Thr Asn Gln
Asp Pro Cys Tyr Leu Ala Ala Gly Ala Ala Ser Ser Ser130
135 140Ser Ser Ser Ser Ala Ala Thr Ala Thr Gln Ala Leu
Ser Ala Ser Ala145 150 155
160Leu Glu Arg Ile Gln Leu His Met Arg Leu Gln Gly Ile Tyr Gly Ala165
170 175Leu Ala Cys Ser Gly Gly Asn Asp Asp
Ser Asn Ala Ala Phe Gly Ser180 185 190Ala
Ala Ala Ala Pro Pro Gln Trp Pro Lys Leu Glu Ala Leu Ser Gln195
200 205Ala Asn Arg Leu Leu Pro Gly Ser Leu Pro Ala
Asp Ala Met Ala Thr210 215 220Thr Val Ser
Val Gln Pro His Pro Gln His Leu Val Glu Val Asp His225
230 235 240His Gln Ser Leu Ala Ala Ala
Ala Leu Glu Gly Glu Gln Gln Leu Ser245 250
255Ser Ala Gly Glu Gly Gly Phe Phe Glu Arg Pro Lys Val Asp Phe Tyr260
265 270Ser Pro Ser Ala Glu Val Ala Ala Ala
Ala Ala Ala Ser Val Glu Met275 280 285Asn
Ser Val Ala Pro Met Val Val Gly Gly Tyr Ala Gly Ala Ala Gly290
295 300Phe Gly Pro Gln His His His Asp Glu Leu Tyr
Asp Phe Leu Tyr Ser305 310 315
320Lys Tyr Gly Ser Val Gly Gly Leu Ala His Asp Gly Gly His Val
Pro325 330 335Thr Leu Pro Glu Leu Gln Cys
Pro Asp Gly Ala Ala Ala Val Val Gly340 345
350Ala Asp Glu Lys Phe Ser Ala Trp Thr355
3602651259DNAOryza sativa 265atggggagag cgccgtgctg cgacaaggcg agcgtgaaga
aggggccatg gtcaccggag 60gaggacgcga agctcaaatc ctacatcgag cagaacggca
ccggcggcaa ctggatcgcc 120ctgccccaga agatcggtat gtgttggcaa ttctcatgag
tcatgagtcg atgtggtggt 180tgtggcagca gcagcagcga tagtggtaga agaaagctaa
tcaagtgcat gtgtgtcttt 240gttggtttgg gtttgtacat gtaggtttga aaaggtgcgg
caagagctgc cgcctccggt 300ggctgaacta cctccggccg aacatcaagc acggcgggtt
ctcggaggag gaagacagga 360tcatcctcag cctctacatc agcataggaa gcaggtatac
ttgctaattc agctgggcac 420acacaagtct agtcactcct atgatcaact caactgtaga
attagctgcg aaaccggtag 480gcaggcaggc aggttgcaca tggggcatga gagtgtagta
caatctatga tctgacatga 540ccatatatgc atggcacagg tggtcgataa tagcagcgca
gctgccgggg cggacggaca 600atgacatcaa gaactattgg aacacgaggc tcaagaagaa
gctctttggc aagcagtcgc 660gcaaggatca gaggcagcag cagcacctgg cgcgccaggc
ggcagcagct gccagcgact 720tgcagatcaa acaagaagcg agcaggggtg caaacgaagc
cgatggcttg gctgccggtg 780ccaattacac ttggcatcac caccacgcca tggccgtgcc
tgtgcacccg atgtcggcac 840caatggtggt ggaaggaggc cgtgtgggag acgatgtcga
tgagtcgatc cggaagcttc 900tgttcaagct cggagggaac ccattcgcgg cctcgccggc
accgccatgc atacctccac 960caccaatgta cgaggaagcc ccaagcttcg tgccaccatt
ggcgcacggc gtgccgctca 1020acgaaggcgg catgcagtgc tccagcgtgc tgccggcgct
ggagctggac gagaacttcc 1080acttcaacca tgtcaagctg gacgggctcg agtgcctctt
cgggatggga gatcaccaaa 1140acatgagatg gaatgaggtg agcccgttgg tttgccctaa
taacgctgtg gcgtccagct 1200cccaagggat gcagcagtac tgcctagttg aagaaccagc
tgacctcggg atgcagtag 1259266116PRTArabidopsis
ThalianaMISC_FEATURE(4)..(4)X at position 4 can be any amino acid 266Met
Gly Arg Xaa Pro Cys Cys Asp Xaa Xaa Xaa Xaa Lys Xaa Gly Xaa1
5 10 15Trp Xaa Xaa Xaa Glu Asp Xaa
Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa20 25
30Gly Xaa Xaa Xaa Xaa Trp Ile Xaa Xaa Pro Xaa Xaa Xaa Gly Xaa Xaa35
40 45Arg Cys Gly Xaa Ser Cys Arg Leu Arg Trp
Xaa Asn Tyr Leu Arg Pro50 55 60Xaa Xaa
Xaa His Gly Xaa Xaa Xaa Xaa Xaa Glu Xaa Xaa Xaa Xaa Xaa65
70 75 80Xaa Xaa Xaa Xaa Xaa Xaa Gly
Ser Xaa Trp Ser Xaa Xaa Ala Xaa Xaa85 90
95Xaa Xaa Xaa Arg Thr Asp Asn Asp Xaa Lys Asn Xaa Trp Xaa Xaa Xaa100
105 110Leu Xaa Xaa Xaa11526787PRTArabidopsis
thalianaMISC_FEATURE(1)..(1)X at position 1 can be arginine or lysine
267Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Gly Xaa Xaa Ile Xaa Xaa1
5 10 15Xaa Xaa Xaa Xaa Xaa Xaa
Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa20 25
30Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa His Xaa Xaa Xaa Xaa Xaa35
40 45Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa
Xaa Xaa Xaa Xaa Xaa Ser Xaa50 55 60Xaa
Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa65
70 75 80Xaa Xaa Xaa Xaa Xaa Xaa
Xaa8526887PRTArabidopsis thalianaMISC_FEATURE(2)..(2)X at position 2 can
be any amino acid 268Gly Xaa Trp Xaa Xaa Xaa Glu Asp Xaa Xaa Xaa Xaa Xaa
Xaa Xaa Xaa1 5 10 15Xaa
Xaa Gly Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa20
25 30Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Arg
Trp Xaa Asn Tyr Leu35 40 45Arg Pro Xaa
Xaa Xaa His Gly Xaa Xaa Xaa Xaa Xaa Glu Xaa Xaa Xaa50 55
60Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Trp Xaa
Xaa Xaa Ala65 70 75
80Xaa Xaa Xaa Xaa Xaa Arg Thr8526910PRTArabidopsis thaliana 269Ser Phe
Ser Gln Leu Leu Leu Asp Pro Asn1 5
1027015PRTArabidopsis thaliana 270Thr Ser Thr Ser Ala Asp Gln Ser Thr Ile
Ser Trp Glu Asp Ile1 5 10
1527130DNAartificial sequencesynthetic primer 271acgttctaga atgggaagag
caccgtgttg 3027232DNAartificial
sequencesynthetic primer 272atcgggatcc ttacacatga tttggcgcat tg
3227330DNAartificial sequencesynthetic primer
273acgttctaga atgggaagag caccgtgttg
3027437DNAartificial sequencesynthetic primer 274atcgggatcc ttaaaaaaat
tgctttgaat cagaata 3727527DNAartificial
sequencesynthetic primer 275acgtaagctt tcgtaaaatc tctcatg
2727640DNAartificial sequencesynthetic primer
276gtcactcgag cctaggtttc ttgattcttg attcttgatc
4027735DNAartificial sequencesynthetic primer 277aaagtcgacg catctttaca
atgtaaagct tttct 3527827DNAartificial
sequencesynthetic primer 278aaatctagat gttcgttgct tttcggg
2727938DNAartificial sequencesynthetic primer
279aaagtcgaca gaagacaaat gagagttggt ttatattt
3828029DNAartificial sequencesynthetic primer 280aaatctagac gcaacgaact
ttgattcaa 2928130DNAartificial
sequencesynthetic primer 281atgggaagag caccgtgttg tgataaggcc
3028235DNAartificial sequencesynthetic primer
282ttaatttggc gcattgaagt aacttgcatc ttcgg
3528328DNAartificial sequencesynthetic primer 283acgttctaga atggggagag
cgccgtgc 2828429DNAartificial
sequencesynthetic primer 284tgcaggatcc tactgcatcc cgaggtcag
2928530DNAartificial sequencesynthetic primer
285acgttctaga atggggagag ctccttgttg
3028629DNAartificial sequencesynthetic primer 286acgtggatcc ctattgcgct
cctcctggg 2928729DNAartificial
sequencesynthetic primer 287acgttctaga atggggaggg caccttgct
2928829DNAartificial sequencesynthetic primer
288acgttctaga atggggagag ctccgtgct
2928928DNAartificial sequencesynthetic primer 289actgtctaga atggggaggg
cgccgtgc 2829031DNAartificial
sequencesynthetic primer 290actgtctaga atgggaagag ctccttgctg t
3129129DNAartificial sequencesynthetic primer
291acgttctaga atggggagag ctccgtgct
2929231DNAartificial sequencesynthetic primer 292actgtctaga atgggaagag
ctccatgttg t 3129335DNAartificial
sequencesynthetic primer 293gactggatcc ttagtaataa aacatcccta tctca
3529430DNAartificial sequencesynthetic primier
294acgttctaga atggggagag ctccttgctg
3029531DNAartificial sequencesynthetic primer 295gactggatcc tcattgtggc
ccaaagaagc t 3129635DNAartificial
sequencesynthetic primer 296aaagtcgacg catctttaca atgtaaagct tttct
3529727DNAartificial sequencesynthetic primer
297aaatctagat gttcgttgct tttcggg
2729838DNAartificial sequencesynthetic primer 298aaagtcgaca gaagacaaat
gagagttggt ttatattt 3829929DNAartificial
sequencesynthetic primer 299aaatctagac gcaacgaact ttgattcaa
2930029DNAartificial sequencesynthetic primer
300aaaggatcca tgggaagagc accgtgttg
2930132DNAartificial sequencesynthetic primer 301aaaggatccc cactccctaa
agacacagat tt 3230228DNAartificial
sequencesynthetic primer 302aaatctagaa tgggaagagc accgtgtt
2830329DNAartificial sequencesynthetic primer
303aaaggatcct tacacatgat ttggcgcat
2930430DNAartificial sequencesynthetic primer 304actgtctaga atgggaagag
ctccatgctg 3030532DNAartificial
sequencesynthetic primer 305cagtggatcc ttaaacactg tggtagctca tc
3230630DNAartificial sequencesynthetic primer
306actgtctaga atgggaagag ctccgtgttg
3030731DNAartificial sequencesynthetic primer 307acgtggatcc taggagtaga
aatagggcaa g 3130831DNAartificial
sequencesynthetic primer 308actgtctaga atgggtaggg ctccatgttg t
3130934DNAartificial sequencesynthetic primer
309acgtggatcc tcagtagtac aacatgaact tatc
3431028DNAartificial sequencesynthetic primer 310aaaatctaga atgggaagag
caccgtgc 2831131DNAartificial
sequencesynthetic primer 311aaaaagatct ctactcatta tcgtatagag g
3131230DNAartificial sequencesynthetic primer
312acgtgtcgac gaagcagcag aagccttgat
3031332DNAartificial sequencesynthetic primer 313acgttctaga ggtagagaaa
agagaaagcc tc 3231430DNAartificial
sequencesynthetic primer 314acgttctaga atggggagag cgccgtgctg
3031532DNAartificial sequencesynthetic primer
315tgcaggatcc ctactgcatc ccgaggtcag ct
3231629DNAartificial sequencesynthetic primer 316acgttctaga atggggaggg
caccttgct 2931728DNAartificial
sequencesynthetic primer 317acgtggatcc tattgcgccc ccgggtag
2831834DNAartificial sequencesynthetic primer
318aaatctagaa tggggagagc tccgtgctgc gaca
3431934DNAartificial sequencesynthetic primer 319aaaggatccc tacttcatcc
caaggtttcc tggc 3432030DNAartificial
sequencesynthetic primer 320acgttctaga atggggagag ctccttgttg
3032129DNAartificial sequencesynthetic primer
321acgtggatcc ctattgcgct cctcctggg
2932239DNAartificial sequencesynthetic primer 322aaatctagaa tgggaagagc
accgtgttgt gataaggcc 3932342DNAartificial
sequencesynthetic primer 323aaaggatcct tacacattat ttggcccatt gaagtatctt
gc 4232439DNAartificial sequencesynthetic primer
324aaatctagaa tgggaagagc accgtgttgt gacaaggct
3932542DNAartificial sequencesynthetic primer 325aaaggatcct tacaaatgat
ttgccccatt gaagtaactt gc 42
User Contributions:
Comment about this patent or add new information about this topic: