Patent application title: COMPOSITIONS, METHODS, AND PLANT GENES FOR THE IMPROVED PRODUCTION OF FERMENTABLE SUGARS FOR BIOFUEL PRODUCTION
Inventors:
IPC8 Class: AC12N1582FI
USPC Class:
1 1
Class name:
Publication date: 2021-06-10
Patent application number: 20210171969
Abstract:
Described herein are compositions comprising at least one auxin transport
inhibitor for pre-treating a plant or seed to increase saccharification,
or saccharide release by hydrolysis, the at least one auxin transport
inhibitor being in an amount effective to increase sugar release from a
plant tissue by hydrolysis. Also described are plant mutations, and
methods to screen for such plant mutations, having an improved sugar
release phenotype. The described compositions, methods and plant
mutations are particularly useful for producing biofuel crops, such as
maize, to improve sugar extractability from lignocellulosic biomass and
hence, the efficiency of bioethanol production overall.Claims:
1-142. (canceled)
143. An isolated nucleic acid which encodes a mutant fpx 2-3 polypeptide comprising a mutation corresponding to G1009D in SEQ ID NO: 19, or a fragment thereof encoding said mutant fpx 2-3 polypeptide wherein said fragment comprises said G1009D mutation and is at least 80% identical to SEQ ID NO: 19.
144. The isolated nucleic acid of claim 143, comprising a nucleic acid sequence 80% identical to SEQ ID NO: 18 or encoding a polypeptide which is at least 80% identical to SEQ ID NO: 19.
145. The isolated nucleic acid of claim 143, comprising a nucleic acid sequence 85% identical to SEQ ID NO: 18 or encoding a polypeptide which is 85% identical to SEQ ID NO: 19.
146. The isolated nucleic acid of claim 143, comprising a nucleic acid sequence 90% identical to SEQ ID NO: 18 or encoding a polypeptide which is 90% identical to SEQ ID NO: 19.
147. The isolated nucleic acid of claim 143, comprising a nucleic acid sequence 99% identical to SEQ ID NO: 18 or encoding a polypeptide which is 99% identical to SEQ ID NO: 19.
148. The isolated nucleic acid of claim 143, having the nucleic acid sequence of SEQ ID NO: 18 or encoding a polypeptide having the sequence of SEQ ID NO: 19.
149. A vector comprising a nucleic acid as defined in claim 143.
150. A host cell comprising a nucleic acid as defined in claim 143.
151. A seed or plant, each comprising a nucleic acid as defined in claim 143.
152. A vector comprising a nucleic acid as defined in claim 146.
153. A host cell comprising a nucleic acid as defined in claim 146.
154. A seed or plant, each comprising a nucleic acid as defined in claim 146.
155. A vector comprising a nucleic acid as defined in claim 148.
156. A host cell comprising a nucleic acid as defined in claim 148.
157. A seed or plant, each comprising a nucleic acid as defined in claim 148.
Description:
CROSS-REFERENCE TO RELATED APPLICATIONS
[0001] This application is a divisional that claims benefit under 35 U.S.C. .sctn. 121 of co-pending U.S. application Ser. No. 15/824,013 filed Nov. 28, 2017, which is a continuation that claims benefit under 35 U.S.C. .sctn. 120 of U.S. application Ser. No. 14/388,089 filed Sep. 25, 2014 now U.S. Pat. No. 9,856,512 issued Jan. 2, 2018, which is a 35 U.S.C. .sctn. 371 National Phase Entry Application of International Application No. PCT/CA13/00289 filed Mar. 26, 2013, and which claims benefit under 35 U.S.C. .sctn. 119(e) of U.S. Provisional Application Ser. No. 61/615,530 filed Mar. 26, 2012 the contents of which are incorporated herein by reference in their entireties.
SEQUENCE LISTING
[0002] The instant application contains a Sequence Listing which has been submitted in ASCII format via EFS-Web and is hereby incorporated by reference in its entirety. Said ASCII copy, created on Sep. 22, 2014, is named 924270WO_ST25.txt and is 257,246 bytes in size.
FIELD OF THE INVENTION
[0003] The present invention is directed to compositions and methods for improving saccharide extraction from biomass, as well as to methods for identifying mutations that affect saccharide extraction. More particularly, the invention relates to compositions comprising auxin transport inhibitors, methods relating thereto, mutant plant varieties, and methods of genetic screening for such mutations that affect saccharification in plant tissue.
BACKGROUND OF THE INVENTION
[0004] Plant biomass and in particular cellulosic ethanol has gained considerable interest as a stable, environmentally benign source of energy that could partially offset fossil fuels. However, the encapsulation of cellulose and branched polysaccharides collectively known as hemicellulose lignin, together with the crystalline nature of cellulose, make the biochemical conversion of lignocellulosic biomass to biofuels a costly and energy inefficient process. The recalcitrance of lignocellulose has led to the development of a variety of technologies that usually involve the deconstruction of plant cell walls through acid, thermochemical, or enzymatic hydrolysis. For example, hemicellulose can be hydrolyzed by dilute acid treatments, but these conditions are not severe enough for cellulose hydrolysis. Increasing acid concentrations or carrying out acid treatments at high temperature and pressure improves sugar yields from cellulose, but both processes are corrosive and increase costs. Unfortunately, enzymatic approaches of digesting lignocellulose are still in their infancy. Moreover, the protective nature of the cell wall to cellulases means digestion is slow and inefficient. As a consequence, acid hydrolysis pretreatments are often used to depolymerize and solubilize hemicelluloses.
[0005] The lack of energy efficient and environmentally friendly conversion of lignocellulosic polymers into fermentable sugars, or saccharification, has spurred interest in using genetic and genomic approaches that modify the cell wall for industrial processing. Often these approaches have involved manipulating known cell wall synthesis or degradation enzymes. Although these rational approaches are promising they depend on a prior molecular knowledge of the genes of interest, usually followed by reverse genetics to test functionality.
[0006] Most approaches to genetically improving conversion of lignocellulosic biomass into a fermentable sugar source take advantage of our understanding of cell wall polymer synthesis. This usually involves manipulating glycosyltransferases and glycan synthases that are involved in polymerizing polysaccharides or modulating levels of lignin. However, the rudimentary knowledge about the regulation of this complex matrix limits this approach. For example, estimates of over 1000 cell wall proteins in Arabidopsis alone make it difficult to know which ones will functionally influence saccharification. Furthermore, over 700 genes are annotated as encoding putative glycosyltransferases or glycosyl hydrolases.
[0007] By contrast, forward genetic screens, which inherently have no mechanistic bias have the potential to uncover novel processes that could improve saccharification. The limitation of forward screens, however, is designing specific high throughput assays, followed by efficient molecular identification of the genes involved. In this latter case, however, the recent development of next generation sequencing technologies to identify mutant alleles has greatly reduced this bottleneck.
SUMMARY OF THE INVENTION
[0008] The invention is directed to a use of an auxin transport inhibitor in the pretreatment of a plant tissue to increase the sugar released from the plant tissue through hydrolysis.
[0009] The invention is further directed to the use of a genetically modified plant that has disrupted auxin transport to increase the sugar released from the plant through hydrolysis.
[0010] The invention is further directed to the use of a genetically modified plant that contains cell wall defects to increase the sugar released from the plant through hydrolysis.
[0011] The invention is further directed to the use of genetically modified plant tissue with increased starch accumulation to increase the sugar released from the plant through hydrolysis.
[0012] The invention is further directed to the use of any of the forgoing in production of bioplastic, biofoam, biorubber, biocomposite, forestry biofibre, agricultural textile, chemical, biocosmetic, and feed stock production.
[0013] The invention is further directed to a method of identifying plant genotypes that show an improved sugar release under mild acid treatment comprising the following steps:
a) providing a plurality of mutated plant seeds; b) germinating the mutated plant seeds; c) retrieving samples from each mutated plant seed; d) submerging the samples in a weak acid; e) incubating the samples with a colorimetric reagent in a concentrated acid; and f) measuring the colour absorbance to determine the relative concentration of the sugar release.
[0014] The invention is further directed to a screening method to identify new plant cellulose synthase (CESA) alleles wherein mutagenized plants are screened with a cellulose biosynthetic inhibitor (CBI).
[0015] The invention is further directed to the use of an X-ray diffractometer to measure the proportion of crystalline cellulose relative to the proportion of amorphous cellulose in plant stem tissue.
[0016] The invention is further directed to the use of forward genetic screens for identifying mutants with improved saccharification from plant tissues.
[0017] The invention is further directed to the use of a forward genetic screen for identifying mutations that show increased sugar release from plant biomass as compared with wild types, under mild acid hydrolysis conditions.
[0018] The invention is further directed to a method of identifying genes involved with saccharification by means of a genetic screen.
[0019] According to an aspect of the invention, there is provided a composition for pre-treating a plant tissue to increase saccharide, or sugar, release from said plant tissue by hydrolysis, the composition comprising at least one auxin transport inhibitor in an amount effective to increase sugar release from said plant tissue by hydrolysis.
[0020] In a further aspect of the invention, there is also provided a method of pre-treating a plant tissue to increase saccharide release the said plant tissue by hydrolysis, the method comprising administering a composition as defined herein in an amount effective to increase sugar release from the plant, or tissues thereof, by hydrolysis.
[0021] Also provided is a method of screening for plants having an increased saccharide release phenotype, a reduced cellulose crystallinity phenotype, or both. The method comprises:
[0022] treating at least one plant or plant seed with at least one cellulose biosynthetic inhibitor (CBI) in an amount effective to select for CBI-resistance in the plant or plant seed;
[0023] germinating the plant seeds and/or incubating the plant and selecting for CBI-resistant mutant plants, or seeds thereof; and
[0024] measuring saccharide release, cellulose crystallinity, or both, in the CBI-resistant mutant plants to identify an increased saccharide release phenotype, a reduced cellulose crystallinity phenotype, or both.
[0025] Other details and aspects of the invention will be apparent from the following description of these compositions, uses and methods, as well as the mutant plants and genes described in detail throughout this application.
BRIEF DESCRIPTION OF THE FIGURES
[0026] This patent or application file contains at least one drawing executed in color. Copies of this patent or patent application publication with color drawing(s) will be provided by the Office upon request and payment of the necessary fee.
[0027] These and other features of the invention will become more apparent from the description, in which reference is made to the following drawings wherein:
[0028] FIGS. 1A-1B illustrate methodology and results of screening for wall hydrolysis sensitive (whs) mutants. FIG. 1A(PRIOR ART) is a schematic of the production of ethanol from cellulosic biomass. For biomass pretreatment, dilute sulphuric acid is used to solubilize the hemicellulosic fraction and to disrupt the crystalline structure of cellulose so that hydrolyzing enzymes can easily access and convert cellulose to fermentable sugars.
[0029] FIG. 1B illustrates the results of measuring hexose content in known cell wall mutants subjected to acid hydrolysis using 1M H.sub.2SO.sub.4 at 21 days after germination (DAG). Of the 30 cell wall mutants tested, only mur11-1 showed a significant difference in cell wall accessibility relative to wild type. All experiments were repeated at least three times with similar results. Dotted line denotes wild type levels (Results are averages.+-.s.d. (n=4). *, P<0.05 using Student's t-test.) FIG. 1C shows the results of measuring hexose content in mur11-1 and sac9-3 (SALK_058870) relative to wild type. Leaf discs were assayed for increased saccharification using 1M H.sub.2SO.sub.4 at 21 days. (Results are averages.+-.s.d. (n=8-10).)
[0030] FIGS. 2A-2C illustrate the results of characterizing whs mutants. FIG. 2A shows three-week old Arabidopsis plants grown in 96-well flats at 22.degree. C. under a 16 h/8 h light/dark cycle (top panel). Leaf 3 or 4 was excised from 21 day-old plants using a hole punch and subjected to acid hydrolysis using 1 M H.sub.2SO.sub.4. c; cotyledon, leaf numbers indicated (middle panel). Results of colorimetric anthrone assay illustrating that whs mutants release more sugars and turn a blue/green colour. Yellow indicates baseline levels of sugar release (bottom panel). FIG. 2B shows the hierarchical cluster analysis of monosaccharide composition analysis by gas chromatography of whs mutants in 21 day-old seedlings. Values are shown as a percentage relative to wild type. Yellow indicates high expression and blue indicates low expression. FIG. 2C shows a clustered heatmap of hexose content from 63 whs mutants subjected to acid hydrolysis of fresh leaf tissue using 1M H.sub.2SO.sub.4, acid hydrolysis of senesced whole plant tissues using 0.2 M H.sub.2SO.sub.4, enzymatic assays using cellulase, cellulase+xylanase and cellulase+peroxidase and starch staining of 14 day-old seedlings. Values are shown as a percentage relative to wild type. Yellow indicates high expression and black indicates low expression.
[0031] FIGS. 3A-3C illustrate the starch analysis of whs mutants mur11, dpe2 and sex4. FIG. 3A shows the acid hydrolysis of fresh leaf disc tissue from known starch mutants using 1 M H.sub.2SO.sub.4. (Results are averages.+-.s.d. (n=4); all experiments were repeated at least three times with similar results.) FIG. 3B shows the treatment of senesced material from starch mutants with .alpha.-amylase and the quantification of the amount of starch released using the anthrone method. (Results are averages.+-.s.d. (n=4); all experiments were repeated two times with similar results.) FIG. 3C shows the assay of the tissue by acid hydrolysis for residual hexose release using 1 M H.sub.2SO.sub.4, post-amylase treatment. (Results are averages.+-.s.d. (n=3).)
[0032] FIGS. 4A-4D illustrate the analysis of pin-shaped inflorescence mutants and NPA treatment, resulting in increased saccharification in Arabidopsis and maize. FIG. 4A shows senesced tissue from Arabidopsis pin-shaped inflorescence mutants subjected to 0.2 M acid hydrolysis. (Results are averages.+-.s.d. (n=3); all experiments were repeated three times with similar results.) Inset shows representative pin-shaped inflorescence in Arabidopsis. FIG. 4B shows maize inflorescence mutants bif2 and ba1 subjected to 0.2 M H.sub.2SO.sub.4 acid hydrolysis. (Results are averages.+-.s.d. (n=3-4). N, phenotypically normal siblings.) Inset shows representative maize inflorescence mutant. FIG. 4C shows wild type (Col-0) Arabidopsis 28 day-old seedlings grown on MS media supplemented with 0, 1 or 5 .mu.M NPA and subjected to 0.2 M H.sub.2SO.sub.4 acid hydrolysis. (Results are averages.+-.s.d. (n=4). *, P<0.001 and **, P<0.005 using Student's t-test; all experiments were repeated two times with similar results.) FIG. 4D shows two maize cultivars treated with 120 .mu.M NPA for 2 weeks and subjected to 0.2 M H.sub.2SO.sub.4 acid hydrolysis. (Results are averages.+-.s.d. (n=6-9).)
[0033] FIG. 5 shows absorbance readings from anthrone acid hydrolysis as quantified against a glucose curve. Candidate whs mutants are considered as releasing a significant amount of sugars when readings measure 2 or more standard deviations above wild type (Abs.sub.660nm 0.12.+-.0.002).
[0034] FIG. 6 shows the map based cloning of cell wall accessible genes.
[0035] FIG. 7 shows the wall hydrolysis sensitivity of the SAC domain family in Arabidopsis using the following T-DNA insertions: sac1-1 (SALK_070875), sac1-2 (SALK_020109), sac2-1 (SALK_099031), sac2-2 (SALK_091926), sac3-1 (SALK_023548), sac3-2 (SALK_049623), sac4-1 (SALK_119184), sac4-2 (SALK_005871), sac4-3 (SALK_056500), sac5-1 (SALK_012372), sac5-2 (SALK_125856), sac6-1 (SALK_021488), sac6-2 (SALK_136049), sac7-1 (SALK_000558), sac7-2 (SALK_092575), sac8-1 (SALK_062145) and sac8-2 (SALK_115643). Leaf disc tissue from 21 day-old plants was assayed using 1 M H.sub.2SO.sub.4. (Results are averages.+-.s.d. (n=3-4).)
[0036] FIG. 8 shows the wall hydrolysis sensitivity of auxin response factor mutants. Leaf disc tissue from 21 day-old plants was assayed using 1 M H.sub.2SO.sub.4. (Results are averages.+-.s.d. (n=4-8).)
[0037] FIG. 9 shows the relative cellulose crystallinity of wt (Col, Ler) and mutant lines. "C" refers to Col-0; "L" refers to Ler; each instance of "f" denotes a fxr mutant line; and each instance of "ix" denotes an ixr mutant line.
[0038] FIG. 10 shows the percent total sugar releases following hydrolysis of wt (Col, Ler) and mutant stem tissue using different treatments.
DETAILED DESCRIPTION
[0039] Described herein are compositions, methods, mutant genes, cells, plants and other materials which are useful to increase carbohydrate availability for saccharification, in particular, through pre-treatment of a plant with an auxin transport inhibitor.
[0040] Saccharification is generally known as the process of breaking a complex carbohydrate (such as starch or cellulose) into its monosaccharide components. By increasing carbohydrate availability for saccharification, the compositions, methods, mutant genes, cells, plants and other materials described in this application can be used for a variety of industrial processes. For instance, they may be used to pretreat feedstock typically used in the biofuels industry for production of bioethanol. They may be employed in the production of biomass which is, for example, useful in producing biofuels, bioplastic, biofoam, biorubber, biocomposite, forestry biofibre, agricultural textile, chemical, biocosmetics, and in other feed stock production.
[0041] The compositions and methods described herein are applicable in a variety of plant species. Of interest are the monocotyledonous plants, e.g. corn (Zea mays), sugar cane (Saccharum sp.), switchgrass (Panicum virgatum) and other grass species (Miscanthus), and other species used in bioethanol production. However, the present invention is also applicable in dicotyledonous plants, e.g. Arabidopsis, . . . .
[0042] In certain embodiments of the invention, the auxin transport inhibitor may include at least one of the following: 1-N-Naphthylphthalmaic acid (NPA), 2-{(E)-1-[4-(3,5-difluorophenyl)semicarbazono]ethyl}nicotinic acid (diflufenzopyr), 2,3,5-triiodobenzoic acid (TIBA), 9-hydroxyfluorene-9-carboxylic acid (HFCA), p-chlorophenoxyisobutyric acid (PCIB), 2-carboxyphenyl-3-phenylpropane-1,2-dione (CPD), chlorflurenol, quimerac, tricyclopyr, CPIB, quercetin, genistein, including agriculturally acceptable salts, esters, or derivatives thereof.
[0043] Chemical structures for some of the above-listed compounds, and certain additional examples of auxin transport inhibitors, include the following:
##STR00001## ##STR00002##
[0044] In certain preferred embodiments of the invention, the auxin transport inhibitor may be of a phthalamate (e.g. 1-N-naphthylphthalmaic acid (NPA)) or semicarbazone (2-{(E)-1-[4-(3,5-difluorophenyl)semicarbazono]ethyl}nicotinic acid (diflufenzopyr)) class of auxin transport inhibitor.
[0045] In certain other embodiments of the invention, which are non-limiting, the auxin transport inhibitor may be of the following molecular class of auxin transport inhibitors:
##STR00003##
including agriculturally acceptable salts, esters, or derivatives thereof. The term "Ar" represents "aryl", and refers to a monovalent unsaturated aromatic carbocyclic group having a single ring (e.g. phenyl) or multiple condensed rings (e.g. naphthyl or anthryl), which can optionally be unsubstituted or substituted with, e.g., halogen (for instance F, Cl, Br, or I), alkyl (for instance, a lower alkyl group), alkoxy, alkylthio, trifluoromethyl, acyloxy, hydroxy, mercapto, carboxy, aryloxy, aryl, arylalkyl, heteroaryl, amino, alkylamino, dialkylamino, morpholino, piperidino, pyrrolidin-1-yl, piperazin-1-yl, or other functionality.
[0046] The term "alkyl" refers to a cyclic, branched, or straight chain alkyl group containing only carbon and hydrogen, and unless otherwise mentioned contains one to twelve carbon atoms. This term is further exemplified by groups such as methyl, ethyl, n-propyl, isobutyl, t-butyl, pentyl, pivalyl, heptyl, adamantyl, and cyclopentyl. Alkyl groups can either be unsubstituted or substituted with one or more substituents, e.g. halogen, alkyl, alkoxy, alkylthio, trifluoromethyl, acyloxy, hydroxy, mercapto, carboxy, aryloxy, aryloxy, aryl, arylalkyl, heteroaryl, amino, alkylamino, dialkylamino, morpholino, piperidino, pyrrolidin-1-yl, piperazin-1-yl, or other functionality.
[0047] The term "lower alkyl" refers to a cyclic, branched or straight chain monovalent alkyl radical of one to seven carbon atoms. This term is further exemplified by such radicals as methyl, ethyl, n-propyl, i-propyl, n-butyl, t-butyl, i-butyl (or 2-methylpropyl), cyclopropylmethyl, i-amyl, n-amyl, hexyl and heptyl. Lower alkyl groups can also be unsubstituted or substituted, where a specific example of a substituted alkyl is 1,1-dimethyl heptyl.
[0048] The auxin transport inhibitor may, in certain embodiments of the invention, be Naptalam, which is also known as N-1-naphthylphthalamic acid of the chemical formula:
##STR00004##
including agriculturally acceptable salts, esters, or derivatives thereof.
[0049] Certain auxin transport inhibitors, including NPA and diflufenzopyr, may have functional groups which can be ionized, and accordingly can also be used in the form of an agriculturally acceptable salt. In general, an "agriculturally acceptable" salt will be a salt form whose cation has no adverse effect on the action of the active compound. For example, agriculturally acceptable cations may include ions of the alkali metals, such as lithium, sodium and potassium; of the alkaline earth metals, such as calcium and magnesium; of the transition metals, such as manganese, copper, zinc and iron; ammonium; substituted ammonium (organoammonium) ions in which one to four hydrogen atoms are replaced by C.sub.1-C.sub.8-alkyl, C.sub.1-C.sub.4-alkyl, hydroxy-C.sub.1-C.sub.4-alkyl, in particular hydroxy-C.sub.2-C.sub.4-alkyl, C.sub.1-C.sub.4-alkoxy-C.sub.1-C.sub.4-alkyl, in particular C.sub.1-C.sub.4-alkoxy-C.sub.2-C.sub.4-alkyl, hydroxy-C.sub.1-C.sub.4-alkoxy-C.sub.1-C.sub.4-alkyl, in particular hydroxy-C.sub.2-C.sub.4-alkoxy-C.sub.2-C.sub.4-alkyl, phenyl or benzyl, preferably ammonium, methylammonium, isopropylammonium, dimethylammonium, diisopropylammonium, trimethylammonium, tetramethylammonium, tetraethylammonium, tetrabutylammonium, pentylammonium, hexylammonium, heptylammonium, 2-hydroxyethylammonium (olamine salt), 2-(2-hydroxyethoxy)eth-1-ylammonium (diglycolamine salt), di(2-hydroxyeth-1-yl)ammonium (=diethanolammonium salt or diolamine salt), tri(2-hydroxyethyl)ammonium (=triethanolammonium salt or trolamine salt), mono-, di- and tri(hydroxypropyl)ammonium (=mono-, di- and tripropanolammonium), benzyltrimethylammonium, benzyltriethylammonium; phosphonium ions; or sulfonium ions, preferably tri(C.sub.1-C.sub.4-alkyl)sulfonium such as trimethylsulfonium, and sulfoxonium ions, preferably tri (C.sub.1-C.sub.4-alkyl)sulfoxonium.
[0050] Auxin transport inhibitors, including N-1-naphthylphthalamic acid, may also carry a carboxyl group that can also be employed in the form of agriculturally acceptable derivatives, for example as amides such as mono- or di-C.sub.1-C.sub.6-alkylamides or arylamides, as esters, for example as allyl esters, propargyl esters, C.sub.1-C.sub.10-alkyl esters or alkoxyalkyl esters, and also as thioesters, for example as C.sub.1-C.sub.10-alkyl thioesters. Preferred mono- and di-C.sub.1-C.sub.6-alkylamides are the methyl- and the dimethylamides. Preferred arylamides are, for example, the anilidines and the 2-chloroanilides. Preferred alkyl esters are, for example, the methyl, ethyl, propyl, isopropyl, butyl, isobutyl, pentyl, mexyl (1-methylhexyl) or isooctyl (2-ethylhexyl) esters. Preferred C.sub.1-C.sub.4-alkoxy-C.sub.1-C.sub.4-alkyl esters are the straight-chain or branched C.sub.1-C.sub.4-alkoxyethyl esters, for example the methoxyethyl, ethoxyethyl or butoxyethyl (butoyl) esters. An example of the straight-chain or branched C.sub.1-C.sub.10-alkyl thioesters is the ethyl thioester. Preferred derivatives are the esters.
[0051] The compositions of the invention preferably comprise N-1-naphthylphthalamic acid, or a salt or ester thereof. Suitable salts of N-1-naphthylphthalamic acid include those salts where the counterion is an agriculturally acceptable cation. In certain non-limiting embodiments, suitable salts of N-1-naphthylphthalamic acid may include the alkali metal salts, in particular the sodium and the potassium salts, and the ammonium or substituted ammonium salts, in particular the ammonium salt, the diethanolammonium salt, the diglycolammonion salt, the isopropylammonium salt, the dimethylammonium salt or the triethanolammonium salt.
[0052] The above-described compositions may be applied using any number of techniques as would be customary to one of skill in the art. Without wishing to be limiting in any way, the compositions may be applied e.g. by spraying or foliar application. A variety of spray application techniques are known and would be apparent to those of skill in the art. For example, the composition may be applied with water as a carrier, and applied to the soil and/or the plants at desired spray rates. In other embodiments of the invention, the composition may be applied by foliar application using an appropriate spray mixture.
[0053] It is also envisioned that the auxin transport inhibitor described herein may be used in combination with other compounds or agents, for instance, herbicidal agents, compound synergistic, fertilizers and the like. Such combinations may be formulated into a single composition, or applied separately.
[0054] Also provided herein is a method of pre-treating a plant to increase saccharide release from a plant tissue by hydrolysis, the method comprising administering an auxin transport inhibitor, or a composition as described herein, in an amount effective to increase sugar release from the plant tissue by hydrolysis.
[0055] In an embodiment of the above method, the auxin transport inhibitor or composition is administered in an amount effective to increase saccharide release from cellulose, starch, or both, in said plant tissue.
[0056] In addition, the method may further comprise a step of hydrolyzing cellulose, starch, or both, from the plant tissue, to produce monosaccharides, disaccharides, polysaccharides, or a combination thereof.
[0057] In a further non-limiting embodiment, the auxin transport inhibitor or composition may be applied by spraying, foliar application, or a combination thereof.
[0058] Also provided herein is a method of screening for plants having an increased saccharide release phenotype, a reduced cellulose crystallinity phenotype, or both, the method comprising:
[0059] treating at least one plant or plant seed with at least one cellulose biosynthetic inhibitor (CBI) in an amount effective to select for CBI-resistance in said plant or plant seed;
[0060] germinating the plant seeds and/or incubating the plant and selecting for CBI-resistant mutant plants, or seeds thereof; and
[0061] measuring saccharide release, cellulose crystallinity, or both, in the CBI-resistant mutant plants to identify an increased saccharide release phenotype, a reduced cellulose crystallinity phenotype, or both.
[0062] In a non-limiting embodiment of the method, the cellulose crystallinity may be measured using an X-ray diffractometer, for example, to determine a proportion of crystalline cellulose relative to a proportion of amorphous cellulose in a tissue of said CBI-mutagenized plant.
[0063] In a further non-limiting embodiment of the method, the tissue may be a stem and/or leaf tissue.
[0064] Without wishing to be limiting, the cellulose biosynthetic inhibitor may be of a nitrile, benzamide, triazolocarboxamide, or quinoline carboxylic acid class of cellulose biosynthetic inhibitor. For example, the cellulose biosynthetic inhibitor may be one or more of dichlobenil, chlorthiamid, isoxaben, flupoxam, quinclorac, or a salt, ester, or derivative thereof. In particular embodiments, the cellulose biosynthetic inhibitor may preferably comprise isoxaben or flupoxam.
[0065] Also described are uses of the compositions described herein for pre-treating a plant or plant tissue to increase saccharide release from the plant tissue by hydrolysis. For example, the plant or plant tissue may comprise biomass, e.g. for production of biofuel (such as bioethanol), bioplastic, biofoam, biorubber, biocomposite, forestry biofibre, agricultural textiles, monosaccharides, disaccharides, polysaccharides, other chemicals, as well as biocosmetics.
[0066] Also described herein are plant mutations which result in improved saccharide release upon hydrolysis treatment. Without limitation, the mutations may include one or more of the following mutations in maize or Arabidopsis genes, or equivalent genes having corresponding gene products in other plant species:
[0067] barren inflorescence2 (bif2), comprising a mutation in the bif2 sequence corresponding to SEQ ID NO: 1 reducing or substantially inhibiting bif2 function;
[0068] barren stalk1 (BA1), comprising a mutation in the BA1 sequence corresponding to SEQ ID NO: 3, reducing or substantially inhibiting BA1 function;
[0069] mur11-1 comprising a mutation corresponding to R278H in SEQ ID NO: 5, including polypeptides, polypeptide fragments, and nucleotides encoding the mutant mur11-1 polypeptide or fragment thereof;
[0070] pid-100 comprising a mutation corresponding to D223N in SEQ ID NO: 7, including polypeptides, polypeptide fragments, and nucleotides encoding the mutant pid-100 polypeptide or fragment thereof;
[0071] dpe2-100, comprising a mutation in the dpe2-100 sequence which reduces or substantially inhibits dpe2-100 function, such as but not limited to the W323Stop mutation in SEQ ID NO: 9, including nucleotides encoding the mutant dpe2-100 sequence;
[0072] dpe2-101 comprising a mutation corresponding to R561K in SEQ ID NO: 11, including polypeptides, polypeptide fragments, and nucleotides encoding the mutant dpe2-101 polypeptide or fragment thereof;
[0073] sex4-100, comprising a mutation in the sex4-100 sequence which reduces or substantially inhibits sex4-100 function, such as but not limited to the sex4-100 splice junction mutant corresponding to SEQ ID NO: 13, or a fragment thereof containing a mutation corresponding to G2194A in SEQ ID NO: 13, including nucleic acid sequences that are 80% identical (or 85%, more particularly 90%, even more particularly 99% identical) thereto;
[0074] fpx 2-1 comprising a mutation corresponding to G1013R in SEQ ID NO: 15, including polypeptides, polypeptide fragments, and nucleotides encoding the mutant fpx 2-1 polypeptide or fragment thereof;
[0075] fpx 2-2 comprising a mutation corresponding to P1010L in SEQ ID NO: 17, including polypeptides, polypeptide fragments, and nucleotides encoding the mutant fpx 2-2 polypeptide or fragment thereof;
[0076] fpx 2-3 comprising a mutation corresponding to G1009D in SEQ ID NO: 19, including polypeptides, polypeptide fragments, and nucleotides encoding the mutant fpx 2-3 polypeptide or fragment thereof;
[0077] fpx 1-1 comprising a mutation corresponding to S1040L in SEQ ID NO: 21, including polypeptides, polypeptide fragments, and nucleotides encoding the mutant fpx 1-1 polypeptide or fragment thereof;
[0078] fpx 1-2 comprising a mutation corresponding to S1037F in SEQ ID NO: 23, including polypeptides, polypeptide fragments, and nucleotides encoding the mutant fpx 1-2 polypeptide or fragment thereof;
[0079] fpx 1-3 comprising a mutation corresponding to S983F in SEQ ID NO: 25, including polypeptides, polypeptide fragments, and nucleotides encoding the mutant fpx 1-3 polypeptide or fragment thereof;
[0080] ixr1-3 comprising a mutation corresponding to G998S in SEQ ID NO: 27, including polypeptides, polypeptide fragments, and nucleotides encoding the mutant ixr1-3 polypeptide or fragment thereof;
[0081] ixr1-4 comprising a mutation corresponding to R806K in SEQ ID NO: 29, including polypeptides, polypeptide fragments, and nucleotides encoding the mutant ixr1-4 polypeptide or fragment thereof;
[0082] ixr1-5 comprising a mutation corresponding to L797F in SEQ ID NO: 31, including polypeptides, polypeptide fragments, and nucleotides encoding the mutant ixr1-5 polypeptide or fragment thereof;
[0083] ixr1-6 comprising a mutation corresponding to S377F in SEQ ID NO: 33, including polypeptides, polypeptide fragments, and nucleotides encoding the mutant ixr1-6 polypeptide or fragment thereof;
[0084] ixr1-7 comprising a mutation corresponding to R276H in SEQ ID NO: 35, including polypeptides, polypeptide fragments, and nucleotides encoding the mutant ixr1-7 polypeptide or fragment thereof; and
[0085] ixr2-2 polypeptide comprising a mutation corresponding to 51002F in SEQ ID NO: 37, including polypeptides, polypeptide fragments, and nucleotides encoding the mutant ixr2-2 polypeptide or fragment thereof.
[0086] The above listed mutant nucleotide and polypeptide sequences may, in certain embodiments, be provided in isolated form, and may have 80% identity to their respective sequences listed, whereas in other embodiments the sequence identity may be higher, including 85%, 90%, or even 99% identical, including identity ranges intervening these integers. In addition, these same mutations may be made in corresponding sequences from other species, including both monocot and dicot species such as but not limited to corn (Zea mays), sugar cane (Saccharum sp.), switchgrass (Panicum virgatum) and other grass species (Miscanthus), other species used in bioethanol production, as well as Arabidopsis and other dicotyledonous plant species.
[0087] Each of the above-listed mutants may also be provided in the form, for example, of a plant or seed thereof having a phenotype characterized by increased saccharide release from plant tissue by hydrolysis. In one non-limiting example, which can be applied throughout the above list of mutations, the plant or seed thereof may comprise a mutant barren inflorescence2 (bif2) gene comprising a mutation in the bif2 sequence corresponding to SEQ ID NO: 1 which reduces or substantially inhibits bif2 function. The plant or seed thereof may accordingly be used to produce biomass for production of bioethanol, bioplastic, biofoam, biorubber, biocomposite, forestry biofibre, agricultural textiles, monosaccharides, disaccharides, polysaccharides, or biocosmetics, preferably for production of bioethanol. The plant or seed thereof may also be provided, in non-limiting embodiments, in a commercial package comprising the plant or seed thereof, wherein the commercial package is for producing biomass for production of bioethanol, bioplastic, biofoam, biorubber, biocomposite, forestry biofibre, agricultural textiles, monosaccharides, disaccharides, polysaccharides, or biocosmetics.
[0088] Also provided herein are vectors, such as but not limited to plasmids, which include a nucleic acid or encoding a polypeptide sequence of one or more of the mutants described herein. Host cells comprising such vectors, or a nucleic acid encoding a polypeptide sequence of one or more of the mutants described herein are also provided. Similarly, seeds and plants may be provided which comprise such vectors and/or nucleic acids.
[0089] The seeds or plants containing these mutant sequences, or which express the mutant polypeptides described herein, have a phenotype which is characterized by an increased saccharide release from the plant tissue by hydrolysis.
[0090] Thus, the nucleic acids or polypeptides, the vectors, the host cells, the seeds and plants described herein can be used to produce plant tissues with a phenotype characterized by increased saccharide release by hydrolysis. These nucleic acids, polypeptides, vectors, host cells, seeds and plants are especially useful in producing biomass for production of biofuels (such as bioethanol), as well as bioplastic, biofoam, biorubber, biocomposite, forestry biofibre, agricultural textiles, monosaccharides, disaccharides, polysaccharides, and biocosmetics.
Experiments:
[0091] A high-throughput strategy, using the model plant Arabidopsis, was used to identify mutants with improved sugar release from plant biomass. Molecular analysis showed a variety of processes, including starch degradation, cell wall composition and polar transport of the plant hormone auxin, can contribute to this improved saccharification. Genetic or chemical inhibition of polar auxin transport in maize is also shown to result in increased sugar release from plant tissues. This information not only uncovers new functions that contribute to cell wall integrity but also demonstrates that information gleaned from genetic approaches involving Arabidopsis can be directly translated to monocotyledonous biofuel crops, such as but not limited to maize, to improve sugar extractability from lignocellulosic biomass.
[0092] The high throughput strategy involved a forward genetic screen to identify genotypes that showed an improved sugar release under mild acid treatment, and identified a large collection of lines. The frequency of mutant identification (0.3%) and lack of many alleles within the collection suggested the screen was not saturated, and that more genetic variation remains to be discovered.
[0093] The identification of mutants that over-accumulate starch in vegetative tissues presents an unforeseen approach with respect to the improvement of fermentable sugars for biofuel production. Because starch is a simple easily accessible glycopolymer compared to lignocellulose, it is efficiently converted to sugar for ethanol production. However, unlike reproductive tissues such as corn kernels, starch levels in stems and leaves are limited, and therefore these vegetative tissues have not previously been considered a useful starch based feedstock.
[0094] The inventors have shown that genetically increasing vegetative starch levels can contribute to the overall fermentable sugar yields during acid pretreatment. Because this sugar source is not lignocellulosic, in principle its genetic manipulation should be a stackable trait with other lignocellulosic feedstock technologies. The observation that only some starch excess mutants were identified in the screens, however, suggests that the relationship between starch and acid-dependent sugar release is complex. Without wishing to be bound by theory, it is possible that certain mutants accumulate starch as a secondary consequence of a mutation. For example, not all sugar release from mur11 mutants is explained through starch accumulation, which is consistent with this mutant also having a defective cell wall. It is also possible that various starch accumulating mutants accumulate slightly different forms of starch, and that these forms may not be equally accessible to mild acid hydrolysis.
[0095] An association between cell walls and auxin has existed for some time with respect to the role of this hormone in cell expansion. More recently, the demonstration that mutating the cellulose synthase gene CESA results in mislocalization of PIN1 efflux carriers further suggests a close linkage between auxin transport and cell wall synthesis. As shown in the experiments below, pinoid and additional pin-shaped inflorescence mutants have increased cell wall accessibility, which reveals an important role for auxin in maintaining the integrity of the cell wall. Interestingly, this association is limited to auxin mutants that display a pin-shaped inflorescence phenotype, which may mean that altering cell wall integrity contributes to aberrant inflorescence development.
[0096] The acid hydrolysis screen only identified pinoid loss-of-function mutants. Presumably, additional Arabidopsis mutants that form pin-shaped inflorescences such as pin1 or mp were not found because, unlike pinoid, these mutants are completely penetrant and therefore infertile. Although this makes propagation of these lines problematic, the pin-shaped phenotype may have advantages with respect to preventing gene flow among commercially grown transgenic crops.
[0097] The inventors also show that treatment of wild type Arabidopsis and maize plants with the polar auxin transport inhibitor, 1-N-Naphthylphthalamic acid (NPA), also results in increased saccharification. In contrast to making transgenic plants, which can be costly, time-consuming and often involve constitutive phenotypes, chemically-induced phenotypes using compounds such as NPA allows for more tailored temporal and spatial control of the cell wall composition. Moreover, NPA, which is already an approved pre-emergence herbicide, can be applied broadly, for example, to bio-energy crops that have rudimentary genetics, or that are difficult to transform.
[0098] Finally, the ability to increase saccharification using NPA suggests chemical genetic screening using Arabidopsis can be applied to develop further chemical leads that may be useful in pretreatment lignocellulosic processing. The experiments presented here show that the results obtained in Arabidopsis can be successfully translated to maize, and thus other monocot species, such as but not limited to sugarcane (Saccharum sp.), Miscanthus or switchgrass, are expected to show similar results.
Example 1: Screening for Wall Hydrolysis Sensitive Mutants
[0099] A colorimetric assay was developed that allowed for the visualization of saccharification from plant tissue incubated in dilute acid at room temperature for one hour.
[0100] Using an anthrone reagent, which turns blue or green in the presence of sugars, (in this example, hexoses) an average sugar release (4.1.+-.0.1 .mu.g sugar/leaf disc) from 100 wild type leaf samples was determined (FIG. 5). With this baseline, the assay was applied against a collection of 30 known cell wall mutants as indexed by the Plant Cell Wall Biosynthesis Research Network (WallBioNet) (FIG. 1(b)).
[0101] Table 1 shows known cell wall mutants and their gene products. MUR11 was molecularly identified in this study and is shown in the table in bold.
TABLE-US-00001 TABLE 1 Mutant AGI GENE csld3-1 At3g03050 CELLULOSE SYNTHASE-LIKE 3 eld1-1 At3g08550 ELONGATION DEFECTIVE 1 fk At3g52940 FACKEL irx1 At4g18780 IRREGULAR XYLEM 1/CESA8 irx3 At5g17420 IRREGULAR XYLEM 3/CESA7/MUR10 irx4 At1g15950 IRREGULAR XYLEM 4/CINNAMOYL COA REDUCTASE 1 ixr1-1 At5g05170 ISOXABEN RESISTANT 1/CESA3 lxr1-2 At5g05170 ISOXABEN RESISTANT 1/CESA3 lxr2-1 At5g64740 ISOXABEN RESISTANT 2/ PROCUSTE1/CESA6 knf At1g67490 KNOPF mur1-1 At3g51160 GDP-D_MANNOSE-4,6-DEHYDRATASE mur2-1 At2g03220 FUCOSYLTRANSFERASE 1 mur3-2 At2g20370 XYLOGLUCAN GALACTOSYLTRANSFERASE mur4-2 At1g30620 UDP-D-XYLOSE 4-EPIMERASE mur5-1 MURUS 5 mur6-1 MURUS 6 mur7-1 MURUS 7 mur8-1 MURUS 8 mur9-1 MURUS 9 mur10-2 At5g17420 CESA7/IRX3 mur11-1 At3g59770 SUPPRESSOR OF ACTIN 9 pmr4-1 At4g03550 POWDERY MILDEW RESISTANT 4 pmr5 pmr6-3 At5g58600; POWDERY MILDEW RESISTANT 5; At3g54920 POWDERY MILDEW RESISTANT 6 pnt1-1 At5g22130 PEANUT 1 prc1-1 At5g64740 PROCUSTE1/CESA6/IXR2 rhd1-1 At1g64440 ROOT HAIR DEFECTIVE 1/ UDP0GLUCOSE 4-EPIMERASE rhd3-1 At3g13870 ROOT HAIR DEFECTIVE 3 rsw2-1 At5g49720 RADIAL SWELLING 2/IXR2 rsw3-1 At5g63840 RADIAL SWELLING 3 sos5-1 At3g46550 SALT OVERLY SENSITIVE 5 vtc1-1 At2g39770 VITAMIN C DEFECTIVE 1/ GDP-MANNOSE PYROPHOSPHORYLASE
[0102] Of the 30 mutants tested, only mur11-1 consistently showed increased saccharification relative to wild type. Map-based cloning of the mur11-1 allele identified a transition mutation (G.fwdarw.A) in a conserved domain of the previously characterized gene, SUPPRESSOR OF ACTIN9 (SAC9), which encodes a phosphoinositide phosphatase (FIG. 6). Table 2 shows the genotypes used in the study.
TABLE-US-00002 TABLE 2 Allele Lesion.sup.a Genomic position.sup.b Amino acid mur11-1 G .fwdarw. A 1157 bp R.sup.278 .fwdarw. H (SEQ ID NO: 6) (SEQ ID NO: 5) sac9-3 SALK_058870 pid-100 G .fwdarw. A 974 bp D.sup.223 .fwdarw. N (SEQ ID NO: 8) (SEQ ID NO: 7) pid-14 SALK_049736 pid-2 CS8063 pin1-1; ttg-1 CS8065 pin1 SALK_047613 arf5-2 SALK_021319 dpe2-100 G .fwdarw. A 1457 bp W.sup.323 .fwdarw. Stop (SEQ ID NO: 10) (SEQ ID NO: 9) dpe2-101 G .fwdarw. A 3201 bp R.sup.561 .fwdarw. K (SEQ ID NO: 12) (SEQ ID NO: 11) dpe2-5 SALK_073273 sex4-100 G .fwdarw. A 2194 bp Splice junction (SEQ ID NO: 13) sex4-5 SALK_126784 sex1-100 SALK_077211 isa3-3 CS88929 bam1 SALK_039895 bam2 SALK_020838 bam3 SALK_041214 bam4 SALK_037355 .sup.aType of lesion due to EMS mutagenesis or T-DNA insertion. .sup.bPosition of base pair change is given from the start codon of genes isolated from the whs primary screen.
[0103] This result was verified by demonstrating that other mur11 alleles also showed improved saccharification by acid hydrolysis (FIG. 1(c)). Because previous biochemical analysis of sac9 mutants suggests this phosphatase modulates phosphoinositide signaling during stress, the original MUR11 cell wall defects may be a secondary consequence of the mutation. With the finding that mutations in SAC9 gave increased sugar release it was decided to assay loss-of-function alleles of the complete SAC family of genes in Arabidopsis (sac1-sac9). However, no other SAC genes were found that contributed to lignocellulose sugar release, which is perhaps not surprising since SAC9 is only distantly related to the other SAC members of this family (FIG. 7)
[0104] The scarcity of improved sugar release from the cell wall mutant collection underscored the limited utility of a reverse genetic approach to identify increased saccharification mutants using weak acid hydrolysis. The mutational space was therefore expanded by applying the acid screen to a population of EMS-mutagenized Arabidopsis seedlings (FIG. 2(a)).
[0105] The screen was limited to plants that showed no obvious growth or developmental defects, since such defects would compromise the application value of the genes identified. From approximately 23,000 M2 plants representing 32 M1 parental groups, 63 mutants were identified that showed increased saccharification (Table 3). Designated wall hydrolysis sensitive (whs), the mutant lines were sub-categorized into four groups based on the amount of sugar they released per fresh leaf disc.
TABLE-US-00003 TABLE 3 Amount of hexose released (.mu.g/fresh leaf disc) 4.5-9 9.1-13 13.1-17 17.1-21 # of 30 21 10 3 mutants whs34 whs49 whs14 whs29 whs4 whs1 whs35 whs50 whs15 whs30 whs5 whs2 whs36 whs51 whs16 whs31 whs6 whs3 whs37 whs52 whs17 whs32 whs7 whs38 whs53 whs18 whs33 whs8 whs39 whs54 whs19 mur11-1 whs9 whs40 whs55 whs20 whs10 whs41 whs56 whs21 whs11 whs42 whs57 whs22 whs12 whs43 whs58 whs23 whs13 whs44 whs59 whs24 whs45 whs60 whs25 whs46 whs61 whs26 whs47 whs62 whs27 whs48 whs63 whs28
[0106] To determine if any of these mutants showed defects in cell wall sugars, gas chromatographic analysis of alditol acetates was performed to identify changes in monosaccharide composition of the cell wall (FIG. 2(b)). Interestingly many of the whs lines showed increases in rhamnose and fucose compared to wild type samples, which indicated that many of the mutations did perturb cell wall composition. Next, the mutant collection was further studied by enzymatic hydrolysis assays using cellulase and cellobiase, to assay cellulose hydrolysis, cellulase, cellobiase and xylanase, to monitor hemicellulose break down, and a cocktail of cellulase, cellobiase, xylanase and peroxidase which, in addition to cellulose and hemicellulose, degrades lignin (FIG. 2(c)). The presence of starch in the samples was also assayed, as this source of carbon could potentially contribute to an increased sugar release phenotype in these assays. Finally, in addition to the fresh leaf material, an assay was carried out on senesced whole plant tissue hydrolyzed with 0.2 M sulphuric acid, biomass that is more akin to field grown plant material and acid concentrations that are more similar to industrial standards.
[0107] Hierarchical clustering of the various assays broadly identified three subcategories. One category consisted of five mutant lines (whs27, whs6, whs4, whs20, whs36) that showed good sugar release in both fresh and senesced tissue acid hydrolysis. A second category consisted of twelve lines (mur11-1, whs1, whs43, whs53, whs14, whs2, whs5, whs21, whs3, whs60, whs9, whs22) which hyper-accumulated starch. Within this grouping, two lines (whs9 and whs22) were of particular interest as they also showed excess sugar release in all enzymatic assays. The remaining mutant lines did not show good saccharification in senesced tissues or in any enzymatic assay and therefore were not further studied.
Example 2: Specific Genes Involved in Starch Metabolism Improve Saccharification
[0108] To understand the molecular nature of the mutant category that showed both a high saccharification and increased starch accumulation, map-based cloning of the mutant alleles was performed on three lines (whs1, whs22 and whs9). The whs1 and whs22 lines contained allelic mutations in the DISPROPORTIONATING ENZYME 2 (DPE2) gene, which encodes a glucosyltransferase required for starch degradation, and these lines were subsequently re-designated dpe2-100 and dpe2-101 respectively (FIG. 6, Table 2). Subsequent molecular analysis of lines whs3, whs5, whs14, whs21 showed they were siblings of whs1. The whs9 line contained a new allele of STARCH EXCESS 4 (sex4-100), which encodes a glycan phosphatase involved in starch degradation (FIG. 6, Table 2).
[0109] The identification of these genes was validated by showing that T-DNA knockout insertion alleles in both DPE2 and SEX4 also showed improved sugar release by acid hydrolysis (FIG. 3(a)).
[0110] The identification of dpe2 and sex4 in the screens suggested that starch could be a source of acid-dependent sugar release. The contribution of starch to saccharification was determined by treating senesced whole plant tissue with .alpha.-amylase, which specifically converts starch to glucose and maltose (FIG. 3(b)). Once tissue was devoid of starch, it was subjected to acid hydrolysis to determine the residual hexose release (FIG. 3(b)). This analysis clearly showed that the improved sugar release observed in both dpe2 and sex4 mutants can be accounted for by their increased starch content. By contrast, the mur11-1 samples showed a higher sugar release than wild type even after a-amylase treatment, suggesting some of the increased saccharification is due to polymers other than starch.
[0111] The connection of starch over-accumulation and increased saccharification by acid hydrolysis was further explored by subjecting a collection of well characterized Arabidopsis starch mutants to the acid hydrolysis assay. The analysis included starch-excess 1 (sex1), which is defective in the regulation of starch degradation, isoamylase 3 (isa3), which is defective in a starch debranching enzyme 15, and b-amylase (barn) mutants, which are defective in the breakdown of starch (bam1 through 4) (FIG. 3(a)). Surprisingly, only alleles of mur11, dpe2 and sex4 mutants showed increased sugar release.
Example 3: Inhibiting Polar Auxin Transport Improves Saccharification
[0112] Among those lines which showed good sugar release in both fresh and senesced tissue, one line (whs20) in particular stood out because it showed an incompletely penetrant pin-shaped inflorescence phenotype that was reminiscent of mutations that perturb the polar transport of the plant hormone auxin. Subsequent molecular analysis of this line identified a mutation in the PINOID (PID) gene (FIG. 6; Table 2). PID encodes a serine threonine protein kinase that is thought to play a role in the cellular localization of the PIN efflux auxin carrier. Mutations in other genes that result in a pin-shaped phenotype, such as pin1 and mp (also known as arf5), also show an improved saccharification phenotype (FIG. 4(a)). By contrast, other auxin response factor mutants defective in auxin signalling (arf6, 7, 8 and 19), did not show increase sugar release, however, these mutants also do not have the pin inflorescence phenotype. Furthermore, none of the single, double or triple combination of arf mutants tested displayed an increase in cell wall accessibility (FIG. 8).
[0113] Finally, maize mutants with barren inflorescence phenotypes were tested. Barren inflorescence2 (bif2) is a co-ortholog of PID in Arabidopsis 20 and barren stalk1 (ba1), a basic helix-loop-helix transcription factor, has been shown to be a downstream target of BIF2 in maize. Consistent with the results from Arabidopsis, both bif2 (SEQ ID NOS: 1 and 2) and bat (SEQ ID NOS: 3 and 4) maize inflorescence mutants show an improved saccharification phenotype (FIG. 4(b)).
[0114] The connection between auxin transport and increased sugar release was further probed using a specific inhibitor of auxin transport N-1-naphthylphthalamic acid (NPA). Application of varying concentrations of NPA to wild type Arabidopsis seedlings resulted in a 1.5 to 2 fold increase in the release of sugars relative to untreated plants (FIG. 4(c)). More importantly, the ability to chemically perturb auxin transport allowed the expansion of the analysis to Zea mays (maize). Application of NPA to two different cultivars of maize also resulted in a significant increase in cell wall accessibility (FIG. 4(d)). Together, these results provide strong support that genetic or chemical manipulation of auxin transport increases sugar release. Moreover, it appears that genes and processes identified using Arabidopsis can be transferred to maize and potentially other monocot species dedicated to biofuel production.
Example 4: Screening for Novel CELLULOSE SYNTHASE (CESA) Alleles
[0115] Further genetic screens aimed at identifying resistance to cellulose biosynthetic inhibitors (CBIs) were also conducted. The aim of conducting resistance screens can be to identify potential inhibitor targets. In the case of some CBIs, like isoxaben, resistance screens have been carried out using high concentrations of the inhibitor with the aim of identifying the target protein. Indeed, high resistance to isoxaben is only possible if certain CELLULOSE SYNTHASE (CESA) genes are altered by mutation. An unforeseen consequence of some of the resistance alleles has been to reduce overall cellulose crystallinity, which ultimately leads to overall improved saccharification of starting cell wall material. With this information as a starting point, the inventors sought to identify novel CESA alleles by conducting additional resistance screens, but utilizing much lower CBI concentrations than in the original screens.
[0116] EMS mutagenized plants (M2) were screened on 20 nM of two different CBIs, isoxaben or flupoxam. Those plants that showed resistance at this concentration of either CBI were then retested in the M3 generation. In total, 2 million M2 seeds were screened and 12 new CESA alleles were isolated, 3 in CESA1, 8 in CESA3 and 1 in CESA6. All of the new mutant alleles led to single amino acid substitutions, which could not have been predicted a priori. Interestingly, one of these alleles led to an amino acid substitution in the proposed catalytic site of the enzyme (ixr1-4). Table 4 shows a summary of the identified mutant alleles.
TABLE-US-00004 TABLE 4 Concentration at which root Genetic length is Allele Background Gene Mutation 50% of wt wild-type Ler -- -- 5 nM wild-type Col-o -- -- 5 nM Isoxaben Resistant ixr1-1 (published) Col-0 CesA3 G(998)D >1 .mu.M ixr1-2 (published) Col-0 CesA3 T(942)I 500 nM ixr1-3 Ler CesA3 G(998)S 100 nM (SEQ ID NOS: 26 and 27) ixr1-4 Ler CesA3 R(806)K 50 nM (SEQ ID NOS: 28 and 29) ixr1-5 Ler CesA3 L(797)F 10 nM (SEQ ID NOS: 30 and 31) ixr1-6 Ler CesA3 S(377)F 50 nM (SEQ ID NOS: 32 and 33) ixr1-7 Ler CesA3 R(276)H 50 nM (SEQ ID NOS: 34 and 35) ixr2-1 (published) Col-0 CesA6 R(1064)W 50 nM ixr2-2 Ler CesA6 S(1002)F 10 nM (SEQ ID NOS: 36 and 37) Flupoxam resistant (Described in http://www.jstor.org/stable/4046145 with recent work in DOI: 10.1111/j.1365-313X.2011.04619.x) fpx 1-1 Col-o CesA3 S(1040)L 500 nM (SEQ ID NOS: 20 and 21) fpx 1-2 Ler CesA3 S(1037)F >1 .mu.M (SEQ ID NOS: 22 and 23) fpx 1-3 Ler CesA3 S(983)F 100 nM (SEQ ID NOS: 24 and 25) fpx 2-1 Ler CesA1 G(1013)R >1 .mu.M (SEQ ID NOS: 14 and 15) fpx 2-2 Ler CesA1 P(1010)L 100-500 nM (SEQ ID NOS: 16 and 17) fpx 2-3 Ler CesA1 G(1009)D 1 .mu.M (SEQ ID NOS: 18 and 19)
[0117] The mutants were further characterized by determining their relative cellulose crystallinity, as well as their saccharification profiles. This was accomplished by using an X-ray diffractometer to measure the proportion of crystalline cellulose relative to the proportion of amorphous cellulose in stem tissue (FIG. 9). To determine the saccharification properties of the mutant lines, commercial enzyme cocktails were used to digest cell wall preparations and determine the amount of sugar released (FIG. 10). It is significant that many of these alleles, to a greater or lesser extent, showed reduced cellulose crystallinity and in addition were also more amenable to enzyme hydrolysis (FIG. 9 and FIG. 10). However, some lines with apparently unaltered cellulose crystallinity did show improved hydrolysis (e.g. fpx1-1, fpx1-2, fpx 1-3) or some lines with reduced crystallinity did not show improved hydrolysis (e.g. ixr1-7). This indicates that there isn't a tight correlation between cellulose crystallinity and hydrolysis properties.
[0118] The value of screening for CESA alleles using this methodology is twofold. Novel CESA alleles can be easily identified, many of which cause cellulose hydrolysis to improve, in a high-throughput manner. The fact that no a priori assumptions about CESA function and structure are required makes this approach particularly useful. In addition, it should be possible to conduct similar screens on target plants to create modified biomass feedstocks directly without the need for generating transgenic plants. One potential limitation is that the CBI that is used may need to specifically target the CESA complex in that plant. For example, the sensitivity to isoxaben is lower in grasses than it is in broadleaf species, which might indicate that alternative CB's would be required for conducting resistance screens in grasses.
Examples 1-5: Materials and Methods
Plant Materials and Growth Conditions
[0119] Arabidopsis thaliana M2 ecotype Columbia seeds mutagenized by ethyl methane sulfonate (EMS) were purchased from Lehle Seeds (Round Rock, Tex.). EMS mutant alleles and T-DNA insertions were provided by the Arabidopsis Biological Resource Centre (Ohio State University, Columbus, USA). Seeds were surface sterilized in 50% bleach, 0.01% Tween.TM.-20 for 5 min, rinsed 5 times with sterile water and stored in the dark at 4.degree. C. for 4 days to synchronize germination. Seeds were plated on 0.5.times. strength Murashige and Skoog (MS) agar plates and sealed with surgical tape under continuous light at room temperature. The maize mutants, bif2-N2354 (stock #108A) and bal (stock #318B) in the W23/M14 genetic background, were obtained from the Maize Genetics Cooperation Stock Center.
Anthrone Mutant Screen
[0120] The M2 generation of EMS-mutagenized Arabidopsis (Col-0) seeds were chilled for 4 days and sowed onto 0.5.times.MS plates placed vertically under continuous light conditions at room temperature. After 7 days, the seedlings were transferred to soil in 96-well flats. Leaf 3 or 4 was excised from 21 day-old plants using a hole punch and placed abaxial side up in a 96-well plate corresponding to the same coordinates as the flat. Samples were submerged in 200 .mu.l of 1M H.sub.2SO.sub.4 and incubated at room temperature for 1 hour. A 50 .mu.l aliquot was removed and incubated with 100 .mu.l of 0.2% anthrone in concentrated H.sub.2SO.sub.4. The samples were incubated at 100.degree. C. for 5 minutes, cooled and the absorbance was read at 660 nm. Approximately 22,000 seedlings from 32 pools were screened from which 63 wall hydrolysis sensitive (whs) mutants were identified as having an absorbance reading greater than 2 standard deviations from wild type (FIG. 5). whs mutants were retested in the M3 generation.
Enzymatic Digestion
[0121] Approximately 0.1-0.2 g of senesced tissue was washed twice with water for 30 min at 80.degree. C. and washed with 70% ethanol at 80.degree. C. for 1 hour. The tissue was rinsed with acetone and oven dried at 60.degree. C. for 2 days. Cellulase from Trichoderma reesi ATCC 26921 and the Cellobiase (Novozyme 188) activities were empirically determined to be 111 FPU/mL and 500 U/mL, respectively. Glucose levels were determined via anthrone assay and cellobiase activity was determined by measuring p-nitro phenol (PNP) absorbance levels at 400 nm. 15 FPU/g of tissue of cellulase and 80 U/g of cellobiase were used on 5 mg of tissue/tube with a total volume of 200 .mu.L in triplicates. The samples were incubated with a final 10.times. dilution of cellulase and cellobiase at 50.degree. C. for 24 hours and heat inactivated at 100.degree. C. for 5 min. Once cooled on ice, the samples were centrifuged and the supernatant was analyzed for its glucose concentration by the Glucose (HK) Assay Kit (GAHK20-1KT) (Sigma) according to the manufacturer's instructions.
Gas-Liquid Chromatography
[0122] Hydrolysis of leaf material and quantification of monosaccharides by gas-liquid chromatography of alditol acetates was carried out as previously described by Reiter el al., 1993. At least 5-20 mg of fresh tissue from 5 plant lines were pooled and extracted three times with chloroform:methanol (1:1) for 30 min. Three technical replicates were performed for each whs mutant. The tissue was washed with 70% ethanol at 70.degree. C. for 1 hour, rinsed with acetone and left to air dry overnight and hydrolyzed in 1M H.sub.2SO.sub.4 at 120.degree. C. for 1 hour. The released monosaccharides were converted into alditol acetates and quantified by gas chromatography. Relative sugar composition values were calculated as a mol percentage.
Clustering and Heatmap Analysis
[0123] Monosaccharide composition of 62 whs mutants (whs35 not determined) and mur11-1 was determined by liquid gas chromatography and calculated as a percent difference relative to wild type (FIG. 2C). Cluster 3.0 using the C Clustering Library version 1.49 was used to cluster the values by Average Linkage and centered correlation. Java TreeView 1.1.5r2 was then used to display the data and colour-coded yellow (more than wild type) or blue (less than wild type). Glucose values quantified from the acid hydrolysis and enzymatic assays performed on the 63 whs mutants, excluding the starch staining, were calculated as a percent difference relative to wild type. Mutants with values equal to wild type were given color coded black and mutants with hexose values greater than wild type were color coded yellow. For starch staining, 14 day-old seedlings were stained with IKI and were visually analyzed for the presence of starch in their cotyledons and determined qualitatively.
Amylase Digestion
[0124] Five milligrams of tissue was weighed out in triplicate and re-suspended in 0.1 M sodium acetate, pH 5, and incubated at 80.degree. C. for 30 min to gelatinize the starch. The tubes were cooled on ice then 30 .mu.L of 0.1.times. .alpha.-amylase (Sigma A7595, activity: 250 U/mL for 1.times.) from Bacillus amyloliquefaciens was added. In addition, 15 .mu.L of pullulanase M1 from Klebsiella planticola (Megazyme 42 U/mg) and 15 .mu.L of pullulanase M2 from Bacillus licheniformis (Megazyme 26 U/mg) were added to bring the total liquid volume to 1 mL. The samples were vortexed then placed in an incubator at 37.degree. C. for 16 hours. The samples were spun down at 12,000 g for 10 min and the reducing sugar equivalents were quantified using 0.2% anthrone. It should be noted that the HK Assay did not detect the products of the amylase digestion.
NPA Treatment of Monocot Plants
[0125] Polar auxin transport inhibition was carried out as described by Wu & McSteen, 2007. The two maize cultivars, Syngenta hybrid N39-Q1 and Tuxedo Sweet Corn, were grown in a greenhouse at 24.degree. C. with a 12 hour day/night cycle. The plants were grown four weeks before NPA treatment followed by a two week watering regime using 120 .mu.M NPA (ChemService, West Chester, Pa., USA) or DMSO alone (solvent) applied every two days in a volume of 150 mL for each pot. Plants were fertilized once a week with 20-20-20 fertilizer. After 2 weeks of treatment, whole plants were collected and de-stained in chloroform:methanol (1:1 v/v). Acid hydrolysis was performed as described previously.
Genetic and Physical Mapping of Mutants
[0126] Genetic mapping was accomplished using an F2 population derived from a cross between the whs mutants (Columbia genotype, Col-0) and Landsberg erecta (Ler). F2 seedlings were scored for wall hydrolysis sensitivity by anthrone screening. Genomic DNA was isolated from individual F2 plants from a mapping population showing the mutant phenotype and assigned to a chromosome using published simple sequence length polymorphism (SSLP) markers. New molecular markers were developed using the Monsanto Col-0 and Ler polymorphism database. The cloned WHS genes were amplified by PCR using X-Taq DNA polymerase with proofreading activity (Takara). Sequencing reactions were performed by The Centre for the Analysis of Genome Evolution and Function (CAGEF) at the University of Toronto. F2 mutants from two independent crosses were used for sequencing and verifying lesions.
[0127] The compositions, methods, mutant genes, cells, plants and other materials described in this application may be employed in the production of biomass useful, for example, in production of biofuels such as bioethanol, as well as other materials such as bioplastic, biofoam, biorubber, biocomposite, forestry biofibre, agricultural textile, chemical, monosaccharide, disaccharide, polysaccharide, biocosmetics, and in other feed stock production.
[0128] The scope of the claims should not be limited by the preferred embodiments set forth in the examples, but should be given the broadest interpretation consistent with the description as a whole.
REFERENCES
[0129] 1. Himmel, M. E. et al. Biomass Recalcitrance: Engineering plants and enzymes for biofuels production. Science 315, 804-807 (2007).
[0130] 2. Carroll A. & Somerville C. Cellulosic biofuels. Annu Rev Plant Biol. 60, 165-82 (2009).
[0131] 3. Pauly, M. & Keegstra, K. Plant cell wall polymers as precursors for biofuels. Curr. Opin. Plant Sci. 13, 305-312 (2010).
[0132] 4. Pingali, S. V. et al. Breakdown of cell wall nanostructure in dilute acid pretreated biomass. Biomacromolecules 11, 2329-2335 (2010).
[0133] 5. Kumar, P. et al. Methods for pretreatment of lignocellulosic biomass for efficient hydrolysis and biofuel production. Ind. Eng. Chem. Res. 48, 3713-3729 (2009).
[0134] 6. Vanholme, R., Van Acker, R. & Boerjan, W. Potential of Arabidopsis systems biology to advance the biofuel field. Trends in Biotech. 28, 543-547 (2010).
[0135] 7. Austin et al. Next-Generation Mapping of Arabidopsis Genes. Plant J. April 23. doi: 10.1111/j.1365-313X.2011.04619.x. [Epub ahead of print] (2011).
[0136] 8. Fry, S. C. The Growing Plant Cell Wall: Chemical and Metabolic Analysis. (The Blackburn Press, Caldwell, N.J., USA, 1988).
[0137] 9. Reiter, W.-D., Chapple, C. & Somerville, C. R. Mutants of Arabidopsis thaliana with altered cell wall polysaccharide composition. Plant J. 12, 335-45 (1997).
[0138] 10. Williams, M. E. et al. Mutations in the Arabidopsis phosphoinositide phosphatase gene SAC9 lead to overaccumulation of Ptdlns(4,5)P2 and constitutive expression of the stress response pathway. Plant Phys. 138, 686-800 (2005).
[0139] 11. Reiter, W.-D., Chapple, C. C. S. & Somerville, C. R. Altered growth and cell walls in a fucose-deficient mutant of Arabidopsis. Science 261, 1032-1035 (1993).
[0140] 12. Chia, T. et al. A cytosolic glucosyltransferase is required for conversion of starch to sucrose in Arabidopsis leaves at night. Plant J. 37, 853-863 (2004).
[0141] 13. Kotting, O. et al. STARCH-EXCESS4 is a laforin-like phophoglucan phophatase required for starch degradation in Arabidopsis thaliana. Plant Cell 21, 334-46 (2009).
[0142] 14. Caspar, T. et al. Mutants of Arabidopsis with altered regulation of starch degradation. Plant Phys. 95, 1181-1188 (1991).
[0143] 15. Wattebled, F. et al. Mutants of Arabidopsis lacking a chloroplastic isoamylase accumulate phytoglycogen and an abnormal for anylopectin. Plant Phys. 138, 184-195 (2005).
[0144] 16. Fulton, D. C. et al. b-AMYLASE4, a noncatalytic protein required for starch breakdown, acts upstream of three active b-amylases in Arabidopsis chloroplasts. Plant Cell 20, 1040-1058 (2008).
[0145] 17. Okada, K. et al. Requirement of the auxin polar transport system in early stages of Arabidopsis floral bud formation. Plant Cell 3, 677-684 (1991).
[0146] 18. Christensen, S. K., Dagenais, N., Chory, J. & Weigel, D. Regulation of auxin response by the protein kinase PINOID. Cell 100, 469-78 (2000).
[0147] 19. Przemeck, G. K. H. et al. Studies on the role of the Arabidopsis gene MONOPTEROS in vascular development and plant cell axialization. Planta 200, 229-237 (1996).
[0148] 20. Wu, X. & McSteen, P. The role of auxin transport during inflorescence development in maize (Zea mays, Poaceae). Am. J. Bot. 11, 1745-1755 (2007).
[0149] 21. Skirpan, A., Wu, X. & McSteen, P. Genetic and physical interactions suggest that BARREN STALKI is a target of BARREN INFLORESCENCE2 in maize inflorescence development. Plant J. 55, 787-797 (2008).
[0150] 22. Reinhardt, D., Madel, T. & Kuhlemeier, C. Auxin regulates the initiation and radial position of plant lateral organs. Plant Cell 12, 507-518 (2000).
[0151] 23. Reinhardt, D. et al. Regulation of phyllotaxis by polar auxin transport. Nature 426, 255-260 (2003).
[0152] 24. Fu, C. et al. Genetic manipulation of lignin reduces recalcitrance and improves ethanol production from switchgrass. Proc. Natl. Acad. Sci. USA 108, 3803-8 (2011).
[0153] 25. Chen F, Dixon R A. Lignin modification improves fermentable sugar yields for biofuel production. Nat Biotech. 25, 759-61 (2007).
[0154] 26. Li, L et al. Combinatorial modification of multiple lignin traits in trees through multigene cotransformation. Proc. Natl. Acad. Sci. 100, 4939-44 (2003).
[0155] 27. Somerville C. R. et al. Toward a Systems Approach to Understanding Plant Cell Walls. Science 306, 2206-2211 (2004).
[0156] 28. Sanchez-Rodriguez C., Rubio-Somoza I., Sibout R., & Persson S. Phytohormones and the cell wall in Arabidopsis during seedling growth. Trends Plant Sci. 15, 291-301 (2010).
[0157] 29. Feraru, E. et al. PIN polarity maintenance by the cell wall in Arabidopsis. Curr. Biol. 4, 33-43 (2011).
[0158] 30. McCourt P. & Desveaux D. Plant chemical genetics. New Phytol. 185, 15-26 (2010).
[0159] 31. Scheible W R, Eshed R, Richmond T, Delmer D, Somerville C. Modifications of cellulose synthase confer resistance to isoxaben and thiazolidinone herbicides in Arabidopsis Ixr1 mutants. Proc Natl Acad Sci USA. 98(18):10079-84 (2001).
[0160] 32. Harris, D., Stork, J. and Debolt, S. Genetic modification in cellulose-synthase reduces crystallinity and improves biochemical conversion to fermentable sugar. GCB Bioenergy, 1: 51-61 (2009).
Sequence CWU
1
1
3711651DNAZea mays 1tcgtcctcgc tcggagacac ggcaagcgag tcctcctcac tcacgcaaac
acacgccgtg 60ccgcgagcgc acgagacaac cgagcggagc tgccgcctgc ccgccagtgc
cagccatgga 120cgccgcggtg cgcgtccccc cggcgctcgg gaacaagacg gtgaccgagg
tgacgccgcc 180gccgccacca ccggcggggg aggagcggct gtcggacgcc gacacgacgg
cgtcgtcgac 240ggcggcgccc aactcgagcc tcagctcggc cagcagcgcc gccagcctgc
cgcgctgctc 300cagcctgtcc cgcctctcct tcgactgctc tccgtccgcg gccctgtcct
cttcctcggc 360ggcggcggcg gccgcggccg cgtcatcgcc ggcgccagcg ccggcgcggc
cgcaccgggc 420aggggacgcg gcgtgggcgg cgatccgcgc ggcgtcggcg tcggccgcgg
cgccgctggg 480gccgcgggac ttcaggctgc tgcgccgcgt gggcggcggc gacgtcggca
ccgtgtacct 540gtgccgcctc agggcgccac ccgcgcccgc gcccgtctgc tgcctgtacg
cgatgaaggt 600ggtggaccgg cgcgtggcgg ccgcgaagaa gaagctggag cacgcggcgg
cggagcggcg 660gatcctgcgg gcgctggacc atccgttcct gcccacgctc ttcgccgact
tcgacgccgc 720gccgcacttc tcctgcgtcg tcacggagtt ctgccccggc ggggacctcc
actcgctccg 780ccaccgcatg cccaaccgcc gcttcccgct cccgtcagct cggttctacg
cggcggaggt 840gttgctggcg ctggagtacc tgcacatgat gggcatcgtg taccgcgacc
tcaagccgga 900gaacgtgctg atccgcgcgg acggccacat catgctcacg gacttcgacc
tgtcgctgca 960gtgcacgtcg acgccgtcgc tcgagccgtg cgccgccccc gaggcggcgg
cggcgtcctg 1020cttcccggac cacctgttcc gccgccggcg cgcgcgactc cgccgtgccg
cctcggcgcg 1080gcggccgcca acgaccctgg tggcggagcc ggtggaggcg cggtcgtgct
cgttcgtggg 1140cacgcacgag tacgtggcgc ccgaggtggc ccgcggcggg ccccacggcg
cggccgtcga 1200ctggtgggcg ctcggcgtgt tcctgtacga gctcctgcac gggcgcaccc
cgttcgcggg 1260cgccgacaac gaggccacgc tccgcaacat cgcgcgccgc ccgctgtcct
tccccgctgc 1320cggcgccggt gatgccgacg cgcgcgacct catcgcccgc ctcctcgcca
aggacccgcg 1380ccaccggttg gggtcccggc gcggcgccgc cgacgtgaag gcgcacccgt
tcttccgcgg 1440gctcaacttc gcgctgctcc ggtcctcccg cccgcccgtc gtccccgccg
cgtcgcgctc 1500cccgctgcac cgctcgcagt cctgcagcgc ggcgcgcacg agagcgtcga
agccgaagcc 1560gccgccggac acccggttcg acctgttctg acacgaccgt tgccggcgtc
acgcacgtgc 1620gtgttgacct agttgcatca ctcgccattg t
16512491PRTZea mays 2Met Asp Ala Ala Val Arg Val Pro Pro Ala
Leu Gly Asn Lys Thr Val1 5 10
15Thr Glu Val Thr Pro Pro Pro Pro Pro Pro Ala Gly Glu Glu Arg Leu
20 25 30Ser Asp Ala Asp Thr Thr
Ala Ser Ser Thr Ala Ala Pro Asn Ser Ser 35 40
45Leu Ser Ser Ala Ser Ser Ala Ala Ser Leu Pro Arg Cys Ser
Ser Leu 50 55 60Ser Arg Leu Ser Phe
Asp Cys Ser Pro Ser Ala Ala Leu Ser Ser Ser65 70
75 80Ser Ala Ala Ala Ala Ala Ala Ala Ala Ser
Ser Pro Ala Pro Ala Pro 85 90
95Ala Arg Pro His Arg Ala Gly Asp Ala Ala Trp Ala Ala Ile Arg Ala
100 105 110Ala Ser Ala Ser Ala
Ala Ala Pro Leu Gly Pro Arg Asp Phe Arg Leu 115
120 125Leu Arg Arg Val Gly Gly Gly Asp Val Gly Thr Val
Tyr Leu Cys Arg 130 135 140Leu Arg Ala
Pro Pro Ala Pro Ala Pro Val Cys Cys Leu Tyr Ala Met145
150 155 160Lys Val Val Asp Arg Arg Val
Ala Ala Ala Lys Lys Lys Leu Glu His 165
170 175Ala Ala Ala Glu Arg Arg Ile Leu Arg Ala Leu Asp
His Pro Phe Leu 180 185 190Pro
Thr Leu Phe Ala Asp Phe Asp Ala Ala Pro His Phe Ser Cys Val 195
200 205Val Thr Glu Phe Cys Pro Gly Gly Asp
Leu His Ser Leu Arg His Arg 210 215
220Met Pro Asn Arg Arg Phe Pro Leu Pro Ser Ala Arg Phe Tyr Ala Ala225
230 235 240Glu Val Leu Leu
Ala Leu Glu Tyr Leu His Met Met Gly Ile Val Tyr 245
250 255Arg Asp Leu Lys Pro Glu Asn Val Leu Ile
Arg Ala Asp Gly His Ile 260 265
270Met Leu Thr Asp Phe Asp Leu Ser Leu Gln Cys Thr Ser Thr Pro Ser
275 280 285Leu Glu Pro Cys Ala Ala Pro
Glu Ala Ala Ala Ala Ser Cys Phe Pro 290 295
300Asp His Leu Phe Arg Arg Arg Arg Ala Arg Leu Arg Arg Ala Ala
Ser305 310 315 320Ala Arg
Arg Pro Pro Thr Thr Leu Val Ala Glu Pro Val Glu Ala Arg
325 330 335Ser Cys Ser Phe Val Gly Thr
His Glu Tyr Val Ala Pro Glu Val Ala 340 345
350Arg Gly Gly Pro His Gly Ala Ala Val Asp Trp Trp Ala Leu
Gly Val 355 360 365Phe Leu Tyr Glu
Leu Leu His Gly Arg Thr Pro Phe Ala Gly Ala Asp 370
375 380Asn Glu Ala Thr Leu Arg Asn Ile Ala Arg Arg Pro
Leu Ser Phe Pro385 390 395
400Ala Ala Gly Ala Gly Asp Ala Asp Ala Arg Asp Leu Ile Ala Arg Leu
405 410 415Leu Ala Lys Asp Pro
Arg His Arg Leu Gly Ser Arg Arg Gly Ala Ala 420
425 430Asp Val Lys Ala His Pro Phe Phe Arg Gly Leu Asn
Phe Ala Leu Leu 435 440 445Arg Ser
Ser Arg Pro Pro Val Val Pro Ala Ala Ser Arg Ser Pro Leu 450
455 460His Arg Ser Gln Ser Cys Ser Ala Ala Arg Thr
Arg Ala Ser Lys Pro465 470 475
480Lys Pro Pro Pro Asp Thr Arg Phe Asp Leu Phe 485
4903958DNAZea mays 3caaagccaac agaactgcac agtgtagtag
ttgcacatag gcgtccgcgc gtcgtcctag 60ctatggatcc atatcactac caaaccatgt
atgacccacg cggcttcccc atcatccacc 120cgcagcctta cctccagcac ccggtggccg
gcgccctcgg tgacagcagg gtgcgcggcg 180gcggcagtgg cgcgcggcgg cgtcctggcg
ccaagctctc cacggacccg cagagcgttg 240cggcgcgcga gcggcggcac cgcatcagcg
accgcttccg cgtgctccgc agcctcgtcc 300ccggcggcag caagatggac actgtgtcca
tgctcgagca ggccatccac tacgtcaagt 360tcctcaagac gcagatcagc ctgcatcagg
ccgcgctgat gcagcacgag gaaggatgcc 420atgctgagct cgccgcctat tccgcggtgg
cggtggttgg tgacaacgag gtgacactcg 480cgtcccatgg tcgtaccggc gcatgcgacg
agatgatgca gctccaggtg gcggcggagg 540aagctttgag ttatggtgtt gatgcccatc
agccgtacgg gctcgatccc aggcagctga 600gtggtgggca cgagctgcca ccgctgcctg
cttcttgcat cttcctcgag gagcctgcag 660acgcatgcta ctctgtgtgt gacctcgacg
acggggacac cggtctgccc ggctcttact 720agagtagtag tagaagtttc ttaaggtagc
atcccgtgtg tgttggtgtc tgctagacgc 780tagtacgtct aattagcaaa gtttagctag
tactcgatca attgtctgtc tagttcgctc 840agagttaaag tatatgatga tgcatctgca
tatatgggct ctgtaattct gttatccgct 900gatcgcagat gatacaccgt atgtaatcac
atgtatgtat gttgcctaaa aaaaaaaa 9584219PRTZea mays 4Met Asp Pro Tyr
His Tyr Gln Thr Met Tyr Asp Pro Arg Gly Phe Pro1 5
10 15Ile Ile His Pro Gln Pro Tyr Leu Gln His
Pro Val Ala Gly Ala Leu 20 25
30Gly Asp Ser Arg Val Arg Gly Gly Gly Ser Gly Ala Arg Arg Arg Pro
35 40 45Gly Ala Lys Leu Ser Thr Asp Pro
Gln Ser Val Ala Ala Arg Glu Arg 50 55
60Arg His Arg Ile Ser Asp Arg Phe Arg Val Leu Arg Ser Leu Val Pro65
70 75 80Gly Gly Ser Lys Met
Asp Thr Val Ser Met Leu Glu Gln Ala Ile His 85
90 95Tyr Val Lys Phe Leu Lys Thr Gln Ile Ser Leu
His Gln Ala Ala Leu 100 105
110Met Gln His Glu Glu Gly Cys His Ala Glu Leu Ala Ala Tyr Ser Ala
115 120 125Val Ala Val Val Gly Asp Asn
Glu Val Thr Leu Ala Ser His Gly Arg 130 135
140Thr Gly Ala Cys Asp Glu Met Met Gln Leu Gln Val Ala Ala Glu
Glu145 150 155 160Ala Leu
Ser Tyr Gly Val Asp Ala His Gln Pro Tyr Gly Leu Asp Pro
165 170 175Arg Gln Leu Ser Gly Gly His
Glu Leu Pro Pro Leu Pro Ala Ser Cys 180 185
190Ile Phe Leu Glu Glu Pro Ala Asp Ala Cys Tyr Ser Val Cys
Asp Leu 195 200 205Asp Asp Gly Asp
Thr Gly Leu Pro Gly Ser Tyr 210 21551646PRTArabidopsis
thaliana 5Met Asp Leu His Pro Pro Gly Gly Ser Lys Lys Thr Ser Val Val
Val1 5 10 15Val Thr Leu
Asp Thr Gly Glu Val Tyr Val Ile Ala Ser Leu Leu Ser 20
25 30Lys Ala Asp Thr Gln Val Ile Tyr Ile Asp
Pro Thr Thr Gly Ile Leu 35 40
45Arg Tyr Asn Gly Lys Pro Gly Leu Asp Asn Phe Lys Ser Glu Arg Glu 50
55 60Ala Leu Asp Tyr Ile Thr Asn Gly Ser
Arg Gly Gly Val Arg Ser Ser65 70 75
80Val Tyr Ala Arg Ala Ile Leu Gly Tyr Ala Val Leu Gly Ser
Phe Gly 85 90 95Met Leu
Leu Val Ala Thr Arg Leu Asn Pro Ser Ile Pro Asp Leu Pro 100
105 110Gly Gly Gly Cys Val Tyr Thr Val Ala
Glu Ser Gln Trp Val Lys Ile 115 120
125Pro Leu Tyr Asn Pro Gln Pro Gln Gly Lys Gly Glu Thr Lys Asn Ile
130 135 140Gln Glu Leu Thr Glu Leu Asp
Ile Asp Gly Lys His Tyr Phe Cys Asp145 150
155 160Thr Arg Asp Ile Thr Arg Pro Phe Pro Ser Arg Met
Pro Leu Gln Ser 165 170
175Pro Asp Asp Glu Phe Val Trp Asn Arg Trp Leu Ser Val Pro Phe Lys
180 185 190Asn Ile Gly Leu Pro Glu
His Cys Val Ile Leu Leu Gln Gly Phe Ala 195 200
205Glu Tyr Arg Pro Phe Gly Ser Ser Gly Gln Leu Glu Gly Ile
Val Ala 210 215 220Leu Met Ala Arg Arg
Ser Arg Leu His Pro Gly Thr Arg Tyr Leu Ala225 230
235 240Arg Gly Ile Asn Ser Cys Ser Gly Thr Gly
Asn Glu Val Glu Cys Glu 245 250
255Gln Leu Val Trp Ile Pro Lys Arg Asn Gly Gln Ser Ile Ala Phe Asn
260 265 270Ser Tyr Ile Trp Arg
His Gly Thr Ile Pro Ile Trp Trp Gly Ala Glu 275
280 285Leu Lys Met Thr Ala Ala Glu Ala Glu Ile Tyr Val
Ala Asp Arg Asp 290 295 300Pro Tyr Lys
Gly Ser Thr Glu Tyr Tyr Gln Arg Leu Ser Lys Arg Tyr305
310 315 320Asp Thr Arg Asn Leu Asp Ala
Pro Val Gly Glu Asn Gln Lys Lys Lys 325
330 335Ala Phe Val Pro Ile Val Cys Val Asn Leu Leu Arg
Ser Gly Glu Gly 340 345 350Lys
Ser Glu Cys Ile Leu Val Gln His Phe Glu Glu Ser Met Asn Phe 355
360 365Ile Lys Ser Ser Gly Lys Leu Pro Tyr
Thr Arg Val His Leu Ile Asn 370 375
380Tyr Asp Trp His Ala Ser Val Lys Leu Lys Gly Glu Gln Gln Thr Ile385
390 395 400Glu Gly Leu Trp
Met Tyr Leu Lys Ser Pro Thr Met Ala Ile Gly Ile 405
410 415Ser Glu Gly Asp Tyr Leu Pro Ser Arg Gln
Arg Leu Lys Asp Cys Arg 420 425
430Gly Glu Val Ile Cys Ile Asp Asp Ile Glu Gly Ala Phe Cys Leu Arg
435 440 445Ser His Gln Asn Gly Val Ile
Arg Phe Asn Cys Ala Asp Ser Leu Asp 450 455
460Arg Thr Asn Ala Ala Ser Phe Phe Gly Gly Leu Gln Val Phe Val
Glu465 470 475 480Gln Cys
Arg Arg Leu Gly Ile Ser Leu Asp Thr Asp Leu Gly Tyr Gly
485 490 495His Asn Ser Val Asn Asn Gln
Gly Gly Tyr Asn Ala Pro Leu Pro Pro 500 505
510Gly Trp Glu Lys Arg Ala Asp Ala Val Thr Gly Lys Ser Tyr
Tyr Ile 515 520 525Asp His Asn Thr
Lys Thr Thr Thr Trp Ser His Pro Cys Pro Asp Lys 530
535 540Pro Trp Lys Arg Leu Asp Met Arg Phe Glu Glu Phe
Lys Arg Ser Thr545 550 555
560Ile Leu Ser Pro Val Ser Glu Leu Ala Asp Leu Phe Leu Gln Gln Gly
565 570 575Asp Ile His Ala Thr
Leu Tyr Thr Gly Ser Lys Ala Met His Ser Gln 580
585 590Ile Leu Asn Ile Phe Ser Glu Glu Ser Gly Ala Phe
Lys Gln Phe Ser 595 600 605Ala Ala
Gln Lys Asn Met Lys Ile Thr Leu Gln Arg Arg Tyr Lys Asn 610
615 620Ala Met Val Asp Ser Ser Arg Gln Lys Gln Leu
Glu Met Phe Leu Gly625 630 635
640Met Arg Leu Phe Lys His Leu Pro Ser Ile Pro Val Gln Pro Leu His
645 650 655Val Leu Ser Arg
Pro Ser Gly Phe Phe Leu Lys Pro Val Pro Asn Met 660
665 670Ser Glu Ser Ser Asn Asp Gly Ser Ser Leu Leu
Ser Ile Lys Arg Lys 675 680 685Asp
Ile Thr Trp Leu Cys Pro Gln Ala Ala Asp Ile Val Glu Leu Phe 690
695 700Ile Tyr Leu Ser Glu Pro Cys His Val Cys
Gln Leu Leu Leu Thr Ile705 710 715
720Ser His Gly Ala Asp Asp Leu Thr Cys Pro Ser Thr Val Asp Val
Arg 725 730 735Thr Gly Arg
His Ile Glu Asp Leu Lys Leu Val Val Glu Leu Val Gln 740
745 750Leu Asp Tyr Arg Leu Pro Val Ile Met Phe
Ser Gly Gln Gly Ala Ser 755 760
765Ile Pro Arg Cys Ala Asn Gly Thr Asn Leu Leu Val Pro Leu Pro Gly 770
775 780Pro Ile Ser Ser Glu Asp Met Ala
Val Thr Gly Ala Gly Ala Arg Leu785 790
795 800His Glu Lys Asp Thr Ser Ser Leu Ser Leu Leu Tyr
Asp Phe Glu Glu 805 810
815Leu Glu Gly Gln Leu Asp Phe Leu Thr Arg Val Val Ala Val Thr Phe
820 825 830Tyr Pro Ala Gly Ala Val
Arg Ile Pro Met Thr Leu Gly Gln Ile Glu 835 840
845Val Leu Gly Ile Ser Leu Pro Trp Lys Gly Met Phe Thr Cys
Glu Arg 850 855 860Thr Gly Gly Arg Leu
Ala Glu Leu Ala Arg Lys Pro Asp Glu Asp Gly865 870
875 880Ser Pro Phe Ser Ser Cys Ser Asp Leu Asn
Pro Phe Ala Ala Thr Thr 885 890
895Ser Leu Gln Ala Glu Thr Val Ser Thr Pro Val Gln Gln Lys Asp Pro
900 905 910Phe Pro Ser Asn Leu
Leu Asp Leu Leu Thr Gly Glu Asp Ser Ser Ser 915
920 925Asp Pro Phe Pro Gln Pro Val Val Glu Cys Ile Ala
Ser Gly Gly Asn 930 935 940Asp Met Leu
Asp Phe Leu Asp Glu Ala Val Val Glu Tyr Arg Gly Ser945
950 955 960Asp Thr Val Pro Asp Gly Ser
Val Pro Gln Asn Lys Arg Pro Lys Asp 965
970 975Ser Gly Ala His Leu Tyr Leu Asn Cys Leu Lys Ser
Leu Ala Gly Pro 980 985 990Asn
Met Ala Lys Lys Leu Glu Phe Val Glu Ala Met Lys Leu Glu Ile 995
1000 1005Glu Arg Leu Arg Leu Asn Ile Ser
Ala Ala Glu Arg Asp Arg Ala 1010 1015
1020Leu Leu Ser Ile Gly Ile Asp Pro Ala Thr Ile Asn Pro Asn Ser
1025 1030 1035Ser Tyr Asp Glu Leu Tyr
Ile Gly Arg Leu Cys Lys Ile Ala Asn 1040 1045
1050Ala Leu Ala Val Met Gly Gln Ala Ser Leu Glu Asp Lys Ile
Ile 1055 1060 1065Ala Ser Ile Gly Leu
Glu Lys Leu Glu Asn Asn Val Ile Asp Phe 1070 1075
1080Trp Asn Ile Thr Arg Ile Gly Glu Gly Cys Asp Gly Gly
Met Cys 1085 1090 1095Gln Val Arg Ala
Glu Val Asn Lys Ser Pro Val Gly Ser Ser Thr 1100
1105 1110Lys Ser Ser Arg Gly Glu Ser Gly Ser Val Phe
Leu Cys Phe Gln 1115 1120 1125Cys Met
Lys Lys Ala Cys Lys Phe Cys Cys Ala Gly Lys Gly Ala 1130
1135 1140Leu Leu Leu Ser Lys Ser Tyr Ser Arg Asp
Thr Ala Asn Gly Gly 1145 1150 1155Gly
Ser Leu Ala Asp Val Ser Ala Thr Ser Ile Gly Ser Asp His 1160
1165 1170Tyr Ile Cys Lys Lys Cys Cys Ser Ser
Ile Val Leu Glu Ala Leu 1175 1180
1185Ile Val Asp Tyr Val Arg Val Met Val Ser Leu Arg Arg Ser Gly
1190 1195 1200Arg Val Asp Asn Ala Gly
Arg Glu Ala Leu Asn Glu Val Phe Gly 1205 1210
1215Ser Asn Ile Thr Asn His Leu Ala Val Arg Gly Gln Pro Ser
Pro 1220 1225 1230Asn Arg Glu Asp Phe
Asn Phe Leu Arg Gln Ile Leu Gly Lys Glu 1235 1240
1245Glu Ser Leu Ser Glu Phe Pro Phe Ala Ser Phe Leu His
Lys Val 1250 1255 1260Glu Thr Ala Thr
Asp Ser Ala Pro Phe Phe Ser Leu Leu Thr Pro 1265
1270 1275Leu Asn Leu Ala Ser Ser Asn Ala Tyr Trp Lys
Ala Pro Pro Ser 1280 1285 1290Ala Asp
Ser Val Glu Ala Ala Ile Val Leu Asn Thr Leu Ser Asp 1295
1300 1305Val Ser Ser Val Ile Leu Leu Val Ser Pro
Cys Gly Tyr Ser Asp 1310 1315 1320Ala
Asp Ala Pro Thr Val Gln Ile Trp Ala Ser Ser Asp Ile Asn 1325
1330 1335Lys Glu Ala Arg Thr Leu Met Gly Lys
Trp Asp Val Gln Ser Phe 1340 1345
1350Ile Arg Ser Ser Pro Glu Leu Ser Gly Ser Glu Lys Ser Gly Arg
1355 1360 1365Ala Pro Arg His Ile Lys
Phe Ala Phe Lys Asn Pro Val Arg Cys 1370 1375
1380Arg Ile Ile Trp Ile Thr Leu Arg Leu Pro Arg Leu Gly Ser
Ser 1385 1390 1395Ser Ser Val Ser Leu
Asp Lys Asn Ile Asn Leu Leu Ser Leu Asp 1400 1405
1410Glu Asn Pro Phe Ala Pro Ile Pro Arg Arg Ala Ser Phe
Gly Ala 1415 1420 1425Thr Ile Glu Asn
Asp Pro Cys Ile His Ala Lys His Ile Leu Val 1430
1435 1440Thr Gly Asn Thr Val Arg Asp Lys Thr Leu Gln
Ser Val Glu Ser 1445 1450 1455Met Ser
Val Arg Asn Trp Leu Asp Arg Ala Pro Arg Leu Asn Arg 1460
1465 1470Phe Leu Ile Pro Leu Glu Thr Glu Arg Pro
Met Glu Asn Asp Leu 1475 1480 1485Val
Leu Glu Leu Tyr Leu Gln Pro Ala Ser Pro Leu Ala Ala Gly 1490
1495 1500Phe Arg Leu Asp Ala Phe Ser Ala Ile
Lys Pro Arg Val Thr His 1505 1510
1515Ser Pro Ser Ser Asp Val Val Asp Ile Trp Asp Pro Thr Ser Val
1520 1525 1530Ile Met Glu Asp Arg His
Val Ser Pro Ala Ile Leu Tyr Ile Gln 1535 1540
1545Val Ser Val Leu Gln Glu Gln Tyr Lys Met Val Thr Ile Ala
Glu 1550 1555 1560Tyr Arg Leu Pro Glu
Ala Arg Asp Gly Thr Lys Leu Tyr Phe Asp 1565 1570
1575Phe Pro Lys Gln Ile Gln Ala Gln Arg Val Ser Phe Lys
Leu Leu 1580 1585 1590Gly Asp Val Ala
Ala Phe Thr Asp Glu Pro Ala Glu Ala Val Asp 1595
1600 1605Leu Ser Ser Arg Ala Ser Pro Phe Ala Ala Gly
Leu Ser Leu Ala 1610 1615 1620Asn Arg
Ile Lys Leu Tyr Tyr Tyr Ala Asp Pro Tyr Glu Val Gly 1625
1630 1635Lys Trp Thr Ser Leu Ser Ser Val 1640
164566394DNAArabidopsis thaliana 6atggatctgc atccaccagg
ttagtttctt tttgaagatt caagtgtttt tttttttcac 60tctagctgtt ttcatcattt
ggttggctaa ttttcttatc ttcttggtgt aggtggttca 120aaaaagacat ctgtagttgt
tgtcacctta gacactggtg aagtctatgt cattgcaagt 180ttgttatcta aggctgatac
tcaagttatc tatatcgatc ctacgactgg tatcctgcgg 240tacaatggga agcctggcct
tgataatttt aagtcagagc gtgaagcctt agattatatt 300acgaatggat caagaggagg
tgttagaagc tctgtttatg ccagggcaat actcggttat 360gctgtcttgg ggagctttgg
gatgctttta gttgcgacca ggctaaatcc aagtattcca 420gatttgcctg gtggtggatg
tgtatataca gtggctgaga gtcaatgggt caaaatacca 480ctctacaatc cacagcctca
agggaaaggt gaaaccaaga atattcagga gttgactgag 540cttgacatcg acgggaagca
ctatttctgt gatactagag acatcactcg gcccttccca 600agccgtatgc cacttcaaag
ccctgatgat gaatttgttt ggaatagatg gttgtccgtg 660ccttttaaga atattgggct
acctgaacac tgtgtcattc ttctgcaggt tcgtcctcca 720ttaatgaatt gatgaaggtt
gatctaccta tccgatttcg gttttgtgtg aagattcaaa 780agcatataaa ttgatatgac
atcatcttct tatttgatgt taattatagg ggtttgcaga 840atatcgacct tttgggagct
caggccagct agaagggatt gttgctctaa tggcccgtcg 900tagcagactg catccaggga
ctcgttacct agctaggggc attaattcat gttctggcac 960aggtgacatg cccccctaaa
atgctcctct ctcttaattt ctttgttgtt ctcttaaaaa 1020gtgtggctct ggattgacta
gggcctaatt tagaagtata tgtgtagtgc aggtaacgaa 1080gttgagtgtg agcagcttgt
atggatacct aaaagaaatg gtcaaagcat tgctttcaac 1140tcgtacattt ggcgacatgg
caccatacca atatggtggg gtgcagaatt aaagatgact 1200gcggcagaag cagaaattta
tgtggcagat agggatcctt ataaaggcag tacagagtat 1260taccaaaggt taagcaagcg
atatgatact aggaatctag atgcacctgt tggagaaaac 1320cagaagaaaa aggcttttgt
tcctattgtg tgcgttaatt tactaagaag tggagaaggg 1380aaatcagaat gtatcttagt
acaacatttt gaagaatcga tgaactttat caaatccagt 1440ggaaagcttc cttatactcg
tgttcacctg ataaattatg attggcatgc cagcgtgaaa 1500ctaaaagggg aacagcaaac
tattgaagga ttgtggatgt atctaaaatc tcccactatg 1560gcaataggaa tttctgaagg
tgactatttg ccttcacgtc aaagactgaa agattgcaga 1620ggtgaggtaa tctgtattga
tgacattgaa ggtgccttct gtttgagatc acatcaaaat 1680ggggtgatac gttttaactg
cgctgattcc ttggatcgaa caaatgcggc tagtttcttt 1740ggtggtcttc aagtgtttgt
agagcaatgt agaaggctgg gaatatcact tgatactgat 1800cttggatatg gtcataattc
tgttaataat caggggggat ataacgctcc ccttccaccg 1860ggatgggaaa aaagagctga
tgccgtaact ggaaaatcat attatataga tcacaataca 1920aagacaacaa catggagtca
tccatgtcct gataaaccat ggaagagact tgacatgagg 1980tttgaggaat ttaagagatc
aactatctta tctcctgtgt cagaacttgc cgatcttttt 2040ctgcaacaag gtgatatcca
tgcaaccctc tatactggct cgaaagctat gcacagccaa 2100attctcaaca tcttcagtga
agaatcagga gcatttaaac agttttctgc agcacagaaa 2160aacatgaaga ttacactaca
gagaagatat aaaaatgcta tggttgatag ttcacggcaa 2220aaacagctcg agatgtttct
gggaatgagg cttttcaagc atcttccatc aattcctgtc 2280cagcctttac atgtaagcac
attgacacga ttccagtaaa aaattcccag ctctcagctc 2340tccttttttc catgtatatt
aactgctcta aagtatgaat cacgtttttt tgcgtgtatg 2400tagattttgt ttcatattgg
accgatacac tttgtttatt gtgtgtagat atataattcc 2460taagagcata acagattcgt
tgatttgtag actctactat ttgttgcttt ctatattctc 2520tgtctacttc tcactgcaaa
aatattcatg tcaggtactt tctcgaccat ctggtttctt 2580tctgaaacct gtacctaaca
tgtccgaaag ttccaatgat gggtccagtc tgctgagtat 2640caagaggaag gacataactt
gggtacctca acttagatat aagattacct cttagttctc 2700attacttgaa tacttgagct
aaaaaaattc catgcttttt gttacttttg cagctatgtc 2760cacaagctgc agatattgtt
gaattattta tctatctcag tgagccttgc catgtatgtc 2820aacttctact gaccatatca
cacggtgcgg atgatttgac atgtccatcc actgtggacg 2880tgagaactgg acgccacata
gaggacctta aattagttgt tgaggttgac tctcttttag 2940gtcttgtccc ttctgtttta
ttgtttgtag gagatctgta gacttacatg caagttagtt 3000caactggatt accgattacc
tgtaattatg ttttctggac agggtgcttc aataccacgc 3060tgtgcaaatg gtacaaatct
tctggtaccc ttaccagggc caattagttc tgaggatatg 3120gctgttactg gagctggtgc
acgtcttcat gaaaaagata cgtcaagtct ttcactgcta 3180tatgattttg aagaactaga
aggacagttg gatttcttaa cccgtgtagt tgctgttaca 3240ttttatccag ctggtgctgt
tagaattcct atgactcttg gtcaggtact tactagtttc 3300caaactatga attgatgaat
atctattagc ttcatcatgc tctgaactcg ttaaagttta 3360tgattagcta tcatcaaaag
aagaaaaaac aaacactttt tctgcattgt gtctatgccg 3420cagatagaag tccttggaat
ttctcttcca tggaaaggaa tgtttacttg tgaacgtact 3480ggaggaagat tagctgaact
tgcaaggaaa ccagatgaag atggaagtcc tttttcatct 3540tgttctgact tgaatccgtt
tgctgcaaca acatctttac aggctgaaac tgtttccaca 3600ccagtacaac agaaggatcc
ctttcccagt aatctgcttg accttttgac aggagaggac 3660tcttcttctg accccttccc
acaaccagtg gtggaatgta ttgcaagtgg aggcaatgac 3720atgcttgatt tcttagacga
agcagttgtt gaatatcgcg gctctgacac tgttcctgac 3780gggtctgtcc cacaaaataa
aaggcccaag gacagtggtg ctcatctgta cttaaattgc 3840ctaaagtccc ttgcgggtcc
aaacatggtg agatgcaatt acgtctttcc agttgccaca 3900aactgtagtt ctgattgtat
gaatctttca ttggtttaac tattcttctc atatgttatc 3960tttcttcgta aaattccatt
ttctatggta aatttaattt ctgcatgtct ggcataatac 4020aggcaaagaa gcttgagttt
gtagaagcta tgaagcttga aattgaacgt ctacgtctca 4080atatttctgc agcagaaaga
gatagggcac tgttatcgat tggaattgat ccagctacca 4140ttaatccaaa ctcttcatac
gacgagttat atattggaag attatgcaaa atagcaaatg 4200cacttgcagt tatgggccag
gcttctcttg aagataaaat tatagcttct attggtctag 4260agaagctgga aaataatgtg
atagatttct ggaacataac cagaattggt gagggttgtg 4320atggcggaat gtgtcaagtc
cgagccgagg tcaataaaag tccagttgga tcttctacca 4380agagttcaag aggagagtcg
ggctcagtgt tcttgtgctt ccaatgtatg aaaaaagctt 4440gcaagttttg ttgtgctgga
aaaggagctc ttctgctttc aaaatcctac tccagggaca 4500ctgcgaatgg aggtggaagt
cttgcagatg tctctgctac ttcgataggt tcagatcatt 4560acatttgtaa aaaatgctgc
agctcgatag tgcttgaagc cctgattgta gattatgtaa 4620gggtcatggt cagcttgcga
agaagtggcc gtgttgataa tgctggtcgg gaagctttga 4680atgaggtatt tggatctaac
attacaaatc accttgctgt tagaggtcaa ccttctccta 4740atcgagaaga cttcaatttc
cttcgtcaaa ttttgggtaa agaggagtcg ctttctgagt 4800tcccatttgc aagcttctta
cataaggtaa tatgcttctt atgtgtttta aaattactat 4860gatcacttca ttgtctttgg
aaagagctgt tatgtaatac ctaatcctcc tttctctctt 4920ttggatgttt gcataatgca
atttaggtcg aaactgcgac tgattcagca ccatttttct 4980cattgctcac ccctctgaat
cttgcttcaa gtaatgccta ctggaaagct cctccgtctg 5040cagactctgt tgaagccgcc
attgttctca acaccctttc agatgtcagc agtgtgattc 5100tactcgttag tccatgtggt
tactctgatg ctgatgctcc taccgtaagt ttgacttttc 5160tatcttcagt tgaattcttg
ttaacccatg ccattactta cgtgataatg ctgccactca 5220tttcaacatt ttatgtatct
tcattgcagg tccaaatttg ggcgagcagc gacataaaca 5280aggaagcacg gactttgatg
ggaaagtggg atgtacagtc ctttattaga tcttcgcctg 5340agctttctgg ttcagaaaag
tctggtagag cacctaggca tataaaattt gctttcaaga 5400atcctgtccg ttgccgcatt
atatggataa cactgcgtct tcctaggctt ggatctagta 5460gctcagttag tttggacaaa
aacatcaatc tcttatcttt ggatgagaac ccatttgctc 5520caattcctcg acgtgcctct
tttggagcaa ccattgagaa tgatccatgt attcatgcaa 5580aacacatctt ggtcactgga
aacaccgtga gggataaaac gctacaaagt gttgagagca 5640tgagcgtaag aaactggctg
gacagagccc cacgtttgaa tagattcctg gttagtgcct 5700tagagaactg ctcgttcctt
ttcacctttt tctgtggtat ttcgtttatt gtcactaata 5760tttgtttttt tcaccttcca
gataccatta gagactgaga gaccaatgga gaatgatcta 5820gtcttggaac tttatctgca
acctgcttca cctttagctg ctggattccg tttagatgct 5880tttagtgcga taaagcctcg
tgtaacccac tcgccttctt cagatgtagt tgacatttgg 5940gacccgacga gtgtcataat
ggaagataga cacgtctctc cggccatctt gtatatacaa 6000gtatctgttc tacaggtatc
tatctctcct cctcccggtt tattatatat cctcagaaac 6060caaaatgttg taataacttt
ttcatgttga tctgaaaaat gttaatctac aggagcaata 6120caaaatggtg acaatcgcgg
aatacagatt gcctgaggcg agagatggaa caaagttgta 6180ttttgacttc cctaaacaga
tacaagcaca gagagtatcg tttaaactgc taggagatgt 6240agcagctttt acagatgagc
ccgcagaggc tgttgatttg agcagccggg cttctccttt 6300tgctgcagga ctgtctttag
caaacaggat caagctatat tactatgcgg atccttacga 6360agtaggcaaa tggactagcc
tttcaagtgt ctga 63947438PRTArabidopsis
thaliana 7Met Leu Arg Glu Ser Asp Gly Glu Met Ser Leu Gly Thr Thr Asn
Ser1 5 10 15Pro Ile Ser
Ser Gly Thr Glu Ser Cys Ser Ser Phe Ser Arg Leu Ser 20
25 30Phe Asp Ala Pro Pro Ser Thr Ile Pro Glu
Glu Glu Ser Phe Leu Ser 35 40
45Leu Lys Pro His Arg Ser Ser Asp Phe Ala Tyr Ala Glu Ile Arg Arg 50
55 60Arg Lys Lys Gln Gly Leu Thr Phe Arg
Asp Phe Arg Leu Met Arg Arg65 70 75
80Ile Gly Ala Gly Asp Ile Gly Thr Val Tyr Leu Cys Arg Leu
Ala Gly 85 90 95Asp Glu
Glu Glu Ser Arg Ser Ser Tyr Phe Ala Met Lys Val Val Asp 100
105 110Lys Glu Ala Leu Ala Leu Lys Lys Lys
Met His Arg Ala Glu Met Glu 115 120
125Lys Thr Ile Leu Lys Met Leu Asp His Pro Phe Leu Pro Thr Leu Tyr
130 135 140Ala Glu Phe Glu Ala Ser His
Phe Ser Cys Ile Val Met Glu Tyr Cys145 150
155 160Ser Gly Gly Asp Leu His Ser Leu Arg His Arg Gln
Pro His Arg Arg 165 170
175Phe Ser Leu Ser Ser Ala Arg Phe Tyr Ala Ala Glu Val Leu Val Ala
180 185 190Leu Glu Tyr Leu His Met
Leu Gly Ile Ile Tyr Arg Asp Leu Lys Pro 195 200
205Glu Asn Ile Leu Val Arg Ser Asp Gly His Ile Met Leu Ser
Asn Phe 210 215 220Asp Leu Ser Leu Cys
Ser Asp Ser Ile Ala Ala Val Glu Ser Ser Ser225 230
235 240Ser Ser Pro Glu Asn Gln Gln Leu Arg Ser
Pro Arg Arg Phe Thr Arg 245 250
255Leu Ala Arg Leu Phe Gln Arg Val Leu Arg Ser Lys Lys Val Gln Thr
260 265 270Leu Glu Pro Thr Arg
Leu Phe Val Ala Glu Pro Val Thr Ala Arg Ser 275
280 285Gly Ser Phe Val Gly Thr His Glu Tyr Val Ala Pro
Glu Val Ala Ser 290 295 300Gly Gly Ser
His Gly Asn Ala Val Asp Trp Trp Ala Phe Gly Val Phe305
310 315 320Leu Tyr Glu Met Ile Tyr Gly
Lys Thr Pro Phe Val Ala Pro Thr Asn 325
330 335Asp Val Ile Leu Arg Asn Ile Val Lys Arg Gln Leu
Ser Phe Pro Thr 340 345 350Asp
Ser Pro Ala Thr Met Phe Glu Leu His Ala Arg Asn Leu Ile Ser 355
360 365Gly Leu Leu Asn Lys Asp Pro Thr Lys
Arg Leu Gly Ser Arg Arg Gly 370 375
380Ala Ala Glu Val Lys Val His Pro Phe Phe Lys Gly Leu Asn Phe Ala385
390 395 400Leu Ile Arg Thr
Leu Thr Pro Pro Glu Ile Pro Ser Ser Val Val Lys 405
410 415Lys Pro Met Lys Ser Ala Thr Phe Ser Gly
Arg Ser Ser Asn Lys Pro 420 425
430Ala Ala Phe Asp Tyr Phe 43581624DNAArabidopsis thaliana
8atgttacgag aatcagacgg tgagatgagt ttaggaacaa caaactcacc gataagcagc
60ggaacagaga gttgcagcag tttcagccgg ttatcattcg acgcgccgcc gtcaactatc
120cccgaagaag aaagcttcct ttctctcaaa cctcaccgat cctcagattt cgcttacgca
180gagatccgaa gacgaaaaaa acaaggccta accttccgag attttcgcct catgcgtcgt
240atcggcgccg gcgacatcgg aacagtttac ttatgccgtc tagccggaga cgaagaagag
300agccggagct cgtattttgc gatgaaagtt gtggataaag aagctcttgc gttgaagaag
360aagatgcata gagcagagat ggagaaaacg attttgaaaa tgcttgacca tccatttttg
420ccgactcttt acgctgagtt tgaagcctca catttctctt gcatcgttat ggaatattgc
480tccggtggtg atttacactc tctccgtcat agacaacctc accggcgatt ctccctctct
540tccgccaggt aaaaaatatc aaattttatt gaataattta atattatgga caaagtcaga
600ttttttttca aaaaaaaaaa attgtgaaaa aagattcatc atcatcaatg tatatatata
660ttttatagtt acatgcattg actctgttca catttgttat cttgttctgc aagaacagac
720ctgttcttat catgtcggtc ttttccagtt ctttgaattg ttatcaaaga gtctttttca
780gcccatcaca atttataaac gtcaataatt atgattttat tagctaatga gtatttattt
840tgtttttggt tacagatttt atgccgccga agttctagtg gcgttagaat atctacacat
900gttgggtatc atctacagag atctgaagcc tgaaaatatc ttagttagat ccgacggtca
960cattatgctc tctaactttg acctctctct atgctccgac tcaatcgcag ccgttgaatc
1020ttcctcgtct tcgccggaga atcaacaact ccgttcaccg cgacgattca ctcgtctcgc
1080tagacttttc caacgagtct tgcggtctaa aaaggttcag actttagaac caacccgtct
1140ctttgttgct gaaccggtta ctgcccggtc cggttcgttc gttggtacgc atgaatacgt
1200ggcaccagaa gttgcttcag gtggatcaca tggtaatgcc gttgactggt gggcctttgg
1260agtgtttctc tacgagatga tatatggcaa gactccgttc gttgcgccga ctaatgacgt
1320cattctccgt aacattgtga aaagacagtt gagtttcccg actgattcgc cggcgactat
1380gtttgagctt catgcgcgga atttgatttc cgggttgctt aacaaagatc cgactaaaag
1440acttgggtca cggcgaggtg cggcggaggt taaagtgcat ccttttttca aaggtctaaa
1500ctttgcgctc attcgtacgc ttactccgcc ggagattcct tcttccgtcg tcaagaagcc
1560gatgaaatcg gcgacgttta gtggtagaag tagtaacaaa ccagcggcgt tcgattactt
1620ttga
16249322PRTArabidopsis thaliana 9Met Met Asn Leu Gly Ser Leu Ser Leu Ser
Thr Ser Lys Ser Ser Lys1 5 10
15Pro Met Val Ser Ile Ser Phe Trp Ile Pro Tyr Phe Thr His Trp Gly
20 25 30Glu Ser Leu Leu Val Cys
Gly Ser Ala Pro Gly Leu Gly Ser Gly Asn 35 40
45Val Lys Lys Gly Leu Leu Leu Lys Pro Ser Gln Gln Asp Asp
Gln Leu 50 55 60Ile Trp Ser Gly Ser
Val Ser Val Pro Pro Gly Phe Ser Ser Asp Tyr65 70
75 80Cys Tyr Tyr Val Val Asp Asp Ser Lys Ser
Val Leu Arg Ser Glu Phe 85 90
95Gly Met Lys Arg Lys Leu Val Val Pro Glu Thr Leu Thr Gly Gly Glu
100 105 110Ser Val His Leu Arg
Asp Leu Trp Gln Ser Gly Asp Gln Ala Leu Pro 115
120 125Phe Arg Ser Ala Phe Lys Asp Val Ile Phe His His
Ser Phe Asp Val 130 135 140Lys Val Glu
Lys Pro Leu Gly Val Phe Met Asn Lys Ser Asp Gln Asp145
150 155 160Asp Ser Val Val Val Gln Phe
Lys Ile Cys Cys Pro Asp Ile Gly Glu 165
170 175Gly Thr Ser Val Tyr Val Leu Gly Thr Pro Glu Lys
Leu Gly Asn Trp 180 185 190Lys
Val Glu Asn Gly Leu Arg Leu Asn Tyr Val Asp Asp Ser Ile Trp 195
200 205Glu Ala Asp Cys Leu Ile Pro Lys Ala
Asp Phe Pro Ile Lys Tyr Arg 210 215
220Tyr Cys Lys Val Gln Lys Glu Asp Ser Ile Gly Phe Glu Ser Gly Gly225
230 235 240Asn Arg Glu Leu
Ser Leu His Ser Ile Gly Ser Lys Gln Glu Tyr Ile 245
250 255Val Met Ser Asp Gly Leu Phe Arg Ala Met
Pro Trp Arg Gly Ala Gly 260 265
270Val Ala Val Pro Met Phe Ser Val Arg Ser Glu Asp Asp Val Gly Val
275 280 285Gly Glu Phe Leu Asp Leu Lys
Leu Leu Val Asp Trp Ala Val Asp Ser 290 295
300Gly Leu His Leu Val Gln Leu Leu Pro Val Asn Asp Thr Ser Val
His305 310 315 320Lys
Met105412DNAArabidopsis thaliana 10atgatgaatc taggatctct ttcgttgagt
acgagcaagt cgagtaagcc aatggttagc 60atcagctttt ggataccgta tttcactcac
tggggagaga gcctgctggt ctgtggctcg 120gctcctggac ttggttccgg gaatgtgaaa
aaaggtttgc tgctaaagcc atcccagcaa 180gatgatcagc tcatctggag tggctctgtc
tcggttccac ctgggtttag ctctgactat 240tgttactatg tggtggatga ctcgaagagt
gtgttaaggt cggagtttgg gatgaaacga 300aaacttgtgg tgcctgagac attgaccggt
ggagagtctg ttcatcttcg tgatctctgg 360caggtttgag ttctttgtcc ttttcgagtg
tattatacaa attttgtaat ctgattttgt 420agtgttgtga ttgatgttct gttcctttaa
ttttagagtg gggatcaagc tcttccattt 480agaagtgcat ttaaagatgt catcttccac
cacagtttcg atgtgaaagt agaaaaacct 540ctaggggtct ttatgaataa gtcagatcaa
gatggtattc atttttttat ctaatcctat 600cacaccattg tctctgagat tctctagttt
gtgattgatc tctttgctaa ctgttgtggc 660cttatgactt taccatgtaa acagattcag
ttgttgtcca attcaaaatc tgttgtccag 720acattggaga gggaacatct gtaagccttt
ctctatgatc tttggttttc tttcttttca 780tgtttgagcc tgatgtggtg ttgtggccat
cattcttacc cttgattatg acatgtacag 840gtgtatgttc taggcacccc agaaaagttg
gggaattgga aagttgaaaa tggacttaga 900ctcaactatg tcgatgattc catatgggaa
gcagattgct tgatccctaa agcagacttt 960cctatcaaat atcctttctt tttactttct
tccagactaa ttagttgaac atagttgaca 1020cttcacttct catatgcttt ctgcatgttg
ggttcttaac gattctttca catatagata 1080ctgtaaagtt cagaaggaag atagcatagg
gtttgaatct ggtggtaatc gggagctgtc 1140tcttcactcc atcggtagta aacaggaata
cattgtcatg tcagatggct tgtttcgagt 1200aagtaacaga gacaagctat gaactagctc
taatgtgaca gaatcaatgt tgtcttacta 1260ttatttaatt ttacttgtag gcaatgccat
ggaggggtgc tggtgtagca gttcccatgt 1320tctctgtaag gtcagaagat gatgttggtg
tgggagagtt tctcgatctg aagttgcttg 1380ttgattgggc tgtagattcc gggttgcatc
tagtacaact tttaccagta aatgacacat 1440ccgttcataa gatgtgatgg gactcgtatc
cctacaggta tgatgattac atttatgtta 1500gttttgcagg agttcaataa aggacttctg
tcacttactt aacggatgga aacaaatatc 1560cagaattttt gttattattc tttcttgact
actgaattca taaatagttc attttttgct 1620tcaatttggt tttcgagacc ttgggttcca
tatactttgg ttgatgatta gtgactgagc 1680ctgaggctct gctgaatatt agtctctact
ctctacttag taaaatttct atatcaactc 1740attgaactgt gtgattcatt cttattgttc
ctttcctcat ttggattgct tctcttgcag 1800ctcgttatct gtgtttgcat tgcatccatt
atatcttaga gtgcaggctc tctctgaacg 1860tctgccagaa gatatcaagg taagtttcta
cacttattgt atcctggaag agactcctaa 1920gcaatatatt ggctgacctt tttcgttctc
tttcaggaag aaattcagaa ggcgaagaat 1980caactggaca agaatgtaat gagtcccttg
ttagacgttt catagatttt cttactttta 2040cgtccttgaa aaagatagca tcagataaac
gggatatagt atttttaaat ggtttgtgtc 2100tgtatcagca tgatgtgttt aaacgcctgc
aagcaattgg ttcatatgtt agttcgtcac 2160aatcagagtt tgagtgttgt aacctaatcc
tttaaagatt gttacgtgta ggatgttgat 2220tatgaggcta ccatggaaac taagctttca
attgccaaga aaatctttga catagagaaa 2280gaccagactt tgaactcaag caccttccag
aaattcttct ctgaaaacga ggtatgcttc 2340aaaatctgat attgtttcct tgatatctct
tgagaattgt ctgcatagaa cttatgcctc 2400caaaggccac gtcagcgttc cattggctta
cggatttttg ttatctttga aagggctggt 2460tgaaaccata tgcagctttc tgcttccttc
gtgacttttt tgagacttca gatcatagtc 2520agtgggggac cttttctgac tatacagacg
acaaggtata atttgctttt atgttgttca 2580agtttaagtt gtgatcgaat tagttcaagc
tttccagatg tatttcatat agtttcagta 2640atttctcaca tctccaaata acatttgcca
cagcttgaaa aattgatatc caaggacaac 2700ttgcactata acactatatg cttccactac
tacattcagt accatttaca tgtacaagta 2760agtatgttct gatttatcta gttatattcc
attcttgtaa gtgaagtcgg ttactggatg 2820tttcatctaa agtgtgttct ggttccattt
tcaaaagttg tcagcagcag cagaatatgc 2880aaggaagaaa ggagttgtgc tgaaaggaga
cctacctatt ggcgttgaca gaaacagtgt 2940tgatacgtgg gtttacagga atctgtttcg
catgaatacg tcaactggag cacctccaga 3000ctattttgac aaaaatggtc agaactgggg
atttcctact tataactggg aggaaatgtc 3060aaaagacaac tatgcctggt ggcgtgctcg
cctaacacag gttggaatta ttgtccttac 3120cagccacatc ctttgtagat ctgaaggctg
accacattgc atacttgctt tgcagatggg 3180gaaatatttc acagcataca ggattgatca
tatattggga ttcttcagaa tctgggagct 3240tccagctcat gctatgactg gtttagtggg
gaagttccgt ccatctattc ctttaagtca 3300ggtacacaag ctacatgctt acagttaaaa
aaaacttggc tttgagagtt acttgatgat 3360aaaactgctg ctgatgcagg aagagttgga
aaaggaggga atatgggatt ttgaccgctt 3420aagcaagccc tatatccaga agaagtttct
tgaggttagt tttccatcac atttattatg 3480atctgcctct gtcacagctt agttagtttg
tgagttttca atctcattac ctatttgttg 3540ataatatgtc attgttctgt tccttcagga
gaaatttgga gatttttggc cttttattgc 3600atcaaacttt cttaatgaaa ctcagaagga
catgtatgag gttagtaaaa atcctcattt 3660agagatcgtt ttcctttgga gaaccaagtc
ttaccttttt ctgtccaaaa ctctgtggtg 3720ttttttgcta agtggatgta ttaaatcttt
cttaattggt ttcctttaat tgtactctag 3780ttcaaggagg actgcaacac agagaagaag
attgtagcaa agctgaagtc attggctgag 3840aagtctttgt tgctagaaaa tgaggacaaa
gttcgacgtg atgtctttga cattttacgg 3900gtaaactttc atgcatcaca agggctttcc
agatttcttc tccttgttat ataaagtgac 3960tctgtccact tcacgtctct ctcaagtcca
ctgtcccaat tgtttatttc tgttgaactt 4020gtgtccctag aatgttgttc tgatcaaaga
tccagaggat gcaagaaaat tctatcctcg 4080ctttaatatt gaggatactt caagcttcca
ggatttggat gatcacaggt acattcaaaa 4140ccttttccca gtcacttccc aaatagagta
gtctgctctt gttcctttga ttttatatcg 4200cccatgaaca tttacacaga tcctctcttg
ctattgtatc gtgctgtatt tttctacttt 4260tcatcaatat gggaaaatct gattaaggtt
ttctcataca gcaaaaatgt tctgaagagg 4320ctatactatg actactattt ccaacgccaa
gaggatctat ggagaaaaaa tgctttgaaa 4380accttgcctg ctctgttgaa ttcatctaat
atgctggcat gtggggagga tctgggtctc 4440attccatctt gtgtacatcc tgtatgtgct
ctgttttctt tctcttggtt catccaactt 4500ctttataaaa cttgtgtaga agcatatgct
aaaattaata gttactcagg ttatgcaaga 4560actgggattg gttggccttc gcatccagcg
catgccaagt gagtccgatg tgaagtttgg 4620gattccgtct aattatgact atatgacggt
tagatatttt tcctcagact tctgtctttc 4680ttacttatag cttcctaccg ttcattttct
gagaattttg taatgggtga gtgattcact 4740gcataattca aactatcgtc ttaatatttt
aggtgtgtgc tccttcatgc cacgactgct 4800ctaccctgcg agcttggtgg gaagaggacg
aagagagaag acagcagtac ttcaaggaag 4860tgattggtgt agatggaatc cctccaagtc
agtgtattcc agagataact catttcattc 4920tgaggcaaca tgttgaagct ccatcaatgt
gggctatttt cccgcttcag gtaatcatca 4980ccactgtccg acttctcaga tttattagaa
ctttttatga ggctcaataa cagtatgtgt 5040tgtttgtttt tttcctctgg aacaatttag
gatatgatgg ctctgaaaga agagtacact 5100actcgtcctg caacagagga gacaatcaat
gatccaacaa accccaaaca ctactggaga 5160taccgtaagt atttactact aactaacaca
caaccctaag aactatacca gttttaactt 5220agactgtttt ttgtgcatat aggcgtacac
gtgactttgg actcgcttct aaaggacact 5280gacctgaagt caaccatcaa gaacctcgtt
tccagcagtg gaagatctgt tcctgctaat 5340gtttctggtg aagacatcaa caaaagccga
ggagaagtta tagccaatgg ctcgactaag 5400ccaaacccat aa
541211955PRTArabidopsis thaliana 11Met
Met Asn Leu Gly Ser Leu Ser Leu Ser Thr Ser Lys Ser Ser Lys1
5 10 15Pro Met Val Ser Ile Ser Phe
Trp Ile Pro Tyr Phe Thr His Trp Gly 20 25
30Glu Ser Leu Leu Val Cys Gly Ser Ala Pro Gly Leu Gly Ser
Gly Asn 35 40 45Val Lys Lys Gly
Leu Leu Leu Lys Pro Ser Gln Gln Asp Asp Gln Leu 50 55
60Ile Trp Ser Gly Ser Val Ser Val Pro Pro Gly Phe Ser
Ser Asp Tyr65 70 75
80Cys Tyr Tyr Val Val Asp Asp Ser Lys Ser Val Leu Arg Ser Glu Phe
85 90 95Gly Met Lys Arg Lys Leu
Val Val Pro Glu Thr Leu Thr Gly Gly Glu 100
105 110Ser Val His Leu Arg Asp Leu Trp Gln Ser Gly Asp
Gln Ala Leu Pro 115 120 125Phe Arg
Ser Ala Phe Lys Asp Val Ile Phe His His Ser Phe Asp Val 130
135 140Lys Val Glu Lys Pro Leu Gly Val Phe Met Asn
Lys Ser Asp Gln Asp145 150 155
160Asp Ser Val Val Val Gln Phe Lys Ile Cys Cys Pro Asp Ile Gly Glu
165 170 175Gly Thr Ser Val
Tyr Val Leu Gly Thr Pro Glu Lys Leu Gly Asn Trp 180
185 190Lys Val Glu Asn Gly Leu Arg Leu Asn Tyr Val
Asp Asp Ser Ile Trp 195 200 205Glu
Ala Asp Cys Leu Ile Pro Lys Ala Asp Phe Pro Ile Lys Tyr Arg 210
215 220Tyr Cys Lys Val Gln Lys Glu Asp Ser Ile
Gly Phe Glu Ser Gly Gly225 230 235
240Asn Arg Glu Leu Ser Leu His Ser Ile Gly Ser Lys Gln Glu Tyr
Ile 245 250 255Val Met Ser
Asp Gly Leu Phe Arg Ala Met Pro Trp Arg Gly Ala Gly 260
265 270Val Ala Val Pro Met Phe Ser Val Arg Ser
Glu Asp Asp Val Gly Val 275 280
285Gly Glu Phe Leu Asp Leu Lys Leu Leu Val Asp Trp Ala Val Asp Ser 290
295 300Gly Leu His Leu Val Gln Leu Leu
Pro Val Asn Asp Thr Ser Val His305 310
315 320Lys Met Trp Trp Asp Ser Tyr Pro Tyr Ser Ser Leu
Ser Val Phe Ala 325 330
335Leu His Pro Leu Tyr Leu Arg Val Gln Ala Leu Ser Glu Arg Leu Pro
340 345 350Glu Asp Ile Lys Glu Glu
Ile Gln Lys Ala Lys Asn Gln Leu Asp Lys 355 360
365Asn Asp Val Asp Tyr Glu Ala Thr Met Glu Thr Lys Leu Ser
Ile Ala 370 375 380Lys Lys Ile Phe Asp
Ile Glu Lys Asp Gln Thr Leu Asn Ser Ser Thr385 390
395 400Phe Gln Lys Phe Phe Ser Glu Asn Glu Gly
Trp Leu Lys Pro Tyr Ala 405 410
415Ala Phe Cys Phe Leu Arg Asp Phe Phe Glu Thr Ser Asp His Ser Gln
420 425 430Trp Gly Thr Phe Ser
Asp Tyr Thr Asp Asp Lys Leu Glu Lys Leu Ile 435
440 445Ser Lys Asp Asn Leu His Tyr Asn Thr Ile Cys Phe
His Tyr Tyr Ile 450 455 460Gln Tyr His
Leu His Val Gln Leu Ser Ala Ala Ala Glu Tyr Ala Arg465
470 475 480Lys Lys Gly Val Val Leu Lys
Gly Asp Leu Pro Ile Gly Val Asp Arg 485
490 495Asn Ser Val Asp Thr Trp Val Tyr Arg Asn Leu Phe
Arg Met Asn Thr 500 505 510Ser
Thr Gly Ala Pro Pro Asp Tyr Phe Asp Lys Asn Gly Gln Asn Trp 515
520 525Gly Phe Pro Thr Tyr Asn Trp Glu Glu
Met Ser Lys Asp Asn Tyr Ala 530 535
540Trp Trp Arg Ala Arg Leu Thr Gln Met Gly Lys Tyr Phe Thr Ala Tyr545
550 555 560Lys Ile Asp His
Ile Leu Gly Phe Phe Arg Ile Trp Glu Leu Pro Ala 565
570 575His Ala Met Thr Gly Leu Val Gly Lys Phe
Arg Pro Ser Ile Pro Leu 580 585
590Ser Gln Glu Glu Leu Glu Lys Glu Gly Ile Trp Asp Phe Asp Arg Leu
595 600 605Ser Lys Pro Tyr Ile Gln Lys
Lys Phe Leu Glu Glu Lys Phe Gly Asp 610 615
620Phe Trp Pro Phe Ile Ala Ser Asn Phe Leu Asn Glu Thr Gln Lys
Asp625 630 635 640Met Tyr
Glu Phe Lys Glu Asp Cys Asn Thr Glu Lys Lys Ile Val Ala
645 650 655Lys Leu Lys Ser Leu Ala Glu
Lys Ser Leu Leu Leu Glu Asn Glu Asp 660 665
670Lys Val Arg Arg Asp Val Phe Asp Ile Leu Arg Asn Val Val
Leu Ile 675 680 685Lys Asp Pro Glu
Asp Ala Arg Lys Phe Tyr Pro Arg Phe Asn Ile Glu 690
695 700Asp Thr Ser Ser Phe Gln Asp Leu Asp Asp His Ser
Lys Asn Val Leu705 710 715
720Lys Arg Leu Tyr Tyr Asp Tyr Tyr Phe Gln Arg Gln Glu Asp Leu Trp
725 730 735Arg Lys Asn Ala Leu
Lys Thr Leu Pro Ala Leu Leu Asn Ser Ser Asn 740
745 750Met Leu Ala Cys Gly Glu Asp Leu Gly Leu Ile Pro
Ser Cys Val His 755 760 765Pro Val
Met Gln Glu Leu Gly Leu Val Gly Leu Arg Ile Gln Arg Met 770
775 780Pro Ser Glu Ser Asp Val Lys Phe Gly Ile Pro
Ser Asn Tyr Asp Tyr785 790 795
800Met Thr Val Cys Ala Pro Ser Cys His Asp Cys Ser Thr Leu Arg Ala
805 810 815Trp Trp Glu Glu
Asp Glu Glu Arg Arg Gln Gln Tyr Phe Lys Glu Val 820
825 830Ile Gly Val Asp Gly Ile Pro Pro Ser Gln Cys
Ile Pro Glu Ile Thr 835 840 845His
Phe Ile Leu Arg Gln His Val Glu Ala Pro Ser Met Trp Ala Ile 850
855 860Phe Pro Leu Gln Asp Met Met Ala Leu Lys
Glu Glu Tyr Thr Thr Arg865 870 875
880Pro Ala Thr Glu Glu Thr Ile Asn Asp Pro Thr Asn Pro Lys His
Tyr 885 890 895Trp Arg Tyr
Arg Val His Val Thr Leu Asp Ser Leu Leu Lys Asp Thr 900
905 910Asp Leu Lys Ser Thr Ile Lys Asn Leu Val
Ser Ser Ser Gly Arg Ser 915 920
925Val Pro Ala Asn Val Ser Gly Glu Asp Ile Asn Lys Ser Arg Gly Glu 930
935 940Val Ile Ala Asn Gly Ser Thr Lys
Pro Asn Pro945 950
955125412DNAArabidopsis thaliana 12atgatgaatc taggatctct ttcgttgagt
acgagcaagt cgagtaagcc aatggttagc 60atcagctttt ggataccgta tttcactcac
tggggagaga gcctgctggt ctgtggctcg 120gctcctggac ttggttccgg gaatgtgaaa
aaaggtttgc tgctaaagcc atcccagcaa 180gatgatcagc tcatctggag tggctctgtc
tcggttccac ctgggtttag ctctgactat 240tgttactatg tggtggatga ctcgaagagt
gtgttaaggt cggagtttgg gatgaaacga 300aaacttgtgg tgcctgagac attgaccggt
ggagagtctg ttcatcttcg tgatctctgg 360caggtttgag ttctttgtcc ttttcgagtg
tattatacaa attttgtaat ctgattttgt 420agtgttgtga ttgatgttct gttcctttaa
ttttagagtg gggatcaagc tcttccattt 480agaagtgcat ttaaagatgt catcttccac
cacagtttcg atgtgaaagt agaaaaacct 540ctaggggtct ttatgaataa gtcagatcaa
gatggtattc atttttttat ctaatcctat 600cacaccattg tctctgagat tctctagttt
gtgattgatc tctttgctaa ctgttgtggc 660cttatgactt taccatgtaa acagattcag
ttgttgtcca attcaaaatc tgttgtccag 720acattggaga gggaacatct gtaagccttt
ctctatgatc tttggttttc tttcttttca 780tgtttgagcc tgatgtggtg ttgtggccat
cattcttacc cttgattatg acatgtacag 840gtgtatgttc taggcacccc agaaaagttg
gggaattgga aagttgaaaa tggacttaga 900ctcaactatg tcgatgattc catatgggaa
gcagattgct tgatccctaa agcagacttt 960cctatcaaat atcctttctt tttactttct
tccagactaa ttagttgaac atagttgaca 1020cttcacttct catatgcttt ctgcatgttg
ggttcttaac gattctttca catatagata 1080ctgtaaagtt cagaaggaag atagcatagg
gtttgaatct ggtggtaatc gggagctgtc 1140tcttcactcc atcggtagta aacaggaata
cattgtcatg tcagatggct tgtttcgagt 1200aagtaacaga gacaagctat gaactagctc
taatgtgaca gaatcaatgt tgtcttacta 1260ttatttaatt ttacttgtag gcaatgccat
ggaggggtgc tggtgtagca gttcccatgt 1320tctctgtaag gtcagaagat gatgttggtg
tgggagagtt tctcgatctg aagttgcttg 1380ttgattgggc tgtagattcc gggttgcatc
tagtacaact tttaccagta aatgacacat 1440ccgttcataa gatgtggtgg gactcgtatc
cctacaggta tgatgattac atttatgtta 1500gttttgcagg agttcaataa aggacttctg
tcacttactt aacggatgga aacaaatatc 1560cagaattttt gttattattc tttcttgact
actgaattca taaatagttc attttttgct 1620tcaatttggt tttcgagacc ttgggttcca
tatactttgg ttgatgatta gtgactgagc 1680ctgaggctct gctgaatatt agtctctact
ctctacttag taaaatttct atatcaactc 1740attgaactgt gtgattcatt cttattgttc
ctttcctcat ttggattgct tctcttgcag 1800ctcgttatct gtgtttgcat tgcatccatt
atatcttaga gtgcaggctc tctctgaacg 1860tctgccagaa gatatcaagg taagtttcta
cacttattgt atcctggaag agactcctaa 1920gcaatatatt ggctgacctt tttcgttctc
tttcaggaag aaattcagaa ggcgaagaat 1980caactggaca agaatgtaat gagtcccttg
ttagacgttt catagatttt cttactttta 2040cgtccttgaa aaagatagca tcagataaac
gggatatagt atttttaaat ggtttgtgtc 2100tgtatcagca tgatgtgttt aaacgcctgc
aagcaattgg ttcatatgtt agttcgtcac 2160aatcagagtt tgagtgttgt aacctaatcc
tttaaagatt gttacgtgta ggatgttgat 2220tatgaggcta ccatggaaac taagctttca
attgccaaga aaatctttga catagagaaa 2280gaccagactt tgaactcaag caccttccag
aaattcttct ctgaaaacga ggtatgcttc 2340aaaatctgat attgtttcct tgatatctct
tgagaattgt ctgcatagaa cttatgcctc 2400caaaggccac gtcagcgttc cattggctta
cggatttttg ttatctttga aagggctggt 2460tgaaaccata tgcagctttc tgcttccttc
gtgacttttt tgagacttca gatcatagtc 2520agtgggggac cttttctgac tatacagacg
acaaggtata atttgctttt atgttgttca 2580agtttaagtt gtgatcgaat tagttcaagc
tttccagatg tatttcatat agtttcagta 2640atttctcaca tctccaaata acatttgcca
cagcttgaaa aattgatatc caaggacaac 2700ttgcactata acactatatg cttccactac
tacattcagt accatttaca tgtacaagta 2760agtatgttct gatttatcta gttatattcc
attcttgtaa gtgaagtcgg ttactggatg 2820tttcatctaa agtgtgttct ggttccattt
tcaaaagttg tcagcagcag cagaatatgc 2880aaggaagaaa ggagttgtgc tgaaaggaga
cctacctatt ggcgttgaca gaaacagtgt 2940tgatacgtgg gtttacagga atctgtttcg
catgaatacg tcaactggag cacctccaga 3000ctattttgac aaaaatggtc agaactgggg
atttcctact tataactggg aggaaatgtc 3060aaaagacaac tatgcctggt ggcgtgctcg
cctaacacag gttggaatta ttgtccttac 3120cagccacatc ctttgtagat ctgaaggctg
accacattgc atacttgctt tgcagatggg 3180gaaatatttc acagcataca agattgatca
tatattggga ttcttcagaa tctgggagct 3240tccagctcat gctatgactg gtttagtggg
gaagttccgt ccatctattc ctttaagtca 3300ggtacacaag ctacatgctt acagttaaaa
aaaacttggc tttgagagtt acttgatgat 3360aaaactgctg ctgatgcagg aagagttgga
aaaggaggga atatgggatt ttgaccgctt 3420aagcaagccc tatatccaga agaagtttct
tgaggttagt tttccatcac atttattatg 3480atctgcctct gtcacagctt agttagtttg
tgagttttca atctcattac ctatttgttg 3540ataatatgtc attgttctgt tccttcagga
gaaatttgga gatttttggc cttttattgc 3600atcaaacttt cttaatgaaa ctcagaagga
catgtatgag gttagtaaaa atcctcattt 3660agagatcgtt ttcctttgga gaaccaagtc
ttaccttttt ctgtccaaaa ctctgtggtg 3720ttttttgcta agtggatgta ttaaatcttt
cttaattggt ttcctttaat tgtactctag 3780ttcaaggagg actgcaacac agagaagaag
attgtagcaa agctgaagtc attggctgag 3840aagtctttgt tgctagaaaa tgaggacaaa
gttcgacgtg atgtctttga cattttacgg 3900gtaaactttc atgcatcaca agggctttcc
agatttcttc tccttgttat ataaagtgac 3960tctgtccact tcacgtctct ctcaagtcca
ctgtcccaat tgtttatttc tgttgaactt 4020gtgtccctag aatgttgttc tgatcaaaga
tccagaggat gcaagaaaat tctatcctcg 4080ctttaatatt gaggatactt caagcttcca
ggatttggat gatcacaggt acattcaaaa 4140ccttttccca gtcacttccc aaatagagta
gtctgctctt gttcctttga ttttatatcg 4200cccatgaaca tttacacaga tcctctcttg
ctattgtatc gtgctgtatt tttctacttt 4260tcatcaatat gggaaaatct gattaaggtt
ttctcataca gcaaaaatgt tctgaagagg 4320ctatactatg actactattt ccaacgccaa
gaggatctat ggagaaaaaa tgctttgaaa 4380accttgcctg ctctgttgaa ttcatctaat
atgctggcat gtggggagga tctgggtctc 4440attccatctt gtgtacatcc tgtatgtgct
ctgttttctt tctcttggtt catccaactt 4500ctttataaaa cttgtgtaga agcatatgct
aaaattaata gttactcagg ttatgcaaga 4560actgggattg gttggccttc gcatccagcg
catgccaagt gagtccgatg tgaagtttgg 4620gattccgtct aattatgact atatgacggt
tagatatttt tcctcagact tctgtctttc 4680ttacttatag cttcctaccg ttcattttct
gagaattttg taatgggtga gtgattcact 4740gcataattca aactatcgtc ttaatatttt
aggtgtgtgc tccttcatgc cacgactgct 4800ctaccctgcg agcttggtgg gaagaggacg
aagagagaag acagcagtac ttcaaggaag 4860tgattggtgt agatggaatc cctccaagtc
agtgtattcc agagataact catttcattc 4920tgaggcaaca tgttgaagct ccatcaatgt
gggctatttt cccgcttcag gtaatcatca 4980ccactgtccg acttctcaga tttattagaa
ctttttatga ggctcaataa cagtatgtgt 5040tgtttgtttt tttcctctgg aacaatttag
gatatgatgg ctctgaaaga agagtacact 5100actcgtcctg caacagagga gacaatcaat
gatccaacaa accccaaaca ctactggaga 5160taccgtaagt atttactact aactaacaca
caaccctaag aactatacca gttttaactt 5220agactgtttt ttgtgcatat aggcgtacac
gtgactttgg actcgcttct aaaggacact 5280gacctgaagt caaccatcaa gaacctcgtt
tccagcagtg gaagatctgt tcctgctaat 5340gtttctggtg aagacatcaa caaaagccga
ggagaagtta tagccaatgg ctcgactaag 5400ccaaacccat aa
5412133576DNAArabidopsis thaliana
13atgaattgtc ttcagaatct tcccaggtat cttcttgctt tccgaaatta gcaaatgttg
60ttgatttgtg tggttctttg attcgatatg aatttgtgtg ttgttgatgt ctttgatcgt
120catttatttc agatgttcag tctcacctct gctgggattc gggtgcattc aaagagatca
180ttcttcttct tcttcttctt tgaagatgct aatatcgcct ccgatcaaag ccaatgatcc
240aaaatctcga cttgttttac atgtatacac ctcttttaag atatattgtt atatcagttt
300ttttcttcta acttttattg acacttttgt aaatgctttg agtgtaggca gtatcagagt
360caaaatccag ctcagagatg agtggtgttg caaaggatga agagaaatct gatgaatata
420gccaagacat gactcaagct atgggtgctg gtaaattgtt ttcatcatac ctatacatgt
480ccttagattg gaaactctaa tgatgtccac cattttttgt ctcatattgg aaatattgat
540ttagctcatt ggattgtatt ctactcttgt ttttcatctt tcatcttgat tagttttgtt
600catctgtcta ccaaggaatc ttatatcttt tgcttgttta ttgacaaact tatcttcttg
660ttctccttaa ttatgtattt cagtcactga ttcattctca tgtgtctttc atttcagttc
720taacttacag gcacgagtta ggaatgaact acaactttat tcgtccagat ctaattgttg
780gatcctgctt acaggttcac ttttatatcc ttctctcaaa tgatgatttt gttttggata
840tgtggtactt tcctcgtgtt cttactgtat gctctccttt attgctagac ccctgaagat
900gttgacaagc ttcgtaaaat tggagttaaa accatatttt gcttgcaaca agatccagac
960ctggagtatc cttccaaagc cctctttttc tgttcatgat tatgattttg ttttcacttt
1020ccttcgcatt gagtgaggaa gaatgatctg cagtgcagaa tgccttgacc aatccttcca
1080gatattttgg agtagacata agcagcatcc aagcctatgc taagaaatat agtgatattc
1140agcatattcg ctgtgaaatt aggtaaggca acgatctagc tgtgactttt taatatattg
1200gtattcaaag ctgatgcgta atgaatcgaa ggagagatca tgagttgatt tttagtttcc
1260tttccaatat ttttcttatt gcttgacttt ataagaagca tatgaggtag ttttttttcc
1320agaattatct gaataggcga gggaacaaag gtatatacac tgttggaggg tagtctcaga
1380tatcttttag gcacgcaatc tttaaattct tattgttttg tcagtgaagt gttatacttt
1440ctttttcttt gggctcttta cttttatcac gtttcctaag ctaatgggaa atcagcacac
1500ttttgtgtaa gcaataactg gtttatgacg aattgtgagt gatagttttt ttttaccggg
1560tgtgaaatca ctgctatcaa ctaacttttt tgttcgtcca cctttgtctg tttacaaact
1620agagtaactg tagaagttag tcatcatccc attctcacta aatttcttta ctgctaaaga
1680cttggaacca tcatcatccc ttttggcata tatgagactg ttcacgagca catgcataga
1740aaaattcacc atggacgtac ccactcacaa cgtaggggat tgacatatta tatatcattg
1800atgtaacaga gactttgatg catttgattt gagaatgcgt cttccagccg tggttggtac
1860tctttacaaa gctgttaagc gaaatggagg agttacatat gtgcactgca ctgctggaat
1920gggaagggct cctgctgttg cggtatgtta cagaaacctg ttactaagct cttctctttt
1980tccccctttt ctttccagta ttgaaaaaat gtaagactgt tgcaaacagt agttgtctta
2040attttgtgct tatttggttg actcatgaag gtatgaacaa gttttcacat tttgctgact
2100gatactgaga agggatatta atttcatttt cagttgacat acatgttctg ggtgcaaggc
2160tataagctta tggaagctca taaattactt atgataagat actaatttac ttgtggttta
2220tgttctctat atgtgtttct ttatacttat attaagtaac caacatgtta atcatgtttt
2280actagtgtta ctgcctgtta tattatgtga ggatgtttgg ggatttcctt cttctcctat
2340gaaccgacaa ttatgataac tagaatcata gaccattggg tcaggggttt gtagtaaata
2400tacactgaag atatagcact ttaaaagaca aaaatcttgg gttagttcat ccaaaatagt
2460ctagcattag acatttttag tttgatatct ctagatggct ggtgaaacat aaaggcacgc
2520gtaaatttgg tttatataac atttctgaaa ttttcttgca gagcaaaagg tcgtgctttc
2580cgaagctgga tgctatcaga aatgcaacaa ttgatattgt gagtcagcgt tcaactttca
2640caatgcttcc tatgtttgaa aaacctcagt tccattgatt agcatgagtg cgcaacagtt
2700acctggttcc ttgactcctg attattgttt ccgggtgtat atggataaaa gcttacagga
2760ctcaagagga agactgttac tctgacactg aaagataagg ggttctccag agtagaaatt
2820tctggccttg acattggatg gggacaggta aatatatttt atcttaaacc aatttattta
2880ttatgacacc ttgctcttaa atctatggtg tgccatataa gctttaaagc caggatgcac
2940ttttgtcact gctattaaaa gaatggcctc tttgataaga agtggaagag atgagtgttg
3000gatttctgaa aattattgtc ttatgattgt gcaaaactat ttggacgtat atatataaaa
3060ggtaatgtat ttctttagct cttttcgttt ttaccatttt tctctttatg tagaggatac
3120ctctaacact ggacaaggga acaggattct ggatcctaaa gagagaactg cctgtgagtt
3180ttcttcttaa ctgttaagca tagtgtctga tcattttcca acaaccacta caatttctga
3240aatgcaggaa ggacagtttg aatataaata catcatagat ggtgaatgga cacacaatga
3300ggccgaaccg tttataggac ctaacaaaga cggccatacc aacaattacg ctaaagtaaa
3360aatgtctctg tctcttaaca tatagctgaa caagttttga tgaatgagtt aatgctgtaa
3420tgttgttgtg tactaatgaa ttaggtagtg gacgatccaa caagtgtgga tggtacaact
3480cgggagagac tatcgagcga agaccctgag ctgttggagg aagaacgctc gaaactaatc
3540cagttcttgg agacttgttc tgaggcagaa gtttga
3576145380DNAArabidopsis thaliana 14atggaggcca gtgccggctt ggttgctgga
tcctaccgga gaaacgagct cgttcggatc 60cgacatgaat ctgatggcgg ggtctgttca
tcttcccttt ttcccatttt tttgttattg 120tttttcgttc ttacaatttt tgatgtgtag
atctcatcta gatttctctg tttctaaatc 180tcgtctcttt tggatccata attggatcat
tgaaactcag atttcgcttc ctttgactgt 240gtagttagtt agtgtcagtt gatcaagtaa
gtgtctgaaa atggaaactt ttctgctcca 300attcttcaaa ttgttgtgat ctatatcaat
taatgccgca tctgttttct taaaatctct 360tatggaaagt gtcggtggat ttcagttcgt
taactttttt aagctaaaat ctttgactct 420taaagtttag ctttacttat tgagatttag
ctcaactaga tctcgttagt tcccgccatg 480ggatacagac tgtgactcgc cttaattcag
atctgcattg attgttttga tttagatcct 540tgctcatctc tttctgtagt ttctaatact
caatgactaa caatgatgca atgttggtca 600aagtgcagac caaacctttg aagaatatga
atggccagat atgtcagatc tgtggtgatg 660atgttggact cgctgaaact ggagatgtct
ttgtcgcgtg taatgaatgt gccttccctg 720tgtgtcggcc ttgctatgag tacgagagga
aagatggaac tcagtgttgc cctcaatgca 780agactagatt cagacgacac aggggtcagt
tgtctttttc tttttgttgg caattgctat 840atatggattt tctctttttg tttctttgct
gttgtgttga acaatttttt ggaattttcc 900agggagtcct cgtgttgaag gagatgaaga
tgaggatgat gttgatgata tcgagaatga 960gttcaattac gcccagggag ctaacaaggc
gagacaccaa cgccatggcg aagagttttc 1020ttcttcctct agacatgaat ctcaaccaat
tcctcttctc acccatggcc atacggtagg 1080gacctacatt ttccctttag actctagagt
gatttgtatt actcaataaa tccctagagt 1140ggtcatttat tacttactat tcacgttaat
gttatatgtg aacaaatctt aacagaattt 1200ttttctgata gtacatggtc atccaaatta
agaaataata atagatgttg ttagttgtgt 1260ctgttttcaa tagattcatg acctttttct
atacacaggt ttctggagag attcgcacgc 1320ctgatacaca atctgtgcga actacatcag
gtcctttggg tccttctgac aggaatgcta 1380tttcatctcc atatattgat ccacggcaac
ctggtattca tatgtttttc ccttgtgcac 1440gtggtctttg ttaaatgtga ttcctattca
tttttacaac atatatattt tgtgtaccgt 1500aactgatagc tcccgctaaa aattgcagtc
cctgtaagaa tcgtggaccc gtcaaaagac 1560ttgaactctt atgggcttgg taatgttgac
tggaaagaaa gagttgaagg ctggaagctg 1620aagcaggaga aaaatatgtt acagatgact
ggtaaatacc atgaagggaa aggaggagaa 1680attgaaggga ctggttccaa tggcgaagaa
ctccaaatgt aagtggaaat actagaccaa 1740tatctttatt gtccaactca aacagctctt
ggccgtgatg ctaataacca ctcttggttt 1800cttattatgt attgatagac ataattaagt
atctgctttg ttacatttgt ttccttccac 1860tcaattatgg ttctcgtact tacagggctg
atgatacacg tcttcctatg agtcgtgtgg 1920tgcctatccc atcttctcgc ctaacccctt
atcgggttgt gattattctc cggcttatca 1980tcttgtgttt cttcttgcaa tatcgtacaa
ctcaccctgt gaaaaatgca tatcctttgt 2040ggttgacctc ggttatctgt gagatctggt
ttgcattttc ttggcttctt gatcagtttc 2100ccaaatggta ccccattaac agggagactt
atcttgaccg tctcgctata aggttggtct 2160ttaagtttat acatccccta ctctcatctc
tcttttatgt attaacttga tatcttctat 2220cacagttttc gatagttgac tttttccccc
tgtaaattta atttaaattt agacaatggt 2280gcatctgaat tttgattatg atatatctta
agaagattat gattgtaaat cttgaaattt 2340agtagaaaac catctgcaat ctactgacca
tgtgaagttt ccgactagac tatgatagaa 2400gcatgccaag tggagtgttt attaagatag
agcttagcta ttatactgat tttatatgtg 2460ttttgatttt ttggtttctt attgtagata
tgatcgagac ggtgaaccat cacagctcgt 2520tcctgttgat gtgtttgtta gtacagtgga
cccattgaaa gagcctcccc ttgttacagc 2580aaacacagtt ctctcgattc tttctgtgga
ctacccggta gataaagtag cctgttatgt 2640ttcagatgat ggttcagcta tgcttacctt
tgaatccctt tctgaaaccg ctgagtttgc 2700aaagaaatgg gtaccatttt gcaagaaatt
caacattgaa cctagggccc ctgaattcta 2760ttttgcccag aagatagatt acttgaagga
caagatccaa ccgtcttttg ttaaagagcg 2820acgagctatg aaggtcattt gaaaagtcca
cctgcttctc atccatacgg caaagagatt 2880gactgacttt ttctttggtt tgtattgaca
gagagagtat gaagagttta aagtgaggat 2940aaatgctctt gttgccaaag cacagaaaat
ccctgaagaa ggctggacaa tgcaggatgg 3000tactccctgg cctggtaaca acactagaga
tcatcctgga atgatacagg tacagtgtgg 3060caatcccttg attgtgacag agaggataac
gtaaaggaaa catgtttaca tcgttttgtt 3120tcaatttcag gtgttcttag gccatagtgg
gggtctggat accgatggaa atgagctgcc 3180tagactcatc tatgtttctc gtgaaaagcg
gcctggattt caacaccaca aaaaggctgg 3240agctatgaat gcattggttt gttaactttc
agaatcctat tgtgtcctct attttattct 3300cttgttcact gcctaagaaa cgttcttctt
gtgtagccgt tgcttcacat tctttttttt 3360ctaggctatg tgttctctcc taatttagta
tctctttact ttgacagatc cgtgtatctg 3420ctgttcttac caatggagca tatcttttga
acgtggattg tgatcattac tttaataaca 3480gtaaggctat taaagaagct atgtgtttca
tgatggaccc ggctattgga aagaagtgct 3540gctatgtcca gttccctcaa cgttttgacg
gtattgattt gcacgatcga tatgccaaca 3600ggaatatagt ctttttcgat gtgagtatca
cttccccatt gtcttttgtt tctcttttgt 3660tcatattttg gttggattta ctcgtttctg
ctatggcctg acttggatat ttgttctctt 3720gggcagatta acatgaaggg gttggatggt
atccagggtc cagtatatgt gggtactggt 3780tgttgtttta ataggcaggc tctatatggg
tatgatcctg ttttgacgga agaagattta 3840gaaccaaata ttattgtcaa gagctgttgc
gggtcaagga agaaaggtaa aagtagcaag 3900aagtataact acgaaaagag gagaggcatc
aacagaagtg actccaatgc tccacttttc 3960aatatggagg acatcgatga gggttttgaa
ggtttgattg agctgattgt gtaataacat 4020cacttcttta tgtaatgatt tatgtgatgg
tgaaatctta caatccttgt ttatgcaggt 4080tatgatgatg agaggtctat tctaatgtcc
cagaggagtg tagagaagcg ttttggtcag 4140tcgccggtat ttattgcggc aaccttcatg
gaacaaggcg gcattccacc aacaaccaat 4200cccgctactc ttctgaagga ggctattcat
gttataagct gtggttacga agacaagact 4260gaatggggca aagaggtcag ttttcaaatg
cagctacaga atcttcttat gttctctttc 4320ttacctgttt gatgacatct tatttggcac
ttttgttaga ttggttggat ctatggttcc 4380gtgacggaag atattcttac tgggttcaag
atgcatgccc ggggttggat atcgatctac 4440tgcaatcctc cacgccctgc gttcaaggga
tctgcaccaa tcaatctttc tgatcgtttg 4500aaccaagttc ttcgatgggc tttgggatct
atcgagattc ttcttagcag acattgtcct 4560atctggtatg gttaccatgg aaggttgaga
cttttggaga ggatcgctta tatcaacacc 4620atcgtctatc ctattacatc catccctctt
attgcgtatt gtattcttcc cgctttttgt 4680ctcatcaccg acagattcat catacccgag
gtttgtaaaa ctgaccacac tgctatttac 4740tatttgaatc ccattttgtg aatgcatttt
tttgtcatca tcattgttgc agataagcaa 4800ctacgcgagt atttggttca ttctactctt
catctcaatt gctgtgactg gaatcctgga 4860gctgagatgg agcggtgtga gcattgagga
ttggtggagg aacgagcagt tctgggtcat 4920tggtggcaca tccgcccatc tttttgctgt
cttccaaggt ctacttaagg ttcttgctgg 4980tatcgacacc aacttcaccg ttacatctaa
agccacagac gaagatgggg attttgcaga 5040actctacatc ttcaaatgga cagctcttct
cattccacca accaccgtcc tacttgtgaa 5100cctcataggc attgtggctg gtgtctctta
tgctgtaaac agtggctacc agtcgtgggg 5160tccgcttttc aggaagctct tcttcgcctt
atgggttatt gcccatctct accctttctt 5220gaaaggtctg ttgggaagac aaaaccgaac
accaaccatc gtcattgtct ggtctgttct 5280tctcgcctcc atcttctcgt tgctttgggt
caggatcaat ccctttgtgg acgccaatcc 5340caatgccaac aacttcaatg gcaaaggagg
tgtcttttag 5380151081PRTArabidopsis thaliana
15Met Glu Ala Ser Ala Gly Leu Val Ala Gly Ser Tyr Arg Arg Asn Glu1
5 10 15Leu Val Arg Ile Arg His
Glu Ser Asp Gly Gly Thr Lys Pro Leu Lys 20 25
30Asn Met Asn Gly Gln Ile Cys Gln Ile Cys Gly Asp Asp
Val Gly Leu 35 40 45Ala Glu Thr
Gly Asp Val Phe Val Ala Cys Asn Glu Cys Ala Phe Pro 50
55 60Val Cys Arg Pro Cys Tyr Glu Tyr Glu Arg Lys Asp
Gly Thr Gln Cys65 70 75
80Cys Pro Gln Cys Lys Thr Arg Phe Arg Arg His Arg Gly Ser Pro Arg
85 90 95Val Glu Gly Asp Glu Asp
Glu Asp Asp Val Asp Asp Ile Glu Asn Glu 100
105 110Phe Asn Tyr Ala Gln Gly Ala Asn Lys Ala Arg His
Gln Arg His Gly 115 120 125Glu Glu
Phe Ser Ser Ser Ser Arg His Glu Ser Gln Pro Ile Pro Leu 130
135 140Leu Thr His Gly His Thr Val Ser Gly Glu Ile
Arg Thr Pro Asp Thr145 150 155
160Gln Ser Val Arg Thr Thr Ser Gly Pro Leu Gly Pro Ser Asp Arg Asn
165 170 175Ala Ile Ser Ser
Pro Tyr Ile Asp Pro Arg Gln Pro Val Pro Val Arg 180
185 190Ile Val Asp Pro Ser Lys Asp Leu Asn Ser Tyr
Gly Leu Gly Asn Val 195 200 205Asp
Trp Lys Glu Arg Val Glu Gly Trp Lys Leu Lys Gln Glu Lys Asn 210
215 220Met Leu Gln Met Thr Gly Lys Tyr His Glu
Gly Lys Gly Gly Glu Ile225 230 235
240Glu Gly Thr Gly Ser Asn Gly Glu Glu Leu Gln Met Ala Asp Asp
Thr 245 250 255Arg Leu Pro
Met Ser Arg Val Val Pro Ile Pro Ser Ser Arg Leu Thr 260
265 270Pro Tyr Arg Val Val Ile Ile Leu Arg Leu
Ile Ile Leu Cys Phe Phe 275 280
285Leu Gln Tyr Arg Thr Thr His Pro Val Lys Asn Ala Tyr Pro Leu Trp 290
295 300Leu Thr Ser Val Ile Cys Glu Ile
Trp Phe Ala Phe Ser Trp Leu Leu305 310
315 320Asp Gln Phe Pro Lys Trp Tyr Pro Ile Asn Arg Glu
Thr Tyr Leu Asp 325 330
335Arg Leu Ala Ile Arg Tyr Asp Arg Asp Gly Glu Pro Ser Gln Leu Val
340 345 350Pro Val Asp Val Phe Val
Ser Thr Val Asp Pro Leu Lys Glu Pro Pro 355 360
365Leu Val Thr Ala Asn Thr Val Leu Ser Ile Leu Ser Val Asp
Tyr Pro 370 375 380Val Asp Lys Val Ala
Cys Tyr Val Ser Asp Asp Gly Ser Ala Met Leu385 390
395 400Thr Phe Glu Ser Leu Ser Glu Thr Ala Glu
Phe Ala Lys Lys Trp Val 405 410
415Pro Phe Cys Lys Lys Phe Asn Ile Glu Pro Arg Ala Pro Glu Phe Tyr
420 425 430Phe Ala Gln Lys Ile
Asp Tyr Leu Lys Asp Lys Ile Gln Pro Ser Phe 435
440 445Val Lys Glu Arg Arg Ala Met Lys Arg Glu Tyr Glu
Glu Phe Lys Val 450 455 460Arg Ile Asn
Ala Leu Val Ala Lys Ala Gln Lys Ile Pro Glu Glu Gly465
470 475 480Trp Thr Met Gln Asp Gly Thr
Pro Trp Pro Gly Asn Asn Thr Arg Asp 485
490 495His Pro Gly Met Ile Gln Val Phe Leu Gly His Ser
Gly Gly Leu Asp 500 505 510Thr
Asp Gly Asn Glu Leu Pro Arg Leu Ile Tyr Val Ser Arg Glu Lys 515
520 525Arg Pro Gly Phe Gln His His Lys Lys
Ala Gly Ala Met Asn Ala Leu 530 535
540Ile Arg Val Ser Ala Val Leu Thr Asn Gly Ala Tyr Leu Leu Asn Val545
550 555 560Asp Cys Asp His
Tyr Phe Asn Asn Ser Lys Ala Ile Lys Glu Ala Met 565
570 575Cys Phe Met Met Asp Pro Ala Ile Gly Lys
Lys Cys Cys Tyr Val Gln 580 585
590Phe Pro Gln Arg Phe Asp Gly Ile Asp Leu His Asp Arg Tyr Ala Asn
595 600 605Arg Asn Ile Val Phe Phe Asp
Ile Asn Met Lys Gly Leu Asp Gly Ile 610 615
620Gln Gly Pro Val Tyr Val Gly Thr Gly Cys Cys Phe Asn Arg Gln
Ala625 630 635 640Leu Tyr
Gly Tyr Asp Pro Val Leu Thr Glu Glu Asp Leu Glu Pro Asn
645 650 655Ile Ile Val Lys Ser Cys Cys
Gly Ser Arg Lys Lys Gly Lys Ser Ser 660 665
670Lys Lys Tyr Asn Tyr Glu Lys Arg Arg Gly Ile Asn Arg Ser
Asp Ser 675 680 685Asn Ala Pro Leu
Phe Asn Met Glu Asp Ile Asp Glu Gly Phe Glu Gly 690
695 700Tyr Asp Asp Glu Arg Ser Ile Leu Met Ser Gln Arg
Ser Val Glu Lys705 710 715
720Arg Phe Gly Gln Ser Pro Val Phe Ile Ala Ala Thr Phe Met Glu Gln
725 730 735Gly Gly Ile Pro Pro
Thr Thr Asn Pro Ala Thr Leu Leu Lys Glu Ala 740
745 750Ile His Val Ile Ser Cys Gly Tyr Glu Asp Lys Thr
Glu Trp Gly Lys 755 760 765Glu Ile
Gly Trp Ile Tyr Gly Ser Val Thr Glu Asp Ile Leu Thr Gly 770
775 780Phe Lys Met His Ala Arg Gly Trp Ile Ser Ile
Tyr Cys Asn Pro Pro785 790 795
800Arg Pro Ala Phe Lys Gly Ser Ala Pro Ile Asn Leu Ser Asp Arg Leu
805 810 815Asn Gln Val Leu
Arg Trp Ala Leu Gly Ser Ile Glu Ile Leu Leu Ser 820
825 830Arg His Cys Pro Ile Trp Tyr Gly Tyr His Gly
Arg Leu Arg Leu Leu 835 840 845Glu
Arg Ile Ala Tyr Ile Asn Thr Ile Val Tyr Pro Ile Thr Ser Ile 850
855 860Pro Leu Ile Ala Tyr Cys Ile Leu Pro Ala
Phe Cys Leu Ile Thr Asp865 870 875
880Arg Phe Ile Ile Pro Glu Ile Ser Asn Tyr Ala Ser Ile Trp Phe
Ile 885 890 895Leu Leu Phe
Ile Ser Ile Ala Val Thr Gly Ile Leu Glu Leu Arg Trp 900
905 910Ser Gly Val Ser Ile Glu Asp Trp Trp Arg
Asn Glu Gln Phe Trp Val 915 920
925Ile Gly Gly Thr Ser Ala His Leu Phe Ala Val Phe Gln Gly Leu Leu 930
935 940Lys Val Leu Ala Gly Ile Asp Thr
Asn Phe Thr Val Thr Ser Lys Ala945 950
955 960Thr Asp Glu Asp Gly Asp Phe Ala Glu Leu Tyr Ile
Phe Lys Trp Thr 965 970
975Ala Leu Leu Ile Pro Pro Thr Thr Val Leu Leu Val Asn Leu Ile Gly
980 985 990Ile Val Ala Gly Val Ser
Tyr Ala Val Asn Ser Gly Tyr Gln Ser Trp 995 1000
1005Gly Pro Leu Phe Arg Lys Leu Phe Phe Ala Leu Trp
Val Ile Ala 1010 1015 1020His Leu Tyr
Pro Phe Leu Lys Gly Leu Leu Gly Arg Gln Asn Arg 1025
1030 1035Thr Pro Thr Ile Val Ile Val Trp Ser Val Leu
Leu Ala Ser Ile 1040 1045 1050Phe Ser
Leu Leu Trp Val Arg Ile Asn Pro Phe Val Asp Ala Asn 1055
1060 1065Pro Asn Ala Asn Asn Phe Asn Gly Lys Gly
Gly Val Phe 1070 1075
1080165380DNAArabidopsis thaliana 16atggaggcca gtgccggctt ggttgctgga
tcctaccgga gaaacgagct cgttcggatc 60cgacatgaat ctgatggcgg ggtctgttca
tcttcccttt ttcccatttt tttgttattg 120tttttcgttc ttacaatttt tgatgtgtag
atctcatcta gatttctctg tttctaaatc 180tcgtctcttt tggatccata attggatcat
tgaaactcag atttcgcttc ctttgactgt 240gtagttagtt agtgtcagtt gatcaagtaa
gtgtctgaaa atggaaactt ttctgctcca 300attcttcaaa ttgttgtgat ctatatcaat
taatgccgca tctgttttct taaaatctct 360tatggaaagt gtcggtggat ttcagttcgt
taactttttt aagctaaaat ctttgactct 420taaagtttag ctttacttat tgagatttag
ctcaactaga tctcgttagt tcccgccatg 480ggatacagac tgtgactcgc cttaattcag
atctgcattg attgttttga tttagatcct 540tgctcatctc tttctgtagt ttctaatact
caatgactaa caatgatgca atgttggtca 600aagtgcagac caaacctttg aagaatatga
atggccagat atgtcagatc tgtggtgatg 660atgttggact cgctgaaact ggagatgtct
ttgtcgcgtg taatgaatgt gccttccctg 720tgtgtcggcc ttgctatgag tacgagagga
aagatggaac tcagtgttgc cctcaatgca 780agactagatt cagacgacac aggggtcagt
tgtctttttc tttttgttgg caattgctat 840atatggattt tctctttttg tttctttgct
gttgtgttga acaatttttt ggaattttcc 900agggagtcct cgtgttgaag gagatgaaga
tgaggatgat gttgatgata tcgagaatga 960gttcaattac gcccagggag ctaacaaggc
gagacaccaa cgccatggcg aagagttttc 1020ttcttcctct agacatgaat ctcaaccaat
tcctcttctc acccatggcc atacggtagg 1080gacctacatt ttccctttag actctagagt
gatttgtatt actcaataaa tccctagagt 1140ggtcatttat tacttactat tcacgttaat
gttatatgtg aacaaatctt aacagaattt 1200ttttctgata gtacatggtc atccaaatta
agaaataata atagatgttg ttagttgtgt 1260ctgttttcaa tagattcatg acctttttct
atacacaggt ttctggagag attcgcacgc 1320ctgatacaca atctgtgcga actacatcag
gtcctttggg tccttctgac aggaatgcta 1380tttcatctcc atatattgat ccacggcaac
ctggtattca tatgtttttc ccttgtgcac 1440gtggtctttg ttaaatgtga ttcctattca
tttttacaac atatatattt tgtgtaccgt 1500aactgatagc tcccgctaaa aattgcagtc
cctgtaagaa tcgtggaccc gtcaaaagac 1560ttgaactctt atgggcttgg taatgttgac
tggaaagaaa gagttgaagg ctggaagctg 1620aagcaggaga aaaatatgtt acagatgact
ggtaaatacc atgaagggaa aggaggagaa 1680attgaaggga ctggttccaa tggcgaagaa
ctccaaatgt aagtggaaat actagaccaa 1740tatctttatt gtccaactca aacagctctt
ggccgtgatg ctaataacca ctcttggttt 1800cttattatgt attgatagac ataattaagt
atctgctttg ttacatttgt ttccttccac 1860tcaattatgg ttctcgtact tacagggctg
atgatacacg tcttcctatg agtcgtgtgg 1920tgcctatccc atcttctcgc ctaacccctt
atcgggttgt gattattctc cggcttatca 1980tcttgtgttt cttcttgcaa tatcgtacaa
ctcaccctgt gaaaaatgca tatcctttgt 2040ggttgacctc ggttatctgt gagatctggt
ttgcattttc ttggcttctt gatcagtttc 2100ccaaatggta ccccattaac agggagactt
atcttgaccg tctcgctata aggttggtct 2160ttaagtttat acatccccta ctctcatctc
tcttttatgt attaacttga tatcttctat 2220cacagttttc gatagttgac tttttccccc
tgtaaattta atttaaattt agacaatggt 2280gcatctgaat tttgattatg atatatctta
agaagattat gattgtaaat cttgaaattt 2340agtagaaaac catctgcaat ctactgacca
tgtgaagttt ccgactagac tatgatagaa 2400gcatgccaag tggagtgttt attaagatag
agcttagcta ttatactgat tttatatgtg 2460ttttgatttt ttggtttctt attgtagata
tgatcgagac ggtgaaccat cacagctcgt 2520tcctgttgat gtgtttgtta gtacagtgga
cccattgaaa gagcctcccc ttgttacagc 2580aaacacagtt ctctcgattc tttctgtgga
ctacccggta gataaagtag cctgttatgt 2640ttcagatgat ggttcagcta tgcttacctt
tgaatccctt tctgaaaccg ctgagtttgc 2700aaagaaatgg gtaccatttt gcaagaaatt
caacattgaa cctagggccc ctgaattcta 2760ttttgcccag aagatagatt acttgaagga
caagatccaa ccgtcttttg ttaaagagcg 2820acgagctatg aaggtcattt gaaaagtcca
cctgcttctc atccatacgg caaagagatt 2880gactgacttt ttctttggtt tgtattgaca
gagagagtat gaagagttta aagtgaggat 2940aaatgctctt gttgccaaag cacagaaaat
ccctgaagaa ggctggacaa tgcaggatgg 3000tactccctgg cctggtaaca acactagaga
tcatcctgga atgatacagg tacagtgtgg 3060caatcccttg attgtgacag agaggataac
gtaaaggaaa catgtttaca tcgttttgtt 3120tcaatttcag gtgttcttag gccatagtgg
gggtctggat accgatggaa atgagctgcc 3180tagactcatc tatgtttctc gtgaaaagcg
gcctggattt caacaccaca aaaaggctgg 3240agctatgaat gcattggttt gttaactttc
agaatcctat tgtgtcctct attttattct 3300cttgttcact gcctaagaaa cgttcttctt
gtgtagccgt tgcttcacat tctttttttt 3360ctaggctatg tgttctctcc taatttagta
tctctttact ttgacagatc cgtgtatctg 3420ctgttcttac caatggagca tatcttttga
acgtggattg tgatcattac tttaataaca 3480gtaaggctat taaagaagct atgtgtttca
tgatggaccc ggctattgga aagaagtgct 3540gctatgtcca gttccctcaa cgttttgacg
gtattgattt gcacgatcga tatgccaaca 3600ggaatatagt ctttttcgat gtgagtatca
cttccccatt gtcttttgtt tctcttttgt 3660tcatattttg gttggattta ctcgtttctg
ctatggcctg acttggatat ttgttctctt 3720gggcagatta acatgaaggg gttggatggt
atccagggtc cagtatatgt gggtactggt 3780tgttgtttta ataggcaggc tctatatggg
tatgatcctg ttttgacgga agaagattta 3840gaaccaaata ttattgtcaa gagctgttgc
gggtcaagga agaaaggtaa aagtagcaag 3900aagtataact acgaaaagag gagaggcatc
aacagaagtg actccaatgc tccacttttc 3960aatatggagg acatcgatga gggttttgaa
ggtttgattg agctgattgt gtaataacat 4020cacttcttta tgtaatgatt tatgtgatgg
tgaaatctta caatccttgt ttatgcaggt 4080tatgatgatg agaggtctat tctaatgtcc
cagaggagtg tagagaagcg ttttggtcag 4140tcgccggtat ttattgcggc aaccttcatg
gaacaaggcg gcattccacc aacaaccaat 4200cccgctactc ttctgaagga ggctattcat
gttataagct gtggttacga agacaagact 4260gaatggggca aagaggtcag ttttcaaatg
cagctacaga atcttcttat gttctctttc 4320ttacctgttt gatgacatct tatttggcac
ttttgttaga ttggttggat ctatggttcc 4380gtgacggaag atattcttac tgggttcaag
atgcatgccc ggggttggat atcgatctac 4440tgcaatcctc cacgccctgc gttcaaggga
tctgcaccaa tcaatctttc tgatcgtttg 4500aaccaagttc ttcgatgggc tttgggatct
atcgagattc ttcttagcag acattgtcct 4560atctggtatg gttaccatgg aaggttgaga
cttttggaga ggatcgctta tatcaacacc 4620atcgtctatc ctattacatc catccctctt
attgcgtatt gtattcttcc cgctttttgt 4680ctcatcaccg acagattcat catacccgag
gtttgtaaaa ctgaccacac tgctatttac 4740tatttgaatc ccattttgtg aatgcatttt
tttgtcatca tcattgttgc agataagcaa 4800ctacgcgagt atttggttca ttctactctt
catctcaatt gctgtgactg gaatcctgga 4860gctgagatgg agcggtgtga gcattgagga
ttggtggagg aacgagcagt tctgggtcat 4920tggtggcaca tccgcccatc tttttgctgt
cttccaaggt ctacttaagg ttcttgctgg 4980tatcgacacc aacttcaccg ttacatctaa
agccacagac gaagatgggg attttgcaga 5040actctacatc ttcaaatgga cagctcttct
cattccacca accaccgtcc tacttgtgaa 5100cctcataggc attgtggctg gtgtctctta
tgctgtaaac agtggctacc agtcgtgggg 5160tctgcttttc gggaagctct tcttcgcctt
atgggttatt gcccatctct accctttctt 5220gaaaggtctg ttgggaagac aaaaccgaac
accaaccatc gtcattgtct ggtctgttct 5280tctcgcctcc atcttctcgt tgctttgggt
caggatcaat ccctttgtgg acgccaatcc 5340caatgccaac aacttcaatg gcaaaggagg
tgtcttttag 5380171081PRTArabidopsis thaliana
17Met Glu Ala Ser Ala Gly Leu Val Ala Gly Ser Tyr Arg Arg Asn Glu1
5 10 15Leu Val Arg Ile Arg His
Glu Ser Asp Gly Gly Thr Lys Pro Leu Lys 20 25
30Asn Met Asn Gly Gln Ile Cys Gln Ile Cys Gly Asp Asp
Val Gly Leu 35 40 45Ala Glu Thr
Gly Asp Val Phe Val Ala Cys Asn Glu Cys Ala Phe Pro 50
55 60Val Cys Arg Pro Cys Tyr Glu Tyr Glu Arg Lys Asp
Gly Thr Gln Cys65 70 75
80Cys Pro Gln Cys Lys Thr Arg Phe Arg Arg His Arg Gly Ser Pro Arg
85 90 95Val Glu Gly Asp Glu Asp
Glu Asp Asp Val Asp Asp Ile Glu Asn Glu 100
105 110Phe Asn Tyr Ala Gln Gly Ala Asn Lys Ala Arg His
Gln Arg His Gly 115 120 125Glu Glu
Phe Ser Ser Ser Ser Arg His Glu Ser Gln Pro Ile Pro Leu 130
135 140Leu Thr His Gly His Thr Val Ser Gly Glu Ile
Arg Thr Pro Asp Thr145 150 155
160Gln Ser Val Arg Thr Thr Ser Gly Pro Leu Gly Pro Ser Asp Arg Asn
165 170 175Ala Ile Ser Ser
Pro Tyr Ile Asp Pro Arg Gln Pro Val Pro Val Arg 180
185 190Ile Val Asp Pro Ser Lys Asp Leu Asn Ser Tyr
Gly Leu Gly Asn Val 195 200 205Asp
Trp Lys Glu Arg Val Glu Gly Trp Lys Leu Lys Gln Glu Lys Asn 210
215 220Met Leu Gln Met Thr Gly Lys Tyr His Glu
Gly Lys Gly Gly Glu Ile225 230 235
240Glu Gly Thr Gly Ser Asn Gly Glu Glu Leu Gln Met Ala Asp Asp
Thr 245 250 255Arg Leu Pro
Met Ser Arg Val Val Pro Ile Pro Ser Ser Arg Leu Thr 260
265 270Pro Tyr Arg Val Val Ile Ile Leu Arg Leu
Ile Ile Leu Cys Phe Phe 275 280
285Leu Gln Tyr Arg Thr Thr His Pro Val Lys Asn Ala Tyr Pro Leu Trp 290
295 300Leu Thr Ser Val Ile Cys Glu Ile
Trp Phe Ala Phe Ser Trp Leu Leu305 310
315 320Asp Gln Phe Pro Lys Trp Tyr Pro Ile Asn Arg Glu
Thr Tyr Leu Asp 325 330
335Arg Leu Ala Ile Arg Tyr Asp Arg Asp Gly Glu Pro Ser Gln Leu Val
340 345 350Pro Val Asp Val Phe Val
Ser Thr Val Asp Pro Leu Lys Glu Pro Pro 355 360
365Leu Val Thr Ala Asn Thr Val Leu Ser Ile Leu Ser Val Asp
Tyr Pro 370 375 380Val Asp Lys Val Ala
Cys Tyr Val Ser Asp Asp Gly Ser Ala Met Leu385 390
395 400Thr Phe Glu Ser Leu Ser Glu Thr Ala Glu
Phe Ala Lys Lys Trp Val 405 410
415Pro Phe Cys Lys Lys Phe Asn Ile Glu Pro Arg Ala Pro Glu Phe Tyr
420 425 430Phe Ala Gln Lys Ile
Asp Tyr Leu Lys Asp Lys Ile Gln Pro Ser Phe 435
440 445Val Lys Glu Arg Arg Ala Met Lys Arg Glu Tyr Glu
Glu Phe Lys Val 450 455 460Arg Ile Asn
Ala Leu Val Ala Lys Ala Gln Lys Ile Pro Glu Glu Gly465
470 475 480Trp Thr Met Gln Asp Gly Thr
Pro Trp Pro Gly Asn Asn Thr Arg Asp 485
490 495His Pro Gly Met Ile Gln Val Phe Leu Gly His Ser
Gly Gly Leu Asp 500 505 510Thr
Asp Gly Asn Glu Leu Pro Arg Leu Ile Tyr Val Ser Arg Glu Lys 515
520 525Arg Pro Gly Phe Gln His His Lys Lys
Ala Gly Ala Met Asn Ala Leu 530 535
540Ile Arg Val Ser Ala Val Leu Thr Asn Gly Ala Tyr Leu Leu Asn Val545
550 555 560Asp Cys Asp His
Tyr Phe Asn Asn Ser Lys Ala Ile Lys Glu Ala Met 565
570 575Cys Phe Met Met Asp Pro Ala Ile Gly Lys
Lys Cys Cys Tyr Val Gln 580 585
590Phe Pro Gln Arg Phe Asp Gly Ile Asp Leu His Asp Arg Tyr Ala Asn
595 600 605Arg Asn Ile Val Phe Phe Asp
Ile Asn Met Lys Gly Leu Asp Gly Ile 610 615
620Gln Gly Pro Val Tyr Val Gly Thr Gly Cys Cys Phe Asn Arg Gln
Ala625 630 635 640Leu Tyr
Gly Tyr Asp Pro Val Leu Thr Glu Glu Asp Leu Glu Pro Asn
645 650 655Ile Ile Val Lys Ser Cys Cys
Gly Ser Arg Lys Lys Gly Lys Ser Ser 660 665
670Lys Lys Tyr Asn Tyr Glu Lys Arg Arg Gly Ile Asn Arg Ser
Asp Ser 675 680 685Asn Ala Pro Leu
Phe Asn Met Glu Asp Ile Asp Glu Gly Phe Glu Gly 690
695 700Tyr Asp Asp Glu Arg Ser Ile Leu Met Ser Gln Arg
Ser Val Glu Lys705 710 715
720Arg Phe Gly Gln Ser Pro Val Phe Ile Ala Ala Thr Phe Met Glu Gln
725 730 735Gly Gly Ile Pro Pro
Thr Thr Asn Pro Ala Thr Leu Leu Lys Glu Ala 740
745 750Ile His Val Ile Ser Cys Gly Tyr Glu Asp Lys Thr
Glu Trp Gly Lys 755 760 765Glu Ile
Gly Trp Ile Tyr Gly Ser Val Thr Glu Asp Ile Leu Thr Gly 770
775 780Phe Lys Met His Ala Arg Gly Trp Ile Ser Ile
Tyr Cys Asn Pro Pro785 790 795
800Arg Pro Ala Phe Lys Gly Ser Ala Pro Ile Asn Leu Ser Asp Arg Leu
805 810 815Asn Gln Val Leu
Arg Trp Ala Leu Gly Ser Ile Glu Ile Leu Leu Ser 820
825 830Arg His Cys Pro Ile Trp Tyr Gly Tyr His Gly
Arg Leu Arg Leu Leu 835 840 845Glu
Arg Ile Ala Tyr Ile Asn Thr Ile Val Tyr Pro Ile Thr Ser Ile 850
855 860Pro Leu Ile Ala Tyr Cys Ile Leu Pro Ala
Phe Cys Leu Ile Thr Asp865 870 875
880Arg Phe Ile Ile Pro Glu Ile Ser Asn Tyr Ala Ser Ile Trp Phe
Ile 885 890 895Leu Leu Phe
Ile Ser Ile Ala Val Thr Gly Ile Leu Glu Leu Arg Trp 900
905 910Ser Gly Val Ser Ile Glu Asp Trp Trp Arg
Asn Glu Gln Phe Trp Val 915 920
925Ile Gly Gly Thr Ser Ala His Leu Phe Ala Val Phe Gln Gly Leu Leu 930
935 940Lys Val Leu Ala Gly Ile Asp Thr
Asn Phe Thr Val Thr Ser Lys Ala945 950
955 960Thr Asp Glu Asp Gly Asp Phe Ala Glu Leu Tyr Ile
Phe Lys Trp Thr 965 970
975Ala Leu Leu Ile Pro Pro Thr Thr Val Leu Leu Val Asn Leu Ile Gly
980 985 990Ile Val Ala Gly Val Ser
Tyr Ala Val Asn Ser Gly Tyr Gln Ser Trp 995 1000
1005Gly Leu Leu Phe Gly Lys Leu Phe Phe Ala Leu Trp
Val Ile Ala 1010 1015 1020His Leu Tyr
Pro Phe Leu Lys Gly Leu Leu Gly Arg Gln Asn Arg 1025
1030 1035Thr Pro Thr Ile Val Ile Val Trp Ser Val Leu
Leu Ala Ser Ile 1040 1045 1050Phe Ser
Leu Leu Trp Val Arg Ile Asn Pro Phe Val Asp Ala Asn 1055
1060 1065Pro Asn Ala Asn Asn Phe Asn Gly Lys Gly
Gly Val Phe 1070 1075
1080185380DNAArabidopsis thaliana 18atggaggcca gtgccggctt ggttgctgga
tcctaccgga gaaacgagct cgttcggatc 60cgacatgaat ctgatggcgg ggtctgttca
tcttcccttt ttcccatttt tttgttattg 120tttttcgttc ttacaatttt tgatgtgtag
atctcatcta gatttctctg tttctaaatc 180tcgtctcttt tggatccata attggatcat
tgaaactcag atttcgcttc ctttgactgt 240gtagttagtt agtgtcagtt gatcaagtaa
gtgtctgaaa atggaaactt ttctgctcca 300attcttcaaa ttgttgtgat ctatatcaat
taatgccgca tctgttttct taaaatctct 360tatggaaagt gtcggtggat ttcagttcgt
taactttttt aagctaaaat ctttgactct 420taaagtttag ctttacttat tgagatttag
ctcaactaga tctcgttagt tcccgccatg 480ggatacagac tgtgactcgc cttaattcag
atctgcattg attgttttga tttagatcct 540tgctcatctc tttctgtagt ttctaatact
caatgactaa caatgatgca atgttggtca 600aagtgcagac caaacctttg aagaatatga
atggccagat atgtcagatc tgtggtgatg 660atgttggact cgctgaaact ggagatgtct
ttgtcgcgtg taatgaatgt gccttccctg 720tgtgtcggcc ttgctatgag tacgagagga
aagatggaac tcagtgttgc cctcaatgca 780agactagatt cagacgacac aggggtcagt
tgtctttttc tttttgttgg caattgctat 840atatggattt tctctttttg tttctttgct
gttgtgttga acaatttttt ggaattttcc 900agggagtcct cgtgttgaag gagatgaaga
tgaggatgat gttgatgata tcgagaatga 960gttcaattac gcccagggag ctaacaaggc
gagacaccaa cgccatggcg aagagttttc 1020ttcttcctct agacatgaat ctcaaccaat
tcctcttctc acccatggcc atacggtagg 1080gacctacatt ttccctttag actctagagt
gatttgtatt actcaataaa tccctagagt 1140ggtcatttat tacttactat tcacgttaat
gttatatgtg aacaaatctt aacagaattt 1200ttttctgata gtacatggtc atccaaatta
agaaataata atagatgttg ttagttgtgt 1260ctgttttcaa tagattcatg acctttttct
atacacaggt ttctggagag attcgcacgc 1320ctgatacaca atctgtgcga actacatcag
gtcctttggg tccttctgac aggaatgcta 1380tttcatctcc atatattgat ccacggcaac
ctggtattca tatgtttttc ccttgtgcac 1440gtggtctttg ttaaatgtga ttcctattca
tttttacaac atatatattt tgtgtaccgt 1500aactgatagc tcccgctaaa aattgcagtc
cctgtaagaa tcgtggaccc gtcaaaagac 1560ttgaactctt atgggcttgg taatgttgac
tggaaagaaa gagttgaagg ctggaagctg 1620aagcaggaga aaaatatgtt acagatgact
ggtaaatacc atgaagggaa aggaggagaa 1680attgaaggga ctggttccaa tggcgaagaa
ctccaaatgt aagtggaaat actagaccaa 1740tatctttatt gtccaactca aacagctctt
ggccgtgatg ctaataacca ctcttggttt 1800cttattatgt attgatagac ataattaagt
atctgctttg ttacatttgt ttccttccac 1860tcaattatgg ttctcgtact tacagggctg
atgatacacg tcttcctatg agtcgtgtgg 1920tgcctatccc atcttctcgc ctaacccctt
atcgggttgt gattattctc cggcttatca 1980tcttgtgttt cttcttgcaa tatcgtacaa
ctcaccctgt gaaaaatgca tatcctttgt 2040ggttgacctc ggttatctgt gagatctggt
ttgcattttc ttggcttctt gatcagtttc 2100ccaaatggta ccccattaac agggagactt
atcttgaccg tctcgctata aggttggtct 2160ttaagtttat acatccccta ctctcatctc
tcttttatgt attaacttga tatcttctat 2220cacagttttc gatagttgac tttttccccc
tgtaaattta atttaaattt agacaatggt 2280gcatctgaat tttgattatg atatatctta
agaagattat gattgtaaat cttgaaattt 2340agtagaaaac catctgcaat ctactgacca
tgtgaagttt ccgactagac tatgatagaa 2400gcatgccaag tggagtgttt attaagatag
agcttagcta ttatactgat tttatatgtg 2460ttttgatttt ttggtttctt attgtagata
tgatcgagac ggtgaaccat cacagctcgt 2520tcctgttgat gtgtttgtta gtacagtgga
cccattgaaa gagcctcccc ttgttacagc 2580aaacacagtt ctctcgattc tttctgtgga
ctacccggta gataaagtag cctgttatgt 2640ttcagatgat ggttcagcta tgcttacctt
tgaatccctt tctgaaaccg ctgagtttgc 2700aaagaaatgg gtaccatttt gcaagaaatt
caacattgaa cctagggccc ctgaattcta 2760ttttgcccag aagatagatt acttgaagga
caagatccaa ccgtcttttg ttaaagagcg 2820acgagctatg aaggtcattt gaaaagtcca
cctgcttctc atccatacgg caaagagatt 2880gactgacttt ttctttggtt tgtattgaca
gagagagtat gaagagttta aagtgaggat 2940aaatgctctt gttgccaaag cacagaaaat
ccctgaagaa ggctggacaa tgcaggatgg 3000tactccctgg cctggtaaca acactagaga
tcatcctgga atgatacagg tacagtgtgg 3060caatcccttg attgtgacag agaggataac
gtaaaggaaa catgtttaca tcgttttgtt 3120tcaatttcag gtgttcttag gccatagtgg
gggtctggat accgatggaa atgagctgcc 3180tagactcatc tatgtttctc gtgaaaagcg
gcctggattt caacaccaca aaaaggctgg 3240agctatgaat gcattggttt gttaactttc
agaatcctat tgtgtcctct attttattct 3300cttgttcact gcctaagaaa cgttcttctt
gtgtagccgt tgcttcacat tctttttttt 3360ctaggctatg tgttctctcc taatttagta
tctctttact ttgacagatc cgtgtatctg 3420ctgttcttac caatggagca tatcttttga
acgtggattg tgatcattac tttaataaca 3480gtaaggctat taaagaagct atgtgtttca
tgatggaccc ggctattgga aagaagtgct 3540gctatgtcca gttccctcaa cgttttgacg
gtattgattt gcacgatcga tatgccaaca 3600ggaatatagt ctttttcgat gtgagtatca
cttccccatt gtcttttgtt tctcttttgt 3660tcatattttg gttggattta ctcgtttctg
ctatggcctg acttggatat ttgttctctt 3720gggcagatta acatgaaggg gttggatggt
atccagggtc cagtatatgt gggtactggt 3780tgttgtttta ataggcaggc tctatatggg
tatgatcctg ttttgacgga agaagattta 3840gaaccaaata ttattgtcaa gagctgttgc
gggtcaagga agaaaggtaa aagtagcaag 3900aagtataact acgaaaagag gagaggcatc
aacagaagtg actccaatgc tccacttttc 3960aatatggagg acatcgatga gggttttgaa
ggtttgattg agctgattgt gtaataacat 4020cacttcttta tgtaatgatt tatgtgatgg
tgaaatctta caatccttgt ttatgcaggt 4080tatgatgatg agaggtctat tctaatgtcc
cagaggagtg tagagaagcg ttttggtcag 4140tcgccggtat ttattgcggc aaccttcatg
gaacaaggcg gcattccacc aacaaccaat 4200cccgctactc ttctgaagga ggctattcat
gttataagct gtggttacga agacaagact 4260gaatggggca aagaggtcag ttttcaaatg
cagctacaga atcttcttat gttctctttc 4320ttacctgttt gatgacatct tatttggcac
ttttgttaga ttggttggat ctatggttcc 4380gtgacggaag atattcttac tgggttcaag
atgcatgccc ggggttggat atcgatctac 4440tgcaatcctc cacgccctgc gttcaaggga
tctgcaccaa tcaatctttc tgatcgtttg 4500aaccaagttc ttcgatgggc tttgggatct
atcgagattc ttcttagcag acattgtcct 4560atctggtatg gttaccatgg aaggttgaga
cttttggaga ggatcgctta tatcaacacc 4620atcgtctatc ctattacatc catccctctt
attgcgtatt gtattcttcc cgctttttgt 4680ctcatcaccg acagattcat catacccgag
gtttgtaaaa ctgaccacac tgctatttac 4740tatttgaatc ccattttgtg aatgcatttt
tttgtcatca tcattgttgc agataagcaa 4800ctacgcgagt atttggttca ttctactctt
catctcaatt gctgtgactg gaatcctgga 4860gctgagatgg agcggtgtga gcattgagga
ttggtggagg aacgagcagt tctgggtcat 4920tggtggcaca tccgcccatc tttttgctgt
cttccaaggt ctacttaagg ttcttgctgg 4980tatcgacacc aacttcaccg ttacatctaa
agccacagac gaagatgggg attttgcaga 5040actctacatc ttcaaatgga cagctcttct
cattccacca accaccgtcc tacttgtgaa 5100cctcataggc attgtggctg gtgtctctta
tgctgtaaac agtggctacc agtcgtggga 5160tccgcttttc gggaagctct tcttcgcctt
atgggttatt gcccatctct accctttctt 5220gaaaggtctg ttgggaagac aaaaccgaac
accaaccatc gtcattgtct ggtctgttct 5280tctcgcctcc atcttctcgt tgctttgggt
caggatcaat ccctttgtgg acgccaatcc 5340caatgccaac aacttcaatg gcaaaggagg
tgtcttttag 5380191081PRTArabidopsis thaliana
19Met Glu Ala Ser Ala Gly Leu Val Ala Gly Ser Tyr Arg Arg Asn Glu1
5 10 15Leu Val Arg Ile Arg His
Glu Ser Asp Gly Gly Thr Lys Pro Leu Lys 20 25
30Asn Met Asn Gly Gln Ile Cys Gln Ile Cys Gly Asp Asp
Val Gly Leu 35 40 45Ala Glu Thr
Gly Asp Val Phe Val Ala Cys Asn Glu Cys Ala Phe Pro 50
55 60Val Cys Arg Pro Cys Tyr Glu Tyr Glu Arg Lys Asp
Gly Thr Gln Cys65 70 75
80Cys Pro Gln Cys Lys Thr Arg Phe Arg Arg His Arg Gly Ser Pro Arg
85 90 95Val Glu Gly Asp Glu Asp
Glu Asp Asp Val Asp Asp Ile Glu Asn Glu 100
105 110Phe Asn Tyr Ala Gln Gly Ala Asn Lys Ala Arg His
Gln Arg His Gly 115 120 125Glu Glu
Phe Ser Ser Ser Ser Arg His Glu Ser Gln Pro Ile Pro Leu 130
135 140Leu Thr His Gly His Thr Val Ser Gly Glu Ile
Arg Thr Pro Asp Thr145 150 155
160Gln Ser Val Arg Thr Thr Ser Gly Pro Leu Gly Pro Ser Asp Arg Asn
165 170 175Ala Ile Ser Ser
Pro Tyr Ile Asp Pro Arg Gln Pro Val Pro Val Arg 180
185 190Ile Val Asp Pro Ser Lys Asp Leu Asn Ser Tyr
Gly Leu Gly Asn Val 195 200 205Asp
Trp Lys Glu Arg Val Glu Gly Trp Lys Leu Lys Gln Glu Lys Asn 210
215 220Met Leu Gln Met Thr Gly Lys Tyr His Glu
Gly Lys Gly Gly Glu Ile225 230 235
240Glu Gly Thr Gly Ser Asn Gly Glu Glu Leu Gln Met Ala Asp Asp
Thr 245 250 255Arg Leu Pro
Met Ser Arg Val Val Pro Ile Pro Ser Ser Arg Leu Thr 260
265 270Pro Tyr Arg Val Val Ile Ile Leu Arg Leu
Ile Ile Leu Cys Phe Phe 275 280
285Leu Gln Tyr Arg Thr Thr His Pro Val Lys Asn Ala Tyr Pro Leu Trp 290
295 300Leu Thr Ser Val Ile Cys Glu Ile
Trp Phe Ala Phe Ser Trp Leu Leu305 310
315 320Asp Gln Phe Pro Lys Trp Tyr Pro Ile Asn Arg Glu
Thr Tyr Leu Asp 325 330
335Arg Leu Ala Ile Arg Tyr Asp Arg Asp Gly Glu Pro Ser Gln Leu Val
340 345 350Pro Val Asp Val Phe Val
Ser Thr Val Asp Pro Leu Lys Glu Pro Pro 355 360
365Leu Val Thr Ala Asn Thr Val Leu Ser Ile Leu Ser Val Asp
Tyr Pro 370 375 380Val Asp Lys Val Ala
Cys Tyr Val Ser Asp Asp Gly Ser Ala Met Leu385 390
395 400Thr Phe Glu Ser Leu Ser Glu Thr Ala Glu
Phe Ala Lys Lys Trp Val 405 410
415Pro Phe Cys Lys Lys Phe Asn Ile Glu Pro Arg Ala Pro Glu Phe Tyr
420 425 430Phe Ala Gln Lys Ile
Asp Tyr Leu Lys Asp Lys Ile Gln Pro Ser Phe 435
440 445Val Lys Glu Arg Arg Ala Met Lys Arg Glu Tyr Glu
Glu Phe Lys Val 450 455 460Arg Ile Asn
Ala Leu Val Ala Lys Ala Gln Lys Ile Pro Glu Glu Gly465
470 475 480Trp Thr Met Gln Asp Gly Thr
Pro Trp Pro Gly Asn Asn Thr Arg Asp 485
490 495His Pro Gly Met Ile Gln Val Phe Leu Gly His Ser
Gly Gly Leu Asp 500 505 510Thr
Asp Gly Asn Glu Leu Pro Arg Leu Ile Tyr Val Ser Arg Glu Lys 515
520 525Arg Pro Gly Phe Gln His His Lys Lys
Ala Gly Ala Met Asn Ala Leu 530 535
540Ile Arg Val Ser Ala Val Leu Thr Asn Gly Ala Tyr Leu Leu Asn Val545
550 555 560Asp Cys Asp His
Tyr Phe Asn Asn Ser Lys Ala Ile Lys Glu Ala Met 565
570 575Cys Phe Met Met Asp Pro Ala Ile Gly Lys
Lys Cys Cys Tyr Val Gln 580 585
590Phe Pro Gln Arg Phe Asp Gly Ile Asp Leu His Asp Arg Tyr Ala Asn
595 600 605Arg Asn Ile Val Phe Phe Asp
Ile Asn Met Lys Gly Leu Asp Gly Ile 610 615
620Gln Gly Pro Val Tyr Val Gly Thr Gly Cys Cys Phe Asn Arg Gln
Ala625 630 635 640Leu Tyr
Gly Tyr Asp Pro Val Leu Thr Glu Glu Asp Leu Glu Pro Asn
645 650 655Ile Ile Val Lys Ser Cys Cys
Gly Ser Arg Lys Lys Gly Lys Ser Ser 660 665
670Lys Lys Tyr Asn Tyr Glu Lys Arg Arg Gly Ile Asn Arg Ser
Asp Ser 675 680 685Asn Ala Pro Leu
Phe Asn Met Glu Asp Ile Asp Glu Gly Phe Glu Gly 690
695 700Tyr Asp Asp Glu Arg Ser Ile Leu Met Ser Gln Arg
Ser Val Glu Lys705 710 715
720Arg Phe Gly Gln Ser Pro Val Phe Ile Ala Ala Thr Phe Met Glu Gln
725 730 735Gly Gly Ile Pro Pro
Thr Thr Asn Pro Ala Thr Leu Leu Lys Glu Ala 740
745 750Ile His Val Ile Ser Cys Gly Tyr Glu Asp Lys Thr
Glu Trp Gly Lys 755 760 765Glu Ile
Gly Trp Ile Tyr Gly Ser Val Thr Glu Asp Ile Leu Thr Gly 770
775 780Phe Lys Met His Ala Arg Gly Trp Ile Ser Ile
Tyr Cys Asn Pro Pro785 790 795
800Arg Pro Ala Phe Lys Gly Ser Ala Pro Ile Asn Leu Ser Asp Arg Leu
805 810 815Asn Gln Val Leu
Arg Trp Ala Leu Gly Ser Ile Glu Ile Leu Leu Ser 820
825 830Arg His Cys Pro Ile Trp Tyr Gly Tyr His Gly
Arg Leu Arg Leu Leu 835 840 845Glu
Arg Ile Ala Tyr Ile Asn Thr Ile Val Tyr Pro Ile Thr Ser Ile 850
855 860Pro Leu Ile Ala Tyr Cys Ile Leu Pro Ala
Phe Cys Leu Ile Thr Asp865 870 875
880Arg Phe Ile Ile Pro Glu Ile Ser Asn Tyr Ala Ser Ile Trp Phe
Ile 885 890 895Leu Leu Phe
Ile Ser Ile Ala Val Thr Gly Ile Leu Glu Leu Arg Trp 900
905 910Ser Gly Val Ser Ile Glu Asp Trp Trp Arg
Asn Glu Gln Phe Trp Val 915 920
925Ile Gly Gly Thr Ser Ala His Leu Phe Ala Val Phe Gln Gly Leu Leu 930
935 940Lys Val Leu Ala Gly Ile Asp Thr
Asn Phe Thr Val Thr Ser Lys Ala945 950
955 960Thr Asp Glu Asp Gly Asp Phe Ala Glu Leu Tyr Ile
Phe Lys Trp Thr 965 970
975Ala Leu Leu Ile Pro Pro Thr Thr Val Leu Leu Val Asn Leu Ile Gly
980 985 990Ile Val Ala Gly Val Ser
Tyr Ala Val Asn Ser Gly Tyr Gln Ser Trp 995 1000
1005Asp Pro Leu Phe Gly Lys Leu Phe Phe Ala Leu Trp
Val Ile Ala 1010 1015 1020His Leu Tyr
Pro Phe Leu Lys Gly Leu Leu Gly Arg Gln Asn Arg 1025
1030 1035Thr Pro Thr Ile Val Ile Val Trp Ser Val Leu
Leu Ala Ser Ile 1040 1045 1050Phe Ser
Leu Leu Trp Val Arg Ile Asn Pro Phe Val Asp Ala Asn 1055
1060 1065Pro Asn Ala Asn Asn Phe Asn Gly Lys Gly
Gly Val Phe 1070 1075
1080204690DNAArabidopsis thaliana 20atggaatccg aaggagaaac cgcggtatgc
ttttttgact cttgcttcat cattatactt 60acctttatcg aaatcaggaa ttatatgtac
tgaaattgat tgatttgggt gttgaattgt 120gtattggaga gatctgattt caaattttct
gttgaggttt ctaattttgg cttcattgat 180tcgacttgat ttgtagggaa agccgatgaa
gaacattgtt ccgcagactt gccagatctg 240tagtgacaat gttggcaaga ctgttgatgg
agatcgtttt gtggcttgtg atatttgttc 300attcccagtt tgtcggcctt gctacgagta
tgagaggaaa gatgggaatc aatcttgtcc 360tcagtgcaaa accagataca agaggctcaa
aggttctctt ttgatccttc tgaagtatac 420tgtcttcatt gttcatcgat agtttatcag
tatgttttga attttggatc agattggtat 480ttatagcaat ttgctaattt ctgattctag
gtagtcctgc tattcctggt gataaagacg 540aggatggctt agctgatgaa ggtactgttg
agttcaacta ccctcagaag gagaaaattt 600cagagcggat gcttggttgg catcttactc
gtgggaaggg agaggaaatg ggggaacccc 660agtatgataa agaggtctct cacaatcatc
ttcctcgtct cacgagcaga caagatgtaa 720ggcattgctg ttcattcttc cctcttaagc
attcgcatcc tcacgcaatt tagttttgga 780atctgatttt gtcatttgct tatttacaga
cttcaggaga gttttctgct gcctcacctg 840aacgcctctc tgtatcttct actatcgctg
ggggaaagcg ccttccctat tcatcagatg 900tcaatcaatc acgtaaatat cctttatttc
taactctctc gccaacacat atatttgtac 960ctaggcttct cttttatgtc aaaactctaa
acaataaaat ctgttgttgt cattcacgct 1020gcagcaaata gaaggattgt ggatcctgtt
ggactcggga atgtagcttg gaaggagaga 1080gttgatggct ggaaaatgaa gcaagagaag
aatactggtc ctgtcagcac gcaggctgct 1140tctgaaagag gtggagtaga tattgatgcc
agcacagata tcctagcaga tgaggctctg 1200ctgtgagttc ttgttttgta atcttgtttg
ttctgtcgtg gtgtaccgag cgtttttcct 1260attaagcaat gtcctgatac tcattttcca
attctttatt tattgtacag gaatgacgaa 1320gcgaggcagc ctctgtcaag gaaagtttca
attccttcat cacggatcaa tccttacaga 1380atggttatta tgctgcggct tgttatcctt
tgtctcttct tgcattaccg tataacaaac 1440ccagtgccaa atgcctttgc tctatggctg
gtctctgtga tatgtgagat ctggtttgcc 1500ttatcctgga ttttggatca gtttcccaag
tggtttcctg tgaaccgtga aacctacctc 1560gacaggcttg ctttaaggta agttctattt
ccccattctt ctgaagcaat tactcaaagg 1620attgtttgcc tatactgttt cccattttaa
tttgatcatg gtcaattttt gggacagata 1680tgatcgtgaa ggtgagccat cacagttagc
tgctgttgac attttcgtga gtactgttga 1740ccccttgaag gagccacccc ttgtgacagc
caacacagtg ctctctattc tggctgttga 1800ctacccagtt gacaaggtgt cctgttatgt
ttctgatgat ggtgctgcta tgttatcatt 1860tgaatcactt gcagaaacat cagagtttgc
tcgtaaatgg gtaccatttt gcaagaaata 1920tagcatagag cctcgtgcac cagaatggta
ctttgctgcg aaaatagatt acttgaagga 1980taaagttcag acatcatttg tcaaagatcg
tagagctatg aaggtaagtt tgtagtttta 2040gtcatctagt caccctcact ttgattttag
tgtatgctat attgaccttt tattttcttt 2100cagagggaat atgaggaatt taaaatccga
atcaatgcac ttgtttccaa agccctaaaa 2160tgtcctgaag aagggtgggt tatgcaagat
ggcacaccgt ggcctggaaa taatacaagg 2220gaccatccag gaatgatcca ggtaagaaat
tggttttaac tatggaatcg agaatgctct 2280ctctttctct ctagaagttc attattgaag
taccatttgc tgaatgcagg tcttcttagg 2340gcaaaatggt ggacttgatg cagagggcaa
tgagctcccg cgtttggtat atgtttctcg 2400agaaaagcga ccaggattcc agcaccacaa
aaaggctggt gctatgaatg cactggtaag 2460tttctgatct tggatttttg acttcttcat
tctgaccaat ttgttagtct aatctgggta 2520cttttcaaat gaataggtga gagtttcagc
agttcttacc aatggacctt tcatcttgaa 2580tcttgattgt gatcattaca taaataacag
caaagcctta agagaagcaa tgtgcttcct 2640gatggaccca aacctcggga agcaagtttg
ttatgttcag ttcccacaaa gatttgatgg 2700tatcgataag aacgatagat atgctaatcg
taataccgtg ttctttgatg taagtcacac 2760ttacctatac ttgcgtctaa ttttcttgtt
ctttcaaatt gcttttagac acgaatatac 2820attaaactca cagtttcttg agtttgtcgt
aatttttcca tgatatgttt tccagattaa 2880cttgagaggt ttagatggga ttcaaggacc
tgtatatgtc ggaactggat gtgttttcaa 2940cagaacagca ttatacggtt atgaacctcc
aataaaagta aaacacaaga agccaagtct 3000tttatctaag ctctgtggtg gatcaagaaa
gaagaattcc aaagctaaga aagagtcgga 3060caaaaagaaa tcaggcaggc atactgactc
aactgttcct gtattcaacc tcgatgacat 3120agaagaggga gttgaaggta caactgtttt
tatttcttct ttggtttccg ttatacccat 3180atgttgctgt ttgaaatatt gatccagggg
aggggattat ttatagttga cagttgtcta 3240aatagtttcc atactaggta tctcatcatg
tcttaactat ttggcatttg tgaaacttag 3300gtgctggttt tgatgatgaa aaggcgctct
taatgtcgca aatgagcctg gagaagcgat 3360ttggacagtc tgctgttttt gttgcttcta
ccctaatgga aaatggtggt gttcctcctt 3420cagcaactcc agaaaacctt ctcaaagagg
ctatccatgt cattagttgt ggttatgagg 3480ataagtcaga ttggggaatg gaggtataat
ctcatttgaa ctcctacatg aatctgcatt 3540gttctgacat atccactttg gcattcactt
tgtttatatt ttccgctgtc tttcttcaga 3600ttggatggat ctatggttct gtgacagaag
atattctgac tgggttcaaa atgcatgccc 3660gtggatggcg atccatttac tgcatgccta
agcttccagc tttcaagggt tctgctccta 3720tcaatctttc agatcgtctg aaccaagtgc
tgaggtgggc tttaggttca gttgagattc 3780tcttcagtcg gcattgtcct atatggtatg
gttacaatgg gaggctaaaa tttcttgaga 3840ggtttgcgta tgtgaacacc accatctacc
ctatcacctc cattcctctt ctcatgtatt 3900gtacattgcc agccgtttgt ctcttcacca
accagtttat tattcctcag gtttgacacc 3960tctctctgtc tatctatctc tatctctatc
tctatctcta gaacaaacct taattacgtt 4020ctgtttaact gaaaccatgt tgtgtttgtc
atctatttac ggttccaaat cctgatcagc 4080tggttctatt gttcctcttt tgcagattag
taacattgca agtatatggt ttctgtctct 4140ctttctctcc attttcgcca cgggtatact
agaaatgagg tggagtggcg taggcataga 4200cgaatggtgg agaaacgagc agttttgggt
cattggtgga gtatccgctc atttattcgc 4260tgtgtttcaa ggtatcctca aagtccttgc
cggtattgac acaaacttca cagttacctc 4320aaaagcttca gatgaagacg gagactttgc
tgagctctac ttgttcaaat ggacaacact 4380tctgattccg ccaacgacgc tgctcattgt
aaacttagtg ggagttgttg caggagtctc 4440ttatgctatc aacagtggat accaatcatg
gggaccactc tttggtaagt tgttctttgc 4500cttctgggtg attgttcact tgtacccttt
cctcaagggt ttgatgggtc gacagaaccg 4560gactcctacc attgttgtgg tctggtctgt
tctcttggct tctatcttct tgttgttgtg 4620ggttaggatt gatcccttca ctagccgagt
cactggcccg gacattctgg aatgtggaat 4680caactgttga
4690211065PRTArabidopsis thaliana 21Met
Glu Ser Glu Gly Glu Thr Ala Gly Lys Pro Met Lys Asn Ile Val1
5 10 15Pro Gln Thr Cys Gln Ile Cys
Ser Asp Asn Val Gly Lys Thr Val Asp 20 25
30Gly Asp Arg Phe Val Ala Cys Asp Ile Cys Ser Phe Pro Val
Cys Arg 35 40 45Pro Cys Tyr Glu
Tyr Glu Arg Lys Asp Gly Asn Gln Ser Cys Pro Gln 50 55
60Cys Lys Thr Arg Tyr Lys Arg Leu Lys Gly Ser Pro Ala
Ile Pro Gly65 70 75
80Asp Lys Asp Glu Asp Gly Leu Ala Asp Glu Gly Thr Val Glu Phe Asn
85 90 95Tyr Pro Gln Lys Glu Lys
Ile Ser Glu Arg Met Leu Gly Trp His Leu 100
105 110Thr Arg Gly Lys Gly Glu Glu Met Gly Glu Pro Gln
Tyr Asp Lys Glu 115 120 125Val Ser
His Asn His Leu Pro Arg Leu Thr Ser Arg Gln Asp Thr Ser 130
135 140Gly Glu Phe Ser Ala Ala Ser Pro Glu Arg Leu
Ser Val Ser Ser Thr145 150 155
160Ile Ala Gly Gly Lys Arg Leu Pro Tyr Ser Ser Asp Val Asn Gln Ser
165 170 175Pro Asn Arg Arg
Ile Val Asp Pro Val Gly Leu Gly Asn Val Ala Trp 180
185 190Lys Glu Arg Val Asp Gly Trp Lys Met Lys Gln
Glu Lys Asn Thr Gly 195 200 205Pro
Val Ser Thr Gln Ala Ala Ser Glu Arg Gly Gly Val Asp Ile Asp 210
215 220Ala Ser Thr Asp Ile Leu Ala Asp Glu Ala
Leu Leu Asn Asp Glu Ala225 230 235
240Arg Gln Pro Leu Ser Arg Lys Val Ser Ile Pro Ser Ser Arg Ile
Asn 245 250 255Pro Tyr Arg
Met Val Ile Met Leu Arg Leu Val Ile Leu Cys Leu Phe 260
265 270Leu His Tyr Arg Ile Thr Asn Pro Val Pro
Asn Ala Phe Ala Leu Trp 275 280
285Leu Val Ser Val Ile Cys Glu Ile Trp Phe Ala Leu Ser Trp Ile Leu 290
295 300Asp Gln Phe Pro Lys Trp Phe Pro
Val Asn Arg Glu Thr Tyr Leu Asp305 310
315 320Arg Leu Ala Leu Arg Tyr Asp Arg Glu Gly Glu Pro
Ser Gln Leu Ala 325 330
335Ala Val Asp Ile Phe Val Ser Thr Val Asp Pro Leu Lys Glu Pro Pro
340 345 350Leu Val Thr Ala Asn Thr
Val Leu Ser Ile Leu Ala Val Asp Tyr Pro 355 360
365Val Asp Lys Val Ser Cys Tyr Val Ser Asp Asp Gly Ala Ala
Met Leu 370 375 380Ser Phe Glu Ser Leu
Ala Glu Thr Ser Glu Phe Ala Arg Lys Trp Val385 390
395 400Pro Phe Cys Lys Lys Tyr Ser Ile Glu Pro
Arg Ala Pro Glu Trp Tyr 405 410
415Phe Ala Ala Lys Ile Asp Tyr Leu Lys Asp Lys Val Gln Thr Ser Phe
420 425 430Val Lys Asp Arg Arg
Ala Met Lys Arg Glu Tyr Glu Glu Phe Lys Ile 435
440 445Arg Ile Asn Ala Leu Val Ser Lys Ala Leu Lys Cys
Pro Glu Glu Gly 450 455 460Trp Val Met
Gln Asp Gly Thr Pro Trp Pro Gly Asn Asn Thr Arg Asp465
470 475 480His Pro Gly Met Ile Gln Val
Phe Leu Gly Gln Asn Gly Gly Leu Asp 485
490 495Ala Glu Gly Asn Glu Leu Pro Arg Leu Val Tyr Val
Ser Arg Glu Lys 500 505 510Arg
Pro Gly Phe Gln His His Lys Lys Ala Gly Ala Met Asn Ala Leu 515
520 525Val Arg Val Ser Ala Val Leu Thr Asn
Gly Pro Phe Ile Leu Asn Leu 530 535
540Asp Cys Asp His Tyr Ile Asn Asn Ser Lys Ala Leu Arg Glu Ala Met545
550 555 560Cys Phe Leu Met
Asp Pro Asn Leu Gly Lys Gln Val Cys Tyr Val Gln 565
570 575Phe Pro Gln Arg Phe Asp Gly Ile Asp Lys
Asn Asp Arg Tyr Ala Asn 580 585
590Arg Asn Thr Val Phe Phe Asp Ile Asn Leu Arg Gly Leu Asp Gly Ile
595 600 605Gln Gly Pro Val Tyr Val Gly
Thr Gly Cys Val Phe Asn Arg Thr Ala 610 615
620Leu Tyr Gly Tyr Glu Pro Pro Ile Lys Val Lys His Lys Lys Pro
Ser625 630 635 640Leu Leu
Ser Lys Leu Cys Gly Gly Ser Arg Lys Lys Asn Ser Lys Ala
645 650 655Lys Lys Glu Ser Asp Lys Lys
Lys Ser Gly Arg His Thr Asp Ser Thr 660 665
670Val Pro Val Phe Asn Leu Asp Asp Ile Glu Glu Gly Val Glu
Gly Ala 675 680 685Gly Phe Asp Asp
Glu Lys Ala Leu Leu Met Ser Gln Met Ser Leu Glu 690
695 700Lys Arg Phe Gly Gln Ser Ala Val Phe Val Ala Ser
Thr Leu Met Glu705 710 715
720Asn Gly Gly Val Pro Pro Ser Ala Thr Pro Glu Asn Leu Leu Lys Glu
725 730 735Ala Ile His Val Ile
Ser Cys Gly Tyr Glu Asp Lys Ser Asp Trp Gly 740
745 750Met Glu Ile Gly Trp Ile Tyr Gly Ser Val Thr Glu
Asp Ile Leu Thr 755 760 765Gly Phe
Lys Met His Ala Arg Gly Trp Arg Ser Ile Tyr Cys Met Pro 770
775 780Lys Leu Pro Ala Phe Lys Gly Ser Ala Pro Ile
Asn Leu Ser Asp Arg785 790 795
800Leu Asn Gln Val Leu Arg Trp Ala Leu Gly Ser Val Glu Ile Leu Phe
805 810 815Ser Arg His Cys
Pro Ile Trp Tyr Gly Tyr Asn Gly Arg Leu Lys Phe 820
825 830Leu Glu Arg Phe Ala Tyr Val Asn Thr Thr Ile
Tyr Pro Ile Thr Ser 835 840 845Ile
Pro Leu Leu Met Tyr Cys Thr Leu Pro Ala Val Cys Leu Phe Thr 850
855 860Asn Gln Phe Ile Ile Pro Gln Ile Ser Asn
Ile Ala Ser Ile Trp Phe865 870 875
880Leu Ser Leu Phe Leu Ser Ile Phe Ala Thr Gly Ile Leu Glu Met
Arg 885 890 895Trp Ser Gly
Val Gly Ile Asp Glu Trp Trp Arg Asn Glu Gln Phe Trp 900
905 910Val Ile Gly Gly Val Ser Ala His Leu Phe
Ala Val Phe Gln Gly Ile 915 920
925Leu Lys Val Leu Ala Gly Ile Asp Thr Asn Phe Thr Val Thr Ser Lys 930
935 940Ala Ser Asp Glu Asp Gly Asp Phe
Ala Glu Leu Tyr Leu Phe Lys Trp945 950
955 960Thr Thr Leu Leu Ile Pro Pro Thr Thr Leu Leu Ile
Val Asn Leu Val 965 970
975Gly Val Val Ala Gly Val Ser Tyr Ala Ile Asn Ser Gly Tyr Gln Ser
980 985 990Trp Gly Pro Leu Phe Gly
Lys Leu Phe Phe Ala Phe Trp Val Ile Val 995 1000
1005His Leu Tyr Pro Phe Leu Lys Gly Leu Met Gly Arg
Gln Asn Arg 1010 1015 1020Thr Pro Thr
Ile Val Val Val Trp Ser Val Leu Leu Ala Ser Ile 1025
1030 1035Phe Leu Leu Leu Trp Val Arg Ile Asp Pro Phe
Thr Ser Arg Val 1040 1045 1050Thr Gly
Pro Asp Ile Leu Glu Cys Gly Ile Asn Cys 1055 1060
1065224690DNAArabidopsis thaliana 22atggaatccg aaggagaaac
cgcggtatgc ttttttgact cttgcttcat cattatactt 60acctttatcg aaatcaggaa
ttatatgtac tgaaattgat tgatttgggt gttgaattgt 120gtattggaga gatctgattt
caaattttct gttgaggttt ctaattttgg cttcattgat 180tcgacttgat ttgtagggaa
agccgatgaa gaacattgtt ccgcagactt gccagatctg 240tagtgacaat gttggcaaga
ctgttgatgg agatcgtttt gtggcttgtg atatttgttc 300attcccagtt tgtcggcctt
gctacgagta tgagaggaaa gatgggaatc aatcttgtcc 360tcagtgcaaa accagataca
agaggctcaa aggttctctt ttgatccttc tgaagtatac 420tgtcttcatt gttcatcgat
agtttatcag tatgttttga attttggatc agattggtat 480ttatagcaat ttgctaattt
ctgattctag gtagtcctgc tattcctggt gataaagacg 540aggatggctt agctgatgaa
ggtactgttg agttcaacta ccctcagaag gagaaaattt 600cagagcggat gcttggttgg
catcttactc gtgggaaggg agaggaaatg ggggaacccc 660agtatgataa agaggtctct
cacaatcatc ttcctcgtct cacgagcaga caagatgtaa 720ggcattgctg ttcattcttc
cctcttaagc attcgcatcc tcacgcaatt tagttttgga 780atctgatttt gtcatttgct
tatttacaga cttcaggaga gttttctgct gcctcacctg 840aacgcctctc tgtatcttct
actatcgctg ggggaaagcg ccttccctat tcatcagatg 900tcaatcaatc acgtaaatat
cctttatttc taactctctc gccaacacat atatttgtac 960ctaggcttct cttttatgtc
aaaactctaa acaataaaat ctgttgttgt cattcacgct 1020gcagcaaata gaaggattgt
ggatcctgtt ggactcggga atgtagcttg gaaggagaga 1080gttgatggct ggaaaatgaa
gcaagagaag aatactggtc ctgtcagcac gcaggctgct 1140tctgaaagag gtggagtaga
tattgatgcc agcacagata tcctagcaga tgaggctctg 1200ctgtgagttc ttgttttgta
atcttgtttg ttctgtcgtg gtgtaccgag cgtttttcct 1260attaagcaat gtcctgatac
tcattttcca attctttatt tattgtacag gaatgacgaa 1320gcgaggcagc ctctgtcaag
gaaagtttca attccttcat cacggatcaa tccttacaga 1380atggttatta tgctgcggct
tgttatcctt tgtctcttct tgcattaccg tataacaaac 1440ccagtgccaa atgcctttgc
tctatggctg gtctctgtga tatgtgagat ctggtttgcc 1500ttatcctgga ttttggatca
gtttcccaag tggtttcctg tgaaccgtga aacctacctc 1560gacaggcttg ctttaaggta
agttctattt ccccattctt ctgaagcaat tactcaaagg 1620attgtttgcc tatactgttt
cccattttaa tttgatcatg gtcaattttt gggacagata 1680tgatcgtgaa ggtgagccat
cacagttagc tgctgttgac attttcgtga gtactgttga 1740ccccttgaag gagccacccc
ttgtgacagc caacacagtg ctctctattc tggctgttga 1800ctacccagtt gacaaggtgt
cctgttatgt ttctgatgat ggtgctgcta tgttatcatt 1860tgaatcactt gcagaaacat
cagagtttgc tcgtaaatgg gtaccatttt gcaagaaata 1920tagcatagag cctcgtgcac
cagaatggta ctttgctgcg aaaatagatt acttgaagga 1980taaagttcag acatcatttg
tcaaagatcg tagagctatg aaggtaagtt tgtagtttta 2040gtcatctagt caccctcact
ttgattttag tgtatgctat attgaccttt tattttcttt 2100cagagggaat atgaggaatt
taaaatccga atcaatgcac ttgtttccaa agccctaaaa 2160tgtcctgaag aagggtgggt
tatgcaagat ggcacaccgt ggcctggaaa taatacaagg 2220gaccatccag gaatgatcca
ggtaagaaat tggttttaac tatggaatcg agaatgctct 2280ctctttctct ctagaagttc
attattgaag taccatttgc tgaatgcagg tcttcttagg 2340gcaaaatggt ggacttgatg
cagagggcaa tgagctcccg cgtttggtat atgtttctcg 2400agaaaagcga ccaggattcc
agcaccacaa aaaggctggt gctatgaatg cactggtaag 2460tttctgatct tggatttttg
acttcttcat tctgaccaat ttgttagtct aatctgggta 2520cttttcaaat gaataggtga
gagtttcagc agttcttacc aatggacctt tcatcttgaa 2580tcttgattgt gatcattaca
taaataacag caaagcctta agagaagcaa tgtgcttcct 2640gatggaccca aacctcggga
agcaagtttg ttatgttcag ttcccacaaa gatttgatgg 2700tatcgataag aacgatagat
atgctaatcg taataccgtg ttctttgatg taagtcacac 2760ttacctatac ttgcgtctaa
ttttcttgtt ctttcaaatt gcttttagac acgaatatac 2820attaaactca cagtttcttg
agtttgtcgt aatttttcca tgatatgttt tccagattaa 2880cttgagaggt ttagatggga
ttcaaggacc tgtatatgtc ggaactggat gtgttttcaa 2940cagaacagca ttatacggtt
atgaacctcc aataaaagta aaacacaaga agccaagtct 3000tttatctaag ctctgtggtg
gatcaagaaa gaagaattcc aaagctaaga aagagtcgga 3060caaaaagaaa tcaggcaggc
atactgactc aactgttcct gtattcaacc tcgatgacat 3120agaagaggga gttgaaggta
caactgtttt tatttcttct ttggtttccg ttatacccat 3180atgttgctgt ttgaaatatt
gatccagggg aggggattat ttatagttga cagttgtcta 3240aatagtttcc atactaggta
tctcatcatg tcttaactat ttggcatttg tgaaacttag 3300gtgctggttt tgatgatgaa
aaggcgctct taatgtcgca aatgagcctg gagaagcgat 3360ttggacagtc tgctgttttt
gttgcttcta ccctaatgga aaatggtggt gttcctcctt 3420cagcaactcc agaaaacctt
ctcaaagagg ctatccatgt cattagttgt ggttatgagg 3480ataagtcaga ttggggaatg
gaggtataat ctcatttgaa ctcctacatg aatctgcatt 3540gttctgacat atccactttg
gcattcactt tgtttatatt ttccgctgtc tttcttcaga 3600ttggatggat ctatggttct
gtgacagaag atattctgac tgggttcaaa atgcatgccc 3660gtggatggcg atccatttac
tgcatgccta agcttccagc tttcaagggt tctgctccta 3720tcaatctttc agatcgtctg
aaccaagtgc tgaggtgggc tttaggttca gttgagattc 3780tcttcagtcg gcattgtcct
atatggtatg gttacaatgg gaggctaaaa tttcttgaga 3840ggtttgcgta tgtgaacacc
accatctacc ctatcacctc cattcctctt ctcatgtatt 3900gtacattgcc agccgtttgt
ctcttcacca accagtttat tattcctcag gtttgacacc 3960tctctctgtc tatctatctc
tatctctatc tctatctcta gaacaaacct taattacgtt 4020ctgtttaact gaaaccatgt
tgtgtttgtc atctatttac ggttccaaat cctgatcagc 4080tggttctatt gttcctcttt
tgcagattag taacattgca agtatatggt ttctgtctct 4140ctttctctcc attttcgcca
cgggtatact agaaatgagg tggagtggcg taggcataga 4200cgaatggtgg agaaacgagc
agttttgggt cattggtgga gtatccgctc atttattcgc 4260tgtgtttcaa ggtatcctca
aagtccttgc cggtattgac acaaacttca cagttacctc 4320aaaagcttca gatgaagacg
gagactttgc tgagctctac ttgttcaaat ggacaacact 4380tctgattccg ccaacgacgc
tgctcattgt aaacttagtg ggagttgttg caggagtctc 4440ttatgctatc aacagtggat
accaatcatg gggaccactc tttggtaagt tgttctttgc 4500cttctgggtg attgttcact
tgtacccttt cctcaagggt ttgatgggtc gacagaaccg 4560gactcctacc attgttgtgg
tctggtctgt tctcttggct tttatcttct cgttgttgtg 4620ggttaggatt gatcccttca
ctagccgagt cactggcccg gacattctgg aatgtggaat 4680caactgttga
4690231065PRTArabidopsis
thaliana 23Met Glu Ser Glu Gly Glu Thr Ala Gly Lys Pro Met Lys Asn Ile
Val1 5 10 15Pro Gln Thr
Cys Gln Ile Cys Ser Asp Asn Val Gly Lys Thr Val Asp 20
25 30Gly Asp Arg Phe Val Ala Cys Asp Ile Cys
Ser Phe Pro Val Cys Arg 35 40
45Pro Cys Tyr Glu Tyr Glu Arg Lys Asp Gly Asn Gln Ser Cys Pro Gln 50
55 60Cys Lys Thr Arg Tyr Lys Arg Leu Lys
Gly Ser Pro Ala Ile Pro Gly65 70 75
80Asp Lys Asp Glu Asp Gly Leu Ala Asp Glu Gly Thr Val Glu
Phe Asn 85 90 95Tyr Pro
Gln Lys Glu Lys Ile Ser Glu Arg Met Leu Gly Trp His Leu 100
105 110Thr Arg Gly Lys Gly Glu Glu Met Gly
Glu Pro Gln Tyr Asp Lys Glu 115 120
125Val Ser His Asn His Leu Pro Arg Leu Thr Ser Arg Gln Asp Thr Ser
130 135 140Gly Glu Phe Ser Ala Ala Ser
Pro Glu Arg Leu Ser Val Ser Ser Thr145 150
155 160Ile Ala Gly Gly Lys Arg Leu Pro Tyr Ser Ser Asp
Val Asn Gln Ser 165 170
175Pro Asn Arg Arg Ile Val Asp Pro Val Gly Leu Gly Asn Val Ala Trp
180 185 190Lys Glu Arg Val Asp Gly
Trp Lys Met Lys Gln Glu Lys Asn Thr Gly 195 200
205Pro Val Ser Thr Gln Ala Ala Ser Glu Arg Gly Gly Val Asp
Ile Asp 210 215 220Ala Ser Thr Asp Ile
Leu Ala Asp Glu Ala Leu Leu Asn Asp Glu Ala225 230
235 240Arg Gln Pro Leu Ser Arg Lys Val Ser Ile
Pro Ser Ser Arg Ile Asn 245 250
255Pro Tyr Arg Met Val Ile Met Leu Arg Leu Val Ile Leu Cys Leu Phe
260 265 270Leu His Tyr Arg Ile
Thr Asn Pro Val Pro Asn Ala Phe Ala Leu Trp 275
280 285Leu Val Ser Val Ile Cys Glu Ile Trp Phe Ala Leu
Ser Trp Ile Leu 290 295 300Asp Gln Phe
Pro Lys Trp Phe Pro Val Asn Arg Glu Thr Tyr Leu Asp305
310 315 320Arg Leu Ala Leu Arg Tyr Asp
Arg Glu Gly Glu Pro Ser Gln Leu Ala 325
330 335Ala Val Asp Ile Phe Val Ser Thr Val Asp Pro Leu
Lys Glu Pro Pro 340 345 350Leu
Val Thr Ala Asn Thr Val Leu Ser Ile Leu Ala Val Asp Tyr Pro 355
360 365Val Asp Lys Val Ser Cys Tyr Val Ser
Asp Asp Gly Ala Ala Met Leu 370 375
380Ser Phe Glu Ser Leu Ala Glu Thr Ser Glu Phe Ala Arg Lys Trp Val385
390 395 400Pro Phe Cys Lys
Lys Tyr Ser Ile Glu Pro Arg Ala Pro Glu Trp Tyr 405
410 415Phe Ala Ala Lys Ile Asp Tyr Leu Lys Asp
Lys Val Gln Thr Ser Phe 420 425
430Val Lys Asp Arg Arg Ala Met Lys Arg Glu Tyr Glu Glu Phe Lys Ile
435 440 445Arg Ile Asn Ala Leu Val Ser
Lys Ala Leu Lys Cys Pro Glu Glu Gly 450 455
460Trp Val Met Gln Asp Gly Thr Pro Trp Pro Gly Asn Asn Thr Arg
Asp465 470 475 480His Pro
Gly Met Ile Gln Val Phe Leu Gly Gln Asn Gly Gly Leu Asp
485 490 495Ala Glu Gly Asn Glu Leu Pro
Arg Leu Val Tyr Val Ser Arg Glu Lys 500 505
510Arg Pro Gly Phe Gln His His Lys Lys Ala Gly Ala Met Asn
Ala Leu 515 520 525Val Arg Val Ser
Ala Val Leu Thr Asn Gly Pro Phe Ile Leu Asn Leu 530
535 540Asp Cys Asp His Tyr Ile Asn Asn Ser Lys Ala Leu
Arg Glu Ala Met545 550 555
560Cys Phe Leu Met Asp Pro Asn Leu Gly Lys Gln Val Cys Tyr Val Gln
565 570 575Phe Pro Gln Arg Phe
Asp Gly Ile Asp Lys Asn Asp Arg Tyr Ala Asn 580
585 590Arg Asn Thr Val Phe Phe Asp Ile Asn Leu Arg Gly
Leu Asp Gly Ile 595 600 605Gln Gly
Pro Val Tyr Val Gly Thr Gly Cys Val Phe Asn Arg Thr Ala 610
615 620Leu Tyr Gly Tyr Glu Pro Pro Ile Lys Val Lys
His Lys Lys Pro Ser625 630 635
640Leu Leu Ser Lys Leu Cys Gly Gly Ser Arg Lys Lys Asn Ser Lys Ala
645 650 655Lys Lys Glu Ser
Asp Lys Lys Lys Ser Gly Arg His Thr Asp Ser Thr 660
665 670Val Pro Val Phe Asn Leu Asp Asp Ile Glu Glu
Gly Val Glu Gly Ala 675 680 685Gly
Phe Asp Asp Glu Lys Ala Leu Leu Met Ser Gln Met Ser Leu Glu 690
695 700Lys Arg Phe Gly Gln Ser Ala Val Phe Val
Ala Ser Thr Leu Met Glu705 710 715
720Asn Gly Gly Val Pro Pro Ser Ala Thr Pro Glu Asn Leu Leu Lys
Glu 725 730 735Ala Ile His
Val Ile Ser Cys Gly Tyr Glu Asp Lys Ser Asp Trp Gly 740
745 750Met Glu Ile Gly Trp Ile Tyr Gly Ser Val
Thr Glu Asp Ile Leu Thr 755 760
765Gly Phe Lys Met His Ala Arg Gly Trp Arg Ser Ile Tyr Cys Met Pro 770
775 780Lys Leu Pro Ala Phe Lys Gly Ser
Ala Pro Ile Asn Leu Ser Asp Arg785 790
795 800Leu Asn Gln Val Leu Arg Trp Ala Leu Gly Ser Val
Glu Ile Leu Phe 805 810
815Ser Arg His Cys Pro Ile Trp Tyr Gly Tyr Asn Gly Arg Leu Lys Phe
820 825 830Leu Glu Arg Phe Ala Tyr
Val Asn Thr Thr Ile Tyr Pro Ile Thr Ser 835 840
845Ile Pro Leu Leu Met Tyr Cys Thr Leu Pro Ala Val Cys Leu
Phe Thr 850 855 860Asn Gln Phe Ile Ile
Pro Gln Ile Ser Asn Ile Ala Ser Ile Trp Phe865 870
875 880Leu Ser Leu Phe Leu Ser Ile Phe Ala Thr
Gly Ile Leu Glu Met Arg 885 890
895Trp Ser Gly Val Gly Ile Asp Glu Trp Trp Arg Asn Glu Gln Phe Trp
900 905 910Val Ile Gly Gly Val
Ser Ala His Leu Phe Ala Val Phe Gln Gly Ile 915
920 925Leu Lys Val Leu Ala Gly Ile Asp Thr Asn Phe Thr
Val Thr Ser Lys 930 935 940Ala Ser Asp
Glu Asp Gly Asp Phe Ala Glu Leu Tyr Leu Phe Lys Trp945
950 955 960Thr Thr Leu Leu Ile Pro Pro
Thr Thr Leu Leu Ile Val Asn Leu Val 965
970 975Gly Val Val Ala Gly Val Ser Tyr Ala Ile Asn Ser
Gly Tyr Gln Ser 980 985 990Trp
Gly Pro Leu Phe Gly Lys Leu Phe Phe Ala Phe Trp Val Ile Val 995
1000 1005His Leu Tyr Pro Phe Leu Lys Gly
Leu Met Gly Arg Gln Asn Arg 1010 1015
1020Thr Pro Thr Ile Val Val Val Trp Ser Val Leu Leu Ala Phe Ile
1025 1030 1035Phe Ser Leu Leu Trp Val
Arg Ile Asp Pro Phe Thr Ser Arg Val 1040 1045
1050Thr Gly Pro Asp Ile Leu Glu Cys Gly Ile Asn Cys 1055
1060 1065244690DNAArabidopsis thaliana
24atggaatccg aaggagaaac cgcggtatgc ttttttgact cttgcttcat cattatactt
60acctttatcg aaatcaggaa ttatatgtac tgaaattgat tgatttgggt gttgaattgt
120gtattggaga gatctgattt caaattttct gttgaggttt ctaattttgg cttcattgat
180tcgacttgat ttgtagggaa agccgatgaa gaacattgtt ccgcagactt gccagatctg
240tagtgacaat gttggcaaga ctgttgatgg agatcgtttt gtggcttgtg atatttgttc
300attcccagtt tgtcggcctt gctacgagta tgagaggaaa gatgggaatc aatcttgtcc
360tcagtgcaaa accagataca agaggctcaa aggttctctt ttgatccttc tgaagtatac
420tgtcttcatt gttcatcgat agtttatcag tatgttttga attttggatc agattggtat
480ttatagcaat ttgctaattt ctgattctag gtagtcctgc tattcctggt gataaagacg
540aggatggctt agctgatgaa ggtactgttg agttcaacta ccctcagaag gagaaaattt
600cagagcggat gcttggttgg catcttactc gtgggaaggg agaggaaatg ggggaacccc
660agtatgataa agaggtctct cacaatcatc ttcctcgtct cacgagcaga caagatgtaa
720ggcattgctg ttcattcttc cctcttaagc attcgcatcc tcacgcaatt tagttttgga
780atctgatttt gtcatttgct tatttacaga cttcaggaga gttttctgct gcctcacctg
840aacgcctctc tgtatcttct actatcgctg ggggaaagcg ccttccctat tcatcagatg
900tcaatcaatc acgtaaatat cctttatttc taactctctc gccaacacat atatttgtac
960ctaggcttct cttttatgtc aaaactctaa acaataaaat ctgttgttgt cattcacgct
1020gcagcaaata gaaggattgt ggatcctgtt ggactcggga atgtagcttg gaaggagaga
1080gttgatggct ggaaaatgaa gcaagagaag aatactggtc ctgtcagcac gcaggctgct
1140tctgaaagag gtggagtaga tattgatgcc agcacagata tcctagcaga tgaggctctg
1200ctgtgagttc ttgttttgta atcttgtttg ttctgtcgtg gtgtaccgag cgtttttcct
1260attaagcaat gtcctgatac tcattttcca attctttatt tattgtacag gaatgacgaa
1320gcgaggcagc ctctgtcaag gaaagtttca attccttcat cacggatcaa tccttacaga
1380atggttatta tgctgcggct tgttatcctt tgtctcttct tgcattaccg tataacaaac
1440ccagtgccaa atgcctttgc tctatggctg gtctctgtga tatgtgagat ctggtttgcc
1500ttatcctgga ttttggatca gtttcccaag tggtttcctg tgaaccgtga aacctacctc
1560gacaggcttg ctttaaggta agttctattt ccccattctt ctgaagcaat tactcaaagg
1620attgtttgcc tatactgttt cccattttaa tttgatcatg gtcaattttt gggacagata
1680tgatcgtgaa ggtgagccat cacagttagc tgctgttgac attttcgtga gtactgttga
1740ccccttgaag gagccacccc ttgtgacagc caacacagtg ctctctattc tggctgttga
1800ctacccagtt gacaaggtgt cctgttatgt ttctgatgat ggtgctgcta tgttatcatt
1860tgaatcactt gcagaaacat cagagtttgc tcgtaaatgg gtaccatttt gcaagaaata
1920tagcatagag cctcgtgcac cagaatggta ctttgctgcg aaaatagatt acttgaagga
1980taaagttcag acatcatttg tcaaagatcg tagagctatg aaggtaagtt tgtagtttta
2040gtcatctagt caccctcact ttgattttag tgtatgctat attgaccttt tattttcttt
2100cagagggaat atgaggaatt taaaatccga atcaatgcac ttgtttccaa agccctaaaa
2160tgtcctgaag aagggtgggt tatgcaagat ggcacaccgt ggcctggaaa taatacaagg
2220gaccatccag gaatgatcca ggtaagaaat tggttttaac tatggaatcg agaatgctct
2280ctctttctct ctagaagttc attattgaag taccatttgc tgaatgcagg tcttcttagg
2340gcaaaatggt ggacttgatg cagagggcaa tgagctcccg cgtttggtat atgtttctcg
2400agaaaagcga ccaggattcc agcaccacaa aaaggctggt gctatgaatg cactggtaag
2460tttctgatct tggatttttg acttcttcat tctgaccaat ttgttagtct aatctgggta
2520cttttcaaat gaataggtga gagtttcagc agttcttacc aatggacctt tcatcttgaa
2580tcttgattgt gatcattaca taaataacag caaagcctta agagaagcaa tgtgcttcct
2640gatggaccca aacctcggga agcaagtttg ttatgttcag ttcccacaaa gatttgatgg
2700tatcgataag aacgatagat atgctaatcg taataccgtg ttctttgatg taagtcacac
2760ttacctatac ttgcgtctaa ttttcttgtt ctttcaaatt gcttttagac acgaatatac
2820attaaactca cagtttcttg agtttgtcgt aatttttcca tgatatgttt tccagattaa
2880cttgagaggt ttagatggga ttcaaggacc tgtatatgtc ggaactggat gtgttttcaa
2940cagaacagca ttatacggtt atgaacctcc aataaaagta aaacacaaga agccaagtct
3000tttatctaag ctctgtggtg gatcaagaaa gaagaattcc aaagctaaga aagagtcgga
3060caaaaagaaa tcaggcaggc atactgactc aactgttcct gtattcaacc tcgatgacat
3120agaagaggga gttgaaggta caactgtttt tatttcttct ttggtttccg ttatacccat
3180atgttgctgt ttgaaatatt gatccagggg aggggattat ttatagttga cagttgtcta
3240aatagtttcc atactaggta tctcatcatg tcttaactat ttggcatttg tgaaacttag
3300gtgctggttt tgatgatgaa aaggcgctct taatgtcgca aatgagcctg gagaagcgat
3360ttggacagtc tgctgttttt gttgcttcta ccctaatgga aaatggtggt gttcctcctt
3420cagcaactcc agaaaacctt ctcaaagagg ctatccatgt cattagttgt ggttatgagg
3480ataagtcaga ttggggaatg gaggtataat ctcatttgaa ctcctacatg aatctgcatt
3540gttctgacat atccactttg gcattcactt tgtttatatt ttccgctgtc tttcttcaga
3600ttggatggat ctatggttct gtgacagaag atattctgac tgggttcaaa atgcatgccc
3660gtggatggcg atccatttac tgcatgccta agcttccagc tttcaagggt tctgctccta
3720tcaatctttc agatcgtctg aaccaagtgc tgaggtgggc tttaggttca gttgagattc
3780tcttcagtcg gcattgtcct atatggtatg gttacaatgg gaggctaaaa tttcttgaga
3840ggtttgcgta tgtgaacacc accatctacc ctatcacctc cattcctctt ctcatgtatt
3900gtacattgcc agccgtttgt ctcttcacca accagtttat tattcctcag gtttgacacc
3960tctctctgtc tatctatctc tatctctatc tctatctcta gaacaaacct taattacgtt
4020ctgtttaact gaaaccatgt tgtgtttgtc atctatttac ggttccaaat cctgatcagc
4080tggttctatt gttcctcttt tgcagattag taacattgca agtatatggt ttctgtctct
4140ctttctctcc attttcgcca cgggtatact agaaatgagg tggagtggcg taggcataga
4200cgaatggtgg agaaacgagc agttttgggt cattggtgga gtatccgctc atttattcgc
4260tgtgtttcaa ggtatcctca aagtccttgc cggtattgac acaaacttca cagttacctc
4320aaaagcttca gatgaagacg gagactttgc tgagctctac ttgttcaaat ggacaacact
4380tctgattccg ccaacgacgc tgctcattgt aaacttagtg ggagttgttg caggagtctt
4440ttatgctatc aacagtggat accaatcatg gggaccactc tttggtaagt tgttctttgc
4500cttctgggtg attgttcact tgtacccttt cctcaagggt ttgatgggtc gacagaaccg
4560gactcctacc attgttgtgg tctggtctgt tctcttggct tctatcttct cgttgttgtg
4620ggttaggatt gatcccttca ctagccgagt cactggcccg gacattctgg aatgtggaat
4680caactgttga
4690251065PRTArabidopsis thaliana 25Met Glu Ser Glu Gly Glu Thr Ala Gly
Lys Pro Met Lys Asn Ile Val1 5 10
15Pro Gln Thr Cys Gln Ile Cys Ser Asp Asn Val Gly Lys Thr Val
Asp 20 25 30Gly Asp Arg Phe
Val Ala Cys Asp Ile Cys Ser Phe Pro Val Cys Arg 35
40 45Pro Cys Tyr Glu Tyr Glu Arg Lys Asp Gly Asn Gln
Ser Cys Pro Gln 50 55 60Cys Lys Thr
Arg Tyr Lys Arg Leu Lys Gly Ser Pro Ala Ile Pro Gly65 70
75 80Asp Lys Asp Glu Asp Gly Leu Ala
Asp Glu Gly Thr Val Glu Phe Asn 85 90
95Tyr Pro Gln Lys Glu Lys Ile Ser Glu Arg Met Leu Gly Trp
His Leu 100 105 110Thr Arg Gly
Lys Gly Glu Glu Met Gly Glu Pro Gln Tyr Asp Lys Glu 115
120 125Val Ser His Asn His Leu Pro Arg Leu Thr Ser
Arg Gln Asp Thr Ser 130 135 140Gly Glu
Phe Ser Ala Ala Ser Pro Glu Arg Leu Ser Val Ser Ser Thr145
150 155 160Ile Ala Gly Gly Lys Arg Leu
Pro Tyr Ser Ser Asp Val Asn Gln Ser 165
170 175Pro Asn Arg Arg Ile Val Asp Pro Val Gly Leu Gly
Asn Val Ala Trp 180 185 190Lys
Glu Arg Val Asp Gly Trp Lys Met Lys Gln Glu Lys Asn Thr Gly 195
200 205Pro Val Ser Thr Gln Ala Ala Ser Glu
Arg Gly Gly Val Asp Ile Asp 210 215
220Ala Ser Thr Asp Ile Leu Ala Asp Glu Ala Leu Leu Asn Asp Glu Ala225
230 235 240Arg Gln Pro Leu
Ser Arg Lys Val Ser Ile Pro Ser Ser Arg Ile Asn 245
250 255Pro Tyr Arg Met Val Ile Met Leu Arg Leu
Val Ile Leu Cys Leu Phe 260 265
270Leu His Tyr Arg Ile Thr Asn Pro Val Pro Asn Ala Phe Ala Leu Trp
275 280 285Leu Val Ser Val Ile Cys Glu
Ile Trp Phe Ala Leu Ser Trp Ile Leu 290 295
300Asp Gln Phe Pro Lys Trp Phe Pro Val Asn Arg Glu Thr Tyr Leu
Asp305 310 315 320Arg Leu
Ala Leu Arg Tyr Asp Arg Glu Gly Glu Pro Ser Gln Leu Ala
325 330 335Ala Val Asp Ile Phe Val Ser
Thr Val Asp Pro Leu Lys Glu Pro Pro 340 345
350Leu Val Thr Ala Asn Thr Val Leu Ser Ile Leu Ala Val Asp
Tyr Pro 355 360 365Val Asp Lys Val
Ser Cys Tyr Val Ser Asp Asp Gly Ala Ala Met Leu 370
375 380Ser Phe Glu Ser Leu Ala Glu Thr Ser Glu Phe Ala
Arg Lys Trp Val385 390 395
400Pro Phe Cys Lys Lys Tyr Ser Ile Glu Pro Arg Ala Pro Glu Trp Tyr
405 410 415Phe Ala Ala Lys Ile
Asp Tyr Leu Lys Asp Lys Val Gln Thr Ser Phe 420
425 430Val Lys Asp Arg Arg Ala Met Lys Arg Glu Tyr Glu
Glu Phe Lys Ile 435 440 445Arg Ile
Asn Ala Leu Val Ser Lys Ala Leu Lys Cys Pro Glu Glu Gly 450
455 460Trp Val Met Gln Asp Gly Thr Pro Trp Pro Gly
Asn Asn Thr Arg Asp465 470 475
480His Pro Gly Met Ile Gln Val Phe Leu Gly Gln Asn Gly Gly Leu Asp
485 490 495Ala Glu Gly Asn
Glu Leu Pro Arg Leu Val Tyr Val Ser Arg Glu Lys 500
505 510Arg Pro Gly Phe Gln His His Lys Lys Ala Gly
Ala Met Asn Ala Leu 515 520 525Val
Arg Val Ser Ala Val Leu Thr Asn Gly Pro Phe Ile Leu Asn Leu 530
535 540Asp Cys Asp His Tyr Ile Asn Asn Ser Lys
Ala Leu Arg Glu Ala Met545 550 555
560Cys Phe Leu Met Asp Pro Asn Leu Gly Lys Gln Val Cys Tyr Val
Gln 565 570 575Phe Pro Gln
Arg Phe Asp Gly Ile Asp Lys Asn Asp Arg Tyr Ala Asn 580
585 590Arg Asn Thr Val Phe Phe Asp Ile Asn Leu
Arg Gly Leu Asp Gly Ile 595 600
605Gln Gly Pro Val Tyr Val Gly Thr Gly Cys Val Phe Asn Arg Thr Ala 610
615 620Leu Tyr Gly Tyr Glu Pro Pro Ile
Lys Val Lys His Lys Lys Pro Ser625 630
635 640Leu Leu Ser Lys Leu Cys Gly Gly Ser Arg Lys Lys
Asn Ser Lys Ala 645 650
655Lys Lys Glu Ser Asp Lys Lys Lys Ser Gly Arg His Thr Asp Ser Thr
660 665 670Val Pro Val Phe Asn Leu
Asp Asp Ile Glu Glu Gly Val Glu Gly Ala 675 680
685Gly Phe Asp Asp Glu Lys Ala Leu Leu Met Ser Gln Met Ser
Leu Glu 690 695 700Lys Arg Phe Gly Gln
Ser Ala Val Phe Val Ala Ser Thr Leu Met Glu705 710
715 720Asn Gly Gly Val Pro Pro Ser Ala Thr Pro
Glu Asn Leu Leu Lys Glu 725 730
735Ala Ile His Val Ile Ser Cys Gly Tyr Glu Asp Lys Ser Asp Trp Gly
740 745 750Met Glu Ile Gly Trp
Ile Tyr Gly Ser Val Thr Glu Asp Ile Leu Thr 755
760 765Gly Phe Lys Met His Ala Arg Gly Trp Arg Ser Ile
Tyr Cys Met Pro 770 775 780Lys Leu Pro
Ala Phe Lys Gly Ser Ala Pro Ile Asn Leu Ser Asp Arg785
790 795 800Leu Asn Gln Val Leu Arg Trp
Ala Leu Gly Ser Val Glu Ile Leu Phe 805
810 815Ser Arg His Cys Pro Ile Trp Tyr Gly Tyr Asn Gly
Arg Leu Lys Phe 820 825 830Leu
Glu Arg Phe Ala Tyr Val Asn Thr Thr Ile Tyr Pro Ile Thr Ser 835
840 845Ile Pro Leu Leu Met Tyr Cys Thr Leu
Pro Ala Val Cys Leu Phe Thr 850 855
860Asn Gln Phe Ile Ile Pro Gln Ile Ser Asn Ile Ala Ser Ile Trp Phe865
870 875 880Leu Ser Leu Phe
Leu Ser Ile Phe Ala Thr Gly Ile Leu Glu Met Arg 885
890 895Trp Ser Gly Val Gly Ile Asp Glu Trp Trp
Arg Asn Glu Gln Phe Trp 900 905
910Val Ile Gly Gly Val Ser Ala His Leu Phe Ala Val Phe Gln Gly Ile
915 920 925Leu Lys Val Leu Ala Gly Ile
Asp Thr Asn Phe Thr Val Thr Ser Lys 930 935
940Ala Ser Asp Glu Asp Gly Asp Phe Ala Glu Leu Tyr Leu Phe Lys
Trp945 950 955 960Thr Thr
Leu Leu Ile Pro Pro Thr Thr Leu Leu Ile Val Asn Leu Val
965 970 975Gly Val Val Ala Gly Val Phe
Tyr Ala Ile Asn Ser Gly Tyr Gln Ser 980 985
990Trp Gly Pro Leu Phe Gly Lys Leu Phe Phe Ala Phe Trp Val
Ile Val 995 1000 1005His Leu Tyr
Pro Phe Leu Lys Gly Leu Met Gly Arg Gln Asn Arg 1010
1015 1020Thr Pro Thr Ile Val Val Val Trp Ser Val Leu
Leu Ala Ser Ile 1025 1030 1035Phe Ser
Leu Leu Trp Val Arg Ile Asp Pro Phe Thr Ser Arg Val 1040
1045 1050Thr Gly Pro Asp Ile Leu Glu Cys Gly Ile
Asn Cys 1055 1060
1065264690DNAArabidopsis thaliana 26atggaatccg aaggagaaac cgcggtatgc
ttttttgact cttgcttcat cattatactt 60acctttatcg aaatcaggaa ttatatgtac
tgaaattgat tgatttgggt gttgaattgt 120gtattggaga gatctgattt caaattttct
gttgaggttt ctaattttgg cttcattgat 180tcgacttgat ttgtagggaa agccgatgaa
gaacattgtt ccgcagactt gccagatctg 240tagtgacaat gttggcaaga ctgttgatgg
agatcgtttt gtggcttgtg atatttgttc 300attcccagtt tgtcggcctt gctacgagta
tgagaggaaa gatgggaatc aatcttgtcc 360tcagtgcaaa accagataca agaggctcaa
aggttctctt ttgatccttc tgaagtatac 420tgtcttcatt gttcatcgat agtttatcag
tatgttttga attttggatc agattggtat 480ttatagcaat ttgctaattt ctgattctag
gtagtcctgc tattcctggt gataaagacg 540aggatggctt agctgatgaa ggtactgttg
agttcaacta ccctcagaag gagaaaattt 600cagagcggat gcttggttgg catcttactc
gtgggaaggg agaggaaatg ggggaacccc 660agtatgataa agaggtctct cacaatcatc
ttcctcgtct cacgagcaga caagatgtaa 720ggcattgctg ttcattcttc cctcttaagc
attcgcatcc tcacgcaatt tagttttgga 780atctgatttt gtcatttgct tatttacaga
cttcaggaga gttttctgct gcctcacctg 840aacgcctctc tgtatcttct actatcgctg
ggggaaagcg ccttccctat tcatcagatg 900tcaatcaatc acgtaaatat cctttatttc
taactctctc gccaacacat atatttgtac 960ctaggcttct cttttatgtc aaaactctaa
acaataaaat ctgttgttgt cattcacgct 1020gcagcaaata gaaggattgt ggatcctgtt
ggactcggga atgtagcttg gaaggagaga 1080gttgatggct ggaaaatgaa gcaagagaag
aatactggtc ctgtcagcac gcaggctgct 1140tctgaaagag gtggagtaga tattgatgcc
agcacagata tcctagcaga tgaggctctg 1200ctgtgagttc ttgttttgta atcttgtttg
ttctgtcgtg gtgtaccgag cgtttttcct 1260attaagcaat gtcctgatac tcattttcca
attctttatt tattgtacag gaatgacgaa 1320gcgaggcagc ctctgtcaag gaaagtttca
attccttcat cacggatcaa tccttacaga 1380atggttatta tgctgcggct tgttatcctt
tgtctcttct tgcattaccg tataacaaac 1440ccagtgccaa atgcctttgc tctatggctg
gtctctgtga tatgtgagat ctggtttgcc 1500ttatcctgga ttttggatca gtttcccaag
tggtttcctg tgaaccgtga aacctacctc 1560gacaggcttg ctttaaggta agttctattt
ccccattctt ctgaagcaat tactcaaagg 1620attgtttgcc tatactgttt cccattttaa
tttgatcatg gtcaattttt gggacagata 1680tgatcgtgaa ggtgagccat cacagttagc
tgctgttgac attttcgtga gtactgttga 1740ccccttgaag gagccacccc ttgtgacagc
caacacagtg ctctctattc tggctgttga 1800ctacccagtt gacaaggtgt cctgttatgt
ttctgatgat ggtgctgcta tgttatcatt 1860tgaatcactt gcagaaacat cagagtttgc
tcgtaaatgg gtaccatttt gcaagaaata 1920tagcatagag cctcgtgcac cagaatggta
ctttgctgcg aaaatagatt acttgaagga 1980taaagttcag acatcatttg tcaaagatcg
tagagctatg aaggtaagtt tgtagtttta 2040gtcatctagt caccctcact ttgattttag
tgtatgctat attgaccttt tattttcttt 2100cagagggaat atgaggaatt taaaatccga
atcaatgcac ttgtttccaa agccctaaaa 2160tgtcctgaag aagggtgggt tatgcaagat
ggcacaccgt ggcctggaaa taatacaagg 2220gaccatccag gaatgatcca ggtaagaaat
tggttttaac tatggaatcg agaatgctct 2280ctctttctct ctagaagttc attattgaag
taccatttgc tgaatgcagg tcttcttagg 2340gcaaaatggt ggacttgatg cagagggcaa
tgagctcccg cgtttggtat atgtttctcg 2400agaaaagcga ccaggattcc agcaccacaa
aaaggctggt gctatgaatg cactggtaag 2460tttctgatct tggatttttg acttcttcat
tctgaccaat ttgttagtct aatctgggta 2520cttttcaaat gaataggtga gagtttcagc
agttcttacc aatggacctt tcatcttgaa 2580tcttgattgt gatcattaca taaataacag
caaagcctta agagaagcaa tgtgcttcct 2640gatggaccca aacctcggga agcaagtttg
ttatgttcag ttcccacaaa gatttgatgg 2700tatcgataag aacgatagat atgctaatcg
taataccgtg ttctttgatg taagtcacac 2760ttacctatac ttgcgtctaa ttttcttgtt
ctttcaaatt gcttttagac acgaatatac 2820attaaactca cagtttcttg agtttgtcgt
aatttttcca tgatatgttt tccagattaa 2880cttgagaggt ttagatggga ttcaaggacc
tgtatatgtc ggaactggat gtgttttcaa 2940cagaacagca ttatacggtt atgaacctcc
aataaaagta aaacacaaga agccaagtct 3000tttatctaag ctctgtggtg gatcaagaaa
gaagaattcc aaagctaaga aagagtcgga 3060caaaaagaaa tcaggcaggc atactgactc
aactgttcct gtattcaacc tcgatgacat 3120agaagaggga gttgaaggta caactgtttt
tatttcttct ttggtttccg ttatacccat 3180atgttgctgt ttgaaatatt gatccagggg
aggggattat ttatagttga cagttgtcta 3240aatagtttcc atactaggta tctcatcatg
tcttaactat ttggcatttg tgaaacttag 3300gtgctggttt tgatgatgaa aaggcgctct
taatgtcgca aatgagcctg gagaagcgat 3360ttggacagtc tgctgttttt gttgcttcta
ccctaatgga aaatggtggt gttcctcctt 3420cagcaactcc agaaaacctt ctcaaagagg
ctatccatgt cattagttgt ggttatgagg 3480ataagtcaga ttggggaatg gaggtataat
ctcatttgaa ctcctacatg aatctgcatt 3540gttctgacat atccactttg gcattcactt
tgtttatatt ttccgctgtc tttcttcaga 3600ttggatggat ctatggttct gtgacagaag
atattctgac tgggttcaaa atgcatgccc 3660gtggatggcg atccatttac tgcatgccta
agcttccagc tttcaagggt tctgctccta 3720tcaatctttc agatcgtctg aaccaagtgc
tgaggtgggc tttaggttca gttgagattc 3780tcttcagtcg gcattgtcct atatggtatg
gttacaatgg gaggctaaaa tttcttgaga 3840ggtttgcgta tgtgaacacc accatctacc
ctatcacctc cattcctctt ctcatgtatt 3900gtacattgcc agccgtttgt ctcttcacca
accagtttat tattcctcag gtttgacacc 3960tctctctgtc tatctatctc tatctctatc
tctatctcta gaacaaacct taattacgtt 4020ctgtttaact gaaaccatgt tgtgtttgtc
atctatttac ggttccaaat cctgatcagc 4080tggttctatt gttcctcttt tgcagattag
taacattgca agtatatggt ttctgtctct 4140ctttctctcc attttcgcca cgggtatact
agaaatgagg tggagtggcg taggcataga 4200cgaatggtgg agaaacgagc agttttgggt
cattggtgga gtatccgctc atttattcgc 4260tgtgtttcaa ggtatcctca aagtccttgc
cggtattgac acaaacttca cagttacctc 4320aaaagcttca gatgaagacg gagactttgc
tgagctctac ttgttcaaat ggacaacact 4380tctgattccg ccaacgacgc tgctcattgt
aaacttagtg ggagttgttg caggagtctc 4440ttatgctatc aacagtggat accaatcatg
gggaccactc tttagtaagt tgttctttgc 4500cttctgggtg attgttcact tgtacccttt
cctcaagggt ttgatgggtc gacagaaccg 4560gactcctacc attgttgtgg tctggtctgt
tctcttggct tctatcttct cgttgttgtg 4620ggttaggatt gatcccttca ctagccgagt
cactggcccg gacattctgg aatgtggaat 4680caactgttga
4690271065PRTArabidopsis thaliana 27Met
Glu Ser Glu Gly Glu Thr Ala Gly Lys Pro Met Lys Asn Ile Val1
5 10 15Pro Gln Thr Cys Gln Ile Cys
Ser Asp Asn Val Gly Lys Thr Val Asp 20 25
30Gly Asp Arg Phe Val Ala Cys Asp Ile Cys Ser Phe Pro Val
Cys Arg 35 40 45Pro Cys Tyr Glu
Tyr Glu Arg Lys Asp Gly Asn Gln Ser Cys Pro Gln 50 55
60Cys Lys Thr Arg Tyr Lys Arg Leu Lys Gly Ser Pro Ala
Ile Pro Gly65 70 75
80Asp Lys Asp Glu Asp Gly Leu Ala Asp Glu Gly Thr Val Glu Phe Asn
85 90 95Tyr Pro Gln Lys Glu Lys
Ile Ser Glu Arg Met Leu Gly Trp His Leu 100
105 110Thr Arg Gly Lys Gly Glu Glu Met Gly Glu Pro Gln
Tyr Asp Lys Glu 115 120 125Val Ser
His Asn His Leu Pro Arg Leu Thr Ser Arg Gln Asp Thr Ser 130
135 140Gly Glu Phe Ser Ala Ala Ser Pro Glu Arg Leu
Ser Val Ser Ser Thr145 150 155
160Ile Ala Gly Gly Lys Arg Leu Pro Tyr Ser Ser Asp Val Asn Gln Ser
165 170 175Pro Asn Arg Arg
Ile Val Asp Pro Val Gly Leu Gly Asn Val Ala Trp 180
185 190Lys Glu Arg Val Asp Gly Trp Lys Met Lys Gln
Glu Lys Asn Thr Gly 195 200 205Pro
Val Ser Thr Gln Ala Ala Ser Glu Arg Gly Gly Val Asp Ile Asp 210
215 220Ala Ser Thr Asp Ile Leu Ala Asp Glu Ala
Leu Leu Asn Asp Glu Ala225 230 235
240Arg Gln Pro Leu Ser Arg Lys Val Ser Ile Pro Ser Ser Arg Ile
Asn 245 250 255Pro Tyr Arg
Met Val Ile Met Leu Arg Leu Val Ile Leu Cys Leu Phe 260
265 270Leu His Tyr Arg Ile Thr Asn Pro Val Pro
Asn Ala Phe Ala Leu Trp 275 280
285Leu Val Ser Val Ile Cys Glu Ile Trp Phe Ala Leu Ser Trp Ile Leu 290
295 300Asp Gln Phe Pro Lys Trp Phe Pro
Val Asn Arg Glu Thr Tyr Leu Asp305 310
315 320Arg Leu Ala Leu Arg Tyr Asp Arg Glu Gly Glu Pro
Ser Gln Leu Ala 325 330
335Ala Val Asp Ile Phe Val Ser Thr Val Asp Pro Leu Lys Glu Pro Pro
340 345 350Leu Val Thr Ala Asn Thr
Val Leu Ser Ile Leu Ala Val Asp Tyr Pro 355 360
365Val Asp Lys Val Ser Cys Tyr Val Ser Asp Asp Gly Ala Ala
Met Leu 370 375 380Ser Phe Glu Ser Leu
Ala Glu Thr Ser Glu Phe Ala Arg Lys Trp Val385 390
395 400Pro Phe Cys Lys Lys Tyr Ser Ile Glu Pro
Arg Ala Pro Glu Trp Tyr 405 410
415Phe Ala Ala Lys Ile Asp Tyr Leu Lys Asp Lys Val Gln Thr Ser Phe
420 425 430Val Lys Asp Arg Arg
Ala Met Lys Arg Glu Tyr Glu Glu Phe Lys Ile 435
440 445Arg Ile Asn Ala Leu Val Ser Lys Ala Leu Lys Cys
Pro Glu Glu Gly 450 455 460Trp Val Met
Gln Asp Gly Thr Pro Trp Pro Gly Asn Asn Thr Arg Asp465
470 475 480His Pro Gly Met Ile Gln Val
Phe Leu Gly Gln Asn Gly Gly Leu Asp 485
490 495Ala Glu Gly Asn Glu Leu Pro Arg Leu Val Tyr Val
Ser Arg Glu Lys 500 505 510Arg
Pro Gly Phe Gln His His Lys Lys Ala Gly Ala Met Asn Ala Leu 515
520 525Val Arg Val Ser Ala Val Leu Thr Asn
Gly Pro Phe Ile Leu Asn Leu 530 535
540Asp Cys Asp His Tyr Ile Asn Asn Ser Lys Ala Leu Arg Glu Ala Met545
550 555 560Cys Phe Leu Met
Asp Pro Asn Leu Gly Lys Gln Val Cys Tyr Val Gln 565
570 575Phe Pro Gln Arg Phe Asp Gly Ile Asp Lys
Asn Asp Arg Tyr Ala Asn 580 585
590Arg Asn Thr Val Phe Phe Asp Ile Asn Leu Arg Gly Leu Asp Gly Ile
595 600 605Gln Gly Pro Val Tyr Val Gly
Thr Gly Cys Val Phe Asn Arg Thr Ala 610 615
620Leu Tyr Gly Tyr Glu Pro Pro Ile Lys Val Lys His Lys Lys Pro
Ser625 630 635 640Leu Leu
Ser Lys Leu Cys Gly Gly Ser Arg Lys Lys Asn Ser Lys Ala
645 650 655Lys Lys Glu Ser Asp Lys Lys
Lys Ser Gly Arg His Thr Asp Ser Thr 660 665
670Val Pro Val Phe Asn Leu Asp Asp Ile Glu Glu Gly Val Glu
Gly Ala 675 680 685Gly Phe Asp Asp
Glu Lys Ala Leu Leu Met Ser Gln Met Ser Leu Glu 690
695 700Lys Arg Phe Gly Gln Ser Ala Val Phe Val Ala Ser
Thr Leu Met Glu705 710 715
720Asn Gly Gly Val Pro Pro Ser Ala Thr Pro Glu Asn Leu Leu Lys Glu
725 730 735Ala Ile His Val Ile
Ser Cys Gly Tyr Glu Asp Lys Ser Asp Trp Gly 740
745 750Met Glu Ile Gly Trp Ile Tyr Gly Ser Val Thr Glu
Asp Ile Leu Thr 755 760 765Gly Phe
Lys Met His Ala Arg Gly Trp Arg Ser Ile Tyr Cys Met Pro 770
775 780Lys Leu Pro Ala Phe Lys Gly Ser Ala Pro Ile
Asn Leu Ser Asp Arg785 790 795
800Leu Asn Gln Val Leu Arg Trp Ala Leu Gly Ser Val Glu Ile Leu Phe
805 810 815Ser Arg His Cys
Pro Ile Trp Tyr Gly Tyr Asn Gly Arg Leu Lys Phe 820
825 830Leu Glu Arg Phe Ala Tyr Val Asn Thr Thr Ile
Tyr Pro Ile Thr Ser 835 840 845Ile
Pro Leu Leu Met Tyr Cys Thr Leu Pro Ala Val Cys Leu Phe Thr 850
855 860Asn Gln Phe Ile Ile Pro Gln Ile Ser Asn
Ile Ala Ser Ile Trp Phe865 870 875
880Leu Ser Leu Phe Leu Ser Ile Phe Ala Thr Gly Ile Leu Glu Met
Arg 885 890 895Trp Ser Gly
Val Gly Ile Asp Glu Trp Trp Arg Asn Glu Gln Phe Trp 900
905 910Val Ile Gly Gly Val Ser Ala His Leu Phe
Ala Val Phe Gln Gly Ile 915 920
925Leu Lys Val Leu Ala Gly Ile Asp Thr Asn Phe Thr Val Thr Ser Lys 930
935 940Ala Ser Asp Glu Asp Gly Asp Phe
Ala Glu Leu Tyr Leu Phe Lys Trp945 950
955 960Thr Thr Leu Leu Ile Pro Pro Thr Thr Leu Leu Ile
Val Asn Leu Val 965 970
975Gly Val Val Ala Gly Val Ser Tyr Ala Ile Asn Ser Gly Tyr Gln Ser
980 985 990Trp Gly Pro Leu Phe Ser
Lys Leu Phe Phe Ala Phe Trp Val Ile Val 995 1000
1005His Leu Tyr Pro Phe Leu Lys Gly Leu Met Gly Arg
Gln Asn Arg 1010 1015 1020Thr Pro Thr
Ile Val Val Val Trp Ser Val Leu Leu Ala Ser Ile 1025
1030 1035Phe Ser Leu Leu Trp Val Arg Ile Asp Pro Phe
Thr Ser Arg Val 1040 1045 1050Thr Gly
Pro Asp Ile Leu Glu Cys Gly Ile Asn Cys 1055 1060
1065284690DNAArabidopsis thaliana 28atggaatccg aaggagaaac
cgcggtatgc ttttttgact cttgcttcat cattatactt 60acctttatcg aaatcaggaa
ttatatgtac tgaaattgat tgatttgggt gttgaattgt 120gtattggaga gatctgattt
caaattttct gttgaggttt ctaattttgg cttcattgat 180tcgacttgat ttgtagggaa
agccgatgaa gaacattgtt ccgcagactt gccagatctg 240tagtgacaat gttggcaaga
ctgttgatgg agatcgtttt gtggcttgtg atatttgttc 300attcccagtt tgtcggcctt
gctacgagta tgagaggaaa gatgggaatc aatcttgtcc 360tcagtgcaaa accagataca
agaggctcaa aggttctctt ttgatccttc tgaagtatac 420tgtcttcatt gttcatcgat
agtttatcag tatgttttga attttggatc agattggtat 480ttatagcaat ttgctaattt
ctgattctag gtagtcctgc tattcctggt gataaagacg 540aggatggctt agctgatgaa
ggtactgttg agttcaacta ccctcagaag gagaaaattt 600cagagcggat gcttggttgg
catcttactc gtgggaaggg agaggaaatg ggggaacccc 660agtatgataa agaggtctct
cacaatcatc ttcctcgtct cacgagcaga caagatgtaa 720ggcattgctg ttcattcttc
cctcttaagc attcgcatcc tcacgcaatt tagttttgga 780atctgatttt gtcatttgct
tatttacaga cttcaggaga gttttctgct gcctcacctg 840aacgcctctc tgtatcttct
actatcgctg ggggaaagcg ccttccctat tcatcagatg 900tcaatcaatc acgtaaatat
cctttatttc taactctctc gccaacacat atatttgtac 960ctaggcttct cttttatgtc
aaaactctaa acaataaaat ctgttgttgt cattcacgct 1020gcagcaaata gaaggattgt
ggatcctgtt ggactcggga atgtagcttg gaaggagaga 1080gttgatggct ggaaaatgaa
gcaagagaag aatactggtc ctgtcagcac gcaggctgct 1140tctgaaagag gtggagtaga
tattgatgcc agcacagata tcctagcaga tgaggctctg 1200ctgtgagttc ttgttttgta
atcttgtttg ttctgtcgtg gtgtaccgag cgtttttcct 1260attaagcaat gtcctgatac
tcattttcca attctttatt tattgtacag gaatgacgaa 1320gcgaggcagc ctctgtcaag
gaaagtttca attccttcat cacggatcaa tccttacaga 1380atggttatta tgctgcggct
tgttatcctt tgtctcttct tgcattaccg tataacaaac 1440ccagtgccaa atgcctttgc
tctatggctg gtctctgtga tatgtgagat ctggtttgcc 1500ttatcctgga ttttggatca
gtttcccaag tggtttcctg tgaaccgtga aacctacctc 1560gacaggcttg ctttaaggta
agttctattt ccccattctt ctgaagcaat tactcaaagg 1620attgtttgcc tatactgttt
cccattttaa tttgatcatg gtcaattttt gggacagata 1680tgatcgtgaa ggtgagccat
cacagttagc tgctgttgac attttcgtga gtactgttga 1740ccccttgaag gagccacccc
ttgtgacagc caacacagtg ctctctattc tggctgttga 1800ctacccagtt gacaaggtgt
cctgttatgt ttctgatgat ggtgctgcta tgttatcatt 1860tgaatcactt gcagaaacat
cagagtttgc tcgtaaatgg gtaccatttt gcaagaaata 1920tagcatagag cctcgtgcac
cagaatggta ctttgctgcg aaaatagatt acttgaagga 1980taaagttcag acatcatttg
tcaaagatcg tagagctatg aaggtaagtt tgtagtttta 2040gtcatctagt caccctcact
ttgattttag tgtatgctat attgaccttt tattttcttt 2100cagagggaat atgaggaatt
taaaatccga atcaatgcac ttgtttccaa agccctaaaa 2160tgtcctgaag aagggtgggt
tatgcaagat ggcacaccgt ggcctggaaa taatacaagg 2220gaccatccag gaatgatcca
ggtaagaaat tggttttaac tatggaatcg agaatgctct 2280ctctttctct ctagaagttc
attattgaag taccatttgc tgaatgcagg tcttcttagg 2340gcaaaatggt ggacttgatg
cagagggcaa tgagctcccg cgtttggtat atgtttctcg 2400agaaaagcga ccaggattcc
agcaccacaa aaaggctggt gctatgaatg cactggtaag 2460tttctgatct tggatttttg
acttcttcat tctgaccaat ttgttagtct aatctgggta 2520cttttcaaat gaataggtga
gagtttcagc agttcttacc aatggacctt tcatcttgaa 2580tcttgattgt gatcattaca
taaataacag caaagcctta agagaagcaa tgtgcttcct 2640gatggaccca aacctcggga
agcaagtttg ttatgttcag ttcccacaaa gatttgatgg 2700tatcgataag aacgatagat
atgctaatcg taataccgtg ttctttgatg taagtcacac 2760ttacctatac ttgcgtctaa
ttttcttgtt ctttcaaatt gcttttagac acgaatatac 2820attaaactca cagtttcttg
agtttgtcgt aatttttcca tgatatgttt tccagattaa 2880cttgagaggt ttagatggga
ttcaaggacc tgtatatgtc ggaactggat gtgttttcaa 2940cagaacagca ttatacggtt
atgaacctcc aataaaagta aaacacaaga agccaagtct 3000tttatctaag ctctgtggtg
gatcaagaaa gaagaattcc aaagctaaga aagagtcgga 3060caaaaagaaa tcaggcaggc
atactgactc aactgttcct gtattcaacc tcgatgacat 3120agaagaggga gttgaaggta
caactgtttt tatttcttct ttggtttccg ttatacccat 3180atgttgctgt ttgaaatatt
gatccagggg aggggattat ttatagttga cagttgtcta 3240aatagtttcc atactaggta
tctcatcatg tcttaactat ttggcatttg tgaaacttag 3300gtgctggttt tgatgatgaa
aaggcgctct taatgtcgca aatgagcctg gagaagcgat 3360ttggacagtc tgctgttttt
gttgcttcta ccctaatgga aaatggtggt gttcctcctt 3420cagcaactcc agaaaacctt
ctcaaagagg ctatccatgt cattagttgt ggttatgagg 3480ataagtcaga ttggggaatg
gaggtataat ctcatttgaa ctcctacatg aatctgcatt 3540gttctgacat atccactttg
gcattcactt tgtttatatt ttccgctgtc tttcttcaga 3600ttggatggat ctatggttct
gtgacagaag atattctgac tgggttcaaa atgcatgccc 3660gtggatggcg atccatttac
tgcatgccta agcttccagc tttcaagggt tctgctccta 3720tcaatctttc agatcgtctg
aaccaagtgc tgaagtgggc tttaggttca gttgagattc 3780tcttcagtcg gcattgtcct
atatggtatg gttacaatgg gaggctaaaa tttcttgaga 3840ggtttgcgta tgtgaacacc
accatctacc ctatcacctc cattcctctt ctcatgtatt 3900gtacattgcc agccgtttgt
ctcttcacca accagtttat tattcctcag gtttgacacc 3960tctctctgtc tatctatctc
tatctctatc tctatctcta gaacaaacct taattacgtt 4020ctgtttaact gaaaccatgt
tgtgtttgtc atctatttac ggttccaaat cctgatcagc 4080tggttctatt gttcctcttt
tgcagattag taacattgca agtatatggt ttctgtctct 4140ctttctctcc attttcgcca
cgggtatact agaaatgagg tggagtggcg taggcataga 4200cgaatggtgg agaaacgagc
agttttgggt cattggtgga gtatccgctc atttattcgc 4260tgtgtttcaa ggtatcctca
aagtccttgc cggtattgac acaaacttca cagttacctc 4320aaaagcttca gatgaagacg
gagactttgc tgagctctac ttgttcaaat ggacaacact 4380tctgattccg ccaacgacgc
tgctcattgt aaacttagtg ggagttgttg caggagtctc 4440ttatgctatc aacagtggat
accaatcatg gggaccactc tttggtaagt tgttctttgc 4500cttctgggtg attgttcact
tgtacccttt cctcaagggt ttgatgggtc gacagaaccg 4560gactcctacc attgttgtgg
tctggtctgt tctcttggct tctatcttct cgttgttgtg 4620ggttaggatt gatcccttca
ctagccgagt cactggcccg gacattctgg aatgtggaat 4680caactgttga
4690291065PRTArabidopsis
thaliana 29Met Glu Ser Glu Gly Glu Thr Ala Gly Lys Pro Met Lys Asn Ile
Val1 5 10 15Pro Gln Thr
Cys Gln Ile Cys Ser Asp Asn Val Gly Lys Thr Val Asp 20
25 30Gly Asp Arg Phe Val Ala Cys Asp Ile Cys
Ser Phe Pro Val Cys Arg 35 40
45Pro Cys Tyr Glu Tyr Glu Arg Lys Asp Gly Asn Gln Ser Cys Pro Gln 50
55 60Cys Lys Thr Arg Tyr Lys Arg Leu Lys
Gly Ser Pro Ala Ile Pro Gly65 70 75
80Asp Lys Asp Glu Asp Gly Leu Ala Asp Glu Gly Thr Val Glu
Phe Asn 85 90 95Tyr Pro
Gln Lys Glu Lys Ile Ser Glu Arg Met Leu Gly Trp His Leu 100
105 110Thr Arg Gly Lys Gly Glu Glu Met Gly
Glu Pro Gln Tyr Asp Lys Glu 115 120
125Val Ser His Asn His Leu Pro Arg Leu Thr Ser Arg Gln Asp Thr Ser
130 135 140Gly Glu Phe Ser Ala Ala Ser
Pro Glu Arg Leu Ser Val Ser Ser Thr145 150
155 160Ile Ala Gly Gly Lys Arg Leu Pro Tyr Ser Ser Asp
Val Asn Gln Ser 165 170
175Pro Asn Arg Arg Ile Val Asp Pro Val Gly Leu Gly Asn Val Ala Trp
180 185 190Lys Glu Arg Val Asp Gly
Trp Lys Met Lys Gln Glu Lys Asn Thr Gly 195 200
205Pro Val Ser Thr Gln Ala Ala Ser Glu Arg Gly Gly Val Asp
Ile Asp 210 215 220Ala Ser Thr Asp Ile
Leu Ala Asp Glu Ala Leu Leu Asn Asp Glu Ala225 230
235 240Arg Gln Pro Leu Ser Arg Lys Val Ser Ile
Pro Ser Ser Arg Ile Asn 245 250
255Pro Tyr Arg Met Val Ile Met Leu Arg Leu Val Ile Leu Cys Leu Phe
260 265 270Leu His Tyr Arg Ile
Thr Asn Pro Val Pro Asn Ala Phe Ala Leu Trp 275
280 285Leu Val Ser Val Ile Cys Glu Ile Trp Phe Ala Leu
Ser Trp Ile Leu 290 295 300Asp Gln Phe
Pro Lys Trp Phe Pro Val Asn Arg Glu Thr Tyr Leu Asp305
310 315 320Arg Leu Ala Leu Arg Tyr Asp
Arg Glu Gly Glu Pro Ser Gln Leu Ala 325
330 335Ala Val Asp Ile Phe Val Ser Thr Val Asp Pro Leu
Lys Glu Pro Pro 340 345 350Leu
Val Thr Ala Asn Thr Val Leu Ser Ile Leu Ala Val Asp Tyr Pro 355
360 365Val Asp Lys Val Ser Cys Tyr Val Ser
Asp Asp Gly Ala Ala Met Leu 370 375
380Ser Phe Glu Ser Leu Ala Glu Thr Ser Glu Phe Ala Arg Lys Trp Val385
390 395 400Pro Phe Cys Lys
Lys Tyr Ser Ile Glu Pro Arg Ala Pro Glu Trp Tyr 405
410 415Phe Ala Ala Lys Ile Asp Tyr Leu Lys Asp
Lys Val Gln Thr Ser Phe 420 425
430Val Lys Asp Arg Arg Ala Met Lys Arg Glu Tyr Glu Glu Phe Lys Ile
435 440 445Arg Ile Asn Ala Leu Val Ser
Lys Ala Leu Lys Cys Pro Glu Glu Gly 450 455
460Trp Val Met Gln Asp Gly Thr Pro Trp Pro Gly Asn Asn Thr Arg
Asp465 470 475 480His Pro
Gly Met Ile Gln Val Phe Leu Gly Gln Asn Gly Gly Leu Asp
485 490 495Ala Glu Gly Asn Glu Leu Pro
Arg Leu Val Tyr Val Ser Arg Glu Lys 500 505
510Arg Pro Gly Phe Gln His His Lys Lys Ala Gly Ala Met Asn
Ala Leu 515 520 525Val Arg Val Ser
Ala Val Leu Thr Asn Gly Pro Phe Ile Leu Asn Leu 530
535 540Asp Cys Asp His Tyr Ile Asn Asn Ser Lys Ala Leu
Arg Glu Ala Met545 550 555
560Cys Phe Leu Met Asp Pro Asn Leu Gly Lys Gln Val Cys Tyr Val Gln
565 570 575Phe Pro Gln Arg Phe
Asp Gly Ile Asp Lys Asn Asp Arg Tyr Ala Asn 580
585 590Arg Asn Thr Val Phe Phe Asp Ile Asn Leu Arg Gly
Leu Asp Gly Ile 595 600 605Gln Gly
Pro Val Tyr Val Gly Thr Gly Cys Val Phe Asn Arg Thr Ala 610
615 620Leu Tyr Gly Tyr Glu Pro Pro Ile Lys Val Lys
His Lys Lys Pro Ser625 630 635
640Leu Leu Ser Lys Leu Cys Gly Gly Ser Arg Lys Lys Asn Ser Lys Ala
645 650 655Lys Lys Glu Ser
Asp Lys Lys Lys Ser Gly Arg His Thr Asp Ser Thr 660
665 670Val Pro Val Phe Asn Leu Asp Asp Ile Glu Glu
Gly Val Glu Gly Ala 675 680 685Gly
Phe Asp Asp Glu Lys Ala Leu Leu Met Ser Gln Met Ser Leu Glu 690
695 700Lys Arg Phe Gly Gln Ser Ala Val Phe Val
Ala Ser Thr Leu Met Glu705 710 715
720Asn Gly Gly Val Pro Pro Ser Ala Thr Pro Glu Asn Leu Leu Lys
Glu 725 730 735Ala Ile His
Val Ile Ser Cys Gly Tyr Glu Asp Lys Ser Asp Trp Gly 740
745 750Met Glu Ile Gly Trp Ile Tyr Gly Ser Val
Thr Glu Asp Ile Leu Thr 755 760
765Gly Phe Lys Met His Ala Arg Gly Trp Arg Ser Ile Tyr Cys Met Pro 770
775 780Lys Leu Pro Ala Phe Lys Gly Ser
Ala Pro Ile Asn Leu Ser Asp Arg785 790
795 800Leu Asn Gln Val Leu Lys Trp Ala Leu Gly Ser Val
Glu Ile Leu Phe 805 810
815Ser Arg His Cys Pro Ile Trp Tyr Gly Tyr Asn Gly Arg Leu Lys Phe
820 825 830Leu Glu Arg Phe Ala Tyr
Val Asn Thr Thr Ile Tyr Pro Ile Thr Ser 835 840
845Ile Pro Leu Leu Met Tyr Cys Thr Leu Pro Ala Val Cys Leu
Phe Thr 850 855 860Asn Gln Phe Ile Ile
Pro Gln Ile Ser Asn Ile Ala Ser Ile Trp Phe865 870
875 880Leu Ser Leu Phe Leu Ser Ile Phe Ala Thr
Gly Ile Leu Glu Met Arg 885 890
895Trp Ser Gly Val Gly Ile Asp Glu Trp Trp Arg Asn Glu Gln Phe Trp
900 905 910Val Ile Gly Gly Val
Ser Ala His Leu Phe Ala Val Phe Gln Gly Ile 915
920 925Leu Lys Val Leu Ala Gly Ile Asp Thr Asn Phe Thr
Val Thr Ser Lys 930 935 940Ala Ser Asp
Glu Asp Gly Asp Phe Ala Glu Leu Tyr Leu Phe Lys Trp945
950 955 960Thr Thr Leu Leu Ile Pro Pro
Thr Thr Leu Leu Ile Val Asn Leu Val 965
970 975Gly Val Val Ala Gly Val Ser Tyr Ala Ile Asn Ser
Gly Tyr Gln Ser 980 985 990Trp
Gly Pro Leu Phe Gly Lys Leu Phe Phe Ala Phe Trp Val Ile Val 995
1000 1005His Leu Tyr Pro Phe Leu Lys Gly
Leu Met Gly Arg Gln Asn Arg 1010 1015
1020Thr Pro Thr Ile Val Val Val Trp Ser Val Leu Leu Ala Ser Ile
1025 1030 1035Phe Ser Leu Leu Trp Val
Arg Ile Asp Pro Phe Thr Ser Arg Val 1040 1045
1050Thr Gly Pro Asp Ile Leu Glu Cys Gly Ile Asn Cys 1055
1060 1065304690DNAArabidopsis thaliana
30atggaatccg aaggagaaac cgcggtatgc ttttttgact cttgcttcat cattatactt
60acctttatcg aaatcaggaa ttatatgtac tgaaattgat tgatttgggt gttgaattgt
120gtattggaga gatctgattt caaattttct gttgaggttt ctaattttgg cttcattgat
180tcgacttgat ttgtagggaa agccgatgaa gaacattgtt ccgcagactt gccagatctg
240tagtgacaat gttggcaaga ctgttgatgg agatcgtttt gtggcttgtg atatttgttc
300attcccagtt tgtcggcctt gctacgagta tgagaggaaa gatgggaatc aatcttgtcc
360tcagtgcaaa accagataca agaggctcaa aggttctctt ttgatccttc tgaagtatac
420tgtcttcatt gttcatcgat agtttatcag tatgttttga attttggatc agattggtat
480ttatagcaat ttgctaattt ctgattctag gtagtcctgc tattcctggt gataaagacg
540aggatggctt agctgatgaa ggtactgttg agttcaacta ccctcagaag gagaaaattt
600cagagcggat gcttggttgg catcttactc gtgggaaggg agaggaaatg ggggaacccc
660agtatgataa agaggtctct cacaatcatc ttcctcgtct cacgagcaga caagatgtaa
720ggcattgctg ttcattcttc cctcttaagc attcgcatcc tcacgcaatt tagttttgga
780atctgatttt gtcatttgct tatttacaga cttcaggaga gttttctgct gcctcacctg
840aacgcctctc tgtatcttct actatcgctg ggggaaagcg ccttccctat tcatcagatg
900tcaatcaatc acgtaaatat cctttatttc taactctctc gccaacacat atatttgtac
960ctaggcttct cttttatgtc aaaactctaa acaataaaat ctgttgttgt cattcacgct
1020gcagcaaata gaaggattgt ggatcctgtt ggactcggga atgtagcttg gaaggagaga
1080gttgatggct ggaaaatgaa gcaagagaag aatactggtc ctgtcagcac gcaggctgct
1140tctgaaagag gtggagtaga tattgatgcc agcacagata tcctagcaga tgaggctctg
1200ctgtgagttc ttgttttgta atcttgtttg ttctgtcgtg gtgtaccgag cgtttttcct
1260attaagcaat gtcctgatac tcattttcca attctttatt tattgtacag gaatgacgaa
1320gcgaggcagc ctctgtcaag gaaagtttca attccttcat cacggatcaa tccttacaga
1380atggttatta tgctgcggct tgttatcctt tgtctcttct tgcattaccg tataacaaac
1440ccagtgccaa atgcctttgc tctatggctg gtctctgtga tatgtgagat ctggtttgcc
1500ttatcctgga ttttggatca gtttcccaag tggtttcctg tgaaccgtga aacctacctc
1560gacaggcttg ctttaaggta agttctattt ccccattctt ctgaagcaat tactcaaagg
1620attgtttgcc tatactgttt cccattttaa tttgatcatg gtcaattttt gggacagata
1680tgatcgtgaa ggtgagccat cacagttagc tgctgttgac attttcgtga gtactgttga
1740ccccttgaag gagccacccc ttgtgacagc caacacagtg ctctctattc tggctgttga
1800ctacccagtt gacaaggtgt cctgttatgt ttctgatgat ggtgctgcta tgttatcatt
1860tgaatcactt gcagaaacat cagagtttgc tcgtaaatgg gtaccatttt gcaagaaata
1920tagcatagag cctcgtgcac cagaatggta ctttgctgcg aaaatagatt acttgaagga
1980taaagttcag acatcatttg tcaaagatcg tagagctatg aaggtaagtt tgtagtttta
2040gtcatctagt caccctcact ttgattttag tgtatgctat attgaccttt tattttcttt
2100cagagggaat atgaggaatt taaaatccga atcaatgcac ttgtttccaa agccctaaaa
2160tgtcctgaag aagggtgggt tatgcaagat ggcacaccgt ggcctggaaa taatacaagg
2220gaccatccag gaatgatcca ggtaagaaat tggttttaac tatggaatcg agaatgctct
2280ctctttctct ctagaagttc attattgaag taccatttgc tgaatgcagg tcttcttagg
2340gcaaaatggt ggacttgatg cagagggcaa tgagctcccg cgtttggtat atgtttctcg
2400agaaaagcga ccaggattcc agcaccacaa aaaggctggt gctatgaatg cactggtaag
2460tttctgatct tggatttttg acttcttcat tctgaccaat ttgttagtct aatctgggta
2520cttttcaaat gaataggtga gagtttcagc agttcttacc aatggacctt tcatcttgaa
2580tcttgattgt gatcattaca taaataacag caaagcctta agagaagcaa tgtgcttcct
2640gatggaccca aacctcggga agcaagtttg ttatgttcag ttcccacaaa gatttgatgg
2700tatcgataag aacgatagat atgctaatcg taataccgtg ttctttgatg taagtcacac
2760ttacctatac ttgcgtctaa ttttcttgtt ctttcaaatt gcttttagac acgaatatac
2820attaaactca cagtttcttg agtttgtcgt aatttttcca tgatatgttt tccagattaa
2880cttgagaggt ttagatggga ttcaaggacc tgtatatgtc ggaactggat gtgttttcaa
2940cagaacagca ttatacggtt atgaacctcc aataaaagta aaacacaaga agccaagtct
3000tttatctaag ctctgtggtg gatcaagaaa gaagaattcc aaagctaaga aagagtcgga
3060caaaaagaaa tcaggcaggc atactgactc aactgttcct gtattcaacc tcgatgacat
3120agaagaggga gttgaaggta caactgtttt tatttcttct ttggtttccg ttatacccat
3180atgttgctgt ttgaaatatt gatccagggg aggggattat ttatagttga cagttgtcta
3240aatagtttcc atactaggta tctcatcatg tcttaactat ttggcatttg tgaaacttag
3300gtgctggttt tgatgatgaa aaggcgctct taatgtcgca aatgagcctg gagaagcgat
3360ttggacagtc tgctgttttt gttgcttcta ccctaatgga aaatggtggt gttcctcctt
3420cagcaactcc agaaaacctt ctcaaagagg ctatccatgt cattagttgt ggttatgagg
3480ataagtcaga ttggggaatg gaggtataat ctcatttgaa ctcctacatg aatctgcatt
3540gttctgacat atccactttg gcattcactt tgtttatatt ttccgctgtc tttcttcaga
3600ttggatggat ctatggttct gtgacagaag atattctgac tgggttcaaa atgcatgccc
3660gtggatggcg atccatttac tgcatgccta agcttccagc tttcaagggt tctgctccta
3720tcaatttttc agatcgtctg aaccaagtgc tgaggtgggc tttaggttca gttgagattc
3780tcttcagtcg gcattgtcct atatggtatg gttacaatgg gaggctaaaa tttcttgaga
3840ggtttgcgta tgtgaacacc accatctacc ctatcacctc cattcctctt ctcatgtatt
3900gtacattgcc agccgtttgt ctcttcacca accagtttat tattcctcag gtttgacacc
3960tctctctgtc tatctatctc tatctctatc tctatctcta gaacaaacct taattacgtt
4020ctgtttaact gaaaccatgt tgtgtttgtc atctatttac ggttccaaat cctgatcagc
4080tggttctatt gttcctcttt tgcagattag taacattgca agtatatggt ttctgtctct
4140ctttctctcc attttcgcca cgggtatact agaaatgagg tggagtggcg taggcataga
4200cgaatggtgg agaaacgagc agttttgggt cattggtgga gtatccgctc atttattcgc
4260tgtgtttcaa ggtatcctca aagtccttgc cggtattgac acaaacttca cagttacctc
4320aaaagcttca gatgaagacg gagactttgc tgagctctac ttgttcaaat ggacaacact
4380tctgattccg ccaacgacgc tgctcattgt aaacttagtg ggagttgttg caggagtctc
4440ttatgctatc aacagtggat accaatcatg gggaccactc tttggtaagt tgttctttgc
4500cttctgggtg attgttcact tgtacccttt cctcaagggt ttgatgggtc gacagaaccg
4560gactcctacc attgttgtgg tctggtctgt tctcttggct tctatcttct cgttgttgtg
4620ggttaggatt gatcccttca ctagccgagt cactggcccg gacattctgg aatgtggaat
4680caactgttga
4690311065PRTArabidopsis thaliana 31Met Glu Ser Glu Gly Glu Thr Ala Gly
Lys Pro Met Lys Asn Ile Val1 5 10
15Pro Gln Thr Cys Gln Ile Cys Ser Asp Asn Val Gly Lys Thr Val
Asp 20 25 30Gly Asp Arg Phe
Val Ala Cys Asp Ile Cys Ser Phe Pro Val Cys Arg 35
40 45Pro Cys Tyr Glu Tyr Glu Arg Lys Asp Gly Asn Gln
Ser Cys Pro Gln 50 55 60Cys Lys Thr
Arg Tyr Lys Arg Leu Lys Gly Ser Pro Ala Ile Pro Gly65 70
75 80Asp Lys Asp Glu Asp Gly Leu Ala
Asp Glu Gly Thr Val Glu Phe Asn 85 90
95Tyr Pro Gln Lys Glu Lys Ile Ser Glu Arg Met Leu Gly Trp
His Leu 100 105 110Thr Arg Gly
Lys Gly Glu Glu Met Gly Glu Pro Gln Tyr Asp Lys Glu 115
120 125Val Ser His Asn His Leu Pro Arg Leu Thr Ser
Arg Gln Asp Thr Ser 130 135 140Gly Glu
Phe Ser Ala Ala Ser Pro Glu Arg Leu Ser Val Ser Ser Thr145
150 155 160Ile Ala Gly Gly Lys Arg Leu
Pro Tyr Ser Ser Asp Val Asn Gln Ser 165
170 175Pro Asn Arg Arg Ile Val Asp Pro Val Gly Leu Gly
Asn Val Ala Trp 180 185 190Lys
Glu Arg Val Asp Gly Trp Lys Met Lys Gln Glu Lys Asn Thr Gly 195
200 205Pro Val Ser Thr Gln Ala Ala Ser Glu
Arg Gly Gly Val Asp Ile Asp 210 215
220Ala Ser Thr Asp Ile Leu Ala Asp Glu Ala Leu Leu Asn Asp Glu Ala225
230 235 240Arg Gln Pro Leu
Ser Arg Lys Val Ser Ile Pro Ser Ser Arg Ile Asn 245
250 255Pro Tyr Arg Met Val Ile Met Leu Arg Leu
Val Ile Leu Cys Leu Phe 260 265
270Leu His Tyr Arg Ile Thr Asn Pro Val Pro Asn Ala Phe Ala Leu Trp
275 280 285Leu Val Ser Val Ile Cys Glu
Ile Trp Phe Ala Leu Ser Trp Ile Leu 290 295
300Asp Gln Phe Pro Lys Trp Phe Pro Val Asn Arg Glu Thr Tyr Leu
Asp305 310 315 320Arg Leu
Ala Leu Arg Tyr Asp Arg Glu Gly Glu Pro Ser Gln Leu Ala
325 330 335Ala Val Asp Ile Phe Val Ser
Thr Val Asp Pro Leu Lys Glu Pro Pro 340 345
350Leu Val Thr Ala Asn Thr Val Leu Ser Ile Leu Ala Val Asp
Tyr Pro 355 360 365Val Asp Lys Val
Ser Cys Tyr Val Ser Asp Asp Gly Ala Ala Met Leu 370
375 380Ser Phe Glu Ser Leu Ala Glu Thr Ser Glu Phe Ala
Arg Lys Trp Val385 390 395
400Pro Phe Cys Lys Lys Tyr Ser Ile Glu Pro Arg Ala Pro Glu Trp Tyr
405 410 415Phe Ala Ala Lys Ile
Asp Tyr Leu Lys Asp Lys Val Gln Thr Ser Phe 420
425 430Val Lys Asp Arg Arg Ala Met Lys Arg Glu Tyr Glu
Glu Phe Lys Ile 435 440 445Arg Ile
Asn Ala Leu Val Ser Lys Ala Leu Lys Cys Pro Glu Glu Gly 450
455 460Trp Val Met Gln Asp Gly Thr Pro Trp Pro Gly
Asn Asn Thr Arg Asp465 470 475
480His Pro Gly Met Ile Gln Val Phe Leu Gly Gln Asn Gly Gly Leu Asp
485 490 495Ala Glu Gly Asn
Glu Leu Pro Arg Leu Val Tyr Val Ser Arg Glu Lys 500
505 510Arg Pro Gly Phe Gln His His Lys Lys Ala Gly
Ala Met Asn Ala Leu 515 520 525Val
Arg Val Ser Ala Val Leu Thr Asn Gly Pro Phe Ile Leu Asn Leu 530
535 540Asp Cys Asp His Tyr Ile Asn Asn Ser Lys
Ala Leu Arg Glu Ala Met545 550 555
560Cys Phe Leu Met Asp Pro Asn Leu Gly Lys Gln Val Cys Tyr Val
Gln 565 570 575Phe Pro Gln
Arg Phe Asp Gly Ile Asp Lys Asn Asp Arg Tyr Ala Asn 580
585 590Arg Asn Thr Val Phe Phe Asp Ile Asn Leu
Arg Gly Leu Asp Gly Ile 595 600
605Gln Gly Pro Val Tyr Val Gly Thr Gly Cys Val Phe Asn Arg Thr Ala 610
615 620Leu Tyr Gly Tyr Glu Pro Pro Ile
Lys Val Lys His Lys Lys Pro Ser625 630
635 640Leu Leu Ser Lys Leu Cys Gly Gly Ser Arg Lys Lys
Asn Ser Lys Ala 645 650
655Lys Lys Glu Ser Asp Lys Lys Lys Ser Gly Arg His Thr Asp Ser Thr
660 665 670Val Pro Val Phe Asn Leu
Asp Asp Ile Glu Glu Gly Val Glu Gly Ala 675 680
685Gly Phe Asp Asp Glu Lys Ala Leu Leu Met Ser Gln Met Ser
Leu Glu 690 695 700Lys Arg Phe Gly Gln
Ser Ala Val Phe Val Ala Ser Thr Leu Met Glu705 710
715 720Asn Gly Gly Val Pro Pro Ser Ala Thr Pro
Glu Asn Leu Leu Lys Glu 725 730
735Ala Ile His Val Ile Ser Cys Gly Tyr Glu Asp Lys Ser Asp Trp Gly
740 745 750Met Glu Ile Gly Trp
Ile Tyr Gly Ser Val Thr Glu Asp Ile Leu Thr 755
760 765Gly Phe Lys Met His Ala Arg Gly Trp Arg Ser Ile
Tyr Cys Met Pro 770 775 780Lys Leu Pro
Ala Phe Lys Gly Ser Ala Pro Ile Asn Phe Ser Asp Arg785
790 795 800Leu Asn Gln Val Leu Arg Trp
Ala Leu Gly Ser Val Glu Ile Leu Phe 805
810 815Ser Arg His Cys Pro Ile Trp Tyr Gly Tyr Asn Gly
Arg Leu Lys Phe 820 825 830Leu
Glu Arg Phe Ala Tyr Val Asn Thr Thr Ile Tyr Pro Ile Thr Ser 835
840 845Ile Pro Leu Leu Met Tyr Cys Thr Leu
Pro Ala Val Cys Leu Phe Thr 850 855
860Asn Gln Phe Ile Ile Pro Gln Ile Ser Asn Ile Ala Ser Ile Trp Phe865
870 875 880Leu Ser Leu Phe
Leu Ser Ile Phe Ala Thr Gly Ile Leu Glu Met Arg 885
890 895Trp Ser Gly Val Gly Ile Asp Glu Trp Trp
Arg Asn Glu Gln Phe Trp 900 905
910Val Ile Gly Gly Val Ser Ala His Leu Phe Ala Val Phe Gln Gly Ile
915 920 925Leu Lys Val Leu Ala Gly Ile
Asp Thr Asn Phe Thr Val Thr Ser Lys 930 935
940Ala Ser Asp Glu Asp Gly Asp Phe Ala Glu Leu Tyr Leu Phe Lys
Trp945 950 955 960Thr Thr
Leu Leu Ile Pro Pro Thr Thr Leu Leu Ile Val Asn Leu Val
965 970 975Gly Val Val Ala Gly Val Ser
Tyr Ala Ile Asn Ser Gly Tyr Gln Ser 980 985
990Trp Gly Pro Leu Phe Gly Lys Leu Phe Phe Ala Phe Trp Val
Ile Val 995 1000 1005His Leu Tyr
Pro Phe Leu Lys Gly Leu Met Gly Arg Gln Asn Arg 1010
1015 1020Thr Pro Thr Ile Val Val Val Trp Ser Val Leu
Leu Ala Ser Ile 1025 1030 1035Phe Ser
Leu Leu Trp Val Arg Ile Asp Pro Phe Thr Ser Arg Val 1040
1045 1050Thr Gly Pro Asp Ile Leu Glu Cys Gly Ile
Asn Cys 1055 1060
1065324690DNAArabidopsis thaliana 32atggaatccg aaggagaaac cgcggtatgc
ttttttgact cttgcttcat cattatactt 60acctttatcg aaatcaggaa ttatatgtac
tgaaattgat tgatttgggt gttgaattgt 120gtattggaga gatctgattt caaattttct
gttgaggttt ctaattttgg cttcattgat 180tcgacttgat ttgtagggaa agccgatgaa
gaacattgtt ccgcagactt gccagatctg 240tagtgacaat gttggcaaga ctgttgatgg
agatcgtttt gtggcttgtg atatttgttc 300attcccagtt tgtcggcctt gctacgagta
tgagaggaaa gatgggaatc aatcttgtcc 360tcagtgcaaa accagataca agaggctcaa
aggttctctt ttgatccttc tgaagtatac 420tgtcttcatt gttcatcgat agtttatcag
tatgttttga attttggatc agattggtat 480ttatagcaat ttgctaattt ctgattctag
gtagtcctgc tattcctggt gataaagacg 540aggatggctt agctgatgaa ggtactgttg
agttcaacta ccctcagaag gagaaaattt 600cagagcggat gcttggttgg catcttactc
gtgggaaggg agaggaaatg ggggaacccc 660agtatgataa agaggtctct cacaatcatc
ttcctcgtct cacgagcaga caagatgtaa 720ggcattgctg ttcattcttc cctcttaagc
attcgcatcc tcacgcaatt tagttttgga 780atctgatttt gtcatttgct tatttacaga
cttcaggaga gttttctgct gcctcacctg 840aacgcctctc tgtatcttct actatcgctg
ggggaaagcg ccttccctat tcatcagatg 900tcaatcaatc acgtaaatat cctttatttc
taactctctc gccaacacat atatttgtac 960ctaggcttct cttttatgtc aaaactctaa
acaataaaat ctgttgttgt cattcacgct 1020gcagcaaata gaaggattgt ggatcctgtt
ggactcggga atgtagcttg gaaggagaga 1080gttgatggct ggaaaatgaa gcaagagaag
aatactggtc ctgtcagcac gcaggctgct 1140tctgaaagag gtggagtaga tattgatgcc
agcacagata tcctagcaga tgaggctctg 1200ctgtgagttc ttgttttgta atcttgtttg
ttctgtcgtg gtgtaccgag cgtttttcct 1260attaagcaat gtcctgatac tcattttcca
attctttatt tattgtacag gaatgacgaa 1320gcgaggcagc ctctgtcaag gaaagtttca
attccttcat cacggatcaa tccttacaga 1380atggttatta tgctgcggct tgttatcctt
tgtctcttct tgcattaccg tataacaaac 1440ccagtgccaa atgcctttgc tctatggctg
gtctctgtga tatgtgagat ctggtttgcc 1500ttatcctgga ttttggatca gtttcccaag
tggtttcctg tgaaccgtga aacctacctc 1560gacaggcttg ctttaaggta agttctattt
ccccattctt ctgaagcaat tactcaaagg 1620attgtttgcc tatactgttt cccattttaa
tttgatcatg gtcaattttt gggacagata 1680tgatcgtgaa ggtgagccat cacagttagc
tgctgttgac attttcgtga gtactgttga 1740ccccttgaag gagccacccc ttgtgacagc
caacacagtg ctctctattc tggctgttga 1800ctacccagtt gacaaggtgt cctgttatgt
ttttgatgat ggtgctgcta tgttatcatt 1860tgaatcactt gcagaaacat cagagtttgc
tcgtaaatgg gtaccatttt gcaagaaata 1920tagcatagag cctcgtgcac cagaatggta
ctttgctgcg aaaatagatt acttgaagga 1980taaagttcag acatcatttg tcaaagatcg
tagagctatg aaggtaagtt tgtagtttta 2040gtcatctagt caccctcact ttgattttag
tgtatgctat attgaccttt tattttcttt 2100cagagggaat atgaggaatt taaaatccga
atcaatgcac ttgtttccaa agccctaaaa 2160tgtcctgaag aagggtgggt tatgcaagat
ggcacaccgt ggcctggaaa taatacaagg 2220gaccatccag gaatgatcca ggtaagaaat
tggttttaac tatggaatcg agaatgctct 2280ctctttctct ctagaagttc attattgaag
taccatttgc tgaatgcagg tcttcttagg 2340gcaaaatggt ggacttgatg cagagggcaa
tgagctcccg cgtttggtat atgtttctcg 2400agaaaagcga ccaggattcc agcaccacaa
aaaggctggt gctatgaatg cactggtaag 2460tttctgatct tggatttttg acttcttcat
tctgaccaat ttgttagtct aatctgggta 2520cttttcaaat gaataggtga gagtttcagc
agttcttacc aatggacctt tcatcttgaa 2580tcttgattgt gatcattaca taaataacag
caaagcctta agagaagcaa tgtgcttcct 2640gatggaccca aacctcggga agcaagtttg
ttatgttcag ttcccacaaa gatttgatgg 2700tatcgataag aacgatagat atgctaatcg
taataccgtg ttctttgatg taagtcacac 2760ttacctatac ttgcgtctaa ttttcttgtt
ctttcaaatt gcttttagac acgaatatac 2820attaaactca cagtttcttg agtttgtcgt
aatttttcca tgatatgttt tccagattaa 2880cttgagaggt ttagatggga ttcaaggacc
tgtatatgtc ggaactggat gtgttttcaa 2940cagaacagca ttatacggtt atgaacctcc
aataaaagta aaacacaaga agccaagtct 3000tttatctaag ctctgtggtg gatcaagaaa
gaagaattcc aaagctaaga aagagtcgga 3060caaaaagaaa tcaggcaggc atactgactc
aactgttcct gtattcaacc tcgatgacat 3120agaagaggga gttgaaggta caactgtttt
tatttcttct ttggtttccg ttatacccat 3180atgttgctgt ttgaaatatt gatccagggg
aggggattat ttatagttga cagttgtcta 3240aatagtttcc atactaggta tctcatcatg
tcttaactat ttggcatttg tgaaacttag 3300gtgctggttt tgatgatgaa aaggcgctct
taatgtcgca aatgagcctg gagaagcgat 3360ttggacagtc tgctgttttt gttgcttcta
ccctaatgga aaatggtggt gttcctcctt 3420cagcaactcc agaaaacctt ctcaaagagg
ctatccatgt cattagttgt ggttatgagg 3480ataagtcaga ttggggaatg gaggtataat
ctcatttgaa ctcctacatg aatctgcatt 3540gttctgacat atccactttg gcattcactt
tgtttatatt ttccgctgtc tttcttcaga 3600ttggatggat ctatggttct gtgacagaag
atattctgac tgggttcaaa atgcatgccc 3660gtggatggcg atccatttac tgcatgccta
agcttccagc tttcaagggt tctgctccta 3720tcaatctttc agatcgtctg aaccaagtgc
tgaggtgggc tttaggttca gttgagattc 3780tcttcagtcg gcattgtcct atatggtatg
gttacaatgg gaggctaaaa tttcttgaga 3840ggtttgcgta tgtgaacacc accatctacc
ctatcacctc cattcctctt ctcatgtatt 3900gtacattgcc agccgtttgt ctcttcacca
accagtttat tattcctcag gtttgacacc 3960tctctctgtc tatctatctc tatctctatc
tctatctcta gaacaaacct taattacgtt 4020ctgtttaact gaaaccatgt tgtgtttgtc
atctatttac ggttccaaat cctgatcagc 4080tggttctatt gttcctcttt tgcagattag
taacattgca agtatatggt ttctgtctct 4140ctttctctcc attttcgcca cgggtatact
agaaatgagg tggagtggcg taggcataga 4200cgaatggtgg agaaacgagc agttttgggt
cattggtgga gtatccgctc atttattcgc 4260tgtgtttcaa ggtatcctca aagtccttgc
cggtattgac acaaacttca cagttacctc 4320aaaagcttca gatgaagacg gagactttgc
tgagctctac ttgttcaaat ggacaacact 4380tctgattccg ccaacgacgc tgctcattgt
aaacttagtg ggagttgttg caggagtctc 4440ttatgctatc aacagtggat accaatcatg
gggaccactc tttggtaagt tgttctttgc 4500cttctgggtg attgttcact tgtacccttt
cctcaagggt ttgatgggtc gacagaaccg 4560gactcctacc attgttgtgg tctggtctgt
tctcttggct tctatcttct cgttgttgtg 4620ggttaggatt gatcccttca ctagccgagt
cactggcccg gacattctgg aatgtggaat 4680caactgttga
4690331065PRTArabidopsis thaliana 33Met
Glu Ser Glu Gly Glu Thr Ala Gly Lys Pro Met Lys Asn Ile Val1
5 10 15Pro Gln Thr Cys Gln Ile Cys
Ser Asp Asn Val Gly Lys Thr Val Asp 20 25
30Gly Asp Arg Phe Val Ala Cys Asp Ile Cys Ser Phe Pro Val
Cys Arg 35 40 45Pro Cys Tyr Glu
Tyr Glu Arg Lys Asp Gly Asn Gln Ser Cys Pro Gln 50 55
60Cys Lys Thr Arg Tyr Lys Arg Leu Lys Gly Ser Pro Ala
Ile Pro Gly65 70 75
80Asp Lys Asp Glu Asp Gly Leu Ala Asp Glu Gly Thr Val Glu Phe Asn
85 90 95Tyr Pro Gln Lys Glu Lys
Ile Ser Glu Arg Met Leu Gly Trp His Leu 100
105 110Thr Arg Gly Lys Gly Glu Glu Met Gly Glu Pro Gln
Tyr Asp Lys Glu 115 120 125Val Ser
His Asn His Leu Pro Arg Leu Thr Ser Arg Gln Asp Thr Ser 130
135 140Gly Glu Phe Ser Ala Ala Ser Pro Glu Arg Leu
Ser Val Ser Ser Thr145 150 155
160Ile Ala Gly Gly Lys Arg Leu Pro Tyr Ser Ser Asp Val Asn Gln Ser
165 170 175Pro Asn Arg Arg
Ile Val Asp Pro Val Gly Leu Gly Asn Val Ala Trp 180
185 190Lys Glu Arg Val Asp Gly Trp Lys Met Lys Gln
Glu Lys Asn Thr Gly 195 200 205Pro
Val Ser Thr Gln Ala Ala Ser Glu Arg Gly Gly Val Asp Ile Asp 210
215 220Ala Ser Thr Asp Ile Leu Ala Asp Glu Ala
Leu Leu Asn Asp Glu Ala225 230 235
240Arg Gln Pro Leu Ser Arg Lys Val Ser Ile Pro Ser Ser Arg Ile
Asn 245 250 255Pro Tyr Arg
Met Val Ile Met Leu Arg Leu Val Ile Leu Cys Leu Phe 260
265 270Leu His Tyr Arg Ile Thr Asn Pro Val Pro
Asn Ala Phe Ala Leu Trp 275 280
285Leu Val Ser Val Ile Cys Glu Ile Trp Phe Ala Leu Ser Trp Ile Leu 290
295 300Asp Gln Phe Pro Lys Trp Phe Pro
Val Asn Arg Glu Thr Tyr Leu Asp305 310
315 320Arg Leu Ala Leu Arg Tyr Asp Arg Glu Gly Glu Pro
Ser Gln Leu Ala 325 330
335Ala Val Asp Ile Phe Val Ser Thr Val Asp Pro Leu Lys Glu Pro Pro
340 345 350Leu Val Thr Ala Asn Thr
Val Leu Ser Ile Leu Ala Val Asp Tyr Pro 355 360
365Val Asp Lys Val Ser Cys Tyr Val Phe Asp Asp Gly Ala Ala
Met Leu 370 375 380Ser Phe Glu Ser Leu
Ala Glu Thr Ser Glu Phe Ala Arg Lys Trp Val385 390
395 400Pro Phe Cys Lys Lys Tyr Ser Ile Glu Pro
Arg Ala Pro Glu Trp Tyr 405 410
415Phe Ala Ala Lys Ile Asp Tyr Leu Lys Asp Lys Val Gln Thr Ser Phe
420 425 430Val Lys Asp Arg Arg
Ala Met Lys Arg Glu Tyr Glu Glu Phe Lys Ile 435
440 445Arg Ile Asn Ala Leu Val Ser Lys Ala Leu Lys Cys
Pro Glu Glu Gly 450 455 460Trp Val Met
Gln Asp Gly Thr Pro Trp Pro Gly Asn Asn Thr Arg Asp465
470 475 480His Pro Gly Met Ile Gln Val
Phe Leu Gly Gln Asn Gly Gly Leu Asp 485
490 495Ala Glu Gly Asn Glu Leu Pro Arg Leu Val Tyr Val
Ser Arg Glu Lys 500 505 510Arg
Pro Gly Phe Gln His His Lys Lys Ala Gly Ala Met Asn Ala Leu 515
520 525Val Arg Val Ser Ala Val Leu Thr Asn
Gly Pro Phe Ile Leu Asn Leu 530 535
540Asp Cys Asp His Tyr Ile Asn Asn Ser Lys Ala Leu Arg Glu Ala Met545
550 555 560Cys Phe Leu Met
Asp Pro Asn Leu Gly Lys Gln Val Cys Tyr Val Gln 565
570 575Phe Pro Gln Arg Phe Asp Gly Ile Asp Lys
Asn Asp Arg Tyr Ala Asn 580 585
590Arg Asn Thr Val Phe Phe Asp Ile Asn Leu Arg Gly Leu Asp Gly Ile
595 600 605Gln Gly Pro Val Tyr Val Gly
Thr Gly Cys Val Phe Asn Arg Thr Ala 610 615
620Leu Tyr Gly Tyr Glu Pro Pro Ile Lys Val Lys His Lys Lys Pro
Ser625 630 635 640Leu Leu
Ser Lys Leu Cys Gly Gly Ser Arg Lys Lys Asn Ser Lys Ala
645 650 655Lys Lys Glu Ser Asp Lys Lys
Lys Ser Gly Arg His Thr Asp Ser Thr 660 665
670Val Pro Val Phe Asn Leu Asp Asp Ile Glu Glu Gly Val Glu
Gly Ala 675 680 685Gly Phe Asp Asp
Glu Lys Ala Leu Leu Met Ser Gln Met Ser Leu Glu 690
695 700Lys Arg Phe Gly Gln Ser Ala Val Phe Val Ala Ser
Thr Leu Met Glu705 710 715
720Asn Gly Gly Val Pro Pro Ser Ala Thr Pro Glu Asn Leu Leu Lys Glu
725 730 735Ala Ile His Val Ile
Ser Cys Gly Tyr Glu Asp Lys Ser Asp Trp Gly 740
745 750Met Glu Ile Gly Trp Ile Tyr Gly Ser Val Thr Glu
Asp Ile Leu Thr 755 760 765Gly Phe
Lys Met His Ala Arg Gly Trp Arg Ser Ile Tyr Cys Met Pro 770
775 780Lys Leu Pro Ala Phe Lys Gly Ser Ala Pro Ile
Asn Leu Ser Asp Arg785 790 795
800Leu Asn Gln Val Leu Arg Trp Ala Leu Gly Ser Val Glu Ile Leu Phe
805 810 815Ser Arg His Cys
Pro Ile Trp Tyr Gly Tyr Asn Gly Arg Leu Lys Phe 820
825 830Leu Glu Arg Phe Ala Tyr Val Asn Thr Thr Ile
Tyr Pro Ile Thr Ser 835 840 845Ile
Pro Leu Leu Met Tyr Cys Thr Leu Pro Ala Val Cys Leu Phe Thr 850
855 860Asn Gln Phe Ile Ile Pro Gln Ile Ser Asn
Ile Ala Ser Ile Trp Phe865 870 875
880Leu Ser Leu Phe Leu Ser Ile Phe Ala Thr Gly Ile Leu Glu Met
Arg 885 890 895Trp Ser Gly
Val Gly Ile Asp Glu Trp Trp Arg Asn Glu Gln Phe Trp 900
905 910Val Ile Gly Gly Val Ser Ala His Leu Phe
Ala Val Phe Gln Gly Ile 915 920
925Leu Lys Val Leu Ala Gly Ile Asp Thr Asn Phe Thr Val Thr Ser Lys 930
935 940Ala Ser Asp Glu Asp Gly Asp Phe
Ala Glu Leu Tyr Leu Phe Lys Trp945 950
955 960Thr Thr Leu Leu Ile Pro Pro Thr Thr Leu Leu Ile
Val Asn Leu Val 965 970
975Gly Val Val Ala Gly Val Ser Tyr Ala Ile Asn Ser Gly Tyr Gln Ser
980 985 990Trp Gly Pro Leu Phe Gly
Lys Leu Phe Phe Ala Phe Trp Val Ile Val 995 1000
1005His Leu Tyr Pro Phe Leu Lys Gly Leu Met Gly Arg
Gln Asn Arg 1010 1015 1020Thr Pro Thr
Ile Val Val Val Trp Ser Val Leu Leu Ala Ser Ile 1025
1030 1035Phe Ser Leu Leu Trp Val Arg Ile Asp Pro Phe
Thr Ser Arg Val 1040 1045 1050Thr Gly
Pro Asp Ile Leu Glu Cys Gly Ile Asn Cys 1055 1060
1065344690DNAArabidopsis thaliana 34atggaatccg aaggagaaac
cgcggtatgc ttttttgact cttgcttcat cattatactt 60acctttatcg aaatcaggaa
ttatatgtac tgaaattgat tgatttgggt gttgaattgt 120gtattggaga gatctgattt
caaattttct gttgaggttt ctaattttgg cttcattgat 180tcgacttgat ttgtagggaa
agccgatgaa gaacattgtt ccgcagactt gccagatctg 240tagtgacaat gttggcaaga
ctgttgatgg agatcgtttt gtggcttgtg atatttgttc 300attcccagtt tgtcggcctt
gctacgagta tgagaggaaa gatgggaatc aatcttgtcc 360tcagtgcaaa accagataca
agaggctcaa aggttctctt ttgatccttc tgaagtatac 420tgtcttcatt gttcatcgat
agtttatcag tatgttttga attttggatc agattggtat 480ttatagcaat ttgctaattt
ctgattctag gtagtcctgc tattcctggt gataaagacg 540aggatggctt agctgatgaa
ggtactgttg agttcaacta ccctcagaag gagaaaattt 600cagagcggat gcttggttgg
catcttactc gtgggaaggg agaggaaatg ggggaacccc 660agtatgataa agaggtctct
cacaatcatc ttcctcgtct cacgagcaga caagatgtaa 720ggcattgctg ttcattcttc
cctcttaagc attcgcatcc tcacgcaatt tagttttgga 780atctgatttt gtcatttgct
tatttacaga cttcaggaga gttttctgct gcctcacctg 840aacgcctctc tgtatcttct
actatcgctg ggggaaagcg ccttccctat tcatcagatg 900tcaatcaatc acgtaaatat
cctttatttc taactctctc gccaacacat atatttgtac 960ctaggcttct cttttatgtc
aaaactctaa acaataaaat ctgttgttgt cattcacgct 1020gcagcaaata gaaggattgt
ggatcctgtt ggactcggga atgtagcttg gaaggagaga 1080gttgatggct ggaaaatgaa
gcaagagaag aatactggtc ctgtcagcac gcaggctgct 1140tctgaaagag gtggagtaga
tattgatgcc agcacagata tcctagcaga tgaggctctg 1200ctgtgagttc ttgttttgta
atcttgtttg ttctgtcgtg gtgtaccgag cgtttttcct 1260attaagcaat gtcctgatac
tcattttcca attctttatt tattgtacag gaatgacgaa 1320gcgaggcagc ctctgtcaag
gaaagtttca attccttcat cacggatcaa tccttacaga 1380atggttatta tgctgcggct
tgttatcctt tgtctcttct tgcattacca tataacaaac 1440ccagtgccaa atgcctttgc
tctatggctg gtctctgtga tatgtgagat ctggtttgcc 1500ttatcctgga ttttggatca
gtttcccaag tggtttcctg tgaaccgtga aacctacctc 1560gacaggcttg ctttaaggta
agttctattt ccccattctt ctgaagcaat tactcaaagg 1620attgtttgcc tatactgttt
cccattttaa tttgatcatg gtcaattttt gggacagata 1680tgatcgtgaa ggtgagccat
cacagttagc tgctgttgac attttcgtga gtactgttga 1740ccccttgaag gagccacccc
ttgtgacagc caacacagtg ctctctattc tggctgttga 1800ctacccagtt gacaaggtgt
cctgttatgt ttctgatgat ggtgctgcta tgttatcatt 1860tgaatcactt gcagaaacat
cagagtttgc tcgtaaatgg gtaccatttt gcaagaaata 1920tagcatagag cctcgtgcac
cagaatggta ctttgctgcg aaaatagatt acttgaagga 1980taaagttcag acatcatttg
tcaaagatcg tagagctatg aaggtaagtt tgtagtttta 2040gtcatctagt caccctcact
ttgattttag tgtatgctat attgaccttt tattttcttt 2100cagagggaat atgaggaatt
taaaatccga atcaatgcac ttgtttccaa agccctaaaa 2160tgtcctgaag aagggtgggt
tatgcaagat ggcacaccgt ggcctggaaa taatacaagg 2220gaccatccag gaatgatcca
ggtaagaaat tggttttaac tatggaatcg agaatgctct 2280ctctttctct ctagaagttc
attattgaag taccatttgc tgaatgcagg tcttcttagg 2340gcaaaatggt ggacttgatg
cagagggcaa tgagctcccg cgtttggtat atgtttctcg 2400agaaaagcga ccaggattcc
agcaccacaa aaaggctggt gctatgaatg cactggtaag 2460tttctgatct tggatttttg
acttcttcat tctgaccaat ttgttagtct aatctgggta 2520cttttcaaat gaataggtga
gagtttcagc agttcttacc aatggacctt tcatcttgaa 2580tcttgattgt gatcattaca
taaataacag caaagcctta agagaagcaa tgtgcttcct 2640gatggaccca aacctcggga
agcaagtttg ttatgttcag ttcccacaaa gatttgatgg 2700tatcgataag aacgatagat
atgctaatcg taataccgtg ttctttgatg taagtcacac 2760ttacctatac ttgcgtctaa
ttttcttgtt ctttcaaatt gcttttagac acgaatatac 2820attaaactca cagtttcttg
agtttgtcgt aatttttcca tgatatgttt tccagattaa 2880cttgagaggt ttagatggga
ttcaaggacc tgtatatgtc ggaactggat gtgttttcaa 2940cagaacagca ttatacggtt
atgaacctcc aataaaagta aaacacaaga agccaagtct 3000tttatctaag ctctgtggtg
gatcaagaaa gaagaattcc aaagctaaga aagagtcgga 3060caaaaagaaa tcaggcaggc
atactgactc aactgttcct gtattcaacc tcgatgacat 3120agaagaggga gttgaaggta
caactgtttt tatttcttct ttggtttccg ttatacccat 3180atgttgctgt ttgaaatatt
gatccagggg aggggattat ttatagttga cagttgtcta 3240aatagtttcc atactaggta
tctcatcatg tcttaactat ttggcatttg tgaaacttag 3300gtgctggttt tgatgatgaa
aaggcgctct taatgtcgca aatgagcctg gagaagcgat 3360ttggacagtc tgctgttttt
gttgcttcta ccctaatgga aaatggtggt gttcctcctt 3420cagcaactcc agaaaacctt
ctcaaagagg ctatccatgt cattagttgt ggttatgagg 3480ataagtcaga ttggggaatg
gaggtataat ctcatttgaa ctcctacatg aatctgcatt 3540gttctgacat atccactttg
gcattcactt tgtttatatt ttccgctgtc tttcttcaga 3600ttggatggat ctatggttct
gtgacagaag atattctgac tgggttcaaa atgcatgccc 3660gtggatggcg atccatttac
tgcatgccta agcttccagc tttcaagggt tctgctccta 3720tcaatctttc agatcgtctg
aaccaagtgc tgaggtgggc tttaggttca gttgagattc 3780tcttcagtcg gcattgtcct
atatggtatg gttacaatgg gaggctaaaa tttcttgaga 3840ggtttgcgta tgtgaacacc
accatctacc ctatcacctc cattcctctt ctcatgtatt 3900gtacattgcc agccgtttgt
ctcttcacca accagtttat tattcctcag gtttgacacc 3960tctctctgtc tatctatctc
tatctctatc tctatctcta gaacaaacct taattacgtt 4020ctgtttaact gaaaccatgt
tgtgtttgtc atctatttac ggttccaaat cctgatcagc 4080tggttctatt gttcctcttt
tgcagattag taacattgca agtatatggt ttctgtctct 4140ctttctctcc attttcgcca
cgggtatact agaaatgagg tggagtggcg taggcataga 4200cgaatggtgg agaaacgagc
agttttgggt cattggtgga gtatccgctc atttattcgc 4260tgtgtttcaa ggtatcctca
aagtccttgc cggtattgac acaaacttca cagttacctc 4320aaaagcttca gatgaagacg
gagactttgc tgagctctac ttgttcaaat ggacaacact 4380tctgattccg ccaacgacgc
tgctcattgt aaacttagtg ggagttgttg caggagtctc 4440ttatgctatc aacagtggat
accaatcatg gggaccactc tttggtaagt tgttctttgc 4500cttctgggtg attgttcact
tgtacccttt cctcaagggt ttgatgggtc gacagaaccg 4560gactcctacc attgttgtgg
tctggtctgt tctcttggct tctatcttct cgttgttgtg 4620ggttaggatt gatcccttca
ctagccgagt cactggcccg gacattctgg aatgtggaat 4680caactgttga
4690351065PRTArabidopsis
thaliana 35Met Glu Ser Glu Gly Glu Thr Ala Gly Lys Pro Met Lys Asn Ile
Val1 5 10 15Pro Gln Thr
Cys Gln Ile Cys Ser Asp Asn Val Gly Lys Thr Val Asp 20
25 30Gly Asp Arg Phe Val Ala Cys Asp Ile Cys
Ser Phe Pro Val Cys Arg 35 40
45Pro Cys Tyr Glu Tyr Glu Arg Lys Asp Gly Asn Gln Ser Cys Pro Gln 50
55 60Cys Lys Thr Arg Tyr Lys Arg Leu Lys
Gly Ser Pro Ala Ile Pro Gly65 70 75
80Asp Lys Asp Glu Asp Gly Leu Ala Asp Glu Gly Thr Val Glu
Phe Asn 85 90 95Tyr Pro
Gln Lys Glu Lys Ile Ser Glu Arg Met Leu Gly Trp His Leu 100
105 110Thr Arg Gly Lys Gly Glu Glu Met Gly
Glu Pro Gln Tyr Asp Lys Glu 115 120
125Val Ser His Asn His Leu Pro Arg Leu Thr Ser Arg Gln Asp Thr Ser
130 135 140Gly Glu Phe Ser Ala Ala Ser
Pro Glu Arg Leu Ser Val Ser Ser Thr145 150
155 160Ile Ala Gly Gly Lys Arg Leu Pro Tyr Ser Ser Asp
Val Asn Gln Ser 165 170
175Pro Asn Arg Arg Ile Val Asp Pro Val Gly Leu Gly Asn Val Ala Trp
180 185 190Lys Glu Arg Val Asp Gly
Trp Lys Met Lys Gln Glu Lys Asn Thr Gly 195 200
205Pro Val Ser Thr Gln Ala Ala Ser Glu Arg Gly Gly Val Asp
Ile Asp 210 215 220Ala Ser Thr Asp Ile
Leu Ala Asp Glu Ala Leu Leu Asn Asp Glu Ala225 230
235 240Arg Gln Pro Leu Ser Arg Lys Val Ser Ile
Pro Ser Ser Arg Ile Asn 245 250
255Pro Tyr Arg Met Val Ile Met Leu Arg Leu Val Ile Leu Cys Leu Phe
260 265 270Leu His Tyr His Ile
Thr Asn Pro Val Pro Asn Ala Phe Ala Leu Trp 275
280 285Leu Val Ser Val Ile Cys Glu Ile Trp Phe Ala Leu
Ser Trp Ile Leu 290 295 300Asp Gln Phe
Pro Lys Trp Phe Pro Val Asn Arg Glu Thr Tyr Leu Asp305
310 315 320Arg Leu Ala Leu Arg Tyr Asp
Arg Glu Gly Glu Pro Ser Gln Leu Ala 325
330 335Ala Val Asp Ile Phe Val Ser Thr Val Asp Pro Leu
Lys Glu Pro Pro 340 345 350Leu
Val Thr Ala Asn Thr Val Leu Ser Ile Leu Ala Val Asp Tyr Pro 355
360 365Val Asp Lys Val Ser Cys Tyr Val Ser
Asp Asp Gly Ala Ala Met Leu 370 375
380Ser Phe Glu Ser Leu Ala Glu Thr Ser Glu Phe Ala Arg Lys Trp Val385
390 395 400Pro Phe Cys Lys
Lys Tyr Ser Ile Glu Pro Arg Ala Pro Glu Trp Tyr 405
410 415Phe Ala Ala Lys Ile Asp Tyr Leu Lys Asp
Lys Val Gln Thr Ser Phe 420 425
430Val Lys Asp Arg Arg Ala Met Lys Arg Glu Tyr Glu Glu Phe Lys Ile
435 440 445Arg Ile Asn Ala Leu Val Ser
Lys Ala Leu Lys Cys Pro Glu Glu Gly 450 455
460Trp Val Met Gln Asp Gly Thr Pro Trp Pro Gly Asn Asn Thr Arg
Asp465 470 475 480His Pro
Gly Met Ile Gln Val Phe Leu Gly Gln Asn Gly Gly Leu Asp
485 490 495Ala Glu Gly Asn Glu Leu Pro
Arg Leu Val Tyr Val Ser Arg Glu Lys 500 505
510Arg Pro Gly Phe Gln His His Lys Lys Ala Gly Ala Met Asn
Ala Leu 515 520 525Val Arg Val Ser
Ala Val Leu Thr Asn Gly Pro Phe Ile Leu Asn Leu 530
535 540Asp Cys Asp His Tyr Ile Asn Asn Ser Lys Ala Leu
Arg Glu Ala Met545 550 555
560Cys Phe Leu Met Asp Pro Asn Leu Gly Lys Gln Val Cys Tyr Val Gln
565 570 575Phe Pro Gln Arg Phe
Asp Gly Ile Asp Lys Asn Asp Arg Tyr Ala Asn 580
585 590Arg Asn Thr Val Phe Phe Asp Ile Asn Leu Arg Gly
Leu Asp Gly Ile 595 600 605Gln Gly
Pro Val Tyr Val Gly Thr Gly Cys Val Phe Asn Arg Thr Ala 610
615 620Leu Tyr Gly Tyr Glu Pro Pro Ile Lys Val Lys
His Lys Lys Pro Ser625 630 635
640Leu Leu Ser Lys Leu Cys Gly Gly Ser Arg Lys Lys Asn Ser Lys Ala
645 650 655Lys Lys Glu Ser
Asp Lys Lys Lys Ser Gly Arg His Thr Asp Ser Thr 660
665 670Val Pro Val Phe Asn Leu Asp Asp Ile Glu Glu
Gly Val Glu Gly Ala 675 680 685Gly
Phe Asp Asp Glu Lys Ala Leu Leu Met Ser Gln Met Ser Leu Glu 690
695 700Lys Arg Phe Gly Gln Ser Ala Val Phe Val
Ala Ser Thr Leu Met Glu705 710 715
720Asn Gly Gly Val Pro Pro Ser Ala Thr Pro Glu Asn Leu Leu Lys
Glu 725 730 735Ala Ile His
Val Ile Ser Cys Gly Tyr Glu Asp Lys Ser Asp Trp Gly 740
745 750Met Glu Ile Gly Trp Ile Tyr Gly Ser Val
Thr Glu Asp Ile Leu Thr 755 760
765Gly Phe Lys Met His Ala Arg Gly Trp Arg Ser Ile Tyr Cys Met Pro 770
775 780Lys Leu Pro Ala Phe Lys Gly Ser
Ala Pro Ile Asn Leu Ser Asp Arg785 790
795 800Leu Asn Gln Val Leu Arg Trp Ala Leu Gly Ser Val
Glu Ile Leu Phe 805 810
815Ser Arg His Cys Pro Ile Trp Tyr Gly Tyr Asn Gly Arg Leu Lys Phe
820 825 830Leu Glu Arg Phe Ala Tyr
Val Asn Thr Thr Ile Tyr Pro Ile Thr Ser 835 840
845Ile Pro Leu Leu Met Tyr Cys Thr Leu Pro Ala Val Cys Leu
Phe Thr 850 855 860Asn Gln Phe Ile Ile
Pro Gln Ile Ser Asn Ile Ala Ser Ile Trp Phe865 870
875 880Leu Ser Leu Phe Leu Ser Ile Phe Ala Thr
Gly Ile Leu Glu Met Arg 885 890
895Trp Ser Gly Val Gly Ile Asp Glu Trp Trp Arg Asn Glu Gln Phe Trp
900 905 910Val Ile Gly Gly Val
Ser Ala His Leu Phe Ala Val Phe Gln Gly Ile 915
920 925Leu Lys Val Leu Ala Gly Ile Asp Thr Asn Phe Thr
Val Thr Ser Lys 930 935 940Ala Ser Asp
Glu Asp Gly Asp Phe Ala Glu Leu Tyr Leu Phe Lys Trp945
950 955 960Thr Thr Leu Leu Ile Pro Pro
Thr Thr Leu Leu Ile Val Asn Leu Val 965
970 975Gly Val Val Ala Gly Val Ser Tyr Ala Ile Asn Ser
Gly Tyr Gln Ser 980 985 990Trp
Gly Pro Leu Phe Gly Lys Leu Phe Phe Ala Phe Trp Val Ile Val 995
1000 1005His Leu Tyr Pro Phe Leu Lys Gly
Leu Met Gly Arg Gln Asn Arg 1010 1015
1020Thr Pro Thr Ile Val Val Val Trp Ser Val Leu Leu Ala Ser Ile
1025 1030 1035Phe Ser Leu Leu Trp Val
Arg Ile Asp Pro Phe Thr Ser Arg Val 1040 1045
1050Thr Gly Pro Asp Ile Leu Glu Cys Gly Ile Asn Cys 1055
1060 1065364779DNAArabidopsis thaliana
36atgaacaccg gtggtcggtt aatcgccggt tctcacaaca ggaatgagtt tgtcctcatt
60aatgccgatg agaatgcccg agtatgtttc tcctcttctt ttgtttccaa ttctctgtct
120tttgatctgt gtttctctat ctctgttcaa aagtctctga ctttttttac ttttcttgtg
180gatctggctc ttaccactgc aaatcaatta agatttaggg tttttagtac tagtattaag
240attacgtacc cttgtagcta attttatcaa gaattgattg tgtcggtggg atggattttt
300ccggatttga cttgtcttaa ttctccaatt taagagattt cttcaattgc aattatgaat
360ctatcaatgt gaagagtaat aattatgtta ttgggttact ttgatctggt gtgagatcca
420gtctgatagt gtcactacta tgatctgatg tatttaactc tactgttttg tgcagataag
480atcagtccaa gagctgagtg gacagacatg tcaaatctgc agagatgaga tcgaattgac
540tgttgatgga gaaccgtttg tggcatgtaa cgaatgtgca ttccctgtgt gtagaccttg
600ctatgagtac gaaagacgag aaggcaatca agcttgtcca cagtgcaaaa cccgtttcaa
660acgtcttaaa ggttcgttgt ttgttagacc aaatttcttt ggttttttgt gaatgtagaa
720gattttctga tttgttcggc ctatgttgtt gtttgttagg aagtccaaga gttgaaggtg
780atgaagagga agatgacatt gatgatttag acaatgagtt tgagtatgga aataatggga
840ttggatttga tcaggtttct gaaggtatgt caatctctcg tcgcaactcc ggtttcccac
900aatctgattt ggattcagct ccacctggct ctcagattcc attgctgact tacggcgacg
960aggtaaaaat ctcagaatgt atccacattg tataacccat cttcagtaat tggctcactc
1020agatttctct tttgttttat tacaggacgt tgagatttct tctgatagac atgctcttat
1080tgttcctcct tcacttggtg gtcatggcaa tagagttcat cctgtttctc tttctgaccc
1140gaccgtggct gcacatccaa ggcctatggt acctcagaaa gatcttgcgg tttatggtta
1200tggaagtgtc gcttggaaag atcggatgga ggaatggaag agaaagcaga atgagaaact
1260tcaggttgtt aggcatgaag gagatcctga ttttgaagat ggtgatgatg ctgattttcc
1320aatgtaaggc aaagaatata attttttttg ttgatgtctt gttccgttgc agtgatattt
1380atcaagcctt ttttttccat tttaggatgg atgagggaag gcagccattg tctaggaaga
1440taccaatcaa atcgagcaag ataaatcctt accggatgtt aattgtgcta cgtcttgtga
1500ttcttggtct cttctttcac taccgtattc ttcaccccgt caaagatgca tatgctttgt
1560ggcttatttc tgttatatgt gagatatggt ttgctgtttc atgggttctt gatcagttcc
1620ctaaatggta ccctatcgag cgagaaacgt acttggaccg actctcatta aggtacttac
1680atcttgtggg ttattacact tggaaatgtt aaaactttgt tttggggata taatccttat
1740tttttttgtt tgcagatatg agaaagaagg gaaaccgtcg ggactatccc ctgtggatgt
1800atttgttagt acagtggatc cattgaaaga gcctccgctt attactgcaa atactgtctt
1860gtctattctt gctgttgatt atcctgtcga taaggttgct tgttacgtat ctgatgatgg
1920tgctgctatg cttactttcg aagctctttc tgagaccgct gaattcgcaa ggaaatgggt
1980tcctttctgc aagaaatatt gtattgagcc tcgtgctccc gaatggtatt tctgccataa
2040aatggactac ttgaagaata aagttcatcc cgcatttgtt agggagcggc gagccatgaa
2100ggttactagt tcttactttt ttataaattt gatttgatga gaaaagtttt ggtctaattg
2160attcttgctt tagaaaaaaa aaattcatga gaaaagttat caatcttttg ttatatgggc
2220tcttatgaaa gaagatggtg gctttgaaaa ttgatttgaa agattgtgtg ttttactggt
2280tttgacagag agattatgaa gaattcaaag taaagatcaa tgctttagta gcaacagcac
2340agaaagtgcc tgaggatggt tggactatgc aagacggtac accttggccc ggtaatagtg
2400tgcgagatca tcctggcatg attcaggtga gtttcaaatg cttcttattt ctgaaaagcc
2460ttcttatgtg ttgtccttca aaatttaatt atactttgtt ttcttgttaa aggtcttcct
2520tggaagtgac ggtgttcgtg atgtcgaaaa caacgagttg cctcgattag tttacgtttc
2580tcgtgagaag agacccggat ttgatcacca taagaaggct ggagctatga attccctggt
2640aaatgatata ctttttaaag ctctaaacct tcttctttgt aaattacgtc ttgccattta
2700ttgaaatggt tcctgactct tgatttcatc tacaaaactt ttgttgaaga tacgagtctc
2760tggggttcta tcaaatgctc cttaccttct gaatgtcgat tgtgatcact acatcaacaa
2820tagcaaagct cttagagaag caatgtgttt catgatggat cctcagtcag gaaagaaaat
2880ctgttatgtt cagttccctc aaaggttcga tgggattgat aggcacgatc gatactcaaa
2940tcgcaatgtt gtgttctttg atgtaagtac agccaccact ttcctattgt atcccttttt
3000cttgagattt ctgtagaata ccaactaatg aatctttatt tacagatcaa tatgaaaggt
3060ttggatgggc tacaagggcc tatatacgtc ggtacaggtt gtgttttcag gaggcaagcg
3120ctttacggat ttgatgcacc gaagaagaag aagggcccac gtaagacatg caattgctgg
3180ccaaaatggt gtctcctatg ttttggttca agaaagaatc gtaaagcaaa gacagtggct
3240gcggataaga agaagaagaa tagggaagcg tcaaagcaga tccacgcatt agaaaatatc
3300gaagagggcc gcgtcactaa aggtatcata caaatcctgt ttgttgttaa actctttcgt
3360tagtcggtgc attttactaa aaaaataaaa tttaaaaaac attctaggtt ctaacgtaga
3420acagtcaacc gaggcaatgc aaatgaagtt ggagaagaaa tttgggcagt ctcctgtatt
3480tgttgcatct gcgcgtatgg agaatggtgg gatggctaga aacgcaagcc cggcttgtct
3540gcttaaagaa gccatccaag tcattagttg cggatatgaa gataaaactg aatggggaaa
3600agaggtaagc agccggtttt aaacctttgt tgtgtttatt caatcaattc ttgattttga
3660tgatgacctt gtgaaaaaaa tctcagattg ggtggatcta tggttctgtt accgaagata
3720ttcttacggg ttttaagatg cattctcatg gttggagatc tgtttattgt acaccaaagt
3780tagcggcttt caaaggatca gctccaatca atctttcgga tcgtctccat caagttcttc
3840gatgggcgct tgggtcggtt gagattttct tgagtaggca ttgtcctatt tggtatggtt
3900atggaggtgg gttgaaatgg cttgagcggt tgtcctacat taactctgtg gtttacccgt
3960ggacctctct accgctcatc gtttactgtt ctctccctgc catctgtctt ctcactggaa
4020aattcatcgt tcccgaggta aaacaatcat cttgagttct caaaatatga atctttattt
4080cacgttttgt gcttattcat tttccttgcc actgggggtt aaaagtatca tatgaatctt
4140tattccaagt tgtgtgtttt aagaccggaa aacgattctt gttccttctt tttccagatt
4200agcaactatg cgagtatcct cttcatggcg ctcttctcgt cgattgcaat aacgggtatt
4260ctcgagatgc aatggggcaa agttgggatc gatgattggt ggagaaacga acagttttgg
4320gtcattggag gtgtttctgc gcatctgttt gctctcttcc aaggtctcct caaggttctt
4380gctggtgtcg acactaactt cacagtcaca tcaaaagcag ctgatgatgg agagttctct
4440gacctttacc tcttcaaatg gacttcactt ctcatccctc caatgactct actcatcata
4500aacgtcattg gagtcatagt cggagtcttt gatgccatca gcaatggata cgactcgtgg
4560ggaccgcttt tcggaagact gttctttgca ctttgggtca tcattcatct ttacccgttc
4620cttaaaggtt tgcttgggaa acaagataga atgccaacca ttattgtcgt ctggtccatc
4680ctcctggcct cgattcttac acttctttgg gtccgggtta atccgtttgt ggcgaaaggc
4740ggtcctattc tcgagatctg tggtttagac tgcttgtga
4779371084PRTArabidopsis thaliana 37Met Asn Thr Gly Gly Arg Leu Ile Ala
Gly Ser His Asn Arg Asn Glu1 5 10
15Phe Val Leu Ile Asn Ala Asp Glu Asn Ala Arg Ile Arg Ser Val
Gln 20 25 30Glu Leu Ser Gly
Gln Thr Cys Gln Ile Cys Arg Asp Glu Ile Glu Leu 35
40 45Thr Val Asp Gly Glu Pro Phe Val Ala Cys Asn Glu
Cys Ala Phe Pro 50 55 60Val Cys Arg
Pro Cys Tyr Glu Tyr Glu Arg Arg Glu Gly Asn Gln Ala65 70
75 80Cys Pro Gln Cys Lys Thr Arg Phe
Lys Arg Leu Lys Gly Ser Pro Arg 85 90
95Val Glu Gly Asp Glu Glu Glu Asp Asp Ile Asp Asp Leu Asp
Asn Glu 100 105 110Phe Glu Tyr
Gly Asn Asn Gly Ile Gly Phe Asp Gln Val Ser Glu Gly 115
120 125Met Ser Ile Ser Arg Arg Asn Ser Gly Phe Pro
Gln Ser Asp Leu Asp 130 135 140Ser Ala
Pro Pro Gly Ser Gln Ile Pro Leu Leu Thr Tyr Gly Asp Glu145
150 155 160Asp Val Glu Ile Ser Ser Asp
Arg His Ala Leu Ile Val Pro Pro Ser 165
170 175Leu Gly Gly His Gly Asn Arg Val His Pro Val Ser
Leu Ser Asp Pro 180 185 190Thr
Val Ala Ala His Pro Arg Pro Met Val Pro Gln Lys Asp Leu Ala 195
200 205Val Tyr Gly Tyr Gly Ser Val Ala Trp
Lys Asp Arg Met Glu Glu Trp 210 215
220Lys Arg Lys Gln Asn Glu Lys Leu Gln Val Val Arg His Glu Gly Asp225
230 235 240Pro Asp Phe Glu
Asp Gly Asp Asp Ala Asp Phe Pro Met Met Asp Glu 245
250 255Gly Arg Gln Pro Leu Ser Arg Lys Ile Pro
Ile Lys Ser Ser Lys Ile 260 265
270Asn Pro Tyr Arg Met Leu Ile Val Leu Arg Leu Val Ile Leu Gly Leu
275 280 285Phe Phe His Tyr Arg Ile Leu
His Pro Val Lys Asp Ala Tyr Ala Leu 290 295
300Trp Leu Ile Ser Val Ile Cys Glu Ile Trp Phe Ala Val Ser Trp
Val305 310 315 320Leu Asp
Gln Phe Pro Lys Trp Tyr Pro Ile Glu Arg Glu Thr Tyr Leu
325 330 335Asp Arg Leu Ser Leu Arg Tyr
Glu Lys Glu Gly Lys Pro Ser Gly Leu 340 345
350Ser Pro Val Asp Val Phe Val Ser Thr Val Asp Pro Leu Lys
Glu Pro 355 360 365Pro Leu Ile Thr
Ala Asn Thr Val Leu Ser Ile Leu Ala Val Asp Tyr 370
375 380Pro Val Asp Lys Val Ala Cys Tyr Val Ser Asp Asp
Gly Ala Ala Met385 390 395
400Leu Thr Phe Glu Ala Leu Ser Glu Thr Ala Glu Phe Ala Arg Lys Trp
405 410 415Val Pro Phe Cys Lys
Lys Tyr Cys Ile Glu Pro Arg Ala Pro Glu Trp 420
425 430Tyr Phe Cys His Lys Met Asp Tyr Leu Lys Asn Lys
Val His Pro Ala 435 440 445Phe Val
Arg Glu Arg Arg Ala Met Lys Arg Asp Tyr Glu Glu Phe Lys 450
455 460Val Lys Ile Asn Ala Leu Val Ala Thr Ala Gln
Lys Val Pro Glu Asp465 470 475
480Gly Trp Thr Met Gln Asp Gly Thr Pro Trp Pro Gly Asn Ser Val Arg
485 490 495Asp His Pro Gly
Met Ile Gln Val Phe Leu Gly Ser Asp Gly Val Arg 500
505 510Asp Val Glu Asn Asn Glu Leu Pro Arg Leu Val
Tyr Val Ser Arg Glu 515 520 525Lys
Arg Pro Gly Phe Asp His His Lys Lys Ala Gly Ala Met Asn Ser 530
535 540Leu Ile Arg Val Ser Gly Val Leu Ser Asn
Ala Pro Tyr Leu Leu Asn545 550 555
560Val Asp Cys Asp His Tyr Ile Asn Asn Ser Lys Ala Leu Arg Glu
Ala 565 570 575Met Cys Phe
Met Met Asp Pro Gln Ser Gly Lys Lys Ile Cys Tyr Val 580
585 590Gln Phe Pro Gln Arg Phe Asp Gly Ile Asp
Arg His Asp Arg Tyr Ser 595 600
605Asn Arg Asn Val Val Phe Phe Asp Ile Asn Met Lys Gly Leu Asp Gly 610
615 620Leu Gln Gly Pro Ile Tyr Val Gly
Thr Gly Cys Val Phe Arg Arg Gln625 630
635 640Ala Leu Tyr Gly Phe Asp Ala Pro Lys Lys Lys Lys
Gly Pro Arg Lys 645 650
655Thr Cys Asn Cys Trp Pro Lys Trp Cys Leu Leu Cys Phe Gly Ser Arg
660 665 670Lys Asn Arg Lys Ala Lys
Thr Val Ala Ala Asp Lys Lys Lys Lys Asn 675 680
685Arg Glu Ala Ser Lys Gln Ile His Ala Leu Glu Asn Ile Glu
Glu Gly 690 695 700Arg Val Thr Lys Gly
Ser Asn Val Glu Gln Ser Thr Glu Ala Met Gln705 710
715 720Met Lys Leu Glu Lys Lys Phe Gly Gln Ser
Pro Val Phe Val Ala Ser 725 730
735Ala Arg Met Glu Asn Gly Gly Met Ala Arg Asn Ala Ser Pro Ala Cys
740 745 750Leu Leu Lys Glu Ala
Ile Gln Val Ile Ser Cys Gly Tyr Glu Asp Lys 755
760 765Thr Glu Trp Gly Lys Glu Ile Gly Trp Ile Tyr Gly
Ser Val Thr Glu 770 775 780Asp Ile Leu
Thr Gly Phe Lys Met His Ser His Gly Trp Arg Ser Val785
790 795 800Tyr Cys Thr Pro Lys Leu Ala
Ala Phe Lys Gly Ser Ala Pro Ile Asn 805
810 815Leu Ser Asp Arg Leu His Gln Val Leu Arg Trp Ala
Leu Gly Ser Val 820 825 830Glu
Ile Phe Leu Ser Arg His Cys Pro Ile Trp Tyr Gly Tyr Gly Gly 835
840 845Gly Leu Lys Trp Leu Glu Arg Leu Ser
Tyr Ile Asn Ser Val Val Tyr 850 855
860Pro Trp Thr Ser Leu Pro Leu Ile Val Tyr Cys Ser Leu Pro Ala Ile865
870 875 880Cys Leu Leu Thr
Gly Lys Phe Ile Val Pro Glu Ile Ser Asn Tyr Ala 885
890 895Ser Ile Leu Phe Met Ala Leu Phe Ser Ser
Ile Ala Ile Thr Gly Ile 900 905
910Leu Glu Met Gln Trp Gly Lys Val Gly Ile Asp Asp Trp Trp Arg Asn
915 920 925Glu Gln Phe Trp Val Ile Gly
Gly Val Ser Ala His Leu Phe Ala Leu 930 935
940Phe Gln Gly Leu Leu Lys Val Leu Ala Gly Val Asp Thr Asn Phe
Thr945 950 955 960Val Thr
Ser Lys Ala Ala Asp Asp Gly Glu Phe Ser Asp Leu Tyr Leu
965 970 975Phe Lys Trp Thr Ser Leu Leu
Ile Pro Pro Met Thr Leu Leu Ile Ile 980 985
990Asn Val Ile Gly Val Ile Val Gly Val Phe Asp Ala Ile Ser
Asn Gly 995 1000 1005Tyr Asp Ser
Trp Gly Pro Leu Phe Gly Arg Leu Phe Phe Ala Leu 1010
1015 1020Trp Val Ile Ile His Leu Tyr Pro Phe Leu Lys
Gly Leu Leu Gly 1025 1030 1035Lys Gln
Asp Arg Met Pro Thr Ile Ile Val Val Trp Ser Ile Leu 1040
1045 1050Leu Ala Ser Ile Leu Thr Leu Leu Trp Val
Arg Val Asn Pro Phe 1055 1060 1065Val
Ala Lys Gly Gly Pro Ile Leu Glu Ile Cys Gly Leu Asp Cys 1070
1075 1080Leu
User Contributions:
Comment about this patent or add new information about this topic: