Patent application title: Method for Producing Recombinant Glycoproteins Having Beta1,2-Bound Xylose Residues
Inventors:
Josef Glössl (Vienna, AT)
Josef Glössl (Vienna, AT)
Richard Strasser (Vienna, AT)
Jan Mucha (Bratislava, SK)
Lukas Mach (Vienna, AT)
Friedrich Altmann (Vienna, AT)
Iain B. Wilson (Vienna, AT)
Herta Steinkellner (Vienna, AT)
IPC8 Class: AC12P2100FI
USPC Class:
800278
Class name: Multicellular living organisms and unmodified parts thereof and related processes method of introducing a polynucleotide molecule into or rearrangement of genetic material within a plant or plant part
Publication date: 2012-12-27
Patent application number: 20120331578
Abstract:
A DNA molecule is provided which comprises a sequence according to SEQ ID
NO: 8 having an open reading frame from base pair 227 to base pair 1831
or having at least 50% homology to the above-indicated sequence, or
hybridizing with the above-indicated sequence under stringent conditions,
or comprising a sequence which has degenerated to the above-indicated DNA
sequence because of the genetic code, the sequence coding for a plant
protein having β1,2-xylosyltransferase activity or being
complementary thereto.Claims:
1-32. (canceled)
33. A method of producing a recombinant glycoprotein, comprising expressing the recombinant glycoprotein in a plant or plant cell in which production of an endogenous β1,2-xylosyltransferase is suppressed or completely stopped, wherein said endogenous β1,2-xylosyltransferase can be identified by sequence comparison with the β1,2-xylosyltransferase sequence according to SEQ ID NO:8 with an open reading frame from base pair 227 to base pair 1831.
34. The method of claim 1, wherein said endogenous β1,2-xylosyltransferase is identified by sequence comparison with the β1,2-xylosyltransferase sequence according to SEQ ID NO:8 with an open reading frame from base pair 227 to base pair 1831 bp the program fastDB.
35. The method according to claim 1, wherein the recombinant glycoprotein is a human protein.
36. The method of claim 1, wherein the expression of the β1,2-xylosyltransferase is suppressed or completely blocked by a knock-out mutation of the endogenous β1,2-xylosyltransferase gene in said plant or plant cell.
37. The method of claim 1, wherein the expression of the β1,2-xylosyltransferase is suppressed or completely blocked by antisense inhibition in said plant or plant cell.
38. The method of claim 1, wherein the expression of the β1,2-xylosyltransferase is suppressed or completely blocked by transfection with a polynucleotide comprising a sequence of at least 50 nucleotides that is complementary to the sequence coding for the endogenous β1,2-xylosyltransferase.
39. The method of claim 1, wherein the endogenous β1,2-xylosyltransferase activity in said plant or plant cell is less than 50% of the β1,2-xylosyltransferase occurring in natural plants or plant cells without β1,2-xylosyltransferase suppression.
40. A method of producing a recombinant glycoprotein, comprising: (a) identifying an endogenous β1,2-xylosyltransferase in a plant or plant cell by (i) sequence comparison with the β1,2-xylosyltransferase sequence according to SEQ ID NO:8 with an open reading frame from base pair 227 to base pair 1831, and (ii) assay for β1,2-xylosyltransferase activity; (b) suppressing or completely stopping said endogenous β1,2-xylosyltransferase production in the plant or plant cells; and (c) expressing the recombinant glycoprotein in the plant or plant cell in which production of an endogenous β1,2-xylosyltransferase is suppressed or completely stopped.
41. A method of producing a plant or plant cell with reduced or no endogenous β1,2-xylosyltransferase activity, comprising: (a) identifying an endogenous β1,2-xylosyltransferase by (i) sequence comparison with the β1,2-xylosyltransferase sequence according to SEQ ID NO:8 with an open reading frame from base pair 227 to base pair 1831 and (ii) assay for β1,2-xylosyltransferase activity; and (b) suppressing or completely stopping said endogenous β1,2-xylosyltransferase production.
42. The method of claim 41, wherein said endogenous β1,2-xylosyltransferase is identified by sequence comparison with the β1,2-xylosyltransferase sequence according to SEQ ID NO:8 with an open reading frame from base pair 227 to base pair 1831 bp the program fastDB.
43. The method of claim 41, wherein the expression of the β1,2-xylosyltransferase is suppressed or completely blocked by a knock-out mutation of the endogenous β1,2-xylosyltransferase gene in said plant or plant cell.
44. The method of claim 41, wherein the expression of the β1,2-xylosyltransferase is suppressed or completely blocked by antisense inhibition in said plant or plant cell.
45. The method of claim 41, wherein the expression of the β1,2-xylosyltransferase is suppressed or completely blocked by transfection with a polynucleotide comprising a sequence of at least 50 nucleotides that is complementary to the sequence coding for the endogenous β1,2-xylosyltransferase.
46. The method of claim 41, wherein the endogenous β1,2-xylosyltransferase activity in said plant or plant cell is less than 50% of the β1,2-xylosyltransferase occurring in natural plants or plant cells without β1,2-xylosyltransferase suppression.
Description:
[0001] The invention relates to polynucleotides coding for a
β1,2-xylosyltransferase. Furthermore, the invention relates to
vectors comprising these polynucleotides, recombinant host cells, plants
and insects transfected with the polynucleotides or with DNA derived
therefrom, respectively, as well as to glycoproteins produced in these
systems.
[0002] Glycoproteins exhibit a variety and complexity of carbohydrate units, the composition and arrangement of the carbohydrates being characteristic of different organisms. The oligosaccharide units of the glycoproteins have a number of tasks, e.g. they are important in regulating metabolism, they are involved in transmitting cell-cell interactions, they determine the circulation periods of proteins in circulation, and they are decisive for recognizing epitopes in antigen-antibody reactions.
[0003] The glycosylation of glycoproteins starts in the endoplasmatic reticulum (ER), where the oligosaccharides are either bound to asparagine side chains by N-glycosidic bonds or to serine or threonine side chains by O-glycosidic bonds. The N-bound oligosaccharides contain a common core from a penta-saccharide unit which consists of three mannose and two N-acetyl glucose amine residues. To modify the carbohydrate units further, the proteins are transported from the ER to the Golgi complex. The structure of the N-bound oligosaccharide units of glycoproteins is determined by their conformation and by the composition of the glycosyl transferases of the Golgi compartments in which they are processed.
[0004] It has been shown that the core pentasaccharide unit of the N-glycans of some plants is substituted by β1,2-bound xylose and α1,3-bound fucose (Lerouge et al., 1998, Plant Mol. Biol. 38, 31-48; Rayon et al., 1998, J. Exp. Bot. 49, 1463-1472). The hepta-saccharide "MMXF3" constitutes the main oligosaccharide type in plants (Kurosaka et al., 1991, J. Biol. Chem., 266, 4168-4172; Wilson and Altmann, 1998, Glycoconj. J. 15, 1055-1070). These structures are also termed complex N-glycans or mannose-deficient or truncated N-glycans, respectively. The α-mannosyl residues may be further replaced by GlcNAc, to which galactose and fucose are bound so that a structure is prepared which corresponds to the human Lewis a-epitope (Melo et al., 1997, FEBS Lett 415, 186-191; Fitchette-Laine et al., 1997, Plant J. 12, 1411-1417).
[0005] Neither β1,2-xylose nor the α1,3-bound fucose exist in mammalian glycoproteins. It has been found that the β1,2-xylose together with α1,3-fucose plays an important role in the epitope recognition of antibodies which are directed against plant N-bound oligosaccharides, and thereby trigger immune reactions in human or animal bodies against these oligosaccharides (Faye et al., 1993, Anal. Biochem. 209, 104-108). The β1,2-xylose and/or α1,3-fucose containing N-glycans furthermore seem to be one of the main causes for the wide-spread allergic cross reactivity between various plant and insect allergens and is also termed "cross-reactive carbohydrate determinant" (CCD). Due to the frequent occurrence of immunological cross reactions, the CCDs moreover mask allergy diagnoses.
[0006] The immunological reactions triggered in the human body by plant proteins are the main problem in the medicinal use of recombinant human proteins produced in plants. To circumvent this problem, β1,2-xylosylation together with α1,3-fucosylation would have to be prevented. According to a study, a mutant of the plant Arabidopsis thaliana was isolated in which the activity of N-acetyl-glucosaminyl transferase I, the first enzyme in the biosynthesis of complex glycans, is missing. The biosynthesis of the complex glycoproteins in this mutant thus is disturbed. Nevertheless, these mutant plants are capable of developing normally under certain conditions (A. Schaewen et al, 1993, Plant Physiol. 102; 1109-1118).
[0007] To block specifically the transfer of the β1,2-xylose to an oligosaccharide without also interfering in other glycosylation steps, solely that enzyme would have to be inactivated which is directly responsible for this specific glycosylation, i.e. the β1,2-xylosyltransferase. This transferase which only occurs in plants and some non-vertebrate animal species, e.g. in Schistosoma sp. (Khoo et al., 1997, Glycobiology 7, 663-677) and snail (e.g. Mulder et al., 1995, Eur. J. Biochem. 232, 272-283), yet not in human beings or in other vertebrates, would have to be inactivated on purpose or suppressed so that human proteins which are produced in plants or in plant cells, respectively, would no longer contain this immune-reaction-triggering epitope, as has been the case so far.
[0008] β1,2-xylosyltransferase transfers the D-xylose from UDP-xylose to the beta-linked mannose of plant N-linked oligosaccharides.
[0009] This enzyme was purified from soybean microsomes in 1997; Zeng et al.: J. Biol. Chem., 272, 31340-31347, 1997). According to this article, the best acceptor for xylose transfer was GlcNAc2Man3GlcNAc2-T, but GlcNAc1Man3GlcNAc2, with the GLcNAc on the 3-branch, was also a good acceptor. Furthermore, a number of other N-linked oligosaccharides were poor acceptors, especially those with galactose units at the nonreducing termini.
[0010] In the article by Rayon et al. (Plant Physiology, 1999, 119, 725-733) it is mentioned that Arabidopsis proteins are N-glycosylated by high-mannose-type N-glycans and by xylose- and fucose containing oligosaccharides. TEZUKA et al. (Eur. J. Biochem. 203, 401-413 (1992)) measured the activities of different enzymes, for example β1,2-xylosyltransferase in the Golgi fraction of suspension-cultured cells of sycamore. They demonstrated that xylose was transferred onto the inner mannose by β1,2-xylosyltransferase. Furthermore, they mentioned that xylose containing oligosaccharides are widely distributed throughout the plant kingdom although xylose containing N-linked oligosaccharides were also found in glycoproteins from gastropods and Chlorophyceae.
[0011] For the specific supression or inactivation of proteins it is best to carry this out at the level of transcription and translation step, respectively. For this it is necessary to isolate and sequence the nucleotide sequence which codes for the active protein.
[0012] As mentioned above, the soybean β1,2-xylosyltransferase was isolated and purified in 1997. Only a part of the xylosyltransferase cDNA has been isolated, s. WO99/29835 A1; SEQ ID NO 6 and 7), however, the complete cDNA, which codes for the active protein could not be isolated and characterized so far. The reason why the nucleotide sequence has not been identified so far could be major problems in the procedure due to very low abundance of the mRNA which codes for the xylosyltransferase in the organisms, as for example soybeans. Until now, although several groups have tried to identify the entire nucleotide sequence of this gene, usually from soybeans, it was not possible to produce full length cDNA which corresponds to the xylosyltransferase mRNA with the help of conventional methods known to be effective in usual cases, for example with the help of the RACE-amplification (rapid amplification of cDNA ends). With this method unknown sequences are amplified with the help of specific amplification primers. Potential reasons for unsuccessful 5'-RACE experiments can be an inadequate choice of specific PCR primers as well as the presence of reverse transcriptase-inhibiting components during cDNA synthesis.
[0013] One problem of the isolated soybean β1,2-xylosyl transferase is that its solubility and activity depends on the presence of detergents.
[0014] Additionally to the problem of the extremely low concentration of β1,2-xylosyltransferase mRNA there is furthermore the problem that the secondary structure at the 5'-end of the RNA seems to hinder the amplification of this region. In these cases, the RACE amplification which is in itself a sensitive method, does not result in the correct and complete xylosyltransferase-cDNA sequence.
[0015] It is of course also very likely that the mRNA and the cDNA derived thereof, beside the fact that it is present only in very low concentrations, recombines and mutates vary easily in the course of the various manipulations. For these and other potential reasons the cloning and the expression of this specific gene was impossible until now.
[0016] It is an object of the present invention to clone and to sequence the whole gene which codes for a plant β1,2-xylosyltransferase, and to prepare vectors comprising this gene or an altered DNA or a DNA derived therefrom, to transfect plants as well as cells thereof with one of these vectors, to produce glycoproteins that do not contain the normally occurring β1,2-xylose, as well as to provide corresponding methods therefor.
[0017] A further object is the production of large quantities of purified recombinant enzyme in order to allow in vitro synthesis of homogenous N-glycans or glycoconjugates containing β1,2-xylose. This will aid the further elucidation of the role of β1,2-xylose in the immunogenicity and allergenicity of plant glycoproteins.
[0018] The object according to the invention is achieved by a DNA molecule comprising a sequence according to SEQ ID NO: 8 with an open reading frame from base pair 227 to base pair 1831 or being at least 50% homologous to the above sequence or hybridizing with the above-indicated sequence under stringent conditions, or comprising a sequence which has degenerated to the above DNA sequence due to the genetic code, the sequence coding for a plant protein which has β1,2-xylosyltransferase activity or is complementary thereto. This complete sequence which has never been described is particularly useful for experiments, analysis and production processes which concern the β1,2-xylosyltransferase activity. This sequence can be used especially for the inactivation or suppression of the β1,2-xylosyltransferase as well as for overexpression and production of the recombinant enzyme.
[0019] Upon searching GenBank+EMBL+DDBJ+PDB databases using the soybean xylosyltransferase-derived peptides as mentioned above several polypeptide sequences (from Arabidopsis and Drosophila) with significant homologies were retrieved. However, these sequences were otherwise unrelated to each other as well as to the β1,2-xylosyltransferase sequence finally identified. Successful retrieval of candidate sequence for β1,2-xylosyltransferase was possible only by properly assembling three soybean β1,2-xylosyl transferase-derived peptide sequences to a single sequence. All search strategies we used according to the present state of the art (i.e. using the peptide sequences separately or in combination with each other) did not lead to a successful retrieval of the correct sequence for β1,2-xylosyltransferase.
[0020] The isolation and purification of this gene was achieved by searching in the DDBJ+GenBank+EMBL+PDB-databases corresponding sequences to three known peptides (used as assembled peptides) of the soybean xylosyltransferase (Patent WO99/29835 A1, SEQ ID NO: 3 and 5). It was found that one DNA sequence of Arabidopsis thaliana which has not yet been assigned to any protein before showed homology to two of the three peptides. With the help of the gene-finder program a predicted protein sequence was found according to which sequence specific primers for a RT-PCR were designed. It was possible to produce a first strand cDNA corresponding to the mRNA of the A. thaliana-xylosyltransferase gene after which the first strand cDNA was subjected to a PCR using the specifically designed primers. The reason for the successful production of A. thaliana xylosyltransferase-cDNA may be on the one hand that the xylosyltransferase-mRNA of A. thaliana is less problematic compared to other plant species, on the other hand the PCR was performed with optimally designed gene-specific primers.
[0021] The open reading frame of the SEQ ID NO: 8 codes for a protein with 534 amino acids and with a theoretical molecular weight of 60.2 kDa, a transmembrane portion presumably being present in the region between Ile11 and Phe29. The calculated pI value of the encoded protein of the sequence according to SEQ ID NO: 9 is 7.52.
[0022] The activity of the plant β1,2-xylosyltransferase is detected by a method and measured, the xylosyltransferase being added to a sample containing UDP-xylose and a labelled acceptor (e.g. a glycopeptide or labelled oligosaccharide). After the reaction time, the content of bound xylose is measured. The activity of the xylosyltransferase in this case is seen as positive if the activity measurement is higher by at least 10 to 20%, in particular at least 30 to 50%, than the activity measurement of the negative control. The structure of the oligosaccharide may additionally be verified by means of HPLC. Such protocols are prior art (Staudacher et al., 1998, Anal. Biochem. 246, 96-101; Staudacher et al., 1991, Eur. J. Biochem. 199, 745-751). Whether the xylose is bound or not to the acceptor substrate can furthermore be determined by measuring the mass of the product by means of mass spectrometry.
[0023] The pairing of two DNA molecules can be changed by selection of the temperature and ionic strength of the sample. By stringent conditions, according to the invention conditions are understood which allow for an exact, stringent, binding. For instance, the DNA molecules are hybridized in 7% sodium dodecyl sulfate (SDS), 0.5M NaPO4, pH 7.0, mM EDTA at 50° C., and washed with 1% SDS at 42° C.
[0024] Whether sequences have an at least 50% homology to SEQ ID NO: 8 can be determined e.g. by means of the program FastDB of EMBL or SWISSPROT data bank.
[0025] There exist a number of relevant differences between recombinant β1,2-xylosyltransferase encoded by the DNA molecule of the present invention and the respective enzyme from soybean as described in WO99/29835 A1:
[0026] 1. The recombinant enzyme is soluble without detergents (e.g. Triton X-100), whereas the solubility of the enzyme from soybean depends on the presence of detergents.
[0027] 2. The recombinant enzyme is fully active in the absence of detergents (e.g. Triton X-100).
[0028] 3. The recombinant enzyme is N-glycosylated, whereas the enzyme from soybean is described to be unglycosylated.
[0029] 4. The enzyme from A. thaliana exhibits full enzymatic activity also as a truncated form lacking the 32 N-terminal amino acids.
[0030] 5. In contrast to the enzyme from soybean the enzyme from A. thaliana has a broad pH-optimum and shows pronounced activity in the range of pH 6-8.
[0031] 6. The cDNA sequence coding for the soybean enzyme corresponds only to amino acids (aa) 199-469 of the A. thaliana protein, see FIG. 11.
[0032] 7. The cDNA sequence coding for A. thaliana xylosyltransferase contains two insertions (corresponding to aa 375-382 and aa 425-429 of the predicted protein sequence) compared to the partial sequence of the soybean enzyme, see FIG. 11.
[0033] 8. None of the five peptides (see FIG. 4 of WO99/29835 A1) isolated from the soybean enzyme is identical to the corresponding regions of the enzyme from A. thaliana:
[0034] Peptide SEQ ID NO. 1: homologous to aa 411-422 of the A. thaliana enzyme
[0035] Peptide SEQ ID NO. 2: homologous to aa 192-205 of the A. thaliana enzyme
[0036] Peptide SEQ ID NO. 3: homologous to aa 451-477 of the A. thaliana enzyme
[0037] Peptide SEQ ID NO. 4: homologous to aa 191-205 of the A. thaliana enzyme
[0038] Peptide SEQ ID NO. 5: homologous to aa 503-512 of the A. thaliana enzyme (remark: the cDNA sequence listed in WO99/29835 A1 does not contain a coding sequence for peptide 5).
[0039] Therefore the DNA molecule according to the present invention is particularly advantageous since it encodes for an active recombinant enzyme which shows surprisingly advantageous characteristics and effects over the known purified enzyme.
[0040] Preferably, the sequence of the DNA molecule of the invention encodes a protein with a β1,2-xylosyltransferase activity. This specific protein is especially useful for analysis, experiments and production methods which relate to the β1,2-xylosyltransferase.
[0041] Preferably, the DNA molecule according to the invention is at least 70%, preferably at least 80%, particularly preferred at least 95%, homologous with the sequence according to SEQ ID NO: 8. This sequence codes for a particularly active β1,2-xylosyltransferase. The homology preferably is determined with a program which recognizes insertions and deletions and which does not consider these in the homology calculation.
[0042] According to a further advantageous embodiment, the DNA molecule comprises 1750 to 1850, in particular 1831, base pairs.
[0043] In doing so, it is particularly advantageous if one of the above-indicated DNA molecules is covalently associated with a detectable marker substance. As the marker (labelling) substance, any common marker can be used, such as, e.g., fluorescent, luminescent, radioactive markers, biotin, etc. In this manner, reagents are provided which are suitable for the detection, selection and quantitation of corresponding DNA molecules in solid tissue samples (e.g. from plants) or also in liquid samples, by means of hybridizing methods.
[0044] Preferably, the DNA molecule according to the invention includes a sequence which comprises a deletion, insertion and/or substitution mutation. The number of mutant nucleotides is variable and varies from a single one to several deleted, inserted or substituted nucleotides. It is also possible that the reading frame is shifted by the mutation. In such a "knock-out gene" it is merely important that the expression of a β1,2-xylosyltransferase is disturbed, and the formation of an active, functional enzyme is prevented. In doing so, the site of the mutation is variable, as long as expression of an enzymatically active protein is prevented. Preferably, the mutation in the catalytic region of the enzyme which is located in the C-terminal region. The method of inserting mutations in DNA sequences are well known to the skilled artisan, and therefore the various possibilities of mutageneses need not be discussed here in detail. Coincidental mutageneses as well as, in particular, directed mutageneses, e.g. the site-directed mutagenesis, oligonucleotide-controlled mutagenesis or mutageneses by aid of restriction enzymes may be employed in this instance.
[0045] The invention further provides a DNA molecule which codes for a ribozyme which comprises two sequence sections, each of which has a length of at least 10 to 15 base pairs each, which are complementary to sequence sections of an inventive DNA molecule as described above so that the ribozyme complexes and cleaves the mRNA which is transcribed by a natural β1,2-xylosyltransferase DNA molecule. The publication by John M. Burke "Clearing the way for ribozymes" (Nature Biotechnology 15:414-415; 1997) relates to the general mode of function of ribozymes. The ribozyme will recognize the mRNA of the β1,2-xylosyltransferase by complementary base pairing with the mRNA. Subsequently, the ribozyme will cleave and destroy the RNA in a sequence-specific manner, before the enzyme is translated. After dissociation from the cleaved substrate, the ribozyme will repeatedly hybridize with RNA molecules and act as specific endonuclease. In general, ribozymes may specifically be produced for inactivation of a certain mRNA, even if not the entire DNA sequence which codes for the protein is known. Ribozymes are particularly efficient if the ribosomes move slowly along the mRNA. In that case it is easier for the ribozyme to find a ribosome-free site on the mRNA. For this reason, slow ribosome mutants are also suitable as a system for ribozymes (J. Burke, 1997, Nature Biotechnology; 15, 414-415).
[0046] One possible way is also to use a varied form of a ribozmye, i.e. a minizyme. Minizymes are efficient particularly for cleaving larger mRNA molecules. A minizyme is a hammer head ribozyme which has a short oligonucleotide linker instead of the stem/loop II. Dimer-minizymes are particularly efficient (Kuwabara et al., 1998, Nature Biotechnology, 16; 961-965).
[0047] A further aspect of the invention relates to a biologically functional vector which comprises one of the above-indicated DNA molecules. For transfection into host cells, an independent vector capable of amplification is necessary, wherein, depending on the host cell, transfection mechanism, task and size of the DNA molecule, a suitable vector can be used. Since a large number of different vectors is known, an enumeration thereof would go beyond the limits of the present application and there fore is done without here, particularly since the vectors are very well known to the skilled artisan (as regards the vectors as well as all the techniques and terms used in this specification which are known to the skilled artisan, cf. also Maniatis). Ideally, the vector has a small molecule mass and should comprise selectable genes so as to lead to an easily recognizable phenotype in a cell so thus enable an easy selection of vector-containing and vector-free host cells. To obtain a high yield of DNA and corresponding gene products, the vector should comprise a strong promoter, as well as an enhancer, gene amplification signals and regulator sequences. For an autonomous replication of the vector, furthermore, a replication origin is important. Polyadenylation sites are responsible for correct processing of the mRNA and splice signals for the RNA transcripts. If phages, viruses or virus particles are used as the vectors, packaging signals will control the packaging of the vector DNA. For instance, for transcription in plants, Ti plasmids are suitable, and for transcription in insect cells, baculoviruses, and in insects, respectively, transposons, such as the P element.
[0048] If the above-described inventive vector is inserted into a plant or into a plant cell, a post-transcriptional suppression of the gene expression of the endogenous β1,2-xylosyltransferase gene is attained by transcription of a transgene homologous thereto or of parts thereof, in sense orientation. For this sense technique, furthermore, reference is made to the publications by Baucombe 1996, Plant. Mol. Biol., 9:373-382, and Brigneti et al., 1998, EMBO J. 17:6739-6746. This strategy of "gene silencing" is an effective way of suppressing the expression of the β1,2-xylosyltransferase gene, cf. also Waterhouse et al., 1998, Proc. Natl. Acad. Sci. USA, 95:13959-13964.
[0049] Furthermore, the invention relates to a biologically functional vector comprising a DNA molecule according to one of the above-described embodiments, being inversely orientated with respect to the promoter. If this vector is transfected in a host cell, an "antisense mRNA" will be read which is complementary to the mRNA of the β1,2-xylosyltransferase and complexes the latter. This bond will either hinder correct processing, transportation, stability or, by preventing ribosome annealing, it will hinder translation and thus the normal gene expression of the β1,2-xylosyltransferase.
[0050] Although the entire sequence of the DNA molecule could be inserted into the vector, partial sequences thereof because of their smaller size may be advantageous for certain purposes. With the antisense aspect, e.g., it is important that the DNA molecule is large enough to form a sufficiently large antisense mRNA which will bind to the transferase mRNA. A suitable antisense RNA molecule comprises, e.g., from 50 to 200 nucleotides since many of the known, naturally occurring antisense RNA molecules comprise approximately 100 nucleotides.
[0051] For a particularly effective inhibition of the expression of an active β1,2-xylosyltransferase, a combination of the sense technique and the antisense technique is suitable (Waterhouse et al., 1998, Proc. Natl. Acad. Sci., USA, 95:13959-13964).
[0052] Advantageously, rapidly hybridizing RNA molecules are used. The efficiency of antisense RNA molecules which have a size of more than 50 nucleotides will depend on the annealing kinetics in vitro. Thus, e.g., rapidly annealing antisense RNA molecules exhibit a greater inhibition of protein expression than slowly hybridizing RNA molecules (Wagner et al., 1994, Annu. Rev. Microbiol., 48:713-742; Rittner et al., 1993, Nucl. Acids Res., 21: 1381-1387). Such rapidly hybridizing antisense RNA molecules particularly comprise a large number of external bases (free ends and connecting sequences), a large number of structural subdomains (components) as well as a low degree of loops (Patzel et al. 1998; Nature Biotechnology, 16; 64-68). The hypothetical secondary structures of the antisense RNA molecule may, e.g., be determined by aid of a computer program, according to which a suitable antisense RNA DNA sequence is chosen.
[0053] Different sequence regions of the DNA molecule may be inserted into the vector. One possibility consists, e.g., in inserting into the vector only that part which is responsible for ribosome annealing. Blocking in this region of the mRNA will suffice to stop the entire translation. A particularly high efficiency of the antisense molecules also results for the 5'- and 3'-non translated regions of the gene.
[0054] The invention also relates to a biologically functional vector which comprises one of the two last-mentioned DNA molecules (mutation or ribozyme-DNA molecule). What has been said above regarding vectors also applies in this instance.
[0055] According to the invention, there is provided a method of preparing a cDNA comprising the DNA molecule of the invention, where in RNA is isolated from a plant cell, in particular from leaf cells, by means of which a reverse transcription is carried out after the addition of a reverse transcriptase and primers. The individual steps of this method are carried out according to protocols known per se. For the reverse transcription, on the one hand, it is possible to produce the cDNA of the entire mRNA with the help of oligo(dT) primers, and only then to carry out a PCR by means of selected primers so as to prepare DNA molecules comprising the β1,2-xylosyltransferase gene. On the other hand, the selected primers may directly be used for the reverse transcription so as to obtain short, specific cDNA. The suitable primers may be prepared e.g. synthetically according to the pattern of cDNA sequences of the transferase.
[0056] The invention furthermore relates to a method of cloning a β1,2-xylosyltransferase, characterized in that the DNA molecule of the invention is cloned into a vector which subsequently is transfected into a host cell or host, respectively, wherein, by selection and amplification of transfected host cells, cell lines are obtained which express the active β1,2-xylosyltransferase. The DNA molecule is inserted into the vector by aid of restriction endonucleases, e.g. For the vector, there applies what has already been said above. What is important in this method is that an efficient host-vector system is chosen. To obtain an active enzyme, eukaryotic host cells are particularly suitable. One possible way is to transfect the vector in insect cells. In doing so, in particular an insect virus would have to be used as vector, such as, e.g., baculovirus.
[0057] Of course, plants or plant cells, human or other vertebrate cells can also be transfected, in which case the latter would express an enzyme foreign to them.
[0058] Preferably, a method of preparing recombinant host cells, in particular plant cells or plants, respectively, with a suppressed or completely stopped β1,2-xylosyltransferase production is provided, which is characterized in that at least one of the vectors according to the invention, i.e. that one comprising the inventive DNA molecule, the mutant DNA molecule or the DNA molecule coding for ribozymes or the one comprising the DNA molecule in inverse orientation to the promoter, is inserted into the host cell or plant, respectively. What has been said above for the transfection also is applicable in this case.
[0059] As the host cells, plant cells may, e.g., be used, wherein, e.g., the Ti plasmid with the agrobacterium system is eligible. With the agrobacterium system it is possible to transfect a plant directly: agrobacteria cause root stem galls in plants. If agrobacteria infect an injured plant, the bacteria themselves do not get into the plant, but they insert the recombinant DNA portion, the so-called T-DNA, from the annular, extrachromosomal, tumour-inducing Ti-plasmid into the plant cells. The T-DNA, and thus also the DNA molecule inserted therein, are installed in the chromosomal DNA of the cell in a stable manner so that the genes of the T-DNA will be expressed in the plant.
[0060] There exist numerous known, efficient transfection mechanisms for different host systems. Some examples are electroporation, the calcium phosphate method, microinjection, liposome method.
[0061] Subsequently, the transfected cells are selected, e.g. on the basis of antibiotic resistences for which the vector comprises genes, or other marker genes. Then the transfected cell lines are amplified, either in small amounts, e.g. in Petri dishes, or in large amounts, e.g. in fermentors. Furthermore, plants have a particular characteristic, i.e. they are capable to re-develop from one (transfected) cell or from a protoplast, respectively, to a complete plant which can be grown.
[0062] Depending on the vector used, processes will occur in the host so that the enzyme expression will be suppressed or completely blocked:
[0063] If the vector comprising the DNA molecule with the deletion, insertion or substitution mutation is transfected, a homologous recombination will occur: the mutant DNA molecule will recognize the identical sequence in the genome of the host cell despite its mutation and will be inserted exactly on that place so that a "knock-out gene" is formed. In this manner, a mutation is introduced into the gene for the β1,2-xylosyltransferase which is capable of inhibiting the faultless expression of the β1,2-xylosyl transferase. As has been explained above, with this technique it is important that the mutation suffices to block the expression of the active protein. After selection and amplification, the gene may be sequenced as an additional check so as to determine the success of the homologous recombination or the degree of mutation, respectively.
[0064] If the vector comprising the DNA molecule coding for a ribozyme is transfected, the active ribozyme will be expressed in the host cell. The ribozyme complexes the complementary mRNA sequence of the β1,2-xylosyltransferase at least at a certain site, cleaves this site, and in this manner it can inhibit the translation of the enzyme. In this host cell as well as in cell lines, or optionally, plant, respectively, derived therefrom, β1,2-xylosyltransferase will not be expressed.
[0065] In case the vector comprises the inventive DNA molecule in sense or inverse direction to the promoter, a sense or antisense-mRNA will be expressed in the transfected cell (or plant, respectively). The antisense mRNA is complementary at least to a part of the mRNA sequence of the β1,2-xylosyltransferase and may likewise inhibit translation of the enzyme. As an example of a method of suppressing the expression of a gene by antisense technique, reference is made to the publication by Smith et al., 1990, Mol. Gen. Genet. 224:477-481, wherein in this publication the expression of a gene involved in the maturing process of tomatoes is inhibited. Double-stranded RNA (dsRNA) has recently been shown to trigger sequence-specific gene silencing in a wide variety of organisms, including nematodes, plants, trypanosomes, fruit flies and planaria; an as yet uncharacterized RNA trigger has been shown to induce DNA methylation in several different plant systems leading to selective interference with gene function (for review see Fire A., 1999, Trends Genet 15 (9): 358-363).
[0066] In all the systems, expression of the β1,2-xylosyltransferase is at least suppressed, preferably even completely blocked. The degree of the disturbance of the gene expression will depend on the degree of complexing, homologous recombination, on possible sub-sequent coincidental mutations and on other processes in the region of the genome. The transfected cells are checked for β1,2-xylosyltransferase activity and selected.
[0067] Moreover, it is possible to still further increase the above-described suppression of the expression of the β1,2-xylosyltransferase by introducing into the host a vector comprising a gene coding for a mammalian protein, e.g. β1,4-galactosyltransferase, in addition to the insertion of an above-described vector. Xylosylation may be reduced by the action of other mammalian enzymes, the combination of the inhibition of the expression of an active β1,2-xylosyltransferase by means of the inventive vector and by means of a mammalian enzyme vector being particularly efficient.
[0068] Any type of plant may be used for transfection, e.g. mung bean, tobacco plant, tomato and/or potato plant.
[0069] Another advantageous method of producing recombinant host cells, in particular plant cells, or plants, respectively, consists in that the DNA molecule comprising the mutation is inserted into the genome of the host cell, or plant, respectively, in the place of the non-mutated homologous sequence (Schaefer et al., 1997, Plant J.; 11(6):1195-1206). This method thus does not function with a vector, but with a pure DNA molecule. The DNA molecule is inserted into the host e.g. by gene bombardment, microinjection or electroporation, to mention just three examples. As has already been explained, the DNA molecule binds to the homologous sequence in the genome of the host so that a homologous recombination and thus reception of the deletion, insertion or substitution mutation, respectively, will result in the genome: Expression of the β1,2-xylosyltransferase can be suppressed or completely blocked, respectively.
[0070] Preferably, recombinant plants or plant cells, respectively, are provided which have been prepared by one of the methods described above, their β1,2-xylosyltransferase production being suppressed or completely blocked, respectively. Preferably, their β1,2-xylosyltransferase activity is less than 50%, in particular less than 20%, particularly preferred 0%, of the β1,2-xylosyltransferase activity occurring in natural plants or plant cells, respectively. The advantage of these plants or plant cells, respectively, is that the glycoproteins produced by them do not comprise any or hardly comprise any β1,2-bound xylose. If products of these plants are taken up by human or vertebrate bodies, there will be no immune reaction due to the β1,2-xylose epitope.
[0071] The invention also relates to a PNA molecule comprising a base sequence complementary to the sequence of the DNA molecule according to the invention. PNA (peptide nucleic acid) is a DNA-like sequence, the nucleobases being bound to a pseudo-peptide backbone. PNA generally hybridizes with complementary DNA-, RNA- or PNA-oligomers by Watson-Crick base pairing and helix formation. The peptide backbone ensures a greater resistance to enzymatic degradation. The PNA molecule thus is an improved antisense agent.
[0072] Neither nucleases nor proteases are capable of attacking a PNA molecule. The stability of the PNA molecule, if bound to a complementary sequence, comprises a sufficient steric blocking of DNA and RNA polymerases, reverse transcriptase, telomerase and ribosomes. The publication by Pooga et al., "Cell penetrating PNA constructs regulate galanin receptor levels and modify pain transmission in vivo" (Nature Biotechnology 16:857-861; 1998) relates to PNA molecules in general and specifically to a PNA molecule that is complementary to human galanin receptor type 1 mRNA.
[0073] If the PNA molecule comprises the above-mentioned sequence, it will bind to the DNA or to a site of the DNA, respectively, which codes for β1,2-xylosyltransferase and in this way is capable of inhibiting transcription of this enzyme. As it is neither transcribed nor translated, the PNA molecule will be prepared synthetically, e.g. by aid of the the t-Boc technique.
[0074] Advantageously, a PNA molecule is provided which comprises a base sequence which corresponds to the sequence of the inventive DNA molecule. This PNA molecule will complex the mRNA or a site of the mRNA of β1,2-xylosyltransferase so that the translation of the enzyme will be inhibited. Similar arguments as set forth for the antisense RNA apply in this case. Thus, e.g., a particularly efficient complexing region is the translation start region or also the 5'-non-translated regions of mRNA.
[0075] A further aspect of the present invention relates to a method of preparing plants or plant cells, respectively, in particular plant cells which comprise a blocked expression of the β1,2-xylosyltransferase at the transcription or translation level, respectively, which is characterized in that inventive PNA molecules are inserted in the cells. To insert the PNA molecule or the PNA molecules, respectively, in the cell, again conventional methods, such as, e.g., electroporation or microinjection, are used. Particularly efficient is insertion if the PNA oligomers are bound to cell penetration peptides, e.g. transportan or pAntp (Pooga et al., 1998, Nature Biotechnology, 16; 857-861).
[0076] The invention provides a method of preparing recombinant glycoproteins which is characterized in that the inventive, recombinant plants or plant cells, respectively, whose β1,2-xylosyltransferase production is suppressed or completely blocked, respectively, or plants or cells, respectively, in which the PNA molecules have been inserted according to the method of the invention, are transfected with the gene that expresses the glycoprotein so that the recombinant glycoproteins are expressed. In doing so, as has already been described above, vectors comprising genes for the desired proteins are transfected into the host or host cells, respectively, as has also already been described above. The transfected plant cells will express the desired proteins, and they have no or hardly any β1,2-bound xylose. Thus, they do not trigger the immune reactions already mentioned above in the human or vertebrate body. Any proteins may be produced in these systems.
[0077] Advantageously, a method of preparing recombinant human glycoproteins is provided which is characterized in that the recombinant plants or plant cells, respectively, whose β1,2-xylosyltransferase production is suppressed or completely blocked, or plants or cells, respectively, in which PNA molecules have been inserted according to the method of the invention, are transfected with the gene that expresses the glycoprotein so that the recombinant glycoproteins are expressed. By this method it becomes possible to produce human proteins in plants (plant cells) which, if taken up by the human body, do not trigger any immune reaction directed against β1,2-bound xylose residues. There, it is possible to utilize plant types for producing the recombinant glycoproteins which serve as food stuffs, e.g. banana, potato and/or tomato. The tissues of this plant comprise the recombinant glycoprotein so that, e.g. by extraction of the recombinant glycoprotein from the tissue and subsequent administration, or directly by eating the plant tissue, respectively, the recombinant glycoprotein is taken up in the human body.
[0078] Preferably, a method of preparing recombinant human glycoproteins for medical use is provided, wherein the inventive, recombinant plants or plant cells, respectively, whose β1,2-xylosyltransferase production is suppressed or completely blocked, respectively, or plants or cells, respectively, into which the PNA molecules have been inserted according to the method of the invention, are transfected with the gene that expresses the glycoprotein so that the recombinant glycoproteins are expressed. In doing so, any protein can be used which is of medical interest.
[0079] Moreover, the present invention relates to recombinant glycoproteins according to a method described above, wherein they have been prepared in plant systems and wherein their peptide sequence comprises less than 50%, in particular less than 20%, particularly preferred 0%, of the β1,2-bound xylose residues occurring in proteins expressed in non-xylosyltransferase-reduced plant systems. Naturally, glycoproteins which do not comprise β1,2-bound xylose residues are to be preferred. The amount of β1,2-bound xylose will depend on the degree of the above-described suppression of the β1,2-xylosyltransferase.
[0080] Preferably, the invention relates to recombinant human glycoproteins which have been produced in plant systems according to a method described above and whose peptide sequence comprises less than 50%, in particular less than 20%, particularly preferred 0%, of the β1,2-bound xylose residues occurring in the proteins expressed in non-xylosyltransferase-reduced plant systems.
[0081] A particularly preferred embodiment relates to recombinant human glycoproteins for medical use which have been prepared in plant systems according to a method described above and whose peptide sequence comprises less than 50%, in particular less than 20%, particularly preferred 0%, of the β1,2-bound xylose residues occurring in the proteins expressed in non-xylosyltransferase-reduced plant systems.
[0082] The glycoproteins according to the invention may include other bound oligosaccharide units specific for plants, whereby--in the case of human glycoproteins--they differ from these natural glycoproteins. Nevertheless, by the glycoproteins according to the invention, a slighter immune reaction or no immune reaction at all, respectively, is triggered in the human body, since, as has already been explained in the introductory portion of the specification, the β1,2-bound xylose residues, together with α1,3-fucose residues, are the main cause for the immune reactions or cross immune reaction, respectively, to plant glycoproteins.
[0083] A further aspect comprises a pharmaceutical composition comprising the glycoproteins according to the invention. In addition to the glycoproteins of the invention, the pharmaceutical composition comprises further additions common for such compositions. These are, e.g., suitable diluting agents of various buffer contents (e.g. Tris-HCl, acetate, phosphate, pH and ionic strength), additives, such as tensides and solubilizers (e.g. Tween 80, Polysorbate 80), preservatives (e.g. Thimerosal, benzyl alcohol), adjuvants, antioxidants (e.g. ascorbic acid, sodium metabisulfite), emulsifiers, fillers (e.g. lactose, mannitol), covalent bonds of polymers, such as polyethylene glycol, to the protein, incorporation of the material in particulate compositions of polymeric compounds, such as polylactic acid, polyglycolic acid, etc. or in liposomes, auxiliary agents and/or carrier substances which are suitable in the respective treatment. Such compositions will influence the physical condition, stability, rate of in vivo liberation and rate of in vivo excretion of the glycoproteins of the invention.
[0084] The invention also provides a method of selecting DNA molecules which code for a β1,2-xylosyltransferase, in a sample, wherein the labelled DNA molecules of the invention or partial sequences thereof, are added to the sample, which bind to the DNA molecules that code for a β1,2-xylosyltransferase. The hybridized DNA molecules can be detected, quantitated and selected. For the sample to contain single strand DNA with which the labelled DNA molecules can hybridize, the sample is denatured, e.g. by heating.
[0085] One possible way is to separate the DNA to be assayed, possibly after the addition of endonucleases, by gel electrophoresis on an agarose gel. After having been transferred to a membrane of nitrocellulose, the labelled DNA molecules according to the invention are admixed which hybridize to the corresponding homologous DNA molecule ("Southern blotting").
[0086] Another possible way consists in finding homologous genes from other species by PCR-dependent methods using specific and/or degenerated primers, derived from the sequence of the DNA molecule according to the invention.
[0087] Advantageously the labelled DNA molecules of the invention or partial sequences thereof are immobilized onto carrier matrices. The use of DNA microarrays ("gene chips") is a further possible way to find homologous genes or to study the expression level of homologous genes. To this end, DNA representing either the entire genomic gene sequence, the full-length cDNA sequence, parts of these sequences or any combination of partial sequences, is immobilized onto carrier matrices, in order that homologous genes, after adding the sample to the carrier matrices, hybridize with the labelled DNA molecules (for examples see e.g. Ferea T. L. & Brown, P.O., 1999, Current Opinion in Genetics & Development 9: 715-722 and references cited herein).
[0088] Preferably, the sample for the above-identified inventive method comprises genomic DNA of a plant organism. By this method, a large number of plants or other species is assayed in a very rapid and efficient manner for the presence of the β1,2-xylosyltransferase gene. In this manner, it is respectively possible to select plants or individuals of other species which do not comprise this gene, or to suppress or completely block, respectively, the expression of the β1,2-xylosyltransferase in such plants or other organisms which comprise this gene, by an above-described method of the invention, so that subsequently they may be used for the transfection and production of (human) glycoproteins.
[0089] The invention also relates to DNA molecules which code for a β1,2-xylosyltransferase which have been selected according to the three last-mentioned methods and subsequently have been isolated from the sample. These molecules can be used for further assays. They can be sequenced and in turn can be used as DNA probes for finding β1,2-xylosyltransferases. These--labelled--DNA molecules will function for organisms, which are related to the organisms from which they have been isolated, more efficiently as probes than the DNA molecules of the invention.
[0090] A further aspect of the invention relates to a preparation of β1,2-xylosyltransferase cloned according to the invention which comprises isoforms having pI values of between 6.0 and 9.0, in particular between 7.50 and 8.00. The pI values of a protein is that pH value at which its net charge is zero and is dependent on the amino acid sequence, the glycosylation pattern as well as on the spatial structure of the protein. The β1,2-xylosyltransferase may comprise several isoforms which have a pI value in this range. The reason for the various isoforms of the transferase are, e.g., different glycosylations as well as limited proteolysis. The pI value of a protein can be determined by isoelectric focussing, which is known to the skilled artisan.
[0091] The main isoform of the enzyme has an apparent molecular weight of 60.2 kDa.
[0092] In particular, the preparation of the invention comprises an isoform having a pI value of 7.52.
[0093] The invention also relates to a method of preparing "plantified" carbohydrate units of human and other vertebrate glycoproteins or other glycoconjugates, wherein UDP-xylose as well as β1,2-xylosyltransferase encoded by an above-described DNA molecule are added to a sample that comprises a carbohydrate unit or a glycoprotein, respectively, so that xylose in β1,2-position is bound by the β1,2-xylosyltransferase to the carbohydrate unit or to the glycoprotein, respectively. By the method according to the invention for cloning β1,2-xylosyltransferase it is possible to produce large amounts of purified enzyme. To obtain a fully active transferase, suitable reaction conditions are provided.
[0094] The invention will be explained in more detail by way of the following examples and drawing figures to which, of course, it shall not be restricted. In detail, in the drawings,
[0095] FIG. 1 shows the amino acid sequence of soybean peptide 2 and 3 (patent WO99/29835; SEQ ID NO: 3 and 5), the homology between these peptides and a A. thaliana sequence as well as the DNA sequence of four primers 1-4;
[0096] FIG. 2 shows the cDNA sequence of β1,2-xylosyltransferase including 226 nt of the 5'-untranslated region;
[0097] FIG. 3 shows the amino acid sequence of β1,2-xylosyltransferase derived therefrom;
[0098] FIGS. 4a, 4b and 4c show the alignment of A. thaliana β1,2-xylosyltransferase cDNAs, one genomic DNA and one EST sequence;
[0099] FIG. 5 shows the alignment of amino acid sequences of β1,2-xylosyltransferase derived from the cDNAs, from a genomic DNA and from a EST sequence;
[0100] FIG. 6 is a schematic representation of the β1,2-xylosyltransferase as well as the PCR-products and the hydrophobicity of the amino acid residues;
[0101] FIG. 7 shows a comparison of the β1,2-xylosyltransferase activity of insect cells transfected with the β1,2-xylosyltransferase gene with that of a negative control;
[0102] FIGS. 8a and 8b shows the structure of the acceptor substrate and product of the β1,2-xylosyltransferase;
[0103] FIG. 9 shows mass spectra;
[0104] FIG. 10 shows the analysis of the β1,2-xylosyltransferase product by reversed-phase HPLC, and
[0105] FIG. 11 shows the alignment of the predicted amino acid sequence derived from the cDNA of the present application with the amino acid sequence of the β1,2-xylosyltransferase purified from soybean.
EXAMPLE 1
RT-PCR and cDNA Cloning
[0106] Primers for the amplification of the putative β1,2-xylosyltransferase cDNA by RT-PCR were designed as follows: A BLASTP search of the DDBJ database using two soybean peptides (SEQ ID NO: 1 and 2; corresponding to SEQ ID NO: 3 and 5 in FIG. 4 in patent WO99/29835 A1; however, the C-terminal amino acids LG were omitted from SEQ ID NO: 5) (see FIG. 1) showed one protein sequence derived from a Arabidopsis thaliana genomic DNA sequence (Acc. Nr. AB015479) with more than 80% homology (SEQ ID NO: 3). Primers 3 (SEQ ID NO: 4) and 4 (SEQ ID NO: 5) were based on the A. thaliana sequence homologous to the soybean peptides 2 and 3. Analysis of the homologous genomic DNA sequence using Gene-Finder at the BCM Search Launcher resulted in one predicted protein. Primer 1 (SEQ ID NO: 6) was designed to include the start codon of the predicted protein, whereas primer 2 (SEQ ID NO: 7) contains the stop codon of the predicted protein.
[0107] The entire RNA was isolated from young leaves of Arabidopsis thaliana var Columbia using the TRIzol reagent (Life Technologies). The RNA was treated with DNAse (Promega, RQ1 RNase Free DNase) to remove traces of genomic DNA. First-strand cDNA was synthesised from 1 μg of total RNA at 42° C. using oligo(dT) primers (Sigma) and AMV reverse transcriptase (Promega).
[0108] The first strand cDNA was subjected to a PCR, wherein different combinations of sense and antisense primers were used (illustrated in FIG. 6): The product of primer 3 and primer 4 was a DNA fragment with length of 174 bp (P1), the product of primer 1 and primer 2 was a 1605 bp (P2) DNA fragment, the product of primer 1 and primer 4 was a DNA fragment with length of 1525 bp (P3) and primer 3 and primer 4 produced a DNA of 254 bp (P4). For amplification of the putative open reading frame primer 1 and primer 2 were used. A PCR reaction contained in a total volume of 50 μl 0.2 μmol of each primer, 0.05 mM dNTPs, 2 mM MgSO4, 20 mM Tris-HCl (pH 8.2 at 25° C.), 10 mM KCl, 10 mM (NH4)2SO4, 0.1% Triton X-100, 5 μg nuclease-free BSA and 2.5 units Pfu DNA polymerase from Promega. After a first denaturing step at 94° C. for 2 min, 30 cycles of 1 min at 92° C., 40 sec at 53° C. and 3 min and 30 sec at 72° C. were performed. The last extension step was carried out at 72° C. for 8 min. PCR products were subcloned into SmaI linearised and dephosphorylated puc19 vector, and sequenced: The sequences of the subcloned fragments were obtained by means of the didesoxynucleotide method (ABI PRISM Dye Termination Cycle Sequencing Ready reaction Kit and ABI PRISM 310 Genetic analyzer from Perkin Elmer). As a result of the RT-PCR three slightly different cDNA sequences were obtained. The sequence of cDNA 6 which has a size of 1605 bp (xt-Ath6; SEQ ID NO: 8) and codes for a protein of 534 amino acids having a molecular weight of 60.2 kDA and a theoretical pI value of 7.52 (see FIG. 3) is identical to the nucleotide sequence derived from the genomic clone after removing of two introns (xt-Athgen.seq). cDNA 9 shows 4 base pair changes compared to cDNA 6, whereas cDNA 16 shows 6 base pair changes compared to cDNA 6 (illustrated in FIGS. 4a, 4b and 4c). Therefore the amino acid sequence derived from cDNA 9 comprises two changes compared to the amino acid sequence derived from cDNA 6 (SEQ ID NO: 8), and the amino acid sequence of cDNA 16 shows four changed residues (illustrated in FIG. 5).
[0109] FIG. 3 shows the cDNA-derived amino acid sequence (SEQ ID NO: 9) of the β1,2-xylosyltransferase (xt-Ath6; SEQ ID NO: 8). Potential sites for the asparagine-bound glycosylation are at Asn51, Asn301 and Asn479.
[0110] FIGS. 4a, 4b and 4c show the alignment of β1,2-xylosyltransferase nucleotide sequences from the A. thaliana cDNA 6 (xt-Ath6; SEQ ID NO: 8), the A. thaliana cDNA 9 (xt-Ath9), the A. thaliana cDNA 16 (xt-Ath16), the A. thaliana genomic DNA sequence after removing of two introns (xt-Athgen), and from a A. thaliana EST sequence (xtAthEST). The dotted line stands for the consensus sequence; the dashed line for a gap.
[0111] The genomic sequence (xt-Athgen; Acc. No. AB015479, start codon at position 58185-58187, stop codon at position 60214-60216 of the genomic DNA) results from removing of two putative introns (intron 1: from position 58881 to 59116 of the genomic DNA; intron 2: from position 59268 to 59458 of the genomic DNA) using the splice site prediction server NetPlantGene. The A. thaliana EST sequence (xt-AthEST; Acc. No. A1994524) is the result of a database search using BLASTN.
[0112] FIG. 5 shows the alignment of amino acid sequences from β1,2-xylosyltransferase derived from A. thaliana cDNA 6 (xt-Ath6; SEQ ID NO: 9), from A. thaliana cDNA 9 (xt-Ath9), from A. thaliana cDNA 16 (xt-Ath16), from the A. thaliana genomic sequence (xt-Athgen), and derived from a A. thaliana EST sequence (xt-AthEST). The dotted line stands for a consensus sequence; the dashed line stands for a gap.
[0113] In FIG. 6, the schematic predicted β1,2-xylosyltransferase protein (top) and the derived hydrophobicity index using ProtScale, of the encoded protein (bottom) are illustrated, a positive hydrophobicity index meaning an increased hydrophobicity. Therebetween the sizes of the four above-indicated PCR products (P1-P4) are shown in relationship to the cDNA. "C" coding for the postulated cytoplasmic region, "T" for the postulated transmembrane region, and "G" for the postulated Golgi lumen catalytic region of the transferase. The analysis of the protein sequence by "TMpred" (from EMBnet,) gave an assumed transmembrane region from Ile11 to Phe29. The C-terminal region of the enzyme probably comprises the catalytic region and consequently should point into the lumen of the Golgi apparatus. According to this, this transferase seems to be a type II transmembrane protein like all the hitherto analysed glycosyltransferases which are involved in glycoprotein biosynthesis (Joziasse, 1992, Glyccrbiology 2, 271-277). The grey regions represent the position of the two peptides, the hexagons represent the potential N-glycosylation sites. A BLAST search in data banks accessible via NCBI showed only high homology to one other plant sequence (hybrid aspen, Acc. No. A162640).
EXAMPLE 2
Expression of Recombinant β1,2-xylosyltransferase in Insect Cells
[0114] The entire coding region of the assumed β1,2-xylosyltransferase including cytoplasmatic and transmembrane region was removed from the puc19 vector by BamHI and EcoRI digestion and subcloned into BamHI/EcoRI digested and dephosphorylated baculovirus transfer vector pVL 1393 (PharMingen, San Diego, Calif.). Correct cloning was confirmed by sequencing using pVL1393 forward primer 5'-AACCATCTCGCAAATAAATAAGTA-3'(SEQ ID NO: 10) and pVL1393 reverse primer 5'-GTCGGGTTTAACATTACGGATTTC-3' (SEQ ID NO: 11). To ensure a homologous recombination, 1 μg of the transfer vector was cotransfected with 200 ng linear Baculo-Gold viral DNA (PharMingen, San Diego, Calif.) into 1×106 Sf-9 cells in IPL-41 Medium using Lipofectin (Life Technologies) according to the manufacturer's protocol. After an incubation of 5 days at 27° C., various volumes of the supernatant with the recombinant virus were used for infection of Sf-21 insect cells. Cells were incubated in IPL-41 medium supplemented with 5% heat-inactivated fetal calf serum for 4 days at 27° C., then harvested and washed 2× with phosphate-buffered saline solution. The cells were resuspended and homogenised in the following buffer (1 ml per 107 cells): 100 mM MES buffer, pH 7.0, with 1% Triton X-100, 1 mM DTT, 1 mM PMSF, 5 μg/ml Leupeptin (Sigma), 5 μg/ml E-64 (Serva) and incubated on ice for 30 min.
EXAMPLE 3
Assay for β1,2-xylosyltransferase Activity
[0115] The cell homogenates were assayed for β1,2-xylosyltransferase activity. Negative controls were carried out with the same number of uninfected cells. The assay mixtures contained, in a total volume of 20 μl, 13 μl of homogenised cells, 2 nmol dabsylated GnGn hexapeptide or GnGn-pyridylamine as acceptor substrate (FIG. 8a), 1 mM UDP-xylose as donor substrate, 10 mM ATP, 20 mM MnCl2 and 1 mM 2-acetamido-1,2-dideoxy-nojirimycin was included to prevent degradation of product by N-acetylhexosaminidase. The samples were incubated for 1 hour at 37° C. and analysed by MALDI-TOF mass spectrometry.
[0116] FIG. 7 shows the measured enzyme activity of the recombinant β1,2-xylosyltransferase as well as of the negative control. Grey bars show the activity when GnGn hexapeptide was used as a substrate, whereas black bars indicate the use of GnGn-pyridylamin as a substrate. The enzyme activity of the cotransfected cells was 30× higher than that of the negative controls.
[0117] The structure of the acceptor substrate of β1,2-xylosyltransferase is shown in FIG. 8a, and the postulated product in FIG. 8b, where R represents either a pyridylamine or dabsylated hexapeptide residue.
EXAMPLE 4
Mass Spectrometry of the Xylosyltransferase Product
[0118] Mass spectrometry was performed on a DYNAMO (BioAnalysis, Santa Fe, N.M.), a MALDITOF MS which is capable of dynamic extraction (synonym for late extraction). Two types of sample matrix preparations were used: dabsylated glycopeptides were dissolved in 5% formic acid, and aliquots were applied to the target, airdried, and covered with 1% alpha-cyano-4-hydroxy cinnamic acid. Pyridylaminated glycans were diluted with water, applied to the target and air-dried. After addition of 2% 2.5-dihydroxy benzoic acid, the samples were immediately dried by applying vacuum.
[0119] FIG. 9 shows the mass spectrum of these samples, (A) being the negative control: The main peak (S) shows the dabsyl-Val-Gly-Glu-(GlcNAc4Man3)Asn-Arg-Thr substrate, the calculated [M+H].sup.+ value being 2262.3. This substrate also appears as sodium addition product and as smaller ion which has been formed by fragmentation of the Azo function of the dabsyl group, at (S*). The peak at m/z=2424.4 shows the incomplete de-galactosylation of the substrate. The mass spectrum (B) shows the sample with recombinant β1,2-xylosyltransferase after incubation for 1 h at 37° C. The main peak (P), having a [M+H].sup.+ value of 2393.4, represents the xylosylated product.
EXAMPLE 5
HPLC-Analysis of the Xylosyltransferase Product
[0120] Xylosyltransferase assays were performed as described above under example 3 except that 10 nmol of GnGn-pyridylamine were used as the acceptor substrate. After 4 h of incubation the sample was analyzed both by MALDI-TOF mass spectrometry and by rever sedphase HPLC to verify the structure of the product. The presumed product peak eluting slightly ahead of the substrate GnGn-PA was collected. By MALDI-TOF MS the product's mass was determined to be 1550.9 which is in good agreement with being GnGnX-PA. Upon digestion with β-N-acetylglucosaminidase from bovine kidney (in 50 mM sodium citrate buffer of pH 5.0 for 20 h at 37° C. with 25 mU of enzyme), the glycan eluted with about the retention of MM. This is in keeping with published data on the retention of MM-PA and MMX-PA (Wilson & Altmann, 1998, Glycoconj. J. 15, 1055-1070). Further digestion with alpha-mannosidase from jack beans under the chosen conditions (20 h at 37° C. with 50 mU of enzyme) resulted in the appearance of two new peaks. As the alpha-1,3-linked mannose is considerably more sensitive to mannosidase than the alpha-1,6-linked mannose, the peaks are assigned to 00X and M0X (in the order of elution). Indeed, MOX-pyridylamine prepared from bromelain by defucosylation with acid coeluted with the presumed MOX derived from the xylosyltransferase product. The fairly high shift of elution time due to the removal of the alpha-1,3-linked mannose residue is a strong indication of the β-mannose being substituted by xylose (Wilson & Altmann, 1998, Glycoconj. J. 15, 1055-1070; Altmann, 1998, Glycoconj. J. 15, 79-82).
[0121] FIG. 10 shows the analysis of the xylosyltransferase product by reversed-phase HPLC. (A) transferase incubation mixture; (B) isolated xylosyltransferase product; (C) isolated xylosyltransferase product after digestion with P-N-acetylglucosaminidase; (D) isolated xylosyltransferase product after further digestion with alpha-mannosidase. The assignments of peaks are as follows: 1, GnGn-PA; 2, GnGnX-PA; 3, MMX-PA; 4, MOX-PA; 5, 00X-PAA; 6, M0-PA (from trace of substrate in isolated product). For abbreviations of N-glycan structures see Wilson I. B. H. and Altmann, F., 1998, Glycoconj. J. 15, 1055-1077.
[0122] FIG. 11 shows the alignment of the predicted amino acid sequence according to the WO 99/29835 A1. This alignment shows that the amino acid sequence of the purified soybean enzyme corresponds only to amino acids 199-469 of the sequence derived from the cDNA according to the present invention. Furthermore, the predicted amino acid sequence derived from the cDNA of the present application contains two insertions (corresponding to aa 375-382 and aa 425-429 of the predicted sequence) compared to the sequence of the purified soybean enzyme.
Sequence CWU
1
23127PRTsoyabean 1Ser Gln Val Gln Ala Ile His Asp Ala Ser Val Ile Ile Gly
Ala His1 5 10 15Gly Ala
Gly Leu Thr His Ile Val Ser Ala Leu 20
2528PRTsoyabean 2Gly Leu Glu Tyr His Ala Ile Asn1
5360PRTarabidopsis thaliana 3Asp Gln Val Arg Ala Ile Gln Asp Ala Ser Val
Ile Ile Gly Ala His1 5 10
15Gly Ala Gly Leu Thr His Ile Val Ser Ala Thr Pro Asn Thr Thr Ile
20 25 30Phe Glu Ile Ile Ser Val Glu
Phe Gln Arg Pro His Phe Glu Leu Ile 35 40
45Ala Lys Trp Lys Gly Leu Glu Tyr His Ala Met His 50
55 60421DNAArtificial SequenceDescription of
Artificial Sequenceprimer 4gatcaagtcc gagccattca a
21521DNAArtificial SequenceDescription of
Artificial Sequenceprimer 5cgcgtgatac tccaatcctt t
21621DNAArtificial SequenceDescription of
Artificial Sequenceprimer 6atgagtaaac ggaatccgaa g
21721DNAArtificial SequenceDescription of
Artificial Sequenceprimer 7ttagcagcca aggctcttca t
2181831DNAArabidopsis thaliana 8aaatctgcag
actctcaaaa ttccgattca tcttattgaa gaacaatttt ccggcgaaac 60agccgatgaa
gtctcgcctg aatcttctgt acctttcacc ggcgattgac ttcacttcag 120aatcgagaga
gaagaaatcg atggaaaact aaaaatagaa agagtttcaa attctcgctc 180tctcttcaaa
accgcaaatc aagggaacga gagacgagag agagagatga gtaaacggaa 240tccgaagatt
ctgaagattt ttctgtatat gttacttctc aactctctct ttctcatcat 300ctacttcgtt
tttcactcat cgtcgttttc accggagcag tcacagcctc ctcatatata 360ccacgtttca
gtgaataacc aatcggcgat tcagaaaccg tggccgatct taccttctta 420cctcccatgg
acgccgccgc agaggaatct accaactggc tcctgcgaag gttacttcgg 480gaatggattt
acaaagagag ttgacttcct taagccgagg attggaggag gaggagaagg 540aagctggttc
cgatgttttt acagtgagac attacagagt tcgatttgtg aaggaaggaa 600tctgagaatg
gttccggatc ggattgttat gtcgagagga ggtgagaagt tagaggaagt 660tatggggagg
aaagaggagg aggagcttcc tgcgtttcga caaggtgcgt ttgaggtagc 720ggaagaggtt
tcttcacggt taggttttaa gagacaccgt cgttttggtg gaggagaagg 780aggtagtgcg
gtttctcggc ggctggtgaa tgatgagatg ttgaatgaat atatgcaaga 840aggtggaatt
gatagacata caatgagaga tttggttgct tcgattcgtg ctgttgatac 900caatgatttc
gtttgtgaag agtgggtgga ggaaccgacc ttgcttgtca ctagattcga 960gtacgcaaat
ctcttccata ctgtgacaga ttggtatagt gcctatgttt cgtctagagt 1020caccggtttg
cctaatcgac ctcacgttgt tttcgttgac ggacactgca cgacgcagct 1080agaagaaaca
tggacagctt tgttttccgg aatcagatac gcaaagaact tcaccaaacc 1140ggtttgtttc
cgccacgcga ttctttcacc attgggatac gaaaccgctc tttttaaagg 1200cttgtccgga
gaaatagact gcaagggaga ttcagctcac aatctgtggc aaaacccgga 1260cgataaaagg
actgcgagga tatcagagtt tggtgaaatg atcagagcag ctttcgggtt 1320gcctgtcaat
agacaccgct cattagaaaa gccgctatca tcatcatcat catcagcctc 1380agtttataat
gttctttttg tccgccgtga agattactta gcccatcctc gtcatggcgg 1440taaagtccag
tctcggctca tcaatgagga agaagtgttc gactcgttgc atcattgggt 1500tgcaactggg
tccaccggtc tgaccaaatg cgggattaac cttgtgaatg gcttgcttgc 1560acacatgtca
atgaaagatc aagtccgagc cattcaagat gcttcagtga tcataggagc 1620tcatggagca
ggactgactc acattgtctc tgcaacacca aacacaacga tatttgagat 1680aataagcgtc
gagtttcaga gacctcattt cgagcttata gctaagtgga aaggattgga 1740gtatcacgcg
atgcatctgg cgaactcacg agcggaacca acggctgtga ttgagaagtt 1800aacggagatc
atgaagagcc ttggctgcta a
18319534PRTArabidopsis thaliana 9Met Ser Lys Arg Asn Pro Lys Ile Leu Lys
Ile Phe Leu Tyr Met Leu1 5 10
15Leu Leu Asn Ser Leu Phe Leu Ile Ile Tyr Phe Val Phe His Ser Ser
20 25 30Ser Phe Ser Pro Glu Gln
Ser Gln Pro Pro His Ile Tyr His Val Ser 35 40
45Val Asn Asn Gln Ser Ala Ile Gln Lys Pro Trp Pro Ile Leu
Pro Ser 50 55 60Tyr Leu Pro Trp Thr
Pro Pro Gln Arg Asn Leu Pro Thr Gly Ser Cys65 70
75 80Glu Gly Tyr Phe Gly Asn Gly Phe Thr Lys
Arg Val Asp Phe Leu Lys 85 90
95Pro Arg Ile Gly Gly Gly Gly Glu Gly Ser Trp Phe Arg Cys Phe Tyr
100 105 110Ser Glu Thr Leu Gln
Ser Ser Ile Cys Glu Gly Arg Asn Leu Arg Met 115
120 125Val Pro Asp Arg Ile Val Met Ser Arg Gly Gly Glu
Lys Leu Glu Glu 130 135 140Val Met Gly
Arg Lys Glu Glu Glu Glu Leu Pro Ala Phe Arg Gln Gly145
150 155 160Ala Phe Glu Val Ala Glu Glu
Val Ser Ser Arg Leu Gly Phe Lys Arg 165
170 175His Arg Arg Phe Gly Gly Gly Glu Gly Gly Ser Ala
Val Ser Arg Arg 180 185 190Leu
Val Asn Asp Glu Met Leu Asn Glu Tyr Met Gln Glu Gly Gly Ile 195
200 205Asp Arg His Thr Met Arg Asp Leu Val
Ala Ser Ile Arg Ala Val Asp 210 215
220Thr Asn Asp Phe Val Cys Glu Glu Trp Val Glu Glu Pro Thr Leu Leu225
230 235 240Val Thr Arg Phe
Glu Tyr Ala Asn Leu Phe His Thr Val Thr Asp Trp 245
250 255Tyr Ser Ala Tyr Val Ser Ser Arg Val Thr
Gly Leu Pro Asn Arg Pro 260 265
270His Val Val Phe Val Asp Gly His Cys Thr Thr Gln Leu Glu Glu Thr
275 280 285Trp Thr Ala Leu Phe Ser Gly
Ile Arg Tyr Ala Lys Asn Phe Thr Lys 290 295
300Pro Val Cys Phe Arg His Ala Ile Leu Ser Pro Leu Gly Tyr Glu
Thr305 310 315 320Ala Leu
Phe Lys Gly Leu Ser Gly Glu Ile Asp Cys Lys Gly Asp Ser
325 330 335Ala His Asn Leu Trp Gln Asn
Pro Asp Asp Lys Arg Thr Ala Arg Ile 340 345
350Ser Glu Phe Gly Glu Met Ile Arg Ala Ala Phe Gly Leu Pro
Val Asn 355 360 365Arg His Arg Ser
Leu Glu Lys Pro Leu Ser Ser Ser Ser Ser Ser Ala 370
375 380Ser Val Tyr Asn Val Leu Phe Val Arg Arg Glu Asp
Tyr Leu Ala His385 390 395
400Pro Arg His Gly Gly Lys Val Gln Ser Arg Leu Ile Asn Glu Glu Glu
405 410 415Val Phe Asp Ser Leu
His His Trp Val Ala Thr Gly Ser Thr Gly Leu 420
425 430Thr Lys Cys Gly Ile Asn Leu Val Asn Gly Leu Leu
Ala His Met Ser 435 440 445Met Lys
Asp Gln Val Arg Ala Ile Gln Asp Ala Ser Val Ile Ile Gly 450
455 460Ala His Gly Ala Gly Leu Thr His Ile Val Ser
Ala Thr Pro Asn Thr465 470 475
480Thr Ile Phe Glu Ile Ile Ser Val Glu Phe Gln Arg Pro His Phe Glu
485 490 495Leu Ile Ala Lys
Trp Lys Gly Leu Glu Tyr His Ala Met His Leu Ala 500
505 510Asn Ser Arg Ala Glu Pro Thr Ala Val Ile Glu
Lys Leu Thr Glu Ile 515 520 525Met
Lys Ser Leu Gly Cys 5301024DNAArtificial SequenceDescription of
Artificial Sequenceprimer 10aaccatctcg caaataaata agta
241124DNAArtificial SequenceDescription of
Artificial Sequenceprimer 11gtcgggttta acattacgga tttc
24121605DNAArabidopsis thaliana 12atgagtaaac
ggaatccgaa gattctgaag atttttctgt atatgttact tctcaactct 60ctctttctca
tcatctactt cgtttttcac tcatcgtcgt tttcaccgga gcagtcacag 120cctcctcata
tataccacgt ttcagtgaat aaccaatcgg cgattcagaa accgtggccg 180atcttacctt
cttacctccc atggacgccg ccgcagagga atctaccaac tggctcctgc 240gaaggttact
tcgggaatgg atttacaaag agagttgact tccttaagcc gaggattgga 300ggaggaggag
aaggaagctg gttccgatgt ttttacagtg agacattaca gagttcgatt 360tgtgaaggaa
ggaatctgag aatggttccg gatcggattg ttatgtcgag aggaggtgag 420aagttagagg
aagttatggg gaggaaagag gaggaggagc ttcctgcgtt tcgacaaggt 480gcgtttgagg
tagcggaaga ggtttcttca cggttaggtt ttaagagaca ccgtcgtttt 540ggtggaggag
aaggaggtag tgcggtttct cggcggctgg tgaatgatga gatgttgaat 600gaatatatgc
aagaaggtgg aattgataga catacaatga gagatttggt tgcttcgatt 660cgtgctgttg
ataccaatga tttcgtttgt gaagagtggg tggaggaacc gaccttgctt 720gtcactagat
tcgagtacgc aaatctcttc catactgtga cagattggta tagtgcctat 780gtttcgtcta
gagtcaccgg tttgcctaat cgacctcacg ttgttttcgt tgacggacac 840tgcacgacgc
agctagaaga aacatggaca gctttgtttt ccggaatcag atacgcaaag 900aacttcacca
aaccggtttg tttccgccac gcgattcttt caccattggg atacgaaacc 960gctcttttta
aaggcttgtc cggagaaata gactgcaagg gagattcagc tcacaatctg 1020tggcaaaacc
cggacgataa aaggactgcg aggatatcag agtttggtga aatgatcaga 1080gcagctttcg
ggttgcctgt caatagacac cgctcattag aaaagccgct atcatcatca 1140tcatcatcag
cctcagttta taatgttctt tttgtccgcc gtgaagatta cttagcccat 1200cctcgtcatg
gcggtaaagt ccagtctcgg ctcatcaatg aggaagaagt gttcgactcg 1260ttgcatcatt
gggttgcaac tgggtccacc ggtctgacca aatgcgggat taaccttgtg 1320aatggcttgc
ttgcacacat gtcaatgaaa gatcaagtcc gagccattca agatgcttca 1380gtgatcatag
gagctcatgg agcaggactg actcacattg tctctgcaac accaaacaca 1440acgatatttg
agataataag cgtcgagttt cagagacctc atttcgagct tatagctaag 1500tggaaaggat
tggagtatca cgcgatgcat ctggcgaact cacgagcgga accaacggct 1560gtgattgaga
agttaacgga gatcatgaag agccttggct gctaa
1605131605DNAArabidopsis thaliana 13atgagtaaac ggaatccgaa gattctgaag
atttttctgt atatgttact tctcaactct 60ctctttctca tcatctactt cgtttttcac
tcatcgtcgt tttcaccgga gcagtcacag 120cctcctcata tataccacgt atcagtgaat
aaccaatcgg cgattcagaa accgtggccg 180atcttacctt cttacctccc atggacgccg
ccgcagagga atctaccaac tggctcctgc 240gaaggttact tcgggaatgg atttacaaag
agagttgact tccttaagcc gaggattgga 300ggaggaggag aaggaagctg gttccgatgt
ttttacagtg agacattaca gagttcgatt 360tgtgaaggaa ggaatctgag aatggttccg
gatcggattg ttatgtcgag aggaggtgag 420aagttagagg aagttatggg gaggaaagag
gaggaggagc ttcctgcgtt tcgacaaggt 480gcgtttgagg tagcggaaga ggtttcttca
cggttaggtt ttaagagaca ccgtcgtttt 540ggtggaggag aaggaggtag tgcggtttct
cggcggctgg tgaatgatga gatgttgaat 600gaatatatgc aagaaggtgg aattgataga
catacaatga gagatttggt tgcttcgatt 660cgtgctgttg ataccaatga tttcgtttgt
gaagagtggg tggaggaacc gaccttgctt 720gtcactagat tcgagtacgc aaatctcttc
catactgtga cagattggta tagtgcctat 780gtttcgtcta gagtcaccgg tttgcctaat
cgacctcacg ttgttttcgt tgacggacac 840tgcacgacgc agctagaaga aacatggaca
gctttgtttt ccggaatcag atacgcaaag 900aacttcacca aaccggtttg tttccgccac
acgattcttt caccattggg atacgaaacc 960gctcttttta aaggcttgtc cggagaaata
gactgcaggg gagattcagc tcacaatctg 1020tggcaaaacc cggacgataa aaggactgcg
aggatatcag agtttggtga aatgatcaga 1080gcagctttcg ggttgcctgt caatagacac
cgctcattag aaaagccgct atcatcatca 1140tcatcgtcag cctcagttta taatgttctt
tttgtccgcc gtgaagatta cttagcccat 1200cctcgtcatg gcggtaaagt ccagtctcgg
ctcatcaatg aggaagaagt gttcgactcg 1260ttgcatcatt gggttgcaac tgggtccacc
ggtctgacca aatgcgggat taaccttgtg 1320aatggcttgc ttgcacacat gtcaatgaaa
gatcaagtcc gagccattca agatgcttca 1380gtgatcatag gagctcatgg agcaggactg
actcacattg tctctgcaac accaaacaca 1440acgatatttg agataataag cgtcgagttt
cagagacctc atttcgagct tatagctaag 1500tggaaaggat tggagtatca cgcgatgcat
ctggcgaact cacgagcgga accaacggct 1560gtgattgaga agttaacgga gatcatgaag
agccttggct gctaa 1605141605DNAArabidopsis thaliana
14atgagtaaac ggaatccgaa gattctgaag atttttctgt atatgttact tctcaactct
60ctctttctca tcatctactt cgtttttcac tcatcgtcgt tttcaccgga gcagtcacag
120cctcctcata tataccacgt ttcagtgaat aaccaatcgg cgattcagaa accgtggccg
180atcttacctt cttacctccc atggacgccg ccgcagagga atctaccaac tggctcctgc
240gaaggttact tcgggaatgg atttacaaag agagttgact tccttaagct gaggattgga
300ggaggaggag aaggaagctg gttccgatgt ttttacagtg agacattaca gagttcgatt
360tgtgaaggaa ggaatctgag aatggttccg gatcggattg ttatgtcgag aggaggtgag
420aagttaaagg aagttatggg gaggaaagag gaggaggagc ttcctgcgtt tcgacaaggt
480gcgtttgagg tagcggaaga ggtttcttca cggttaggtt ttaagagaca ccgtcgtttt
540ggtggaggag aaggaggtag tgcggtttct cggcggctgg tgaatgatga gatgttgaac
600gaatatatgc aagaaggtgg aattgataga catacaatga gagatttggt tgcttcgatt
660cgtgctgttg ataccaatga tttcgtttgt gaagagtggg tggaggaacc gaccttgctt
720gtcactagat tcgagtacgc aaatctcttc catactgtga cagattggta tagtgcctat
780gtttcgtcta aagtcaccgg tttgcctaat cgacctcacg ttgttttcgt tgacggacac
840tgcacgacgc agctagaaga aacatggaca gctttgtttt ccggaatcag atacgcagag
900aacttcacca aaccggtttg tttccgccac gcgattcttt caccattggg atacgaaacc
960gctcttttta aaggcttgtc cggagaaata gactgcaagg gagattcagc tcacaatctg
1020tggcaaaacc cggacgataa aaggactgcg aggatatcag agtttggtga aatgatcaga
1080gcagctttcg ggttgcctgt caatagacac cgctcattag aaaagccgct atcatcatca
1140tcatcatcag cctcagttta taatgttctt tttgtccgcc gtgaagatta cttagcccat
1200cctcgtcatg gcggtaaagt ccagtctcgg ctcatcaatg aggaagaagt gttcgactcg
1260ttgcatcatt gggttgcaac tgggtccacc ggtctgacca aatgcgggat taaccttgtg
1320aatggcttgc ttgcacacat gtcaatgaaa gatcaagtcc gagccattca agatgcttca
1380gtgatcatag gagctcatgg agcaggactg actcacattg tctctgcaac accaaacaca
1440acgatatttg agataataag cgtcgagttt cagagacctc atttcgagct tatagctaag
1500tggaaaggat tggagtatca cgcgatgcat ctggcgaact cgcgagcgga accaacggct
1560gtgattgaga agttaacgga gatcatgaag agccttggct gctaa
1605151605DNAArabidopsis thaliana 15atgagtaaac ggaatccgaa gattctgaag
atttttctgt atatgttact tctcaactct 60ctctttctca tcatctactt cgtttttcac
tcatcgtcgt tttcaccgga gcagtcacag 120cctcctcata tataccacgt ttcagtgaat
aaccaatcgg cgattcagaa accgtggccg 180atcttacctt cttacctccc atggacgccg
ccgcagagga atctaccaac tggctcctgc 240gaaggttact tcgggaatgg atttacaaag
agagttgact tccttaagcc gaggattgga 300ggaggaggag aaggaagctg gttccgatgt
ttttacagtg agacattaca gagttcgatt 360tgtgaaggaa ggaatctgag aatggttccg
gatcggattg ttatgtcgag aggaggtgag 420aagttagagg aagttatggg gaggaaagag
gaggaggagc ttcctgcgtt tcgacaaggt 480gcgtttgagg tagcggaaga ggtttcttca
cggttaggtt ttaagagaca ccgtcgtttt 540ggtggaggag aaggaggtag tgcggtttct
cggcggctgg tgaatgatga gatgttgaat 600gaatatatgc aagaaggtgg aattgataga
catacaatga gagatttggt tgcttcgatt 660cgtgctgttg ataccaatga tttcgtttgt
gaagagtggg tggaggaacc gaccttgctt 720gtcactagat tcgagtacgc aaatctcttc
catactgtga cagattggta tagtgcctat 780gtttcgtcta gagtcaccgg tttgcctaat
cgacctcacg ttgttttcgt tgacggacac 840tgcacgacgc agctagaaga aacatggaca
gctttgtttt ccggaatcag atacgcaaag 900aacttcacca aaccggtttg tttccgccac
gcgattcttt caccattggg atacgaaacc 960gctcttttta aaggcttgtc cggagaaata
gactgcaagg gagattcagc tcacaatctg 1020tggcaaaacc cggacgataa aaggactgcg
aggatatcag agtttggtga aatgatcaga 1080gcagctttcg ggttgcctgt caatagacac
cgctcattag aaaagccgct atcatcatca 1140tcatcatcag cctcagttta taatgttctt
tttgtccgcc gtgaagatta cttagcccat 1200cctcgtcatg gcggtaaagt ccagtctcgg
ctcatcaatg aggaagaagt gttcgactcg 1260ttgcatcatt gggttgcaac tgggtccacc
ggtctgacca aatgcgggat taaccttgtg 1320aatggcttgc ttgcacacat gtcaatgaaa
gatcaagtcc gagccattca agatgcttca 1380gtgatcatag gagctcatgg agcaggactg
actcacattg tctctgcaac accaaacaca 1440acgatatttg agataataag cgtcgagttt
cagagacctc atttcgagct tatagctaag 1500tggaaaggat tggagtatca cgcgatgcat
ctggcgaact cacgagcgga accaacggct 1560gtgattgaga agttaacgga gatcatgaag
agccttggct gctaa 1605161605DNAArabidopsis thaliana
16atgagtaaac ggaatccgaa gattctgaag atttttctgt atatgttact tctcaactct
60ctctttctca tcatctactt cgtttttcac tcatcgtcgt tttcaccgga gcagtcacag
120cctcctcata tataccacgt ttcagtgaat aaccaatcgg cgattcagaa accgtggccg
180atcttacctt cttacctccc atggacgccg ccgcagagga atctaccaac tggctcctgc
240gaaggttact tcgggaatgg atttacaaag agagttgact tccttaagcc gaggattgga
300ggaggaggag aaggaagctg gttccgatgt ttttacagtg agacattaca gagttcgatt
360tgtgaaggaa ggaatctgag aatggttccg gatcggattg ttatgtcgag aggaggtgag
420aagttagagg aagttatggg gaggaaagag gaggaggagc ttcctgcgtt tcgacaaggt
480gcgtttgagg tagcggaaga ggtttcttca cggttaggtt ttaagagaca ccgtcgtttt
540ggtggaggag aaggaggtag tgcggtttct cggcggctgg tgaatgatga gatgttgaat
600gaatatatgc aagaaggtgg aattgataga catacaatga gagatttggt tgcttcgatt
660cgtgctgttg ataccaatga tttcgtttgt gaagagtggg tggaggaacc gaccttgctt
720gtcactagat tcgagtacgc aaatctcttc catactgtga cagattggta tagtgcctat
780gtttcgtcta gagtcaccgg tttgcctaat cgacctcacg ttgttttcgt tgacggacac
840tgcacgacgc agctagaaga aacatggaca gctttgtttt ccggaatcag atacgcaaag
900aacttcacca aaccggtttg tttccgccac gcgattcttt caccattggg atacgaaacc
960gctcttttta aaggcttgtc cggagaaata gactccaagg gagattcagc tcacaatctg
1020tggcaaaacc cggacgataa aaggactgcg aggatatcag agtttcgtga gatgatcaga
1080gcagctttcg ggttgcctgg tcatagacta cgctcattac aaaagccgct atcatcatca
1140tcatcatcag cctcagttta taatgttctt tttgtccgcc gtgaagatta cttagcccat
1200cctcgtcatg gcggtaaagt ccagtctcgg ctcatcaatg aggaagaagt gttcgactcg
1260ttgcatcatt gggttgcaac tgggtccacc ggtctgacca aatgcgggat taaccttgtg
1320aatggcttgc ttgcacacat gtcaatgaaa gatcaagtcc gagccattca agatgcttca
1380gtgatcatag gagctcatgg agcaggactg actcacattg tctctgcaac accaaacaca
1440acgatatttg agataataag cgtcgagttt cagagacctc atttcgagct tatagctaag
1500tggaaaggat tggagtatca cgcgatgcat ctggcgaact cacgagcgga accaacggct
1560gtgattgaga agttaacgga gatcatgaag agccttggct gctaa
160517534PRTArabidopsis thaliana 17Met Ser Lys Arg Asn Pro Lys Ile Leu
Lys Ile Phe Leu Tyr Met Leu1 5 10
15Leu Leu Asn Ser Leu Phe Leu Ile Ile Tyr Phe Val Phe His Ser
Ser 20 25 30Ser Phe Ser Pro
Glu Gln Ser Gln Pro Pro His Ile Tyr His Val Ser 35
40 45Val Asn Asn Gln Ser Ala Ile Gln Lys Pro Trp Pro
Ile Leu Pro Ser 50 55 60Tyr Leu Pro
Trp Thr Pro Pro Gln Arg Asn Leu Pro Thr Gly Ser Cys65 70
75 80Glu Gly Tyr Phe Gly Asn Gly Phe
Thr Lys Arg Val Asp Phe Leu Lys 85 90
95Pro Arg Ile Gly Gly Gly Gly Glu Gly Ser Trp Phe Arg Cys
Phe Tyr 100 105 110Ser Glu Thr
Leu Gln Ser Ser Ile Cys Glu Gly Arg Asn Leu Arg Met 115
120 125Val Pro Asp Arg Ile Val Met Ser Arg Gly Gly
Glu Lys Leu Glu Glu 130 135 140Val Met
Gly Arg Lys Glu Glu Glu Glu Leu Pro Ala Phe Arg Gln Gly145
150 155 160Ala Phe Glu Val Ala Glu Glu
Val Ser Ser Arg Leu Gly Phe Lys Arg 165
170 175His Arg Arg Phe Gly Gly Gly Glu Gly Gly Ser Ala
Val Ser Arg Arg 180 185 190Leu
Val Asn Asp Glu Met Leu Asn Glu Tyr Met Gln Glu Gly Gly Ile 195
200 205Asp Arg His Thr Met Arg Asp Leu Val
Ala Ser Ile Arg Ala Val Asp 210 215
220Thr Asn Asp Phe Val Cys Glu Glu Trp Val Glu Glu Pro Thr Leu Leu225
230 235 240Val Thr Arg Phe
Glu Tyr Ala Asn Leu Phe His Thr Val Thr Asp Trp 245
250 255Tyr Ser Ala Tyr Val Ser Ser Arg Val Thr
Gly Leu Pro Asn Arg Pro 260 265
270His Val Val Phe Val Asp Gly His Cys Thr Thr Gln Leu Glu Glu Thr
275 280 285Trp Thr Ala Leu Phe Ser Gly
Ile Arg Tyr Ala Lys Asn Phe Thr Lys 290 295
300Pro Val Cys Phe Arg His Ala Ile Leu Ser Pro Leu Gly Tyr Glu
Thr305 310 315 320Ala Leu
Phe Lys Gly Leu Ser Gly Glu Ile Asp Cys Lys Gly Asp Ser
325 330 335Ala His Asn Leu Trp Gln Asn
Pro Asp Asp Lys Arg Thr Ala Arg Ile 340 345
350Ser Glu Phe Gly Glu Met Ile Arg Ala Ala Phe Gly Leu Pro
Val Asn 355 360 365Arg His Arg Ser
Leu Glu Lys Pro Leu Ser Ser Ser Ser Ser Ser Ala 370
375 380Ser Val Tyr Asn Val Leu Phe Val Arg Arg Glu Asp
Tyr Leu Ala His385 390 395
400Pro Arg His Gly Gly Lys Val Trp Ser Arg Leu Ile Asn Glu Glu Glu
405 410 415Val Phe Asp Ser Leu
His His Trp Val Ala Thr Gly Ser Thr Gly Leu 420
425 430Thr Lys Cys Gly Ile Asn Leu Val Asn Gly Leu Leu
Ala His Met Ser 435 440 445Met Lys
Asp Gln Val Arg Ala Ile Gln Asp Ala Ser Val Ile Ile Gly 450
455 460Ala His Gly Ala Gly Leu Thr His Ile Val Ser
Ala Thr Pro Asn Thr465 470 475
480Thr Ile Phe Glu Ile Ile Ser Val Glu Phe Gln Arg Pro His Phe Glu
485 490 495Leu Ile Ala Lys
Trp Lys Gly Leu Glu Tyr His Ala Met His Leu Ala 500
505 510Asn Ser Arg Ala Glu Pro Thr Ala Val Ile Glu
Lys Leu Thr Glu Ile 515 520 525Met
Lys Ser Leu Gly Cys 53018534PRTArabidopsis thaliana 18Met Ser Lys Arg
Asn Pro Lys Ile Leu Lys Ile Phe Leu Tyr Met Leu1 5
10 15Leu Leu Asn Ser Leu Phe Leu Ile Ile Tyr
Phe Val Phe His Ser Ser 20 25
30Ser Phe Ser Pro Glu Gln Ser Gln Pro Pro His Ile Tyr His Val Ser
35 40 45Val Asn Asn Gln Ser Ala Ile Gln
Lys Pro Trp Pro Ile Leu Pro Ser 50 55
60Tyr Leu Pro Trp Thr Pro Pro Gln Arg Asn Leu Pro Thr Gly Ser Cys65
70 75 80Glu Gly Tyr Phe Gly
Asn Gly Phe Thr Lys Arg Val Asp Phe Leu Lys 85
90 95Pro Arg Ile Gly Gly Gly Gly Glu Gly Ser Trp
Phe Arg Cys Phe Tyr 100 105
110Ser Glu Thr Leu Gln Ser Ser Ile Cys Glu Gly Arg Asn Leu Arg Met
115 120 125Val Pro Asp Arg Ile Val Met
Ser Arg Gly Gly Glu Lys Leu Glu Glu 130 135
140Val Met Gly Arg Lys Glu Glu Glu Glu Leu Pro Ala Phe Arg Gln
Gly145 150 155 160Ala Phe
Glu Val Ala Glu Glu Val Ser Ser Arg Leu Gly Phe Lys Arg
165 170 175His Arg Arg Phe Gly Gly Gly
Glu Gly Gly Ser Ala Val Ser Arg Arg 180 185
190Leu Val Asn Asp Glu Met Leu Asn Glu Tyr Met Gln Glu Gly
Gly Ile 195 200 205Asp Arg His Thr
Met Arg Asp Leu Val Ala Ser Ile Arg Ala Val Asp 210
215 220Thr Asn Asp Phe Val Cys Glu Glu Trp Val Glu Glu
Pro Thr Leu Leu225 230 235
240Val Thr Arg Phe Glu Tyr Ala Asn Leu Phe His Thr Val Thr Asp Trp
245 250 255Tyr Ser Ala Tyr Val
Ser Ser Arg Val Thr Gly Leu Pro Asn Arg Pro 260
265 270His Val Val Phe Val Asp Gly His Cys Thr Thr Gln
Leu Glu Glu Thr 275 280 285Trp Thr
Ala Leu Phe Ser Gly Ile Arg Tyr Ala Lys Asn Phe Thr Lys 290
295 300Pro Val Cys Phe Arg His Thr Ile Leu Ser Pro
Leu Gly Tyr Glu Thr305 310 315
320Ala Leu Phe Lys Gly Leu Ser Gly Glu Ile Asp Cys Arg Gly Asp Ser
325 330 335Ala His Asn Leu
Trp Gln Asn Pro Asp Asp Lys Arg Thr Ala Arg Ile 340
345 350Ser Glu Phe Gly Glu Met Ile Arg Ala Ala Phe
Gly Leu Pro Val Asn 355 360 365Arg
His Arg Ser Leu Glu Lys Pro Leu Ser Ser Ser Ser Ser Ser Ala 370
375 380Ser Val Tyr Asn Val Leu Phe Val Arg Arg
Glu Asp Tyr Leu Ala His385 390 395
400Pro Arg His Gly Gly Lys Val Trp Ser Arg Leu Ile Asn Glu Glu
Glu 405 410 415Val Phe Asp
Ser Leu His His Trp Val Ala Thr Gly Ser Thr Gly Leu 420
425 430Thr Lys Cys Gly Ile Asn Leu Val Asn Gly
Leu Leu Ala His Met Ser 435 440
445Met Lys Asp Gln Val Arg Ala Ile Gln Asp Ala Ser Val Ile Ile Gly 450
455 460Ala His Gly Ala Gly Leu Thr His
Ile Val Ser Ala Thr Pro Asn Thr465 470
475 480Thr Ile Phe Glu Ile Ile Ser Val Glu Phe Gln Arg
Pro His Phe Glu 485 490
495Leu Ile Ala Lys Trp Lys Gly Leu Glu Tyr His Ala Met His Leu Ala
500 505 510Asn Ser Arg Ala Glu Pro
Thr Ala Val Ile Glu Lys Leu Thr Glu Ile 515 520
525Met Lys Ser Leu Gly Cys 53019534PRTArabidopsis
thaliana 19Met Ser Lys Arg Asn Pro Lys Ile Leu Lys Ile Phe Leu Tyr Met
Leu1 5 10 15Leu Leu Asn
Ser Leu Phe Leu Ile Ile Tyr Phe Val Phe His Ser Ser 20
25 30Ser Phe Ser Pro Glu Gln Ser Gln Pro Pro
His Ile Tyr His Val Ser 35 40
45Val Asn Asn Gln Ser Ala Ile Gln Lys Pro Trp Pro Ile Leu Pro Ser 50
55 60Tyr Leu Pro Trp Thr Pro Pro Gln Arg
Asn Leu Pro Thr Gly Ser Cys65 70 75
80Glu Gly Tyr Phe Gly Asn Gly Phe Thr Lys Arg Val Asp Phe
Leu Lys 85 90 95Leu Arg
Ile Gly Gly Gly Gly Glu Gly Ser Trp Phe Arg Cys Phe Tyr 100
105 110Ser Glu Thr Leu Gln Ser Ser Ile Cys
Glu Gly Arg Asn Leu Arg Met 115 120
125Val Pro Asp Arg Ile Val Met Ser Arg Gly Gly Glu Lys Leu Lys Glu
130 135 140Val Met Gly Arg Lys Glu Glu
Glu Glu Leu Pro Ala Phe Arg Gln Gly145 150
155 160Ala Phe Glu Val Ala Glu Glu Val Ser Ser Arg Leu
Gly Phe Lys Arg 165 170
175His Arg Arg Phe Gly Gly Gly Glu Gly Gly Ser Ala Val Ser Arg Arg
180 185 190Leu Val Asn Asp Glu Met
Leu Asn Glu Tyr Met Gln Glu Gly Gly Ile 195 200
205Asp Arg His Thr Met Arg Asp Leu Val Ala Ser Ile Arg Ala
Val Asp 210 215 220Thr Asn Asp Phe Val
Cys Glu Glu Trp Val Glu Glu Pro Thr Leu Leu225 230
235 240Val Thr Arg Phe Glu Tyr Ala Asn Leu Phe
His Thr Val Thr Asp Trp 245 250
255Tyr Ser Ala Tyr Val Ser Ser Lys Val Thr Gly Leu Pro Asn Arg Pro
260 265 270His Val Val Phe Val
Asp Gly His Cys Thr Thr Gln Leu Glu Glu Thr 275
280 285Trp Thr Ala Leu Phe Ser Gly Ile Arg Tyr Ala Lys
Asn Phe Thr Arg 290 295 300Pro Val Cys
Phe Arg His Ala Ile Leu Ser Pro Leu Gly Tyr Glu Thr305
310 315 320Ala Leu Phe Lys Gly Leu Ser
Gly Glu Ile Asp Cys Lys Gly Asp Ser 325
330 335Ala His Asn Leu Trp Gln Asn Pro Asp Asp Lys Arg
Thr Ala Arg Ile 340 345 350Ser
Glu Phe Gly Glu Met Ile Arg Ala Ala Phe Gly Leu Pro Val Asn 355
360 365Arg His Arg Ser Leu Glu Lys Pro Leu
Ser Ser Ser Ser Ser Ser Ala 370 375
380Ser Val Tyr Asn Val Leu Phe Val Arg Arg Glu Asp Tyr Leu Ala His385
390 395 400Pro Arg His Gly
Gly Lys Val Trp Ser Arg Leu Ile Asn Glu Glu Glu 405
410 415Val Phe Asp Ser Leu His His Trp Val Ala
Thr Gly Ser Thr Gly Leu 420 425
430Thr Lys Cys Gly Ile Asn Leu Val Asn Gly Leu Leu Ala His Met Ser
435 440 445Met Lys Asp Gln Val Arg Ala
Ile Gln Asp Ala Ser Val Ile Ile Gly 450 455
460Ala His Gly Ala Gly Leu Thr His Ile Val Ser Ala Thr Pro Asn
Thr465 470 475 480Thr Ile
Phe Glu Ile Ile Ser Val Glu Phe Gln Arg Pro His Phe Glu
485 490 495Leu Ile Ala Lys Trp Lys Gly
Leu Glu Tyr His Ala Met His Leu Ala 500 505
510Asn Ser Arg Ala Glu Pro Thr Ala Val Ile Glu Lys Leu Thr
Glu Ile 515 520 525Met Lys Ser Leu
Gly Cys 53020534PRTArabidopsis thaliana 20Met Ser Lys Arg Asn Pro Lys
Ile Leu Lys Ile Phe Leu Tyr Met Leu1 5 10
15Leu Leu Asn Ser Leu Phe Leu Ile Ile Tyr Phe Val Phe
His Ser Ser 20 25 30Ser Phe
Ser Pro Glu Gln Ser Gln Pro Pro His Ile Tyr His Val Ser 35
40 45Val Asn Asn Gln Ser Ala Ile Gln Lys Pro
Trp Pro Ile Leu Pro Ser 50 55 60Tyr
Leu Pro Trp Thr Pro Pro Gln Arg Asn Leu Pro Thr Gly Ser Cys65
70 75 80Glu Gly Tyr Phe Gly Asn
Gly Phe Thr Lys Arg Val Asp Phe Leu Lys 85
90 95Pro Arg Ile Gly Gly Gly Gly Glu Gly Ser Trp Phe
Arg Cys Phe Tyr 100 105 110Ser
Glu Thr Leu Gln Ser Ser Ile Cys Glu Gly Arg Asn Leu Arg Met 115
120 125Val Pro Asp Arg Ile Val Met Ser Arg
Gly Gly Glu Lys Leu Glu Glu 130 135
140Val Met Gly Arg Lys Glu Glu Glu Glu Leu Pro Ala Phe Arg Gln Gly145
150 155 160Ala Phe Glu Val
Ala Glu Glu Val Ser Ser Arg Leu Gly Phe Lys Arg 165
170 175His Arg Arg Phe Gly Gly Gly Glu Gly Gly
Ser Ala Val Ser Arg Arg 180 185
190Leu Val Asn Asp Glu Met Leu Asn Glu Tyr Met Gln Glu Gly Gly Ile
195 200 205Asp Arg His Thr Met Arg Asp
Leu Val Ala Ser Ile Arg Ala Val Asp 210 215
220Thr Asn Asp Phe Val Cys Glu Glu Trp Val Glu Glu Pro Thr Leu
Leu225 230 235 240Val Thr
Arg Phe Glu Tyr Ala Asn Leu Phe His Thr Val Thr Asp Trp
245 250 255Tyr Ser Ala Tyr Val Ser Ser
Arg Val Thr Gly Leu Pro Asn Arg Pro 260 265
270His Val Val Phe Val Asp Gly His Cys Thr Thr Gln Leu Glu
Glu Thr 275 280 285Trp Thr Ala Leu
Phe Ser Gly Ile Arg Tyr Ala Lys Asn Phe Thr Lys 290
295 300Pro Val Cys Phe Arg His Ala Ile Leu Ser Pro Leu
Gly Tyr Glu Thr305 310 315
320Ala Leu Phe Lys Gly Leu Ser Gly Glu Ile Asp Cys Lys Gly Asp Ser
325 330 335Ala His Asn Leu Trp
Gln Asn Pro Asp Asp Lys Arg Thr Ala Arg Ile 340
345 350Ser Glu Phe Gly Glu Met Ile Arg Ala Ala Phe Gly
Leu Pro Val Asn 355 360 365Arg His
Arg Ser Leu Glu Lys Pro Leu Ser Ser Ser Ser Ser Ser Ala 370
375 380Ser Val Tyr Asn Val Leu Phe Val Arg Arg Glu
Asp Tyr Leu Ala His385 390 395
400Pro Arg His Gly Gly Lys Val Trp Ser Arg Leu Ile Asn Glu Glu Glu
405 410 415Val Phe Asp Ser
Leu His His Trp Val Ala Thr Gly Ser Thr Gly Leu 420
425 430Thr Lys Cys Gly Ile Asn Leu Val Asn Gly Leu
Leu Ala His Met Ser 435 440 445Met
Lys Asp Gln Val Arg Ala Ile Gln Asp Ala Ser Val Ile Ile Gly 450
455 460Ala His Gly Ala Gly Leu Thr His Ile Val
Ser Ala Thr Pro Asn Thr465 470 475
480Thr Ile Phe Glu Ile Ile Ser Val Glu Phe Gln Arg Pro His Phe
Glu 485 490 495Leu Ile Ala
Lys Trp Lys Gly Leu Glu Tyr His Ala Met His Leu Ala 500
505 510Asn Ser Arg Ala Glu Pro Thr Ala Val Ile
Glu Lys Leu Thr Glu Ile 515 520
525Met Lys Ser Leu Gly Cys 53021534PRTArabidopsis thaliana 21Met Ser
Lys Arg Asn Pro Lys Ile Leu Lys Ile Phe Leu Tyr Met Leu1 5
10 15Leu Leu Asn Ser Leu Phe Leu Ile
Ile Tyr Phe Val Phe His Ser Ser 20 25
30Ser Phe Ser Pro Glu Gln Ser Gln Pro Pro His Ile Tyr His Val
Ser 35 40 45Val Asn Asn Gln Ser
Ala Ile Gln Lys Pro Trp Pro Ile Leu Pro Ser 50 55
60Tyr Leu Pro Trp Thr Pro Pro Gln Arg Asn Leu Pro Thr Gly
Ser Cys65 70 75 80Glu
Gly Tyr Phe Gly Asn Gly Phe Thr Lys Arg Val Asp Phe Leu Lys
85 90 95Pro Arg Ile Gly Gly Gly Gly
Glu Gly Ser Trp Phe Arg Cys Phe Tyr 100 105
110Ser Glu Thr Leu Gln Ser Ser Ile Cys Glu Gly Arg Asn Leu
Arg Met 115 120 125Val Pro Asp Arg
Ile Val Met Ser Arg Gly Gly Glu Lys Leu Glu Glu 130
135 140Val Met Gly Arg Lys Glu Glu Glu Glu Leu Pro Ala
Phe Arg Gln Gly145 150 155
160Ala Phe Glu Val Ala Glu Glu Val Ser Ser Arg Leu Gly Phe Lys Arg
165 170 175His Arg Arg Phe Gly
Gly Gly Glu Gly Gly Ser Ala Val Ser Arg Arg 180
185 190Leu Val Asn Asp Glu Met Leu Asn Glu Tyr Met Gln
Glu Gly Gly Ile 195 200 205Asp Arg
His Thr Met Arg Asp Leu Val Ala Ser Ile Arg Ala Val Asp 210
215 220Thr Asn Asp Phe Val Cys Glu Glu Trp Val Glu
Glu Pro Thr Leu Leu225 230 235
240Val Thr Arg Phe Glu Tyr Ala Asn Leu Phe His Thr Val Thr Asp Trp
245 250 255Tyr Ser Ala Tyr
Val Ser Ser Arg Val Thr Gly Leu Pro Asn Arg Pro 260
265 270His Val Val Phe Val Asp Gly His Cys Thr Thr
Gln Leu Glu Glu Thr 275 280 285Trp
Thr Ala Leu Phe Ser Gly Ile Arg Tyr Ala Lys Asn Phe Thr Lys 290
295 300Pro Val Cys Phe Arg His Ala Ile Leu Ser
Pro Leu Gly Tyr Glu Thr305 310 315
320Ala Leu Phe Lys Gly Leu Ser Gly Glu Ile Asp Arg Lys Gly Asp
Ser 325 330 335Ala His Asn
Leu Trp Gln Asn Pro Asp Asp Lys Arg Thr Ala Arg Ile 340
345 350Ser Glu Phe Arg Glu Met Ile Arg Ala Ala
Phe Gly Leu Pro Gly His 355 360
365Ser Asn Arg Ser Leu Arg Lys Pro Leu Ser Ser Ser Ser Ser Ser Ala 370
375 380Ser Val Tyr Asn Val Leu Phe Val
Arg Arg Glu Asp Tyr Leu Ala His385 390
395 400Pro Arg His Gly Gly Lys Val Trp Ser Arg Leu Ile
Asn Glu Glu Glu 405 410
415Val Phe Asp Ser Leu His His Trp Val Ala Thr Gly Ser Thr Gly Leu
420 425 430Thr Lys Cys Gly Ile Asn
Leu Val Asn Gly Leu Leu Ala His Met Ser 435 440
445Met Lys Asp Gln Val Arg Ala Ile Gln Asp Ala Ser Val Ile
Ile Gly 450 455 460Ala His Gly Ala Gly
Leu Thr His Ile Val Ser Ala Thr Pro Asn Thr465 470
475 480Thr Ile Phe Glu Ile Ile Ser Val Glu Phe
Gln Arg Pro His Phe Glu 485 490
495Leu Ile Ala Lys Trp Lys Gly Leu Glu Tyr His Ala Met His Leu Ala
500 505 510Asn Ser Arg Ala Glu
Pro Thr Ala Val Ile Glu Lys Leu Thr Glu Ile 515
520 525Met Lys Ser Leu Gly Cys
53022468PRTSoybeanmisc_feature(2)..(180)Xaa can be any naturally
occurring amino acid 22Arg Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa
Xaa Xaa Xaa Xaa1 5 10
15Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa
20 25 30Xaa Xaa Xaa Xaa Xaa Xaa Xaa
Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 35 40
45Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa
Xaa 50 55 60Xaa Xaa Xaa Xaa Xaa Xaa
Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa65 70
75 80Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa
Xaa Xaa Xaa Xaa Xaa 85 90
95Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa
100 105 110Xaa Xaa Xaa Xaa Xaa Xaa
Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 115 120
125Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa
Xaa Xaa 130 135 140Xaa Xaa Xaa Xaa Xaa
Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa145 150
155 160Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa
Xaa Xaa Xaa Xaa Xaa Xaa 165 170
175Xaa Xaa Xaa Xaa Ser Gly Pro Arg Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa
180 185 190Xaa Xaa Xaa Xaa Xaa
Xaa Leu Asp Glu Tyr Val Pro Arg Gly Gly Ile 195
200 205Asp Arg His Thr Met Arg Asp Leu Ile Ala Lys Ile
Arg Ile Val Arg 210 215 220Gly Lys Asp
Glu Gln Cys Asp Glu Trp Ile Glu Glu Pro Thr Leu Leu225
230 235 240Val Arg Phe Glu Tyr Ala Asn
Leu Phe His Thr Val Thr Asp Trp Tyr 245
250 255Ser Ala Tyr Val Ser Ser Arg Val Thr Ala Leu Pro
Asn Arg Pro His 260 265 270Val
Ile Phe Val Asp Gly His Cys Lys Ala Pro Leu Glu Glu Thr Trp 275
280 285Lys Ala Leu Phe Ser Ser Val Gly Tyr
Ala Lys Ser Phe Ser Gly Ser 290 295
300Val Cys Phe His His Ala Ile Leu Ser Pro Leu Gly Tyr Glu Thr Ala305
310 315 320Met Phe Arg Gly
Leu Ser Glu His Ile Asp Cys Tyr Gly Ala Pro Ala 325
330 335Gln Glu Leu Leu Gln Asn Leu Asn Asp His
Lys Thr Ala Arg Leu Ser 340 345
350Glu Phe Gly Glu Met Val Arg Ala Ala Phe Gly Leu Pro Leu Asn Val
355 360 365Asn His Xaa Met Glu Xaa Xaa
Xaa Xaa Xaa Xaa Xaa Xaa Pro Leu Ala 370 375
380Gly His Asn Val Leu Phe Val Arg Arg Glu Asp Tyr Leu Ala His
Pro385 390 395 400Arg Gly
Ser Gly Lys Leu Glu Ser Arg Leu Ser Asn Glu Gln Glu Val
405 410 415Phe Asn Ser Leu Lys Ser Trp
Xaa Xaa Xaa Xaa Xaa Ser Asn Tyr Lys 420 425
430Gly Cys Lys Ile Asn Leu Val Asn Gly Leu Phe Ala His Met
Ser Met 435 440 445Lys Asp Gln Val
Gln Ala Ile His Asp Ala Ser Val Ile Ile Gly Ala 450
455 460His Gly Ala Gly46523534PRTArabidopsis thaliana
23Met Ser Lys Arg Asn Pro Lys Ile Leu Lys Ile Phe Leu Tyr Met Leu1
5 10 15Leu Leu Asn Ser Leu Phe
Leu Ile Ile Tyr Phe Val Phe His Ser Ser 20 25
30Ser Phe Ser Pro Glu Gln Ser Gln Pro Pro His Ile Tyr
His Val Ser 35 40 45Val Asn Asn
Gln Ser Ala Ile Gln Lys Pro Trp Pro Ile Leu Pro Ser 50
55 60Tyr Leu Pro Trp Thr Pro Pro Gln Arg Asn Leu Pro
Thr Gly Ser Cys65 70 75
80Glu Gly Tyr Gly Phe Asn Gly Phe Thr Lys Arg Val Asp Phe Leu Lys
85 90 95Pro Arg Ile Gly Gly Gly
Gly Glu Gly Ser Trp Phe Arg Cys Phe Tyr 100
105 110Ser Glu Thr Leu Gln Ser Ser Ile Cys Glu Gly Arg
Asn Leu Arg Met 115 120 125Val Pro
Asp Arg Ile Val Met Ser Arg Gly Gly Glu Lys Leu Glu Glu 130
135 140Val Met Gly Arg Lys Glu Glu Glu Glu Leu Pro
Ala Phe Arg Gln Gly145 150 155
160Ala Phe Glu Val Ala Glu Glu Val Ser Ser Arg Leu Gly Phe Lys Arg
165 170 175His Arg Arg Phe
Gly Gly Gly Glu Gly Gly Ser Ala Val Ser Arg Arg 180
185 190Leu Val Asn Asp Glu Met Leu Asn Glu Tyr Met
Gln Glu Gly Gly Ile 195 200 205Asp
Arg His Thr Met Arg Asp Leu Val Ala Ser Ile Arg Ala Val Asp 210
215 220Thr Asn Asp Phe Val Cys Glu Glu Trp Val
Glu Glu Pro Thr Leu Leu225 230 235
240Val Thr Arg Phe Glu Tyr Ala Asn Leu Phe His Thr Val Thr Asp
Trp 245 250 255Tyr Ser Ala
Tyr Val Ser Ser Arg Val Thr Gly Leu Pro Asn Arg Pro 260
265 270His Val Val Phe Val Asp Gly His Cys Thr
Thr Gln Leu Glu Glu Thr 275 280
285Trp Thr Ala Leu Phe Ser Gly Ile Arg Tyr Ala Lys Asn Phe Thr Lys 290
295 300Pro Val Cys Phe Arg His Ala Ile
Leu Ser Pro Leu Gly Tyr Glu Thr305 310
315 320Ala Leu Phe Thr Gly Leu Ser Gly Glu Ile Asp Cys
Lys Gly Asp Ser 325 330
335Ala His Asn Leu Trp Gln Asn Pro Asp Asp Lys Arg Thr Ala Arg Ile
340 345 350Ser Glu Phe Gly Glu Met
Ile Arg Ala Ala Phe Gly Leu Pro Val Asn 355 360
365Arg His Arg Ser Leu Glu Lys Pro Leu Ser Ser Ser Ser Ser
Ser Ala 370 375 380Ser Val Tyr Asn Val
Leu Phe Val Arg Arg Glu Asp Tyr Leu Ala His385 390
395 400Pro Arg His Gly Gly Lys Val Gln Ser Arg
Leu Ile Asn Glu Glu Glu 405 410
415Val Phe Asp Ser Leu His His Trp Val Ala Thr Gly Ser Thr Gly Leu
420 425 430Thr Lys Cys Gly Ile
Asn Leu Val Asn Gly Leu Leu Ala His Met Ser 435
440 445Met Lys Asp Gln Val Arg Ala Ile Gln Asp Ala Ser
Val Ile Ile Gly 450 455 460Ala His Gly
Ala Gly Leu Thr His Ile Val Ser Ala Thr Pro Asn Thr465
470 475 480Thr Ile Phe Glu Ile Ile Ser
Val Glu Phe Gln Arg Pro His Phe Glu 485
490 495Leu Ile Ala Lys Trp Lys Gly Leu Glu Tyr His Ala
Met His Leu Ala 500 505 510Asn
Ser Arg Ala Glu Pro Thr Ala Val Ile Glu Lys Leu Thr Glu Ile 515
520 525Met Lys Ser Leu Gly Cys 530
User Contributions:
Comment about this patent or add new information about this topic:
People who visited this patent also read: | |
Patent application number | Title |
---|---|
20140362899 | RECEIVING CIRCUIT |
20140362898 | BASE STATION ASSISTED FREQUENCY DOMAIN EQUALIZATION |
20140362897 | MITIGATION OF WIRELESS SIGNAL INTERFERENCE |
20140362896 | CALIBRATION METHOD AND SYSTEM FOR ESTIMATING A PATH DIFFERENCE OF A TARGET SIGNAL TRANSMITTED BY A SPACECRAFT OR AIRCRAFT |
20140362895 | METHOD, PROGRAM PRODUCT, AND TEST DEVICE FOR TESTING BIT ERROR RATE OF NETWORK MODULE |