Patent application title: GENE SILENCING
Inventors:
Caius Rommens (Boise, ID, US)
Hua Yan (Boise, ID, US)
Hua Yan (Boise, ID, US)
Oleg Bougri (Boise, ID, US)
Kathy M.m. Swords (Boise, ID, US)
IPC8 Class: AA01H106FI
USPC Class:
426615
Class name: Food or edible material: processes, compositions, and products products per se, or processes of preparing or treating compositions involving chemical reaction by addition, combining diverse food material, or permanent additive plant material is basic ingredient other than extract, starch or protein
Publication date: 2012-10-11
Patent application number: 20120258233
Abstract:
The present invention relates to unique strategies and constructs for
producing a nucleic acid product that downregulates or prevents
expression of a desired target polynucleotide.Claims:
1-13. (canceled)
14. A construct, comprising an expression cassette which comprises in the 5' to 3' orientation (i) a promoter, (ii) a first polynucleotide that comprises a sequence that shares sequence identity with at least a part of a promoter sequence associated with a target gene, (iii) a second polynucleotide comprising a sequence that is a perfect or imperfect inverse complement of the first polynucleotide, and (iv) a terminator, wherein the first promoter is operably linked to the 5'-end of the first polynucleotide and the second polynucleotide is operably linked to the terminator.
15-32. (canceled)
33. The construct of claim 14, wherein the first polynucleotide and the promoter sequence associated with the target gene are fully identical in sequence over at least 23 nucleotides.
34. The construct of claim 14, wherein the second polynucleotide and the first polynucleotide do not comprise any sequence located downstream of the transcription start of the target gene.
35. The construct of claim 14, wherein (a) at least part of the first desired polynucleotide is in the antisense orientation; or (b) at least part of the first desired polynucleotide is in the sense orientation.
36. The construct of claim 14, further comprising a spacer polynucleotide positioned between the first and second polynucleotides, wherein the spacer polynucleotide is 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, or more than 500 nucleotides long.
37. The construct of claim 14, wherein the promoters and the terminator are functional in a plant and wherein the expression cassette is located between transfer-DNA border sequences of a plasmid that is suitable for bacterium-mediated plant transformation, wherein the bacterium is a strain of Agrobacterium, Rhizobium, or Phyllobacterium.
38. The construct of claim 14, wherein the promoter is a constitutive promoter, a near-constitutive promoter, a tissue-specific promoter, or an inducible promoter.
39. The construct of claim 14, wherein the promoter is a functional plant promoter specific to a plant tissue selected from the group consisting of in tubers, seeds, leaves, roots, vascular system, flowers, pollen, and ovules.
40. The construct of claim 14, wherein the target gene is a COMT gene involved in lignin biosynthesis, a CCOMT gene involved in lignin biosynthesis, any other gene involved in lignin biosynthesis, an R1 gene involved in starch phosphorylation, a phosphorylase gene involved in starch phosphorylation, a PPO gene involved in oxidation of polyphenols, a polygalacturonase gene involved in pectin degradation, a gene involved in the production of allergens, or a gene involved in fatty acid biosynthesis.
41. The construct of claim 14, wherein the promoter sequence associated with the target gene is selected from the group consisting of (1) a starch-associated R1 gene promoter, (2) a polyphenol oxidase gene promoter, (3) a fatty acid desaturase 12 gene promoter, (4) a microsomal omega-6 fatty acid desaturase gene promoter, (5) a cotton stearoyl-acyl-carrier protein delta 9-desaturase gene promoter, (6) an oleoyl-phosphatidylcholine omega 6-desaturase gene promoter, (7) a Medicago truncatula caffeic acid/5-hydroxyferulic acid 3/5-O-methyltransferase (COMT) gene promoter, (8) a Medicago sativa (alfalfa) caffeic acid/5-hydroxyferulic acid 3/5-O-methyltransferase (COMT) gene promoter, (9) a Medicago truncatula caffeoyl CoA 3-O-methyltransferase (CCOMT) gene promoter, (10) a Medicago sativa (alfalfa) caffeoyl CoA 3-O-methyltransferase (CCOMT) gene promoter, (11) a Zea mays (maize) COMT gene, (12) a major apple allergen Mal d 1 gene promoter, (13) a major peanut allergen Ara h 2 gene promoter, (14) a major soybean allergen Gly m Bd 30 K gene promoter, (15) a polygalacturonase gene promoter, (16) any other endogenous promoter.
42. A method for downregulating the expression of a target gene in a plant cell, comprising expressing the construct of claim 14 in a plant cell, wherein the expression level of the target gene is downregulated compared to the expression level of the target gene in an untransformed plant cell.
43. A method for enhancing tolerance to black spot bruising in a tuber, comprising expressing the construct of claim 14 in a cell of a tuber, wherein (a) the first polynucleotide comprises the sequence that shares sequence identity with at least a part of a tuber-expressed polyphenol oxidase gene promoter, (b) the promoter is GBSS or AGP, and (c) expression of the construct in the cell reduces transcription and/or translation of a polyphenol oxidase gene in the tuber cell genome, thereby enhancing the tolerance of the tuber to black spot bruising.
44. A method for reducing cold-induced sweetening in a tuber, comprising expressing the construct of claim 14 in a cell of a tuber, wherein (a) the first polynucleotide comprises the sequence that shares sequence identity with at least a part of an R1 gene promoter, (b) the promoters is GBSS or AGP, and (c) expression of the construct in the cell reduces transcription and/or translation of an R1 gene in the tuber cell genome, thereby reducing cold-induced sweetening in the tuber.
45. A method for increasing oleic acid levels in an oil-bearing plant, comprising expressing the construct of claim 14 in a cell of a seed of an oil-bearing plant, wherein (a) the first polynucleotide comprises a sequence that shares sequence identity with at least a part of a Fad2 gene promoter, (b) the promoter is a napin gene promoter, a Fad2 gene promoter, or a stearoyl-ACP desaturase gene promoter, and (c) expression of the construct in the cell reduces transcription and/or translation of a Fad2 gene in the cell of the seed of the oil-bearing plant, thereby increasing the oil content of the seed.
46. A method for reducing lignin content in a plant, comprising expressing the construct of claim 14 in a cell of the plant, wherein (a) the first polynucleotide comprises a sequence that shares sequence identity with at least part of the sequence of the promoter associated with a gene selected from the group consisting of caffeic acid/5-hydroxyferulic acid 3/5-O-methyltransferase (COMT) gene and caffeoyl CoA 3-O-methyltransferase (CCOMT) gene, (b) the promoter is a petE or Pal gene promoter, and (c) expression of the construct in the cell reduces transcription and/or translation of a COMT or CCOMT gene in the cell of the plant, thereby reducing lignin content in a plant.
47. A method for reducing the degradation of pectin in a fruit of a plant, comprising expressing the construct of claim 14 in a fruit cell of the plant, wherein (a) the first polynucleotide comprises a sequence that shares sequence identity with at least a part of a polygalacturonase gene promoter, (b) the promoter is a fruit-specific promoter, and (c) expression of the construct in the fruit cell reduces transcription and/or translation of a polygalacturonase gene in the cell of the plant, thereby reducing the degradation of pectin in the fruit.
48. A method for reducing the allergenicity of a food produced by an apple plant, comprising expressing the construct of claim 14 in a cell of the apple plant, wherein (a) the first polynucleotide comprises a sequence from the Mal d I gene promoter, and (b) expression of the construct in the apple plant reduces transcription and/or translation of Mal d I in the apple.
49. A method for reducing the allergenicity of a food produced by a peanut plant, comprising expressing the construct of claim 14 in a cell of the peanut plant, wherein (a) the first polynucleotide comprises a sequence from the Ara h 2 gene promoter, and (b) expression of the construct in the peanut plant reduces transcription and/or translation of Ara h 2 in the peanut.
50. A method for reducing the allergenicity of a food produced by a soybean plant, comprising expressing the construct of claim 14 in a cell of the soybean plant, wherein (a) the first polynucleotide comprises a sequence from the Gly m Bd gene promoter, and (b) expression of the construct in the soybean plant reduces transcription and/or translation of Gly m Bd in the soybean.
51. A plant cell obtained according to the method of claim 42.
52. A plant comprising the plant cell of claim 51.
53. A food product made from the plant of claim 52.
Description:
CROSS-REFERENCE TO RELATED PATENT APPLICATIONS
[0001] This application is a Divisional of U.S. Ser. No. 11/662,872, filed Oct. 15, 2007, which is a U.S. National Stage of PCT/US2005/033992, filed Sep. 23, 2005, which claims priority to U.S. provisional application Ser. Nos. 60/612,638, filed on Sep. 24, 2004, 60/619,959, filed on Oct. 20, 2004, 60/653,609, filed on Feb. 16, 2005, and 60/668,071, filed on Apr. 5, 2005, all of which are incorporated herein by reference.
FIELD OF THE INVENTION
[0002] The present invention relates to unique constructs for producing a nucleic acid product that downregulates or prevents expression of a desired target polynucleotide.
BACKGROUND OF THE INVENTION
[0003] Suppression of gene expression may be accomplished by constructs that trigger post-transcriptional or transcriptional gene silencing. These silencing mechanisms may downregulate desired polynucleotide or gene expression by chromatin modification, RNA cleavage, translational repression, or via hitherto unknown mechanisms. See Meister G. and Tuschl T., Nature, vol. 431, pp. 343-349, 2004.
[0004] A construct that is typically used in this regard contains a desired polynucleotide, which shares sequence identity with at least part of a target gene that is operably linked to a promoter and a terminator. As is well appreciated, the promoter initiates transcription, while the terminator ends transcription at a specific site and subsequently mediates polyadenylation. Such transcript processing is important for stability of the transcript and its transport from the nucleus and into the cytoplasm.
[0005] In this regard, the terminator plays an important role in conventional gene silencing constructs. For instance, WO 99/53050 describes a construct that comprises a promoter, a polynucleotide comprising a first sequence with homology to a target gene and a second sequence that is inverse complementary to the target gene, and a terminator. A terminator of conventional constructs does not necessarily have to be positioned immediately downstream from the desired polynucleotide. For instance, Mette and co-workers described a plasmid that contains a desired polynucleotide that is separated from an operably linked terminator by a hygromycin gene (Mette et al., EMBO J 18: 241-8, 1999; Mette et al., EMBO J 19: 5194-201, 2000).
[0006] Other conventional constructs designed to silence genes contain a polynucleotide in the sense or antisense orientation between promoter and terminator. Such a conventional gene silencing construct typically produces RNA transcripts that are similar in size, determined by the distance from transcription start to termination cleavage site and the poly-adenylated tail.
[0007] The present invention relates to new strategies and constructs for gene silencing that are generally more effective than conventional constructs. Furthermore, the present invention relates to new strategies and constructs for gene silencing using a polynucleotide that is not operably linked to a promoter and a terminator but is instead operably linked to two convergently-oriented promoters.
SUMMARY OF THE INVENTION
[0008] Strategies and constructs of the present invention can be characterized by certain features. A construct of the present invention, for instance, may not comprise a DNA region, such as a terminator, that is involved in 3'-end formation and polyadenylation. Alternatively, the construct may comprise a non-functional terminator that is naturally non-functional or which has been modified or mutated to become non-functional.
[0009] A construct may also be characterized in the arrangement of promoters at either side of a desired polynucleotide. Hence, a construct of the present invention may comprise two or more promoters which flank one or more desired polynucleotides or which flank copies of a desired polynucleotide, such that both strands of the desired polynucleotide are transcribed. That is, one promoter may be oriented to initiate transcription of the 5'-end of a desired polynucleotide, while a second promoter may be operably oriented to initiate transcription from the 3'-end of the same desired polynucleotide. The oppositely-oriented promoters may flank multiple copies of the desired polynucleotide. Hence, the "copy number" may vary so that a construct may comprise 2, 3, 4, 5, 6, 7, 8, 9, 10, 15, 20, 30, 40, 50, 60, 70, 80, 90, or 100, or more than 100 copies, or any integer in-between, of a desired polynucleotide ultimately flanked by promoters that are oriented to induce convergent transcription.
[0010] Alternatively, a first promoter may be operably linked to a first polynucleotide in "cassette A," for instance, and a second promoter may be operably linked to a second polynucleotide, e.g., "cassette B." The polynucleotides of each cassette may or may not comprise the same nucleotide sequence, but may share some percentage of sequence identity with a target nucleic acid of interest. The cassettes may be tandemly arranged, i.e., so that they are adjacent to one another in the construct. Furthermore, cassette B, for instance, may be oriented in the inverse complementary orientation to cassette A. In this arrangement, therefore, transcription from the promoter of cassette B will proceed in the direction toward the promoter of cassette A. Hence, the cassettes are arranged to induce "convergent transcription."
[0011] If neither cassette comprises a terminator sequence, then such a construct, by virtue of the convergent transcription arrangement, may produce RNA transcripts that are of different lengths.
[0012] In this situation, therefore, there may exist subpopulations of partially or fully transcribed RNA transcripts that comprise partial or full-length sequences of the transcribed desired polynucleotide from the respective cassette. Alternatively, in the absence of a functional terminator, the transcription machinery may proceed past the end of a desired polynucleotide to produce a transcript that is longer than the length of the desired polynucleotide.
[0013] In a construct that comprises two copies of a desired polynucleotide, therefore, where one of the polynucleotides may or may not be oriented in the inverse complementary direction to the other, and where the polynucleotides are operably linked to promoters to induce convergent transcription, and there is no functional terminator in the construct, the transcription machinery that initiates from one desired polynucleotide may proceed to transcribe the other copy of the desired polynucleotide and vice versa. The multiple copies of the desired polynucleotide may be oriented in various permutations: in the case where two copies of the desired polynucleotide are present in the construct, the copies may, for example, both be oriented in same direction, in the reverse orientation to each other, or in the inverse complement orientation to each other, for example.
[0014] In an arrangement where one of the desired polynucleotides is oriented in the inverse complementary orientation to the other polynucleotide, an RNA transcript may be produced that comprises not only the "sense" sequence of the first polynucleotide but also the "antisense" sequence from the second polynucleotide. If the first and second polynucleotides comprise the same or substantially the same DNA sequences, then the single RNA transcript may comprise two regions that are complementary to one another and which may, therefore, anneal. Hence, the single RNA transcript that is so transcribed, may form a partial or full hairpin duplex structure.
[0015] On the other hand, if two copies of such a long transcript were produced, one from each promoter, then there will exist two RNA molecules, each of which would share regions of sequence complementarity with the other. Hence, the "sense" region of the first RNA transcript may anneal to the "antisense" region of the second RNA transcript and vice versa. In this arrangement, therefore, another RNA duplex may be formed which will consist of two separate RNA transcripts, as opposed to a hairpin duplex that forms from a single self-complementary RNA transcript.
[0016] Alternatively, two copies of the desired polynucleotide may be oriented in the same direction so that, in the case of transcription read-through, the long RNA transcript that is produced from one promoter may comprise, for instance, the sense sequence of the first copy of the desired polynucleotide and also the sense sequence of the second copy of the desired polynucleotide. The RNA transcript that is produced from the other convergently-oriented promoter, therefore, may comprise the antisense sequence of the second copy of the desired polynucleotide and also the antisense sequence of the first polynucleotide. Accordingly, it is likely that neither RNA transcript would contain regions of exact complementarity and, therefore, neither RNA transcript is likely to fold on itself to produce a hairpin structure. On the other hand the two individual RNA transcripts could hybridize and anneal to one another to form an RNA duplex.
[0017] Hence, in one aspect, the present invention provides a construct that lacks a terminator or lacks a terminator that is preceded by self-splicing ribozyme encoding DNA region, but which comprises a first promoter that is operably linked to a first polynucleotide and a second promoter that is operably linked to second polynucleotide, whereby (1) the first and second polynucleotide share at least some sequence identity with each other, (2) the first promoter is oriented such that the direction of transcription initiated by this promoter proceeds towards the second promoter, and vice versa, and (3) this convergent arrangement produces a range of RNA transcripts that are generally different in length.
[0018] The desired polynucleotides may be perfect or imperfect repeats of one another, or perfect or imperfect inverse complementary repeats of one another. In the case of a construct that comprises a first polynucleotide and a second polynucleotide, the second polynucleotide may be fully or partially identical in nucleotide sequence to the first polynucleotide and oriented in the direct or inverse complementary orientation with respect to the first polynucleotide. Hence, the first and second polynucleotides may be perfect repeats of one another. On the other hand, the second polynucleotide may be an imperfect repeat of the first polynucleotide, that is the second polynucleotide may share sequence identity with the first polynucleotide, but is not fully or partially identical in sequence, i.e., the second polynucleotide is an imperfect repeat. That second polynucleotide also may be oriented as a direct repeat or positioned in the inverse complementary orientation with respect to the first polynucleotide.
[0019] Any of the polynucleotides described herein, such as a desired polynucleotide, or a first or second polynucleotide, for instance, may be identical to at least a part of a target sequence, or may share sequence identity with at least a part of a target sequence. When a desired polynucleotide comprises a sequence that is homologous to a fragment of a target sequence, i.e., it shares sequence identity with "at least a part of" a target sequence, then it may be desirable that the nucleotide sequence of the fragment is specific to the target gene, and/or the partial perfect or imperfect sequence of the target that is present in the desired polynucleotide is of sufficient length to confer target-specificity. Hence the portion of the desired polynucleotide that shares sequence identity with a part of a target sequence may comprise a characteristic domain, binding site, or nucleotide sequence typically conserved by isoforms or homologs of the target sequence. It is possible, therefore, to design a desired polynucleotide that is optimal for targeting a target nucleic acid in a cell.
[0020] In another embodiment, the desired polynucleotide comprises a sequence of preferably between 4 and 5,000 nucleotides, more preferably between 50 and 1,000 nucleotides, and most preferably between 150 and 500 nucleotides that share sequence identity with the DNA or RNA sequence of a target nucleic acid. The desired polynucleotide may share sequence identity with at least 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 60, 70, 80, 90, 100, 110, 120, 130, 140, 150, 160, 170, 180, 190, 200, 300, 400, 500, or more than 500 contiguous nucleotides, or any integer in between, that are 100% identical in sequence with a sequence in a target sequence, or a desired polynucleotide comprises a sequence that shares about 99%, 98%, 97%, 96%, 95%, 94%, 93%, 92%, 91%, 90%, 89%, 88%, 87%, 86%, 85%, 84%, 83%, 82%, 81%, 80%, 79%, 78%, 77%, 76%, 75%, 74%, 73%, 72%, 71%, 70%, 69%, 68%, 67%, 66%, 65%, 64%, 63%, 62%, 61%, 60%, 59%, 58%, 57%, 56%, 55%, 54%, 53%, 52%, 51%, 50%, 49%, 48%, 47%, 46%, 45%, 44%, 43%, 42%, 41%, 40%, 39%, 38%, 37%, 36%, 35%, 34%, 33%, 32%, 31%, 30%, 29%, 8%, 27%, 26%, 25%, 24%, 23%, 22%, 21%, 20%, 19%, 18%, 17%, 16%, 15%, 14%, 13%, 12%, 11%, 10%, 9%, 8%, 7%, 6%, 5%, 4%, 3%, 2%, 1% nucleotide sequence identity with a sequence of the target sequence. In other words the desired polynucleotide may be homologous to or share homology with the full-length sequence of a target sequence or a fragment thereof of a target sequence.
[0021] Hence, the present invention provides an isolated nucleic acid molecule comprising a polynucleotide that shares homology with a target sequence and which, therefore, may hybridize under stringent or moderate hybridization conditions to a portion of a target sequence described herein. By a polynucleotide which hybridizes to a "portion" of a polynucleotide is intended a polynucleotide (either DNA or RNA) hybridizing to at least about 15 nucleotides, and more preferably at least about 20 nucleotides, and still more preferably at least about 30 nucleotides, and even more preferably more than 30 nucleotides of the reference polynucleotide. For the purpose of the invention, two sequences that share homology, i.e., a desired polynucleotide and a target sequence, may hybridize when they form a double-stranded complex in a hybridization solution of 6×SSC, 0.5% SDS, 5×Denhardt's solution and 100 μg of non-specific carrier DNA. See Ausubel et al., section 2.9, supplement 27 (1994). Such sequence may hybridize at "moderate stringency," which is defined as a temperature of 60° C. in a hybridization solution of 6×SSC, 0.5% SDS, 5×Denhardt's solution and 100 μg of non-specific carrier DNA. For "high stringency" hybridization, the temperature is increased to 68° C. Following the moderate stringency hybridization reaction, the nucleotides are washed in a solution of 2×SSC plus 0.05% SDS for five times at room temperature, with subsequent washes with 0.1×SSC plus 0.1% SDS at 60° C. for lh. For high stringency, the wash temperature is increased to typically a temperature that is about 68° C. Hybridized nucleotides may be those that are detected using 1 ng of a radiolabeled probe having a specific radioactivity of 10,000 cpm/ng, where the hybridized nucleotides are clearly visible following exposure to X-ray film at -70° C. for no more than 72 hours.
[0022] In one embodiment, a construct of the present invention may comprise an expression cassette that produces a nucleic acid that reduces the expression level of a target gene that is normally expressed by a cell containing the construct, by 99%, 98%, 97%, 96%, 95%, 94%, 93%, 92%, 91%, 90%, 89%, 88%, 87%, 86%, 85%, 84%, 83%, 82%, 81%, 80%, 79%, 78%, 77%, 76%, 75%, 74%, 73%, 72%, 71%, 70%, 69%, 68%, 67%, 66%, 65%, 64%, 63%, 62%, 61%, 60%, 59%, 58%, 57%, 56%, 55%, 54%, 53%, 52%, 51%, 50%, 49%, 48%, 47%, 46%, 45%, 44%, 43%, 42%, 41%, 40%, 39%, 38%, 37%, 36%, 35%, 34%, 33%, 32%, 31%, 30%, 29%, 28%, 27%, 26%, 25%, 24%, 23%, 22%, 21%, 20%, 19%, 18%, 17%, 16%, 15%, 14%, 13%, 12%, 11%, 10%, 9%, 8%, 7%, 6%, 5%, 4%, 3%, 2%, 1% in comparison to a cell that does not contain the construct.
[0023] Any polynucleotide of the present invention, be it a "desired polynucleotide," a "first" polynucleotide, a "second" polynucleotide may share a certain percentage sequence identity with a target sequence. As explained herein, a target sequence may be, but is not limited to, a sequence, partial or full-length, of a gene, regulatory element, such as a promoter or terminator, exon, intron, an untranslated region, or any sequence upstream or downstream of a target genomic sequence. Accordingly, a polynucleotide of the present invention, may comprise a sequence that is identical over the length of that sequence to such a target sequence. On the other hand, the polynucleotide of the present invention, may comprise a sequence that shares sequence identity to such a target sequence. Hence, a desired polynucleotide of the present invention may share about 99%, 98%, 97%, 96%, 95%, 94%, 93%, 92%, 91%, 90%, 89%, 88%, 87%, 86%, 85%, 84%, 83%, 82%, 81%, 80%, 79%, 78%, 77%, 76%, 75%, 74%, 73%, 72%, 71%, 70%, 69%, 68%, 67%, 66%, 65%, 64%, 63%, 62%, 61%, or 60% nucleotide sequence identity with a sequence of the target sequence.
[0024] In another embodiment, a desired polynucleotide comprises a sequence that is derived from a target promoter. The target promoter may either be naturally present in a cell genome, that is, the target promoter is endogenous to the cell genome, or may be introduced into that genome through transformation. The derived promoter of the polynucleotide may be functionally active and contain a TATA box or TATA box-like sequence but neither the transcription start nor any transcribed sequences beyond the transcription start. Alternatively, the derived promoter of the polynucleotide may be functionally inactive by, for instance, the absence of a TATA box. Such a derived promoter may represent only part of the target promoter.
[0025] In another embodiment, the desired polynucleotide comprises a sequence that is specific to an intron that is endogenous to a cell genome.
[0026] In another embodiment, the desired polynucleotide comprises a sequence that is part of a terminator that is endogenous to a cell genome.
[0027] In another embodiment, the construct comprises two identical promoters that are functionally active in a target tissue. In another embodiment, the construct comprises two different promoters, each of which is functionally active in a target tissue.
[0028] A construct of the present invention may further comprise one or more additional polynucleotides between cassette A and cassette B. For instance, in the 5'- to 3'-orientation, a construct may comprise (i) a first promoter, (ii) a desired polynucleotide, (iii) an additional polynucleotide spacer, e.g., an intron, (iv) the inverse complement copy of the desired polynucleotide, and (v) a second promoter, where the first and second promoters are operably linked to the desired polynucleotide and the complementary copy, respectively, and are oriented to induce convergent transcription.
[0029] The additional spacer polynucleotide may be of any length. That is, the spacer polynucleotide may be an intron that is 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 60, 70, 80, 90, 100, 110, 120, 130, 140, 150, 160, 170, 180, 190, 200, 300, 400, 500, or more than 500 nucleotides, or any integer in between in length. If the spacer polynucleotide between two desired polynucleotides is long enough, transcription may never proceed from promoter to the other. That is, for whatever reason, transcription may stop whilst the transcription machinery is located in the spacer that does not contain a functionally active terminator element. Accordingly, the resultant transcript may comprise the full-length sequence of a first desired polynucleotide and a partial sequence of the intron, but no part of the second desired polynucleotide. Thus, it may be possible to design a construct as described herein with a spacer polynucleotide that prevents transcription from proceeding from one desired polynucleotide to the other. In such a situation, and if one of the desired polynucleotides is oriented as the inverse complementary copy of the other, then the prevention of transcription read-through would, therefore, avoid the synthesis of an RNA transcript that is self-complementary.
[0030] Accordingly, depending on any of (i) the convergent arrangement of promoters and desired polynucleotides, (ii) the copy number of the desired polynucleotides, (iii) the absence of a terminator region from the construct, and (iv) the complementarity and length of the resultant transcripts, various populations of RNA molecules may be produced from the present constructs.
[0031] Hence, a single construct of the present invention may produce (i) a single stranded "sense" RNA transcript, (ii) a single-stranded "antisense" RNA transcript, (iii) a hairpin duplex formed by a single-stranded RNA transcript that anneals to itself, or (iv) an RNA duplex formed from two distinct RNA transcripts that anneal to each other. A single construct may be designed to produce only sense or only antisense RNA transcripts from each convergently-arranged promoter.
[0032] The present invention also provides a method of reducing expression of a gene normally capable of being expressed in a plant cell, by stably incorporating any of the constructs described herein into the genome of a cell.
[0033] In this regard, any type of cell from any species may be exposed to or stably- or transiently-transformed with a construct of the present invention. Hence, a bacterial cell, viral cell, fungal cell, algae cell, worm cell, plant cell, insect cell, reptile cell, bird cell, fish cell, or mammalian cell may be transformed with a construct of the present invention. The target sequence, therefore, may be located in the nucleus or a genome of any on of such cell types. The target sequence, therefore, may be located in a gene in the cell genome. Hence, the target sequence may be located in at least one of a regulatory element of the gene, an exon of the gene, an intron of the gene, the 5'-untranslated region of the gene, or the 3'-untranslated region of the gene. In one embodiment, the regulatory element of the gene is at least one of the promoter or an enhancer element of the gene.
[0034] Alternatively, the target sequence may be located in an RNA transcript that is present in one of these cells and which may or may not be normally produced by the cell. That is, the RNA transcript that comprises the target sequence may be produced from a source that is foreign to the host cell. For instance, the RNA transcript that comprises the target sequence may be of viral origin but exists in a plant cell.
[0035] The present invention also contemplates in vitro, ex vivo, ex planta and in vivo exposure and integration of the desired construct into a cell genome or isolated nucleic acid preparations.
[0036] The constructs of the present invention, for example, may be inserted into Agrobacterium-derived transformation plasmids that contain requisite T-DNA border elements for transforming plant cells. Accordingly, a culture of plant cells may be transformed with such a transformation construct and, successfully transformed cells, grown into a desired transgenic plant that expresses the convergently operating promoter/polynucleotide cassettes.
[0037] The promoters may be constitutive or inducible promoters or permutations thereof "Strong" promoters, for instance, can be those isolated from viruses, such as rice tungro bacilliform virus, maize streak virus, cassava vein virus, mirabilis virus, peanut chlorotic streak caulimovirus, figwort mosaic virus and chlorella virus. Other promoters can be cloned from bacterial species such as the promoters of the nopaline synthase and octopine synthase gene. There are various inducible promoters, but typically an inducible promoter can be a temperature-sensitive promoter, a chemically-induced promoter, or a temporal promoter. Specifically, an inducible promoter can be a Ha hsp17.7 G4 promoter, a wheat wcs 120 promoter, a Rab 16A gene promoter, an α-amylase gene promoter, a pin2 gene promoter, or a carboxylase promoter.
[0038] Another aspect of the present invention is a construct, comprising an expression cassette which comprises (i) a first promoter operably linked to a first polynucleotide and (ii) a second promoter operably linked to a second polynucleotide, wherein (a) neither the first nor the second polynucleotide is operably linked to a terminator, (b) at least part of the second polynucleotide is substantially identical in nucleotide sequence to at least part of the sequence of the first polynucleotide but is positioned within the cassette in a different orientation to the first polynucleotide, and (c) the direction of transcription initiated from the first promoter is toward the second promoter and the direction of transcription initiated from the second promoter is toward the first promoter.
[0039] In one embodiment, at least part of the second polynucleotide is oriented as an inverse complement copy of at least part of the first polynucleotide.
[0040] In another embodiment, the sequence that terminates transcription, to which neither polynucleotide is operably linked, is a sequence at the 3'-end of a gene that is involved in 3'-end formation and polyadenylation of the transcript of that gene.
[0041] In a preferred embodiment, the sequence that is involved in 3-end formation and polyadenylation is a terminator.
[0042] In another embodiment, the expression cassette does not comprise (i) a nos gene terminator, (ii) the 3' untranslated sequence of T-DNA gene 7, (iii) the 3' untranslated sequences of the major inclusion body protein gene of cauliflower mosaic virus, (iv) the 3' untranslated sequences of the pea ribulose 1,5-bisphosphate carboxylase small subunit, (v) the 3' untranslated sequences of the potato ubiquitin-3 gene, or (vi) the 3' untranslated sequences of the potato proteinase inhibitor II gene, (vii) the 3' untranslated sequences of opine genes, (viii) the 3' untranslated sequences of endogenous genes.
[0043] In one embodiment, the first polynucleotide comprises a sequence that shares sequence identity with a target gene or at least one of a regulatory element that is associated with the target gene, an exon of the target gene, an intron of the target gene, the 5'-untranslated region of the target gene, or the 3'-untranslated region of the target gene.
[0044] In another embodiment, the first polynucleotide comprises a sequence that shares about 99%, 98%, 97%, 96%, 95%, 94%, 93%, 92%, 91%, 90%, 89%, 88%, 87%, 86%, 85%, 84%, 83%, 82%, 81%, 80%, 79%, 78%, 77%, 76%, 75%, 74%, 73%, 72%, 71%, 70%, 69%, 68%, 67%, 66%, 65%, 64%, 63%, 62%, 61%, 60%, 59%, 58%, 57%, 56%, 55%, 54%, 53%, 52%, 51%, 50%, 49%, 48%, 47%, 46%, 45%, 44%, 43%, 42%, 41%, 40%, 39%, 38%, 37%, 36%, 35%, 34%, 33%, 32%, 31%, 30%, 29%, 8%, 27%, 26%, 25%, 24%, 23%, 22%, 21%, 20%, 19%, 18%, 17%, 16%, 15%, 14%, 13%, 12%, 11%, 10%, 9%, 8%, 7%, 6%, 5%, 4%, 3%, 2%, 1% nucleotide sequence identity with a sequence of the target sequence.
[0045] In one embodiment, the target gene is a COMT gene involved in lignin biosynthesis, a CCOMT gene involved in lignin biosynthesis, any other gene involved in lignin biosynthesis, an R1 gene involved in starch phosphorylation, a phosphorylase gene involved in starch phosphorylation, a PPO gene involved in oxidation of polyphenols, a polygalacturonase gene involved in pectin degradation, a gene involved in the production of allergens, a gene involved in fatty acid biosynthesis such as FAD2.
[0046] In another embodiment, (a) the regulatory element of the target gene is the promoter or an enhancer element associated with the target gene or (b) the first polynucleotide comprises a sequence that shares sequence identity with an intron of a target gene, wherein the intron comprises the sequence of SEQ ID NO: 44.
[0047] In a particular embodiment, the target gene is located in the genome of a cell. Hence, the cell may be a cell from a bacteria, virus, fungus, yeast, plant, reptile, bird, fish, or mammal.
[0048] In one embodiment, the target sequence is located in a DNA sequence that encodes an RNA transcript.
[0049] In another embodiment, the first and second promoters are functional in a plant.
[0050] In a preferred embodiment, the expression cassette is located between transfer-DNA border sequences of a plasmid that is suitable for bacterium-mediated plant transformation.
[0051] In yet another embodiment, the bacterium is Agrobacterium, Rhizobium, or Phyllobacterium. In one embodiment, the bacterium is Agrobacterium tumefaciens, Rhizobium trifolii, Rhizobium leguminosarum, Phyllobacterium myrsinacearum, SinoRhizobium meliloti, and MesoRhizobium loti.
[0052] In one embodiment, the construct further comprises a spacer polynucleotide positioned between the first and second polynucleotides. The spacer polynucleotide may be 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 60, 70, 80, 90, 100, 110, 120, 130, 140, 150, 160, 170, 180, 190, 200, 300, 400, 500, or more than 500 nucleotides long.
[0053] In another embodiment, the first promoter is a near-constitutive promoter, a tissue-specific promoter, or an inducible promoter and wherein the second promoter is a near-constitutive promoter, a tissue-specific promoter, or an inducible promoter.
[0054] In a particular embodiment, the constitutive strong promoter is selected from the group consisting of a potato ubiquitin-7 promoter, a potato ubiquitin-3 promoter, a tomato ubiquitin promoter, an alfalfa petE promoter, an alfalfa Pal promoter, a canola napin promoter, a maize ubiquitin promoter, a rice ubiquitin promoter, a sugarcane ubiquitin promoter, a rice actin promoter, a rubisco small subunit promoter, and a rubisco activase promoter.
[0055] In one embodiment, the tissue-specific promoter is a granule-bound starch synthase promoter or an ADP glucose pyrophosphorylase gene promoter.
[0056] In one embodiment, the inducible promoter is a temperature-sensitive promoter, a chemically-induced promoter, or a temporal promoter.
[0057] In one embodiment, the inducible promoter is selected from the group consisting of an Ha hsp17.7 G4 promoter, a wheat wcs120 promoter, a Rab 16A gene promoter, an α-amylase gene promoter, a pin2 gene promoter, and a carboxylase promoter.
[0058] Another aspect of the present invention is a transformation plasmid, comprising an expression cassette, which comprises in the 5' to 3' orientation (1) a first promoter that is operably linked to (2) a first desired polynucleotide, which abuts (3) at least one optional spacer polynucleotide, where the 3'-end of one of the spacer polynucleotides abuts a (4) a second desired polynucleotide, which is operably linked to (5) a second promoter, wherein neither desired polynucleotide in the expression cassette is operably linked to any known transcription terminator.
[0059] In one embodiment, at least part of the first desired polynucleotide is in the antisense orientation and wherein at least part of the second desired polynucleotide is oriented as the inverse complement of the first desired polynucleotide.
[0060] In another embodiment, at least part of the first desired polynucleotide is in the sense orientation and wherein at least part of the second desired polynucleotide is oriented as the inverse complement of the first desired polynucleotide.
[0061] In another embodiment, at least part of the first desired polynucleotide is a promoter sequence.
[0062] In a further embodiment, the promoter sequence is from a promoter selected from the group consisting of (1) a starch-associated R1 gene promoter, (2) a polyphenol oxidase gene promoter, (3) a fatty acid desaturase 12 gene promoter, (4) a microsomal omega-6 fatty acid desaturase gene promoter, (5) a cotton stearoyl-acyl-carrier protein delta 9-desaturase gene promoter, (6) an oleoyl-phosphatidylcholine omega 6-desaturase gene promoter, (7) a Medicago truncatula caffeic acid/5-hydroxyferulic acid 3/5-O-methyltransferase (COMT) gene promoter, (8) a Medicago sativa (alfalfa) caffeic acid/5-hydroxyferulic acid 3/5-O-methyltransferase (COMT) gene promoter, (9) a Medicago truncatula caffeoyl CoA 3-O-methyltransferase (CCOMT) gene promoter, (10) a Medicago sativa (alfalfa) caffeoyl CoA 3-O-methyltransferase (CCOMT) gene promoter, (11) a major apple allergen Mal d 1 gene promoter, (12) a major peanut allergen Ara h 2 gene promoter, (13) a major soybean allergen Gly m Bd 30 K gene promoter, and (14) a polygalacturonase gene promoter.
[0063] In one embodiment, (i) at least one of the first and second promoters is a GBSS promoter, and (ii) the first desired polynucleotide is a sequence from a polyphenol oxidase gene.
[0064] In another embodiment, the first and second promoters are GBSS promoters.
[0065] In one embodiment, both the first promoter is a GBSS promoter and the second promoter is an AGP promoter.
[0066] Another aspect of the present invention is a method of reducing expression of a gene normally capable of being expressed in a plant cell, comprising exposing a plant cell to any construct described herein, wherein the construct is maintained in a bacterium strain, wherein the desired polynucleotide comprises a sequence that shares sequence identity to a target sequence in the plant cell genome.
[0067] In one embodiment, the bacterium strain is Agrobacterium tumefaciens, Rhizobium trifolii, Rhizobium leguminosarum, Phyllobacterium myrsinacearum, SinoRhizobium meliloti, and MesoRhizobium loti.
[0068] Another aspect of the present invention is a construct, comprising an expression cassette which comprises in the 5' to 3' orientation (i) a first promoter, (ii) a first polynucleotide that comprises a sequence that shares sequence identity with at least a part of a promoter sequence of a target gene, (iii) a second polynucleotide comprising a sequence that shares sequence identity with the inverse complement of at least part of the promoter of the target gene, and (iv) a second promoter, wherein the first promoter is operably linked to the 5'-end of the first polynucleotide and the second promoter is operably linked to the 3'-end of the second polynucleotide.
[0069] Another aspect of the present invention is a construct, comprising an expression cassette which comprises in the 5' to 3' orientation (i) a first promoter, (ii) a first polynucleotide that comprises a sequence that shares sequence identity with at least a part of a promoter sequence of a target gene, (iii) a second polynucleotide comprising a sequence that shares sequence identity with the inverse complement of at least part of the promoter of the target gene, (iv) a terminator, wherein the first promoter is operably linked to the 5'-end of the first polynucleotide and the second polynucleotide is operably linked to the terminator.
[0070] Another aspect of the present invention is a plant transformation plasmid, comprising the sequence depicted in SEQ ID NO: 40 or 42.
[0071] Another aspect of the present invention is a method for reducing cold-induced sweetening in a tuber, comprising expressing any construct described herein in a cell of a tuber, wherein (a) the first polynucleotide comprises the sequence of part of an R1 gene, (b) the second polynucleotide is the inverse complement of the first polynucleotide compared to the first polynucleotide, (c) one or both of the first and second promoters are GBSS or AGP, and (d) expression of the construct in the cell reduces transcription and/or translation of an R1 gene in the tuber cell genome, thereby reducing cold-induced sweetening in the tuber. In one embodiment, the first polynucleotide comprises the sequence depicted in SEQ ID NO: 23 or 24. In another embodiment, the tuber is a potato. In another embodiment, the first polynucleotide comprises two copies of the sequence of SEQ ID NO: 23 or 24.
[0072] Another aspect of the present invention is a method for enhancing tolerance to black spot bruising in a tuber, comprising expressing any construct described herein in a cell of a tuber, wherein (a) the first polynucleotide comprises the sequence of part of a polyphenol oxidase gene, (b) the second polynucleotide is the inverse complement of the first polynucleotide, (c) one or both of the first and second promoters are GBSS or AGP, and (d) expression of the construct in the cell reduces transcription and/or translation of a polyphenol oxidase gene in the tuber cell genome, thereby enhancing the tolerance of the tuber to black spot bruising. In one embodiment, the first polynucleotide comprises the sequence of SEQ ID NO: 26 or 27. In another embodiment, the tuber is a potato. In another embodiment, the first polynucleotide comprises two copies of the sequence of SEQ ID NO: 26 or 27.
[0073] Another aspect of the present invention is a method for increasing oleic acid levels in an oil-bearing plant, comprising expressing any construct described herein in a cell of a seed of an oil-bearing plant, wherein (a) the first polynucleotide comprises the sequence of part of a Fad2 gene, (b) the second polynucleotide is the inverse complement of the first polynucleotide, (c) one or both of the first and second promoters are napin gene, Fad2 gene, or stearoyl-ACP desaturase gene promoters, and (d) expression of the construct in the cell reduces transcription and/or translation of a Fad2 gene in the cell of the seed of the oil-bearing plant, thereby increasing the oil content of the seed. In one embodiment, the first polynucleotide comprises the sequence depicted in SEQ ID NO: 28. In another embodiment, the sequence of the napin gene promoter comprises the sequence depicted in SEQ ID NO: 30.
[0074] In one embodiment, the sequence of the stearoyl-ACP desaturase gene promoter comprises the sequence depicted in SEQ ID NO: 31.
[0075] In another embodiment, the sequence of the Fad2 gene promoter comprises the sequence depicted in SEQ ID NO: 32.
[0076] In one embodiment, the oil-bearing plant is a Brassica plant, canola plant, soybean plant, cotton plant, or a sunflower plant.
[0077] Another aspect of the present invention is a method for reducing lignin content in a plant, comprising expressing any construct described herein in a cell of the plant, wherein (a) the first polynucleotide comprises the sequence of part of a caffeic acid/5-hydroxyferulic acid 3/5-O-methyltransferase (COMT) gene, (b) the second polynucleotide is the inverse complement of the first polynucleotide, (c) one or both of the first and second promoters are petE or Pal gene promoters, and (d) expression of the construct in the cell reduces transcription and/or translation of a COMT gene in the cell of the plant, thereby reducing lignin content in a plant. In one embodiment, the cell is in the vascular system of the plant. In a preferred embodiment, the plant is an alfalfa plant. In another embodiment, the first polynucleotide comprises the sequence depicted in SEQ ID NO: 33 or 37.
[0078] Another aspect of the present invention is a method for reducing the degradation of pectin in a fruit of a plant, comprising expressing any construct described herein in a fruit cell of the plant, wherein (a) the first polynucleotide comprises the sequence of part of polygalacturonase gene, (b) the second polynucleotide is the inverse complement of the first polynucleotide, (c) both of the first and second promoters are fruit-specific promoters, and (d) expression of the construct in the fruit cell reduces transcription and/or translation of a polygalacturonase gene in the cell of the plant, thereby reducing the degradation of pectin in the fruit. In one embodiment, the first polynucleotide comprises the sequence depicted in SEQ ID NO: 39.
[0079] Another aspect of the present invention is a method for reducing the allergenicity of a food produced by a plant, comprising expressing any construct described herein in a cell of a plant, wherein (a) the first polynucleotide comprises the sequence of part of a gene that encodes an allergen, (b) the second polynucleotide is the inverse complement of the first polynucleotide, and (c) the expression of the construct reduces transcription and/or translation of the allergen, thereby reducing the allergenicity of a food produced by the plant.
[0080] In one embodiment, (a) the plant is an apple plant, (b) the food is an apple, (c) the first polynucleotide comprises a sequence from the Mal d I gene promoter, and (d) expression of the construct in the apple plant reduces transcription and/or translation of Mal d I in the apple.
[0081] In another embodiment, (a) the plant is a peanut plant, (b) the food is a peanut, (c) the first polynucleotide comprises a sequence from the Ara h 2 gene promoter, and (d) expression of the construct in the peanut plant reduces transcription and/or translation of Ara h 2 in the peanut.
[0082] In another embodiment, (a) the plant is a soybean plant, (b) the food is a soybean, (c) the first polynucleotide comprises a sequence from the Gly m Bd gene promoter, and (d) expression of the construct in the soybean plant reduces transcription and/or translation of Gly m Bd in the soybean.
[0083] Another aspect of the present invention is a method for downregulating the expression of multiple genes in a plant, comprising expressing in a cell of a plant a construct comprising the sequence depicted in SEQ ID NO: 40, which downregulates expression of polyphenol oxidase, phosphorylase L gene, and the R1 gene in the plant cell.
[0084] Another aspect of the present invention is a method for downregulating the expression of multiple genes in a plant, comprising expressing in a cell of a plant a construct comprising the sequence depicted in SEQ ID NO: 42, which downregulates expression of polyphenol oxidase, phosphorylase L gene, and the R1 gene in the plant cell.
[0085] Another aspect of the present invention is a construct, comprising a desired promoter that is operably linked to (i) a first promoter at its 5'-end and (ii) a second promoter at its 3'-end, wherein the desired promoter shares sequence identity with a target promoter in a genome of interest.
[0086] Another aspect of the present invention is a construct, comprising a two convergently-oriented copies of a desired promoter that are separated by a polynucleotide, wherein the desired promoter shares sequence identity with a target promoter in a desired genome of interest. In one embodiment, the polynucleotide that separates the convergently-oriented promoters is an intron.
[0087] Another aspect of the present invention is a construct, comprising two desired promoters that are operably linked to a promoter and a terminator, wherein the desired promoters share sequence identity with a target promoter in a genome of interest. In one embodiment, the two desired promoters share, over at least a part of their respective lengths, sequence identity with each other and where one of the desired promoters is oriented as the inverse complement of the other.
[0088] In another aspect is a construct, comprising two desired promoters that are operably linked to a promoter and a terminator, wherein the desired promoters share sequence identity with a target promoter in a genome of interest. In one embodiment, the two desired promoters share, over at least a part of their respective lengths, sequence identity with each other and where one of the desired promoters is oriented as the inverse complement of the other.
[0089] In another aspect a construct is provided that comprises four direct repeats of a polynucleotide of interest, which are preceded by an antisense DNA fragment of the polynucleotide of interest. Such a construct is depicted by pSIM1111.
[0090] The present invention also provides a method for reducing the expression level of an endogenous gene in an alfalfa plant, comprising introducing a cassette into an alfalfa cell, wherein the cassette comprises two alfalfa-specific promoters arranged in a convergent orientation to each other, wherein the activity of the promoters in the cassette reduces the expression level of an endogenous alfalfa gene, which is operably linked in the alfalfa genome to a promoter that has a sequence that shares sequence identity with at least a part of one of the promoters in the cassette. In one embodiment, the sequence of at least one of the promoters is depicted in SEQ ID NO: 54 or SEQ ID NO: 55.
[0091] The present invention also provides a method for reducing the expression of a Comt gene, comprising expressing a Comt gene fragment or Comt promoter fragment in a cell that comprises a Comt gene in its genome.
[0092] The present invention also provides a method for reducing the expression of a Comt gene or Ccomt gene, comprising expressing the construct of any construct described herein in a cell that comprises a Comt gene or a Ccomt gene in its genome, wherein the first polynucleotide comprises a sequence of a Comt gene or Comt gene promoter or a Ccomt gene or Ccomt gene promoter.
BRIEF DESCRIPTION OF THE DRAWINGS
[0093] FIG. 1 depicts schematic diagrams for T-DNAs of binary vectors that (a) represent a negative control (pSIM714), and (b) comprise constructs that represent conventional silencing constructs, pSIM374, pSIM718, and pSIM755. "B" denotes a transfer-DNA border sequence; "T" denotes a terminator sequence; "hptII" is a resistance gene that confers hygromycin resistance to a plant; "P1" denotes a promoter sequence and, in this example, is a promoter that is identical to the promoter driving a functionally active bete-glucuronidase (gus) gene in the transgenic gus plant; "P2" denotes a promoter sequence that is also functionally active but different from P1; "gus-S" denotes a gus gene fragment; "gus-A" denotes an inverse complement of the gus gene fragment; "I" denotes an intron. With respect to gus-S and gus-A, the solid thick arrows signify (part of the) RNA transcripts that share identity with a part of the transcript produced by expressing the gus gene; the dotted thick arrows signify (part of the) RNA transcripts that share identity with a part of the inverse complement of the gus gene transcript; the thin lines signify parts of the transcript with homology or inverse complementarity to another sequence such as the intron of the construct. In this respect, the leftward pointing open arrow (which denotes the "gus-A" element in the cassette) indicates that the gus-A element is oriented in the expression cassette as the inverse complement of the gus-S, the rightward pointing arrow. Hence, P1 and P2 promoters are oriented so that transcription from each proceeds in a convergent manner, i.e., transcription of P1 proceeds toward P2 and vice versa.
[0094] FIG. 2 depicts schematic diagrams for T-DNAs of binary vectors comprising constructs that resemble conventional silencing constructs except that they lack a terminator, pSIM728, pSIM140, and pSIM758. With respect to gus-S and gus-A, the solid thick arrows signify the part of the RNA transcripts that share identity with a part of the transcript produced by expressing the gus gene; the dotted thick arrows signify parts of the RNA transcripts that share identity with a part of the inverse complement of the gus gene transcript; the thin lines signify parts of the transcript with homology or inverse complementarity to another sequence such as the intron of the construct.
[0095] FIG. 3 depicts schematic diagrams for T-DNAs comprising "terminator-free colliding transcription" (TFCT) constructs. Specifically, it illustrates the T-DNAs of pSIM715, pSIM717, pSIM756, and pSIM771. The key to the identified elements and solid and dotted arrows is the same as those explained in the legend of FIG. 1. In pSIM717, read-through of transcription originated from both P1 and P2 over the intron produces transcripts that contain 5'-sequences identical to part of the gus gene transcript and 3'-sequences that are inverse complementary to the gus gene transcript. These transcripts may fold to produce partially double-stranded RNA. Depending on the ability of the P1 transcription complex to proceed unencumbered, an RNA transcript, initiated from the P1 promoter, could conceivably transcribe sequences downstream of the gus-S sequence to which it is operably linked. Accordingly, when reading the "top," i.e., sense strand of pSIM717, in a 5'- to 3'-direction, a transcript from P1 may comprise the sequence of the intervening intron ("I"), as well as the sequence of the inverse complement gus-S element. The "top" strand sequence of the inverse complement gus-S element is the antisense of gus-S.
[0096] FIG. 4 depicts schematic diagrams for T-DNAs comprising "terminator-free colliding transcription" (TFCT) constructs. Specifically, it illustrates the T-DNAs of pSIM754, pSIM773, and pSIM767. The key to the identified elements and solid and dotted arrows is the same as those explained in the legend of FIG. 1. P1n indicates the part of the P1 promoter that is upstream from the TATA box. This sequence is not functional as promoter.
[0097] FIG. 5 depicts schematic diagrams for T-DNAs comprising "terminator-free colliding transcription" (TFCT) constructs. Specifically, it illustrates the T-DNAs of pSIM782. The key to the identified elements and solid and dotted arrows is the same as those explained in the legend of FIG. 1. gusI indicates the intronm of the gus gene.
[0098] FIG. 6 depicts schematic diagrams for T-DNAs comprising "terminator-free colliding transcription" (TFCT) constructs. Specifically, it illustrates the T-DNAs of pSIM765, pSIM922F, pSIM922G, pSIM774, and pSIM775. The key to the identified elements and solid and dotted arrows is the same as those explained in the legend of FIG. 1. PPO indicates a fragment of the tobacco PPO gene.
[0099] FIG. 7 shows ethidium bromide-stained agarose gels containing the products of RT-PCRs. +=positive plasmid control; -=negative control; M=marker; T1=transcript from P1 promoter; T2=transcript from P2 promoter.
[0100] FIG. 8 shows autoradiograms of RNA gel blots. The probe used for hybridization was derived from the gus gene.
[0101] FIG. 9 shows a sequence analysis of the various promoter fragments and identifies a 89-bp sequence that may be methylated during promoter-based silencing.
[0102] FIG. 10 depicts plasmid maps. G: gus gene fragment; H: expression cassette for hptII gene; LB: left border region; RB: right border region; T: terminator; P1: P1 promoter; Pln: non-functional P1 promoter lacking a TATA box; P2: P2 promoter; P3: P3 promoter; GB: GBSS promoter; PP: PPO gene fragment; PT=fragment of tobacco PPO gene. Direction of transcription is indicated with a small black solid arrow.
DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS
[0103] A construct of the present invention can be used to efficiently reduce or prevent the transcription or translation of a target nucleic acid by triggering convergent transcription of a desired polynucleotide. Hence one goal of the present invention is to provide constructs that produce nucleic acid molecules that prevent or reduce expression of a gene or of a gene product, such as an RNA transcript or protein.
[0104] One particular characteristic of such a construct is that, in contrast to conventional silencing constructs, no functional terminator is inserted and operably linked to the 3'-end of a desired polynucleotide. It is well established that a terminator is a nucleotide sequence, typically located at the 3'-end of a gene, that is involved in cleavage of the RNA transcript that is transcribed from the gene and in polyadenylation of that transcript. Typically, a terminator is located downstream of the gene's stop codon.
[0105] Terminators that were used for the construction of conventional silencing cassettes, and which are excluded from constructs of the present invention, were derived from such 3'-regions of certain genes and often also included even more downstream non-transcribed DNA sequences. The choice of which terminator to use has more often than not, simply been a matter of convenience. Hence, opine terminators or termination regions from endogenous and previously characterized genes have been used in conventional silencing constructs. One of the more frequently used terminators, for instance, is the Agrobacterium nopaline synthase (nos) gene terminator, which comprises both 3' untranslated sequences and some additional downstream DNA. Other terminators include:
[0106] The 3' untranslated sequences of T-DNA gene 7 (Genbank accession V00090).
[0107] The 3' untranslated sequences of the major inclusion body protein gene of cauliflower mosaic virus.
[0108] The 3' untranslated sequences of the pea ribulose 1,5-bisphosphate carboxylase small subunit (Genbank accession M21375).
[0109] The 3' untranslated sequences of the potato ubiquitin-3 gene (Genbank accession Z11669).
[0110] The 3' untranslated sequences of the potato proteinase inhibitor II gene (Genbank accession CQ889094).
[0111] The 3' untranslated sequences of opine genes.
[0112] The 3' untranslated sequences of endogenous genes; that is genes that are normally expressed by the genome of an organism.
[0113] With respect to the present invention, however, none of such terminators, indeed, no functional terminator, is directly operably linked to a desired polynucleotide of the present construct. Nor is a desired polynucleotide directly operably linked to a terminator that is preceded by a self-splicing ribozyme-encoding sequence.
[0114] Another characteristic of the construct of the present invention is that it promotes convergent transcription of one or more copies of polynucleotide that is or are not directly operably linked to a terminator, via two opposing promoters. Due to the absence of a termination signal, the length of the pool of RNA molecules that is transcribed from the first and second promoters may be of various lengths.
[0115] Occasionally, for instance, the transcriptional machinery may continue to transcribe past the last nucleotide that signifies the "end" of the desired polynucleotide sequence. Accordingly, in this particular arrangement, transcription termination may occur either through the weak and unintended action of downstream sequences that, for instance, promote hairpin formation or through the action of unintended transcriptional terminators located in plant DNA flanking the transfer DNA integration site.
[0116] A terminator-free colliding transcription (TFCT) construct of the present invention, therefore, may comprise a first promoter operably linked to a first polynucleotide and a second promoter operably linked to a second polynucleotide, whereby (1) the first and second polynucleotides share at least some sequence identity with each other and a target sequence, and (2) the first promoter is oriented such that the direction of transcription initiated by this promoter proceeds towards the second promoter, and vice versa, (3) the construct produces RNA molecules that are generally different in size, some transcripts representing the RNA counterparts of at least part of the polynucleotide and others comprising the counterparts of at least some of both the polynucleotide and its inverse complement.
[0117] The desired polynucleotide may be linked in two different orientations to the promoter. In one orientation, e.g., "sense", at least the 5'-part of the resultant RNA transcript will share sequence identity with at least part of at least one target transcript. An example of this arrangement is shown in FIG. 3 as pSIM717. In the other orientation designated as "antisense", at least the 5'-part of the predicted transcript will be identical or homologous to at least part of the inverse complement of at least one target transcript. An example of the latter arrangement is shown in FIG. 3 as pSIM756.
[0118] As used herein, "sequence identity" or "identity" in the context of two nucleic acid or polypeptide sequences includes reference to the residues in the two sequences which are the same when aligned for maximum correspondence over a specified region. When percentage of sequence identity is used in reference to proteins it is recognized that residue positions which are not identical often differ by conservative amino acid substitutions, where amino acid residues are substituted for other amino acid residues with similar chemical properties (e.g. charge or hydrophobicity) and therefore do not change the functional properties of the molecule. Where sequences differ in conservative substitutions, the percent sequence identity may be adjusted upwards to correct for the conservative nature of the substitution. Sequences which differ by such conservative substitutions are said to have "sequence similarity" or "similarity". Means for making this adjustment are well-known to those of skill in the art. Typically this involves scoring a conservative substitution as a partial rather than a full mismatch, thereby increasing the percentage sequence identity. Thus, for example, where an identical amino acid is given a score of 1 and a non-conservative substitution is given a score of zero, a conservative substitution is given a score between zero and 1. The scoring of conservative substitutions is calculated, e.g., according to the algorithm of Meyers and Miller, Computer Applic. Biol. Sci., 4: 11-17 (1988) e.g., as implemented in the program PC/GENE (Intelligenetics, Mountain View, Calif., USA).
[0119] As used herein, "percentage of sequence identity" means the value determined by comparing two optimally aligned sequences over a comparison window, wherein the portion of the polynucleotide sequence in the comparison window may comprise additions or deletions (i.e., gaps) as compared to the reference sequence (which does not comprise additions or deletions) for optimal alignment of the two sequences. The percentage is calculated by determining the number of positions at which the identical nucleic acid base or amino acid residue occurs in both sequences to yield the number of matched positions, dividing the number of matched positions by the total number of positions in the window of comparison and multiplying the result by 100 to yield the percentage of sequence identity.
[0120] Methods of alignment of sequences for comparison are well-known in the art. Optimal alignment of sequences for comparison may be conducted by the local homology algorithm of Smith and Waterman, Adv. Appl. Math. 2: 482 (1981); by the homology alignment algorithm of Needleman and Wunsch, J. Mol. Biol. 48: 443 (1970); by the search for similarity method of Pearson and Lipman, Proc. Natl. Acad. Sci. 85: 2444 (1988); by computerized implementations of these algorithms, including, but not limited to: CLUSTAL in the PC/Gene program by Intelligenetics, Mountain View, Calif.; GAP, BESTFIT, BLAST, FASTA, and TFASTA in the Wisconsin Genetics Software Package, Genetics Computer Group (GCG), 575 Science Dr., Madison, Wis., USA; the CLUSTAL program is well described by Higgins and Sharp, Gene 73: 237-244 (1988); Higgins and Sharp, CABIOS 5: 151-153 (1989); Corpet, et al., Nucleic Acids Research 16: 10881-90 (1988); Huang, et al., Computer Applications in the Biosciences 8: 155-65 (1992), and Pearson, et al., Methods in Molecular Biology 24: 307-331 (1994).
[0121] The BLAST family of programs which can be used for database similarity searches includes: BLASTN for nucleotide query sequences against nucleotide database sequences; BLASTX for nucleotide query sequences against protein database sequences; BLASTP for protein query sequences against protein database sequences; TBLASTN for protein query sequences against nucleotide database sequences; and TBLASTX for nucleotide query sequences against nucleotide database sequences. See, Current Protocols in Molecular Biology, Chapter 19, Ausubel, et al., Eds., Greene Publishing and Wiley-Interscience, New York (1995); Altschul et al., J. Mol. Biol., 215:403-410 (1990); and, Altschul et al., Nucleic Acids Res. 25:3389-3402 (1997).
[0122] Software for performing BLAST analyses is publicly available, e.g., through the National Center for Biotechnology Information (http://www.ncbi.nlm.nih.gov/). This algorithm involves first identifying high scoring sequence pairs (HSPs) by identifying short words of length W in the query sequence, which either match or satisfy some positive-valued threshold score T when aligned with a word of the same length in a database sequence. T is referred to as the neighborhood word score threshold. These initial neighborhood word hits act as seeds for initiating searches to find longer HSPs containing them. The word hits are then extended in both directions along each sequence for as far as the cumulative alignment score can be increased. Cumulative scores are calculated using, for nucleotide sequences, the parameters M (reward score for a pair of matching residues; always >0) and N (penalty score for mismatching residues; always <0). For amino acid sequences, a scoring matrix is used to calculate the cumulative score. Extension of the word hits in each direction are halted when: the cumulative alignment score falls off by the quantity X from its maximum achieved value; the cumulative score goes to zero or below, due to the accumulation of one or more negative-scoring residue alignments; or the end of either sequence is reached. The BLAST algorithm parameters W, T, and X determine the sensitivity and speed of the alignment. The BLASTN program (for nucleotide sequences) uses as defaults a wordlength (W) of 11, an expectation (E) of 10, a cutoff of 100, M=5, N=-4, and a comparison of both strands. For amino acid sequences, the BLASTP program uses as defaults a wordlength (W) of 3, an expectation (E) of 10, and the BLOSUM62 scoring matrix (see Henikoff & Henikoff (1989) Proc. Natl. Acad. Sci. USA 89:10915).
[0123] In addition to calculating percent sequence identity, the BLAST algorithm also performs a statistical analysis of the similarity between two sequences (see, e.g., Karlin & Altschul, Proc. Nat'l. Acad. Sci. USA 90:5873-5877 (1993)). One measure of similarity provided by the BLAST algorithm is the smallest sum probability (P(N)), which provides an indication of the probability by which a match between two nucleotide or amino acid sequences would occur by chance.
[0124] BLAST searches assume that proteins can be modeled as random sequences. However, many real proteins comprise regions of nonrandom sequences which may be homopolymeric tracts, short-period repeats, or regions enriched in one or more amino acids. Such low-complexity regions may be aligned between unrelated proteins even though other regions of the protein are entirely dissimilar. A number of low-complexity filter programs can be employed to reduce such low-complexity alignments. For example, the SEG (Wooten and Federhen, Comput. Chem., 17:149-163 (1993)) and XNU (Clayerie and States, Comput. Chem., 17:191-201 (1993)) low-complexity filters can be employed alone or in combination.
[0125] Multiple alignment of the sequences can be performed using the CLUSTAL method of alignment (Higgins and Sharp (1989) CABIOS. 5:151-153) with the default parameters (GAP PENALTY=10, GAP LENGTH PENALTY=10). Default parameters for pairwise alignments using the CLUSTAL method are KTUPLE 1, GAP PENALTY=3, WINDOW=5 and DIAGONALS SAVED=5.
[0126] Any or all of the elements and DNA sequences that are described herein may be endogenous to one or more plant genomes. Accordingly, in one particular embodiment of the present invention, all of the elements and DNA sequences, which are selected for the ultimate transfer cassette are endogenous to, or native to, the genome of the plant that is to be transformed. For instance, all of the sequences may come from a potato genome. Alternatively, one or more of the elements or DNA sequences may be endogenous to a plant genome that is not the same as the species of the plant to be transformed, but which function in any event in the host plant cell. Such plants include potato, tomato, and alfalfa plants. The present invention also encompasses use of one or more genetic elements from a plant that is interfertile with the plant that is to be transformed.
[0127] Public concerns were addressed through development of an all-native approach to making genetically engineered plants, as disclosed by Rommens et al. in WO2003/069980, US-2003-0221213, US-2004-0107455, and WO2005/004585, which are all incorporated herein by reference. Rommens et al. teach the identification and isolation of genetic elements from plants that can be used for bacterium-mediated plant transformation. Thus, Rommens teaches that a plant-derived transfer-DNA ("P-DNA"), for instance, can be isolated from a plant genome and used in place of an Agrobacterium T-DNA to genetically engineer plants.
[0128] In this regard, a "plant" of the present invention includes, but is not limited to angiosperms and gymnosperms such as potato, tomato, tobacco, avocado, alfalfa, lettuce, carrot, strawberry, sugarbeet, cassava, sweet potato, soybean, pea, bean, cucumber, grape, brassica, maize, turf grass, wheat, rice, barley, sorghum, oat, oak, eucalyptus, walnut, and palm. Thus, a plant may be a monocot or a dicot. "Plant" and "plant material," also encompasses plant cells, seed, plant progeny, propagule whether generated sexually or asexually, and descendents of any of these, such as cuttings or seed. "Plant material" may refer to plant cells, cell suspension cultures, callus, embryos, meristematic regions, callus tissue, leaves, roots, shoots, gametophytes, sporophytes, pollen, seeds, germinating seedlings, and microspores. Plants may be at various stages of maturity and may be grown in liquid or solid culture, or in soil or suitable media in pots, greenhouses or fields. Expression of an introduced leader, trailer or gene sequences in plants may be transient or permanent.
[0129] Thus, any one of such plants and plant materials may be transformed according to the present invention. In this regard, transformation of a plant is a process by which DNA is stably integrated into the genome of a plant cell. "Stably" refers to the permanent, or non-transient retention and/or expression of a polynucleotide in and by a cell genome. Thus, a stably integrated polynucleotide is one that is a fixture within a transformed cell genome and can be replicated and propagated through successive progeny of the cell or resultant transformed plant. Transformation may occur under natural or artificial conditions using various methods well known in the art. See, for instance, METHODS IN PLANT MOLECULAR BIOLOGY AND BIOTECHNOLOGY, Bernard R. Glick and John E. Thompson (eds), CRC Press, Inc., London (1993); Chilton, Scientific American, 248)(6), pp. 36-45, 1983; Bevan, Nucl. Acids. Res., 12, pp. 8711-8721, 1984; and Van Montague et al., Proc R Soc Lond B Biol Sci., 210(1180), pp. 351-65, 1980. Plants also may be transformed using "Refined Transformation" and "Precise Breeding" techniques. See, for instance, Rommens et al. in WO2003/069980, US-2003-0221213, US-2004-0107455, WO2005/004585, US-2004-0003434, US-2005-0034188, WO2005/002994, and WO2003/079765, which are all incorporated herein by reference.
[0130] One or more traits of a tuber-bearing plant of the present invention may be modified using the transformation sequences and elements described herein. A "tuber" is a thickened, usually underground, food-storing organ that lacks both a basal plate and tunic-like covering, which corms and bulbs have. Roots and shoots grow from growth buds, called "eyes," on the surface of the tuber. Some tubers, such as caladiums, diminish in size as the plants grow, and form new tubers at the eyes. Others, such as tuberous begonias, increase in size as they store nutrients during the growing season and develop new growth buds at the same time. Tubers may be shriveled and hard or slightly fleshy. They may be round, flat, odd-shaped, or rough. Examples of tubers include, but are not limited to ahipa, apio, arracacha, arrowhead, arrowroot, baddo, bitter casava, Brazilian arrowroot, cassava, Chinese artichoke, Chinese water chestnut, coco, cocoyam, dasheen, eddo, elephant's ear, girasole, goo, Japanese artichoke, Japanese potato, Jerusalem artichoke, jicama, lilly root, ling gaw, mandioca, manioc, Mexican potato, Mexican yam bean, old cocoyam, potato, saa got, sato-imo, seegoo, sunchoke, sunroot, sweet casava, sweet potatoes, tanier, tannia, tannier, tapioca root, topinambour, water lily root, yam bean, yam, and yautia. Examples of potatoes include, but are not limited to Russet Potatoes, Round White Potatoes, Long White Potatoes, Round Red Potatoes, Yellow Flesh Potatoes, and Blue and Purple Potatoes.
[0131] Tubers may be classified as "microtubers," "minitubers," "near-mature" tubers, and "mature" tubers. Microtubers are tubers that are grown on tissue culture medium and are small in size. By "small" is meant about 0.1 cm-1 cm. A "minituber" is a tuber that is larger than a microtuber and is grown in soil. A "near-mature" tuber is derived from a plant that starts to senesce, and is about 9 weeks old if grown in a greenhouse. A "mature" tuber is one that is derived from a plant that has undergone senescence. A mature tuber is, for example, a tuber that is about 12 or more weeks old.
[0132] In this respect, a plant-derived transfer-DNA ("P-DNA") border sequence of the present invention is not identical in nucleotide sequence to any known bacterium-derived T-DNA border sequence, but it functions for essentially the same purpose. That is, the P-DNA can be used to transfer and integrate one polynucleotide into another. A P-DNA can be inserted into a tumor-inducing plasmid, such as a Ti-plasmid from Agrobacterium in place of a conventional T-DNA, and maintained in a bacterium strain, just like conventional transformation plasmids. The P-DNA can be manipulated so as to contain a desired polynucleotide, which is destined for integration into a plant genome via bacteria-mediated plant transformation. See Rommens et al. in WO2003/069980, US-2003-0221213, US-2004-0107455, and WO2005/004585, which are all incorporated herein by reference.
[0133] Thus, a P-DNA border sequence is different by 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, or more nucleotides from a known T-DNA border sequence from an Agrobacterium species, such as Agrobacterium tumefaciens or Agrobacterium rhizogenes.
[0134] A P-DNA border sequence is not greater than 99%, 98%, 97%, 96%, 95%, 94%, 93%, 92%, 91%, 90%, 89%, 88%, 87%, 86%, 85%, 84%, 83%, 82%, 81%, 80%, 79%, 78%, 77%, 76%, 75%, 74%, 73%, 72%, 71%, 70%, 69%, 68%, 67%, 66%, 65%, 64%, 63%, 62%, 61%, 60%, 59%, 58%, 57%, 56%, 55%, 54%, 53%, 52%, 51% or 50% similar in nucleotide sequence to an Agrobacterium T-DNA border sequence.
[0135] Methods were developed to identify and isolate transfer DNAs from plants, particularly potato and wheat, and made use of the border motif consensus described in US-2004-0107455, which is incorporated herein by reference.
[0136] In this respect, a plant-derived DNA of the present invention, such as any of the sequences, cleavage sites, regions, or elements disclosed herein is functional if it promotes the transfer and integration of a polynucleotide to which it is linked into another nucleic acid molecule, such as into a plant chromosome, at a transformation frequency of about 99%, about 98%, about 97%, about 96%, about 95%, about 94%, about 93%, about 92%, about 91%, about 90%, about 89%, about 88%, about 87%, about 86%, about 85%, about 84%, about 83%, about 82%, about 81%, about 80%, about 79%, about 78%, about 77%, about 76%, about 75%, about 74%, about 73%, about 72%, about 71%, about 70%, about 69%, about 68%, about 67%, about 66%, about 65%, about 64%, about 63%, about 62%, about 61%, about 60%, about 59%, about 58%, about 57%, about 56%, about 55%, about 54%, about 53%, about 52%, about 51%, about 50%, about 49%, about 48%, about 47%, about 46%, about 45%, about 44%, about 43%, about 42%, about 41%, about 40%, about 39%, about 38%, about 37%, about 36%, about 35%, about 34%, about 33%, about 32%, about 31%, about 30%, about 29%, about 28%, about 27%, about 26%, about 25%, about 24%, about 23%, about 22%, about 21%, about 20%, about 15%, or about 5% or at least about 1%.
[0137] Any of such transformation-related sequences and elements can be modified or mutated to change transformation efficiency. Other polynucleotide sequences may be added to a transformation sequence of the present invention. For instance, it may be modified to possess 5'- and 3'-multiple cloning sites, or additional restriction sites. The sequence of a cleavage site as disclosed herein, for example, may be modified to increase the likelihood that backbone DNA from the accompanying vector is not integrated into a plant genome.
[0138] Any desired polynucleotide may be inserted between any cleavage or border sequences described herein. For example, a desired polynucleotide may be a wild-type or modified gene that is native to a plant species, or it may be a gene from a non-plant genome. For instance, when transforming a potato plant, an expression cassette can be made that comprises a potato-specific promoter that is operably linked to a desired potato gene or fragment thereof and a potato-specific terminator. The expression cassette may contain additional potato genetic elements such as a signal peptide sequence fused in frame to the 5'-end of the gene, and a potato transcriptional enhancer. The present invention is not limited to such an arrangement and a transformation cassette may be constructed such that the desired polynucleotide, while operably linked to a promoter, is not operably linked to a terminator sequence.
[0139] In addition to plant-derived elements, such elements can also be identified in, for instance, fungi and mammals. See, for instance, SEQ ID NOs: 173-182. Several of these species have already been shown to be accessible to Agrobacterium-mediated transformation. See Kunik et al., Proc Natl Acad Sci USA 98: 1871-1876, 2001, and Casas-Flores et al., Methods Mol Biol 267: 315-325, 2004, which are incorporated herein by reference.
[0140] When a transformation-related sequence or element, such as those described herein, are identified and isolated from a plant, and if that sequence or element is subsequently used to transform a plant of the same species, that sequence or element can be described as "native" to the plant genome.
[0141] Thus, a "native" genetic element refers to a nucleic acid that naturally exists in, originates from, or belongs to the genome of a plant that is to be transformed. In the same vein, the term "endogenous" also can be used to identify a particular nucleic acid, e.g., DNA or RNA, or a protein as "native" to a plant. Endogenous means an element that originates within the organism. Thus, any nucleic acid, gene, polynucleotide, DNA, RNA, mRNA, or cDNA molecule that is isolated either from the genome of a plant or plant species that is to be transformed or is isolated from a plant or species that is sexually compatible or interfertile with the plant species that is to be transformed, is "native" to, i.e., indigenous to, the plant species. In other words, a native genetic element represents all genetic material that is accessible to plant breeders for the improvement of plants through classical plant breeding. Any variants of a native nucleic acid also are considered "native" in accordance with the present invention. In this respect, a "native" nucleic acid may also be isolated from a plant or sexually compatible species thereof and modified or mutated so that the resultant variant is greater than or equal to 99%, 98%, 97%, 96%, 95%, 94%, 93%, 92%, 91%, 90%, 89%, 88%, 87%, 86%, 85%, 84%, 83%, 82%, 81%, 80%, 79%, 78%, 77%, 76%, 75%, 74%, 73%, 72%, 71%, 70%, 69%, 68%, 67%, 66%, 65%, 64%, 63%, 62%, 61%, or 60% similar in nucleotide sequence to the unmodified, native nucleic acid isolated from a plant. A native nucleic acid variant may also be less than about 60%, less than about 55%, or less than about 50% similar in nucleotide sequence.
[0142] A "native" nucleic acid isolated from a plant may also encode a variant of the naturally occurring protein product transcribed and translated from that nucleic acid. Thus, a native nucleic acid may encode a protein that is greater than or equal to 99%, 98%, 97%, 96%, 95%, 94%, 93%, 92%, 91%, 90%, 89%, 88%, 87%, 86%, 85%, 84%, 83%, 82%, 81%, 80%, 79%, 78%, 77%, 76%, 75%, 74%, 73%, 72%, 71%, 70%, 69%, 68%, 67%, 66%, 65%, 64%, 63%, 62%, 61%, 60% similar in amino acid sequence to the unmodified, native protein expressed in the plant from which the nucleic acid was isolated.
[0143] In a terminator-free construct that so comprises two copies of the desired polynucleotide, one desired polynucleotide may be oriented so that its sequence is the inverse complement of the other. The schematic diagram of pSIM717 in FIG. 3 illustrates such an arrangement. That is, the "top," "upper," or "sense" strand of the construct would comprise, in the 5'- to 3'-direction, (1) a target gene fragment, and (2) the inverse complement of a target gene fragment. In this arrangement, a second promoter that is operably linked to that inverse complement of the desired polynucleotide will likely produce an RNA transcript that is at least partially identical in sequence to the transcript produced from the other desired polynucleotide.
[0144] The desired polynucleotide and its inverse complement may be separated by a spacer DNA sequence, such as an intron, that is of any length. It may be desirable, for instance, to reduce the chance of transcribing the inverse complement copy of the desired polynucleotide from the opposing promoter by inserting a long intron or other DNA sequence between the 3'-terminus of the desired polynucleotide and the 5'-terminus of its inverse complement. For example, in the case of pSIM717 (FIG. 3) the size of the intron ("I") may be lengthened so that the transcriptional complex of P1 is unlikely to reach the sequence of the inverse complement of gus-S before becoming interrupted or dislodged. Accordingly, there may be about 50, 100, 250, 500, 2000 or more than 2000 nucleotides positioned between the sense and antisense copies of the desired polynucleotide.
[0145] A desired polynucleotide of the present invention, e.g., a "first" or "second" polynucleotide as described herein may share sequence identity with all or at least part of a sequence of a structural gene or regulatory element. For instance, a first polynucleotide may share sequence identity with a coding or non-coding sequence of a target gene or with a portion of a promoter of the target gene. In one embodiment, the polynucleotide in question shares about 100%, 99%, about 98%, about 97%, about 96%, about 95%, about 94%, about 93%, about 92%, about 91%, about 90%, about 89%, about 88%, about 87%, about 86%, about 85%, about 84%, about 83%, about 82%, about 81%, about 80%, about 79%, about 78%, about 77%, about 76%, about 75%, about 74%, about 73%, about 72%, about 71%, about 70%, about 69%, about 68%, about 67%, about 66%, about 65%, about 64%, about 63%, about 62%, about 61%, about 60%, about 59%, about 58%, about 57%, about 56%, about 55%, about 54%, about 53%, about 52%, about 51%, about 50%, about 49%, about 48%, about 47%, about 46%, about 45%, about 44%, about 43%, about 42%, about 41%, about 40%, about 39%, about 38%, about 37%, about 36%, about 35%, about 34%, about 33%, about 32%, about 31%, about 30%, about 29%, about 28%, about 27%, about 26%, about 25%, about 24%, about 23%, about 22%, about 21%, about 20%, about 15%, or about 5% or at least about 1% sequence identity with a target gene or target regulatory element, such as a target promoter.
[0146] For ease, the term "desired polynucleotide" as used herein is not limiting but includes other terms used herein such as "first polynucleotide" and "second polynucleotide" or any polynucleotide that is used in a construct of the present invention to reduce expression of a target gene or sequence. Hence a "desired polynucleotide" may be a first or second polynucleotide or both.
[0147] In a simpler form, a construct of the present invention does not contain two copies of the polynucleotide but only one copy. Accordingly, the polynucleotide is operably linked to promoters at both its 5'- and 3' termini. In this particular arrangement, RNA transcripts will be produced that comprise sequences from each strand of the DNA duplex. An example of this arrangement is shown in FIG. 3 as pSIM772.
[0148] A terminator-free cassette may exist as an extrachromosomal DNA molecule in a cell or it may be integrated by any one of a variety of mechanisms into the nucleus, chromosome, or other endogenous nucleic acid of the cell. If the terminator-free cassette is stably integrated into the genome of the cell, then it may be possible to produce a cell line, cell culture, biological tissue, plant, or organism that comprises the cassette in subsequent cell or organism generations.
[0149] Expression of such a construct in a plant will reduce or prevent expression of gene(s) that display either shares sequence identity or inverse complementarity with at least part of a desired polynucleotide. The invention is not bound by any particular theory or mechanism, but the transcripts may, directly or indirectly, affect the activity of a regulatory sequence, such as a promoter, that is normally associated with the expression of a target gene in a cell; or the transcript may negatively affect the accumulation of a transcript that is endogenously produced in the target cell. Accordingly, either or both of transcript accumulation and transcript translation may be altered by the activity of the transcript produced by the expression cassette of the present invention.
[0150] A plant of the present invention may be a monocotyledonous plant, for instance, alfalfa, canola, wheat, turf grass, maize, rice, oat, barley, sorghum, orchid, iris, lily, onion, banana, sugarcane, and palm. Alternatively, the plant may be a dicotyledonous plant, for instance, potato, tobacco, tomato, avocado, pepper, sugarbeet, broccoli, cassava, sweet potato, cotton, poinsettia, legumes, alfalfa, soybean, pea, bean, cucumber, grape, brassica, carrot, strawberry, lettuce, oak, maple, walnut, rose, mint, squash, daisy, and cactus.
[0151] The effect of the RNA molecule, which is produced by a terminator-free expression cassette of the present invention, may be assessed by measuring, directly or indirectly, the target nucleic acid or protein level in the cell or environment in which the expression cassette is present. Thus, the effect of an expression cassette of the present invention in downregulating, suppressing, reducing, or preventing or eliminating target gene expression may be identified by a reduction in the amount of RNA transcript that is produced by the target gene, or a reduction in the amount of target gene protein product, or both.
[0152] A desired polynucleotide of a terminator-free construct described herein may be identical to, or share sequence identity with different kinds of DNA regions, such as (1) at least part of the sequence that encodes a target transcript, (2) at least part of the intron of a gene that encodes a target transcript, (3) at least part of the promoter of a gene that encodes a target transcript, (4) part of the terminator of a gene that encodes a target transcript, whereby the polynucleotide is not a terminator, (5) the 3'-untranslated region of a gene, and (6) the 5'-untranslated region of a gene. One or more nucleotides of any one of these regions may be mutated, altered, or substituted to increase sequence identity with a target sequence or to otherwise increase or enhance silencing of the target sequence.
[0153] The location of the target sequence, therefore, may be in, but is not limited to, (i) the genome of a cell; (ii) at least one RNA transcript normally produced in a cell; or (iii) in a plasmid, construct, vector, or other DNA or RNA vehicle. The cell that contains the genome or which produces the RNA transcript may be the cell of a bacteria, virus, fungus, yeast, fly, worm, plant, reptile, bird, fish, or mammal.
[0154] Hence, the target nucleic acid may be one that is normally transcribed into RNA from a cell nucleus, which is then in turn translated into an encoding polypeptide. Alternatively, the target nucleic acid may not actually be expressed in a particular cell or cell type. For instance, a target nucleic acid may be a genomic DNA sequence residing in a nucleus, chromosome, or other genetic material, such as a DNA sequence of mitochondrial DNA. Such a target nucleic acid may be of, but not limited to, a regulatory region, an untranslated region of a gene, or a non-coding sequence.
[0155] Alternatively, the target nucleic acid may be foreign to a host cell but is present or expressed by a non-host organism. For instance, a target nucleic acid may be the DNA or RNA molecule endogenous to, or expressed by, an invading parasite, virus, or bacteria.
[0156] Furthermore, the target nucleic acid may be a DNA or RNA molecule present or expressed by a disease cell. For instance, the disease cell may be a cancerous cell that expresses an RNA molecule that is not normally expressed in the non-cancerous cell type.
[0157] In plants, the desired polynucleotide may share sequence identity with a target nucleic acid that is responsible for a particular trait of a plant. For instance, a desired polynucleotide may produce a transcript that targets and reduces the expression of a polyphenol oxidase gene target in a plant and, thereby, modifies one or more traits or phenotypes associated with black spot bruising. Similarly, a desired polynucleotide may produce a transcript that targets and reduces the expression of a starch-associated R1 target nucleic acid or phosphorylase target nucleic acid in a plant, thereby modifying one or more traits or phenotypes associated with cold-induced sweetening.
[0158] An expression cassette in a construct of the present invention may be flanked by one or more transfer-DNA ("T-DNA") border sequences. Any of the expression cassettes described herein, for instance, may be inserted into the T-DNA of an Agrobacterium-derived plasmid, such as a Ti plasmid from A. tumefaciens.
[0159] A border sequence may comprise a sequence that is similar to a traditional Agrobacterium T-DNA border sequence, but actually is a sequence that is native to a plant, but which can facilitate transfer and integration of one nucleic acid into another. For instance, such plant-derived transfer-DNA ("P-DNA") border sequences can be isolated from potato (SEQ ID NO: 44), tomato (SEQ ID NOs: 45-46), pepper (SEQ ID NO: 47), alfalfa (SEQ ID NO: 48), barley (SEQ ID NO: 49), and rice (SEQ ID NO: 50) shown in the sequence table elsewhere in this application.
[0160] Accordingly, any one of the expression cassettes described herein may be inserted into a transfer-DNA that is delimited by such P-DNA border sequences, which are capable of integrating the cassette into another nucleic acid, such as a plant genome or plant chromosome.
[0161] Accordingly, an Agrobacterium plasmid, which contains an expression cassette described herein that does not comprise a DNA region that is involved in 3-end formation and polyadenylation of an RNA transcript, may be stably integrated into the genome of a plant via Agrobacterium-mediated transformation. The progeny of that transformed plant, therefore, will continue to express the transcripts associated with the expression cassette.
[0162] The promoters that are used to initiate transcription of the desired polynucleotide may be constitutive, tissue-preferred, or inducible promoters or permutations thereof. "Strong" promoters, for instance, include the potato ubiquitin-7 and ubiquitin-3 promoters, and ubiquitin promoters from maize, rice, and sugarcane. They also include the rice actin promoter, various rubisco small subunit promoters, rubisco activase promoters, and rice actin promoters. Good tissue-preferred promoters that are mainly expressed in potato tubers include the promoters of the granule-bound starch synthase and ADP glucose pyrophosphorylase genes. There are various inducible promoters, but typically an inducible promoter can be a temperature-sensitive promoter, a chemically-induced promoter, or a temporal promoter. Specifically, an inducible promoter can be a Ha hsp17.7 G4 promoter, a wheat wcs120 promoter, a Rab 16A gene promoter, an α-amylase gene promoter, a pin2 gene promoter, or a carboxylase promoter.
[0163] Accordingly, to facilitate identification of a plant that has been successfully transformed with a terminator-free expression cassette, it may be desirable to include within the region delineated by the transfer-DNA border sequences a selectable or screenable marker. Inclusion of a marker is a standard procedure in Agrobacterium-mediated transformation and is employed to make it possible to readily identify successfully-transformed plant material. In the expression cassettes depicted in FIGS. 1-4, for instance, the marker is hygromycin phosphtransferase ("hptII"), which confers hygromycin resistance to a plant that expresses that marker. In such cassettes, therefore, a terminator or DNA region that is involved in 3-end formation and polyadenylation of an RNA transcript is operably linked to the hptII gene sequence. Other selectable and screenable markers may be used instead of hptII.
EXAMPLES
Example 1
Conventional Silencing Constructs
[0164] The efficacy of various silencing constructs was tested by targeting the beta glucuronidase (gus) reporter gene operationally linked to the strong constitutive promoter of figwort mosaic virus, designated here as "P1" (SEQ ID NO: 1). This test system is stringent because the gus protein is highly stable. Thus, only relatively large reductions in gus transcripts result in phenotypically detectable reductions of gus protein levels. Most silencing constructs contain at least one copy of the same 304-bp gus gene fragment (SEQ ID NO: 2), operably linked in either the sense or antisense orientation to a strong constitutive promoter and in some cases followed by the terminator of the Agrobacterium nopaline synthase gene. The silencing constructs were inserted next to an expression cassette for the hygromycin phosphotransferase (hptII) selectable marker gene between the T-DNA borders of transformation vectors. Resulting vectors were used to retransform a tobacco plant that had been transformed before with a T-DNA containing an expression cassette for the gus gene (see also FIGS. 1 and 2).
[0165] The following transformation vectors were produced to study the role of a terminator element in conventional silencing constructs:
[0166] pSIM714: The negative control vector pSIM714, which does not contain a silencing construct.
[0167] pSIM718: Vector pSIM718, which contains a `sense` gus gene fragment operably linked to the terminator of the nopaline synthase gene (SEQ ID NO: 3) that represents strategies described in, e.g., U.S. Pat. Nos. 5,283,184 and 5,231,020. This vector contains the gus gene fragment operably linked in the sense orientation to the promoter and followed by the terminator.
[0168] pSIM140: Vector pSIM140, which is identical to pSIM718 except that the silencing construct does not contain a terminator.
[0169] pSIM755: Vector pSIM755, which contains a terminator-containing `antisense` construct that represents strategies described in, e.g., U.S. Pat. Nos. 5,107,065 and 5,759,829. This vector contains the gus gene fragment operably linked in the sense orientation to the promoter and followed by the terminator.
[0170] pSIM758: Vector pSIM758, which is identical to pSIM755 except that the silencing construct does not contain a terminator.
[0171] pSIM374: Vector pSIM374, which contains a terminator-containing construct that comprises both a sense and antisense gus gene fragment and represents strategies described in, e.g., WO 99/53050A1. This vector contains two copies of the gus gene fragment, one in the sense orientation and the other one in the antisense orientation and separated from each other by an intron, depicted in SEQ ID NO: 4, and inserted between promoter and terminator.
[0172] pSIM728 and 777: Vector pSIM728, which is identical to pSIM374 except that the silencing construct does not contain a terminator. Vector pSIM777 is identical to pSIM728 except that the P2 promoter is at the other side of the expression cassette.
[0173] Binary vectors containing the various constructs were introduced into Agrobacterium. Ten-fold dilutions of overnight-grown cultures of the resulting strains were grown for five to six hours, precipitated for 15 minutes at 2,800 RPM, washed with MS liquid medium (PhytoTechnology, KS) supplemented with sucrose (3%, pH 5.7) and resuspended in the same medium to 0.2 OD/600 nm. The suspension was then used to infect leaf explants of the transgenic in vitro grown Nicotiana tabacum (tobacco) plant expressing the gus gene. Infected explants were incubated for two days on co-culture medium (1/10 MS salts, 3% sucrose, pH 5.7) containing 6 g/L agar at 25° C. in a Percival growth chamber (16/8 hr photoperiod) and subsequently transferred to M401/agar (PhytoTechnology) medium containing timentin (150 mg/L) and hygromycin (20 mg/L). Resulting shoots were transferred to hormone-free rooting medium, and three leaves of each resulting plant were stained for gus expression.
[0174] Table 1 shows that all plants retransformed with pSIM714 displayed the same levels of gus expression as the original gus plant, confirming that retransformation, proliferation of single cells, and regeneration does not negatively affect expression of the reporter gene.
[0175] Table 1 also shows that the constructs representing the three different conventional silencing methods trigger gus gene silencing with varying efficiencies. In agreement with what has been reported in the literature, pSIM374 is most effective. About half of plants that were retransformed with this constructs display at least some reduced level of gus activity. The two other constructs support a reduction in gus activity in only about 6% of retransformed plants.
[0176] Importantly, Table 1 also demonstrates that removal of the terminator dramatically lowers the efficacy of the silencing constructs. For instance, pSIM374 is more than six-fold more efficacious than its terminator-free derivative, pSIM728. Hardly any activity is observed with the terminator-free pSIM758.
[0177] It can be concluded that the terminator plays an essential role in optimizing the activity of conventional silencing constructs.
Example 2
Effective Gene Silencing with Terminator-Free Constructs Comprising at Least Two Copies of a Target Gene Fragment that Trigger Convergent Transcription
[0178] The following transformation vectors were produced to study the effect of convergent transcription on gene silencing (see also FIG. 3):
[0179] pSIM715: Vector pSIM715 contains a construct that comprises a first segment consisting of the gus gene fragment operationally linked to the promoter (P1) and a second segment in the opposite orientation that consists of the same gus gene fragment operationally linked to the constitutive 35 S promoter of cauliflower mosaic virus, designated `P2` and depicted in SEQ ID NO: 5, whereby the first and second segment are separated by two different introns.
[0180] pSIM717: Vector pSIM717 is identical to pSIM715 except that the two segments of the construct are separated by a single intron.
[0181] pSIM789: Vector pSIM789 is identical to pSIM717 except that the P2 promoter is replaced by a P1 promoter.
[0182] pSIM771: Vector pSIM771 is identical to pSIM717 except that the P1 promoter is replaced by the potato ubiquitin-7 promoter, which is depicted in SEQ ID NO: 6 and named here `P3`.
[0183] pSIM770: Vector pSIM770 is identical to pSIM717 except that the P2 promoter that drives expression of the selectable marker gene is replaced by P1, and the P1 promoter of the silencing construct is replaced by P2.
[0184] pSIM772: Vector pSIM772 contains the gus gene fragment inserted between two different oppositely oriented promoters, P2 and P3
[0185] pSIM756: Vector pSIM756 is identical to pSIM717 except that the gus gene fragments are oriented in the inverse complementary orientation relative to the promoter to which they are immediately linked.
[0186] pSIM779: Vector pSIM779 is an example of a tandem repeat of gus gene fragments inserted between two convergent promoters.
[0187] pSIM787: Vector pSIM787 is similar as pSIM779 but contains four direct repeats of the target gene fragment inserted between convergent promoters.
[0188] pSIM1111 is identical to pSIM779 except that the four direct repeats are preceded by an antisense DNA fragment of the gus gene that is different from SEQ ID NO: 2 and depicted as SEQ ID NO: 7.
[0189] Gus assays performed on re-transformed gus plants demonstrate that all tested terminator-free constructs that contain two segments, each containing a different promoter driving the gus gene fragment, in the inverse complementary orientation, pSIM715, 717, and 756, 771 are more efficaceous in silencing the gus gene than pSIM374, the construct that represents the best conventional approach. Furthermore, pSIM789 also confers effective gene silencing to many of the double transformants.
[0190] The experiment also shows that the use of a single gus gene fragment (pSIM779) is not as efficaceous. This result suggests that convergent transcription of at least two copies of the desired polynucleotide is important for effective silencing.
[0191] The experiment also showed that a construct with two direct repeats (pSIM780) triggered gene silencing. However, this arrangement was not as effective as the inverted repeat organization of pSIM756 (Table 1). Furthermore, four directs repeats (pSIM787) are more effective than two direct repeats (Table 1).
[0192] To study the molecular basis of terminator-free silencing, RNA was isolated from three plants that had been retransformed with pSIM717 and three additional plants retransformed with pSIM715. In each case, one plant represented an ineffective silencing event whereas the other two plants displayed near-complete gus gene silencing.
[0193] Reverse-transcription polymerase chain reactions (RT-PCRs) were performed to study to production of transcripts from the two different promoters used in pSIM715 and pSIM717. The first primer used for these experiments (PG, shown in SEQ ID NO: 8) is specific for a sequence of the gus gene fragment and anneals to transcripts produced from either strand. The second primer was designed to anneal to intron sequences of one of the strands only (pIF, shown in SEQ ID NO: 9, anneals to a sequence of the GBSS-intron derived spacer region of transcripts produced by the P1 promoter, and PIR, shown in SEQ ID NO: 10, anneals to transcripts produced by the P2 promoter). Interestingly, these studies demonstrated that the construct of the non-silenced plants 717-7 and 717-13 only contained transcripts produced from one of the two strands, either T1 or T2 (FIG. 7). In contrast, the silenced plants 715-19, 715-38, 717-55, and 717-36 produced transcripts from both strands (FIG. 7). Thus, effective silencing is accomplished only if both promoters of the construct are functionally active simultaneously.
[0194] Hybridization of subsequent RNA gel blots with radioactively labeled probes derived from the gus gene demonstrated that effective silencing in 715-38, 715-55, 717-12, 717-36, and 717-19 is correlated with a strong decrease in gus RNA accumulation (FIG. 8). Furthermore, the transcripts produced by the silencing construct in fully silenced plants were generally found to be varying in size from about 0.2-kb to about 1.0-kb (FIG. 8). Although RT-PCR revealed the presence of additional large transcripts that comprise not only the polynucleotide but also downstream promoter sequences, the presence of such transcripts could hardly be detected on RNA gel blots. For instance, hybridization with a probe derived from the P1 promoter required a seven-day exposure time before an extremely faint smear could be observed.
Example 3
Silencing in Potato Tubers
[0195] Various vectors were developed to test the concept of terminator-free silencing in tubers. These vectors contained an expression cassette for the neomycin phosphotransferase (nptII) gene as selectable marker system (see also FIG. 6). The driver promoters used for gene silencing in potato tubers were selected from the group consisting of: (1) the strong potato ubiquitin-7 promoter, (2) the strong tuber and stolon-specific promoter of the granule-bound starch synthase (GBSS) gene (SEQ ID NO: 12), and (3) the strong tuber-specific promoter of the potato ADP glucose pyrophosphorylase (AGP) gene (SEQ ID NO: 13). See FIG. 4 for maps of the transfer DNAs.
[0196] pSIM764: Vector pSIM764 contains a `tuber-silencing` construct that comprises a first segment consisting of the 154-bp trailer of the potato tuber-expressed PPO gene (SEQ ID NO: 14) operably linked to the GBSS promoter and a second segment in the opposite orientation that consists of the same trailer fragment operably linked to the GBSS promoter whereby the first and second segment are separated by the intron of the potato ubiquitin-7 gene depicted in SEQ ID NO: 15.
[0197] pSIM765: Vector pSIM765 is identical to pSIM764 except that the PPO gene fragments are oriented in the opposite orientation.
[0198] pSIM217 represents the control plasmid and contains the two copies of the PPO gene inserted as inverted repeat between GBSS promoter and ubiquitin terminator.
[0199] Ten-fold dilutions of overnight-grown cultures were grown for 5-6 hours, precipitated for 15 minutes at 2,800 RPM, washed with MS liquid medium (Phytotechnology) supplemented with sucrose (3%, pH 5.7), and resuspended in the same medium to 0.2 OD/600 nm. The resuspended cells were mixed and used to infect 0.4-0.6 mm internodal segments of the potato variety "Ranger Russet". Infected stems were incubated for two days on co-culture medium (1/10 MS salts, 3% sucrose, pH 5.7) containing 6 g/L agar at 22° C. in a Percival growth chamber (16 hrs light) and subsequently transferred to callus induction medium (CIM, MS medium supplemented with 3% sucrose 3, 2.5 mg/L of zeatin riboside, 0.1 mg/L of naphthalene acetic acid, and 6 g/L of agar) containing timentin (150 mg/L) and kanamycin (100 mg/L). After one month of culture on CIM, explants were transferred to shoot induction medium (SIM, MS medium supplemented with 3% sucrose, 2.5 mg/L of zeatin riboside, 0.3 mg/L of giberellic acid GA3, and 6 g/L of agar) containing timentin and kanamycin (150 and 100 mg/L respectively) until shoots arose. Shoots arising at the end of regeneration period were transferrred to MS medium with 3% sucrose, 6 g/L of agar and timentin (150 mg/L). Transgenic plants were transferred to soil and placed in a growth chamber (11 hours light, 25° C.). After three weeks, at least 3 minitubers/line were assayed for PPO activity. For this purpose, 1 g of potato tubers is pulverized in liquid nitrogen, added to 5 ml of 50 mM MOPS (3-(N-morpholino) propane-sulfonic acid) buffer (pH 6.5) containing 50 mM catechol, and incubated at room temperature with rotation for about 1 hour. The solid fraction was precipitated, and the supernatant transferred to another tube to determine PPO activity. For this purpose, 1 g of potato tubers was pulverized in liquid nitrogen. This powder was then added to 5 ml of 50 mM MOPS (3-(N-morpholino) propane-sulfonic acid) buffer (pH 6.5) containing 50 mM catechol, and incubated at room temperature with rotation for about 1 hour. The solid fraction was then precipitated, and the supernatant transferred to another tube to determine PPO activity by measuring the change of OD-410 over time. The experiment demonstrated that pSIM764 and 765 trigger effective silencing in potato tubers (Table 2). A comparison with data presented in WO 2003/069980 demonstrates that the method of the present invention can be more effective than that of conventional terminator-based gene silencing as exemplified by pSIM217, the PPO control.
Example 4
Multi-Gene Silencing in Tobacco
[0200] Two constructs were created to study the effect of the position of gene fragments within the silencing construct. For this purpose, the two copies of the gus gene fragment of pSIM771 were replaced by two copies of the gus gene fragment linked to a fragment of the tobacco polyphenol oxidase (PPO) gene (SEQ ID NO: 16) (also, see FIG. 6):
[0201] pSIM774: Vector pSIM774 contains a silencing construct with the gus gene fragments immediately linked to the promoters and the adjacent PPO gene fragments linked to the central intron.
[0202] pSIM775: Vector pSIM775 contains a silencing construct with the PPO gene fragments immediately linked to the promoters and the adjacent gus gene fragments linked to the central intron.
[0203] Retransformation of gus plants with these vectors is expected to trigger silencing as efficiently as pSIM771.
Example 5
Multi-Gene Silencing in Potato
[0204] Multiple gene silencing is implemented by simultaneously targeting three undesirable potato tuber genes.
[0205] Plasmid pSIM1121 (Russet Boise II) comprises an all-native transfer DNA depicted in SEQ ID NO: 17 comprising a silencing construct comprising two copies of a DNA segment, separated by the intron of the potato ubiquitin-7 gene and positioned as inverted repeat between two convergent GBSS promoters, whereby the DNA segment comprises (i) a fragment of the trailer of the tuber-expressed polyphenol oxidase gene of the wild potato relative Solanum verrucosum Schltdl. TRHRG 193, accession number 498062 (see: USDA, ARS, National Genetic Resources Program. Germplasm Resources Information Network--(GRIN). [Online Database] National Germplasm Resources Laboratory, Beltsville, Md. Available: http://www.ars-grin.gov2/cgi-bininpgs/html/acchtml.pl?1392998, 12 Sep. 2005) (SEQ ID NO: 18) (ii) a fragment of the leader of the phosphorylase L gene (SEQ ID NO: 19), and (iii) a fragment of the leader of the R1 gene (SEQ ID NO: 20).
[0206] Employment of this plasmid makes it possible to produce transformed potato plants that only contain native DNA and display the following new traits: (1) bruise tolerance due to silencing of the tuber-expressed PPO gene, (2) reduced cold-induced glucose accumulation due to silencing of the phosphorylase and R1 genes.
Example 6
Highly Effective Promoter Targeting
[0207] The following transformation vectors were produced to demonstrate that sequences of the target promoter can be used to silence expression of the target gene (see also FIG. 4):
[0208] pSIM773: Vector pSIM773 contains a construct that comprises a first segment comprising the P3 promoter linked to P1, and a second segment, which is oriented in the opposite orientation, and which comprises the P2 linked to P1. The first and second segment are separated by an intron. Thus, this construct contains four functionally active promoters. The two promoters in the middle are identical, represent the target promoter, and are in convergent orientation. The two outside promoters are different to each other and in convergent orientation. All four promoters contain a TATA box and proceed up to a base pair upstream from the transcription start.
[0209] pSIM1101: Vector pSIM1101 is identical to pSIM773 except that the P3 promoter was replaced by the nos terminator.
[0210] pSIM788: Vector pSIM788 is similar to pSIM773 except that the two central P1 promoters of the target gus gene only contain sequences upstream from the TATA box, (SEQ ID NO: 21), thus representing non-functional promoters.
[0211] pSIM1120: Vector pSIM1120 is similar to pSIM773 except that the two central promoters of the target gene lack a TATA box and are not in convergent but divergent orientation.
[0212] pSIM1112: Vector pSIM1112 contains a single non-functional P1 promoter inserted between convergent P2 and P3 promoter.
[0213] pSIM1113: Vector pSIM1113 contains two convergent P1 promoters separated by an intron.
[0214] pSIM754: Control vector pSIM754 contains the P1 promoter driving expression of the P2 promoter, and vice versa.
[0215] Retransformation of gus plants with pSIM773 yielded 35 hygromycin resistant plants. PCR analysis confirmed the presence of the transfer DNA of pSIM773. Surprisingly, subsequent gus staining revealed an extremely effective complete silencing of the gus gene (Table 1). Twenty plants (57%) did not display any detectable gus expression. Thus, promoter targeting using the pSIM773 strategy is highly desirable.
[0216] Similar results were obtained with the target promoters in divergent orienteation inserted between two convergent driver promoters, with 77% of plants that had been retransformed with pSIM1120 displaying full gus gene silencing (Table 1).
[0217] Table 1 shows that gene silencing was also accomplished by using a single target promoter inserted between two convergent driver promoters (pSIM1112). However, this method may be less effective than methods that employ two copies of the target promoter oriented as inverted repeat.
[0218] Furthermore, efficacy of pSIM1113 demonstrates that driver promoters are not always necessary. It is possible to effectively silencing a gene by simply employing two convergent target promoters (Table 1).
[0219] Many (44%) of the plants that were retransformed with pSIM1101 also displayed full gene silencing (Table 1). This finding demonstrates that promoter-based silencing does not require convergent transcription.
[0220] Conventional silencing methods have often been found to not provide stable gene silencing in subsequent generations. In contrast, four-promoter constructs represented by pSIM773 gave full silencing that is completely maintained upon transmission of the silencing cassette to the next generation. The enhanced stability was demonstrated by allowing double transformed tobacco plants to mature, and subsequently determining gus expression levels in T1 progenies. This study showed that 100% of the progeny plants that were derived from a pSIM773 plant and contained both gus gene and silencing cassette displayed full gus gene silencing (Table 3). In contrast, none of the T1 plants carrying the gus gene and pSIM374 silencing cassette displayed full gus gene silencing (Table 3). An intermediair phenotype was observed by analyzing the progeny of a plant carrying the gus gene and the silencing cassette of pSIM717 (Table 3).
Example 7
Sequence Requirements for Promoter Targeting
[0221] The above experiments demonstrated that promoter sequences can be used to effectively trigger gene silencing. However, they should not be understood to imply that any promoter fragment of the target gene could be employed for this purpose.
[0222] To study the sequence requirements for promoter-based silencing, two vectors were created that comprise two copies of only part of the P1 promoter inserted as inverted repeat between the driver promoters.
[0223] pSIM1118: Vector pSIM1118 contains two copies of an upstream 300-bp fragment of the promoter shown in SEQ ID NO: 11.
[0224] pSIM1119: Vector pSIM1119 contains two copies of a central 300-bp region of the P1 promoter shown in SEQ ID NO: 51.
[0225] Retransformation of gus plants with the two different constructs yielded 34 and 20 plants, respectively, that were analyzed histochemically. Interestingly, none of the analyzed plants displayed any reduced gus expression, indicating that the employed promoter fragments did not effectively trigger gene silencing (Table 1).
[0226] FIG. 9 shows a sequence analysis of the various promoter fragments. The fragment that facilitates effective gene silencing is present in pSIM773, 788, 1101, and 1120 but not in pSIM1118 and 1119.
Example 8
Reduced Cold-Sweetening in Tubers of Potato Plants Containing a Silencing Construct Comprising Two Copies of a Fragment of the Promoter of the R1 Gene
[0227] The sequence of the promoter of the potato starch-associated R1 gene, including leader and start codon, is shown in SEQ ID NO: 22. Two copies of a short (342-bp) R1 promoter fragment (SEQ ID NO: 23) were inserted as inverted repeat between either two convergently oriented promoters of the GBSS promoter (in plasmid pSIM1038) or a GBSS and AGP promoter in convergent orientation (in plasmid pSIM1043). The resulting binary vectors were used to produce transformed potato plants. These plants will be allowed to develop tubers, and the tubers will be stored for about a month or longer at 4° C. Glucose analysis of the cold-stored tubers will demonstrate that the transformed plants accumulate less glucose than untransformed control plants. The reduced accumulation of glucose will lower color formation during French fry processing and, thus, make it possible to reduce blanch time and preserve more of the original potato flavor. Furthermore, promoter-mediated R1 gene silencing will limit starch phosphorylation and, therefore, reduce the environmental issues related to the release of waste water containing potato starch. Other benefits of the transformed tubers include: (1) resulting French fries will contain lower amounts of the toxic compound acrylamide, which is formed through a reaction between glucose and asparagine, and (2) resulting fries will display a crisper phenotype, as evaluated by professional sensory panels, due to the slightly altered structure of the starch.
[0228] Similar results can be obtained by employing a shorter (151-bp) part of the R1 promoter, shown in SEQ ID NO. 24. Binary vector pSIM1056 comprises two copies of this fragment inserted as inverted repeat between two convergently oriented GBSS promoters; pSIM1062 comprises the fragments inserted between convergently oriented GBSS and AGP promoters. This vector was used to produce 25 transformed plants, which can be shown to display reduced cold-induced glucose accumulation and all benefits associated with that trait.
Example 9
Enhanced Blackspot Bruise Tolerance in Tubers of Potato Plants Containing a Silencing Construct Comprising Two Copies of a Fragment of the Promoter of the Polyphenol Oxidase Gene
[0229] The sequence of the promoter of the potato tuber-expressed polyphenol oxidase gene is shown in SEQ ID NO: 25. Two copies of a 200-bp PPO promoter fragment (SEQ ID NO: 26) were inserted as inverted repeat between convergent GBSS and AGP promoters. A binary vector comprising this silencing construct, designated pSIM1046, was used to produce twenty-five transformed potato plants. The plants can be allowed to develop tubers, and the tubers can be assayed for polyphenol oxidase activity. Such an analysis will show that the expression level of the targeted PPO gene is reduced if compared to levels in untransformed controls.
[0230] In a similar way, plasmid pSIM1045, which contains two copies of a 460-bp PPO promoter fragment (SEQ ID NO: 27) inserted between convergent GBSS and AGP promoters, can be used to lower PPO gene expression.
[0231] Similar strategies can be used in other crop species to limit bruise. For instance, the promoter of the leaf-expressed PPO gene of lettuce can be used to reduce bruise in lettuce leaves, the promoter of the fruit-expressed PPO gene of apple can be used to reduce bruise in apple fruit, and the promoter of the seed-expressed PPO gene of wheat can be used to reduce bruise in wheat grains. In all these and other cases, the promoter can be isolated straightforwardly by designing primers that anneal to the known PPO gene sequences, and performing well-known DNA isolation methods such as inverse PCR.
Example 10
Improved Oil Content in Seeds of Canola Plants Containing a Silencing Construct Comprising Two Copies of a Fragment of the Promoter of the Fad2 Gene
[0232] The sequence of the promoter of the Brassica Fad2 gene, including leader, intron, and start codon, is shown in SEQ ID NO: 28. Two copies of a fragment of this promoter lacking any transcribed sequences such as the 441-bp fragment shown in SEQ ID 29 can be placed as inverted repeat between two convergently oriented promoters that are expressed in Brassica seeds. Examples of `driver` promoters are: the promoter of a napin (1.7S seed storage protein gene) gene shown in SEQ ID NO: 30 or the promoter of a stearoyl-ACP desaturase gene (SEQ ID NO: 31).
[0233] The silencing cassette can be placed within the transfer DNA sequence of a binary vector, and this binary vector can be used to transform Brassica. Some of the resulting plants will produce seed that contains increased amounts of oleic acid.
[0234] Other promoters that can be used in silencing constructs to improve oil composition in oilseed crops such as canola, soybean, cotton, and sunflower include promoters of other genes of the fatty acid biosynthesis pathway. For instance, a promoter of a target fatty acid desaturase 12, or microsomal omega-6 fatty acid desaturase, (FAD12) gene (e.g., Genbank Accession Nr. AF243045 for canola and AB188250 for soybean) such as the soybean FAD12 promoter shown in SEQ ID NO: 32 can be used to increase oleic acid levels in crops such as canola and soybean.
[0235] Furthermore, promoters of the cotton stearoyl-acyl-carrier protein delta 9-desaturase and oleoyl-phosphatidylcholine omega 6-desaturase genes can be used to increase stearic acid and oleic acid levels, respectively, in cotton. This promoter can be identified by performing methods such as inverse PCR using the known sequence of the target genes (Liu et al., Plant Physiol 129:1732-43, 2002). Two copies of the newly isolated promoter can then be used in strategies similar to that shown for pSIM773 whereby the `driver` seed-specific promoters can either represent foreign DNA or native DNA.
Example 11
Reduced Lignin Content in the Vascular System of Alfalfa Plants Containing a Silencing Construct Comprising Two Copies of a Fragment of the Promoter of the Comt Gene
[0236] The promoter of the Medicago sativa (alfalfa) caffeic acid/5-hydroxyferulic acid 3/5-O-methyltransferase (COMT) gene, including leader, is shown in SEQ ID NO: 33.
[0237] Two copies of a 448-bp promoter fragment that lacks transcribed sequences (SEQ ID NO: 34) were inserted as inverted repeat between two convergently oriented driver promoters. The first driver promoter is the promoter of the petE gene shown in SEQ ID NO: 35; the second promoter is the promoter of the Pal gene shown in SEQ ID NO: 36. A binary vector comprising this silencing construct, designated pSIM1117, was used to produce transformed alfalfa plants. Stem tissues of the plants were assayed and shown to contain reduced levels of lignin.
[0238] Reduced lignin content can be determined according to the following protocol: (i) cut stem sections and place them on watch glass, (ii) immerse the cut stems in 1% potassium permanganate for 5 min at room temperature, (iii) discard the potassium permanganate solution using a disposable pipette and wash the samples twice with water to remove excess potassium permanganate, (iv) add 6% HC1 (V/V) and let the color of the sections turn from black or dark brown to light brown, (v) if necessary, add additional HC1 to facilitate the removal of dark color, (vi) discard the HC1 and wash the samples twice with water, (vii) add few drops of 15% sodium bicarbonate solution (some times it may not go into solution completely), a dark red or red-purple color develops for hardwoods (higher in S units) and brown color for softwood (higher in G units).
[0239] Nineteen transformed alfalfa lines were tested for reduced lignin content, and six plants were found to accumulate reduced amounts of the S-unit of lignin.
[0240] Instead of the promoter of the COMT gene, it is also possible to use the promoter of the caffeoyl CoA 3-O-methyltransferase (CCOMT) gene. The sequence of this promoter, together with downstream leader, is shown in SEQ ID NO: 37. A fragment of SEQ ID NO: 29 that lacks transcribed sequences as depicted in SEQ ID NR: 38 can be used to lower lignin content.
[0241] Similarly, lignin can be reduced in trees by using promoters of genes involved in lignin biosynthesis. It is also possible to use SEQ ID NO: 59 and reduce lignin content in maize by employing the above-described promoter-based silencing approach.
Example 12
Increased Shelf Life of Fruits of Tomato Plants Containing a Silencing Construct Comprising Two Copies of a Fragment of the Promoter of the Polygalacturonase Gene
[0242] A promoter of a target polygalacturonase gene such as the tomato promoter shown in SEQ ID NO: 39 can be used to reduce breakdown of pectin, thus slowing cell wall degradation, delaying softening, enhancing viscosity characteristics, and increasing shelf life in tomato by inserting two copies of the promoter fragment as inverted repeat between convergent fruit-specific driver promoters.
Example 13
Reduced Allergenicity of Foods from Plants Containing a Silencing Construct Comprising Two Copies of a Fragment of the Promoter of Genes Encoding Allergens
[0243] The promoter of the major apple allergen Mal d 1 gene can be isolated by employing inverse PCR methods using the known gene sequence (Gilissen et al., J Allergy Clin Immunol 115:364-9, 2005), and this promoter can then be used to develop apple varieties that contain lower allergenicity levels.
[0244] Similarly, the promoter of the major peanut allergen Ara h 2 (Dodo et al., Curr Allergy Asthma Rep 5, 67-73, 2005) can be isolated using inverse PCR methods, and used to develop peanut varieties that contain lower allergenicity levels.
[0245] Furthermore, the promoter of the major soybean allergen Gly m Bd 30 K (Herman et al., Plant Physiol 132, 36-43, 2003) can be isolated using inverse PCR methods, and used to develop peanut varieties that contain lower allergenicity levels.
Example 14
Multi-Gene Silencing Approach Based on a Combination of Gene and Promoter Fragments
[0246] Plasmid pSIM870 (Russet Boise III) comprises an all-native transfer DNA depicted in SEQ ID NO. 40 comprising (1) a first silencing cassette comprising two copies of a DNA segment positioned as inverted repeat between two convergent GBSS promoters whereby the DNA segment comprises (i) a fragment of the trailer of the tuber-expressed polyphenol oxidase gene of Solanum verrucosum, (ii) a fragment of the leader of the phosphorylase L gene, and (iii) a fragment of the trailer of the phosphorylase L gene (SEQ ID NO: 41), and (2) a second silencing cassette comprising two copies of the R1 promoter positioned as inverted repeat between the driver promoters of the AGP and GBSS genes, respectively.
[0247] Plasmid pSIM899 (Russet Boise IV) comprises an all-native transfer DNA depicted in SEQ ID NO: 42 comprising a first silencing cassette comprising two copies of a DNA segment positioned as inverted repeat between two convergent GBSS promoters whereby the DNA segment comprises (i) a fragment of the trailer of the tuber-expressed polyphenol oxidase gene of Solanum verrucosum, and (ii) a fragment of the leader of the phosphorylase L gene, and a second silencing cassette comprising four copies of the leader of the R1 gene operably linked to the AGP promoter and followed by an inverted repeat comprising a sense and antisense fragment of the R1 gene.
[0248] Potato transformation with any of these three plasmids will produce plants that, compared to untransformed plants, display the following characteristics: (1) reduced expression of the tuber-expressed polyphenol oxidase gene and, consequently, (i) increased tuber polyphenol content as can be determined by xx, and (ii) enhanced tolerance to tuber black spot bruise as can be determined by xx, and (2) strongly reduced expression of the phosphorylase and R1 genes and, consequently, (i) reduced starch phosphorylation and, consequently, lowered phosphate content of waste waters containing potato starch, and (ii) a reduced conversion of starch into glucose during cold-storage as determined by using the glucose oxidase/peroxidase reagent (Megazyme, Ireland), resulting in (a) less caramelization, and consequently, reduced color formation during frying, which makes it possible to store at higher temperatures and/or blanch for shorter time periods (b) less formation of acrylamide, and (c) increased crispness of fries.
Example 15
Intron Targeting
[0249] The polynucleotide used to generate a TFCT construct can contain the intron of a gene that produces the target transcript. The concept of intron-targeted silencing can be demonstrated by using the intron of the gus gene that is expressed in transgenic tobacco.
[0250] The following transformation vector was produced to demonstrate that sequences of the target intron can be used to silence expression of the target gene (see also FIG. 5):
[0251] Vector pSIM782, which contains a construct that comprises a first segment consisting of the intron of the gus gene operationally linked to the promoter (P1) and a second segment in the opposite orientation that consists of the same gus gene intron operationally linked to a second constitutive promoter (P2) whereby the first and second segment are separated by an intron.
[0252] An example of an intron that can be used to silence a gene is the intron of the Solanum vernei starch-associated R1 gene SEQ ID NO: 44. R1 gene silencing will reduce the extent of cold-induced sweetening in tubers during storage.
Example 16
Terminator Targeting
[0253] The polynucleotide used to generate a TFCT construct can comprise sequences downstream from the transcribed sequences of a target gene. This concept can be demonstrated by using the sequences downstream from the gus gene that is expressed in transgenic tobacco.
Example 17
Reduced Lignin Content in the Vascular System of Alfalfa Plants Containing a Silencing Construct Comprising Two Copies of a Fragment of the Comt Gene
[0254] A binary vector designated pSIM856 was assembled comprising an expression cassette comprising two Comt gene fragments depicted in SEQ ID NOs: 52 and 53, positioned as inverted repeat between two convergent alfalfa promoters shown in SEQ ID NOs: 54 and 55 in such a way that the promoters are operably linked to first the antisense fragment and then the sense fragment. The expression cassette is inserted between alfalfa derived sequences that function as replacement for Agrobacterium borders and are shown in SEQ ID NOs: 56 and 57. The entire transfer DNA, depicted in SEQ ID NO: 58 is inserted into a plasmid that carries an expression cassette for the Agrobacterium ipt gene in its backbone.
[0255] Transformations were carried out as described in Weeks and Rommens, US patent application US20050034188A1, which is incorporated herein by reference. Two transformed plants were tested for lignin content, and both were found to not visibly accumulate the S-unit.
Tables
TABLE-US-00001 [0256] TABLE 1 Efficacy of conventional and terminator-free silencing constructs. Tobacco Construct for 2nd plants gus expression transformation assayed 50-100% 10-50% 1-10% 0% none 3 3 (100%) 0 0 0 PSIM714 8 8 (100%) 0 0 0 PSIM374 36 13 (36%) 11 (31%) 9 (25%) 3 (8%) PSIM718 35 33 (95%) 1 (3%) 1 (3%) 0 PSIM728 23 15 (65%) 5 (22%) 3 (13%) 0 PSIM715 37 10 (27%) 11 (30%) 15 (41%) 1 (3%) PSIM717 35 11 (31%) 3 (9%) 19 (54%) 2 (6%) pSIM754 38 38 (100%) 0 0 0 PSIM755 36 35 (97%) 0 0 1 (3%) pSIM756 37 18 (49%) 12 (32%) 5 (14%) 2 (5%) PSIM758 29 29 (100%) 0 0 0 PSIM770 38 35 (92%) 3 (8%) 0 0 PSIM771 35 20 (57%) 3 (9%) 9 (26%) 3 (9%) PSIM772 35 34 (97%) 0 1 (3%) 0 PSIM773 35 15 (43%) 0 0 20 (57%) PSIM774 35 31 (89%) 2 (6%) 2 (6%) 1 (3%) PSIM775 36 22 (61%) 6 (17%) 7 (19%) 1 (3%) PSIM777 36 33 (92%) 1 (3%) 1 (3%) 1 (3%) PSIM778 36 32 (89%) 2 (6%) 2 (6%) 0 PSIM779 36 33 (92%) 1 (3%) 2 (6%) 0 PSIM782 35 34 (97%) 0 1 (3%) 0 PSIM787 32 20 (63%) 7 (22%) 3 (9%) 2 (6%) PSIM788 35 14 (40%) 0 0 21 (60%) PSIM789 35 19 (54%) 4 (11%) 6 (17%) 6 (17%) PSIM1101 34 14 (41%) 0 5 (15%) 15 (44%) PSIM1111 36 21 (58%) 9 (25%) 6 (17%) 0 pSIM1112 36 33 (92%) 1 (3%) 0 2 (6%) pSIM1113 34 24 (71%) 2 (6%) 3 (9%) 5 (15%) PSIM1118 34 34 (100%) 0 0 0 PSIM1119 20 20 (100%) 0 0 0 pSIM1120 35 8 (23%) 0 0 27 (77%)
TABLE-US-00002 TABLE 2 PPO activity in potato mini tubers. rep-1(OD) rep-2 (OD) rep-3 (OD) % of WT S.E. Control wt-1 0.127 0.121 0.137 87 2.6 wt-2 0.129 0.141 0.125 89 2.7 wt-3 0.138 0.146 0.123 92 3.7 wt-4 0.134 0.157 0.159 101 4.4 wt-5 0.152 0.173 0.169 111 3.6 wt-6 0.153 0.152 0.151 103 0.3 wt-7 0.173 0.158 0.167 112 2.4 wt-8 0.149 0.165 0.152 105 2.7 401-1 0.138 0.155 0.174 105 5.7 401-2 0.182 0.193 0.163 121 4.8 401-3 0.139 0.145 0.152 98 2.1 pSIM764 1 0.051 0.055 0.060 37 1.4 2 0.071 0.072 0.068 48 0.7 3 0.063 0.070 0.075 47 1.9 4 0.035 0.032 0.030 22 0.8 5 0.045 0.031 0.030 24 2.7 6 0.053 0.056 0.056 37 0.6 7 0.079 0.108 0.117 68 6.3 8 0.035 0.042 0.041 27 1.2 9 0.039 0.042 0.043 28 0.7 10 0.081 0.073 0.077 52 1.3 11 0.059 0.061 0.052 39 1.5 12 0.055 0.046 0.053 35 1.5 13 0.036 0.039 0.032 24 1.1 14 0.052 0.068 0.062 41 2.6 15 0.037 0.033 0.034 23 0.7 16 0.066 0.057 0.066 43 1.7 17 0.063 0.061 0.057 41 1.0 18 0.063 0.041 0.047 34 3.6 18 0.045 0.049 0.041 30 1.3 20 0.061 0.051 0.048 36 2.2 21 0.043 0.039 0.039 27 0.7 22 0.111 0.102 0.112 73 1.8 23 0.058 0.049 0.057 37 1.6 24 0.043 0.041 0.042 28 0.3 25 0.041 0.040 0.045 28 0.8 26 0.044 0.042 0.042 29 0.4 PSIM765 1 0.044 0.035 0.039 27 1.4 2 0.041 0.048 0.055 32 2.2 3 0.064 0.060 0.058 41 1.0 5 0.122 0.118 0.102 77 3.4 10 0.042 0.066 0.059 38 3.9 14 0.087 0.103 0.111 68 3.9 15 0.045 0.049 0.059 34 2.3 16 0.033 0.042 0.035 25 1.5 19 0.033 0.048 0.045 28 2.5 20 0.043 0.040 0.052 30 2.0 21 0.044 0.035 0.033 25 1.9 24 0.046 0.049 0.047 32 0.5 28 0.046 0.048 0.033 29 2.6 29 0.071 0.082 0.078 52 1.8 30 0.051 0.059 0.056 37 1.3 32 0.105 0.134 0.129 83 4.9 34 0.045 0.047 0.038 29 1.5 35 0.143 0.168 0.171 109 4.9 36 0.115 0.128 0.097 77 5.0 37 0.057 0.049 0.040 33 2.7 38 0.062 0.067 0.063 43 0.8 39 0.046 0.055 0.045 33 1.8 40 0.040 0.036 0.036 25 0.7 41 0.083 0.069 0.072 50 2.3 `wt` = untransformed wild type plants; `401` = transformed plants carrying a transfer DNA only comprising an expression cassette for the nptII seelectable marker gene, `OD` = OD260 measurement, `S.E.` = standard error.
TABLE-US-00003 TABLE 3 PCR positive for both gus gene and Parental line silencing construct Partially silenced Fully silenced 374-18 25/50 (50%) 24/25 (96%) 0 717-54 35/50 (70%) 28/35 (80%) 3/35 (9%) 773-4 23/50 (46%) 0 23/23 (100%)
Sequence CWU
1
601581DNAFigwort mosaic virus 1atttagcagc attccagatt gggttcaatc aacaaggtac
gagccatatc actttattca 60aattggtatc gccaaaacca agaaggaact cccatcctca
aaggtttgta aggaagaatt 120ctcagtccaa agcctcaaca aggtcagggt acagagtctc
caaaccatta gccaaaagct 180acaggagatc aatgaagaat cttcaatcaa agtaaactac
tgttccagca catgcatcat 240ggtcagtaag tttcagaaaa agacatccac cgaagactta
aagttagtgg gcatctttga 300aagtaatctt gtcaacatcg agcagctggc ttgtggggac
cagacaaaaa aggaatggtg 360cagaattgtt aggcgcacct accaaaagca tctttgcctt
tattgcaaag ataaagcaga 420ttcctctagt acaagtgggg aacaaaataa cgtggaaaag
agctgtcctg acagcccact 480cactaatgcg tatgacgaac gcagtgacga ccacaaaaga
attccctcta tataagaagg 540cattcattcc catttgaagg atcatcagat actcaaccaa t
5812304DNAArtificial SequenceDescription of
Artificial Sequence Synthetic construct 2caacgcgtaa actcgacccg
acgcgtccga tcacctgcgt caatgtaatg ttctgcgacg 60ctcacaccga taccatcagc
gatctctttg atgtgctgtg cctgaaccgt tattacggat 120ggtatgtcca aagcggcgat
ttggaaacgg cagagaaggt actggaaaaa gaacttctgg 180cctggcagga gaaactgcat
cagccgatta tcatcaccga atacggcgtg gatacgttag 240ccgggctgca ctcaatgtac
accgacatgt ggagtgaaga gtatcagtgt gcatggctgg 300atat
3043253DNAArtificial
SequenceDescription of Artificial Sequence Synthetic construct
3cgttcaaaca tttggcaata aagtttctta agattgaatc ctgttgccgg tcttgcgatg
60attatcatat aatttctgtt gaattacgtt aagcatgtaa taattaacat gtaatgcatg
120acgttattta tgagatgggt ttttatgatt agagtcccgc aattatacat ttaatacgcg
180atagaaaaca aaatatagcg cgcaaactag gataaattat cgcgcgcggt gtcatctatg
240ttactagatc ggg
2534244DNAArtificial SequenceDescription of Artificial Sequence Synthetic
construct 4gtggtaactt ttactcatct cctccaatta tttctgattt catgcatgtt
tccctacatt 60ctattatgaa tcgtgttatg gtgtataaac gttgtttcat atctcatctc
atctattctg 120attttgattc tcttgcctac tgaatttgac cctactgtaa tcggtgataa
atgtgaatgc 180ttcctcttct tcttcttctt ctcagaaatc aatttctgtt ttgtttttgt
tcatctgtag 240cttg
2445545DNACauliflower mosaic virus 5tagcttcatg gagtcaaaga
ttcaaataga ggacctaaca gaactcgccg taaagactgg 60cgaacagttc atacagagtc
tcttacgact caatgacaag aagaaaatct tcgtcaacat 120ggtggagcac gacacacttg
tctactccaa aaatatcaaa gatacagtct cagaagacca 180aagggcaatt gagacttttc
aacaaagggt aatatccgga aacctcctcg gattccattg 240cccagctatc tgtcacttta
ttgtgaagat agtggaaaag gaaggtggct cctacaaatg 300ccatcattgc gataaaggaa
aggccatcgt tgaagatgcc tctgccgaca gtggtcccaa 360agatggaccc ccacccacga
ggagcatcgt ggaaaaagaa gacgttccaa ccacgtcttc 420aaagcaagtg gattgatgtg
atatctccac tgacgtaagg gatgacgcac aatcccacta 480tccttcgcaa gacccttcct
ctatataagg aagttcattt catttggaga gaacacgggg 540gactc
54561024DNASolanum tuberosum
6tcgagcacat tgattgagtt ttatatgcaa tatagtaata ataataatat ttcttataaa
60gcaagaggtc aatttttttt tattatacca acgtcactaa attatatttg ataatgtaaa
120acaattcaat tttacttaaa tatcatgaaa taaactattt ttataaccaa attactaaat
180ttttccaata aaaaaaagtc attaagaaga cataaaataa atttgagtaa aaagagtgaa
240gtcgactgac tttttttttt ttatcataag aaaataaatt attaacttta acctaataaa
300acactaatat aatttcatgg aatctaatac ttacctctta gaaataagaa aaagtgtttc
360taatagaccc tcaatttaca ttaaatattt tcaatcaaat ttaaataaca aatatcaata
420tgaggtcaat aacaatatca aaataatatg aaaaaagagc aatacataat ataagaaaga
480agatttaagt gcgattatca aggtagtatt atatcctaat ttgctaatat ttaaactctt
540atatttaagg tcatgttcat gataaacttg aaatgcgcta tattagagca tatattaaaa
600taaaaaaata cctaaaataa aattaagtta tttttagtat atattttttt acatgaccta
660catttttctg ggtttttcta aaggagcgtg taagtgtcga cctcattctc ctaattttcc
720ccaccacata aaaattaaaa aggaaaggta gcttttgcgt gttgttttgg tacactacac
780ctcattatta cacgtgtcct catataattg gttaacccta tgaggcggtt tcgtctagag
840tcggccatgc catctataaa atgaagcttt ctgcacctca tttttttcat cttctatctg
900atttctatta taatttctct caattgcctt caaatttctc tttaaggtta gaaatcttct
960ctatttttgg tttttgtctg tttagattct cgaattagct aatcaggtgc tgttatagcc
1020ctta
10247175DNAArtificial SequenceDescription of Artificial Sequence
Synthetic construct 7ccttcacccg gttgccagag gtgcggattc accacttgca
aagtcccgct agtgccttgt 60ccagttgcaa ccacctgttg atccgcatca cgcagttcaa
cgctgacatc accattggcc 120accacctgcc agtcaacaga cgcgtggtta cagtcttgcg
cgacatgcgt cacca 175827DNAArtificial SequenceDescription of
Artificial Sequence Synthetic primer 8caacgcgtaa actcgacccg acgcgtc
27927DNAArtificial
SequenceDescription of Artificial Sequence Synthetic primer
9ttgtttttgt tcatctgtag cttctgc
271027DNAArtificial SequenceDescription of Artificial Sequence Synthetic
primer 10tggaggagat gagtaaaagt taccacg
2711300DNAFigwort mosaic virus 11atttagcagc attccagatt
gggttcaatc aacaaggtac gagccatatc actttattca 60aattggtatc gccaaaacca
agaaggaact cccatcctca aaggtttgta aggaagaatt 120ctcagtccaa agcctcaaca
aggtcagggt acagagtctc caaaccatta gccaaaagct 180acaggagatc aatgaagaat
cttcaatcaa agtaaactac tgttccagca catgcatcat 240ggtcagtaag tttcagaaaa
agacatccac cgaagactta aagttagtgg gcatctttga 30012658DNASolanum
tuberosum 12gaaccatgca tctcaatctt aatactaaaa aatgcaacaa aattctagtg
gagggaccag 60taccagtaca ttagatatta tcttttatta ctataataat attttaatta
acacgagaca 120taggaatgtc aagtggtagc ggtaggaggg agttggttca gttttttaga
tactaggaga 180cagaaccgga ggggcccatt gcaaggccca agttgaagtc cagccgtgaa
tcaacaaaga 240gagggcccat aatactgtcg atgagcattt ccctataata cagtgtccac
agttgccttc 300cgctaaggga tagccacccg ctattctctt gacacgtgtc actgaaacct
gctacaaata 360aggcaggcac ctcctcattc tcacactcac tcactcacac agctcaacaa
gtggtaactt 420ttactcatct cctccaatta tttctgattt catgcatgtt tccctacatt
ctattatgaa 480tcgtgttatg gtgtataaac gttgtttcat atctcatctc atctattctg
attttgattc 540tcttgcctac tgaatttgac cctactgtaa tcggtgataa atgtgaatgc
ttcctcttct 600tcttcttctt ctcagaaatc aatttctgtt ttgtttttgt tcatctgtag
cttggtag 658131024DNASolanum tuberosum 13ccgcagtgtg ccagggctgt
cggcagatgg acataaatgg cacaccgctc ggctcgtgga 60aagagtatgg tcagtttcat
tgataagtat ttactcgtat tcggtgttta catcaagtta 120atatgttcaa acacatgtga
tatcatacat ccattagtta agtataaatg ccaacttttt 180acttgaatcg ccgaataaat
ttacttacgt ccaatattta gttttgtgtg tcaaacatat 240catgcactat ttgattaaga
ataaataaac gatgtgtaat ttgaaaacca attagaaaag 300aagtatgacg ggattgatgt
tctgtgaaat cactggtaaa ttggacggac gatgaaattt 360gatcgtccat ttaagcatag
caacatgggt ctttagtcat catcattatg ttataattat 420tttcttgaaa cttgatacac
caactttcat tgggaaagtg acagcatagt ataaactata 480atatcaattc tggcaatttc
gaattattcc aaatctcttt tgtcatttca tttcctcccc 540tatgtctgca agtaccaatt
atttaagtac aaaaaatctt gattaaacaa tttattttct 600cactaataat cacatttaat
catcaacggt tcatacacgt ctgtcactct ttttttattc 660tctcaagcgc atgtgatcat
accaattatt taaatacaaa aaatcttgat taaacaattc 720agtttctcac taataatcac
atttaatcat caacggttca tacacatccg tcactctttt 780tttattctct caagcgcatg
tgatcatacc aattatttaa atacaaaaaa tcttgattaa 840acaattcatt ttctcactaa
taatcacatt taatcatcaa cggtttatac acgtccgcca 900ctcttttttt attctctcaa
gcgtatgtga tcatatctaa ctctcgtgca aacaagtgaa 960atgacgttca ctaataaata
atcttttgaa tactttgttc agtttaattt atttaatttg 1020ataa
102414154DNASolanum tuberosum
14ttagtctcta ttgaatctgc tgagattaca ctttgatgga tgatgctctg tttttgtttt
60cttgttctgt tttttcctct gttgaaatca gctttgttgc ttgatttcat tgaagttgtt
120attcaagaat aaatcagtta caattatgtt tggg
15415569DNASolanum tuberosum 15gttagaaatc ttctctattt ttggtttttg
tctgtttaga ttctcgaatt agctaatcag 60gtgctgttat agcccttaat tttgagtttt
ttttcggttg ttttgatgga aaaggcctaa 120aatttgagtt tttttacgtt ggtttgatgg
aaaaggccta caattggagt tttccccgtt 180gttttgatga aaaagcccct agtttgagat
tttttttctg tcgattcgat tctaaaggtt 240taaaattaga gtttttacat ttgtttgatg
aaaaaggcct taaatttgag tttttccggt 300tgatttgatg aaaaagccct agaatttgtg
ttttttcgtc ggtttgattc tgaaggccta 360aaatttgagt ttctccggct gttttgatga
aaaagcccta aatttgagtt tctccggctg 420ttttgatgaa aaagccctaa atttgagttt
tttccccgtg ttttagattg tttggtttta 480attctcgaat cagctaatca gggagtgtga
aaagccctaa aatttgagtt tttttcgttg 540ttctgattgt tgtttttatg aatttgcag
56916154DNANicotiana tabacum
16ttagtctcta ttgaatctgc tgagattaca ctttgatgga tgatgctctg tttttgtttt
60cttgttctgt tttttcctct gttgaaatca gctttgttgc ttgatttcat tgaagttgtt
120attcaagaat aaatcagtta caattatgtt tggg
154173272DNASolanum tuberosum 17tggcaggata tataccggtg taaacgaagt
gtgtgtggtt gatccaaaat ctatcgtacc 60tttagaaagt gtagctatga aggatagtct
cacttatgaa gaactaccta ttgagattct 120tgatcgtcag gtccgaaggt tgagaaaaat
agaagtcgct tcagttacgg ctttgtggag 180gagtaagggt accgaaccat gcatctcaat
cttaatacta aaaaatgcaa caaaattcta 240gtggagggac cagtaccagt acattagata
ttatctttta ttactataat aatattttaa 300ttaacacgag acataggaat gtcaagtggt
agcggtagga gggagttggt tcagtttttt 360agatactagg agacagaacc ggaggggccc
attgcaaggc ccaagttgaa gtccagccgt 420gaatcaacaa agagagggcc cataatactg
tcgatgagca tttccctata atacagtgtc 480cacagttgcc ttccgctaag ggatagccac
ccgctattct cttgacacgt gtcactgaaa 540cctgctacaa ataaggcagg cacctcctca
ttctcacact cactcactca cacagctcaa 600caagtggtaa cttttactca tctcctccaa
ttatttctga tttcatgcat gtttccctac 660attctattat gaatcgtgtt atggtgtata
aacgttgttt catatctcat ctcatctatt 720ctgattttga ttctcttgcc tactgaattt
gaccctactg taatcggtga taaatgtgaa 780tgcttcctct tcttcttctt cttctcagaa
atcaatttct gttttgtttt tgttcatctg 840tagcttggta gattcccctt tttgtagacc
acacatcacg gatcccccaa acataattgt 900aactgattta ttcttgaata acaacttcaa
tgaaatcaag caacaaagct gatttcaaca 960tgaaaaaaca gaacaagaaa acgaaaacag
agcatcatcc atcaaagtgt aatctcagca 1020gattcaatag agactaactc gaggtgctct
ctatgcaaat ctagcttttc gaatgagagt 1080gataagagag tgaggattgt gaattatttt
attgatgaag attggagaag tcaattattg 1140attcacacac aggaattaag tgtgttgtgt
tgcgtcctct tgtggaaatt aaatgtcacc 1200ctttttttat ttatcaataa aagcacgaaa
atctcctgca ctactcccct gcactctctt 1260atatttgtcc atttcccaca aatccctaac
ttaattactt acccacactc aagcttcaac 1320actgttgagg ttaggaatcc ctggtacagc
aagttattcc ctaaggaatt actcatatcc 1380tcccactggc ttaattcact caagttcagc
tagaaacgtc gatttctagt gaagtaacga 1440ggaaattagc gaagaagcgt cgagaaattc
gatgaagatg aattcacgaa gcaaaatgaa 1500gattggagca gagagtatgg ggattggaga
gtggaaagtg gtagtgaaat aaggtccgcg 1560ggttaaattc atgattttat gaactcaata
gcttttcata atgagcaata ttatctttct 1620tcagtagcaa atccacatgc tcttatgctc
gctgaaatag ttttggccgt ggagtttcac 1680catctatgtt tacaattgat tcttgtagct
gcagacctta tttcactacc actttccact 1740ctccaatccc catactctct gctccaatct
tcattttgct tcgtgaattc atcttcatcg 1800aatttctcga cgcttcttcg ctaatttcct
cgttacttca ctagaaatcg acgtttctag 1860ctgaacttga gtgaattaag ccagtgggag
gatatgagta attccttagg gaataacttg 1920ctgtaccagg gattcctaac ctcaacagtg
ttgaagcttg agtgtgggta agtaattaag 1980ttagggattt gtgggaaatg gacaaatata
agagagtgca ggggagtagt gcaggagatt 2040ttcgtgcttt tattgataaa taaaaaaagg
gtgacattta atttccacaa gaggacgcaa 2100cacaacacac ttaattcctg tgtgtgaatc
aataattgac ttctccaatc ttcatcaata 2160aaataattca caatcctcac tctcttatca
ctctcattcg aaaagctaga tttgcataga 2220gagcacctcg agttagtctc tattgaatct
gctgagatta cactttgatg gatgatgctc 2280tgttttcgtt ttcttgttct gttttttcat
gttgaaatca gctttgttgc ttgatttcat 2340tgaagttgtt attcaagaat aaatcagtta
caattatgtt tgggtctaga gtgatgtgtg 2400gtctacaaaa aggggaatct accaagctac
agatgaacaa aaacaaaaca gaaattgatt 2460tctgagaaga agaagaagaa gaggaagcat
tcacatttat caccgattac agtagggtca 2520aattcagtag gcaagagaat caaaatcaga
atagatgaga tgagatatga aacaacgttt 2580atacaccata acacgattca taatagaatg
tagggaaaca tgcatgaaat cagaaataat 2640tggaggagat gagtaaaagt taccacttgt
tgagctgtgt gagtgagtga gtgtgagaat 2700gaggaggtgc ctgccttatt tgtagcaggt
ttcagtgaca cgtgtcaaga gaatagcggg 2760tggctatccc ttagcggaag gcaactgtgg
acactgtatt atagggaaat gctcatcgac 2820agtattatgg gccctctctt tgttgattca
cggctggact tcaacttggg ccttgcaatg 2880ggcccctccg gttctgtctc ctagtatcta
aaaaactgaa ccaactccct cctaccgcta 2940ccacttgaca ttcctatgtc tcgtgttaat
taaaatatta ttatagtaat aaaagataat 3000atctaatgta ctggtactgg tccctccact
agaattttgt tgcatttttt agtattaaga 3060ttgagatgca tggttcgagc tccttcaaca
tgttataaac ttcacatatt cagttgggaa 3120taggctttat aatgagttgg actacgttat
gtccccctca agtcccagaa ttatgtgccc 3180ccgtatgtta taagtcccct ctgcgggcat
caatttagtg atcacgccag acatgcctct 3240atacctcggc caggatatat ttgttggtaa
tg 327218152DNASolanum verrucosum
18cccaaacata attgtaactg atttattctt gaataacaac ttcaatgaaa tcaagcaaca
60aagctgattt caacatgaaa aaacagaaca agaaaacgaa aacagagcat catccatcaa
120agtgtaatct cagcagattc aatagagact aa
15219267DNASolanum tuberosum 19gtgctctcta tgcaaatcta gcttttcgaa
tgagagtgat aagagagtga ggattgtgaa 60ttattttatt gatgaagatt ggagaagtca
attattgatt cacacacagg aattaagtgt 120gttgtgttgc gtcctcttgt ggaaattaaa
tgtcaccctt tttttattta tcaataaaag 180cacgaaaatc tcctgcacta ctcccctgca
ctctcttata tttgtccatt tcccacaaat 240ccctaactta attacttacc cacactc
26720239DNASolanum tuberosum
20caacactgtt gaggttagga atccctggta cagcaagtta ttccctaagg aattactcat
60atcctcccac tggcttaatt cactcaagtt cagctagaaa cgtcgatttc tagtgaagta
120acgaggaaat tagcgaagaa gcgtcgagaa attcgatgaa gatgaattca cgaagcaaaa
180tgaagattgg agcagagagt atggggattg gagagtggaa agtggtagtg aaataaggt
23921520DNAArtificial SequenceDescription of Artificial Sequence
Synthetic construct 21atttagcagc attccagatt gggttcaatc aacaaggtac
gagccatatc actttattca 60aattggtatc gccaaaacca agaaggaact cccatcctca
aaggtttgta aggaagaatt 120ctcagtccaa agcctcaaca aggtcagggt acagagtctc
caaaccatta gccaaaagct 180acaggagatc aatgaagaat cttcaatcaa agtaaactac
tgttccagca catgcatcat 240ggtcagtaag tttcagaaaa agacatccac cgaagactta
aagttagtgg gcatctttga 300aagtaatctt gtcaacatcg agcagctggc ttgtggggac
cagacaaaaa aggaatggtg 360cagaattgtt aggcgcacct accaaaagca tctttgcctt
tattgcaaag ataaagcaga 420ttcctctagt acaagtgggg aacaaaataa cgtggaaaag
agctgtcctg acagcccact 480cactaatgcg tatgacgaac gcagtgacga ccacaaaaga
520221024DNASolanum tuberosum 22ttcaaatttc
atttgtgtca tataaattga gacatataat tgtcggcaca tgctcatgta 60tccaaacaag
gataatttga tcatctattc ttatatattt gaaaattacg ataataatac 120tttaaatcac
aataattaac aagttaaaat atttaaaagt catataaaaa attaattgac 180tctcaaaatt
ctgtaagtac tataaattaa aataaataac aacttaagaa tttcaaagtc 240ataaaaaatt
tggtggctct ctaaaatata tcaatgtcac ataaaaagta acatatatta 300ttcagaaatt
acgtaaaaga taccacaaat tacaataatt aacaacttga aatatttaaa 360atacataaaa
ataattaatt ttagaaattc caggcgtgcc acataaattg ggacaacgaa 420ataatatata
ctattatttt aaaattatgt aaaaaaataa ttctaaatca tgataattaa 480taacttaaaa
tattattaaa aatcatataa aaatttaaat aattgctcag gtttcagccg 540tattacataa
attaggataa aaaataatat atattgggcc ccgtgctggc acgggggccc 600gtatctagtt
tatataataa atatcgtttc tagtctatct cttctgatgc taaataaagt 660ctgtgattat
cttttaattt tttctactca gcatggggtg ccgtatctag tttatataat 720aaatatcgtt
tctagtctat ctcttctgat gctaaataaa gtcagtgatt attttttaat 780tttttctact
aggtaatgta aaattcttat gttaaccaaa taaattgaga caaattaatt 840cagttaacca
gagttaagag taaagtacta ttgcaagaaa atatcaaagg caaaagaaaa 900gatcatgaaa
gaaaatatca aagaaaaaga agaggttaca atcaaactcc cataaaactc 960caaaaataaa
cattcaaatt gcaaaaacat ccaatcaaat tgctctactt cacggggccc 1020acgc
102423342DNASolanum tuberosum 23aaaattctta tgttaaccaa ataaattgag
acaaattaat tcagttaacc agagttaaga 60gtaaagtact attgcaagaa aatatcaaag
gcaaaagaaa agatcatgaa agaaaatatc 120aaagaaaaag aagaggttac aatcaaactc
ccataaaact ccaaaaataa acattcaaat 180tgcaaaaaca tccaatcaaa ttgctctact
tcacggggcc cacgccggct gcatctcaaa 240ctttcccacg tgacatccca taacaaatca
ccaccgtaac ccttctcaaa actcgacacc 300tcactctttt tctctatatt acaataaaaa
atatacgtgt cc 34224151DNASolanum tuberosum
24cattcaaatt gcaaaaacat ccaatcaaat tgctctactt cacggggccc acgccggctg
60catctcaaac tttcccacgt gacatcccat aacaaatcac caccgtaacc cttctcaaaa
120ctcgacacct cactcttttt ctctatatta c
151251024DNASolanum tuberosum 25taatataaca taccatgggt ggagctagaa
gtctgattac aaatttcgtc aaattcaaca 60atatttgctt aaataatata tttgtatagt
aatttttttt acaaaatata tacaaattta 120ggtcaaggat tcagttatta accctttaaa
atcgtgtcat aaaattcaat gttaaaattc 180tgactttccc cgtgcttaac attacttatc
aaatttatgt ttctgtgtag aaaagtacta 240gtactactct ttgactcgtc tagacgtcta
ctatagatct ccttagatta aaaactccag 300ttttaatatt ttcctcacaa ttattattct
taatctacca cctaccggag tcacaaatat 360attaaatgaa aatattctat ctattaattt
atgatctacc tattgataat ttgtaatcta 420gtcaaaatga tggcaaaaaa aatataatat
ctagactgaa gttcttagtc aatagcgtaa 480atgaaagaaa aaaaaaaaag ctcaagaaga
aacatgatat ctttgttgct ctgattcgta 540aaaaaaaaaa catagtaact tcataaaata
tcttatcctt tggacagagc gatgaaaaaa 600atatattact agtaatactg agattagtta
cctgagacta tttcctatct tctgttttga 660tttgatttat taaggaaaat tatgtttcaa
cggccatgct tatccatgca ttattaatga 720tcaatatatt actaaatgct attactatag
gttgcttata tgttctgtaa tactgaatat 780gatgtataac taatacatac attaaattct
ctaataaatc tatcaacaga agcctaagag 840attaacaaat actactatta tccagactaa
gttatttttc tgtttactac agatccttcc 900aagaacaaaa acttaataat tgtatggctg
ctatacataa ttccccacct accgcttcct 960ggaataattg atatggaagc cgcctctaaa
attgaataat tatactgttt tacatattat 1020ataa
102426200DNASolanum tuberosum
26aagttatttt tctgtttact acagatcctt ccaagaacaa aaacttaata attgtatggc
60tgctatacat aattccccac ctaccgcttc ctggaataat tgatatggaa gccgcctcta
120aaattgaata attatactgt tttacatatt atataaagca aggtatagcc caatgaattt
180tcattcaaaa gctagcaata
20027460DNASolanum tuberosum 27ctagtaatac tgagattagt tacctgagac
tatttcctat cttctgtttt gatttgattt 60attaaggaaa attatgtttc aacggccatg
cttatccatg cattattaat gatcaatata 120ttactaaatg ctattactat aggttgctta
tatgttctgt aatactgaat atgatgtata 180actaatacat acattaaatt ctctaataaa
tctatcaaca gaagcctaag agattaacaa 240atactactat tatccagact aagttatttt
tctgtttact acagatcctt ccaagaacaa 300aaacttaata attgtatggc tgctatacat
aattccccac ctaccgcttc ctggaataat 360tgatatggaa gccgcctcta aaattgaata
attatactgt tttacatatt atataaagca 420aggtatagcc caatgaattt tcattcaaaa
gctagcaata 460281024DNABrassica sp. 28attgagcttg
aaggaacatt cgagcagata aacgaagcga gcgcaatggt tagagagctg 60attgggaggc
ttaattccgc atctaggaga ccacctggtg gcggtggtgg cgggggtggg 120cttggttctg
aagggaaacc acatccagga agcaacttca agacgaagat gtgtgagaga 180ttctctaaag
gaagctgtac atttggtgat agatgtcact ttgctcacgg ggaagcagag 240ctacgcaggt
catgaattgc gcctagagtt actggtgaaa caagtctctt tcatttgttg 300tggtgattcc
taatatcatc ttctcctact tgtttttagt tgtcttcgtt ttttgaaact 360acaatgttta
gttttcattg tcagtgtaag ttttccccat ttggtgtttt tttagaatct 420agtttgaatt
tgagatgggg caagcttgat gaatgattgg caaaacagtg gttaggattt 480gtgtgctgtc
tctacttaat atttcatgtt ttatctactt tattttggtc agcaagttga 540tgtgtttctc
tgatgtgtgt gtgattatca gcttagatta ttttgtgagt atgctagact 600gtataactaa
tcgttgtcga tgttatagtt ctcttataat gtttgataga ctatataact 660aaaaattcat
gttattaata gccgtcgctg atagtaacag ctgaataaat gaaatgaaat 720catggtaggt
gatgatcttt aaagaatgtt aaaaataatg tgtcgttata agcggtaatg 780catagaaaaa
ctctaatcat cttaacataa gagagagcga tagctttaat aaagtactta 840aattaattac
tagtcggcag tcgctgccta cttgtgtacc acctaaatta atttattata 900atatatgacg
aatctccaaa gtacatcaca cacactcggg gctattcacg tgatctcaac 960cacaatgtct
gcagatattt ttttaagttt tcttctcaca tgggagaaga agaagccaag 1020cacg
102429441DNABrassica sp. 29ccggctacca ctaacttcta cagttctact tgtgagtcgg
caaggacgtt tcctcatatt 60aaagtaaaga catcaaatac cataatctta atgctaatta
acgtaacgga tgagttctat 120aacataaccc aaactagtct ttgtgaacat taggattggg
taaaccaata tttacatttt 180aaaaacaaaa tacaaaaaga aacgtgataa actttataaa
agcaattata tgatcacggc 240atctttttca cttttccgta aatatatata agtggtgtaa
atatcagata tttggagtag 300aaaaaaaaaa aaagaaaaaa gaaatatgaa gagaggaaat
aatggagggg cccacttgta 360aaaaagaaag aaaagagatg tcactcaatc gtctcacacg
ggcccccgtc aatttaaacg 420gcctgccttc tgcccaatcg c
441301024DNABrassica sp. 30aagctttctt catcggtgat
tgattccttt aaagacttat gtttcttatc ttgcttctga 60ggcaagtatt cagttaccac
ttatattctg gactttctga ctgcatcctc atttttccaa 120cattttaaat ttcactattg
gctgaatgct tcttctttga ggaagaaaca attcagatgg 180cagaaatgta tcaaccaatg
catatataca aatgtacctc ttgttctcaa aacatctatc 240ggatggttcc atttgctttg
tcatccaatt agtgactact ttatattatt cactcctctt 300tattactatt ttcatgcgag
gttgccatgt acattatatt tgtaaggatt gacgctattg 360agcgtttttc ttcaattttc
tttattttag acatgggtat gaaatggttg ttagagttgg 420gttgaatgag atatacgttc
aagtgaatgg cataccgttc tcgagtaagg atgacctacc 480cattcttgag acaaatgtta
cattttagta tcagagtaaa atgtgtacct ataactcaaa 540ttcgattgac atgtatccat
tcaacataaa attaaaccag cctgcacctg catccacatt 600tcaagtattt tcaaaccgtt
cggctcctat ccaccgggtg taacaagacg gattccgaat 660ttggaagatt ttgactcaaa
ttcccaattt atattgaccg tgactaaatc aactttaact 720tctataattc tgattaagct
cccaatttat attcccaacg gcactacctc caaaatttat 780agactctcat ccccttttaa
accaacttag taaacgtttt tttttttaat tttatgaagt 840taagttttta ccttgttttt
aaaaagaatc gttcataaga tgccatgcca gaacattagc 900tacacgttac acatagcatg
cagccgcgga gaattgtttt tcttcgccac ttgtcactcc 960cttcaaacac ctaagagctt
ctctctcaca gcacacacat acaatcacat gcgtgcatgc 1020atta
1024312164DNABrassica napus
31ctgcaggtac aaagaggagc tctacttagt ttatgacttt atgcccagtg gaagccttga
60caagtacctc tacaccgaat cagatcaaga atatatgaag ttcacaagaa aaatcttagt
120atttgtttac tctatctttc tatgtaaatg tgtttttgct tttcaaaaaa gagctttgag
180aaaaattaaa gaagataact tgtcttaacc tatttttggt tcgggttttg cggagaactt
240ttgaaaataa tgacaactag gtgttttgcc ctcgatgcgg atttaaacat tttcataatt
300tttgaaaagt tctttgtaca ctatattcat tatactaaaa taaatcttaa aataatttaa
360tattatattt taaattatat aattaaaaaa caattatttg attaatattt aattatataa
420tttatgtttt aactttttac taaatacttt ttcagataac aatacaatac atatatatag
480aaattatctt ttttttaaat tatattttta gatcttggat aatttaatat tatattttta
540ttatataatt aagaatttta tttgattaat atttagttat ataattcatg ttttaaactt
600tatatatata attcatgttt ttaacttttt actaaaatac tttttcagat aacaatacaa
660tatatacata aattatctct ttttaaaatt atattttcag attttggata atgcttacta
720ttattaattt taatcaatta tctaatcaaa taaattaaaa ttttgtttta taggatataa
780acgatattaa tcattctaat ttttaacgtg agagttcgat tccaaaaatt tacttcgcaa
840ataatagtat atatctagct tattagggct ttaaaaggtt taggtttctt tacgctttaa
900ttgttttttt taactataat tgtaaacgtg ttaaacataa ctaatcagtg ttaaaacttg
960ctttatttta tttttccaac ttttagatta aagcataaag tgttaccata aaaaagaaga
1020ttaaagcata aagagatatc atttggtata atatttatgc caacgtataa tttgttttta
1080tcttttatgc aaacgcatac acatgtggac ttgaaagaaa cgacaatgag gacacttaac
1140acaaactccc aaaatgtcac ttaaagctat agttctgtca cggtctctca atggaaaatc
1200gtggtgctat caatgaaaaa acgttgtgct gaaactggca gagcacaaaa ctatagtcta
1260aaaaggattg aatgaagcaa aaaatgcaag aaccaaaggc acaacgattc tcacatttga
1320atgatattag aaatattagt ttcatttgcc aagtggacac atccacgtgg tagaggagcc
1380atgccacttg tctctttcgt gggttctcac gcccgagtta catttgaaaa ttacaaaata
1440aaagaacatt ttgtatatgt attgacattt ttacccttgc atatacatgt gtttagatct
1500aaattcacaa ctaatccatc tcttatcatt tttagttaac taagagcatc aatgttaacc
1560atgattctaa tttgaaatta attcatgatt tgatatttta ttattttatt tttatatttt
1620ttggttaaaa acagactctt atatctttta tttaagagat agttcttaat tttcttaatt
1680aaaagttaag aaacggttct taaccaaatg taaaaaccat attgtaagag ctcggattta
1740ttatgatcta aggaactcac gagtcaattc acctaatcaa atctaaaata tagtaattat
1800agctttaccg acatgtgata ctgccaaaaa taataaataa tatatagaca caagaaggat
1860gtgatagtga ggaatctgga gggcatttta gaactgatgc tcgattaaaa acagaaaata
1920atcttcaaaa ttttaattta cacgatagat gacgtcattt ttccatttgt tttgttaatt
1980agttatttat ttccttctct ctctttcctc cgagtgtaga ctcttccctc aaaccgctgt
2040ttctcataac catattctct ttctgtggac gaaactcaac cttaagagac cagagagagc
2100attagcctag agagagctcg ctcgtgtctg aaagaacatc aaacctcgta tcaaaaaaaa
2160gaaa
2164321024DNAGlycine maxmodified_base(143)a, c, g, t, unknown or other
32cagaaaaagg gaatagtttg gataaaatag attttaggtc tctcaattcc tagtcaaaat
60tagtctcaat cattatgctt taaaaatgat gattttgaca cttgggagat aacaatattt
120cttaaagttt gattctcaag ttngtatata tgaaatagtg tgttgggaga agtaaactct
180taaaataaat ttttatattt tagagatgat tctctcattc ttatgaagga gatatactag
240aaaaaaaatg atttttattt ttttattttt tattatgaaa cttaagaatt aagataccag
300gatgagggac aaaagtcatt aattattaaa aaaaaataca agaatcaaga ttattatttt
360taaaatataa aaaaaactaa ttttgatata taaagaaatc caggggatat aatacacact
420ctatcccaaa tatttggtta aacccccagg ggcccaatgt ttcgtctttc ctcaacagta
480taaattgcta atgatattat ttgtcttgaa ttggttcctg tggctagcat atctctgcaa
540cttgtgcaac catttggtaa ttcaattaag aatatataat atactttaaa tttactagga
600tgcataaaaa accctgtgac ttgtctgacc aagacttgcc aaattttttt atcatgcatt
660acaaaaacca gccatttgtt tttatttttt ggatttctat tctttccaaa tgaaggccta
720acagataaat tgcatgtcta atttcccctt gttattagag aaataagaaa ttataagctt
780ttgctttgac ttttgaacat attttacact ctttgcaggt tgctttttat cttggaagac
840cagaggagtc aaaataacag tgtcgcggta agtaagtgct cgacattctg gaatagtctc
900ttattgcgta ttgtgccatc attttgaggc cttgtnggct tgcatcacca ttgaaagaaa
960ttagtttgat ggttaaaatg gtataccttt tgtcttcatt attactcgaa ttacatttag
1020aaag
1024332024DNAMedicago sativa 33aaatgaaaga gagttaagga ttgaaatgaa
actggtaaaa aacagcttat tttaaaacat 60cttattcaaa acaacttatt ttatttaaaa
caatttattt tattcaaaac atgttttgaa 120taagttgttt tttgaaaata agctgttttg
aataagctgt ttttaaaata aggtgttttt 180cataaaataa gttgtttttg ttaaaataag
ttgttttttc aaataagctg ttttgaataa 240gctgtttttt tttaaataag ttgttttgaa
taagctgttt tttttaaata agttgttttt 300ttaaataagc tgttttgaat aagttgtttt
aaaataaggt gttttgcata aaataagctg 360ttttgaataa gttgttttga ataagttgtt
ttgaataagc tgtttttttt aaaaataaat 420tgttttcata aaataagctg tttttaaaat
aaggtgtttt gtataaataa gctttttaaa 480ataagctatt caaataagtt gtttttttgg
aaagatccaa caaagagttc aagtggtttc 540tttaaaataa aataaaaagt tcaagtggtt
tggttcggtt caaacggttc ggttcggttc 600aagatggttc ggttatggtt caagaactgt
taataaatta acggttcggt tcgtgaacca 660ttataacgat tcggttattt ttggttcggt
tcggttcgcg cggttcggtt cggttcatgg 720ttctttttgc ccacccctaa agaaaataaa
tgaatggtgg ttgagtattc ttaaaatgat 780ttgttttcta gaataaagag ttaataaggg
ggtcaaaaga gcaaccatct aaggtaaact 840ctcacattta gagttgatgc ggttaaaatt
tggatataac acttttgttg accaaaatgt 900ctcttatgaa taagactgaa agaagtaata
atttaaaaaa aaaaaatccg gctgttgcat 960tttttaaaac attaatccga agaaaagatg
tttgaaaatt gtttataatg agaagttatt 1020ttgagttttt tttccttcta aaaaaataat
gttattttca ttatgtttaa cacccataaa 1080actacttctg tttttttaaa gaatctctaa
aaatcaattt ctaaacgtca aaagtttttt 1140atacaattag tttagggtgt ttctatgagg
gtttgataat atttctacga ctatatatat 1200ttttttttta aggaaattct acgactactt
gtagttggaa tatgggaata cgactacttt 1260tctatgaaga gcaggttacg gtagacacaa
aagctgactc ttgcgcaaag cttgttcaac 1320ccaatagtga catattagga aatgaaaaat
accctaatgc ctccttttca atactcaaga 1380aaagtcctcc ttaccatatt gtcccatttt
ctttaagagc agagaagaac acatttgttc 1440acaccaacat gatttttgta tgcttgtaaa
tgaaaagctt ctagttatcc agctcaaccc 1500gtgactaagg tctattcaat ttgcttagaa
atgaggcatc aattatgatg caaatttttg 1560tactcattac tcaattcaaa aactatatga
acttgtggtg tcacgtaagt gaataacact 1620atctaaattt gagtacagta cttctcctgt
cacggggaga aaaacactca aaatcaattg 1680ttagagataa attttgtatc ataaattaat
taattttaca attacatcaa taaatgtcat 1740tgtttaatca aataatatat gacaaaactt
ctttgaaaat atactgagca aaaacaaaac 1800tattaattgc atgcaacggc aacacatttc
tgtttacaat tatattcggt gagtactcag 1860tcagtataac ccaattacca catatgcacg
aattctctta gtgggtccac attgtggtgg 1920ttgagtggga cccaattgta atggatggcc
cacatacacc aaactcaacc aaacaatttc 1980tcataaagtt ctatataata gcaatccact
ttgcatcatt gagg 202434448DNAArtificial
SequenceDescription of Artificial Sequence Synthetic construct
34caccaacatg atttttgtat gcttgtaaat gaaaagcttc tagttatcca gctcaacccg
60tgactaaggt ctattcaatt tgcttagaaa tgaggcatca attatgatgc aaatttttgt
120actcattact caattcaaaa actatatgaa cttatggtgt cacgtaagtg aataacacta
180tctaaatttg agtacttctc ctgtcacggg gagaaaaaca ctcaaaatca attgcatgca
240acggcaacac atttctgttt acaattatat tcggtgagta ctcagtcagt ataacccaat
300taccacatat gcacgaattc tcttagtggg tccacattgt ggtggttgag tgggacccaa
360ttgtaatgga tggcccacat acaccaaact caaccaaaca atttctcata aagttctata
420taatagcaat ccactttgca tcattgag
448351021DNAMedicago sativa 35atagtggacc agttaggtag gtggagaaag aaattattaa
aaaaatatat ttatatgttg 60tcaaataact caaaaatcat aaaagtttaa gttagcaagt
gtgcacattt ttatttggac 120aaaagtattc acctactact gttataaatc attattaaac
attagagtaa agaaatatgg 180atgataagaa taagagtagt gatattttga caacaatttt
gttacaacat ttgagaaaat 240tttgttgttc tctcttttca ttggtcaaaa acaatagaga
gagagagaga aaaaggaaga 300gggagaataa aaacataatg tgagtatgag agagaaagtt
gtacaaaagt tgtaccaaaa 360tggttgtaca aatatcattg aggaatttga caaaagctac
acaaataagg gttaattgct 420gtaaataaat aaggatgacg cattagagag atgtaccatt
agagaatttt tggcaagtca 480ttaaaaagaa agaataaatt atttttaaaa ttaaaagttg
agtcatttga ttaaacatgt 540gattatttaa tgaattgatg agagagttgg attaaagttg
tattaatgat tagaatttgg 600tgtcaaattt aatttgacat ttgatctttt cctatatatt
gccccataga gtcatttaac 660tcatttttat atttcataga tcaaataaga gaaataacgg
tatattaatc cctccaacaa 720aaaaaaaaaa aaaacggtat atttactaaa aaatctaagc
cacgtaggag gataacatcc 780aatccaacca atcacaacaa tcctgatgag ataacccact
ttaagcccac gcactctgtg 840gcacatctac attatctaaa tcacacattc ttccacacat
ctgagccaca caaaaaccaa 900tccacatctt tatcatccat tctataaaaa atcacacttt
gtgagtctac actttgattc 960ccttcaaaca catacaaaga gaagagacta attaattaat
taatcatctt gagagaaagc 1020c
102136858DNAMedicago sativa 36agagaggagg cagtgtacac
aggggcagag agaggtgagt cgtctttctg gtagggctgg 60tgttggggat agtggttggt
ttgagagtca ggtggtgagg agggttggcg atggggttga 120tacgttgttt tggttggata
ggtggttagg agatgctcct ttttgtgttt gtttcaggag 180gttgtttgag ttaacagaga
acaaatttgt gtctgtggct aatttgttat ctgttgactc 240ggagcagtgg ggggaggtgt
tgaggtgaag cgtatggtgg cagaggtggt ggcagaggtg 300aagcgtatgg tggcagctga
gggaggcagt gtacacagag gtggagagag aggagagaga 360agagagaaga gagagaaaat
ggagaagaga gaagagaaga gagagaagac aaatttttgt 420gtgtgtgacc aaaccaaaat
tcttggtcct ggtccacaca agattttctc ccaaccaagg 480tacaagaata ccacgatcca
agagtgccac gttgcaacat cataaccgtt caatagtaag 540agataatcga acggccataa
ttaattttca acaaacccac ttttttcctc ctacttttgc 600aacttgtccc tcatcaccta
ccaaacacac atagcacacc aacacacata ataatattat 660aataattgta aatatatgta
gcctccaaat tagaaagaaa cctctatata aagcctaact 720acttccttca caaatcagga
aattcacaac tctaatattc atttctttcc taatcattag 780aatttccatt cttataaaat
tctaggtacc accacacaac aaataaagga acattaatca 840atactattaa gatggatc
858371008DNAMedicago sativa
37cttctattaa tgatttaatc aacctttttt aaaatacgaa ggtgacctta ttttgcaaat
60aatccatgca tggaaatgca tcatcctttt gaaaatggga ttatctgaat tcttaagtta
120cgtgaaaatt taatacattt cattttagat aaatttatta ttaaaattca cacttagatg
180gcctaaaaat taacacttat ttttaacaat tcaaataaaa tatacgacga aatgagtgta
240atttagttgg ttaagcatcg tcaagcttgg agagaaagat catagtttga tctttgaaaa
300ctacactatt gaaaagggtg aagatatcta aacatccaaa caaaatttat tttgatagtc
360gattcaaatt atcaaaattt gtgaaaatat tttgtaaatt gttaagttgg caaaaatatg
420ttaattttca aattaccatt tgcacatttt tctaatctca aatcacattt aagggatgtt
480gactacttta gttttgtaca aatctttaca attttaacat ttataaaatg tgtttcggta
540gataaaaagt gtgagtattg tttataagag attgtgtttt tcttttgttt aaacttataa
600aataaatata tattttattt tattttaatg tgagattgta agaattcatt ataagattat
660gtcattccct caaaagaaaa ttagatgatg tcattttcat aactcatttt ctataaatac
720agaaaatcct caaaaatgaa aaacctcagt caaaaaataa aagaaaaaca tcaatagtgg
780actggcccac actcattgct ttgctttagt ataagaaagt agacctcacc aaccacgaac
840cggacgccaa ccggttcaac caaacattac accaattttc cttaaccata ccggtttttc
900cctcccttat ataaccatct tcctacctct tatctaacca agctccattc aactcttcaa
960cacatatcag aaacagaaaa agaagcaaaa cattccaaga atttaaca
100838171DNAArtificial SequenceDescription of Artificial Sequence
Synthetic construct 38catcaatagt ggactggccc acactcattg ctttgcttta
gtataagaaa gtagacctca 60ccaaccacga accggacgcc aaccggttca accaaacatt
acaccaattt tccttaacca 120taccggtttt tccctccctt atataaccat cttcctacct
cttatctaac c 171391024DNALycopersicon esculentum 39aagcttctta
aaaaggcaaa ttgattaatt tgaagtcaaa ataattaatt ataacaatgg 60taaagcacct
taagaaacca tagtttgaaa ggttaccaat gcgctatata ttaatcaact 120tgataatata
aaaaaaattt caattcgaaa agggcctaaa atattctcaa agtattcgaa 180atggtacaaa
actaccatcc gtccacctat tgactccaaa ataaaattat tatccacctt 240tgagtttaaa
attgactact tatataacaa ttctaaattt aaactatttt aatactttta 300aaaatacatg
gcgttcaaat atttaatata atttaattta tgaatatcat ttataaacca 360accaactacc
aactcattaa tcattaaatc ccacccaaat tctactatca aaattgtcct 420aaacactact
aaaacaagac gaaattgttc gagtccgaat cgaagcacca atctaattta 480ggttgagccg
catatttagg aggacacttt caatagtatt tttttcaagc atgaatttga 540aatttaagat
taatggtaaa gaagtagtac acccgaatta attcatgcct tttttaaata 600taattatata
aatatttatg atttgtttta aatattaaaa cttgaatata ttatttttaa 660aaaaattatc
tattaagtac catcacataa ttgagacgag gaataattaa gatgaacata 720gtgtttaatt
agtaatggat gggtagtaaa tttatttata aattatatca ataagttaaa 780ttataacaaa
tatttgagcg ccatgtattt taaaaaatat taaataagtt tgaatttaaa 840accgttagat
aaatggtcaa ttttgaaccc aaaagtggat gagaagggta ttttagagcc 900aataggggga
tgagaaggat attttgaagc caatatgtga tggatggagg ataattttgt 960atcatttcta
atactttaaa gatattttag gtcattttcc cttctttagt ttatagacta 1020tagt
1024401024DNASolanum tuberosum 40tggcaggata tataccggtg taaacgaagt
gtgtgtggtt gatccaaaat ctatcgtacc 60tttagaaagt gtagctatga aggatagtct
cacttatgaa gaactaccta ttgagattct 120tgatcgtcag gtccgaaggt tgagaaaaat
agaagtcgct tcagttacgg ctttgtggag 180gagtaagggt accgaaccat gcatctcaat
cttaatacta aaaaatgcaa caaaattcta 240gtggagggac cagtaccagt acattagata
ttatctttta ttactataat aatattttaa 300ttaacacgag acataggaat gtcaagtggt
agcggtagga gggagttggt tcagtttttt 360agatactagg agacagaacc ggaggggccc
attgcaaggc ccaagttgaa gtccagccgt 420gaatcaacaa agagagggcc cataatactg
tcgatgagca tttccctata atacagtgtc 480cacagttgcc ttccgctaag ggatagccac
ccgctattct cttgacacgt gtcactgaaa 540cctgctacaa ataaggcagg cacctcctca
ttctcacact cactcactca cacagctcaa 600caagtggtaa cttttactca tctcctccaa
ttatttctga tttcatgcat gtttccctac 660attctattat gaatcgtgtt atggtgtata
aacgttgttt catatctcat ctcatctatt 720ctgattttga ttctcttgcc tactgaattt
gaccctactg taatcggtga taaatgtgaa 780tgcttcctct tcttcttctt cttctcagaa
atcaatttct gttttgtttt tgttcatctg 840tagcttggta gattcccctt tttgtagacc
acacatcacg gatcccccaa acataattgt 900aactgattta ttcttgaata acaacttcaa
tgaaatcaag caacaaagct gatttcaaca 960tgaaaaaaca gaacaagaaa acgaaaacag
agcatcatcc atcaaagtgt aatctcagca 1020gatt
102441139DNASolanum tuberosum
41gagggggaag tgaatgaaaa ataacaaagg cacagtaagt agtttctctt tttatcatgt
60gatgaaggta tataatgtat gtgtaagagg atgatgttat taccacataa taagagatga
120agagtctcat tttctgctt
139426087DNASolanum tuberosum 42tggcaggata tataccggtg taaacgaagt
gtgtgtggtt gatccaaaat ctatcgtacc 60tttagaaagt gtagctatga aggatagtct
cacttatgaa gaactaccta ttgagattct 120tgatcgtcag gtccgaaggt tgagaaaaat
agaagtcgct tcagttacgg ctttgtggag 180gagtaagggt accgaaccat gcatctcaat
cttaatacta aaaaatgcaa caaaattcta 240gtggagggac cagtaccagt acattagata
ttatctttta ttactataat aatattttaa 300ttaacacgag acataggaat gtcaagtggt
agcggtagga gggagttggt tcagtttttt 360agatactagg agacagaacc ggaggggccc
attgcaaggc ccaagttgaa gtccagccgt 420gaatcaacaa agagagggcc cataatactg
tcgatgagca tttccctata atacagtgtc 480cacagttgcc ttccgctaag ggatagccac
ccgctattct cttgacacgt gtcactgaaa 540cctgctacaa ataaggcagg cacctcctca
ttctcacact cactcactca cacagctcaa 600caagtggtaa cttttactca tctcctccaa
ttatttctga tttcatgcat gtttccctac 660attctattat gaatcgtgtt atggtgtata
aacgttgttt catatctcat ctcatctatt 720ctgattttga ttctcttgcc tactgaattt
gaccctactg taatcggtga taaatgtgaa 780tgcttcctct tcttcttctt cttctcagaa
atcaatttct gttttgtttt tgttcatctg 840tagcttggta gattcccctt tttgtagacc
acacatcacg gatcccccaa acataattgt 900aactgattta ttcttgaata acaacttcaa
tgaaatcaag caacaaagct gatttcaaca 960tgaaaaaaca gaacaagaaa acgaaaacag
agcatcatcc atcaaagtgt aatctcagca 1020gattcaatag agactaactc gaggtgctct
ctatgcaaat ctagcttttc gaatgagagt 1080gataagagag tgaggattgt gaattatttt
attgatgaag attggagaag tcaattattg 1140attcacacac aggaattaag tgtgttgtgt
tgcgtcctct tgtggaaatt aaatgtcacc 1200ctttttttat ttatcaataa aagcacgaaa
atctcctgca ctactcccct gcactctctt 1260atatttgtcc atttcccaca aatccctaac
ttaattactt acccacactc aagcttaagc 1320agaaaatgag actcttcatc tcttattatg
tggtaataac atcatcctct tacacataca 1380ttatatacct tcatcacatg ataaaaagag
aaactactta ctgtgccttt gttatttttc 1440attcacttcc ccctcccgcg ggttaaattc
atgattttat gaactcaata gcttttcata 1500atgagcaata ttatctttct tcagtagcaa
atccacatgc tcttatgctc gctgaaatag 1560ttttggccgt ggagtttcac catctatgtt
tacaattgat tcttgtagct gcaggagggg 1620gaagtgaatg aaaaataaca aaggcacagt
aagtagtttc tctttttatc atgtgatgaa 1680ggtatataat gtatgtgtaa gaggatgatg
ttattaccac ataataagag atgaagagtc 1740tcattttctg cttaagcttg agtgtgggta
agtaattaag ttagggattt gtgggaaatg 1800gacaaatata agagagtgca ggggagtagt
gcaggagatt ttcgtgcttt tattgataaa 1860taaaaaaagg gtgacattta atttccacaa
gaggacgcaa cacaacacac ttaattcctg 1920tgtgtgaatc aataattgac ttctccaatc
ttcatcaata aaataattca caatcctcac 1980tctcttatca ctctcattcg aaaagctaga
tttgcataga gagcacctcg agttagtctc 2040tattgaatct gctgagatta cactttgatg
gatgatgctc tgttttcgtt ttcttgttct 2100gttttttcat gttgaaatca gctttgttgc
ttgatttcat tgaagttgtt attcaagaat 2160aaatcagtta caattatgtt tgggtctaga
gtgatgtgtg gtctacaaaa aggggaatct 2220accaagctac agatgaacaa aaacaaaaca
gaaattgatt tctgagaaga agaagaagaa 2280gaggaagcat tcacatttat caccgattac
agtagggtca aattcagtag gcaagagaat 2340caaaatcaga atagatgaga tgagatatga
aacaacgttt atacaccata acacgattca 2400taatagaatg tagggaaaca tgcatgaaat
cagaaataat tggaggagat gagtaaaagt 2460taccacttgt tgagctgtgt gagtgagtga
gtgtgagaat gaggaggtgc ctgccttatt 2520tgtagcaggt ttcagtgaca cgtgtcaaga
gaatagcggg tggctatccc ttagcggaag 2580gcaactgtgg acactgtatt atagggaaat
gctcatcgac agtattatgg gccctctctt 2640tgttgattca cggctggact tcaacttggg
ccttgcaatg ggcccctccg gttctgtctc 2700ctagtatcta aaaaactgaa ccaactccct
cctaccgcta ccacttgaca ttcctatgtc 2760tcgtgttaat taaaatatta ttatagtaat
aaaagataat atctaatgta ctggtactgg 2820tccctccact agaattttgt tgcatttttt
agtattaaga ttgagatgca tggttcgagc 2880tcccgcagtg tgccagggct gtcggcagat
ggacataaat ggcacaccgc tcggctcgtg 2940gaaagagtat ggtcagtttc attgataagt
atttactcgt attcggtgtt tacatcaagt 3000taatatgttc aaacacatgt gatatcatac
atccattagt taagtataaa tgccaacttt 3060ttacttgaat cgccgaataa atttacttac
gtccaatatt tagttttgtg tgtcaaacat 3120atcatgcact atttgattaa gaataaataa
acgatgtgta atttgaaaac caattagaaa 3180agaagtatga cgggattgat gttctgtgaa
atcactggta aattggacgg acgatgaaat 3240ttgatcgtcc atttaagcat agcaacatgg
gtctttagtc atcatcatta tgttataatt 3300attttcttga aacttgatac accaactttc
attgggaaag tgacagcata gtataaacta 3360taatatcaat tctggcaatt tcgaattatt
ccaaatctct tttgtcattt catttcctcc 3420cctatgtctg caagtaccaa ttatttaagt
acaaaaaatc ttgattaaac aatttatttt 3480ctcactaata atcacattta atcatcaacg
gttcatacac gtctgtcact ctttttttat 3540tctctcaagc gcatgtgatc ataccaatta
tttaaataca aaaaatcttg attaaacaat 3600tcagtttctc actaataatc acatttaatc
atcaacggtt catacacatc cgtcactctt 3660tttttattct ctcaagcgca tgtgatcata
ccaattattt aaatacaaaa aatcttgatt 3720aaacaattca ttttctcact aataatcaca
tttaatcatc aacggtttat acacgtccgc 3780cactcttttt ttattctctc aagcgtatgt
gatcatatct aactctcgtg caaacaagtg 3840aaatgacgtt cactaataaa taatcttttg
aatactttgt tcagtttaat ttatttaatt 3900tgataagaat ttttttatta ttgaattttt
attgttttaa attaaaaata agttaaatat 3960atcaaaatat cttttaattt tatttttgaa
aaataacgta gttcaaacaa attaaaattg 4020agtaactgtt tttcgaaaaa taatgattct
aatagtatat tctttttcat cattagatat 4080tttttttaag ctaagtacaa aagtcatatt
tcaatcccca aaatagcctc aatcacaaga 4140aatgcttaaa tccccaaaat accctcaatc
acaagacgtg tgtaccaatc atacctatgg 4200tcctctcgta aattccgaca aaatcaggtc
tataaagtta cccttgatat cagtattata 4260aaactaaaaa tctcagctgt aattcaagtg
caatcacact ctaccacaca ctctctagta 4320gagagatcag ttgataacaa gcttgttaac
ggatccaaaa ttcttatgtt aaccaaataa 4380attgagacaa attaattcag ttaaccagag
ttaagagtaa agtactattg caagaaaata 4440tcaaaggcaa aagaaaagat catgaaagaa
aatatcaaag aaaaagaaga ggttacaatc 4500aaactcccat aaaactccaa aaataaacat
tcaaattgca aaaacatcca atcaaattgc 4560tctacttcac ggggcccacg ccggctgcat
ctcaaacttt cccacgtgac atcccataac 4620aaatcaccac cgtaaccctt ctcaaaactc
gacacctcac tctttttctc tatattacaa 4680taaaaaatat acgtgtcccc gcgggttaaa
ttcatgattt tatgaactca atagcttttc 4740ataatgagca atattatctt tcttcagtag
caaatccaca tgctcttatg ctcgctgaaa 4800tagttttggc cgtggagttt caccatctat
gtttacaatt gattcttgta gctgcaggga 4860cacgtatatt ttttattgta atatagagaa
aaagagtgag gtgtcgagtt ttgagaaggg 4920ttacggtggt gatttgttat gggatgtcac
gtgggaaagt ttgagatgca gccggcgtgg 4980gccccgtgaa gtagagcaat ttgattggat
gtttttgcaa tttgaatgtt tatttttgga 5040gttttatggg agtttgattg taacctcttc
tttttctttg atattttctt tcatgatctt 5100ttcttttgcc tttgatattt tcttgcaata
gtactttact cttaactctg gttaactgaa 5160ttaatttgtc tcaatttatt tggttaacat
aagaattttt ctagagtgat gtgtggtcta 5220caaaaagggg aatctaccaa gctacagatg
aacaaaaaca aaacagaaat tgatttctga 5280gaagaagaag aagaagagga agcattcaca
tttatcaccg attacagtag ggtcaaattc 5340agtaggcaag agaatcaaaa tcagaataga
tgagatgaga tatgaaacaa cgtttataca 5400ccataacacg attcataata gaatgtaggg
aaacatgcat gaaatcagaa ataattggag 5460gagatgagta aaagttacca cttgttgagc
tgtgtgagtg agtgagtgtg agaatgagga 5520ggtgcctgcc ttatttgtag caggtttcag
tgacacgtgt caagagaata gcgggtggct 5580atcccttagc ggaaggcaac tgtggacact
gtattatagg gaaatgctca tcgacagtat 5640tatgggccct ctctttgttg attcacggct
ggacttcaac ttgggccttg caatgggccc 5700ctccggttct gtctcctagt atctaaaaaa
ctgaaccaac tccctcctac cgctaccact 5760tgacattcct atgtctcgtg ttaattaaaa
tattattata gtaataaaag ataatatcta 5820atgtactggt actggtccct ccactagaat
tttgttgcat tttttagtat taagattgag 5880atgcatggtt cgagctcctt caacatgtta
taaacttcac atattcagtt gggaataggc 5940tttataatga gttggactac gttatgtccc
cctcaagtcc cagaattatg tgcccccgta 6000tgttataagt cccctctgcg ggcatcaatt
tagtgatcac gccagacatg cctctatacc 6060tcggccagga tatatttgtt ggtaatg
608743654DNASolanum tuberosum
43gtaaatttct agtgattata ctgtacattt cgcataattt aggatcgtat ttgatatgtt
60ttacgcttga ttgatcgaga acttaaagct tttctgatct gaaatttgtt ttttggcata
120ctcgagttga gatcctggtt aaatcagtgt tatttcgatt gaattttagc aaaatttggt
180gttgattttc agtattttca tggtttaatg tatataaaca agcttaattt ttcaaattca
240agctcgttta accttttaat tacagcatat ttctggaaaa aagtttggtg atttctctag
300atgttttatt cgagaaaaaa acaaaaacga aaaaagggga aatgctgttc tgtatgtaca
360aaaagtgatt gatcagcttt tggtcaccga catacatttg attagtacat acacgagtca
420tacgagtata tttccgtgtg cactttattg ttttgaagga attctggatt tggttgattc
480ctttttaaaa cttctaagtt ttttttgtta cattttactc taattaagtc ttctctgtga
540actgacaaat actcaccagg cacacattac aaccttcatt tgattatccg cgaacgatcc
600attgcttttg tgtatattgc ttttgtattg actgattttg tattgtatta gcag
6544425DNASolanum tuberosum 44cattaccaac aaatatatcc tggcc
254525DNALycopersicon esculentum 45cattaccaac
aaatatatcc tggcc
254625DNALycopersicon esculentum 46ctctacctct gaatatatcc tgcgg
254725DNACapsicum sp. 47cattaccaac
aaatatatcc tggcc
254825DNAMedicago sativa 48gtatacctct gtatacatcc tgccg
254925DNAHordeum sp. 49atataccaaa tgatacatcc tgccc
255025DNAOryza sativa
50acttactcaa ggatatatcc tggct
2551300DNAFigwort mosaic virus 51gcctcaacaa ggtcagggta cagagtctcc
aaaccattag ccaaaagcta caggagatca 60atgaagaatc ttcaatcaaa gtaaactact
gttccagcac atgcatcatg gtcagtaagt 120ttcagaaaaa gacatccacc gaagacttaa
agttagtggg catctttgaa agtaatcttg 180tcaacatcga gcagctggct tgtggggacc
agacaaaaaa ggaatggtgc agaattgtta 240ggcgcaccta ccaaaagcat ctttgccttt
attgcaaaga taaagcagat tcctctagta 30052304DNAArtificial
SequenceDescription of Artificial Sequence Synthetic construct
52aatgccccca tcaaggactg catcttttag gtggtaccag ctttccatga gcactttatc
60ctgattcatg agattaagag cagaaatgga tacaccatct tcattcttaa ccaaatactt
120agcaacagta gccaaaccat aaagtctctg aacctttcca tcttgttgag tacgaactga
180acaagtgagg atattgtaac aagccaagag acgcaacatt cggtccaaca taactggtgc
240atcagggtta gttgttggta gctgagaagc aatttcaata ggtgaaattt gagcaccagg
300tcca
30453492DNAArtificial SequenceDescription of Artificial Sequence
Synthetic construct 53aatgccccca tcaaggactg catcttttag gtggtaccag
ctttccatga gcactttatc 60ctgattcatg agattaagag cagaaatgga tacaccatct
tcattcttaa ccaaatactt 120agcaacagta gccaaaccat aaagtctctg aacctttcca
tcttgttgag tacgaactga 180acaagtgagg atattgtaac aagccaagag acgcaacatt
cggtccaaca taactggtgc 240atcagggtta gttgttggta gctgagaagc aatttcaata
ggtgaaattt gagcaccagg 300tccagaattc aatctcacaa aaacctcatc aatcacaacc
atgggttcaa caggtgaaac 360tcaaataaca ccaacccaca tatcagatga agaagcaaac
ctcttcgcca tgcaactagc 420aagtgcttca gttcttccca tgattttgaa atcagctctt
gaacttgatc tcttagaaat 480cattgctaaa gc
492541027DNAMedicago sativa 54gggcccatag
tggaccagtt aggtaggtgg agaaagaaat tattaaaaaa atatatttat 60atgttgtcaa
ataactcaaa aatcataaaa gtttaagtta gcaagtgtgc acatttttat 120ttggacaaaa
gtattcacct actactgtta taaatcatta ttaaacatta gagtaaagaa 180atatggatga
taagaataag agtagtgata ttttgacaac aattttgtta caacatttga 240gaaaattttg
ttgttctctc ttttcattgg tcaaaaacaa tagagagaga gagagaaaaa 300ggaagaggga
gaataaaaac ataatgtgag tatgagagag aaagttgtac aaaagttgta 360ccaaaatggt
tgtacaaata tcattgagga atttgacaaa agctacacaa ataagggtta 420attgctgtaa
ataaataagg atgacgcatt agagagatgt accattagag aatttttggc 480aagtcattaa
aaagaaagaa taaattattt ttaaaattaa aagttgagtc atttgattaa 540acatgtgatt
atttaatgaa ttgatgagag agttggatta aagttgtatt aatgattaga 600atttggtgtc
aaatttaatt tgacatttga tcttttccta tatattgccc catagagtca 660tttaactcat
ttttatattt catagatcaa ataagagaaa taacggtata ttaatccctc 720caacaaaaaa
aaaaaaaaaa cggtatattt actaaaaaat ctaagccacg taggaggata 780acatccaatc
caaccaatca caacaatcct gatgagataa cccactttaa gcccacgcac 840tctgtggcac
atctacatta tctaaatcac acattcttcc acacatctga gccacacaaa 900aaccaatcca
catctttatc atccattcta taaaaaatca cactttgtga gtctacactt 960tgattccctt
caaacacata caaagagaag agactaatta attaattaat catcttgaga 1020gaaagcc
102755853DNAMedicago sativa 55agagaggagg cagtgtacac aggggcagag agaggtgagt
cgtctttctg gtagggctgg 60tgttggggat agtggttggt ttgagagtca ggtggtgagg
agggttggcg atggggttga 120tacgttgttt tggttggata ggtggttagg agatgctcct
ttttgtgttt gtttcaggag 180gttgtttgag ttaacagaga acaaatttgt gtctgtggct
aatttgttat ctgttgactc 240ggagcagtgg ggggaggtgt tgaggtgaag cgtatggtgg
cagaggtggt ggcagaggtg 300aagcgtatgg tggcagctga gggaggcagt gtacacagag
gtggagagag aggagagaga 360agagagaaga gagagaaaat ggagaagaga gaagagaaga
gagagaagac aaatttttgt 420gtgtgtgacc aaaccaaaat tcttggtcct ggtccacaca
agattttctc ccaaccaagg 480tacaagaata ccacgatcca agagtgccac gttgcaacat
cataaccgtt caatagtaag 540agataatcga acggccataa ttaattttca acaaacccac
ttttttcctc ctacttttgc 600aacttgtccc tcatcaccta ccaaacacac atagcacacc
aacacacata ataatattat 660aataattgta aatatatgta gcctccaaat tagaaagaaa
cctctatata aagcctaact 720acttccttca caaatcagga aattcacaac tctaatattc
atttctttcc taatcattag 780aatttccatt cttataaaat tctaggtacc accacacaac
aaataaagga acattaatca 840atactattaa gat
85356153DNAMedicago sativa 56cggcaggatg tatacagagg
tatacaattt tatattacat ttatatttgt gttaattcat 60tgaattttca cttttatttt
ttactttgat aatcaactgt gtaaagaatt atttgaaaaa 120tatatataat ttatagaatt
tttttttgtt atg 1535791DNAMedicago sativa
57ctagattatg cgggctaacg ggctgcccgc ggccctttcg ggctagccct aacgggtacc
60gggccccggc aggatgtata cagaggtata c
91582932DNAArtificial SequenceDescription of Artificial Sequence
Synthetic construct 58cggcaggatg tatacagagg tatacaattt tatattacat
ttatatttgt gttaattcat 60tgaattttca cttttatttt ttactttgat aatcaactgt
gtaaagaatt atttgaaaaa 120tatatataat ttatagaatt tttttttgtt atggggccca
tagtggacca gttaggtagg 180tggagaaaga aattattaaa aaaatatatt tatatgttgt
caaataactc aaaaatcata 240aaagtttaag ttagcaagtg tgcacatttt tatttggaca
aaagtattca cctactactg 300ttataaatca ttattaaaca ttagagtaaa gaaatatgga
tgataagaat aagagtagtg 360atattttgac aacaattttg ttacaacatt tgagaaaatt
ttgttgttct ctcttttcat 420tggtcaaaaa caatagagag agagagagaa aaaggaagag
ggagaataaa aacataatgt 480gagtatgaga gagaaagttg tacaaaagtt gtaccaaaat
ggttgtacaa atatcattga 540ggaatttgac aaaagctaca caaataaggg ttaattgctg
taaataaata aggatgacgc 600attagagaga tgtaccatta gagaattttt ggcaagtcat
taaaaagaaa gaataaatta 660tttttaaaat taaaagttga gtcatttgat taaacatgtg
attatttaat gaattgatga 720gagagttgga ttaaagttgt attaatgatt agaatttggt
gtcaaattta atttgacatt 780tgatcttttc ctatatattg ccccatagag tcatttaact
catttttata tttcatagat 840caaataagag aaataacggt atattaatcc ctccaacaaa
aaaaaaaaaa aaacggtata 900tttactaaaa aatctaagcc acgtaggagg ataacatcca
atccaaccaa tcacaacaat 960cctgatgaga taacccactt taagcccacg cactctgtgg
cacatctaca ttatctaaat 1020cacacattct tccacacatc tgagccacac aaaaaccaat
ccacatcttt atcatccatt 1080ctataaaaaa tcacactttg tgagtctaca ctttgattcc
cttcaaacac atacaaagag 1140aagagactaa ttaattaatt aatcatcttg agagaaagcc
ctgcagaatg cccccatcaa 1200ggactgcatc ttttaggtgg taccagcttt ccatgagcac
tttatcctga ttcatgagat 1260taagagcaga aatggataca ccatcttcat tcttaaccaa
atacttagca acagtagcca 1320aaccataaag tctctgaacc tttccatctt gttgagtacg
aactgaacaa gtgaggatat 1380tgtaacaagc caagagacgc aacattcggt ccaacataac
tggtgcatca gggttagttg 1440ttggtagctg agaagcaatt tcaataggtg aaatttgagc
accaggtcca gctttagcaa 1500tgatttctaa gagatcaagt tcaagagctg atttcaaaat
catgggaaga actgaagcac 1560ttgctagttg catggcgaag aggtttgctt cttcatctga
tatgtgggtt ggtgttattt 1620gagtttcacc tgttgaaccc atggttgtga ttgatgaggt
ttttgtgaga ttgaattctg 1680gacctggtgc tcaaatttca cctattgaaa ttgcttctca
gctaccaaca actaaccctg 1740atgcaccagt tatgttggac cgaatgttgc gtctcttggc
ttgttacaat atcctcactt 1800gttcagttcg tactcaacaa gatggaaagg ttcagagact
ttatggtttg gctactgttg 1860ctaagtattt ggttaagaat gaagatggtg tatccatttc
tgctcttaat ctcatgaatc 1920aggataaagt gctcatggaa agctggtacc acctaaaaga
tgcagtcctt gatgggggca 1980ttggatccat cttaatagta ttgattaatg ttcctttatt
tgttgtgtgg tggtacctag 2040aattttataa gaatggaaat tctaatgatt aggaaagaaa
tgaatattag agttgtgaat 2100ttcctgattt gtgaaggaag tagttaggct ttatatagag
gtttctttct aatttggagg 2160ctacatatat ttacaattat tataatatta ttatgtgtgt
tggtgtgcta tgtgtgtttg 2220gtaggtgatg agggacaagt tgcaaaagta ggaggaaaaa
agtgggtttg ttgaaaatta 2280attatggccg ttcgattatc tcttactatt gaacggttat
gatgttgcaa cgtggcactc 2340ttggatcgtg gtattcttgt accttggttg ggagaaaatc
ttgtgtggac caggaccaag 2400aattttggtt tggtcacaca cacaaaaatt tgtcttctct
ctcttctctt ctctcttctc 2460cattttctct ctcttctctc ttctctctcc tctctctcca
cctctgtgta cactgcctcc 2520ctcagctgcc accatacgct tcacctctgc caccacctct
gccaccatac gcttcacctc 2580aacacctccc cccactgctc cgagtcaaca gataacaaat
tagccacaga cacaaatttg 2640ttctctgtta actcaaacaa cctcctgaaa caaacacaaa
aaggagcatc tcctaaccac 2700ctatccaacc aaaacaacgt atcaacccca tcgccaaccc
tcctcaccac ctgactctca 2760aaccaaccac tatccccaac accagcccta ccagaaagac
gactcacctc tctctgcccc 2820tgtgtacact gcctcctctc tctagattat gcgggctaac
gggctgcccg cggccctttc 2880gggctagccc taacgggtac cgggccccgg caggatgtat
acagaggtat ac 293259676DNAZea mays 59ttccacggca gctgccaccg
tcgctatcgc tgaccaaccc ggctggtcgc ctctgtgctc 60catccatgca tgttacaact
atgcagatgc agccgaaaca aacactggct agaaaggcag 120cccaacgggc ctactgtcat
tcgctccggc atcctactgg tgggcccact tgcaccggcc 180gatgaccagt tcatcatttt
tctcgacgaa tttgtgcaca gaatttgcta aaaattcttc 240gcacgtggca aaaccagggg
gaaaatcgac aactagtcgg ggttttttta attccctgat 300agaatagtcc ctgctaatca
tccatgaaaa ccaaacacgt actctacgtc accgtcatgg 360atggagcgag tgaactgatg
attttttccc catcccgcac gcaacagcat gggtgacaac 420aaccactccc gctgcggttg
ggcgagcaca tctctacgca cttgacactc acgcaaacct 480aacgcatact agagtaatca
tcgccaccaa ctatcggcga cagaaacgat gggccccgct 540tctcttaatc acggtgcttg
aattagtgcg cgcatagtag tgaaaaataa tagtgaaaaa 600taagcagtgc gtgttttggt
gtggtggttg gtgagccgtc cgccccaata aaaacccctc 660gcaccacctc gtccct
6766089DNAFigwort mosaic
virus 60caagtgggga acaaaataac gtggaaaaga gctgtcctga cagcccactc actaatgcgt
60atgacgaacg cagtgacgac cacaaaaga
89
User Contributions:
Comment about this patent or add new information about this topic: