Patent application title: COMPOSITIONS AND METHODS FOR INSECTICIDAL CONTROL OF STINKBUGS
Inventors:
IPC8 Class: AC12N1582FI
USPC Class:
1 1
Class name:
Publication date: 2019-03-07
Patent application number: 20190071690
Abstract:
Methods and compositions are provided which employ a silencing element
that, when ingested by a pest, such as a Pentatomidae plant pest,
decrease the expression of a target sequence in the pest. The present
invention provides various target polynucleotides set forth in any one of
SEQ ID NOS: 6-12, 18-40 or active variants and fragments thereof, wherein
a decrease in expression of one or more the sequences in the target pest
controls the pest (i.e., has insecticidal activity). Plants, plant part,
bacteria and other host cells comprising the silencing elements or an
active variant or fragment thereof of the invention are also provided.Claims:
1. An expression cassette, comprising a nucleotide sequence selected from
the group consisting of: (a) a nucleotide sequence comprising any one of
SEQ ID NOS: 6-12, 18-40, a fragment or variant thereof, or a complement
thereof; (b) a nucleotide sequence comprising at least 90% sequence
identity to any one of SEQ ID NOS: 6-12, 18-40, a fragment or variant
thereof, or a complement thereof, wherein said polynucleotide encodes a
silencing element having insecticidal activity against a Pentatomidae
plant pest; (c) a nucleotide sequence comprising at least 19 consecutive
nucleotides of any one of SEQ ID NOS: 6-12, 18-40, a fragment or variant
thereof, or a complement thereof, wherein said polynucleotide encodes a
silencing element having insecticidal activity against a Pentatomidae
plant pest; and, (d) a nucleotide sequence that hybridizes under
stringent conditions to the full length complement of the nucleotide
sequence of a), wherein said stringent conditions comprise hybridization
in 50% formamide, 1 M NaCl, 1% SDS at 37.degree. C., and a wash in
0.1.times.SSC at 60.degree. C. to 65.degree. C., wherein said
polynucleotide encodes a silencing element having insecticidal activity
against a Pentatomidae plant pest.
2. The expression cassette of claim 1, wherein said Pentatomidae plant pest is a N. viridula plant pest.
3. The expression cassette of claim 1, wherein said polynucleotide is operably linked to a heterologous promoter.
4. The expression cassette of claim 1, wherein said polynucleotide is expressed as a double stranded RNA.
5. The expression cassette of claim 1, wherein said polynucleotide comprise a silencing element which is expressed as a hairpin RNA.
6. The expression cassette of claim 5, wherein the silencing element comprises, a first segment, a second segment, and a third segment, wherein a) said first segment comprises at least about 19 nucleotides having at least 90% sequence complementarity to a target sequence set forth in SEQ ID NOS: 6-12, 18-40, a fragment or variant thereof; b) said second segment comprises a loop of sufficient length to allow the silencing element to be transcribed as a hairpin RNA; and, c) said third segment comprises at least about 19 nucleotides having at least 85% complementarity to the first segment.
7. The expression cassette of claim 1, wherein said polynucleotide is flanked by a first operably linked convergent promoter at one terminus of the polynucleotide and a second operably linked convergent promoter at the opposing terminus of the polynucleotide, wherein the first and the second convergent promoters are capable of driving expression of the polynucleotide.
8. A host cell comprising a heterologous expression cassette of claim 1.
9. A plant cell having stably incorporated into its genome a heterologous polynucleotide comprising a silencing element, wherein said silencing element comprises a) a fragment of at least 19 consecutive nucleotides of SEQ ID NOS: 6-12, 18-40, a fragment or variant thereof, or a complement thereof; or, b) the nucleotide sequence comprising at least 90% sequence identity to any one of SEQ ID NOS: 6-12, 18-40, a fragment or variant thereof, or a complement thereof, wherein said silencing element, when ingested by a Pentatomidae plant pest, reduces the level of a target sequence in said Pentatomidae plant pest and thereby controls the Pentatomidae plant pest.
10. The plant cell of claim 9, wherein the Pentatomidae plant pest is a N. viridula plant pest.
11. The plant cell of claim 9, wherein said silencing element comprises a) a polynucleotide comprising the sequence set forth in SEQ ID NOS: 6-12, 18-40, a fragment or variant thereof, or a complement thereof; b) a polynucleotide comprising at least 75 consecutive nucleotides of the sequence set forth in SEQ ID NOS: 6-12, 18-40, a fragment or variant thereof, or a complement thereof.
12. The plant cell of claim 9, wherein said plant cell comprises the expression cassette of claim 7.
13. The plant cell of claim 9, wherein said silencing element expresses a double stranded RNA.
14. The plant cell of claim 9, wherein said silencing element expresses a hairpin RNA.
15. The plant cell of claim 14, wherein said polynucleotide comprising the silencing element comprises, a first segment, a second segment, and a third segment, wherein a) said first segment comprises at least about 19 nucleotides having at least 90% sequence complementarity to a target sequence set forth in SEQ ID NOS: 6-12, 18-40, a fragment or variant thereof; b) said second segment comprises a loop of sufficient length to allow the silencing element to be transcribed as a hairpin RNA; and, c) said third segment comprises at least about 19 nucleotides having at least 85% complementarity to the first segment.
16. The plant cell of claim 9, wherein said silencing element is operably linked to a heterologous promoter.
17. The plant cell of claim 9, wherein said plant cell is from a monocot.
18. The plant cell of claim 17, wherein said monocot is maize, barley, millet, wheat or rice.
19-24. (canceled)
25. The plant cell of claim 9, wherein said plant cell is from a dicot.
26. The plant cell of claim 25, wherein said dicot is soybean, canola, alfalfa, sunflower, safflower, tobacco, Arabidopsis, or cotton.
27-32. (canceled)
33. A plant or plant part comprising a plant cell of claim 9.
34. A transgenic seed from the plant of claim 33.
35. A method for controlling a Pentatomidae plant pest comprising feeding to a Pentatomidae plant pest a composition comprising a silencing element, wherein said silencing element, when ingested by said Pentatomidae plant pest, reduces the level of a target Pentatomidae plant pest sequence and thereby controls the Pentatomidae plant pest, wherein said target Pentatomidae plant pest sequence comprise a nucleotide sequence comprising at least 90% sequence identity to any one of SEQ ID NOS: 6-12, 18-40, a fragment or variant thereof.
36. The method of claim 35, wherein said Pentatomidae plant pest comprises a N. viridula plant pest.
37. The method of claim 35, wherein said silencing element comprises a) a fragment of at least 19 consecutive nucleotides of SEQ ID NOS: 6-12, 18-40, a fragment or variant thereof, or a complement thereof or, b) a nucleotide sequence comprising at least 90% sequence identity to any one of SEQ ID NOS: 6-12, 18-40, a fragment or variant thereof, or a complement thereof.
38. The method of claim 35, wherein said composition comprises a plant or plant part having stably incorporated into its genome a polynucleotide comprising said silencing element.
39. The method of claim 38, wherein said silencing element comprises a) a polynucleotide comprising the sense or antisense sequence of the sequence set forth in SEQ ID NOS: 6-12, 18-40, a fragment or variant thereof, or a complement thereof; b) a polynucleotide comprising the sense or antisense sequence of a sequence having at least 95% sequence identity to the sequence set forth in SEQ ID NOS: 6-12, 18-40, a fragment or variant thereof, or a complement thereof; c) a polynucleotide comprising the sense or antisense sequence of a sequence having at least 75 contiguous nucleotides of SEQ ID NOS: 6-12, 18-40, a fragment or variant thereof, or a complement thereof.
40. The method of claim 39, wherein said silencing element expresses a double stranded RNA.
41. The method of claim 39, wherein said silencing element comprises a hairpin RNA.
42. The method of claim 41, wherein said polynucleotide comprising the silencing element comprises, a first segment, a second segment, and a third segment, wherein a) said first segment comprises at least about 20 nucleotides having at least 90% sequence complementarity to the target polynucleotide; b) said second segment comprises a loop of sufficient length to allow the silencing element to be transcribed as a hairpin RNA; and, c) said third segment comprises at least about 20 nucleotides having at least 85% complementarity to the first segment.
43. The method of claim 35, wherein said silencing element is operably linked to a heterologous promoter.
44. The method of claim 35, wherein said silencing element is flanked by a first operably linked convergent promoter at one terminus of the silencing element and a second operably linked convergent promoter at the opposing terminus of the polynucleotide, wherein the first and the second convergent promoters are capable of driving expression of the silencing element.
45. The method of claim 35, wherein said plant is a monocot.
46. The method of claim 45, wherein said monocot is maize, barley, millet, wheat or rice.
47. The method of claim 38, wherein said plant is a monocot.
48-54. (canceled)
55. The method of any one of claims 35-37, wherein said plant is a dicot.
56. The method of claim 45, wherein said dicot is soybean, canola, alfalfa, sunflower, safflower, tobacco, Arabidopsis, or cotton.
57-66. (canceled)
Description:
CROSS-REFERENCE TO RELATED APPLICATIONS
[0001] This Application is a divisional application of U.S. Nonprovisional Application Ser. No. 14/775,282, filed on Sep. 11, 2015, which claims the benefit of International Application Number PCT/US2014/025274 filed Mar. 13, 2014, which claims the benefit of U.S. Provisional Application No. 61/779,643, filed on Mar. 13, 2013, each of which is incorporated herein by reference in its entirety.
REFERENCE TO SEQUENCE LISTING
[0002] The Sequence Listing submitted Mar. 13, 2014 as a text file named "36446_0006U1_2013_03_13_Sequences_as_Filed," created on Mar. 7, 2014, and having a size of 133,534 bytes is hereby incorporated by reference pursuant to 37 C.F.R. .sctn. 1.52(e)(5).
FIELD OF THE INVENTION
[0003] The present invention relates generally to methods of molecular biology and gene silencing to control pests.
BACKGROUND OF THE INVENTION
[0004] Insect pests are a serious problem in agriculture. They destroy millions of acres of staple crops such as corn, soybeans, peas, and cotton. Yearly, these pests cause over $100 billion dollars in crop damage in the U.S. alone. In an ongoing seasonal battle, farmers must apply billions of gallons of synthetic pesticides to combat these pests. Other methods employed in the past delivered insecticidal activity by microorganisms or genes derived from microorganisms expressed in transgenic plants. For example, certain species of microorganisms of the genus Bacillus are known to possess pesticidal activity against a broad range of insect pests including Lepidoptera, Diptera, Coleoptera, Hemiptera, and others. In fact, microbial pesticides, particularly those obtained from Bacillus strains, have played an important role in agriculture as alternatives to chemical pest control. Agricultural scientists have developed crop plants with enhanced insect resistance by genetically engineering crop plants to produce insecticidal proteins from Bacillus. For example, corn and cotton plants genetically engineered to produce Cry toxins (see, e.g., Aronson (2002) Cell Mol. Life Sci. 59(3):417-425; Schnepf et al. (1998) Microbiol. Mol. Biol. Rev. 62(3):775-806) are now widely used in American agriculture and have provided the farmer with an alternative to traditional insect-control methods. However, these Bt insecticidal proteins only protect plants from a relatively narrow range of pests. Moreover, these modes of insecticidal activity provided varying levels of specificity and, in some cases, caused significant environmental consequences.
[0005] Previous control of stinkbugs relied on broad spectrum insecticides. With the adoption of transgenic controls for major lepidopteran pests in several crops, these insecticides are no longer used and stinkbugs have become a major secondary pest. No successful use of transgenic control of stinkbugs has been described or adopted. This may be due in part to the extra oral digestion employed by stinkbugs where digestive enzymes are injected into the host plant prior to feeding. This makes it difficult to find proteins that survive long enough to manifest activity against these insects. RNAi may overcome that feeding behavior by relying on double stranded RNAs rather than proteins. Thus, there is an immediate need for alternative methods to control pests.
BRIEF SUMMARY OF THE INVENTION
[0006] Methods and compositions are provided which employ a silencing element that, when ingested by a pest, such as a Pentatomidae plant pest including for example, a N. viridula (southern green stinkbug), Acrosternum hilare (green stinkbug), Piezodorus guildini (redbanded stinkbug), Euschistus servus (brown stinkbug), and/or Halymorpha halys (brown marmorated stinkbug) plant pest, is capable of decreasing the expression of a target sequence in the pest. In specific embodiments, the decrease in expression of the target sequence controls the pest and thereby the methods and compositions are capable of limiting damage to a plant. The present invention provides various target polynucleotides as set forth in SEQ ID NOS: 6-12, 18-40, or active variants or fragments thereof, or complements thereof, wherein a decrease in expression of one or more the sequences in the target pest controls the pest (i.e., has insecticidal activity). Further provided are silencing elements, which when ingested by the pest, decrease the level of expression of one or more of the target polynucleotides. Plants, plant parts, plant cells, bacteria and other host cells comprising the silencing elements or an active variant or fragment thereof are also provided.
[0007] In another embodiment, a method for controlling a pest, such as a Pentatomidae plant pest, such as, for example, a N. viridula, Acrosternum hilare, Piezodorus guildini, Euschistus servus (brown stinkbug), and/or Halymorpha halys plant pest, is provided. The method comprises feeding to a pest a composition comprising a silencing element, wherein the silencing element, when ingested by the pest, reduces the level of a target sequence in the pest and thereby controls the pest. Further provided are methods to protect a plant from a pest. Such methods comprise introducing into the plant or plant part, or alternatively onto the plant as part of a topical formulation, a silencing element of the invention. When the pest ingests a plant comprising the silencing element, the level of the target sequence is decreased in the pest and the pest is controlled.
[0008] In specific embodiments, the silencing element comprises at least 15, 20, or 22 consecutive nucleotides of any one or more of SEQ ID NOS: 6-12, 18-40. In specific embodiments, the pest that is controlled is a Pentatomidae plant pest, such as, for example, a N. viridula, Acrosternum hilare, Piezodorus guildini, Euschistus servus (brown stinkbug), and/or Halymorpha halys plant pest. Plants, plant parts, plant cells, bacteria and other host cells comprising the silencing element comprising at least 15, 20, or 22 consecutive nucleotides of any one or more of SEQ ID NOS: 6-12, 18-40 or an active variant or fragment thereof, or complements thereof, are also provided.
[0009] In another embodiment, a method for controlling a pest, such as a pest from Pentatomidae plant pest, such as, for example, a N. viridula, Acrosternum hilare, Piezodorus guildini, Euschistus servus (brown stinkbug), and/or Halymorpha halys (Hemiptera order) is provided. The method comprises feeding to a pest a composition comprising a silencing element comprising at least 15, 20, or 22 consecutive nucleotides of any one or more of SEQ ID NOS: 6-12, 18-40, wherein the silencing element, when ingested by the pest, reduces the level of a target sequence in the pest and thereby controls the pest. Further provided are methods to protect a plant from a pest. Such methods comprise introducing into the plant or plant part, or alternatively onto the plant as part of a topical formulation, a silencing element of the invention. When the pest ingests a plant expressing the silencing element, the level of the target sequence is decreased in the pest and the pest is controlled.
BRIEF DESCRIPTION OF THE FIGURES
[0010] FIG. 1 is a map of plasmid PHP 36164.
[0011] FIG. 2 is a map of plasmid PHP 59032.
[0012] FIG. 3 is a map of plasmid PHP 62151.
DETAILED DESCRIPTION OF THE INVENTION
[0013] The present inventions now will be described more fully hereinafter with reference to the accompanying drawings, in which some, but not all embodiments of the inventions are shown. Indeed, these inventions may be embodied in many different forms and should not be construed as limited to the embodiments set forth herein; rather, these embodiments are provided so that this disclosure will satisfy applicable legal requirements. Like numbers refer to like elements throughout. It is to be understood that this invention is not limited to the particular methodology, protocols, cell lines, genera, and reagents described, as such may vary. It is also to be understood that the terminology used herein is for the purpose of describing particular embodiments only, and is not intended to limit the scope of the present invention.
[0014] Many modifications and other embodiments of the inventions set forth herein will come to mind to one skilled in the art to which these inventions pertain having the benefit of the teachings presented in the foregoing descriptions and the associated drawings. Therefore, it is to be understood that the inventions are not to be limited to the specific embodiments disclosed and that modifications and other embodiments are intended to be included within the scope of the appended claims. Although specific terms are employed herein, they are used in a generic and descriptive sense only and not for purposes of limitation.
[0015] As used herein the singular forms "a", "and", and "the" include plural referents unless the context clearly dictates otherwise. Thus, for example, reference to "a cell" includes a plurality of such cells and reference to "the protein" includes reference to one or more proteins and equivalents thereof known to those skilled in the art, and so forth. All technical and scientific terms used herein have the same meaning as commonly understood to one of ordinary skill in the art to which this invention belongs unless clearly indicated otherwise.
I. Overview
[0016] The present invention comprises methods and compositions employing one or more silencing elements that, when ingested by a pest, such as a Pentatomidae plant pest such as, for example, a N. viridula, Acrosternum hilare, Piezodorus guildini, and/or Halymorpha halys plant pest, is capable of decreasing the expression of a target sequence in the pest. In specific embodiments, the decrease in expression of the target sequence controls the pest and thereby the methods and compositions are capable of limiting damage to a plant or plant part. The present invention provides target polynucleotides as set forth in SEQ ID NOS: 6-12, 18-40, or active variants and fragments thereof, or complements thereof. Silencing elements comprising sequences, complementary sequences, active fragments or variants of these target polynucleotides are provided which, when ingested by a pest, decrease the expression of one or more of the target sequences and thereby controls the pest (i.e., has insecticidal activity).
[0017] As used herein, by "controlling a pest" or "controls a pest" is intended any effect on a pest that results in limiting the damage that the pest causes. Controlling a pest includes, but is not limited to, killing the pest, inhibiting development of the pest, altering fertility or growth of the pest in such a manner that the pest provides less damage to the plant, decreasing the number of offspring produced, producing less fit pests, producing pests more susceptible to predator attack, or deterring the pests from eating the plant.
[0018] Reducing the level of expression of the target polynucleotide or the polypeptide encoded thereby, in the pest results in the suppression, control, and/or killing of the invading pathogenic organism. Reducing the level of expression of the target sequence of the pest will reduce the disease symptoms resulting from pathogen challenge by at least about 2% to at least about 6%, at least about 5% to about 50%, at least about 10% to about 60%, at least about 30% to about 70%, at least about 40% to about 80%, or at least about 50% to about 90% or greater. Hence, the methods of the invention can be utilized to control pests, particularly, Pentatomidae plant pests such as, for example, a N. viridula, Acrosternum hilare, Piezodorus guildini, and/or Halymorpha halys plant pest.
[0019] Assays measuring the control of a pest are commonly known in the art, as are methods to quantitate disease resistance in plants following pathogen infection. See, for example, U.S. Pat. No. 5,614,395, herein incorporated by reference. Such techniques include, measuring over time, the average lesion diameter, the pathogen biomass, and the overall percentage of decayed plant tissues. See, for example, Thomma et al. (1998) Plant Biology 95:15107-15111, herein incorporated by reference. See, also Baum et al. (2007) Nature Biotech 11:1322-1326 and WO 2007/035650 which proved both whole plant feeding assays and corn root feeding assays. Both of these references are herein incorporated by reference in their entirety. See, also the examples below.
[0020] The invention comprises compositions and methods for protecting plants from a plant pest, such as Pentatomidae plant pests such as, for example, a N. viridula, Acrosternum hilare, Piezodorus guildini, and/or Halymorpha halys plant pests or inducing resistance in a plant to a plant pest, such as Pentatomidae plant pests such as, for example, a N. viridula, Acrosternum hilare, Piezodorus guildini, and/or Halymorpha halys plant pests. As used herein "Pentatomidae plant pest" is used to refer to any member of the Pentatomidae family. Accordingly, the compositions and methods are also useful in protecting plants against any Pentatomidae plant pest including representative genera and species such as, but not limited to, Acrocorisellus (A. serraticollis), Acrosternum (A. adelpha, A. hilare, A. herbidum, A. scutellatum), Agonoscelis (A. nubila), Alcaeorrhynchus (A. grandis, A. phymatophorus), Amaurochrous (A. brevitylus), Apateticus (A. anatarius, A. bracteatus, A. cynicus, A. lineolatus, A. marginiventris), Apoecilus, Arma (A. custos), Arvelius, Bagrada, Bagrada hilaris, Banasa (B. calva, B. dimiata, B. grisea, B. induta, B. sordida), Brochymena (B. affinis, B. cariosa, B. haedula, B. hoppingi, B. sulcata), Carbula (C. obtusangula, C. sinica), Chinavia, Chlorochroa (C. belfragii, C. kanei, C. norlandi, C. senilis, C. viridicata), Chlorocoris (C. distinctus, C. flaviviridis, C. hebetatus, C. subrugosus, C. tau), Codophila (C. remota, C. sulcata, C. varius), Coenus (C. delius, C. inermis, C. tarsalis), Cosmopepla (C. bimaculata, C. binotata, C. carnifex, C. decorata, C. intergressus), Dalpada (D. oculata), Dendrocoris (D. arizonesis, D. fruticicola, D. humeralis, D. parapini, D. reticulatus), Dolycoris (D. baccarum (sloe bug)), Dybowskyia (D. reticulata), Edessa, Erthesina (E. fullo), Eurydema (E. dominulus, E. gebleri (shield bug), E. pulchra, E. rugosa), Euschistus (E. biformis, E. integer, E. quadrator, E. servus, E. tristigma), Euthyrhynchus (E. floridanus, E. macronemis), Gonopsis (G. coccinea), Graphosoma (G. lineatum (stinkbug), G. rubrolineatum), Halyomorpha (H. halys (brown marmorated stinkbug)), Halys (H. sindillus, H. sulcatus), Holcostethus (H. abbreviatus, H. fulvipes, H. limbolarius, H. piceus, H. sphacelatus), Homalogonia (H. obtusa), Hymenarcys (H. aequalis, H. crassa, H. nervosa, H. perpuncata, H. reticulata), Lelia (L. decempunctata), Lineostethus, Loxa (L. flavicollis, L. viridis), Mecidea (M. indicia, M. major, M. minor), Megarrhamphus (M. hastatus), Menecles (M. insertus, M. portacrus), Mormidea (M. cubrosa, M. lugens, M. pama, M. pictiventris, M. ypsilon), Moromorpha (M. tetra), Murgantia (M. angularis, M. tessellata, M. varicolor, M. violascens), Neottiglossa (N. californica, N. cavifrons, N. coronaciliata, N. sulcifrons, N. undata), Nezara (N. smaragdulus, N. viridula (southern green stinkbug)), Oebalus
[0021] (O. grisescens, O. insularis, O. mexicanus, O. pugnax, O. typhoeus), Oechalia (O. schellenbergii (spined predatory shield bug)), Okeanos (O. quelpartensis), Oplomus (O. catena, O. dichrous, O. tripustulatus), Palomena (P. prasina (green shield bug)), Parabrochymena, Pentatoma (P. angulata, P. illuminata, P. japonica, P. kunmingensis, P. metallifera, P. parataibaiensis, P. rufipes, P. semiannulata, P. viridicornuta), Perillus (P. bioculatus, P. confluens, P. strigipes), Picromerus (P. griseus), Piezodorus (P. degeeri, P. guildinii, P. lituratus (gorse shield bug)), Pinthaeus (P. humeralis), Plautia (P. crossota, P. stali (brown-winged green bug)), Podisus (P. maculiventris), Priassus (P. testaceus), Prionosoma, Proxys (P. albopunctulatus, P. punctulatus, P. victor), Rhaphigaster (R. nebulosa), Scotinophara (S. horvathi), Stiretrus (S. anchorago, S. fimbriatus), Thyanta (T. accerra, T. calceata, T. casta, T. perditor, T. pseudocasta), Trichopepla (T. aurora, T. dubia, T. pilipes, T. semivittata, T. vandykei), Tylospilus, and Zicrona. Other order and species for which the present invention is intended include Hemiptera, Kudzu bug, Megacopta cribraria (fa. Plataspidae) and Sunn pest, Eurygaster integriceps (fa. Scutelleridae).
II. Target Sequences
[0022] As used herein, a "target sequence" or "target polynucleotide" comprises any sequence in the pest that one desires to reduce the level of expression. In specific embodiments, decreasing the level of the target sequence in the pest controls the pest. For instance, the target sequence may be essential for growth and development. While the target sequence can be expressed in any tissue of the pest, in specific embodiments, the sequences targeted for suppression in the pest are expressed in cells of the gut tissue of the pest, cells in the midgut of the pest, and cells lining the gut lumen or the midgut. Such target sequences can be involved in, for example, gut cell metabolism, growth or differentiation. Non-limiting examples of target sequences of the invention include a polynucleotide set forth in SEQ ID NOS: 6-12, 18-40, active fragments or variants thereof, or complements thereof. As exemplified elsewhere herein, decreasing the level of expression of one or more of these target sequences in a Pentatomidae plant pest such as, for example, a N. viridula, Acrosternum hilare, Piezodorus guildini, and/or Halymorpha halys plant pest controls the pest.
III. Silencing Elements
[0023] By "silencing element" is intended a polynucleotide which when ingested by a pest, is capable of reducing or eliminating the level or expression of a target polynucleotide or the polypeptide encoded thereby. The silencing element employed can reduce or eliminate the expression level of the target sequence by influencing the level of the target RNA transcript or, alternatively, by influencing translation and thereby affecting the level of the encoded polypeptide. Methods to assay for functional silencing elements that are capable of reducing or eliminating the level of a sequence of interest are disclosed elsewhere herein. A single polynucleotide employed in the methods of the invention can comprise one or more silencing elements to the same or different target polynucleotides. The silencing element can be produced in vivo (i.e., in a host cell such as a plant or microorganism) or in vitro.
[0024] In other embodiments, while the silencing element controls pests, preferably the silencing element has no effect on the normal plant or plant part.
[0025] As discussed in further detail below, silencing elements can include, but are not limited to, a sense suppression element, an antisense suppression element, a double stranded RNA, a siRNA, an amiRNA, a miRNA, or a hairpin suppression element. Silencing elements of the present invention may comprise a chimera where two or more sequences of the present invention or active fragments or variants, or complements thereof, are found in the same RNA molecule. Further, a sequence of the present invention or active fragment or variant, or complement thereof, may be present as more than one copy in a DNA construct, silencing element, DNA molecule or RNA molecule. Non-limiting examples of silencing elements that can be employed to decrease expression of these target Pentatomidae plant pest sequences such as, for example, a N. viridula, Acrosternum hilare, Piezodorus guildini, and/or Halymorpha halys plant pest sequences comprise, or alternatively consist of, fragments and variants of the sense or antisense sequences set forth in SEQ ID NOS: 6-12, 18-40 or one or more variants or fragments thereof. The silencing element can further comprise additional sequences that advantageously effect transcription and/or the stability of a resulting transcript.
[0026] By "reduces" or "reducing" the expression level of a polynucleotide or a polypeptide encoded thereby is intended to mean, the polynucleotide or polypeptide level of the target sequence is statistically lower than the polynucleotide level or polypeptide level of the same target sequence in an appropriate control pest which is not exposed to (i.e., has not ingested) the silencing element. In particular embodiments of the invention, reducing the polynucleotide level and/or the polypeptide level of the target sequence in a pest according to the invention results in less than 95%, less than 90%, less than 80%, less than 70%, less than 60%, less than 50%, less than 40%, less than 30%, less than 20%, less than 10%, or less than 5% of the polynucleotide level, or the level of the polypeptide encoded thereby, of the same target sequence in an appropriate control pest. Methods to assay for the level of the RNA transcript, the level of the encoded polypeptide, or the activity of the polynucleotide or polypeptide are discussed elsewhere herein.
[0027] a. Sense Suppression Elements
[0028] As used herein, a "sense suppression element" comprises a polynucleotide designed to express an RNA molecule corresponding to at least a part of a target messenger RNA in the "sense" orientation. Expression of the RNA molecule comprising the sense suppression element reduces or eliminates the level of the target polynucleotide or the polypeptide encoded thereby. The polynucleotide comprising the sense suppression element may correspond to all or part of the sequence of the target polynucleotide, all or part of the 5' and/or 3' untranslated region of the target polynucleotide, all or part of the coding sequence of the target polynucleotide, or all or part of both the coding sequence and the untranslated regions of the target polynucleotide.
[0029] Typically, a sense suppression element has substantial sequence identity to the target polynucleotide, typically greater than about 65% sequence identity, greater than about 85% sequence identity, about 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% sequence identity. See, U.S. Pat. Nos. 5,283,184 and 5,034,323; herein incorporated by reference. The sense suppression element can be any length so long as it allows for the suppression of the targeted sequence. The sense suppression element can be, for example, 15, 16, 17, 18 19, 20, 22, 25, 30, 50, 100, 150, 200, 250, 300, 350, 400, 450, 500, 600, 700, 900, 1000, 1100, 1200, 1300 nucleotides or longer of the target polynucleotides set forth in any of SEQ ID NOS: 6-12, 18-40. In other embodiments, the sense suppression element can be, for example, about 15-25, 25-100, 100-150, 150-200, 200-250, 250-300, 300-350, 350-400, 450-500, 500-550, 550-600, 600-650, 650-700, 700-750, 750-800, 800-850, 850-900, 900-950, 950-1000, 1000-1050, 1050-1100, 1100-1200, 1200-1300, 1300-1400, 1400-1500, 1500-1600, 1600-1700, 1700-1800 nucleotides or longer of the target polynucleotides set forth in any of SEQ ID NOS: 6-12, 18-40.
[0030] b. Antisense Suppression Elements
[0031] As used herein, an "antisense suppression element" comprises a polynucleotide which is designed to express an RNA molecule complementary to all or part of a target messenger RNA. Expression of the antisense RNA suppression element reduces or eliminates the level of the target polynucleotide. The polynucleotide for use in antisense suppression may correspond to all or part of the complement of the sequence encoding the target polynucleotide, all or part of the complement of the 5' and/or 3' untranslated region of the target polynucleotide, all or part of the complement of the coding sequence of the target polynucleotide, or all or part of the complement of both the coding sequence and the untranslated regions of the target polynucleotide. In addition, the antisense suppression element may be fully complementary (i.e., 100% identical to the complement of the target sequence) or partially complementary (i.e., less than 100% identical to the complement of the target sequence) to the target polynucleotide. In specific embodiments, the antisense suppression element comprises at least 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% sequence complementarity to the target polynucleotide. Antisense suppression may be used to inhibit the expression of multiple proteins in the same plant. See, for example, U.S. Pat. No. 5,942,657. Furthermore, the antisense suppression element can be complementary to a portion of the target polynucleotide. Generally, sequences of at least 15, 20, 22, 25, 50, 100, 200, 300, 400, 450 nucleotides or greater of the sequence set forth in any of SEQ ID NOS: 6-12, 18-40 may be used. Methods for using antisense suppression to inhibit the expression of endogenous genes in plants are described, for example, in Liu et al (2002) Plant Physiol. 129:1732-1743 and U.S. Pat. Nos. 5,759,829 and 5,942,657, each of which is herein incorporated by reference.
[0032] c. Double Stranded RNA Suppression Element
[0033] A "double stranded RNA silencing element" or "dsRNA" comprises at least one transcript that is capable of forming a dsRNA either before or after ingestion by a pest. Thus, a "dsRNA silencing element" includes a dsRNA, a transcript or polyribonucleotide capable of forming a dsRNA or more than one transcript or polyribonucleotide capable of forming a dsRNA. "Double stranded RNA" or "dsRNA" refers to a polyribonucleotide structure formed either by a single self-complementary RNA molecule or a polyribonucleotide structure formed by the expression of least two distinct RNA strands. The dsRNA molecule(s) employed in the methods and compositions of the invention mediate the reduction of expression of a target sequence, for example, by mediating RNA interference ("RNAi") or gene silencing in a sequence-specific manner. In the context of the present invention, the dsRNA is capable of reducing or eliminating the level of expression of a target polynucleotide or the polypeptide encoded thereby in a pest.
[0034] The dsRNA can reduce or eliminate the expression level of the target sequence by influencing the level of the target RNA transcript, by influencing translation and thereby affecting the level of the encoded polypeptide, or by influencing expression at the pre-transcriptional level (i.e., via the modulation of chromatin structure, methylation pattern, etc., to alter gene expression). See, for example, Verdel et al. (2004) Science 303:672-676; Pal-Bhadra et al. (2004) Science 303:669-672; Allshire (2002) Science 297:1818-1819; Volpe et al. (2002) Science 297:1833-1837; Jenuwein (2002) Science 297:2215-2218; and Hall et al. (2002) Science 297:2232-2237. Methods to assay for functional dsRNA that are capable of reducing or eliminating the level of a sequence of interest are disclosed elsewhere herein. Accordingly, as used herein, the term "dsRNA" is meant to encompass other terms used to describe nucleic acid molecules that are capable of mediating RNA interference or gene silencing, including, for example, short-interfering RNA (siRNA), double-stranded RNA (dsRNA), micro-RNA (miRNA), hairpin RNA, short hairpin RNA (shRNA), post-transcriptional gene silencing RNA (ptgsRNA), and others.
[0035] In specific embodiments, at least one strand of the duplex or double-stranded region of the dsRNA shares sufficient sequence identity or sequence complementarity to the target polynucleotide to allow for the dsRNA to reduce the level of expression of the target sequence. As used herein, the strand that is complementary to the target polynucleotide is the "antisense strand" and the strand homologous to the target polynucleotide is the "sense strand."
[0036] In another embodiment, the dsRNA comprises a hairpin RNA. A hairpin RNA comprises an RNA molecule that is capable of folding back onto itself to form a double stranded structure. Multiple structures can be employed as hairpin elements. In specific embodiments, the dsRNA suppression element comprises a hairpin element which comprises in the following order, a first segment, a second segment, and a third segment, where the first and the third segment share sufficient complementarity to allow the transcribed RNA to form a double-stranded stem-loop structure.
[0037] The "second segment" of the hairpin comprises a "loop" or a "loop region." These terms are used synonymously herein and are to be construed broadly to comprise any nucleotide sequence that confers enough flexibility to allow self-pairing to occur between complementary regions of a polynucleotide (i.e., segments 1 and 3 which form the stem of the hairpin). For example, in some embodiments, the loop region may be substantially single stranded and act as a spacer between the self-complementary regions of the hairpin stem-loop. In some embodiments, the loop region can comprise a random or nonsense nucleotide sequence and thus not share sequence identity to a target polynucleotide. In other embodiments, the loop region comprises a sense or an antisense RNA sequence or fragment thereof that shares identity to a target polynucleotide. See, for example, International Patent Publication No. WO 02/00904, herein incorporated by reference. In specific embodiments, the loop region can be optimized to be as short as possible while still providing enough intramolecular flexibility to allow the formation of the base-paired stem region. Accordingly, the loop sequence is generally less than 1000, 900, 800, 700, 600, 500, 400, 300, 200, 100, 50, 25, 20, 15, 10 nucleotides or less.
[0038] The "first" and the "third" segment of the hairpin RNA molecule comprise the base-paired stem of the hairpin structure. The first and the third segments are inverted repeats of one another and share sufficient complementarity to allow the formation of the base-paired stem region. In specific embodiments, the first and the third segments are fully complementary to one another. Alternatively, the first and the third segment may be partially complementary to each other so long as they are capable of hybridizing to one another to form a base-paired stem region. The amount of complementarity between the first and the third segment can be calculated as a percentage of the entire segment. Thus, the first and the third segment of the hairpin RNA generally share at least 50%, 60%, 70%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, up to and including 100% complementarity.
[0039] The first and the third segment are at least about 1000, 500, 400, 300, 200, 100, 50, 40, 30, 25, 22, 20, 19, 18, 17, 16, 15 or 10 nucleotides in length. In specific embodiments, the length of the first and/or the third segment is about 10-100 nucleotides, about 10 to about 75 nucleotides, about 10 to about 50 nucleotides, about 10 to about 40 nucleotides, about 10 to about 35 nucleotides, about 10 to about 30 nucleotides, about 10 to about 25 nucleotides, about 10 to about 19 nucleotides, about 50 nucleotides to about 100 nucleotides, about 100 nucleotides to about 150 nucleotides, about 150 nucleotides to about 200 nucleotides, about 200 nucleotides to about 250 nucleotides, about 250 nucleotides to about 300 nucleotides, about 300 nucleotides to about 350 nucleotides, about 350 nucleotides to about 400 nucleotides, about 400 nucleotides to about 500 nucleotides, about 600 nucleotides, about 700 nucleotides, about 800 nucleotides, about 900 nucleotides, about 1000 nucleotides, about 1100 nucleotides, about 1200 nucleotides, 1300 nucleotides, 1400 nucleotides, 1500 nucleotides, 1600 nucleotides, 1700 nucleotides, 1800 nucleotides, 1900 nucleotides, 2000 nucleotides or longer. In other embodiments, the length of the first and/or the third segment comprises at least 10-19 nucleotides; 19-35 nucleotides; 30-45 nucleotides; 40-50 nucleotides; 50-100 nucleotides; 100-300 nucleotides; about 500-700 nucleotides; about 700-900 nucleotides; about 900-1100 nucleotides; about 1300-1500 nucleotides; about 1500-1700 nucleotides; about 1700-1900 nucleotides; about 1900-2100 nucleotides; about 2100-2300 nucleotides; or about 2300-2500 nucleotides. See, for example, International Publication No. WO 0200904. In specific embodiments, the first and the third segment comprise at least 19 nucleotides having at least 85% complementary to the first segment. In still other embodiments, the first and the third segments which form the stem-loop structure of the hairpin comprises 3' or 5' overhang regions having unpaired nucleotide residues.
[0040] Hairpin molecules or double-stranded RNA molecules of the present invention may have more than one sequence of the present invention or active fragments or variants, or complements thereof, found in the same portion of the RNA molecule. For example, in a chimeric hairpin structure, the first segment of a hairpin molecule comprises two polynucleotide sections, each with a different sequence of the present invention. For example, reading from one terminus of the hairpin, the first segment is composed of sequences from two separate genes (A followed by B). This first segment is followed by the second segment, the loop portion of the hairpin. The loop segment is followed by the third segment, where the complementary strands of the sequences in the first segment are found (B* followed by A*) In forming the stem-loop, hairpin structure, the stem contains SeqA-A* at the distal end of the stem and SeqB-B* proximal to the loop region.
[0041] In specific embodiments, the sequences used in the first, the second, and/or the third segments comprise domains that are designed to have sufficient sequence identity to a target polynucleotide of interest and thereby have the ability to decrease the level of expression of the target polynucleotide. The specificity of the inhibitory RNA transcripts is therefore generally conferred by these domains of the silencing element. Thus, in some embodiments of the invention, the first, second and/or third segment of the silencing element comprise a domain having at least 10, at least 15, at least 19, at least 20, at least 21, at least 22, at least 23, at least 24, at least 25, at least 30, at least 40, at least 50, at least 100, at least 200, at least 300, at least 500, at least 1000, or more than 1000 nucleotides that share sufficient sequence identity to the target polynucleotide to allow for a decrease in expression levels of the target polynucleotide when expressed in an appropriate cell. In other embodiments, the domain is between about 15 to 50 nucleotides, about 19-35 nucleotides, about 25-50 nucleotides, about 19 to 75 nucleotides, about 40-90 nucleotides, about 15-100 nucleotides, 10-100 nucleotides, about 10 to about 75 nucleotides, about 10 to about 50 nucleotides, about 10 to about 40 nucleotides, about 10 to about 35 nucleotides, about 10 to about 30 nucleotides, about 10 to about 25 nucleotides, about 10 to about 19 nucleotides, about 50 nucleotides to about 100 nucleotides, about 100 nucleotides to about 150 nucleotides, about 150 nucleotides to about 200 nucleotides, about 200 nucleotides to about 250 nucleotides, about 250 nucleotides to about 300 nucleotides, about 300 nucleotides to about 350 nucleotides, about 350 nucleotides to about 400 nucleotides, about 400 nucleotide to about 500 nucleotides or longer. In other embodiments, the length of the first and/or the third segment comprises at least 10-19 nucleotides, 19-35 nucleotides, 30-45 nucleotides, 40-50 nucleotides, 50-100 nucleotides, or about 100-300 nucleotides.
[0042] In specific embodiments, the domain of the first, the second, and/or the third segment has 100% sequence identity to the target polynucleotide. In other embodiments, the domain of the first, the second and/or the third segment having homology to the target polypeptide have at least 50%, 60%, 70%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or greater sequence identity to a region of the target polynucleotide. The sequence identity of the domains of the first, the second and/or the third segments to the target polynucleotide need only be sufficient to decrease expression of the target polynucleotide of interest. See, for example, Chuang and Meyerowitz (2000) Proc. Natl. Acad. Sci. USA 97:4985-4990; Stoutjesdijk et al. (2002) Plant Physiol. 129:1723-1731; Waterhouse and Helliwell (2003) Nat. Rev. Genet. 4:29-38; Pandolfini et al. BMC Biotechnology 3:7, and U.S. Patent Publication No. 20030175965; each of which is herein incorporated by reference. A transient assay for the efficiency of hpRNA constructs to silence gene expression in vivo has been described by Panstruga et al. (2003) Mol. Biol. Rep. 30:135-140, herein incorporated by reference.
[0043] The amount of complementarity shared between the first, second, and/or third segment and the target polynucleotide or the amount of complementarity shared between the first segment and the third segment (i.e., the stem of the hairpin structure) may vary depending on the organism in which gene expression is to be controlled. Some organisms or cell types may require exact pairing or 100% identity, while other organisms or cell types may tolerate some mismatching. In some cells, for example, a single nucleotide mismatch in the targeting sequence abrogates the ability to suppress gene expression. In these cells, the suppression cassettes of the invention can be used to target the suppression of mutant genes, for example, oncogenes whose transcripts comprise point mutations and therefore they can be specifically targeted using the methods and compositions of the invention without altering the expression of the remaining wild-type allele.
[0044] Any region of the target polynucleotide can be used to design the domain of the silencing element that shares sufficient sequence identity to allow expression of the hairpin transcript to decrease the level of the target polynucleotide. For instance, the domain can be designed to share sequence identity to the 5' untranslated region of the target polynucleotide(s), the 3' untranslated region of the target polynucleotide(s), exonic regions of the target polynucleotide(s), intronic regions of the target polynucleotide(s), and any combination thereof. In specific embodiments, a domain of the silencing element shares sufficient homology to at least about 15, 16, 17, 18, 19, 20, 22, 25 or 30 consecutive nucleotides from about nucleotides 1-50, 25-75, 75-125, 50-100, 125-175, 175-225, 100-150, 150-200, 200-250, 225-275, 275-325, 250-300, 325-375, 375-425, 300-350, 350-400, 425-475, 400-450, 475-525, 450-500, 525-575, 575-625, 550-600, 625-675, 675-725, 600-650, 625-675, 675-725, 650-700, 725-825, 825-875, 750-800, 875-925, 925-975, 850-900, 925-975, 975-1025, 950-1000, 1000-1050, 1025-1075, 1075-1125, 1050-1100, 1125-1175, 1100-1200, 1175-1225, 1225-1275, 1200-1300, 1325-1375, 1375-1425, 1300-1400, 1425-1475, 1475-1525, 1400-1500, 1525-1575, 1575-1625, 1625-1675, 1675-1725, 1725-1775, 1775-1825, 1825-1875, 1875-1925, 1925-1975, 1975-2025, 2025-2075, 2075-2125, 2125-2175, 2175-2225, 1500-1600, 1600-1700, 1700-1800, 1800-1900, 1900-2000 of the target sequence. In some instances to optimize the siRNA sequences employed in the hairpin, the synthetic oligodeoxyribonucleotide/RNase H method can be used to determine sites on the target mRNA that are in a conformation that is susceptible to RNA silencing. See, for example, Vickers et al. (2003) J. Biol. Chem 278:7108-7118 and Yang et al. (2002) Proc. Natl. Acad. Sci. USA 99:9442-9447, herein incorporated by reference. These studies indicate that there is a significant correlation between the RNase-H-sensitive sites and sites that promote efficient siRNA-directed mRNA degradation.
[0045] The hairpin silencing element may also be designed such that the sense sequence or the antisense sequence do not correspond to a target polynucleotide. In this embodiment, the sense and antisense sequence flank a loop sequence that comprises a nucleotide sequence corresponding to all or part of the target polynucleotide. Thus, it is the loop region that determines the specificity of the RNA interference. See, for example, WO 02/00904, herein incorporated by reference.
[0046] In addition, transcriptional gene silencing (TGS) may be accomplished through use of a hairpin suppression element where the inverted repeat of the hairpin shares sequence identity with the promoter region of a target polynucleotide to be silenced. See, for example, Aufsatz et al. (2002) PNAS 99 (Suppl. 4):16499-16506 and Mette et al. (2000) EMBO J 19(19):5194-5201.
[0047] d. MicroRNA (miRNA) Silencing Element
[0048] In other embodiments, the silencing element can comprise a microRNA (miRNA). "MicroRNAs" or "miRNAs" are regulatory agents comprising about 19 to about 24 ribonucleotides in length, which are highly efficient at inhibiting the expression of target polynucleotides. See, for example Javier et al. (2003) Nature 425: 257-263, herein incorporated by reference. For miRNA interference, the silencing element can be designed to express a dsRNA molecule that forms a partially base-paired structure containing a 19, 20, 21, 22, 23, 24 or 25 nucleotide sequence that is complementary to the target polynucleotide of interest. The miRNA can be synthetically made, or transcribed as a longer RNA which is subsequently cleaved to produce the active miRNA. The miRNA can be an "artificial miRNA" or "amiRNA" which comprises a miRNA sequence that is synthetically designed to silence a target sequence.
[0049] When expressing an miRNA, the final (mature) miRNA is present in a duplex in a precursor backbone structure, the two strands being referred to as the miRNA (the strand that will eventually base pair with the target) and miRNA* (star sequence). It has been demonstrated that miRNAs can be transgenically expressed and target genes of interest efficiently silenced (Highly specific gene silencing by artificial microRNAs in Arabidopsis Schwab R, Ossowski S, Riester M, Warthmann N, Weigel D. Plant Cell. 2006 May; 18(5):1121-33. Epub 2006 Mar. 10 & Expression of artificial microRNAs in transgenic Arabidopsis thaliana confers virus resistance. Niu Q W, Lin S S, Reyes J L, Chen K C, Wu H W, Yeh S D, Chua N H. Nat Biotechnol. 2006 November; 24(11):1420-8. Epub 2006 Oct. 22. Erratum in: Nat Biotechnol. 2007 February; 25(2):254.)
[0050] The silencing element for miRNA interference comprises a miRNA primary sequence. The miRNA primary sequence comprises a DNA sequence having the miRNA and star sequences separated by a loop as well as additional sequences flanking this region that are important for processing. When expressed as an RNA, the structure of the primary miRNA is such as to allow for the formation of a hairpin RNA structure that can be processed into a mature miRNA. In some embodiments, the miRNA backbone comprises a genomic or cDNA miRNA precursor sequence, wherein said sequence comprises a native primary in which a heterologous (artificial) mature miRNA and star sequence are inserted.
[0051] As used herein, a "star sequence" is the sequence within a miRNA precursor backbone that is complementary to the miRNA and forms a duplex with the miRNA to form the stem structure of a hairpin RNA. In some embodiments, the star sequence can comprise less than 100% complementarity to the miRNA sequence. Alternatively, the star sequence can comprise at least 99%, 98%, 97%, 96%, 95%, 90%, 85%, 80% or lower sequence complementarity to the miRNA sequence as long as the star sequence has sufficient complementarity to the miRNA sequence to form a double stranded structure. In still further embodiments, the star sequence comprises a sequence having 1, 2, 3, 4, 5 or more mismatches with the miRNA sequence and still has sufficient complementarity to form a double stranded structure with the miRNA sequence resulting in production of miRNA and suppression of the target sequence.
[0052] The miRNA precursor backbones can be from any plant. In some embodiments, the miRNA precursor backbone is from a monocot. In other embodiments, the miRNA precursor backbone is from a dicot. In further embodiments, the backbone is from maize or soybean. MicroRNA precursor backbones have been described previously. For example, US20090155910A1 (WO 2009/079532) discloses the following soybean miRNA precursor backbones: 156c, 159, 166b, 168c, 396b and 398b, and US20090155909A1 (WO 2009/079548) discloses the following maize miRNA precursor backbones: 159c, 164h, 168a, 169r, and 396h. Each of these references is incorporated by reference in their entirety.
[0053] Thus, the primary miRNA can be altered to allow for efficient insertion of heterologous miRNA and star sequences within the miRNA precursor backbone. In such instances, the miRNA segment and the star segment of the miRNA precursor backbone are replaced with the heterologous miRNA and the heterologous star sequences, designed to target any sequence of interest, using a PCR technique and cloned into an expression construct. It is recognized that there could be alterations to the position at which the artificial miRNA and star sequences are inserted into the backbone. Detailed methods for inserting the miRNA and star sequence into the miRNA precursor backbone are described in, for example, US Patent Applications 20090155909A1 and US20090155910A1, herein incorporated by reference in their entirety.
[0054] When designing a miRNA sequence and star sequence, various design choices can be made. See, for example, Schwab R, et al. (2005) Dev Cell 8: 517-27. In non-limiting embodiments, the miRNA sequences disclosed herein can have a "U" at the 5'-end, a "C" or "G" at the 19.sup.th nucleotide position, and an "A" or "U" at the 10th nucleotide position. In other embodiments, the miRNA design is such that the miRNA have a high free delta-G as calculated using the ZipFold algorithm (Markham, N. R. & Zuker, M. (2005) Nucleic Acids Res. 33: W577-W581.) Optionally, a one base pair change can be added within the 5' portion of the miRNA so that the sequence differs from the target sequence by one nucleotide.
[0055] The methods and compositions of the invention employ silencing elements that when transcribed "form" a dsRNA molecule. Accordingly, the heterologous polynucleotide being expressed need not form the dsRNA by itself, but can interact with other sequences in the plant cell or in the pest gut after ingestion to allow the formation of the dsRNA. For example, a chimeric polynucleotide that can selectively silence the target polynucleotide can be generated by expressing a chimeric construct comprising the target sequence for a miRNA or siRNA to a sequence corresponding to all or part of the gene or genes to be silenced. In this embodiment, the dsRNA is "formed" when the target for the miRNA or siRNA interacts with the miRNA present in the cell. The resulting dsRNA can then reduce the level of expression of the gene or genes to be silenced. See, for example, US Application Publication 2007-0130653, entitled "Methods and Compositions for Gene Silencing", herein incorporated by reference. The construct can be designed to have a target for an endogenous miRNA or alternatively, a target for a heterologous and/or synthetic miRNA can be employed in the construct. If a heterologous and/or synthetic miRNA is employed, it can be introduced into the cell on the same nucleotide construct as the chimeric polynucleotide or on a separate construct. As discussed elsewhere herein, any method can be used to introduce the construct comprising the heterologous miRNA.
[0056] e. Silencing Elements
[0057] A silencing element may comprise a chimeric construction molecule comprising two or more sequences of the present invention. For example, the chimeric construction may be a hairpin or dsRNA as disclosed herein. A chimera may comprise two or more sequences of the present invention. Providing at least two different sequences in a single silencing element may allow for targeting multiple genes using one silencing element and/or for example, one expression cassette. Targeting multiple genes may allow for slowing or reducing the possibility of resistance by the pest, and providing the multiple targeting ability in one expressed molecule may reduce the expression burden of the transformed plant or plant product, or provide topical treatments that are capable of targeting multiple hosts with one application.
[0058] IV. Variants and Fragments
[0059] By "fragment" is intended a portion of the polynucleotide or a portion of the amino acid sequence and hence protein encoded thereby. Fragments of a polynucleotide may encode protein fragments that retain the biological activity of the native protein. Alternatively, fragments of a polynucleotide that are useful as a silencing element do not need to encode fragment proteins that retain biological activity. Thus, fragments of a nucleotide sequence may range from at least about 10, about 15, about 16, about 17, about 18, about 19, about 20 nucleotides, about 22 nucleotides, about 50 nucleotides, about 75 nucleotides, about 100 nucleotides, 200 nucleotides, 300 nucleotides, 400 nucleotides, 500 nucleotides, 600 nucleotides, 700 nucleotides and up to the full-length polynucleotide employed in the invention. Alternatively, fragments of a nucleotide sequence may range from 1-50, 25-75, 75-125, 50-100, 125-175, 175-225, 100-150, 150-200, 200-250, 225-275, 275-325, 250-300, 325-375, 375-425, 300-350, 350-400, 425-475, 400-450, 475-525, 450-500, 525-575, 575-625, 550-600, 625-675, 675-725, 600-650, 625-675, 675-725, 650-700, 725-825, 825-875, 750-800, 875-925, 925-975, 850-900, 925-975, 975-1025, 950-1000, 1000-1050, 1025-1075, 1075-1125, 1050-1100, 1125-1175, 1100-1200, 1175-1225, 1225-1275, 1200-1300, 1325-1375, 1375-1425, 1300-1400, 1425-1475, 1475-1525, 1400-1500, 1525-1575, 1575-1625, 1625-1675, 1675-1725, 1725-1775, 1775-1825, 1825-1875, 1875-1925, 1925-1975, 1975-2025, 2025-2075, 2075-2125, 2125-2175, 2175-2225, 1500-1600, 1600-1700, 1700-1800, 1800-1900, 1900-2000 of any one of SEQ ID NOS: 6-12, 18-40. Methods to assay for the activity of a desired silencing element are described elsewhere herein.
[0060] "Variants" is intended to mean substantially similar sequences. For polynucleotides, a variant comprises a deletion and/or addition of one or more nucleotides at one or more internal sites within the native polynucleotide and/or a substitution of one or more nucleotides at one or more sites in the native polynucleotide. A variant of a polynucleotide that is useful as a silencing element will retain the ability to reduce expression of the target polynucleotide and, in some embodiments, thereby control a pest of interest. As used herein, a "native" polynucleotide or polypeptide comprises a naturally occurring nucleotide sequence or amino acid sequence, respectively. For polynucleotides, conservative variants include those sequences that, because of the degeneracy of the genetic code, encode the amino acid sequence of one of the polypeptides employed in the invention. Variant polynucleotides also include synthetically derived polynucleotides, such as those generated, for example, by using site-directed mutagenesis, but continue to retain the desired activity. Generally, variants of a particular polynucleotide of the invention (i.e., a silencing element) will have at least about 40%, 45%, 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or more sequence identity to that particular polynucleotide as determined by sequence alignment programs and parameters described elsewhere herein.
[0061] Variants of a particular polynucleotide of the invention (i.e., the reference polynucleotide) can also be evaluated by comparison of the percent sequence identity between the polypeptide encoded by a variant polynucleotide and the polypeptide encoded by the reference polynucleotide. Percent sequence identity between any two polypeptides can be calculated using sequence alignment programs and parameters described elsewhere herein. Where any given pair of polynucleotides employed in the invention is evaluated by comparison of the percent sequence identity shared by the two polypeptides they encode, the percent sequence identity between the two encoded polypeptides is at least about 40%, 45%, 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or more sequence identity.
[0062] The following terms are used to describe the sequence relationships between two or more polynucleotides or polypeptides: (a) "reference sequence", (b) "comparison window", (c) "sequence identity", and, (d) "percentage of sequence identity."
[0063] (a) As used herein, "reference sequence" is a defined sequence used as a basis for sequence comparison. A reference sequence may be a subset or the entirety of a specified sequence; for example, as a segment of a full-length cDNA or gene sequence, or the complete cDNA or gene sequence.
[0064] (b) As used herein, "comparison window" makes reference to a contiguous and specified segment of a polynucleotide sequence, wherein the polynucleotide sequence in the comparison window may comprise additions or deletions (i.e., gaps) compared to the reference sequence (which does not comprise additions or deletions) for optimal alignment of the two polynucleotides. Generally, the comparison window is at least 20 contiguous nucleotides in length, and optionally can be 30, 40, 50, 100, or longer. Those of skill in the art understand that to avoid a high similarity to a reference sequence due to inclusion of gaps in the polynucleotide sequence a gap penalty is typically introduced and is subtracted from the number of matches.
[0065] Unless otherwise stated, sequence identity/similarity values provided herein refer to the value obtained using GAP Version 10 using the following parameters: % identity and % similarity for a nucleotide sequence using GAP Weight of 50 and Length Weight of 3, and the nwsgapdna.cmp scoring matrix; % identity and % similarity for an amino acid sequence using GAP Weight of 8 and Length Weight of 2, and the BLOSUM62 scoring matrix; or any equivalent program thereof. By "equivalent program" is intended any sequence comparison program that, for any two sequences in question, generates an alignment having identical nucleotide or amino acid residue matches and an identical percent sequence identity when compared to the corresponding alignment generated by GAP Version 10.
[0066] (c) As used herein, "sequence identity" or "identity" in the context of two polynucleotides or polypeptide sequences makes reference to the residues in the two sequences that are the same when aligned for maximum correspondence over a specified comparison window. When percentage of sequence identity is used in reference to proteins it is recognized that residue positions which are not identical often differ by conservative amino acid substitutions, where amino acid residues are substituted for other amino acid residues with similar chemical properties (e.g., charge or hydrophobicity) and therefore do not change the functional properties of the molecule. When sequences differ in conservative substitutions, the percent sequence identity may be adjusted upwards to correct for the conservative nature of the substitution. Sequences that differ by such conservative substitutions are said to have "sequence similarity" or "similarity". Means for making this adjustment are well known to those of skill in the art. Typically this involves scoring a conservative substitution as a partial rather than a full mismatch, thereby increasing the percentage sequence identity. Thus, for example, where an identical amino acid is given a score of 1 and a non-conservative substitution is given a score of zero, a conservative substitution is given a score between zero and 1. The scoring of conservative substitutions is calculated, e.g., as implemented in the program PC/GENE (Intelligenetics, Inc., Mountain View, Calif.).
[0067] (d) As used herein, "percentage of sequence identity" means the value determined by comparing two optimally aligned sequences over a comparison window, wherein the portion of the polynucleotide sequence in the comparison window may comprise additions or deletions (i.e., gaps) as compared to the reference sequence (which does not comprise additions or deletions) for optimal alignment of the two sequences. The percentage is calculated by determining the number of positions at which the identical nucleic acid base or amino acid residue occurs in both sequences to yield the number of matched positions, dividing the number of matched positions by the total number of positions in the window of comparison, and multiplying the result by 100 to yield the percentage of sequence identity.
[0068] A method is provided for identifying a silencing element from the target polynucleotides set forth in SEQ ID NOS: 6-12, 18-40. Such methods comprise obtaining a candidate fragment of any one or more of SEQ ID NOS: 6-12, 18-40 which is of sufficient length to act as a silencing element and thereby reduce the expression of the target polynucleotide and/or control a desired pest; expressing said candidate polynucleotide fragment in an appropriate expression cassette to produce a candidate silencing element and determining if said candidate polynucleotide fragment has the activity of a silencing element, thereby reducing the expression of the target polynucleotide and/or controlling a desired pest. Methods of identifying such candidate fragments based on the desired pathway for suppression are known. For example, various bioinformatics programs can be employed to identify the region of the target polynucleotides that could be exploited to generate a silencing element. See, for example, Elbahir et al. (2001) Genes and Development 15:188-200, Schwartz et al. (2003) Cell 115:199-208, Khvorova et al. (2003) Cell 115:209-216. See also, siRNA at Whitehead (jura.wi.mit.edu/bioc/siRNAext/) which calculates the binding energies for both sense and antisense siRNAs. See, also genscript.com/ssl-bin/app/rnai?op=known; Block-iT.TM. RNAi designer from Invitrogen and GenScript siRNA Construct Builder.
V. DNA Constructs
[0069] The use of the term "polynucleotide" is not intended to limit the present invention to polynucleotides comprising DNA. Those of ordinary skill in the art will recognize that polynucleotides can comprise ribonucleotides and combinations of ribonucleotides and deoxyribonucleotides. Such deoxyribonucleotides and ribonucleotides include both naturally occurring molecules and synthetic analogues. The polynucleotides of the invention also encompass all forms of sequences including, but not limited to, single-stranded forms, double-stranded forms, hairpins, stem-and-loop structures, and the like.
[0070] The polynucleotide encoding the silencing element(s) or in specific embodiments employed in the methods and compositions of the invention can be provided in expression cassettes for expression in a plant or organism of interest. It is recognized that multiple silencing elements including multiple identical silencing elements, multiple silencing elements targeting different regions of the target sequence, or multiple silencing elements from different target sequences can be used. In this embodiment, it is recognized that each silencing element can be contained in a single or separate cassette, DNA construct, or vector. As discussed, any means of providing the silencing element is contemplated. A plant or plant cell can be transformed with a single cassette comprising DNA encoding one or more silencing elements or separate cassettes comprising each silencing element can be used to transform a plant or plant cell or host cell. Likewise, a plant transformed with one component can be subsequently transformed with the second component. One or more silencing elements can also be brought together by sexual crossing. That is, a first plant comprising one component is crossed with a second plant comprising the second component. Progeny plants from the cross will comprise both components.
[0071] The expression cassette can include 5' and 3' regulatory sequences operably linked to the polynucleotide of the invention. "Operably linked" is intended to mean a functional linkage between two or more elements. For example, an operable linkage between a polynucleotide of the invention and a regulatory sequence (i.e., a promoter) is a functional link that allows for expression of the polynucleotide of the invention. Operably linked elements may be contiguous or non-contiguous. When used to refer to the joining of two protein coding regions, by operably linked is intended that the coding regions are in the same reading frame. The cassette may additionally contain at least one additional polynucleotide to be co-transformed into the organism. Alternatively, the additional polypeptide(s) can be provided on multiple expression cassettes. Expression cassettes can be provided with a plurality of restriction sites and/or recombination sites for insertion of the polynucleotide to be under the transcriptional regulation of the regulatory regions. The expression cassette may additionally contain selectable marker genes.
[0072] The expression cassette can include in the 5'-3' direction of transcription, a transcriptional and translational initiation region (i.e., a promoter), a polynucleotide comprising the silencing element employed in the methods and compositions of the invention, and a transcriptional and translational termination region (i.e., termination region) functional in plants. In another embodiment, the double stranded RNA is expressed from a suppression cassette. Such a cassette can comprise two convergent promoters that drive transcription of an operably linked silencing element. "Convergent promoters" refers to promoters that are oriented on either terminus of the operably linked silencing element such that each promoter drives transcription of the silencing element in opposite directions, yielding two transcripts. In such embodiments, the convergent promoters allow for the transcription of the sense and anti-sense strand and thus allow for the formation of a dsRNA.
[0073] The regulatory regions (i.e., promoters, transcriptional regulatory regions, and translational termination regions) and/or the polynucleotides employed in the invention may be native/analogous to the host cell or to each other. Alternatively, the regulatory regions and/or the polynucleotide employed in the invention may be heterologous to the host cell or to each other. As used herein, "heterologous" in reference to a sequence is a sequence that originates from a foreign species, or, if from the same species, is substantially modified from its native form in composition and/or genomic locus by deliberate human intervention. For example, a promoter operably linked to a heterologous polynucleotide is from a species different from the species from which the polynucleotide was derived, or, if from the same/analogous species, one or both are substantially modified from their original form and/or genomic locus, or the promoter is not the native promoter for the operably linked polynucleotide. As used herein, a chimeric gene comprises a coding sequence operably linked to a transcription initiation region that is heterologous to the coding sequence.
[0074] The termination region may be native with the transcriptional initiation region, may be native with the operably linked polynucleotide encoding the silencing element, may be native with the plant host, or may be derived from another source (i.e., foreign or heterologous) to the promoter, the polynucleotide comprising silencing element, the plant host, or any combination thereof. Convenient termination regions are available from the Ti-plasmid of A. tumefaciens, such as the octopine synthase and nopaline synthase termination regions. See also Guerineau et al. (1991) Mol. Gen. Genet. 262:141-144; Proudfoot (1991) Cell 64:671-674; Sanfacon et al. (1991) Genes Dev. 5:141-149; Mogen et al. (1990) Plant Cell 2:1261-1272; Munroe et al. (1990) Gene 91:151-158; Ballas et al. (1989) Nucleic Acids Res. 17:7891-7903; and Joshi et al. (1987) Nucleic Acids Res. 15:9627-9639.
[0075] Additional sequence modifications are known to enhance gene expression in a cellular host. These include elimination of sequences encoding spurious polyadenylation signals, exon-intron splice site signals, transposon-like repeats, and other such well-characterized sequences that may be deleterious to gene expression. The G-C content of the sequence may be adjusted to levels average for a given cellular host, as calculated by reference to known genes expressed in the host cell. When possible, the sequence is modified to avoid predicted hairpin secondary mRNA structures.
[0076] In preparing the expression cassette, the various DNA fragments may be manipulated, so as to provide for the DNA sequences in the proper orientation and, as appropriate, in the proper reading frame. Toward this end, adapters or linkers may be employed to join the DNA fragments or other manipulations may be involved to provide for convenient restriction sites, removal of superfluous DNA, removal of restriction sites, or the like. For this purpose, in vitro mutagenesis, primer repair, restriction, annealing, resubstitutions, e.g., transitions and transversions, may be involved.
[0077] A number of promoters can be used in the practice of the invention. The polynucleotide encoding the silencing element can be combined with constitutive, tissue-preferred, or other promoters for expression in plants.
[0078] Such constitutive promoters include, for example, the core promoter of the Rsyn7 promoter and other constitutive promoters disclosed in WO 99/43838 and U.S. Pat. No. 6,072,050; the core CaMV 35S promoter (Odell et al. (1985) Nature 313:810-812); rice actin (McElroy et al. (1990) Plant Cell 2:163-171); ubiquitin (Christensen et al. (1989) Plant Mol. Biol. 12:619-632 and Christensen et al. (1992) Plant Mol. Biol. 18:675-689); pEMU (Last et al. (1991) Theor. Appl. Genet. 81:581-588); MAS (Velten et al. (1984) EMBO J. 3:2723-2730); ALS promoter (U.S. Pat. No. 5,659,026), and the like. Other constitutive promoters include, for example, U.S. Pat. Nos. 5,608,149; 5,608,144; 5,604,121; 5,569,597; 5,466,785; 5,399,680; 5,268,463; 5,608,142; and 6,177,611.
[0079] An inducible promoter, for instance, a pathogen-inducible promoter could also be employed. Such promoters include those from pathogenesis-related proteins (PR proteins), which are induced following infection by a pathogen; e.g., PR proteins, SAR proteins, beta-1,3-glucanase, chitinase, etc. See, for example, Redolfi et al. (1983) Neth. J. Plant Pathol. 89:245-254; Uknes et al. (1992) Plant Cell 4:645-656; and Van Loon (1985) Plant Mol. Virol. 4:111-116. See also WO 99/43819, herein incorporated by reference.
[0080] Additionally, as pathogens find entry into plants through wounds or insect damage, a wound-inducible promoter may be used in the constructions of the invention. Such wound-inducible promoters include potato proteinase inhibitor (pin II) gene (Ryan (1990) Ann. Rev. Phytopath. 28:425-449; Duan et al. (1996) Nature Biotechnology 14:494-498); wun1 and wun2, U.S. Pat. No. 5,428,148; win1 and win2 (Stanford et al. (1989) Mol. Gen. Genet. 215:200-208); systemin (McGurl et al. (1992) Science 225:1570-1573); WIP1 (Rohmeier et al. (1993) Plant Mol. Biol. 22:783-792; Eckelkamp et al. (1993) FEBS Letters 323:73-76); MPI gene (Corderok et al. (1994) Plant J. 6(2):141-150); and the like, herein incorporated by reference.
[0081] Chemical-regulated promoters can be used to modulate the expression of a gene in a plant through the application of an exogenous chemical regulator. Depending upon the objective, the promoter may be a chemical-inducible promoter, where application of the chemical induces gene expression, or a chemical-repressible promoter, where application of the chemical represses gene expression. Chemical-inducible promoters are known in the art and include, but are not limited to, the maize In2-2 promoter, which is activated by benzenesulfonamide herbicide safeners, the maize GST promoter, which is activated by hydrophobic electrophilic compounds that are used as pre-emergent herbicides, and the tobacco PR-la promoter, which is activated by salicylic acid. Other chemical-regulated promoters of interest include steroid-responsive promoters (see, for example, the glucocorticoid-inducible promoter in Schena et al. (1991) Proc. Natl. Acad. Sci. USA 88:10421-10425 and McNellis et al. (1998) Plant J. 14(2):247-257) and tetracycline-inducible and tetracycline-repressible promoters (see, for example, Gatz et al. (1991) Mol. Gen. Genet. 227:229-237, and U.S. Pat. Nos. 5,814,618 and 5,789,156), herein incorporated by reference.
[0082] Tissue-preferred promoters can be utilized to target enhanced expression within a particular plant tissue. Tissue-preferred promoters include Yamamoto et al. (1997) Plant J. 12(2):255-265; Kawamata et al. (1997) Plant Cell Physiol. 38(7):792-803; Hansen et al. (1997) Mol. Gen Genet. 254(3):337-343; Russell et al. (1997) Transgenic Res. 6(2):157-168; Rinehart et al. (1996) Plant Physiol. 112(3):1331-1341; Van Camp et al. (1996) Plant Physiol. 112(2):525-535; Canevascini et al. (1996) Plant Physiol. 112(2):513-524; Yamamoto et al. (1994) Plant Cell Physiol. 35(5):773-778; Lam (1994) Results Probl. Cell Differ. 20:181-196; Orozco et al. (1993) Plant Mol Biol. 23(6):1129-1138; Matsuoka et al. (1993) Proc Natl. Acad. Sci. USA 90(20):9586-9590; and Guevara-Garcia et al. (1993) Plant J. 4(3):495-505. Such promoters can be modified, if necessary, for weak expression.
[0083] Leaf-preferred promoters are known in the art. See, for example, Yamamoto et al. (1997) Plant J. 12(2):255-265; Kwon et al. (1994) Plant Physiol. 105:357-67; Yamamoto et al. (1994) Plant Cell Physiol. 35(5):773-778; Gotor et al. (1993) Plant J. 3:509-18; Orozco et al. (1993) Plant Mol. Biol. 23(6):1129-1138; and Matsuoka et al. (1993) Proc. Natl. Acad. Sci. USA 90(20):9586-9590.
[0084] Root-preferred promoters are known and can be selected from the many available from the literature or isolated de novo from various compatible species. See, for example, Hire et al. (1992) Plant Mol. Biol. 20(2):207-218 (soybean root-specific glutamine synthetase gene); Keller and Baumgartner (1991) Plant Cell 3(10):1051-1061 (root-specific control element in the GRP 1.8 gene of French bean); Sanger et al. (1990) Plant Mol. Biol. 14(3):433-443 (root-specific promoter of the mannopine synthase (MAS) gene of Agrobacterium tumefaciens); and Miao et al. (1991) Plant Cell 3(1):11-22 (full-length cDNA clone encoding cytosolic glutamine synthetase (GS), which is expressed in roots and root nodules of soybean). See also Bogusz et al. (1990) Plant Cell 2(7):633-641, where two root-specific promoters isolated from hemoglobin genes from the nitrogen-fixing nonlegume Parasponia andersonii and the related non-nitrogen-fixing nonlegume Trema tomentosa are described. The promoters of these genes were linked to a .beta.-glucuronidase reporter gene and introduced into both the nonlegume Nicotiana tabacum and the legume Lotus corniculatus, and in both instances root-specific promoter activity was preserved. Leach and Aoyagi (1991) describe their analysis of the promoters of the highly expressed rolC and rolD root-inducing genes of Agrobacterium rhizogenes (see Plant Science (Limerick) 79(1):69-76). They concluded that enhancer and tissue-preferred DNA determinants are dissociated in those promoters. Teeri et al. (1989) used gene fusion to lacZ to show that the Agrobacterium T-DNA gene encoding octopine synthase is especially active in the epidermis of the root tip and that the TR2' gene is root specific in the intact plant and stimulated by wounding in leaf tissue, an especially desirable combination of characteristics for use with an insecticidal or larvicidal gene (see EMBO J. 8(2): 343-350). The TR1' gene, fused to nptII (neomycin phosphotransferase II) showed similar characteristics. Additional root-preferred promoters include the VfENOD-GRP3 gene promoter (Kuster et al. (1995) Plant Mol. Biol. 29(4):759-772); and rolB promoter (Capana et al. (1994) Plant Mol. Biol. 25(4):681-691. See also U.S. Pat. Nos. 5,837,876; 5,750,386; 5,633,363; 5,459,252; 5,401,836; 5,110,732; and 5,023,179.
[0085] In one embodiment of this invention the plant-expressed promoter is a vascular-specific promoter such as a phloem-specific promoter. A "vascular-specific" promoter, as used herein, is a promoter which is at least expressed in vascular cells, or a promoter which is preferentially expressed in vascular cells. Expression of a vascular-specific promoter need not be exclusively in vascular cells, expression in other cell types or tissues is possible. A "phloem-specific promoter" as used herein, is a plant-expressible promoter which is at least expressed in phloem cells, or a promoter which is preferentially expressed in phloem cells.
[0086] Expression of a phloem-specific promoter need not be exclusively in phloem cells, expression in other cell types or tissues, e.g., xylem tissue, is possible. In one embodiment of this invention, a phloem-specific promoter is a plant-expressible promoter at least expressed in phloem cells, wherein the expression in non-phloem cells is more limited (or absent) compared to the expression in phloem cells. Examples of suitable vascular-specific or phloem-specific promoters in accordance with this invention include but are not limited to the promoters selected from the group consisting of: the SCSV3, SCSV4, SCSVS, and SCSV7 promoters (Schunmann et al. (2003) Plant Functional Biology 30:453-60; the rolC gene promoter of Agrobacterium rhizogenes (Kiyokawa et al. (1994) Plant Physiology 104:801-02; Pandolfini et al. (2003) BioMedCentral (BMC) Biotechnology 3:7, (www.biomedcentral.com/1472-6750/3/7); Graham et al. (1997) Plant Mol. Biol. 33:729-35; Guivarc'h et al. (1996); Almon et al. (1997) Plant Physiol. 115:1599-607; the rolA gene promoter of Agrobacterium rhizogenes (Dehio et al. (1993) Plant Mol. Biol. 23:1199-210); the promoter of the Agrobacterium tumefaciens T-DNA gene 5 (Korber et al. (1991) EMBO J. 10:3983-91); the rice sucrose synthase RSs1 gene promoter (Shi et al. (1994) J. Exp. Bot. 45:623-31); the CoYMV or Commelina yellow mottle badnavirus promoter (Medberry et al. (1992) Plant Cell 4:185-92; Zhou et al. (1998) Chin. J. Biotechnol. 14:9-16); the CFDV or coconut foliar decay virus promoter (Rohde et al. (1994) Plant Mol. Biol. 27:623-28; Hehn and Rhode (1998) J. Gen. Virol. 79:1495-99); the RTBV or rice tungro bacilliform virus promoter (Yin and Beachy (1995) Plant J. 7:969-80; Yin et al. (1997) Plant J. 12:1179-80); the pea glutamine synthase GS3A gene (Edwards et al. (1990) Proc. Natl. Acad. Sci. USA 87:3459-63; Brears et al. (1991) Plant J. 1:235-44); the inv CD111 and inv CD141 promoters of the potato invertase genes (Hedley et al. (2000) J. Exp. Botany 51:817-21); the promoter isolated from Arabidopsis shown to have phloem-specific expression in tobacco by Kertbundit et al. (1991) Proc. Natl. Acad. Sci. USA 88:5212-16); the VAHOX1 promoter region (Tornero et al. (1996) Plant J. 9:639-48); the pea cell wall invertase gene promoter (Zhang et al. (1996) Plant Physiol. 112:1111-17); the promoter of the endogenous cotton protein related to chitinase of US published patent application 20030106097, an acid invertase gene promoter from carrot (Ramloch-Lorenz et al. (1993) The Plant J. 4:545-54); the promoter of the sulfate transporter gene Sultr1;3 (Yoshimoto et al. (2003) Plant Physiol. 131:1511-17); a promoter of a sucrose synthase gene (Nolte and Koch (1993) Plant Physiol. 101:899-905); and the promoter of a tobacco sucrose transporter gene (Kuhn et al. (1997) Science 275-1298-1300).
[0087] Possible promoters also include the Black Cherry promoter for Prunasin Hydrolase (PH DL1.4 PRO) (U.S. Pat. No. 6,797, 859), thioredoxin H promoter from cucumber and rice (Fukuda A et al. (2005). Plant Cell Physiol. 46(11):1779-86), Rice (RSs1) (Shi, T. Wang et al. (1994). J. Exp. Bot. 45(274): 623-631) and maize sucrose synthase -1 promoters (Yang., N-S. et al. (1990) PNAS 87:4144-4148), PP2 promoter from pumpkin (Guo, H. et al. (2004) Transgenic Research 13:559-566), At SUC2 promoter (Truernit, E. et al. (1995) Planta 196(3):564-70., At SAM1 (S-adenosylmethionine synthetase) (Mijnsbrugge KV. et al. (1996) Planr. Cell. Physiol. 37(8): 1108-1115), and the Rice tungro bacilliform virus (RTBV) promoter (Bhattacharyya-Pakrasi et al. (1993) Plant J. 4(1):71-79).
[0088] The expression cassette can also comprise a selectable marker gene for the selection of transformed cells. Selectable marker genes are utilized for the selection of transformed cells or tissues. Marker genes include genes encoding antibiotic resistance, such as those encoding neomycin phosphotransferase II (NEO) and hygromycin phosphotransferase (HPT), as well as genes conferring resistance to herbicidal compounds, such as glufosinate ammonium, bromoxynil, imidazolinones, and 2,4-dichlorophenoxyacetate (2,4-D). Additional selectable markers include phenotypic markers such as .beta.-galactosidase and fluorescent proteins such as green fluorescent protein (GFP) (Su et al. (2004) Biotechnol Bioeng 85:610-9 and Fetter et al. (2004) Plant Cell 16:215-28), cyan florescent protein (CYP) (Bolte et al. (2004) J. Cell Science 117:943-54 and Kato et al. (2002) Plant Physiol 129:913-42), and yellow florescent protein (PhiYFP.TM. from Evrogen, see, Bolte et al. (2004) J. Cell Science 117:943-54). For additional selectable markers, see generally, Yarranton (1992) Curr. Opin. Biotech. 3:506-511; Christopherson et al. (1992) Proc. Natl. Acad. Sci. USA 89:6314-6318; Yao et al. (1992) Cell 71:63-72; Reznikoff (1992) Mol. Microbiol. 6:2419-2422; Barkley et al. (1980) in The Operon, pp. 177-220; Hu et al. (1987) Cell 48:555-566; Brown et al. (1987) Cell 49:603-612; Figge et al. (1988) Cell 52:713-722; Deuschle et al. (1989) Proc. Natl. Acad. Sci. USA 86:5400-5404; Fuerst et al. (1989) Proc. Natl. Acad. Sci. USA 86:2549-2553; Deuschle et al. (1990) Science 248:480-483; Gossen (1993) Ph.D. Thesis, University of Heidelberg; Reines et al. (1993) Proc. Natl. Acad. Sci. USA 90:1917-1921; Labow et al. (1990) Mol. Cell. Biol. 10:3343-3356; Zambretti et al. (1992) Proc. Natl. Acad. Sci. USA 89:3952-3956; Bairn et al. (1991) Proc. Natl. Acad. Sci. USA 88:5072-5076; Wyborski et al. (1991) Nucleic Acids Res. 19:4647-4653; Hillenand-Wissman (1989) Topics Mol. Struc. Biol. 10:143-162; Degenkolb et al. (1991) Antimicrob. Agents Chemother. 35:1591-1595; Kleinschnidt et al. (1988) Biochemistry 27:1094-1104; Bonin (1993) Ph.D. Thesis, University of Heidelberg; Gossen et al. (1992) Proc. Natl. Acad. Sci. USA 89:5547-5551; Oliva et al. (1992) Antimicrob. Agents Chemother. 36:913-919; Hlavka et al. (1985) Handbook of Experimental Pharmacology, Vol. 78 (Springer-Verlag, Berlin); Gill et al. (1988) Nature 334:721-724. Such disclosures are herein incorporated by reference. The above list of selectable marker genes is not meant to be limiting. Any selectable marker gene can be used in the present invention.
[0089] VI. Compositions Comprising Silencing Elements
[0090] One or more of the polynucleotides comprising a silencing element can be provided as an external composition such as a spray or powder to the plant, plant part, seed, a pest, or an area of cultivation. In another example, a plant is transformed with a DNA construct or expression cassette for expression of at least one silencing element. In either composition, the silencing element, when ingested by an insect, can reduce the level of a target pest sequence and thereby control the pest (i.e., a Pentatomidae plant pest including a N. viridula, Acrosternum hilare, Piezodorus guildini, and/or Halymorpha halys). It is recognized that the composition can comprise a cell (such as plant cell or a bacterial cell), in which a polynucleotide encoding one or more silencing elements is stably incorporated into the genome and operably linked to promoters active in the cell. Compositions comprising a mixture of cells, some cells expressing at least one silencing element are also encompassed. In other embodiments, compositions comprising the silencing elements are not contained in a cell. In such embodiments, the composition can be applied to an area inhabited by a pest. In one embodiment, the composition is applied externally to a plant (i.e., by spraying a field or area of cultivation) to protect the plant from the pest. Methods of applying nucleotides in such a manner are known to those skilled in the art.
[0091] The composition of the invention can further be formulated as bait. In this embodiment, the compositions comprise a food substance or an attractant which enhances the attractiveness of the composition to the pest.
[0092] The composition comprising the silencing element can be formulated in an agriculturally suitable and/or environmentally acceptable carrier. Such carriers can be any material that the animal, plant or environment to be treated can tolerate. Furthermore, the carrier must be such that the composition remains effective at controlling a pest. Examples of such carriers include water, saline, Ringer's solution, dextrose or other sugar solutions, Hank's solution, and other aqueous physiologically balanced salt solutions, phosphate buffer, bicarbonate buffer and Tris buffer. In addition, the composition may include compounds that increase the half-life of a composition.
[0093] It is recognized that the polynucleotides comprising sequences encoding the silencing element(s) can be used to transform organisms to provide for host organism production of these components, and subsequent application of the host organism to the environment of the target pest(s). Such host organisms include baculoviruses, bacteria, and the like. In this manner, the combination of polynucleotides encoding the silencing element(s) may be introduced via a suitable vector into a microbial host, and said host applied to the environment, or to plants or animals.
[0094] The term "introduced" in the context of inserting a nucleic acid into a cell, means "transfection" or "transformation" or "transduction" and includes reference to the incorporation of a nucleic acid into a eukaryotic or prokaryotic cell where the nucleic acid may be stably incorporated into the genome of the cell (e.g., chromosome, plasmid, plastid, or mitochondrial DNA), converted into an autonomous replicon, or transiently expressed (e.g., transfected mRNA).
[0095] Microbial hosts that are known to occupy the "phytosphere" (phylloplane, phyllosphere, rhizosphere, and/or rhizoplana) of one or more crops of interest may be selected. These microorganisms are selected so as to be capable of successfully competing in the particular environment with the wild-type microorganisms, provide for stable maintenance and expression of the sequences encoding the silencing element, and desirably, provide for improved protection of the components from environmental degradation and inactivation.
[0096] Such microorganisms include bacteria, algae, and fungi. Of particular interest are microorganisms such as bacteria, e.g., Pseudomonas, Erwinia, Serratia, Klebsiella, Xanthomonas, Streptomyces, Rhizobium, Rhodopseudomonas, Methylius, Agrobacterium, Acetobacter, Lactobacillus, Arthrobacter, Azotobacter, Leuconostoc, and Alcaligenes, fungi, particularly yeast, e.g., Saccharomyces, Cryptococcus, Kluyveromyces, Sporobolomyces, Rhodotorula, and Aureobasidium. Of particular interest are such phytosphere bacterial species as Pseudomonas syringae, Pseudomonas fluorescens, Serratia marcescens, Acetobacter xylinum, Agrobacteria, Rhodopseudomonas spheroides, Xanthomonas campestris, Rhizobium melioti, Alcaligenes entrophus, Clavibacter xyli and Azotobacter vinlandir, and phytosphere yeast species such as Rhodotorula rubra, R. glutinis, R. marina, R. aurantiaca, Cryptococcus albidus, C. diffluens, C. laurentii, Saccharomyces rosei, S. pretoriensis, S. cerevisiae, Sporobolomyces rosues, S. odorus, Kluyveromyces veronae, and Aureobasidium pollulans. Of particular interest are the pigmented microorganisms.
[0097] A number of ways are available for introducing the polynucleotide comprising the silencing element(s) into the microbial host under conditions that allow for stable maintenance and expression of such nucleotide encoding sequences. For example, expression cassettes can be constructed which include the nucleotide constructs of interest operably linked with the transcriptional and translational regulatory signals for expression of the nucleotide constructs, and a nucleotide sequence homologous with a sequence in the host organism, whereby integration will occur, and/or a replication system that is functional in the host, whereby integration or stable maintenance will occur.
[0098] Transcriptional and translational regulatory signals include, but are not limited to, promoters, transcriptional initiation start sites, operators, activators, enhancers, other regulatory elements, ribosomal binding sites, an initiation codon, termination signals, and the like. See, for example, U.S. Pat. Nos. 5,039,523 and 4,853,331; EPO 0480762A2; Sambrook et al. (2000); Molecular Cloning: A Laboratory Manual (3.sup.rd ed.; Cold Spring Harbor Laboratory Press, Plainview, N.Y.); Davis et al. (1980) Advanced Bacterial Genetics (Cold Spring Harbor Laboratory, Cold Spring Harbor, N.Y.); and the references cited therein.
[0099] Suitable host cells include the prokaryotes and the lower eukaryotes, such as fungi. Illustrative prokaryotes, both Gram-negative and Gram-positive, include Enterobacteriaceae, such as Escherichia, Erwinia, Shigella, Salmonella, and Proteus; Bacillaceae; Rhizobiceae, such as Rhizobium; Spirillaceae, such as photobacterium, Zymomonas, Serratia, Aeromonas, Vibrio, Desulfovibrio, Spirillum; Lactobacillaceae; Pseudomonadaceae, such as Pseudomonas and Acetobacter; Azotobacteraceae and Nitrobacteraceae. Among eukaryotes are fungi, such as Phycomycetes and Ascomycetes, which includes yeast, such as Saccharomyces and Schizosaccharomyces; and Basidiomycetes yeast, such as Rhodotorula, Aureobasidium, Sporobolomyces, and the like.
[0100] Characteristics of particular interest in selecting a host cell for purposes of the invention include ease of introducing the coding sequence into the host, availability of expression systems, efficiency of expression, stability in the host, and the presence of auxiliary genetic capabilities. Characteristics of interest for use as a pesticide microcapsule include protective qualities, such as thick cell walls, pigmentation, and intracellular packaging or formation of inclusion bodies; leaf affinity; lack of mammalian toxicity; attractiveness to pests for ingestion; and the like. Other considerations include ease of formulation and handling, economics, storage stability, and the like.
[0101] Host organisms of particular interest include yeast, such as Rhodotorula spp., Aureobasidium spp., Saccharomyces spp., and Sporobolomyces spp., phylloplane organisms such as Pseudomonas spp., Envinia spp., and Flavobacterium spp., and other such organisms, including Pseudomonas aeruginosa, Pseudomonas fluorescens, Saccharomyces cerevisiae, Bacillus thuringiensis, Escherichia coli, Bacillus subtilis, and the like.
[0102] The sequences encoding the silencing elements encompassed by the invention can be introduced into microorganisms that multiply on plants (epiphytes) to deliver these components to potential target pests. Epiphytes, for example, can be gram-positive or gram-negative bacteria.
[0103] The silencing element can be fermented in a bacterial host and the resulting bacteria processed and used as a microbial spray in the same manner that Bacillus thuringiensis strains have been used as insecticidal sprays. Any suitable microorganism can be used for this purpose. By way of example, Pseudomonas has been used to express Bacillus thuringiensis endotoxins as encapsulated proteins and the resulting cells processed and sprayed as an insecticide Gaertner et al. (1993), in Advanced Engineered Pesticides, ed. L. Kim (Marcel Decker, Inc.).
[0104] Alternatively, the components of the invention are produced by introducing heterologous genes into a cellular host. Expression of the heterologous sequences results, directly or indirectly, in the intracellular production of the silencing element. These compositions may then be formulated in accordance with conventional techniques for application to the environment hosting a target pest, e.g., soil, water, and foliage of plants. See, for example, EPA 0192319, and the references cited therein.
[0105] In the present invention, a transformed microorganism can be formulated with an acceptable carrier into separate or combined compositions that are, for example, a suspension, a solution, an emulsion, a dusting powder, a dispersible granule, a wettable powder, and an emulsifiable concentrate, an aerosol, an impregnated granule, an adjuvant, a coatable paste, and also encapsulations in, for example, polymer substances.
[0106] Such compositions disclosed above may be obtained by the addition of a surface-active agent, an inert carrier, a preservative, a humectant, a feeding stimulant, an attractant, an encapsulating agent, a binder, an emulsifier, a dye, a UV protectant, a buffer, a flow agent or fertilizers, micronutrient donors, or other preparations that influence plant growth. One or more agrochemicals including, but not limited to, herbicides, insecticides, fungicides, bactericides, nematicides, molluscicides, acaracides, plant growth regulators, harvest aids, and fertilizers, can be combined with carriers, surfactants or adjuvants customarily employed in the art of formulation or other components to facilitate product handling and application for particular target pests. Suitable carriers and adjuvants can be solid or liquid and correspond to the substances ordinarily employed in formulation technology, e.g., natural or regenerated mineral substances, solvents, dispersants, wetting agents, tackifiers, binders, or fertilizers. The active ingredients of the present invention (i.e., at least one silencing element) are normally applied in the form of compositions and can be applied to the crop area, plant, or seed to be treated. For example, the compositions may be applied to grain in preparation for or during storage in a grain bin or silo, etc. The compositions may be applied simultaneously or in succession with other compounds. Methods of applying an active ingredient or a composition that contains at least one silencing element include, but are not limited to, foliar application, seed coating, and soil application. The number of applications and the rate of application depend on the intensity of infestation by the corresponding pest.
[0107] Suitable surface-active agents include, but are not limited to, anionic compounds such as a carboxylate of, for example, a metal; carboxylate of a long chain fatty acid; an N-acylsarcosinate; mono- or di-esters of phosphoric acid with fatty alcohol ethoxylates or salts of such esters; fatty alcohol sulfates such as sodium dodecyl sulfate, sodium octadecyl sulfate, or sodium cetyl sulfate; ethoxylated fatty alcohol sulfates; ethoxylated alkylphenol sulfates; lignin sulfonates; petroleum sulfonates; alkyl aryl sulfonates such as alkyl-benzene sulfonates or lower alkylnaphtalene sulfonates, e.g., butyl-naphthalene sulfonate; salts of sulfonated naphthalene-formaldehyde condensates; salts of sulfonated phenol-formaldehyde condensates; more complex sulfonates such as the amide sulfonates, e.g., the sulfonated condensation product of oleic acid and N-methyl taurine; or the dialkyl sulfosuccinates, e.g., the sodium sulfonate or dioctyl succinate. Non-ionic agents include condensation products of fatty acid esters, fatty alcohols, fatty acid amides or fatty-alkyl- or alkenyl-substituted phenols with ethylene oxide, fatty esters of polyhydric alcohol ethers, e.g., sorbitan fatty acid esters, condensation products of such esters with ethylene oxide, e.g., polyoxyethylene sorbitan fatty acid esters, block copolymers of ethylene oxide and propylene oxide, acetylenic glycols such as 2,4,7,9-tetraethyl-5-decyn-4,7-diol, or ethoxylated acetylenic glycols. Examples of a cationic surface-active agent include, for instance, an aliphatic mono-, di-, or polyamine such as an acetate, naphthenate or oleate; or oxygen-containing amine such as an amine oxide of polyoxyethylene alkylamine; an amide-linked amine prepared by the condensation of a carboxylic acid with a di- or polyamine; or a quaternary ammonium salt.
[0108] Examples of inert materials include, but are not limited to, inorganic minerals such as kaolin, phyllosilicates, carbonates, sulfates, phosphates, or botanical materials such as cork, powdered corncobs, peanut hulls, rice hulls, and walnut shells.
[0109] The compositions comprising the silencing element(s) can be in a suitable form for direct application or as a concentrate of primary composition that requires dilution with a suitable quantity of water or other dilutant before application.
[0110] The compositions (including the transformed microorganisms) can be applied to the environment of an insect pest (such as a Pentatomidae plant pest such as, for example, a N. viridula, Acrosternum hilare, Piezodorus guildini, and/or Halymorpha halys plant pest) by, for example, spraying, atomizing, dusting, scattering, coating or pouring, introducing into or on the soil, introducing into irrigation water, by seed treatment or general application or dusting at the time when the pest has begun to appear or before the appearance of pests as a protective measure. For example, the composition(s) and/or transformed microorganism(s) may be mixed with grain to protect the grain during storage. It is generally important to obtain good control of pests in the early stages of plant growth, as this is the time when the plant can be most severely damaged. The compositions can conveniently contain another insecticide if this is thought necessary. In an embodiment of the invention, the composition(s) is applied directly to the soil, at a time of planting, in granular form of a composition of a carrier and dead cells of a Bacillus strain or transformed microorganism of the invention. Another embodiment is a granular form of a composition comprising an agrochemical such as, for example, a herbicide, an insecticide, a fertilizer, in an inert carrier, and dead cells of a Bacillus strain or transformed microorganism of the invention.
VII. Plants, Plant Parts, and Methods of Introducing Sequences into Plants
[0111] In one embodiment, the methods of the invention involve introducing a polynucleotide into a plant. "Introducing" is intended to mean presenting to the plant the polynucleotide in such a manner that the sequence gains access to the interior of a cell of the plant. The methods of the invention do not depend on a particular method for introducing a sequence into a plant, only that the polynucleotides or polypeptides gain access to the interior of at least one cell of the plant. Methods for introducing polynucleotides into plants are known in the art including, but not limited to, stable transformation methods, transient transformation methods, and virus-mediated methods.
[0112] "Stable transformation" is intended to mean that the nucleotide construct introduced into a plant integrates into the genome of the plant and is capable of being inherited by the progeny thereof "Transient transformation" is intended to mean that a polynucleotide is introduced into the plant and does not integrate into the genome of the plant or a polypeptide is introduced into a plant.
[0113] Transformation protocols as well as protocols for introducing polypeptides or polynucleotide sequences into plants may vary depending on the type of plant or plant cell, i.e., monocot or dicot, targeted for transformation. Suitable methods of introducing polypeptides and polynucleotides into plant cells include microinjection (Crossway et al. (1986) Biotechniques 4:320-334), electroporation (Riggs et al. (1986) Proc. Natl. Acad. Sci. USA 83:5602-5606, Agrobacterium-mediated transformation (U.S. Pat. No. 5,563,055 and U.S. Pat. No. 5,981,840), direct gene transfer (Paszkowski et al. (1984) EMBO J. 3:2717-2722), and ballistic particle acceleration (see, for example, U.S. Pat. Nos. 4,945,050; U.S. Pat. No. 5,879,918; U.S. Pat. No. 5,886,244; and, 5,932,782; Tomes et al. (1995) in Plant Cell, Tissue, and Organ Culture: Fundamental Methods, ed. Gamborg and Phillips (Springer-Verlag, Berlin); McCabe et al. (1988) Biotechnology 6:923-926); and Lecl transformation (WO 00/28058). Also see Weissinger et al. (1988) Ann. Rev. Genet. 22:421-477; Sanford et al. (1987) Particulate Science and Technology 5:27-37 (onion); Christou et al. (1988) Plant Physiol. 87:671-674 (soybean); McCabe et al. (1988) Bio/Technology 6:923-926 (soybean); Finer and McMullen (1991) In Vitro Cell Dev. Biol. 27P:175-182 (soybean); Singh et al. (1998) Theor. Appl. Genet. 96:319-324 (soybean); Datta et al. (1990) Biotechnology 8:736-740 (rice); Klein et al. (1988) Proc. Natl. Acad. Sci. USA 85:4305-4309 (maize); Klein et al. (1988) Biotechnology 6:559-563 (maize); U.S. Pat. Nos. 5,240,855; 5,322,783; and, 5,324,646; Klein et al. (1988) Plant Physiol. 91:440-444 (maize); Fromm et al. (1990) Biotechnology 8:833-839 (maize); Hooykaas-Van Slogteren et al. (1984) Nature (London) 311:763-764; U.S. Pat. No. 5,736,369 (cereals); Bytebier et al. (1987) Proc. Natl. Acad. Sci. USA 84:5345-5349 (Liliaceae); De Wet et al. (1985) in The Experimental Manipulation of Ovule Tissues, ed. Chapman et al. (Longman, New York), pp. 197-209 (pollen); Kaeppler et al. (1990) Plant Cell Reports 9:415-418 and Kaeppler et al. (1992) Theor. Appl. Genet. 84:560-566 (whisker-mediated transformation); D'Halluin et al. (1992) Plant Cell 4:1495-1505 (electroporation); Li et al. (1993) Plant Cell Reports 12:250-255 and Christou and Ford (1995) Annals of Botany 75:407-413 (rice); Osjoda et al. (1996) Nature Biotechnology 14:745-750 (maize via Agrobacterium tumefaciens); all of which are herein incorporated by reference.
[0114] In specific embodiments, the silencing element sequences of the invention can be provided to a plant using a variety of transient transformation methods. Such transient transformation methods include, but are not limited to, the introduction of the protein or variants and fragments thereof directly into the plant or the introduction of the transcript into the plant. Such methods include, for example, microinjection or particle bombardment. See, for example, Crossway et al. (1986) Mol Gen. Genet. 202:179-185; Nomura et al. (1986) Plant Sci. 44:53-58; Hepler et al. (1994) Proc. Natl. Acad. Sci. 91: 2176-2180 and Hush et al. (1994) The Journal of Cell Science 107:775-784, all of which are herein incorporated by reference. Alternatively, polynucleotides can be transiently transformed into the plant using techniques known in the art. Such techniques include viral vector systems and the precipitation of the polynucleotide in a manner that precludes subsequent release of the DNA. Thus, the transcription from the particle-bound DNA can occur, but the frequency with which it is released to become integrated into the genome is greatly reduced. Such methods include the use of particles coated with polyethyleneimine (PEI; Sigma-Aldrich Corp., St. Louis, Mo., Catalog No. P3143).
[0115] In other embodiments, the polynucleotide of the invention may be introduced into plants by contacting plants with a virus or viral nucleic acids. Generally, such methods involve incorporating a nucleotide construct of the invention within a viral DNA or RNA molecule. Further, it is recognized that promoters of the invention also encompass promoters utilized for transcription by viral RNA polymerases. Methods for introducing polynucleotides into plants and expressing a protein encoded therein, involving viral DNA or RNA molecules, are known in the art. See, for example, U.S. Pat. Nos. 5,889,191, 5,889,190, 5,866,785, 5,589,367, 5,316,931, and Porta et al. (1996) Molecular Biotechnology 5:209-221; herein incorporated by reference.
[0116] Methods are known in the art for the targeted insertion of a polynucleotide at a specific location in the plant genome. In one embodiment, the insertion of the polynucleotide at a desired genomic location is achieved using a site-specific recombination system. See, for example, WO99/25821, WO99/25854, WO99/25840, WO99/25855, and WO99/25853, all of which are herein incorporated by reference. Briefly, the polynucleotide of the invention can be contained in a transfer cassette flanked by two non-recombinogenic recombination sites. The transfer cassette is introduced into a plant having stably incorporated into its genome a target site which is flanked by two non-recombinogenic recombination sites that correspond to the sites of the transfer cassette. An appropriate recombinase is provided and the transfer cassette is integrated at the target site. The polynucleotide of interest is thereby integrated at a specific chromosomal position in the plant genome.
[0117] The cells that have been transformed may be grown into plants in accordance with conventional ways. See, for example, McCormick et al. (1986) Plant Cell Reports 5:81-84. These plants may then be grown, and either pollinated with the same transformed strain or different strains, and the resulting progeny having constitutive expression of the desired phenotypic characteristic identified. Two or more generations may be grown to ensure that expression of the desired phenotypic characteristic is stably maintained and inherited and then seeds harvested to ensure expression of the desired phenotypic characteristic has been achieved. In this manner, the present invention provides transformed seed (also referred to as "transgenic seed") having a polynucleotide of the invention, for example, an expression cassette of the invention, stably incorporated into their genome.
[0118] As used herein, the term plant also includes plant cells, plant protoplasts, plant cell tissue cultures from which plants can be regenerated, plant calli, plant clumps, and plant cells that are intact in plants or parts of plants such as embryos, pollen, ovules, seeds, leaves, flowers, branches, fruit, kernels, ears, cobs, husks, stalks, roots, root tips, anthers, and the like. Grain is intended to mean the mature seed produced by commercial growers for purposes other than growing or reproducing the species. Progeny, variants, and mutants of the regenerated plants are also included within the scope of the invention, provided that these parts comprise the introduced polynucleotides.
[0119] The present invention may be used for transformation of any plant species, including, but not limited to, monocots and dicots. Examples of plant species of interest include, but are not limited to, corn (Zea mays), Brassica sp. (e.g., B. napus, B. raga, B. juncea), particularly those Brassica species useful as sources of seed oil, alfalfa (Medicago sativa), rice (Oryza sativa), rye (Secale cereale), sorghum (Sorghum bicolor, Sorghum vulgare), millet (e.g., pearl millet (Pennisetum glaucum), proso millet (Panicum miliaceum), foxtail millet (Setaria italica), finger millet (Eleusine coracana)), sunflower (Helianthus annuus), safflower (Carthamus tinctorius), wheat (Triticum aestivum), soybean (Glycine max), tobacco (Nicotiana tabacum), potato (Solanum tuberosum), peanuts (Arachis hypogaea), cotton (Gossypium barbadense, Gossypium hirsutum), sweet potato (Ipomoea batatus), cassava (Manihot esculenta), coffee (Coffea spp.), coconut (Cocos nucifera), pineapple (Ananas comosus), citrus trees (Citrus spp.), cocoa (Theobroma cacao), tea (Camellia sinensis), banana (Musa spp.), avocado (Persea americana), fig (Ficus casica), guava (Psidium guajava), mango (Mangifera indica), olive (Olea europaea), papaya (Carica papaya), cashew (Anacardium occidentale), macadamia (Macadamia integrifolia), almond (Prunus amygdalus), sugar beets (Beta vulgaris), sugarcane (Saccharum spp.), oats, barley, vegetables, ornamentals, and conifers.
[0120] Vegetables include tomatoes (Lycopersicon esculentum), lettuce (e.g., Lactuca sativa), green beans (Phaseolus vulgaris), lima beans (Phaseolus limensis), peas (Lathyrus spp.), and members of the genus Cucumis such as cucumber (C. sativus), cantaloupe (C. cantalupensis), and musk melon (C. melo). Ornamentals include azalea (Rhododendron spp.), hydrangea (Macrophylla hydrangea), hibiscus (Hibiscus rosasanensis), roses (Rosa spp.), tulips (Tulipa spp.), daffodils (Narcissus spp.), petunias (Petunia hybrida), carnation (Dianthus caryophyllus), poinsettia (Euphorbia pulcherrima), and chrysanthemum.
[0121] Conifers that may be employed in practicing the present invention include, for example, pines such as loblolly pine (Pinus taeda), slash pine (Pinus elliotii), ponderosa pine (Pinus ponderosa), lodgepole pine (Pinus contorta), and Monterey pine (Pinus radiata); Douglas-fir (Pseudotsuga menziesii); Western hemlock (Tsuga canadensis); Sitka spruce (Picea glauca); redwood (Sequoia sempervirens); true firs such as silver fir (Abies amabilis) and balsam fir (Abies balsamea); and cedars such as Western red cedar (Thuja plicata) and Alaska yellow-cedar (Chamaecyparis nootkatensis). In specific embodiments, plants of the present invention are crop plants (for example, corn, alfalfa, sunflower, Brassica, soybean, cotton, safflower, peanut, sorghum, wheat, millet, tobacco, etc.). In other embodiments, corn and soybean plants and sugarcane plants are optimal, and in yet other embodiments corn plants are optimal.
[0122] Other plants of interest include grain plants that provide seeds of interest, oil-seed plants, and leguminous plants. Seeds of interest include grain seeds, such as corn, wheat, barley, rice, sorghum, rye, etc. Oil-seed plants include cotton, soybean, safflower, sunflower, Brassica, maize, alfalfa, palm, coconut, etc. Leguminous plants include beans and peas. Beans include guar, locust bean, fenugreek, soybean, garden beans, cowpea, mungbean, lima bean, fava bean, lentils, chickpea, etc.
VIII. Methods of Use
[0123] Methods of the invention comprise methods for controlling a pest (i.e., a Pentatomidae plant pest, such as, N. viridula, Acrosternum hilare, Piezodorus guildini, and/or Halymorpha halys plant pest). In one embodiment, the method comprises feeding to a pest a composition comprising a silencing element of the invention, wherein said silencing element, when ingested by a pest (i.e., a Pentatomidae plant pest including N. viridula, Acrosternum hilare, Piezodorus guildini, and/or Halymorpha halys), reduces the level of a target polynucleotide of the pest and thereby controls the pest. The pest can be fed the silencing element(s) in a variety of ways. For example, in one embodiment, a polynucleotide comprising the silencing element(s) is introduced into a plant. As the Pentatomidae plant pest such as, for example, a N. viridula, Acrosternum hilare, Piezodorus guildini, and/or Halymorpha halys plant pest feeds on the plant or part thereof expressing these sequences, the silencing element is delivered to the pest. When the silencing element is delivered to the plant in this manner, it is recognized that the silencing element can be expressed constitutively or alternatively, it may be produced in a stage-specific manner by employing the various inducible or tissue-preferred or developmentally regulated promoters that are discussed elsewhere herein. In specific embodiments, the silencing element(s) is expressed in the roots, stalk or stem, leaf including pedicel, xylem and phloem, fruit or reproductive tissue, silk, flowers and all parts therein or any combination thereof.
[0124] In another method, a composition comprising at least one silencing element of the invention is applied to a plant. In such embodiments, the silencing element can be formulated in an agronomically suitable and/or environmentally acceptable carrier, which is preferably, suitable for dispersal in fields. In addition, the carrier can also include compounds that increase the half-life of the composition. In specific embodiments, the composition comprising the silencing element is formulated in such a manner such that it persists in the environment for a length of time sufficient to allow it to be delivered to a pest. In such embodiments, the composition can be applied to an area inhabited by a pest. In one embodiment, the composition is applied externally to a plant (i.e., by spraying a field) to protect the plant from pests.
[0125] In certain embodiments, the constructs of the present invention can be stacked with any combination of polynucleotide sequences of interest in order to create plants with a desired trait. A trait, as used herein, refers to the phenotype derived from a particular sequence or groups of sequences. For example, the polynucleotides of the present invention may be stacked with any other polynucleotides encoding polypeptides having pesticidal and/or insecticidal activity, such as other Bacillus thuringiensis toxic proteins (described in U.S. Pat. Nos. 5,366,892; 5,747,450; 5,737,514; 5,723,756; 5,593,881; and Geiser et al. (1986) Gene 48:109), lectins (Van Damme et al. (1994) Plant Mol. Biol. 24:825, pentin (described in U.S. Pat. No. 5,981,722), and the like. The combinations generated can also include multiple copies of any one of the polynucleotides of interest. The polynucleotides of the present invention can also be stacked with any other gene or combination of genes to produce plants with a variety of desired trait combinations including, but not limited to, traits desirable for animal feed such as high oil genes (e.g., U.S. Pat. No. 6,232,529); balanced amino acids (e.g., hordothionins (U.S. Pat. Nos. 5,990,389; 5,885,801; 5,885,802; and 5,703,409); barley high lysine (Williamson et al. (1987) Eur. J. Biochem. 165:99-106; and WO 98/20122) and high methionine proteins (Pedersen et al. (1986) J. Biol. Chem. 261:6279; Kirihara et al. (1988) Gene 71:359; and Musumura et al. (1989) Plant Mol. Biol. 12:123)); increased digestibility (e.g., modified storage proteins (U.S. application Ser. No. 10/053,410, filed Nov. 7, 2001); and thioredoxins (U.S. application Ser. No. 10/005,429, filed Dec. 3, 2001)); the disclosures of which are herein incorporated by reference.
[0126] The polynucleotides of the present invention can also be stacked with traits desirable for disease or herbicide resistance (e.g., fumonisin detoxification genes (U.S. Pat. No. 5,792,931); avirulence and disease resistance genes (Jones et al. (1994) Science 266:789; Martin et al. (1993) Science 262:1432; Mindrinos et al. (1994) Cell 78:1089); acetolactate synthase (ALS) mutants that lead to herbicide resistance such as the S4 and/or Hra mutations; inhibitors of glutamine synthase such as phosphinothricin or basta (e.g., bar gene); and glyphosate resistance (EPSPS gene)); and traits desirable for processing or process products such as high oil (e.g., U.S. Pat. No. 6,232,529); modified oils (e.g., fatty acid desaturase genes (U.S. Pat. No. 5,952,544; WO 94/11516)); modified starches (e.g., ADPG pyrophosphorylases (AGPase), starch synthases (SS), starch branching enzymes (SBE), and starch debranching enzymes (SDBE)); and polymers or bioplastics (e.g., U.S. Pat. No. 5,602,321; beta-ketothiolase, polyhydroxybutyrate synthase, and acetoacetyl-CoA reductase (Schubert et al. (1988) J. Bacteriol. 170:5837-5847) facilitate expression of polyhydroxyalkanoates (PHAs)); the disclosures of which are herein incorporated by reference. One could also combine the polynucleotides of the present invention with polynucleotides providing agronomic traits such as male sterility (e.g., see U.S. Pat. No. 5,583,210), stalk strength, flowering time, or transformation technology traits such as cell cycle regulation or gene targeting (e.g., WO 99/61619, WO 00/17364, and WO 99/25821); the disclosures of which are herein incorporated by reference.
[0127] These stacked combinations can be created by any method including, but not limited to, cross-breeding plants by any conventional or TopCross methodology, or genetic transformation. If the sequences are stacked by genetically transforming the plants (i.e., molecular stacks), the polynucleotide sequences of interest can be combined at any time and in any order. For example, a transgenic plant comprising one or more desired traits can be used as the target to introduce further traits by subsequent transformation. The traits can be introduced simultaneously in a co-transformation protocol with the polynucleotides of interest provided by any combination of transformation cassettes. For example, if two sequences will be introduced, the two sequences can be contained in separate transformation cassettes (trans) or contained on the same transformation cassette (cis). Expression of the sequences can be driven by the same promoter or by different promoters. In certain cases, it may be desirable to introduce a transformation cassette that will suppress the expression of the polynucleotide of interest. This may be combined with any combination of other suppression cassettes or overexpression cassettes to generate the desired combination of traits in the plant. It is further recognized that polynucleotide sequences can be stacked at a desired genomic location using a site-specific recombination system. See, for example, WO99/25821, WO99/25854, WO99/25840, WO99/25855, and WO99/25853, all of which are herein incorporated by reference.
[0128] The following examples are offered by way of illustration and not by way of limitation.
EXPERIMENTAL
Example 1
Selection of DNAs
[0129] DNAs were selected by two different methods. cDNA libraries were constructed using the SMART cDNA Synthesis Kit (Clontech) from mRNA isolated from second instar southern green stinkbug (Nezara viridula (Linnaeus)) or mRNA isolated from the head of second and third instar southern green stinkbugs. Select clones were sequenced and subject to BLAST analysis to create an expressed sequence tag (EST) library. The library was BLAST queried with sequences of interest and southern green stinkbug homologs were identified.
[0130] Additionally, a transcriptome of second instar southern green stinkbug was created using Illumina sequencing. Sequences were assembled using Oases (Schulz et al. 2012) and annotated using a proprietary functional annotation pipeline. The transcriptome was BLAST queried with sequences of interest and southern green stinkbug homologs were identified. DNAs were synthesized using RT-PCR. In brief, mRNA from second instar southern green stinkbug was reverse transcribed using the SuperScript.RTM. III First-Strand Synthesis System (Invitrogen; catalog # 18080-051) using random primers. Sequences of interest were PCR amplified using gene specific primers and ReadyMix Taq PCR Reaction Mix (Sigma-Aldrich Corp., St. Louis, Mo., Catalog No. P4600). The resulting DNA was analyzed on TAE agarose gels and cloned into pCR2.1 (Invitrogen). The resulting clones were sequenced and sequence verified clones were used to produce double stranded RNA.
Example 2
Production of Double Stranded RNA
[0131] Either EST clones or clones derived from RT-PCR and cloned into pCR2.1 were used as template for PCR. Sequences flanking the insert were fused with the T7 promoter sequence (TAATACGACTCACTATAGGG, SEQ ID 1) and used to generate primers (Table 1) to PCR amplify DNA. This PCR amplified DNA was used to synthesize double stranded RNA (dsRNA) using the MEGAscript.RTM. kit (Ambion, Catalog No. AM1334) following the manufacturer's protocol. Products of PCR as well as dsRNA synthesis were run on 1% agarose gel to verify amplification. With the EST clones, after the initial screening, fragments of the EST clones were amplified using gene specific primers fused with the T7 promoter sequence.
TABLE-US-00001 TABLE 1 EST TAATACGACTCACTATAGGGATGCCCGGGA SEQ ID 2 PRIMER 1 ATTCGGCCATTACG EST TAATACGACTCACTATAGGGCGCGCCAAAC SEQ ID 3 PRIMER 2 GAATGGTCTAGAAAGC pCR2.1 TAATACGACTCACTATAGGGCTAGTAACGG SEQ ID 4 Primer 1 CCGCCAGTGTGCTG pCR2.1 TAATACGACTCACTATAGGGGGCCGCCAGT SEQ ID 5 Primer 1 GTGATGGATATCTG
Example 3
Selected Clones
[0132] The following clones (Table 2) were selected for use in the bioassay.
TABLE-US-00002 TABLE 2 length SEQ ID NO Clone name bp DESCRIPTION Corresponding full-length DNA SEQ ID NO ta01222.002 Fragment 1 362 WD domain, G-beta ta01222.002_nezvi 21 repeat protein SEQ ID 6 SEQ ID NO ta01222.002 Fragment 2 369 WD domain, G-beta ta01222.002_nezvi 22 repeat protein SEQ ID 6 SEQ ID NO ta01222.002 Fragment 3 374 WD domain, G-beta ta01222.002_nezvi 23 repeat protein SEQ ID 6 SEQ ID NO ta02948.001 Fragment 1 355 Coatomer protein ta02948.001_nezvi 24 complex, subunit beta 1, SEQ ID 7 SEQ ID NO ta02948.001 Fragment 2 382 Coatomer protein ta02948.001_nezvi 25 complex, subunit beta 1, SEQ ID 7 SEQ ID NO ta02948.001 Fragment 3 376 Coatomer protein ta02948.001_nezvi 26 complex, subunit beta 1, SEQ ID 7 SEQ ID NO ta00781.001 Fragment 1 340 Coatomer, gamma ta00781.001_nezvi 27 subunit, SEQ ID 8 SEQ ID NO ta00781.001 Fragment 2 383 Coatomer, gamma ta00781.001_nezvi 28 subunit, SEQ ID 8 SEQ ID NO ta00781.001 Fragment 3 388 Coatomer, gamma ta00781.001_nezvi 29 subunit, SEQ ID 8 SEQ ID NO nezvi_22408.WL.1 412 Ryanodine receptor nezvi_22408.WL.1 30 Fragment 3 SEQ ID 9 SEQ ID NO nezvi_22408.WL.1 342 Ryanodine receptor nezvi_22408.WL.1 31 Fragment 6 SEQ ID 9 SEQ ID NO nezvi_22408.WL.1 432 Ryanodine receptor nezvi_22408.WL.1 32 Fragment 7 SEQ ID 9 SEQ ID NO nezvi_22408.WL.1 367 Ryanodine receptor nezvi_22408.WL.1 33 Fragment 9 SEQ ID 9 SEQ ID NO nezvi_22408.WL.1 396 Ryanodine receptor nezvi_22408.WL.1 34 Fragment 14 SEQ ID 9 SEQ ID NO inv2c.pk011.b22.f 965 26S proteasome non- inv2c.pk011.b22.f 10 ATPase regulatory SEQ ID 10 subunit 7 SEQ ID NO inv2c.pk011.b22.f 557 26S proteasome non- inv2c.pk011.b22.f 35 Fragment 1 ATPase regulatory SEQ ID 10 subunit 7 SEQ ID NO inv2c.pk011.b22f 530 26S proteasome non- inv2c.pk011.b22.f 36 Fragment 2 ATPase regulatory SEQ ID 10 subunit 7 SEQ ID NO inv2c.pk020.119.f 924 Proteasome subunit alpha inv2c.pk020.119.f 11 type-2 SEQ ID 11 SEQ ID NO inv2c.pk020.119.f 544 Proteasome subunit alpha inv2c.pk020.119.f 37 Fragment 1 type-2 SEQ ID 11 SEQ ID NO inv2c.pk020.119.f 587 Proteasome subunit alpha inv2c.pk020.119.f 38 Fragment 2 type-2 SEQ ID 11 SEQ ID NO inv3c.pk002.i8.f 946 26S protease regulatory inv3c.pk002.i8.f 12 subunit 8-like SEQ ID 12 SEQ ID NO inv3c.pk002.i8.f Fragment 550 26S protease regulatory inv3c.pk002.i8.f 39 1 subunit 8-like SEQ ID 12 SEQ ID NO inv3c.pk002.i8.f Fragment 580 26S protease regulatory inv3c.pk002.i8.f 40 2 subunit 8-like SEQ ID 12
Example 4
Stinkbug Collection and Bioassay
[0133] Southern green stinkbug eggs were collected from a laboratory maintained colony and kept in an incubator at 27.degree. C. with 65% relative humidity. After hatching, the insects were allowed to feed on green beans with or without the addition of green peas. Thereafter freshly molted second instar stinkbugs were transferred onto a modified artificial Lygus diet (Bioserve; Lygus Hesperus diet, catalog # F9644B) supplemented either with dsRNA or water (as control). Five second instar stinkbugs per bioassay were fed with 200 ppm dsRNA supplemented in the artificial diet. The diet with dsRNA or water was changed every two days and the bioassay observations on stunting and/or mortality were taken on day 7. All insects were weighed at the conclusion of the assay.
Example 5
Results of dsRNA Feeding
[0134] Five second instars per experiment were either fed upon a diet mixed with select dsRNA or water (control) and each experiment was replicated two to six times. The number of replicates is reported in column four (labeled N) of Table Three. Feeding of select dsRNAs to second instar southern green stinkbug significantly inhibited the growth when compared with controls. At the conclusion of the bioassay (day 7), the control stinkbugs developed into late third instars and weighed on an average 11.3.+-.0.9 mg (group A) or 8.4.+-.0.8 mg (group B). It is understood by those in the field that bioassay data can vary depending upon the time the assays are run which explains the differences in control weight. Insects fed selected dsRNA developed poorly and were still in second instar stage and only developed to 46-66% of the control weight. See Table 3.
TABLE-US-00003 TABLE 3 SEQ ID % control DNA NO Group weight N ta01222.002 Fragment 1 21 A 66 4 ta01222.002 Fragment 2 22 A 59 4 ta01222.002 Fragment 3 23 A 51 4 ta02948.001 Fragment 1 24 A 49 4 ta02948.001 Fragment 2 25 A 57 4 ta02948.001 Fragment 3 26 A 63 4 ta00781.001 Fragment 1 27 A 52 4 ta00781.001 Fragment 2 28 A 59 4 ta00781.001 Fragment 3 29 A 58 4 nezvi_22408.WL.1 Fragment 7 32 A 58 3 inv2c.pk011.b22.f 10 B 46 2 inv2c.pk011.b22.f Fragment 1 35 B 48 2 inv2c.pk011.b22f Fragment 2 36 B 63 6 inv2c.pk020.119.f 11 B 48 2 inv2c.pk020.119.f Fragment 1 37 B 61 2 inv2c.pk020.119.f Fragment 2 38 B 50 6 inv3c.pk002.i8.f 12 B 49 2 inv3c.pk002.i8.f Fragment 1 39 B 52 2 inv3c.pk002.i8.f Fragment 2 40 B 55 2 nezvi_22408.WL.1 Fragment 3 30 B 54 6 nezvi_22408.WL.1 Fragment 6 31 B 55 6 nezvi_22408.WL.1 Fragment 7 32 B 47 6 nezvi_22408.WL.1 Fragment 9 33 B 57 6 nezvi_22408.WL.1 Fragment 14 34 B 60 6
Example 6
Construction of Hairpin Constructs for Plant Transformation
[0135] A selection of the fragments that showed activity in the in vitro insect assay were used to make constructs for plant transformation. Fragments were amplified using gene specific primers flanked by sequence encoding an ATT B4 recombinase sequence (CAACTTTGTATAGAAAAGTTG; SEQ ID 13) on one side and an ATT B3 recombinase sequence (CAACTTTGTATAATAAAGTTG; SEQ ID 14) on the other side. The resulting amplified DNA was cloned into pCR2.1 and clones were sequenced. Sequence verified clones were recombined into plasmid PHP36164 (FIG. 1, SEQ ID 15) using a BP Gateway Reaction (Invitrogen). The resulting clones were then recombined into PHP59032 (FIG. 2, SEQ ID 16) using a LR Gateway Reaction (Invitrogen). The resulting plasmid contains a hairpin-structured transcript controlled by the seed specific promoter kit. The cassette comprising a promoter and terminator separated by a unique Not I restriction endonuclease site comprises the KTi3 promoter, a unique Not I restriction endonuclease site, and the KTi3 terminator region. This cassette comprises about 2088 nucleotides of the KTi3 promoter, a unique Not I restriction endonuclease site, and about 202 nucleotides of the KTi3 transcription terminator. The gene encoding KTi3 has been described (Jofuku, K. D. and Goldberg, R. B., Plant Cell 1:1079-1093 (1989)).
[0136] It is understood that such a hairpin-structured transcript will form a dsRNA in vivo. The plasmid also contains a promoterless Glycine max acetolactate synthase (P178S) which is useful as a selectable marker. These two cassettes are flanked by FRT1 and FRT87 sites that are required for site specific integration during soybean transformation. An example of such a plasmid is PHP62151 (FIG. 3, SEQ ID 17).
Example 7
Transformation and Regeneration of Soybean (Glycine max)
[0137] Transgenic soybean lines are generated by the method of particle gun bombardment (Klein et al., Nature (London) 327:70-73 (1987); U.S. Pat. No. 4,945,050) using a BIORAD Biolistic PDS1000/He instrument and either plasmid or fragment DNA.
[0138] Integration of DNA into the soybean genome after particle gun-mediated transformation may be random, or it may be through site-specific integration (SSI), achieved by recombinase-mediated cassette exchange (RMCE) at a previously created transgenic target site (U.S. Pat. No. 7,102,055 issued Sep. 5, 2006). Recombinase-mediated DNA cassette exchange RMCE using different recombinase systems have been achieved successfully in several plants (Nanto K, Yamada-Watanabet K, Ebinuma H(2005) Agrobacterium-mediated RMCE approach for gene replacement. Plant Biotechnol J, 3: 203-214; Louwerse JD et al. 2007. Stable recombinase-mediated cassette exchange in Arabidopsis using Agrobacterium tumefaciens. Plant Physiol 145: 1282-1293; Li Z. et al. 2009, Site-specific integration of transgenes in soybean via recombinase-mediated DNA cassette exchange. Plant Physiol 151: 1087-1095). Groups of transgenes can be stacked to the same site through multiple rounds of RMCE (Li et al 2010, Published online before print August 2010, doi:10.1104/pp.110.160093; Plant Physiology October 2010 vol. 154 no. 2 622-631). Taking advantage of reversible DNA cassette exchange in RMCE, an RMCE product can be used as a new target for subsequent SSI transformation.
[0139] The transgenic target site for RMCE may contain a promoter followed by recombination sites surrounding a selectable marker gene such as the hygromycin phosphotransferase (HPT) gene, with or without additional components. After bombardment with donor DNA, the target DNA previously integrated into the soybean genome recombines with the donor DNA at recombination sites such as FRT1 and FRT87 with the help of a transiently expressed recombinase such as the FLP recombinase. The portion of the DNA cassette in the target which contains the original selectable marker gene flanked by dissimilar recombination sites such as FRT1 and FRT87 is replaced by the donor DNA cassette flanked by the same FRT1 and FRT87 sites, resulting in site-specific integration of the donor cassette to the exact same genomic site of the target. The promoter existing upstream of the recombination sites in the transgenic target remains after RMCE to regulate expression of the new selectable marker gene delivered to the site as part of the donor cassette. Successful RMCE events may be identified by chemical selection for cells expressing the selectable marker gene of the donor.
Culture Media and Stock Solutions
[0140] The following stock solutions and media are used for transformation and regeneration of soybean plants:
Stock Solutions:
[0141] Sulfate 100.times.Stock:
[0142] 37.0 g MgSO.sub.4.7H.sub.2O, 1.69 g MnSO.sub.4.H.sub.2O, 0.86 g ZnSO.sub.4.7H.sub.2O, 0.0025 g CuSO.sub.4.5H.sub.2O
Halides 100.times.Stock:
[0143] 30.0 g CaCl.sub.2.2H.sub.2O, 0.083 g KI, 0.0025 g CoCl.sub.2.6H.sub.2O
P, B, Mo 100.times.Stock:
[0144] 18.5 g KH.sub.2PO.sub.4, 0.62 g H.sub.3BO.sub.3, 0.025 g Na.sub.2MoO.sub.4.2H.sub.2O
Fe EDTA 100.times.Stock:
[0145] 3.724 g Na.sub.2EDTA, 2.784 g FeSO.sub.4.7H.sub.2O
2,4-D Stock:
[0146] 10 mg/mL Vitamin
B5 vitamins, 1000.times.Stock: 100.0 g myo-inositol, 1.0 g nicotinic acid, 1.0 g pyridoxine HCl, 10 g thiamine.HCL.
Media (per Liter):
SB199 Solid Medium:
[0147] 1 package MS salts (Gibco/BRL--Cat. No. 11117-066), 1 mL B5 vitamins 1000.times.stock, 30 g sucrose, 4 ml 2, 4-D (40 mg/L final concentration), pH 7.0, 2 g Gelrite.TM.
SB1 Solid Medium:
[0148] 1 package MS salts (Gibco/BRL--Cat. No. 11117-066), 1 mL B5 vitamins 1000.times.stock, 31.5 g Glucose, 2 mL 2, 4-D (20 mg/L final concentration), pH 5.7, 8 g TC agar
SB196:
[0149] 10 mL of each of the above stock solutions 1-4, 1 mL B5 Vitamin stock, 0.463 g (NH.sub.4).sub.2SO.sub.4, 2.83 g KNO.sub.3, 1 mL 2,4 D stock, 1 g asparagine, 10 g sucrose, pH 5.7
SB71-4:
[0150] Gamborg's B5 salts, 20 g sucrose, 5 g TC agar, pH 5.7.
SB103:
[0151] 1 pk. Murashige & Skoog salts mixture, 1 mL B5 Vitamin stock, 750 mg MgCl.sub.2 hexahydrate, 60 g maltose, 2 g Gelrite.TM., pH 5.7.
SB166:
[0152] SB103 supplemented with 5 g per liter activated charcoal.
Soybean Embryogenic Suspension Culture Initiation:
[0153] Pods with immature seeds from available soybean plants 45-55 days after planting are picked, removed from their shells and placed into a sterilized magenta box. The soybean seeds are sterilized by shaking them for 15 min in a 5% Clorox solution with soap or other surfactants at 1 drop per 100 mL solution. Seeds are rinsed with sterile distilled water, and those less than 4 mm are placed on a sterile surface under microscope. The small ends of seeds are cut, and the cotyledons are pressed out of the seed coats. Cotyledons are transferred to plates containing SB199 medium (25-30 cotyledons per plate) for 2 weeks, then transferred to SB1 for 2-4 weeks. Plates are wrapped with fiber tape and cultured for 8 weeks in growth chamber room with temperature set at 24.4-26.degree. C. and light on a 16:8 h day/night photoperiod at an intensity of 45-65 .mu.E/m2/s . After this time, secondary embryos are cut and placed into SB196 liquid medium for 7 days.
Culture Conditions:
[0154] Soybean embryogenic suspension cultures are maintained in 50 mL liquid medium SB196 on a rotary shaker at a speed of 100-150 rpm. The cultures are set in a growth chamber with temperature set at 24.4-26.degree. C. and light on a 16:8 h day/night photoperiod at intensity of 80-100 .mu.E/m2/s for liquid culture and 80-120 .mu.E/m2/s for maturation and germination. Cultures are subcultured every 7-14 days by inoculating up to .sup.1/.sub.2 dime size quantity of tissue into 50 mL of fresh liquid SB196.
Preparation of DNA for Bombardment:
[0155] In particle gun bombardment procedures it is possible to use purified 1) entire plasmid DNA; or 2) DNA fragments containing only the recombinant DNA expression cassette(s) of interest. For every bombardment experiment, 85 .mu.L of suspension is prepared containing 1 to 90 pg of plasmid DNA per base pair of DNA. To prepare for an SSI transformation, the donor plasmid is mixed with plasmid DNA containing the FLP recombinase gene cassette in a ratio such as 3:1. Both recombinant DNA plasmids are co-precipitated onto gold particles as follows. The DNAs in suspension are added to 50 .mu.L of a 10-60 mg/mL 0.6 .mu.m gold particle suspension and then combined with 50 .mu.L CaCl.sub.2 (2.5 M) and 20 .mu.L spermidine (0.1 M). The mixture is vortexed for 5 sec, spun in a microcentrifuge for 5 sec, and the supernatant removed. The DNA-coated particles are then washed once with 150 .mu.L of 100% ethanol, vortexed and spun in a microcentrifuge again, then resuspended in 85 .mu.L of anhydrous ethanol. Five .mu.L of the DNA-coated gold particles are then loaded onto each macrocarrier disk.
Tissue Preparation and Bombardment with DNA:
[0156] Approximately 100-200 mg of two-week-old suspension culture is placed in an empty 60 mm.times.15 mm petri plate and the residual liquid removed from the tissue using a pipette. The tissue is placed about 3.5 inches away from the retaining screen. Membrane rupture pressure is set at 650 psi and the bombardment chamber of the particle gun is evacuated to -28 inches of Hg prior to bombardment. Typically, each plate of tissue is bombarded once.
Selection of Transformed Embryos and Plant Regeneration:
[0157] After bombardment, tissue from each bombarded plate is divided and placed into one to two flasks of SB196 liquid culture maintenance medium per plate of tissue, one flask per 100 mg tissue. Seven days post bombardment, the liquid medium in each flask is replaced with fresh SB196 culture maintenance medium supplemented with 100 ng/ml selective agent (selection medium). For selection of transformed soybean cells after random transformation or RMCE, the selective agent used can be a sulfonylurea (SU) compound with the chemical name, 2-chloro-N-((4-methoxy-6 methyl-1,3,5-triazine-2-yl)aminocarbonyl)benzenesulfonamide (common names: DPX-W4189 and chlorsulfuron). Chlorsulfuron is the active ingredient in the DuPont sulfonylurea herbicide, GLEAN.RTM.. The selection medium containing SU is replaced every two weeks for 8 weeks. After the 8 week selection period, islands of green, transformed tissue are observed growing from untransformed, necrotic embryogenic clusters. The putative transgenic randomly integrated or RMCE events are isolated and kept in SB196 liquid medium with SU at 100 ng/ml for another 5 weeks with media changes every 1-2 weeks to generate new, clonally propagated, transformed embryogenic suspension cultures. Embryos spend a total of around 13 weeks in contact with SU. Suspension cultures are subcultured and maintained as clusters of immature transgenic embryos and also regenerated into whole plants by maturation and germination of individual somatic embryos.
[0158] Transgenic somatic embryos become suitable for germination after four weeks on maturation medium (1 week on SB166 followed by 3 weeks on SB103). They are then removed from the maturation medium and dried in empty petri dishes, or with a small amount of medium, for approximately seven days. The dried embryos are then planted in SB71-4 medium where they are allowed to germinate under the same light and temperature conditions as described above. Germinated embryos are allowed to develop into small plantlets and are then transferred to potting medium and grown to maturity for seed production.
Example 8
Bioassay of Soybean Plants
[0159] After transformation, transgenic soybean plants will be grown in the greenhouse and seeds will be harvested from these transformed plants and designated as T1 seeds. T1 seeds will be chipped manually and DNA extracted from the chips will be used to determine zygosity using a quantitative PCR assay. Homozygous seeds will be sown in 2.5 inch pots, maintained in the growth chambers in 16:8 (light: dark) cycle in an insecticide free environment. After about 4 weeks, these plants will be transplanted to a larger pot and maintained at 14:10 (light: dark) cycle for 2 weeks. After two weeks, the plants will be maintained in 12:12 (light: dark) cycle to induce flowering and delivered for bioassay at R3 stage. Fertilizer will be provided as needed and chambers are maintained at 50% relative humidity. Ten second instar southern green stinkbugs will be used to infest soybean pods at various stages: R3 (beginning pod), R4 (full pod), R5 (beginning seed), R6 (full seed) and R7 (beginning maturity). Insects will be maintained on the pods using enclosures. Developmental stage, stunting (% control as outlined in example 5) and mortality will be recorded at 8-10 days after initial infest of the transgenic soybean pods.
Example 9
Alternative Sequences for nezvi_22408.WL.1 and inv2c.pk011.b22.f
[0160] A transcriptome is a collection of all the transcripts present in a given cell. As such, a transcriptome includes alternative spliced variants that are present within the cell. For nezvi_22408.WL.1 two alternatively spliced variants are predicted: nezvi_22408.WL.2 (SEQ ID 18) and nezvi_22408.WL.3 (SEQ ID 19). RT-PCR as described in Example 1 along with primers that were designed to amplify transcript specific sequences as well as cloning and sequence verification show that all three transcripts are real and exist in second instar southern green stinkbug mRNA.
[0161] Similarly, it is understood that cDNA library sequences may not encode the entire transcript. The sequence for the clone inv2c.pk011.b22.f (SEQ ID 10) was used to BLAST query the transcriptome and a longer sequence named nezvi_3755.WL.1 (SEQ ID 20) was found.
TABLE-US-00004 SEQUENCES SEQ ID 1 T7 promoter sequence TAATACGACTCACTATAGGG SEQ ID 2 EST PRIMER 1 TAATACGACTCACTATAGGGATGCCCGGGAATTCGGCCATTACG SEQ ID 3 EST PRIMER 2 TAATACGACTCACTATAGGGCGCGCCAAACGAATGGTCTAGAAAGC SEQ ID 4 pCR2.1 Primer 1 TAATACGACTCACTATAGGGCTAGTAACGGCCGCCAGTGTGCTG SEQ ID 5 pCR2.1 Primer 1 TAATACGACTCACTATAGGGGGCCGCCAGTGTGATGGATATCTG SEQ ID 6 ta01222.002_nezvi cagagatggcagagatgggaagttgatagccagatctgacagagtcaagtgtgttgacttacatccatcagaac- catggatgttggcttcttt atacaatggaaacgttcatatttggaatcatgagacccagcagctagtaaagtcttttgaagtatgcgaccaac- cagttcgtgctgcagtattt gttcctcgcaagaactggattgtaacagggtcagatgatatgcagatcagagtttttaattacaatactcttga- aagagtaaatgcatttgaagc tcattcagactatgtcagatgtatagcagttcacccagcccatccttatattctgacatcatcagatgatatgt- taatcaaattgtggaattggtct aaggcttgggtctgccaacaaatatttgaaggacatacccattatgtaatgcaagttgttataaatccaaaaga- taataatacatttgcatctgct tcattagatcggactgttaaagtttggcagttaggctctgctgctccaaattttactttagaaggtcatgaaaa- aggagttaattctgtcgattatt atcatggtggtgacaaaccttatctcatatctggcgccgacgatcatcttgtcaaaatatgggattatcaaaat- aagacttgtgttcaaaccttg gagggccatgcccagaatattactgcagtttgttttcacactgaactacctattataataactgggtcagaaga- tggaactgttcgattatggca ctcagcaacttacagattggaatcatctcttaactatggcctagaacgtgtttggactattgcgaggctgaaag- gatcaaacaatatagctcttg gatatgatgaagggagtatcatggtgaagataggacgtgaagaaccagcaatttcaatggatgtgaatggtgaa- aaaatagtttgggccag acattctgaaattgaacaggtaaacttgaagcaagtttcaggagaagtaagagatggcgaacgtcttgctttag- ctccaaaagaaatgggac catgtgaaatatatcctcaaagtatttcacataatccaaatggaagatttgtcgttgtttgtggagatggtgaa- tacataatttatactgctatggct ttaagaaacaaaagttttggatcagcccaagaatttgtatgggcacaagatagttctgactatgctataagaga- aggaacatctactgtaaaac tatttagacagttcaaggagcgcaagacacttaagccagagtttggtgctgaaggtatatttggtggacaattg- cttggtgtcagatcagtctc aggattatgtttatatgattgggaaactctggaattaatcagaagaatagaaattcaggcaaaatctctccatt- ggtctgattctggacatcttctt gctattgtaacggatgattcctattatatattgaagtatgattcatccgcaatcgccagtgctcaagagagaac- tcctgatggtgttgaagctgc attttctcttgtcggagaagtaaatgacacagtaaagacaggtttgtgggttggcgattgttttatttacacca- atgctgttgggcgaataaattat tacgttggaggagaaatagtgactgttgctcacttggattgcactatgtacctgttgggatatgtggctaggca- aaatcttttatacctttctgata aacatcacaatattgtttgttatacattattactttctgttcttgaatatcaaacagctgttatgagaggagat- tttgaaacagctgaccgtgtgttgc caacaattccagttcagcatcgttcccggtagcccacttcttggaaaaacagggctttaaaaaagaagctctgg- ctgtatctactgatccaga acataaatttgaattagctcttggactaaaagagctcgatacggttgttcagttagctgaggaaataggtagca- cagccaagtggggtcaag ccgctgaattagcaacgagacaagccaggcttgatgttgcgcaagcagctcttcacagagcccaacattatggt- ggacttctgcttctctcca catcagcaggaaatcgggaaatgatggaaaaacaggaaagagttcaggagaaaatggaaaaaataatgttagct- tccttgcatatttcctgc ttggagaccttgccaaatgtcttcaaattcttattgacactgatcgcattccagaagctgccttttttgccagg- acatatttgccgagtgaggttc ctcgagttgttgggttatggcgaggtttagcaaaggcaggacagagccttgcagatccttcgcagtatctaatc- tctttccaggttatgcagat gattaaaaactgaacagtatttagcaaagaatcctgtgtgactaaaccagatatatcataaaaatattaaggtt- atatttttattagtattattcat atatattatgtatattatatagcatgtaattgggtacttgagcagaaaaaaatacatgtcaatttgagacatag- tagaatataagtgacaaagagc atatataacattgagatagcattttttaaattacaaaaaaaagagctcatatttgactaaaaacttgaaataac- agtgtgcctgggggctaccaa agtggggctggggtgtcctgagaacacacctgaaaaatattttagtatgaatgaaatttctaggtaatgaaaaa- atatatcaatatacattttattt gtaaaaaaaaagtggaaaaacataaaatgtaattttcatctaaaagaattcttagtgtacatttataaaaatgg- gccattattaaattatttcataaa gccatgaaaattctatgtagagatttttttttaaactttcgagatacagaggtttgactattcttcaggtctaa- ccaatatttttgtttgctagaaacaa cctaatcgtaat SEQ ID 7 ta02948.001_nezvi aatattaaattatgagagttttttttgttgatatgaaataacaagtgcttggctgttttatttcccaaagaagt- attgagtgaataaatatcaagatatt gaattataatttcctatttaaggatggctgtggtggaacaaccttgttatactctgatcaattttccatctgat- ttagagcctcctaatgaaatgcag ctaaaatctgatttagaaaatggagacactaaagcgaaaattgaagctttgaaaaatattattcatttaattgc- aaatggagagcgtctacctgg tttacttatgcatatcatacgttttgttttgccatcacaagaccataccataaaaaaattactgcttatatttt- gggaaatcgttcctaaaactactcc agatggcaaacttctccaggaaatgattttggtttgtgatgcctatcggaaggacttacaacatcctaatgaat- ttgtcaggggatctacattac gttttctatgtaaacttaaagaacctgaattgatgagcctttaatgcctgctataagagcttgtttagagcatc- gggtttcatatgtacgaagaaa tgcagtacttgcaatatttaccatttataggaattttgaattatagctcctgatgcaccagaacttattgctaa- tttcttagatggggagcaagac atgtcatgtaaaagaaatgattataatgctcctacatgctgaccaagaacgtgccttatcctacttagcttcat- gtcttgatcaagtgacttcctt tggcgatatacttcaattagttattgttgaattaatttataaggtttgccatgctaacccttctgaacgttctc- gatttatacgttgcatttataatttact caattctaacagtcctgctgtgcgatatgaagctgctggaactttaatcacactttcgaatgctcctactgcaa- taaaagctgctgcttcttgtta cattgatttgataataaaggaaagtgataataatgttaaattaattgtattagatcgtatatatctttaaaaga- aattcctactcatgaacgggttct tcaagatttagttatggatatattacgtgtgctagccagtcctgacatggaagtaaagaaaaaagccttaagcc- tagcactggatctcactactt cacggtgtgttgaagaaatggttttaatgttaaaaaaagaagttgctaagacacataacttgacagaacatgaa- gatgctggaaaatatcgtc aacttcttgttagaactcttcattcctgttgcatgaagtttccagatgttgctgcttcagttataccagtatta- atggaatttctctcagatacaagtg aactagatcgtatgatgttatatatttgtccgagaagcaattcataagtttgattctttaagggttttgatcat- agagaaattattagaagcgtttc caaccataaaatctatgaaagttcatcgagctgctctttggatattgggtgaatatactacttcagttacagat- attaaagaagtcatgaaacaa ataaaacatgcccttggagagataccacttgtcgatgatgaaataaaaagagcttctggagagaaagttgagga- agttgatcatcgagatca agtaaaactggttacatctgatggaacatatgctacacaatcaatatttaacaccattctggcaattaaaaaag- aggatcgacctcctctcaga caatacttgattgatggagacttttttattggtgtatctgtggcttctacgcttgtgaaattagcattacgtta- taaagagcttgttcagcaggaaaa tatgtaccataaattttttgctgagtgtatgctaatcatttcatctatagttcgtctgggtaaatctggatatc- cttcgaaacagctgagctatgatg attatgaacgaatgttactttgtctaaaggttctctctgaaaataatgcacctattgtaaaaattttcaacact- gattgtcgcaatgctcttgctaata tgttagttgctcaacagaatgaggagtactcacttattaaggccaaagaaaaatccgtccataccatccaagtt- gatgatcctgtatcatttttac aattatcaacgatacgatcatctgattttggttcagaaaatgtttttgagcttagtttaaatcaagctgtcggg- gggccaaatacagctacaaac acagctgaacttccattttcagccagtaaattgaataaagtaactcagctgacagggttttcagatccagttta- tgcagaagcatatgttcatgtc aaccagtatgacattgtacttgatgttttgatcgttaatcaaacaggtgatacacttcaaaactgtacgcttga- actagcaactcttggtgatctg aagcttgtcgaaaaacctcaaccttgtgttctagctccttatgatttctgcaacattaaagcaaatgtgaaggt- tgcatctactgaaaatggaatc atatttggcaatattgtttatgatattagtggagctgcttctgacagaaacgttgttgttcttaatgatatcca- tatagatatcatggattatattgttc ctgctatctgttcagacacagaatttcgtcagatgtgggctgaatttgagtgggaaaacaaggtgtcagttaac- acttatttggtcgatcttcatg aatatcttggccatttattgaagagcactaatatgaaatgtttaacaccagaaaaagctctatgtgggcaatgt- ggatttatggctgctaatatgt atgcccgttcaatttttggtgaagatgcacttgctaacttaagcatcgaaaaaccgtttaataagcctaatgca- cctgttactgggcatattagaa tcagagctaaaagccagggtatggcattaagataggagataaaattaatatgacccagaagaaacctactatca- tggctcagtgaaacata atagtttcattgttaaaatgcatttcaagattgtttagactttttatatctattggtttataagtatttggaat- tatgggattcacaactctgaatttgttaa agtattttaaatcaagttatcaaaaattatttttacttcaatctaatagttgtacattattattagatgtgagt- acctacaaatatatagattttttgtacct ttctatgacttattaaaatatttcatttgatgtacattatatattgttgacctaaattaaaaaagcttatgtat- cttattcttaaatttgtttttattattctt agaataatgctattatattttgtgatatctcaatttgaaaatagtatatatgtgtgtgtgttaatatacatgtg- tatattattaatattctaataataaatatt tatatttaaaagtcagtaaaattatatgtatgtttgtataactatactgggtgtcctgtaaatagagagacatt- tttgtaggagttatagcagatctca agaactacaaaaaaattcatataaacaatgggccttaattttcagctgaaaagtaggaaattcaatttcttttt- ttttagcaacctacacaaaattta cctcaaattaaaatacagtttgtcatcgttaaaaccatttaaagaggactttaaaatggctaaattaagcttaa- aaaaatcataaataactagtga tttttttttgcaattattcatcatctaaagtggtactttgtttttaataagatctatattattggaattaattc- atacttttaagtgatgcaataactgtttct agtcgtaacatttacatttaaaataaagaaacctgctgattgcttcaattattttcatgaaaacgtaaatatta- ggatcaggaacatttattttatcact tataaacacgaatctattcctataatataatattcctataatatagatcttagtaaaaaacgaatttttagtac- cactttagatgatgaaccatcgca aaaacattattttaaacttttttttaaaatttttatttatgcatttatatttattataatacatcccagatcat- ggaaaaacatgaaattaaagttttgcaac agtttattgtagtctttttgctgaaaatacgggtattttttggatgactggttaaaaaaacacaaattgttttt- acataacttgatgtggatctcaaatg tttgcctaataaacctcaacttaattaatattgacttaaaggttattaaacctcggaagtttttctaaaagtga- ggtatctgtagtacagttagagga tgataaaaacaaatcgtgaataacttatgattatttttaaaaaacagaacttaggttagtttagccatttaaag- atcctctttaaatggttttagcaat gacaaattgtattttgatttgaggtaaattttgtgtagtttgctgaggagcaattgttaattatataaataaaa- tatttttttagttgtcgctttattttact aaaaaagaggatcattaaaaaatatataacttggatttcctacttttctgctgaaaattagcccattgtttgta- taacatttttgtagctcttgaggt ctgctatctctcctataaaaatttccctctacttacaggacatcctgtatatgtgttgaagtttgcatgaatgt- ttactttgttttttgtttttttaatttttc aggtcaagttaatacatatattattaattttatataaatatatatatgaattatttttggaccattattaaaaa- tatgttgtaactaaaataaataaaaat aatttattaaaagtccaaaaaaaaa SEQ ID 8 ta00781.001_nezvi ctgttgacgttgacgtgggatgtgtagttaatgtttaataattatttgtgtaatttttaatttgtaatatatta- ataacatatttataaccaataaaaatg gcaataaaacgagataagaaagaagaagaagatggtggaaacccctttcagagtcttgataagaccagtgttct- tcaggatgccagaacttt taatgaaacaccagttgaacctcgcaaatgcaccccaatattgaccaaaattctgtatcttttaaaccaaggag- aacagcttggtcctgctgaa gcaacagaaacattttttgctgttacgaagctttttcaatcaaataatactttgcttcgacgaatggtatatct- tggcataaaagagttatctctaatt gctcaagatgttatcatcgttacttctagccttacaaaagacatgactgggaaagaagatttatatcgagcagc- tgcaattcgagcattatgca gtataacagatgctactatgctgcagacgattgaaagatatatgaaacaagcaatcgttgatagaaacccagct- gttgctagtgctgctcttgt tagttcactgcatatgagtaggatcgctagcgatgtcgtcaagagatgggttaatgaagcacaagaagctgtta- attctgacagtataatggtc caatatcatgctctgggcctccttttccatattaggaaaaatgacagattggctgtaacaaaattagttgctaa- attaactagaatgtcgttgaaa tctccatttcgcagtttgtatgttgattcgaattgcatgtaaattattggaagaagaaagctctggagaatatg- cagactctccactttttgattttat tgaagcatgtttacgccacaaaagtgaaacagttgtttatgaagcagctgctgctcttgtaaacttacgccaca- ctactaccagacaaatcac gcctgcagtaagtgttcttcaattattttgttcttctccaaaaccagcgcttcgttttgctgctgtgagaactc- ttaataaggtagcaatgacacat cccactgctgtaacgtcatgcaatattgacttagagaaccttataacggattcaaatcggtccatagctacctt- ggccataactactcttctaaa aactggagctgaatcagctgtggacagacttatgaagcagatagcatctttcgtttcagaaataagtgatgaat- tcaagattgttgtagtgcag gccattagagcactatgcttgaaattccctcgaaaacatggaacactcatgacgtttttgtctgcgatgctgag- ggatgagggaggattggag tataaggcttcaatcgccgatacacttatatctcttatcgaagggaaccctgaagcgaaagagtctggcctcgc- tcatttgtgtgaattcatcga ggattgtgagcatacttccctggctgtcaggatactgcatctgcttggtaaagaaggaccaaaaacaaaacagc- cttctaggtacataagatt tatctataatagagtcattctggaaaatgcagtagtacgagcagctgctgtttctgcattgtctcaatttggag- ctcagtgccctgatcttcttgag aacatactagtcctcctcgcccggtgccaaatggatacagacgatgaagttagggacagggccacatattattt- cagtattttacaaaatcaa gatcgacatttgattaataattacatagttgaaccacctcaggtgtgtgtttccagtttagaaaaagccttaat- gctgcatttgatggaaactcca gaagaagtatttgacttgagttctgttccgttggcaccccctcctctatccgacgaagttcaggctgctccaac- tgttgtacaggaaccattag cggattgggacgtcctgcggtctccaaagaagagagtgcttctgatagacttcgagctattccagaactttctt- ggattcagggtccactctt caaaagttccgatcctatcagtcttacggaatctgagacagaatatcaagttagagtcacgaagcatgttttca- aaaatcatattgttcttcagttt gactgtacaaataccatgagtgaccagctactggagaaagttcgagtgcagttagaagtgagcgaaggttacca- gatcgtagctgaggtcc cctgccaaagattagcctgttcggaaacatcacctacttatattgccctgcaatttccagatgcccctaatctt- actgtcacaaactttgctgcta ctctgaggtttgttgtaaaggattgcgacccaatgaccggtatccctaactcagatgatggttatgaagaagat- tatatgcttgaagatgtcga agtgatgcttgctgaccaaatgcagcgacttacgaagagcaacttcggtgctgcatgggaggaaggcgaatcgt- atagtgagctagagga cacttataacttgtcaggaataaacagcctcgaagaggcagtgaggagtgttgtcagtttcatggggatgcagc- ctgctgacaggagcgac agggtacagcctgataaatcttcacacactgtctacctcggaggcatgttccgtggtggagttgaagtgttagc- tagagctaaactggccatg
ggtaattccccaggcgttgccatgcaacttacagtccgctctccaaatccagatatttgtgaactgattatttc- tgtagtcgggtaaaaaaaatat ataaatatatttgagaagtacacagtttcctctcagatgttgtacagaatcaaacattgaacataaagtatata- tcatatgaactgtattagttgact agctgcttgggaaaattttggttacgcaataatcaatcttttatatgtatcagattttaattaaagtatttaaa- atacaagtgttgctgtataaaatgat gttttgaaacatttttaaagtatttaagttatatgttttaatttaagcaacccagttattttttatgttatgat- atgggaattttattttatataaaatacatt ttttttattcgagataggtgtaaatttaaacttgaattttttccaaaggcatttgtctaatttattaaataata- tatgatttattatatatattttttattaat ccaataaatacttataag SEQ ID 9 >nezvi_22408.WL.1 caacttcctaacgacgaggtagttcttggatatgtaatacgggagagccatccacttctttcactggtctgcta- agtagagaggatggccgac agcgaaggaggatccgagcaggacgatgtttcgttcctgaggacggaggatatggtgtgcctatcatgcacagc- aactggagagagagtt tgcttagcagctgagggctttggtaaccgtcactgttttctagaaaatattgctgataagaatataccaccaga- tctttcaacatgtgtatttgttat tgaacaagctctatcagtaagagcacttcaggagttagttacagcagctggatctgaagagggaaagggaactg- gatctggtcacaggact cttctttatggaaatgctatactactccggcaccaaaacagtgacatgtatctggcttgtttatctaccagttc- atcaaatgacaagctctcatttg atgttggtttacaagaacattcccaaggggaagcttgttggtggaccgtacaccctgcttctaaacagagatca- gaaggtgaaaaagtgaga gttggtgatgatttaattcttgtgtctgtagccactgaaagatatttgcatactgctaaagaaaacgatcaatc- tattgtaaatgcatctttccatgt aactcattggtctgttcagccttatggaactggtatcagcaaaatgaagtatgttggttatgtgttcggaggag- atgtgttaagatttttccatggt ggggatgaatgccttaccattccatcaacttggagtgaaacccctggacaaaatgtggtagtttatgaaggagg- gagtgttttgagtcaagct cgttcactttggagattggaactggctaggacaaaatggtctggtggtttcattaattggtatcatccaatgag- gatacgacatctcaccactg gtagatacttaggagttaatgaaaataatgaattacacctcgttgttagggaggaagccacaacagcattatct- acattcattttaagacaaga aaaagatgaccaaaaagtagtaatggaagataaggatttagaagtaataggagctccaataataaaatatggtg- acagtactgttttagtcca acattcagaaagtggtttatggttaacttataagtcattcgaaactaagaaaaaaggtgtgggtaaagtagaag- aaaaacaagctgtacttcat gaggagggaaaaatggatgatggattagactttagtagaagtcaagaagaagaatcaaggactgctagagtaat- aaggaaatgttcgtcac ttttcactcaatttattaggggtctagaaactctgcaaatgaatcgaagacattctctgttttgcgctagtgta- aatttaaatgaaatggtcatgtgt ttagaagatttaattaattactttgcccagcctgaggaagatatggaacatgaggaaaaacaaaaccggttaag- agctttgagaaacagaca agatttgttccaagaagaaggaattttaaatcttatcttagaagccattgataaaattaatgttataacatccc- aaggtttcttagtcagtttagctg gagatgagtctggacagagctgggatataatctcaggatatttgtatcaactgctagctgccatcataaaagga- aatcatactaattgtgctca gtttgctaacacaaatagattaaactggttatttagcagactaggttctcaagcttcaagtgagggcacaggta- tgttggatgtacttcattgcgt cttaattgattctccagaagctttgaatatgatgagagatgaacatataaaagtaatcatttcactgctagaaa- aacatgggcgagatccaaga gttttagatgtactttgttcactttgtgttggtaatggtgtagcagtccgtagctcacaaaacaacatctgtga- tttccttctgccaggaaaaaactt gcttctacaaacgcaacttgtggatcatgttgccagtgtcaggccaaatatttttgtgggtcgagtcgaaggtt- ctgctgtttatcaaaaatggta ttttgaagtgactttagatcatatggagcaaaccacccatatgacaccgcatctaagaattggctgggctaaca- cttctggttatgttccctttcc tggcggtggtgaaaaatggggcggtaatggagttggtgatgatctctactcttttggttttgatggagctgcat- tatggacaggtggaagaaa aactgtagtccttcctcatgctatggaaccttacataagaaagggagatgttattggttgtgctttcgatctga- ctgttccaattattacatttacttt taatggaacattaatccgaggatcatttagggattttaatcttcaaggaatgttctttccagttataagctgtt- cctcaaaacttagttgtcgtttttta ctgggaggtgatcatggaagattaaaatatgcacctcctgaagaattttctcctctcgttgaaagtttgcttcc- tcaacaagtgctttctattgatc catgtttttattttggcaacctgaataaatgtgtattggctggtccttatcctgttgaagatgattgtgctttt- gttccagttccagttgacacatctat ggtaaatttacccgttcatgttgatacaatacgcgatcgtttagctgaaaacatccatgaaatgtgggctatga- ataaaattgaagcaggatgg atttatggagatgtaagagatgatataagaagaatacatccatgtcttgtgcaatttgaaaaactacctcctgc- agaaaagcgatatgacactc aacttgctgtacaaactttaaaaaccatcattgcactgggctaccatataacaatggaaaaaccaccatctaga- ataaagaacattcgtttgcc gaatgaaccatttttacaatctaatggttacaagccagctcctcttgatctcagtgccataacactaataccta- aaatggaggaacttgttgacc aactcgctgaaaatactcacaacttgtgggcaaaagaaagaatccaacaaggctggacctatggtcttaatgag- gatcctgatttgtcccga agtcctcacctcgtcccttacagtaaagttgatgatttaattaaaaaagccaacagggataccgcaagtgaaac- tgtcaggactcttcttgttta tggttataatttagaccctcctacaggtgaacaaactgaagctctcttagcagaagcaagccgtttgaagcaga- tgcagtttagaacctatcg ggctgaaaagacatatgcagtaaccagtggcaaatggtattttgaatttgaaattcttactgctgggccaatga- gagtaggttgggccattgct gattataatccaggttcccagatcggaagtgatgaagcatcctgggcatatgatggttataatgaggaaaaggt- ttattctggggttgctgaaa cgtttggaagacaatggcaagttggagacgttgtaggagtttttcttgatctattggatcatactattagtttc- tctctaaatggtgaactgcttatg gatgcacttgggggagaaacatcttttgcagatgttcagggagaaggatttgttccagcatttacacttggagt- aggacaaaaagcaaaatta gtgtttgggcaagatgttaactcacttaagttctttactacctgtggtttgcaagaaggttatgaacctttctg- tgtaaacatgaacagggcagtt accttttggtacaccaaagatcatcctatatttgaaaatactgatgattatattgatactaaaattgatgcaac- gcgtattcctgctggttctgaca caccaccatgtcttaaaattagtcataatacttttgagacaatggagaaagccaattgggaatttcttagactt- tctttacctgttcaatgtttacca tcattcataaatgaacaagaaaaagtacgtaggtggcaagaaataaggataagacaacacagacttcttgtgga- agctgaccaaaccactc ctgctcacattgaacagattatgaagtctggttttagtatgagtgatattaagggtcttcaaagaagttataca- gaagatggaatggaaggaga agaaggattggcaccaagctcatcaccacttacaaggactaagtcaaaagtgactccagctcgtccacctagga- aaggctccttaccacga aatggagatgttattaatatgaacgggacattagaaccaggtggaggaaaaatgaaccgttctaatagtgagct- tgatttccaacgtttcaatg gtgaaatgcccgatggcgataacaagaaaaagcgtgggagatctccatttaggttcttttcaagaaaaaagggg- gagcgtgatactagtgg agaaaatgcaaaaaatgtacatatgtctgagcctatgggtaatttccttgagcctccaaggactccaatgcagc- aaagaggtggaagtgctc tgcgttcttctcctcaacctaaagtacaggagttaactaagccaccatccccattagttgaaagaagtggaccc- aaagcaatgtctgtgcctgt tggaactggcatcgaaactattggaaatgaaatatttgatgtagagtgtttgaaattgattaatgaatacttct- acggtgtcaggatatttccagg tcaagacccaactcatgtatatgtcggttgggttacaactcaattccatctacgtagtaaagactttaatcaga- atcgagtgctaaagagcact gtagtagtatgtgatgaattcaatcgtgtaatagacagtattcagcggcagagttgttttatggtaagagctga- tgaattatacaatcaagtaact caggatgcctctggtaaaggtgcttcacaaggaatgtttattggatgtttcctggatactgctactggttatgt- gacgttcacatgtgaaggaaa agaaactaaccacaagtataagatggaacctgatacaaaattatttccagctatatttgttgaagctacaagca- aagaaattctacaaattgag cttggtcgtacatcaactacactgcctttatcagcagctgttctccaaaattcagaaagacatgtcattcctca- gtttccaccaagacttaaagtt cagtgtctaaaaccacatcagtgggcacgtgttcctaatatttcattgcatgtccacgctctgaaattatcaga- tataagaggttggagtatgctt tgtgaagatccagtttcaatgttagcattacatatacctgaagaagatagatgtattgatattttagaacttat- tgaaatggacaaactactttcatt ccatgctcatacattgacactttatgcagcactatgttaccaatccaattatcgtgcaggacatgttctctgca- aacatgtagaccaaaagcaac ttcagtatgctattaggtctgaattcatatctggatctttacgcttgggattttatgacctcttgattgcttta- cacattgaatcacatgcaacaacaa tggaagtttgtaaaaatgaattcataataccccttggtctagacttgaaagatttatatgaagatccagatatg- aagcacagcttacgatctttaa aaactgtctctattttacctcaaatgagtatgacagacattacggaaaatattgaaagcatcaatacattatat- agtccttattttcctcttgatgca gttaaggattatggaatgactgcattagaagaggctgtaagcatgaatcaacttcacaatagagaccctgtagg- tggttcaaatgaaaacttg tttctacccttgttgaaactggtagatagattattgcttgttgggatactacgagatgaagatgttacaaagct- actaattatgtttgatcctgaaac ttgggattcaaattttgaaaaggatggcaaagatgaacatcgtaagggtttacttcaaatgaaaatggcagagg- gggcaaaactacagatgt gctatctcttacagcatttatgcgatatacaattgcggcatcgggttgaagccattattaattttagttatgac- tatattgctgatcttcagcaggat cagttgagaagatatgttgatattaagcagtctgatcttccatcatcagttgctgcaagaaaaacaagagagtt- tcgttgccctccaagagaac agatgaatgctatcataaattttaaaaatttagaagaagatgacaaagaaaactgtccatgtggtgaagaactg- agggagagattaaacacat ttcatgaagaaactatgagtaaagtttcacttgttgctctccaagagccacaagaagatgagaacggtgaaaca- ccagaaaagccgggtgtt ttcaaaaaattatacaattttattaatgctgttaaagaattggaagaacctcctaaaatagaagaagaacctgt- taagaaaactcctgaagaaat atttagaaaagtattaattagtacaattgttagatgggctgaagaatcccagattgaaacaccaaaattagtca- gagaaatgttcagtctattgg taaggcagtacgacactgtaggtgaattaatcagatctcttggaaacacttatgtgataaatgacaaaacgaaa- gaagatgtagctcagatgt gggtagggttgagccagatcagagctctcctacctgttcaaatgtctcaagatgaagaaggtcttatgcgaatg- aggctatggaaattagtta acaatcacacattctttcaacatcctgatttgattagagttcttcgtgttcatgaaaatgttatggctgttatg- atcaataccttgggtagaagatca caagcacaatctgatgcttctcaagctggtcaagaaggtgaacctgcagctaaggagaaagatacgtcccatga- aatggtggtagcatgtt gtcgtttcctgtgttatttttgcagaacttcacgtcaaaatcagaaagcaatgtttgaccatttaacattttta- ttagaaaacagtaatattttactttc aagaccttcacttagaggaagtacccctcttgatgttgcctattcctctctcatggaaaataccgaactggcat- tagctcttagagaacattattt agagaagatagctgtttacttgtctcgctgtggattacaatctaattcagaattggtagaaaagggttaccctg- atttgggttgggatccagttg agggagaaagatatttagactttttacgcttctgtgtttgggttaacggtgaaagtgttgaagaaaatgcaaat- ctggttatacggctccttatac gtcgaccagaatgtttgggtcctgcacttcgtggagaaggtgaaggattactgagagcaattatagatgctaat- aagatgtctgaaagaattt cagatcgcagaaaaatgatggaggaacctgaaaattctgcccatcatcagtttgaacatccacttcctgagtct- gatgaagatgaggactata ttgatacaggagcagcaatactggcattctattgtactctggtcgatcttttaggtcgctgtgctccagatgct- agtgtgattgctcagggaaag aatgagtctcttagagctagagctattttgagatctttagtacctcttgaagatttatttggtgtcttgagttt- aaagtttacacttaccaatccagct attggagaagaaaggccaaaaagtgatataccatctggtctaataccatctcataagcaaagtattgttttatt- tttagagagagtatatggtatt gaacagcaagatctcttcttcagattactcgaggaagcatttttacctgatttaagagcagcaactatgctaga- tagaactgatggttctgaatc agaaatggcattagctatgaatcgctatattggaaattctattctccctttgttgataaagcattaccagtttt- atagtggtgcagataactatgca agtcttttagatgctacacttcatacagtgtatcgcctatcaaaaaatcgaatgctaactaaaggtcagcgaga- ggcagtatcagattttttggtt gctctcacaagtcaattacagccaagcatgttactcaaacttcttcgaaagttaaccgttgatgtatcaaagct- ttctgagtataccacagttgct ttaaggttgcttactttacactatgagcgttgtgcaaaatattatggaactactggtggacaagctggtggatc- tagtgatgaagaaaaaaggc tcactatgttactcttcagtaatatttttgattctttatcaaaaatggattatgatcctgaattatttggaaaa- gcgcttccctgcttgagtgctatagg atgtgcacttccacccgattattcactgtccaagaattatgatgaagaatggtatagttcaaagggttcagaac- cgactgatgggccttataatc cactgcccatcaatacttctatggtttctctaaataatgatttaaacacaattgttcaaaaattttctgaacat- tatcatgatgcatgggctagtcga aaaatggaaaatggttgggtatatggtgagcagtggtctgacagctctaaaactcatcctcgtttaaaacctta- tacattgcttaatgattatgaa aaagagagatacaaagaaccggttagagagtcattgaaagctctgttagctataggatggaatgtagagcatac- tgaagttgatattccttct aataacagaggatcatcagtcagaagatcttctaaagcaaatacatctgatggttcaacaccatttaattatca- tcccaacccaattgatatgac taatttaacattgagtagagaaatgcaaaatatggcagagaggttagctgaaaactcacatgatatttgggcaa- aaaagaagaaagaagaa cttgtttcatgtggtggtggtatacacccacagcttgttccatatgatcttttaacagacaaagagaagaggaa- agatagagaaagatctcaag aatttttgaaatatttacaatatcaaggatacaaactccacaggcctactcgaggaagtgctgatgagcaacag- gccgctgcagctgctgcc acaggagagtccagatttgcttacagtctactcgagaaacttatacaatatactgataaagcttctattaatat- gaaactactaaagccttctggt acattcagtagacgctccagttttaaaacttgttcaagagacataaaattcttttccaaagtggtattgctatt- ggttgagaagtatttcagcactc acagaaattacttcattgctgttgccactgcttctaataatgtaggagcagcctctttaaaagaaaaagaaatg- gttgccagtttgttctgtaagc tggcaaatttaattcgaacaaagctggctgcttttggtgcagatgttcgaattactgtccgttgtctacaagtg- ctagtgaaagctatagatgcc aagtcattggtaaagaattgtcctgaatttataaggacttcaatgctgacatttttcaataatacagctgatga- cttaggccaaactattcagtgttt gcaagagggtcgttacagtcaccttagaggcactcatcttaaaacatctacttctttattttatataaatgatg- ttgtactacctgttctcacttctat gtttgatcatttggctgtgtgtgattatggtagcgacttgttacttgatgaaattcaagtggcctcatatagaa- tgttgggtagtttatataatttagg aattgatccaactttaactcatgacagaaaatatttaaaaacagaaattgaaaggcataggcctgccattggtg- cttgtcttggtgcattttcatc aacatttccagtcgcttatcttgaaccccatttaaataaacataatcagttttcattagttaatagaattgctg- aacattctcttgaagcacaggata ttctagctagaatggaaaacaccatgcctacattggatgcgatcctttctgaagttgatcagttcattgaatcc- gaaaagagtcatacttcagca ccacatgttattgatgtgattttgcctctgctttgtgcttatttgccaagttggtggagtcaaggtcctgataa- tgtcagtctcacagcagggaatt atgtaacaatggttactagtgatcatatgaatcaactcctaaaaaatgtactaaaattaatcaaaaataatatt- ggaaatgaaaatgctccctgg atgacgagaatagcagcttacacccagcagatcatcataaactcttctgaagaactgttgaaagatccattcct- tccattaacacaagttgttaa gaagaggatagacaatatgtttcaccgtgaagaatctcttcgaggatttctaaaatcttcaactgaagatacct- ctcaagttgaagcagaaatt caggagggctggcatcttattgttagagatatatattctttttatccactactaattaaatatgttgatttaca- aagaaatcactggttacgtaataat attccggaagctgaatacttgtatactcatgttgctgatatatttaatatttggtctaaatcacagtactttct- aaaagaagaacagaatttcatatct gccaacgaaatagacaatatggctctaattatgcccactgcaactaggagatctgcagttgttttggatggaac- agctcctgctggaggtgg aaagaagaaaaagaagcatcgtgataagaaaagagataagaataaagaaatccaagcaagcttaatggtagctt- gcttaaaacgtttattac cagttggtcttaacctattcgctggaagagaacaagagttagttcagcattgtaaagacagatatttgaagaaa- atgccagaatatgaaatagt ggattttgccaaaatccaattaactcttcctgacaagatagatcctggagatgagatgtcttggcagcattatt- tgtactcaaaactgggaaata aaaaagatatcagctctgaaaaaccacagcaaatcgatgaggtagttgataggattgtggctatggcaaaagtt- ctttttgggcttcatatgatt gatcatccacaactacagagcaagacacaatacagatctgttgtatccacacagagaaagcgtgctgtcatagc-
ttgtttccggcaactatca ctacatgccttaccaagcatgcaaataaacctccacctcaccaatctggatggaaaagagttctttcagcagcg- agaaaacgggctgctatt gcttgtcttagaactcaacctttgtatacccttccaaggcatcgagtaattaacatatttgctcgcgcttattg- tgagctgtggctgcaagaagag aatgttggtcaagaaatcatgattgaagatcttacacaaacttttgaagatgctgaattgaaaaaaagagattc- tgaagaagatgaaagcaaa cctgatccacttacccaattagttacaacattttgtcggggtgcaatgactgaaaggagtggagctttgcaaga- agacccactttatatgtccta tgcagaaattactgcaaaatcatgtggagaagaagaagaagaaggtggagatgaggaagaaggtggagacgaag- aaggaggggcatc tatccataagacaatggcaaaattagtggaacaagaaatggaaaaacagaaactcttattccatcaagctcggc- tagccaacagaggtgttg cagaaatggtattgttacatatttcagcttgtaaaggtgttcccagtgaaatggttatgaaaactctccagctg- ggtatttctgttttacgtggtgg taatcttgatattcaaatgggtatgctaaatcatttgaaagaaaaaaaggatgttggattttttacttctatag- ctggcttgatgaactcctgcagtg tgttggatttagatgcatttgaaagaaacacaaaagctgaaggcttaggagttggttcagaaggtgctgctggt- gaaaagaacatgcatgat gctgaattcacctgtactcttttcagatttattcaacttacctgtgaagggcataacttagaatggcagaatta- tcttagaacccaagctggaaat acaacaacagttaatgttgttatttgtactgttgattaccttttgagattacaggaatcaattatggacttcta- ttggcactattcgagtaaagaatta attgatcctgctggaaaagccaactttttcaaagcaattggtgtggctagtcaagtatttaatacactctctga- agtaattcaagggccttgccc acaaaatcaacaagctctggctcattcaagattgtgggatgctgttggaggatttttgtttcttttctctcata- tgcaagataagctatcaaaacat tctagtcaagtagacttactgaaagaacttttgaatttacagaaagatatgataacaatgatgctatcaatgtt- ggaaggtaatgttgtgaatggt actattggaaaacagatggtagacacattagttgaatctgcctcaaatgtggaattgattttgaagtacttcga- catgtttttgaaattgaaagatt tgacatcctctgctagcttcttggaacttgatccaaaccatgaaggctgggtaacacctaaagattttaaagaa- aaaatggaacagcagaaaa gttatactccagaagaaatagacttcatgttacagtgctgtgaaaccaatcatgacggtaaaattgactatgtt- ggcttcacggatagattccat gagccggccaaggaaattggttttaacctagctgttcttctcacaaatttatctgagcatatgccaaatgaacc- gagacttgctcgctttttaga aacagctggtagtgttcttaactactttgaacctttcctgggacgaattgaaatattaggtagtagtaaacgaa- tcgagcgtgtatatttcgagat taaagaatcaaatattgaacagtgggaaaaacctcaaatcaaggaatctaaacgagcatttttctattcaattg- tcactgaaggaggtgacaa agaaaaattggaagcttttgttaatttttgtgaagatgccatatttgagatgacacatgccagtgggcttatgg- caactgatgatggtacaggct ctggaggaggaaaacaaagagcatcctcttattcttatatggaagatgaagatgaagaaaggaatccaatcaga- cgtggttggcaagcaac taaagatggaatttactttatgttctcaatgttatctcctagcaatattaaacataaaattattgaaatgcaac- aaatgtcaattattgaactaatgat tggttttataaaactatttttctacatgttttattactcaggatattctgtatcagttgtactgaagtatattg- gtggtattatattttcattgatgagggg accacaaattgaagagccagttgtagaagttaaagaggaagaaaaatctggacctctgaggataatgcctgctt- tgccaccacctgaagat agctctctgcttccatctgatgggtcaagagacatgaaaaaagaagacagtcagcctccatcaaaagtcataga- aggggctattcccataga agaaggaggtgagaggagctcagaggaacatgcgggagaccatgtaaaaccagaaaatgaagagcaacctccaa- caccaacacttgct gatatattgggtggagaagcagcaagaaaagaagcagcacaaagagcagaagtcgctgctgaacaagaagcagt- tatggctgcttttga ggcagaatctaaaatagaaaaagtttcagagccttctgctgtctctcaaattgattttaacaagtatactcacc- gggctgtcagtttccttgctcg taatttctataatcttaaatatgtagcattggttttggctttctgcattaactttattttattgttctacaagg- taacaacattgggtgaagatgatgatg ctgctagcggagaagggagtgttgaacaactaatggaagaattaacaggcgaaggtgatgatgtgagtggcgga- ggaagtagtggtgga gaaagtggtgaagaggatccaattgaaatggttcatgtggatgaggatttcttttatatggcacatgttatgcg- attggctgcaatcctacattct cttgtttctttagctatgttgattgcatattatcatttgaaggtccctctagctatattcaagagagaaaaaga- aatagctcgtcgacttgagtttga tggtttgtacattgctgagcaaccagaagatgatgatattaaatcacattgggataaactggttatctgtgcaa- aatcatttcctgttaattactgg gataaatttgtgaagaaaaaggttcgacagaaatacagtgaaacttatgactttgattcaataagtaatctttt- gggaatggaaaaaacatctttc agtgcccaagatactgaagaaggatcgggacttattcattacattttgaactttgactggaggtatcagctttg- gaaagcaggagtcacaatc acagataatgcatttttgtacagtttattatacttcatcttttcaattttgggaaacttcaataactttttctt- tgctgcccatttacttgatgttgcagttg gttttaaaacattgaggactattttgcaatcagtcacacacaatggaaaacagcttgtattgactgtaatgctg- ctaaccatcatagtatacatct atactgtcattgctttcaacttcttccgaaaattttatgtccaagaagaggatgaggaagtggataaaaaatgc- cacgatatgttaacttgttttgt attccacctttacaaaggagttagagctggtggtggtattggtgatgagattgaacctcctgatggtgatgatt- atgaagtttacaggataatgt ttgatattacgtttttcttttttgttattgtcatcttgctagccatcattcaaggtttgatcattgatgcattt- ggtgaattgagagatcagttagaaagt gtaaaagaagacatggaatctaactgcttcatttgtgggataggaaaagattattttgataaagttccccatgg- ttttgacactcatgttcaacaa gaacataacttggctaattacatgttctttcttatgcatctgattaacaagccagatactgaatacacaggtca- agaaacctatgtctggaacat gtatcagcaacgttgttgggatttcttcccagttggtgactgttttcgtaaacagtatgaagatgaactgggag- gtggtggtggttaattcatttg ggtgggtggtggctaaatttatattattaaaacaaaattaatgctgggaactatcaaacatccttcaattttat- taaaatttcagctaaattcaacaa tatatcttatgatattgtatttgtctaatgaaggaatagaactatcgtgttatgaatcagtgaagttttcactt- gtttagcataatttatgctaagtttac tattgcaaaatactttctttatatccgaaaatgttgtaaaataaatgtaaatggtgtggccttaaatataatg SEQ ID 10 inv2c.pk011.b22.f aatttagaatcaaatattattgatactatttctttttcatactttacattaatattcttcaaaattaaaaatgc- caggagtagagcatgttactaacaaa gtcgttgttcatcctttagttctattaagtgttgttgatcatttcaatagaatgggtaaaattgggaatcagaa- gagagtagttggcgtattattagg atgctggaaggcaaaaggtgttttagacgtatctaatagttttgcagtgccatttgatgaagatgataaagaca- aatcagtttggtttttagacca tgattatttagaaaatatgtatggcatgtttaagaaagttaatgcaagagaaaaagttgttggctggtatcata- caggcccaaagttacatcaaa atgatgttgcaattaatgaacttatacgccgttactgccctaactcagttcttgttattatcgatgcaaaacca- aaggatcttggtttacctacaga agcatatagagcagttgaagaagtacatgatgatggttctcctacgacaaaaacatttgagcatgttcccagtg- aaataggggctgaagaag cagaggaagtgggtgttgaacatctgctgagagatataaaagatacaactgtcggctcactttcgcaaagggtt- actaatcaatttcttggtct caaaggccttaatcaacaaattcaagacatcagggattaccttatgcaggttgttgaaggaaaattgcccatca- accatcaaataatatatcag cttcaagacatatttaatctccttcctgacatgaaccatgggaactttgttgattcattatacataaaaacaaa- tgatcagatgcttgtcgtttatct cgctgccctcgttagagctattgttgccttgcataatctgatcaataataaactcagtaatcgtgatgccgaaa- aaaaaaaaaaaaaaaaaaa aaaaaaaa SEQ ID 11 inv2c.pk020.119.f tacttcattgtcataaaggggtaacattgctgaatccagcgtaaaggttacagtgactctcacctggttataac- agttttgctttgtaatcatgggt tctgagagatatagcttttctttgactactttcagtccatctggaaaattagttcaaattgagtatgcacttgc- cgcagtcgcagctggagctcca tcaatcggtatcagagcatccaatggagttgtattggctactgaaaacaaatacaaatcaattttatatgaaga- acatactattcaaaaagtaga aatgataactaaacacattggaatggtctacagtggaatgggacctgattataggctactagtgaagagagcta- gaaaaatggctcaacaat aacagttagtttacggtgagcctattcctactgcacagcttgttcaacgagttgccatggttatgcaggagtac- actcaatctggaggtgttag accttttggagtttctttactcattgccgggtgggatggggataaaccatctctgtttcaatgtgatccatctg- gagcatactttgcctggaaagc tactgcaatgggaaaaaattttgtcactggcaaaacatttctagaaaagaggtacagtgaaactttagagctgg- atgatgcagtacatactgc aattctcactcttaaagaaaactttgaaggccaaatgacttcggacaatatcgaggtcggagtttgtgatgatc- aagggttcagagttttagatc ctacaacagtgaaggattatctggctaatattccataaatttattattaaaatttgattttataattaataaaa- aggtgattgcttatggatatgtgtga tgcctaaataaaatattattttttattggtttaatgctaaaaaaaaaaaaaaaaaaaaaaaaaaaa SEQ ID 12 inv3c.pk002.i8.f.fis atcattgatgatggttgagaaagttccagactctacatatgaaatggttggaggtcttgataagcaaattaagg- aaatcaaagaagtaattgaa cctcctgtaaaacatccagaactgtttgatgcactaggaatagctcagcccaaaggagttttattatatggacc- acctggaacaggtaaaaca cttttggcaagagcagttgcccatcacactgagtgcacgttcattcgtgtgtcaggatctgagttggttcagaa- attcattggggaaggatcca gaatggttagagaattgttcgtcatggcaagggaacatgctccatctatcatatttatggatgaaatcgattca- ataggttcatcacgtatcgaat ctgggagtggtggtgattctgaagtccagagaacaatgttagagttattgaaccaattggatggcttcgaagcc- acaaaaaatattaaggtca taatggccactaataggattgatattttggaccctgctcttctgcgtcctggaaggatagatcgtaagattgag- ttccccccaccaaatgagga agctcgtttagatatccttagaattcattcacgtaaaatgaatcttacccggggtatcaacttgcgtaaaattg- ccgagctcatgcctggagctt caggtgcagaagtaaagggtgtctgtactgaagcagggatgtatgccctgagggagaggagaatccatgtcacc- caagaagatttcgaaa tggctgtggccaaggttatgcaaaaggactccgagaagaatatgtcaatcaagaaattatggaaataaacgact- cacttatttttttttttttttac tctgtttaaaaagctttaaatatatagatgtttgtgaggttttgttaaaaataaatatatactataatcataaa- aaaaaaaaaaaaaaaaaaaaaaa aaa SEQ ID 13 ATT B4 recombinase sequence CAACTTTGTATAGAAAAGTTG SEQ ID 14 ATT B3 recombinase sequence CAACTTTGTATAATAAAGTTG SEQ ID 15 PHP36164 agattatcaaaaaggatcttcacctagatccttttaaattaaaaatgaagttttaaatcaatctaaagtatata- tgagtaaacttggtctgacagtta ccaatgcttaatcagtgaggcacctatctcagcgatctgtctatttcgttcatccatagttgcctgactccccg- tcgtgtagataactacgatacg ggagggcttaccatctggccccagtgctgcaatgataccgcgagacccacgctcaccggctccagatttatcag- caataaaccagccagc cggaagggccgagcgcagaagtggtcctgcaactttatccgcctccatccagtctattaattgttgccgggaag- ctagagtaagtagttcgc cagttaatagtttgcgcaacgttgttgccattgctacaggcatcgtggtgtcacgctcgtcgtttggtatggct- tcattcagctccggttcccaac gatcaaggcgagttacatgatcccccatgttgtgcaaaaaagcggttagctccttcggtcctccgatcgttgtc- agaagtaagttggccgca gtgttatcactcatggttatggcagcacttacggatggcatgacagtaagagaattatgcagatgcttttctgt- gactggtgagtactcaacca agtcattctgagaatagtgtatgcggcgaccgagttgctcttgcccggcgtcaatacgggataataccgcgcca- catagcagaactttaaaa gtgctcatcattggaaaacgttatcggggcgaaaactctcaaggatcttaccgctgttgagatccagttcgatg- taacccactcgtgcaccc aactgatcttcagcatcttttactttcaccagcgtttctgggtgagcaaaaacaggaaggcaaaatgccgcaaa- aaagggaataagggcga cacggaaatgttgaatactcatactcttcctttttcaatattattgaagcatttatcagggttattgtctcatg- atgatatatttttatcttgtgcaatgta acatcagagattttgagacacgggccagagctgccaggaaacagctatgaccatgtaatacgactcactatagg- ggatatcgcggccgcc ctgcagctggatggcaaataatgattttattttgactgatagtgacctgttcgttgcaacaaattgataagcaa- tgctttcttataatgccaactttg tatagaaaagttgaacgagaaacgtaaaatgatataaatatcaatatattaaattagattttgcataaaaaaca- gactacataatactgtaaaac acaacatatccagtcactatgaatcaactacttagatggtattagtgacctgtagtcgactaagttggcagcat- cacccgacgcactttgcgcc gaataaatacctgtgacggaagatcacttcgcagaataaataaatcctggtgtccctgttgataccgggaagcc- ctgggccaacttttggcg aaaatgagacgttgatcggcacgtaagaggttccaactttcaccataatgaaataagatcactaccgggcgtat- tttttgagttatcgagatttt caggagctaaggaagctaaaatggagaaaaaaatcactggatataccaccgttgatatatcccaatggcatcgt- aaagaacattttgaggca tttcagtcagttgctcaatgtacctataaccagaccgttcagctggatattacggcctttttaaagaccgtaaa- gaaaaataagcacaagttttat ccggcctttattcacattcttgcccgcctgatgaatgctcatccggaattccgtatggcaatgaaagacggtga- gctggtgatatgggatagt gttcaccatgttacaccgttttccatgagcaaactgaaacgttttcatcgctctggagtgaataccacgacgat- ttccggcagtttctacacata tattcgcaagatgtggcgtgttacggtgaaaacctggcctatttccctaaagggtttattgagaatatgttttt- cgtctcagccaatccctgggtg agtttcaccagttttgatttaaacgtggccaatatggacaacttcttcgcccccgttttcaccatgggcaaata- ttatacgcaaggcgacaaggt gctgatgccgctggcgattcaggttcatcatgccgtttgtgatggcttccatgtcggcagaatgataatgaatt- acaacagtactgcgatgag tggcagggggggcgtaaacgccgcgtggatccggcttactaaaagccagataacagtatgcgtatttgcgcgct- gatttttgcggtataag aatatatactgatatgtatacccgaagtatgtcaaaaagaggtatgctatgaagcagcgtattacagtgacagt- tgacagcgacagctatcag ttgctcaaggcatatatgatgtcaatatctccggtctggtaagcacaaccatgcagaatgaagcccgtcgtctg- cgtgccgaacgctggaaa gcggaaaatcaggaagggatggctgaggtcgcccggtttattgaaatgaacggctcttttgctgacgagaacag- gggctggtgaaatgca gtttaaggtttacacctataaaagagagagccgttatcgtctgtttgtggatgtacagagtgatattattgaca- cgcccgggcgacggatggtg atccccctggccagtgcacgtctgctgtcagataaagtctcccgtgaactttacccggtggtgcatatcgggga- tgaaagctggcgcatgat gaccaccgatatggccagtgtgccggtctccgttatcggggaagaagtggctgatctcagccaccgcgaaaatg- acatcaaaaacgccat taacctgatgttctggggaatataaatgtcaggctcccttatacacagccagtctgcaggtcgatacagtagaa- attacagaaactttatcacg tttagtaagtatagaggctgaaaatccagatgaagccgaacgacttgtaagagaaaagtataagagttgtgaaa- ttgttcttgatgcagatgat tttcaggactatgacactagcgtatatgaataggtagatgtttttattttgtcacacaaaaaagaggctcgcac- ctctttttcttatttctttttatgatt taatacggcattgaggacaatagcgagtaggctggatacgacgattccgtttgagaagaacatttggaaggctg- tcggtcgactaagttggc agcatcacccgaagaacatttggaaggctgtcggtcgactacaggtcactaataccatctaagtagttgattca- tagtgactggatatgttgtg ttttacagtattatgtagtctgttttttatgcaaaatctaatttaatatattgatatttatatcattttacgtt- tctcgttcaactttattatacaaagttggc attataaaaaagcattgctcatcaatttgttgcaacgaacaggtcactatcagtcaaaataaaatcattatttg- gggcccgagcttaagactggcc
gtcgttttacaacgtcgtgactgggaaaacatccatgctagcgttaacgcgagagtagggaactgccaggcatc- aaataaaacgaaaggct cagtcggaagactgggcctttcgttttatctgttgtttgtcggtgaacgctctcctgagtaggacaaatccgcc- gggagcggatttgaacgttg tgaagcaacggcccggagggtggcgggcaggacgcccgccataaactgccaggcatcaaactaagcagaaggcc- atcctgacggatg gcattttgcgtttctacaaactcttcctggctagcggtacgcgtattaattgcgttgcgctcactgcccgcttt- ccagtcgggaaacctgtcgtg ccagctgcattaatgaatcggccaacgcgcggggagaggcggtttgcgtattgggcgctcttccgcttcctcgc- tcactgactcgctgcgct cggtcgttcggctgcggcgagcggtatcagctcactcaaaggcggtaatacggttatccacagaatcaggggat- aacgcaggaaagaac atgtgagcaaaaggccagcaaaaggccaggaaccgtaaaaaggccgcgttgctggcgtttttccataggctccg- cccccctgacgagca tcacaaaaatcgacgctcaagtcagaggtggcgaaacccgacaggactataaagataccaggcgtttccccctg- gaagctccctcgtgcg ctctcctgttccgaccctgccgcttaccggatacctgtccgcctttctcccttcgggaagcgtggcgctttctc- atagctcacgctgtaggtatc tcagttcggtgtaggtcgttcgctccaagctgggctgtgtgcacgaaccccccgttcagcccgaccgctgcgcc- ttatccggtaactatcgt cttgagtccaacccggtaagacacgacttatcgccactggcagcagccactggtaacaggattagcagagcgag- gtatgtaggcggtgct acagagttcttgaagtggtggcctaactacggctacactagaagaacagtatttggtatctgcgctctgctgaa- gccagttaccttcggaaaa agagttggtagctcttgatccggcaaacaaaccaccgctggtagcggtggtttttttgtttgcaagcagcagat- tacgcgcagaaaaaaagg atctcaagaagatcctttgatcttttctacggggtctgacgctcagtggaacgacgcgtaactcacgttaaggg- attttggtcatg SEQ ID 16 PHP59032 cgtaccggccggcctctgcctgcgttctgctgtggaagttcctattccgaagttcctattctccagaaagtata- ggaacttcacatgctgcctc gtgcaagtcacgatctcgagttctatagtgtcacctaaatcgtatgtgtatgatacataaggttatgtattaat- tgtagccgcgttctaacgacaa tatgtccatatggtgcactctcagtacaatctgctctgatgccgcatagttaagccagccccgacacccgccaa- cacccgctgacgcgccct gacgggcttgtctgctcccggcatccgcttacagacaagctgtgaccgtctccgggagctgcatgtgtcagagg- ttttcaccgtcatcaccg aaacgcgcgagacgaaagggcctcgtgatacgcctatttttataggttaatgtcatgaccaaaatcccttaacg- tgagttttcgttccactgag cgtcagaccccgtagaaaagatcaaaggatcttcttgagatcctttttttctgcgcgtaatctgctgcttgcaa- acaaaaaaaccaccgctacc agcggtggtttgtttgccggatcaagagctaccaactctttttccgaaggtaactggcttcagcagagcgcaga- taccaaatactgtccttcta gtgtagccgtagttaggccaccacttcaagaactctgtagcaccgcctacatacctcgctctgctaatcctgtt- accagtggctgctgccagt ggcgataagtcgtgtcttaccgggttggactcaagacgatagttaccggataaggcgcagcggtcgggctgaac- ggggggttcgtgcac acagcccagcttggagcgaacgacctacaccgaactgagatacctacagcgtgagcattgagaaagcgccacgc- ttcccgaagggaga aaggcggacaggtatccggtaagcggcagggtcggaacaggagagcgcacgagggagcttccagggggaaacgc- ctggtatctttata gtcctgtcgggtttcgccacctctgacttgagcgtcgatttttgtgatgctcgtcaggggggcggagcctatgg- aaaaacgccagcaacgc ggcctttttacggttcctggccttttgctggccttttgctcacatgttctttcctgcgttatcccctgattctg- tggataaccgtattaccgcctttga gtgagctgataccgctcgccgcagccgaacgaccgagcgcagcgagtcagtgagcgaggaagcggaagagcgcc- caatacgcaaac cgcctctccccgcgcgttggccgattcattaatgcaggttgatcagatctcgatcccgcgaaattaatacgact- cactatagggagaccaca acggtttccctctagaaataattttgtttaactttaagaaggagatatacccatggaaaagcctgaactcaccg- cgacgtctgtcgagaagtttc tgatcgaaaagttcgacagcgtctccgacctgatgcagctctcggagggcgaagaatctcgtgctttcagcttc- gatgtaggagggcgtgg atatgtcctgcgggtaaatagctgcgccgatggtttctacaaagatcgttatgtttatcggcactttgcatcgg- ccgcgctcccgattccggaa gtgcttgacattggggaattcagcgagagcctgacctattgcatctcccgccgtgcacagggtgtcacgttgca- agacctgcctgaaaccg aactgcccgctgttctgcagccggtcgcggaggctatggatgcgatcgctgcggccgatcttagccagacgagc- gggttcggcccattcg gaccgcaaggaatcggtcaatacactacatggcgtgatttcatatgcgcgattgctgatccccatgtgtatcac- tggcaaactgtgatggacg acaccgtcagtgcgtccgtcgcgcaggctctcgatgagctgatgctttgggccgaggactgccccgaagtccgg- cacctcgtgcacgcg gatttcggctccaacaatgtcctgacggacaatggccgcataacagcggtcattgactggagcgaggcgatgtt- cggggattcccaatacg aggtcgccaacatcttcttctggaggccgtggttggcttgtatggagcagcagacgcgctacttcgagcggagg- catccggagcttgcag gatcgccgcggctccgggcgtatatgctccgcattggtcttgaccaactctatcagagcttggttgacggcaat- ttcgatgatgcagcttggg cgcagggtcgatgcgacgcaatcgtccgatccggagccgggactgtcgggcgtacacaaatcgcccgcagaagc- gcggccgtctgga ccgatggctgtgtagaagtactcgccgatagtggaaaccgacgccccagcactcgtccgagggcaaaggaatag- tgaggtacagcttgg atcgatccggctgctaacaaagcccgaaaggaagctgagttggctgctgccaccgctgagcaataactagcata- accccttggggcctct aaacgggtcttgaggggttttttgctgaaaggaggaactatatccggatgatcgtcgaggcctcacgtgttaac- agaagttcctattccgaag ttcctattctctagaaagtataggaacttccaccacacaacacaatggcggccaccgcttccagaaccacccga- ttctcttcttcctcttcacac cccaccttccccaaacgcattactagatccaccctccctctctctcatcaaaccctcaccaaacccaaccacgc- tctcaaaatcaaatgttcca tctccaaaccccccacggcggcgcccttcaccaaggaagcgccgaccacggagcccttcgtgtcacggttcgcc- tccggcgaacctcgc aagggcgcggacatccttgtggaggcgctggagaggcagggcgtgacgacggtgttcgcgtaccccggcggtgc- gtcgatggagatc caccaggcgctcacgcgctccgccgccatccgcaacgtgctcccgcgccacgagcagggcggcgtcttcgccgc- cgaaggctacgcg cgttcctccggcctccccggcgtctgcattgccacctccggccccggcgccaccaacctcgtgagcggcctcgc- cgacgctttaatggac agcgtcccagtcgtcgccatcaccggccaggtcagccgccggatgatcggcaccgacgccttccaagaaacccc- gatcgtggaggtga gcagatccatcacgaagcacaactacctcatcctcgacgtcgacgacatcccccgcgtcgtcgccgaggctttc- ttcgtcgccacctccgg ccgccccggtccggtcctcatcgacattcccaaagacgttcagcagcaactcgccgtgcctaattgggacgagc- ccgttaacctccccggt tacctcgccaggctgcccaggccccccgccgaggcccaattggaacacattgtcagactcatcatggaggccca- aaagcccgttctctac gtcggcggtggcagtttgaattccagtgctgaattgaggcgctttgttgaactcactggtattcccgttgctag- cactttaatgggtcttggaac ttttcctattggtgatgaatattcccttcagatgctgggtatgcatggtactgtttatgctaactatgctgttg- acaatagtgatttgttgcttgccttt ggggtaaggtttgatgaccgtgttactgggaagcttgaggcttttgctagtagggctaagattgttcacattga- tattgattctgccgagattgg gaagaacaagcaggcgcacgtgtcggtttgcgcggatttgaagttggccttgaagggaattaatatgattttgg- aggagaaaggagtgga gggtaagtttgatcttggaggttggagagaagagattaatgtgcagaaacacaagtttccattgggttacaaga- cattccaggacgcgatttc tccgcagcatgctatcgaggttcttgatgagttgactaatggagatgctattgttagtactggggttgggcagc- atcaaatgtgggctgcgca gttttacaagtacaagagaccgaggcagtggttgacctcagggggtcttggagccatgggttttggattgcctg- cggctattggtgctgctgt tgctaaccctggggctgttgtggttgacattgatggggatggtagtttcatcatgaatgttcaggagttggcca- ctataagagtggagaatctc ccagttaagatattgttgttgaacaatcagcatttgggtatggtggttcagtgggaggataggttctacaagtc- caatagagctcacacctatct tggagatccgtctagcgagagcgagatattcccaaacatgctcaagtttgctgatgcttgtgggataccggcag- cgcgagtgacgaagaa ggaagagcttagagcggcaattcagagaatgttggacacccctggcccctaccttcttgatgtcattgtgcccc- atcaggagcatgtgttgc cgatgattcccagtaatggatccttcaaggatgtgataactgagggtgatggtagaacgaggtactgattgcct- agaccaaatgttccttgat gcttgttttgtacaatatatataagataatgctgtcctagttgcaggatttggcctgtggtgagcatcatagtc- tgtagtagttttggtagcaagac attttattttccttttatttaacttactacatgcagtagcatctatctatctctgtagtctgatatctcctgtt- gtctgtattgtgccgttggattttttgctg tagtgagactgaaaatgatgtgctagtaataatatttctgttagaaatctaagtagagaatctgttgaagaagt- caaaagctaatggaatcaggt tacatattcaatgtttttctttttttagcggttggtagacgtgtagattcaacttctcttggagctcacctagg- caatcagtaaaatgcatattccttttt taacttgccatttatttacttttagtggaaattgtgaccaatttgttcatgtagaacggatttggaccattgcg- tccacaaaacgtctcttttgctcga tcttcacaaagcgataccgaaatccagagatagttttcaaaagtcagaaatggcaaagttataaatagtaaaac- agaatagatgctgtaatcg acttcaataacaagtggcatcacgtttctagttctagacccatcagctgaggtacaccggtgatcctcgaagag- aagggttaataacacacttt tttaacatttttaacacaaattttagttatttaaaaatttattaaaaaatttaaaataagaagaggaactatta- aataaatctaacttacaaaatttatg atttttaataagttttcaccaataaaaaatgtcataaaaatatgttaaaaagtatattatcaatattctcttta- tgataaataaaaagaaaaaaaaaat aaaagttaagtgaaaatgagattgaagtgactttaggtgtgtataaatatatcaaccccgccaacaatttattt- aatccaaatatattgaagtatat tattccatagcctttatttatttatatatttattatataaaagctttatttgttctaggttgttcatgaaatat- ttttttggttttatctccgttgtaagaaaa tcatgtgctttgtgtcgccactcactattgcagctttttcatgcattggtcagattgacggttgattgtatttt- tgttttttatggttttgtgttatgacttaa gtcttcatctctttatctcttcatcaggtttgatggttacctaatatggtccatgggtacatgcatggttaaat- taggtggccaactttgttgtgaac gatagaatttttttttatattaagtaaactatttttatattatgaaataataataaaaaaaatattttatcatt- attaacaaaatcatattagttaatttgtta actctataataaaagaaatactgtaacattcacattacatggtaacatctttccaccctttcatttgttttttg- tttgatgactttttttcttgtttaaattta tttcccttcttttaaatttggaatacattatcatcatatataaactaaaatactaaaaacaggattacacaaat- gataaataataacacaaatatttat aaatctagctgcaatatatttaaactagctatatcgatattgtaaaataaaactagctgcattgatactgataa- aaaaatatcatgtgctttctgga ctgatgatgcagtatacttttgacattgcctttattttatttttcagaaaagctttcttagttctgggttcttc- attatttgtttcccatctccattgtgaatt gaatcatttgcttcgtgtcacaaatacatttagtttaggtacatgcattggtcagattcacggtttattatgtc- atgacttaagttcatggtagtacat tacctgccacgcatgcattatattggttagatttgataggcaaatttggttgtcaacaatataaatataaataa- tgtttttatattacgaaataacagt gatcaaaacaaacagttttatctttattaacaagattttgtttttgtttgatgacgttttttaatgtttacgct- ttcccccttcttttgaatttagaacacttt atcatcataaaatcaaatactaaaaaaattacatatttcataaataataacacaaatatttttaaaaaatctga- aataataatgaacaatattacata ttatcacgaaaattcattaataaaaatattatataaataaaatgtaatagtagttatatgtaggaaaaaagtac- tgcacgcataatatatacaaaaa gattaaaatgaactattataaataataacactaaattaatggtgaatcatatcaaaataatgaaaaagtaaata- aaatttgtaattaacttctatatg tattacacacacaaataataaataatagtaaaaaaaattatgataaatatttaccatctcataaagatatttaa- aataatgataaaaatatagattat tttttatgcaactagctagccaaaaagagaacacgggtatatataaaaagagtacctttaaattctactgtact- tcctttattcctgacgtttttatat caagtggacatacgtgaagattttaattatcagtctaaatatttcattagcacttaatacttttctgttttatt- cctatcctataagtagtcccgattctc ccaacattgcttattcacacaactaactaagaaagtcttccatagccccccaagccctaggcgctatcaacttt- gtatagaaaagttgaacga gaaacgtaaaatgatataaatatcaatatattaaattagattttgcataaaaaacagactacataatactgtaa- aacacaacatatccagtcacta tggtcgacattttcaggagctaaggaagctaaaatggagaaaaaaatcactggatataccaccgttgatatatc- ccaatggcatcgtaaaga acattttgaggcatttcagtcagttgctcaatgtacctataaccagaccgttcagctggatattacggcctttt- taaagaccgtaaagaaaaata agcacaagttttatccggcctttattcacattcttgcccgcctgatgaatgctcatccggaattccgtatggca- atgaaagacggtgagctggt gatatgggatagtgttcacccttgttacaccgttttccatgagcaaactgaaacgttttcatcgctctggagtg- aataccacgacgatttccggc agtttctacacatatattcgcaagatgtggcgtgttacggtgaaaacctggcctatttccctaaagggtttatt- gagaatatgtttttcgtctcagc caatccctgggtgagtttcaccagttttgatttaaacgtggccaatatggacaacttcttcgcccccgttttca- ccatgggcaaatattatacgca aggcgacaaggtgctgatgccgctggcgattcaggttcatcatgccgtctgtgatggcttccatgtcggcagaa- tgcttaatgaattacaaca gtactgcgatgagtggcagggcggggcgtaaacgcgtggatccggcttactaaaagccagataacagtatgcgt- atttgcgcgctgattttt gcggtataagaatatatactgatatgtatacccgaagtatgtcaaaaagaggtgtgctatgaagcagcgtatta- cagtgacagttgacagcga cagctatcagttgctcaaggcatatatgatgtcaatatctccggtctggtaagcacaaccatgcagaatgaagc- ccgtcgtctgcgtgccga acgctggaaagcggaaaatcaggaagggatggctgaggtcgcccggtttattgaaatgaacggctcttttgctg- acgagaacagggactg gtgaaatgcagtttaaggtttacacctataaaagagagagccgttatcgtctgtttgtggatgtacagagtgat- attattgacacgccagggcg acggatggtgatccccctggccagtgcacgtctgctgtcagataaagtctcccgtgaactttacccggtggtgc- atatcggggatgaaagct ggcgcatgatgaccaccgatatggccagtgtgccggtctccgttatcggggaagaagtggctgatctcagccac- cgcgaaaatgacatca aaaacgccattaacctgatgttctggggaatataaatgtcaggctcccttatacacaggcggccgccatagtga- ctggatatgttgtgttttac agtattatgtagtctgttttttatgcaaaatctaatttaatatattgatatttatatcattttacgtttctcgt- tcaactttattatacaaagttgatagatat cggtccgagatccatcaggtaagtttctgcttctacctttgatatatatataataattatcattaattagtagt- aatataatatttcaaatatttttttcaa aataaaagaatgtagtatatagcaattgcttttctgtagtttataagtgtgtatattttaatttataacttttc- taatatatgaccaaaacatggtgatgt gcaggtccatggtggagctcgaccgatatctatcaactttgtataataaagttgaacgagaaacgtaaaatgat- ataaatatcaatatattaaat tagattttgcataaaaaacagactacataatactgtaaaacacaacatatccagtcactatggcggccgcatta- gggcaccccaggctttaca ctttatgcttccggctcgtataatgtgtggattttgagttaggatccgtcgagattttcaggagctaaggaagc- taaaatggagaaaaaaatca ctggatataccaccgttgatatatcccaatggcatcgtaaagaacattttgaggcatttcagtcagttgctcaa- tgtacctataaccagaccgtt cagctggatattacggcctttttaaagaccgtaaagaaaaataagcacaagttttatccggcctttattcacat- tcttgcccgcctgatgaatgct catccggaattccgtatggcaatgaaagacggtgagctggtgatatgggatagtgttcacccttgttacaccgt- tttccatgagcaaactgaa acgttttcatcgctctggagtgaataccacgacgatttccggcagtttctacacatatattcgcaagatgtggc- gtgttacggtgaaaacctgg cctatttccctaaagggtttattgagaatatgtttttcgtctcagccaatccctgggtgagtttcaccagtttt- gatttaaacgtggccaatatggac aacttcttcgcccccgttttcaccatgggcaaatattatacgcaaggcgacaaggtgctgatgccgctggcgat- tcaggttcatcatgccgttt gtgatggcttccatgtcggcagaatgcttaatgaattacaacagtactgcgatgagtggcaggcggggcgtaat- ctagaggatccggcttac taaaagccagataacagtatgcgtatttgcgcgctgatttttgcggtataagaatatatactgatatgtatacc- cgaagtatgtcaaaaagaggt atgctatgaagcagcgtattacagtgacagttgacagcgacagctatcagttgctcaaggcatatatgatgtca- atatctccggttcggtaag
cacaaccatgcagaatgaagcccgtcgtctgcgtgccgaacgctggaaagcggaaaatcaggaagggatggctg- aggtcgcccggttta ttgaaatgaacggctcttttgccgacgagaacaggggctggtgaaatgcagtttaaggtttacacctataaaag- agagagccgttatcgtctg tttgtggatgtacagagtgatattattgacacgccagggcgacggatggtgatccccctggccagtgcacgtct- gctgtcagataaagtccc ccgtgaactttacccggtggtgcatatcggggatgaaagctggcgcatgatgaccaccgatatggccagtgtgc- cggtctccgttatcggg gaagaagtggctgatctcagccaccgcgaaaatgacatcaaaaacgccattaacctgatgttctggggaatata- aatgtcaggctcccttat acacagccagtctgcaggtcgaccatagtgactggatatgttgtgttttacagtattatgtagtctgtttttta- tgcaaaatctaatttaatatattga tatttatatcattttacgtttctcgttcaacttttctatacaaagttgatagcgttaacccgggtaactgtacc- taaagaaggagtgcgtcgaagca gatcgttcaaacatttggcaataaagtttcttaagattgaatcctgttgccggtcttgcgatgattatcatata- atttctgttgaattacgttaagcat gtaataattaacatgtaatgcatgacgttatttatgagatgggtttttatgattagagtcccgcaattatacat- ttaatacgcgatagaaaacaaaa tatagcgcgcaaactaggataaattatcgcgcgcggtgtcatctatgttactagatcgatgtcgaatcgatggg- cc SEQ ID 17 PHP62151 acaaagttgatagcgttaacccgggtaactgtacctaaagaaggagtgcgtcgaagcagatcgttcaaacattt- ggcaataaagtttcttaag attgaatcctgttgccggtcttgcgatgattatcatataatttctgttgaattacgttaagcatgtaataatta- acatgtaatgcatgacgttatttatg agatgggtttttatgattagagtcccgcaattatacatttaatacgcgatagaaaacaaaatatagcgcgcaaa- ctaggataaattatcgcgcg cggtgtcatctatgttactagatcgatgtcgaatcgatgggcccgtaccggccggcctctgcctgcgttctgct- gtggaagttcctattccgaa gttcctattctccagaaagtataggaacttcacatgctgcctcgtgcaagtcacgatctcgagttctatagtgt- cacctaaatcgtatgtgtatga tacataaggttatgtattaattgtagccgcgttctaacgacaatatgtccatatggtgcactctcagtacaatc- tgctctgatgccgcatagttaa gccagccccgacacccgccaacacccgctgacgcgccctgacgggcttgtctgctcccggcatccgcttacaga- caagctgtgaccgtc tccgggagctgcatgtgtcagaggttttcaccgtcatcaccgaaacgcgcgagacgaaagggcctcgtgatacg- cctatttttataggttaat gtcatgaccaaaatcccttaacgtgagttttcgttccactgagcgtcagaccccgtagaaaagatcaaaggatc- ttcttgagatcctttttttctg cgcgtaatctgctgcttgcaaacaaaaaaaccaccgctaccagcggtggtttgtttgccggatcaagagctacc- aactctttttccgaaggta actggcttcagcagagcgcagataccaaatactgtccttctagtgtagccgtagttaggccaccacttcaagaa- ctctgtagcaccgcctac atacctcgctctgctaatcctgttaccagtggctgctgccagtggcgataagtcgtgtcttaccgggttggact- caagacgatagttaccggat aaggcgcagcggtcgggctgaacggggggttcgtgcacacagcccagcttggagcgaacgacctacaccgaact- gagatacctacag cgtgagcattgagaaagcgccacgcttcccgaagggagaaaggcggacaggtatccggtaagcggcagggtcgg- aacaggagagcg cacgagggagcttccagggggaaacgcctggtatctttatagtcctgtcgggtttcgccacctctgacttgagc- gtcgatttttgtgatgctcg tcaggggggcggagcctatggaaaaacgccagcaacgcggcctttttacggttcctggccttttgctggccttt- tgctcacatgttctttcctg cgttatcccctgattctgtggataaccgtattaccgcctttgagtgagctgataccgctcgccgcagccgaacg- accgagcgcagcgagtc agtgagcgaggaagcggaagagcgcccaatacgcaaaccgcctctccccgcgcgttggccgattcattaatgca- ggttgatcagatctcg atcccgcgaaattaatacgactcactatagggagaccacaacggtttccctctagaaataattttgtttaactt- taagaaggagatatacccatg gaaaagcctgaactcaccgcgacgtctgtcgagaagtttctgatcgaaaagttcgacagcgtctccgacctgat- gcagctctcggagggc gaagaatctcgtgctttcagcttcgatgtaggagggcgtggatatgtcctgcgggtaaatagctgcgccgatgg- tttctacaaagatcgttat gtttatcggcactttgcatcggccgcgctcccgattccggaagtgcttgacattggggaattcagcgagagcct- gacctattgcatctcccgc cgtgcacagggtgtcacgttgcaagacctgcctgaaaccgaactgcccgctgttctgcagccggtcgcggaggc- tatggatgcgatcgct gcggccgatcttagccagacgagcgggttcggcccattcggaccgcaaggaatcggtcaatacactacatggcg- tgatttcatatgcgcg attgctgatccccatgtgtatcactggcaaactgtgatggacgacaccgtcagtgcgtccgtcgcgcaggctct- cgatgagctgatgctttgg gccgaggactgccccgaagtccggcacctcgtgcacgcggatttcggctccaacaatgtcctgacggacaatgg- ccgcataacagcggt cattgactggagcgaggcgatgttcggggattcccaatacgaggtcgccaacatcttcttctggaggccgtggt- tggcttgtatggagcagc agacgcgctacttcgagcggaggcatccggagcttgcaggatcgccgcggctccgggcgtatatgctccgcatt- ggtcttgaccaactcta tcagagcttggttgacggcaatttcgatgatgcagcttgggcgcagggtcgatgcgacgcaatcgtccgatccg- gagccgggactgtcgg gcgtacacaaatcgcccgcagaagcgcggccgtctggaccgatggctgtgtagaagtactcgccgatagtggaa- accgacgccccagc actcgtccgagggcaaaggaatagtgaggtacagcttggatcgatccggctgctaacaaagcccgaaaggaagc- tgagttggctgctgc caccgctgagcaataactagcataaccccttggggcctctaaacgggtcttgaggggttttttgctgaaaggag- gaactatatccggatgat cgtcgaggcctcacgtgttaacagaagttcctattccgaagttcctattctctagaaagtataggaacttccac- cacacaacacaatggcggc caccgcttccagaaccacccgattctcttcttcctcttcacaccccaccttccccaaacgcattactagatcca- ccctccctctctctcatcaaa ccctcaccaaacccaaccacgctctcaaaatcaaatgttccatctccaaaccccccacggcggcgcccttcacc- aaggaagcgccgacca cggagcccttcgtgtcacggttcgcctccggcgaacctcgcaagggcgcggacatccttgtggaggcgctggag- aggcagggcgtgac gacggtgttcgcgtaccccggcggtgcgtcgatggagatccaccaggcgctcacgcgctccgccgccatccgca- acgtgctcccgcgc cacgagcagggcggcgtcttcgccgccgaaggctacgcgcgttcctccggcctccccggcgtctgcattgccac- ctccggccccggcg ccaccaacctcgtgagcggcctcgccgacgctttaatggacagcgtcccagtcgtcgccatcaccggccaggtc- agccgccggatgatc ggcaccgacgccttccaagaaaccccgatcgtggaggtgagcagatccatcacgaagcacaactacctcatcct- cgacgtcgacgacat cccccgcgtcgtcgccgaggctttcttcgtcgccacctccggccgccccggtccggtcctcatcgacattccca- aagacgttcagcagcaa ctcgccgtgcctaattgggacgagcccgttaacctccccggttacctcgccaggctgcccaggccccccgccga- ggcccaattggaaca cattgtcagactcatcatggaggcccaaaagcccgttctctacgtcggcggtggcagtttgaattccagtgctg- aattgaggcgctttgttga actcactggtattcccgttgctagcactttaatgggtcttggaacttttcctattggtgatgaatattcccttc- agatgctgggtatgcatggtactg tttatgctaactatgctgttgacaatagtgatttgttgcttgcctttggggtaaggtttgatgaccgtgttact- gggaagcttgaggcttttgctagt agggctaagattgttcacattgatattgattctgccgagattgggaagaacaagcaggcgcacgtgtcggtttg- cgcggatttgaagttggcc ttgaagggaattaatatgattttggaggagaaaggagtggagggtaagtttgatcttggaggttggagagaaga- gattaatgtgcagaaaca caagtttccattgggttacaagacattccaggacgcgatttctccgcagcatgctatcgaggttcttgatgagt- tgactaatggagatgctattg ttagtactggggttgggcagcatcaaatgtgggctgcgcagttttacaagtacaagagaccgaggcagtggttg- acctcagggggtcttgg agccatgggttttggattgcctgcggctattggtgctgctgttgctaaccctggggctgttgtggttgacattg- atggggatggtagtttcatca tgaatgttcaggagttggccactataagagtggagaatctcccagttaagatattgttgttgaacaatcagcat- ttgggtatggtggttcagtgg gaggataggttctacaagtccaatagagctcacacctatcttggagatccgtctagcgagagcgagatattccc- aaacatgctcaagtttgct gatgcttgtgggataccggcagcgcgagtgacgaagaaggaagagcttagagcggcaattcagagaatgttgga- cacccctggccccta ccttcttgatgtcattgtgccccatcaggagcatgtgttgccgatgattcccagtaatggatccttcaaggatg- tgataactgagggtgatggta gaacgaggtactgattgcctagaccaaatgttccttgatgcttgttttgtacaatatatataagataatgctgt- cctagttgcaggatttggcctgt ggtgagcatcatagtctgtagtagttttggtagcaagacattttattttccttttatttaacttactacatgca- gtagcatctatctatctctgtagtct gatatctcctgttgtctgtattgtgccgttggattttttgctgtagtgagactgaaaatgatgtgctagtaata- atatttctgttagaaatctaagtag agaatctgttgaagaagtcaaaagctaatggaatcaggttacatattcaatgtttttctttttttagcggttgg- tagacgtgtagattcaacttctctt ggagctcacctaggcaatcagtaaaatgcatattccttttttaacttgccatttatttacttttagtggaaatt- gtgaccaatttgttcatgtagaacg gatttggaccattgcgtccacaaaacgtctcttttgctcgatcttcacaaagcgataccgaaatccagagatag- ttttcaaaagtcagaaatgg caaagttataaatagtaaaacagaatagatgctgtaatcgacttcaataacaagtggcatcacgtttctagttc- tagacccatcagctgaggta caccggtgatcctcgaagagaagggttaataacacacttttttaacatttttaacacaaattttagttatttaa- aaatttattaaaaaatttaaaataa gaagaggaactctttaaataaatctaacttacaaaatttatgatttttaataagttttcaccaataaaaaatgt- cataaaaatatgttaaaaagtatat tatcaatattctctttatgataaataaaaagaaaaaaaaaataaaagttaagtgaaaatgagattgaagtgact- ttaggtgtgtataaatatatca accccgccaacaatttatttaatccaaatatattgaagtatattattccatagcctttatttatttatatattt- attatataaaagctttatttgttctaggtt gttcatgaaatatttttttggttttatctccgttgtaagaaaatcatgtgctttgtgtcgccactcactattgc- agctttttcatgcattggtcagattga cggttgattgtatttttgttttttatggttttgtgttatgacttaagtcttcatctctttatctcttcatcagg- tttgatggttacctaatatggtccatgggt acatgcatggttaaattaggtggccaactttgttgtgaacgatagaatttttttttatattaagtaaactattt- ttatattatgaaataataataaaaaa aatattttatcattattaacaaaatcatattagttaatttgttaactctataataaaagaaatactgtaacatt- cacattacatggtaacatctttccac cctttcatttgttttttgtttgatgactttttttcttgtttaaatttatttcccttcttttaaatttggaatac- attatcatcatatataaactaaaatactaa aaacaggattacacaaatgataaataataacacaaatatttataaatctagctgcaatatatttaaactagcta- tatcgatattgtaaaataaaactag ctgcattgatactgataaaaaaatatcatgtgctttctggactgatgatgcagtatacttttgacattgccttt- attttatttttcagaaaagctttctta gttctgggttcttcattatttgtttcccatctccattgtgaattgaatcatttgcttcgtgtcacaaatacatt- tagtttaggtacatgcattggtcagat tcacggtttattatgtcatgacttaagttcatggtagtacattacctgccacgcatgcattatattggttagat- ttgataggcaaatttggttgtcaa caatataaatataaataatgtttttatattacgaaataacagtgatcaaaacaaacagttttatctttattaac- aagattttgtttttgtttgatgacgttt tttaatgtttacgctttcccccttcttttgaatttagaacactttatcatcataaaatcaaatactaaaaaaat- tacatatttcataaataataacacaa atatttttaaaaaatctgaaataataatgaacaatattacatattatcacgaaaattcattaataaaaatatta- tataaataaaatgtaatagtagttat atgtaggaaaaaagtactgcacgcataatatatacaaaaagattaaaatgaactattataaataataacactaa- attaatggtgaatcatatcaa aataatgaaaaagtaaataaaatttgtaattaacttctatatgtattacacacacaaataataaataatagtaa- aaaaaattatgataaatatttacc atctcataaagatatttaaaataatgataaaaatatagattattttttatgcaactagctagccaaaaagagaa- cacgggtatatataaaaagagt acctttaaattctactgtacttcctttattcctgacgtttttatatcaagtggacatacgtgaagattttaatt- atcagtctaaatatttcattagcactta atacttttctgttttattcctatcctataagtagtcccgattctcccaacattgcttattcacacaactaacta- agaaagtcttccatagccccccaa gccctaggcgctatcaactttgtatagaaaagttgaagcatcacttcgacatcttcaagcatataatcttcttc- ataaccatcatctgagttaggg ataccggtcattgggtcgcaatcctttacaacaaacctcagagtagcagcaaagtttgtgacagtaagattagg- ggcatctggaaattgcag ggcaatataagtaggtgatgtttccgaacaggctaatctttggcaggggacctcagctacgatctggtaacctt- cgctcacttctaactgcact cgaactttctccagtagctggtcactcatggtatttgtacagtcaaactgaagaacaatatgatttttgaaaac- atgcttcgtgactctaacttgat attctgtctcagattccgtaagactgataggatcggaaccaactttattatacaaagttgatagatatcggtcc- gagatccatcaggtaagtttct gcttctacctttgatatatatataataattatcattaattagtagtaatataatatttcaaatatttttttcaa- aataaaagaatgtagtatatagcaattg cttttctgtagtttataagtgtgtatattttaatttataacttttctaatatatgaccaaaacatggtgatgtg- caggtccatggtggagctcgaccga tatctatcaactttgtataataaagttggttccgatcctatcagtcttacggaatctgagacagaatatcaagt- tagagtcacgaagcatgttttca aaaatcatattgttcttcagtttgactgtacaaataccatgagtgaccagctactggagaaagttcgagtgcag- ttagaagtgagcgaaggtta ccagatcgtagctgaggtcccctgccaaagattagcctgttcggaaacatcacctacttatattgccctgcaat- ttccagatgcccctaatctta ctgtcacaaactttgctgctactctgaggtttgttgtaaaggattgcgacccaatgaccggtatccctaactca- gatgatggttatgaagaagat tatatgcttgaagatgtcgaagtgatgcttcaacttttctat SEQ ID 18 nezvi_22408.WL.2 caacttcctaacgacgaggtagttcttggatatgtaatacgggagagccatccacttctttcactggtctgcta- agtagagaggatggccgac agcgaaggaggatccgagcaggacgatgtttcgttcctgaggacggaggatatggtgtgcctatcatgcacagc- aactggagagagagtt tgcttagcagctgagggctttggtaaccgtcactgttttctagaaaatattgctgataagaatataccaccaga- tctttcaacatgtgtatttgttat tgaacaagctctatcagtaagagcacttcaggagttagttacagcagctggatctgaagagggaaagggaactg- gatctggtcacaggact cttctttatggaaatgctatactactccggcaccaaaacagtgacatgtatctggcttgtttatctaccagttc- atcaaatgacaagctctcatttg atgttggtttacaagaacattcccaaggggaagcttgttggtggaccgtacaccctgcttctaaacagagatca- gaaggtgaaaaagtgaga gttggtgatgatttaattcttgtgtctgtagccactgaaagatatttgcatactgctaaagaaaacgatcaatc- tattgtaaatgcatctttccatgt aactcattggtctgttcagccttatggaactggtatcagcaaaatgaagtatgttggttatgtgttcggaggag- atgtgttaagatttttccatggt ggggatgaatgccttaccattccatcaacttggagtgaaacccctggacaaaatgtggtagtttatgaaggagg- gagtgttttgagtcaagct cgttcactttggagattggaactggctaggacaaaatggtctggtggtttcattaattggtatcatccaatgag- gatacgacatctcaccactg gtagatacttaggagttaatgaaaataatgaattacacctcgttgttagggaggaagccacaacagcattatct- acattcattttaagacaaga aaaagatgaccaaaaagtagtaatggaagataaggatttagaagtaataggagctccaataataaaatatggtg- acagtactgttttagtcca acattcagaaagtggtttatggttaacttataagtcattcgaaactaagaaaaaaggtgtgggtaaagtagaag- aaaaacaagctgtacttcat gaggagggaaaaatggatgatggattagactttagtagaagtcaagaagaagaatcaaggactgctagagtaat- aaggaaatgttcgtcac ttttcactcaatttattaggggtctagaaactctgcaaatgaatcgaagacattctctgttttgcgctagtgta- aatttaaatgaaatggtcatgtgt ttagaagatttaattaattactttgcccagcctgaggaagatatggaacatgaggaaaaacaaaaccggttaag- agctttgagaaacagaca agatttgttccaagaagaaggaattttaaatcttatcttagaagccattgataaaattaatgttataacatccc- aaggtttcttagtcagtttagctg gagatgagtctggacagagctgggatataatctcaggatatttgtatcaactgctagctgccatcataaaagga- aatcatactaattgtgctca gtttgctaacacaaatagattaaactggttatttagcagactaggttctcaagcttcaagtgagggcacaggta- tgttggatgtacttcattgcgt
cttaattgattctccagaagctttgaatatgatgagagatgaacatataaaagtaatcatttcactgctagaaa- aacatgggcgagatccaaga gttttagatgtactttgttcactttgtgttggtaatggtgtagcagtccgtagctcacaaaacaacatctgtga- tttccttctgccaggaaaaaactt gcttctacaaacgcaacttgtggatcatgttgccagtgtcaggccaaatatttttgtgggtcgagtcgaaggtt- ctgctgtttatcaaaaatggta ttttgaagtgactttagatcatatggagcaaaccacccatatgacaccgcatctaagaattggctgggctaaca- cttctggttatgttccctttcc tggcggtggtgaaaaatggggcggtaatggagttggtgatgatctctactcttttggttttgatggagctgcat- tatggacaggtggaagaaa aactgtagtccttcctcatgctatggaaccttacataagaaagggagatgttattggttgtgctttcgatctga- ctgttccaattattacatttacttt taatggaacattaatccgaggatcatttagggattttaatcttcaaggaatgttctttccagttataagctgtt- cctcaaaacttagttgtcgtttttta ctgggaggtgatcatggaagattaaaatatgcacctcctgaagaattttctcctctcgttgaaagtttgcttcc- tcaacaagtgctttctattgatc catgtttttattttggcaacctgaataaatgtgtattggctggtccttatcctgttgaagatgattgtgctttt- gttccagttccagttgacacatctat ggtaaatttacccgttcatgttgatacaatacgcgatcgtttagctgaaaacatccatgaaatgtgggctatga- ataaaattgaagcaggatgg atttatggagatgtaagagatgatataagaagaatacatccatgtcttgtgcaatttgaaaaactacctcctgc- agaaaagcgatatgacactc aacttgctgtacaaactttaaaaaccatcattgcactgggctaccatataacaatggaaaaaccaccatctaga- ataaagaacattcgtttgcc gaatgaaccatttttacaatctaatggttacaagccagctcctcttgatctcagtgccataacactaataccta- aaatggaggaacttgttgacc aactcgctgaaaatactcacaacttgtgggcaaaagaaagaatccaacaaggctggacctatggtcttaatgag- gatcctgatttgtcccga agtcctcacctcgtcccttacagtaaagttgatgatttaattaaaaaagccaacagggataccgcaagtgaaac- tgtcaggactcttcttgttta tggttataatttagaccctcctacaggtgaacaaactgaagctctcttagcagaagcaagccgtttgaagcaga- tgcagtttagaacctatcg ggctgaaaagacatatgcagtaaccagtggcaaatggtattttgaatttgaaattcttactgctgggccaatga- gagtaggttgggccattgct gattataatccaggttcccagatcggaagtgatgaagcatcctgggcatatgatggttataatgaggaaaaggt- ttattctggggttgctgaaa cgtttggaagacaatggcaagttggagacgttgtaggagtttttcttgatctattggatcatactattagtttc- tctctaaatggtgaactgcttatg gatgcacttgggggagaaacatcttttgcagatgttcagggagaaggatttgttccagcatttacacttggagt- aggacaaaaagcaaaatta gtgtttgggcaagatgttaactcacttaagttctttactacctgtggtttgcaagaaggttatgaacctttctg- tgtaaacatgaacagggcagtt accttttggtacaccaaagatcatcctatatttgaaaatactgatgattatattgatactaaaattgatgcaac- gcgtattcctgctggttctgaca caccaccatgtcttaaaattagtcataatacttttgagacaatggagaaagccaattgggaatttcttagactt- tctttacctgttcaatgtttacca tcattcataaatgaacaagaaaaagtacgtaggtggcaagaaataaggataagacaacacagacttcttgtgga- agctgaccaaaccactc ctgctcacattgaacagattatgaagtctggttttagtatgagtgatattaagggtcttcaaagaagttataca- gaagatggaatggaaggaga agaaggattggcaccaagctcatcaccacttacaaggactaagtcaaaagtgactccagctcgtccacctagga- aaggctccttaccacga aatggagatgttattaatatgaacgggacattagaaccaggtggaggaaaaatgaaccgttctaatagtgagct- tgatttccaacgtttcaatg gtgaaatgcccgatggcgataacaagaaaaagcgtgggagatctccatttaggttcttttcaagaaaaaagggg- gagcgtgatactagtgg agaaaatgcaaaaaatgtacatatgtctgagcctatgggtaatttccttgagcctccaaggactccaatgcagc- aaagaggtggaagtgctc tgcgttcttctcctcaacctaaagtacaggagttaactaagccaccatccccattagttgaaagaagtggaccc- aaagcaatgtctgtgcctgt tggaactggcatcgaaactattggaaatgaaatatttgatgtagagtgtttgaaattgattaatgaatacttct- acggtgtcaggatatttccagg tcaagacccaactcatgtatatgtcggttgggttacaactcaattccatctacgtagtaaagactttaatcaga- atcgagtgctaaagagcact gtagtagtatgtgatgaattcaatcgtgtaatagacagtattcagcggcagagttgttttatggtaagagctga- tgaattatacaatcaagtaact caggatgcctctggtaaaggtgcttcacaaggaatgtttattggatgtttcctggatactgctactggttatgt- gacgttcacatgtgaaggaaa agaaactaaccacaagtataagatggaacctgatacaaaattatttccagctatatttgttgaagctacaagca- aagaaattctacaaattgag cttggtcgtacatcaactacactgcctttatcagcagctgttctccaaaattcagaaagacatgtcattcctca- gtttccaccaagacttaaagtt cagtgtctaaaaccacatcagtgggcacgtgttcctaatatttcattgcatgtccacgctctgaaattatcaga- tataagaggttggagtatgctt tgtgaagatccagtttcaatgttagcattacatatacctgaagaagatagatgtattgatattttagaacttat- tgaaatggacaaactactttcatt ccatgctcatacattgacactttatgcagcactatgttaccaatccaattatcgtgcaggacatgttctctgca- aacatgtagaccaaaagcaac ttcagtatgctattaggtctgaattcatatctggatctttacgcttgggattttatgacctcttgattgcttta- cacattgaatcacatgcaacaacaa tggaagtttgtaaaaatgaattcataataccccttggtctagacttgaaagatttatatgaagatccagatatg- aagcacagcttacgatctttaa aaactgtctctattttacctcaaatgagtatgacagacattacggaaaatattgaaagcatcaatacattatat- agtccttattttcctcttgatgca gttaaggattatggaatgactgcattagaagaggctgtaagcatgaatcaacttcacaatagagaccctgtagg- tggttcaaatgaaaacttg tttctacccttgttgaaactggtagatagattattgcttgttgggatactacgagatgaagatgttacaaagct- actaattatgtttgatcctgaaac ttgggattcaaattttgaaaaggatggcaaagatgaacatcgtaagggtttacttcaaatgaaaatggcagagg- gggcaaaactacagatgt gctatctcttacagcatttatgcgatatacaattgcggcatcgggttgaagccattattaattttagttatgac- tatattgctgatcttcagcaggat cagttgagaagatatgttgatattaagcagtctgatcttccatcatcagttgctgcaagaaaaacaagagagtt- tcgttgccctccaagagaac agatgaatgctatcataaattttaaaaatttagaagaagatgacaaagaaaactgtccatgtggtgaagaactg- agggagagattaaacacat ttcatgaagaaactatgagtaaagtttcacttgttgctctccaagagccacaagaagatgagaacggtgaaaca- ccagaaaagccgggtgtt ttcaaaaaattatacaattttattaatgctgttaaagaattggaagaacctcctaaaatagaagaagaacctgt- taagaaaactcctgaagaaat atttagaaaagtattaattagtacaattgttagatgggctgaagaatcccagattgaaacaccaaaattagtca- gagaaatgttcagtctattgg taaggcagtacgacactgtaggtgaattaatcagatctcttggaaacacttatgtgataaatgacaaaacgaaa- gaagatgtagctcagatgt gggtagggttgagccagatcagagctctcctacctgttcaaatgtctcaagatgaagaaggtcttatgcgaatg- aggctatggaaattagtta acaatcacacattctttcaacatcctgatttgattagagttcttcgtgttcatgaaaatgttatggctgttatg- atcaataccttgggtagaagatca caagcacaatctgatgcttctcaagctggtcaagaaggtgaacctgcagctaaggagaaagatacgtcccatga- aatggtggtagcatgtt gtcgtttcctgtgttatttttgcagaacttcacgtcaaaatcagaaagcaatgtttgaccatttaacattttta- ttagaaaacagtaatattttactttc aagaccttcacttagaggaagtacccctcttgatgttgcctattcctctctcatggaaaataccgaactggcat- tagctcttagagaacattattt agagaagatagctgtttacttgtctcgctgtggattacaatctaattcagaattggtagaaaagggttaccctg- atttgggttgggatccagttg agggagaaagatatttagactttttacgcttctgtgtttgggttaacggtgaaagtgttgaagaaaatgcaaat- ctggttatacggctccttatac gtcgaccagaatgtttgggtcctgcacttcgtggagaaggtgaaggattactgagagcaattatagatgctaat- aagatgtctgaaagaattt cagatcgcagaaaaatgatggaggaacctgaaaattctgcccatcatcagtttgaacatccacttcctgagtct- gatgaagatgaggactata ttgatacaggagcagcaatactggcattctattgtactctggtcgatcttttaggtcgctgtgctccagatgct- agtgtgattgctcagggaaag aatgagtctcttagagctagagctattttgagatctttagtacctcttgaagatttatttggtgtcttgagttt- aaagtttacacttaccaatccagct attggagaagaaaggccaaaaagtgatataccatctggtctaataccatctcataagcaaagtattgttttatt- tttagagagagtatatggtatt gaacagcaagatctcttcttcagattactcgaggaagcatttttacctgatttaagagcagcaactatgctaga- tagaactgatggttctgaatc agaaatggcattagctatgaatcgctatattggaaattctattctccctttgttgataaagcattaccagtttt- atagtggtgcagataactatgca agtcttttagatgctacacttcatacagtgtatcgcctatcaaaaaatcgaatgctaactaaaggtcagcgaga- ggcagtatcagattttttggtt gctctcacaagtcaattacagccaagcatgttactcaaacttcttcgaaagttaacagttgatgtatcaaagct- ttctgagtataccacagtcgct ttaaggttgcttactttacactatgagcgttgtgcaaaatattatggaactactggaggacaagctggtggatc- tagtgatgaagaaaaaaggc tcactatgttactcttcagtaatatttttgattctttatcaaaaatggattatgatcctgaattatttggaaaa- gcgcttccctgcttgagtgctatagg atgtgcacttccacccgattattcactgtccaagaattatgatgaagaatggtatagctcaaagggttcagaac- cgactgatgggccttataat ccactgcccatcaatacttctatggtttctctaaataatgatttaaacacaattgttcaaaaattttctgaaca- ttatcatgatgcatgggctagtcg aaaaatggaaaatggttgggtatatggcgaacagtggtctgacagctctaaaactcatcctcgtttaaaacctt- atacattgcttaatgattatg aaaaagagagatacaaagaaccggttagagagtcattgaaagctctgttagctataggatggaatgtagagcat- actgaagttgatattcctt ctaataacagaggatcatcagtcagaagatcttctaaagcaaatacatctgatggttcaacaccatttaattat- catcccaacccaattgatatg actaatttaacattgagtagagaaatgcaaaatatggcagagaggttagctgaaaactcacatgatatttgggc- aaaaaagaagaaagaag aacttgtttcatgtggtggtggtatacacccacagcttgttccatatgatcttttaacagacaaagagaagagg- aaagatagagaaagatctca agaatttttgaaatatttacaatatcaaggatacaaactccacaggcctactcgaggaagtgctgatgagcaac- aggccgctgcagctgctg ccacaggagagtccagatttgcttacagtctactcgagaaacttatacaatatactgataaagcttctattaat- atgaaactactaaagccttctg gtacattcagtagacgctccagttttaaaacttgttcaagagacataaaattcttttccaaagtggtattgcta- ttggttgagaagtatttcagcac tcacagaaattacttcattgctgttgccactgcttctaataatgtaggagcagcctctttaaaagaaaaagaaa- tggttgccagtttgttctgtaa gctggcaaatttaattcgaacaaagctggctgcttttggtgcagatgttcgaattactgtccgttgtctacaag- tgctagtgaaagctatagatg ccaagtcattggtaaagaattgtcctgaatttataaggacttcaatgctgacatttttcaataatacagctgat- gacttaggccaaactattcagt gtttgcaagagggtcgttacagtcaccttagaggcactcatcttaaaacatctacttctttattttatataaat- gatgttgtactacctgttctcactt ctatgtttgatcatttggctgtgtgtgattatggtagcgacttgttacttgatgaaattcaagtggcctcatat- agaatgttgggtagtttatataattt aggaattgatccaactttaactcatgacagaaaatatttaaaaacagaaattgaaaggcataggcctgccattg- gtgcttgtcttggtgcatttt catcaacatttccagtcgcttatcttgaaccccatttaaataaacataatcagttttcattagttaatagaatt- gctgaacattctcttgaagcacag gatattctagctagaatggaaaacaccatgcctacattggatgcgatcctttctgaagttgatcagttcattga- atccgaaaagagtcatacttc agcaccacatgttattgatgtgattttgcctctgctttgtgcttatttgccaagttggtggagtcaaggtcctg- ataatgtcagtctcacagcagg gaattatgtaacaatggttactagtgatcatatgaatcaactcctaaaaaatgtactaaaattaatcaaaaata- atattggaaatgaaaatgctcc ctggatgacgagaatagcagcttacacccagcagatcatcataaactcttctgaagaactgttgaaagatccat- tccttccattaacacaagtt gttaagaagaggatagacaatatgtttcaccgtgaagaatctcttcgaggatttctaaaatcttcaactgaaga- tacctctcaagttgaagcag aaattcaggagggctggcatcttattgttagagatatatattctttttatccactactaattaaatatgttgat- ttacaaagaaatcactggttacgta ataatattccggaagctgaatacttgtatactcatgttgctgatatatttaatatttggtctaaatcacagtac- tttctaaaagaagaacagaatttc atatctgccaacgaaatagacaatatggctctaattatgcccactgcaactaggagatctgcagttgttttgga- tggaacagctcctgctggag gtggaaagaagaaaaagaagcatcgtgataagaaaagagataagaataaagaaatccaagcaagcttaatggta- gcttgcttaaaacgttt attaccagttggtcttaacctattcgctggaagagaacaagagttagttcagcattgtaaagacagatatttga- agaaaatgccagaatatgaa atagtggattttgccaaaatccaattaactcttcctgacaagatagatcctggagatgagatgtcttggcagca- ttatttgtactcaaaactggg aaataaaaaagatatcagctctgaaaaaccacagcaaatcgatgaggtagttgataggattgtggctatggcaa- aagttctttttgggcttcat atgattgatcatccacaactacagagcaagacacaatacagatctgttgtatccacacagagaaagcgtgctgt- catagcttgtttccggcaa ctatcactacatgccttaccaagcatgcaaataaacctccacctcaccaatctggatggaaaagagttctttca- gcagcgagaaaacgggct gctattgcttgtcttagaactcaacctttgtatacccttccaaggcatcgagtaattaacatatttgctcgcgc- ttattgtgagctgtggctgcaag aagagaatgttggtcaagaaatcatgattgaagatcttacacaaacttttgaagatgctgaattgaaaaaaaga- gattctgaagaagatgaaa gcaaacctgatccacttacccaattagttacaacattttgtcggggtgcaatgactgaaaggagtggagctttg- caagaagacccactttatat gtcctatgcagaaattactgcaaaatcatgtggagaagaagaagaagaaggtggagatgaggaagaaggtggag- acgaagaaggagg ggcatctatccatgaacaagaaatggaaaaacagaaactcttattccatcaagctcggctagccaacagaggtg- ttgcagaaatggtattgtt acatatttcagcttgtaaaggtgttcccagtgaaatggttatgaaaactctccagctgggtatttctgttttac- gtggtggtaatcttgatattcaaa tgggtatgctaaatcatttgaaagaaaaaaaggatgttggattttttacttctatagctggcttgatgaactcc- tgcagtgtgttggatttagatgc atttgaaagaaacacaaaagctgaaggcttaggagttggttcagaaggtgctgctggtgaaaagaacatgcatg- atgctgaattcacctgta ctcttttcagatttattcaacttacctgtgaagggcataacttagaatggcagaattatcttagaacccaagct- ggaaatacaacaacagttaat gttgttatttgtactgttgattaccttttgagattacaggaatcaattatggacttctattggcactattcgag- taaagaattaattgatcctgctgga aaagccaactttttcaaagcaattggtgtggctagtcaagtatttaatacactctctgaagtaattcaagggcc- ttgcccacaaaatcaacaag ctctggctcattcaagattgtgggatgctgttggaggatttttgtttcttttctctcatatgcaagataagcta- tcaaaacattctagtcaagtagac ttactgaaagaacttttgaatttacagaaagatatgataacaatgatgctatcaatgttggaaggtaatgttgt- gaatggtactattggaaaacag atggtagacacattagttgaatctgcctcaaatgtggaattgattttgaagtacttcgacatgtttttgaaatt- gaaagatttgacatcctctgctag atcttggaacttgatccaaaccatgaaggctgggtaacacctaaagattttaaagaaaaaatggaacagcagaa- aagttatactccagaag aaatagacttcatgttacagtgctgtgaaaccaatcatgacggtaaaattgactatgttggcttcacggataga- ttccatgagccggccaagg aaattggttttaacctagctgttcttctcacaaatttatctgagcatatgccaaatgaaccgagacttgctcgc- tttttagaaacagctggtagtgt tcttaactactttgaacctttcctgggacgaattgaaatattaggtagtagtaaacgaatcgagcgtgtatatt- tcgagattaaagaatcaaatatt gaacagtgggaaaaacctcaaatcaaggaatctaaacgagcatttttctattcaattgtcactgaaggaggtga- caaagaaaaattggaagc ttttgttaatttttgtgaagatgccatatttgagatgacacatgccagtgggcttatggcaactgatgatggta- caggctctggaggaggaaaa caaagagcatcctcttattcttatatggaagatgaagatgaagaaaggaatccaatcagacgtggttggcaagc- aactaaagatggaatttac tttatgttctcaatgttatctcctagcaatattaaacataaaattattgaaatgcaacaaatgtcaattattga- actaatgattggttttataaaactatt tttctacatgttttattactcaggatattctgtatcagttgtactgaagtatattggtggtattatattttcat- tgatgaggggaccacaaattgaaga gccagttgtagaagttaaagaggaagaaaaatctggacctctgaggataatgcctgctttgccaccacctgaag- atagctctctgcttccatc tgatgggtcaagagacatgaaaaaagaagacagtcagcctccatcaaaagtcatagaaggggctattcccatag-
aagaaggaggtgaga ggagctcagaggaacatgcgggagaccatgtaaaaccagaaaatgaagagcaacctccaacaccaacacttgct- gatatattgggtgga gaagcagcaagaaaagaagcagcacaaagagcagaagtcgctgctgaacaagaagcagttatggctgcttttga- ggcagaatctaaaat agaaaaagtttcagagccttctgctgtctctcaaattgattttaacaagtatactcaccgggctgtcagtttcc- ttgctcgtaatttctataatcttaa atatgtagcattggttttggctttctgcattaactttattttattgttctacaaggtaacaacattgggtgaag- atgatgatgctgctagcggagaa gggagtgttgaacaactaatggaagaattaacaggcgaaggtgatgatgtgagtggcggaggaagtagtggtgg- agaaagtggtgaaga ggatccaattgaaatggttcatgtggatgaggatttcttttatatggcacatgttatgcgattggctgcaatcc- tacattctcttgtttctttagctat gttgattgcatattatcatttgaaggtccctctagctatattcaagagagaaaaagaaatagctcgtcgacttg- agtttgatggtttgtacattgct gagcaaccagaagatgatgatattaaatcacattgggataaactggttatctgtgcaaaatcatttcctgttaa- ttactgggataaatttgtgaag aaaaaggttcgacagaaatacagtgaaacttatgactttgattcaataagtaatcttttgggaatggaaaaaac- atctttcagtgcccaagatac tgaagaaggatcgggacttattcattacattttgaactttgactggaggtatcagctttggaaagcaggagtca- caatcacagataatgcatttt tgtacagtttattatacttcatcttttcaattttgggaaacttcaataactttttctttgctgcccatttactt- gatgttgcagttggttttaaaacattgag gactattttgcaatcagtcacacacaatggaaaacagcttgtattgactgtaatgctgctaaccatcatagtat- acatctatactgtcattgctttc aacttcttccgaaaattttatgtccaagaagaggatgaggaagtggataaaaaatgccacgatatgttaacttg- ttttgtattccacctttacaaa ggagttagagctggtggtggtattggtgatgagattgaacctcctgatggtgatgattatgaagtttacaggat- aatgtttgatattacgtttttct tttttgttattgtcatcttgctagccatcattcaaggtttgatcattgatgcatttggtgaattgagagatcag- ttagaaagtgtaaaagaagacatg gaatctaactgcttcatttgtgggataggaaaagattattttgataaagttccccatggttttgacactcatgt- tcaacaagaacataacttggcta attacatgttctttcttatgcatctgattaacaagccagatactgaatacacaggtcaagaaacctatgtctgg- aacatgtatcagcaacgttgtt gggatttcttcccagttggtgactgttttcgtaaacagtatgaagatgaactgggaggtggtggtggttaattc- atttgggtgggtggtggctaa atttatattattaaaacaaaattaatgctgggaactatcaaacatccttcaattttattaaaatttcagctaaa- ttcaacaatatatcttatgatattgta tttgtctaatgaaggaatagaactatcgtgttatgaatcagtgaagttttcacttgtttagcataatttatgct- aagtttactattgcaaaatactttctt tatatccgaaaatgttgtaaaataaatgtaaatggtgtggccttaaatataatg SEQ ID 19 nezvi_22408.WL.3 caacttcctaacgacgaggtagttcttggatatgtaatacgggagagccatccacttctttcactggtctgcta- agtagagaggatggccgac agcgaaggaggatccgagcaggacgatgtttcgttcctgaggacggaggatatggtgtgcctatcatgcacagc- aactggagagagagtt tgcttagcagctgagggctttggtaaccgtcactgttttctagaaaatattgctgataagaatataccaccaga- tctttcaacatgtgtatttgttat tgaacaagctctatcagtaagagcacttcaggagttagttacagcagctggatctgaagagggaaagggaactg- gatctggtcacaggact cttctttatggaaatgctatactactccggcaccaaaacagtgacatgtatctggcttgtttatctaccagttc- atcaaatgacaagctctcatttg atgttggtttacaagaacattcccaaggggaagcttgttggtggaccgtacaccctgcttctaaacagagatca- gaaggtgaaaaagtgaga gttggtgatgatttaattcttgtgtctgtagccactgaaagatatttgcatactgctaaagaaaacgatcaatc- tattgtaaatgcatctttccatgt aactcattggtctgttcagccttatggaactggtatcagcaaaatgaagtatgttggttatgtgttcggaggag- atgtgttaagatttttccatggt ggggatgaatgccttaccattccatcaacttggagtgaaacccctggacaaaatgtggtagtttatgaaggagg- gagtgttttgagtcaagct cgttcactttggagattggaactggctaggacaaaatggtctggtggtttcattaattggtatcatccaatgag- gatacgacatctcaccactg gtagatacttaggagttaatgaaaataatgaattacacctcgttgttagggaggaagccacaacagcattatct- acattcattttaagacaaga aaaagatgaccaaaaagtagtaatggaagataaggatttagaagtaataggagctccaataataaaatatggtg- acagtactgttttagtcca acattcagaaagtggtttatggttaacttataagtcattcgaaactaagaaaaaaggtgtgggtaaagtagaag- aaaaacaagctgtacttcat gaggagggaaaaatggatgatggattagactttagtagaagtcaagaagaagaatcaaggactgctagagtaat- aaggaaatgttcgtcac ttttcactcaatttattaggggtctagaaactctgcaaatgaatcgaagacattctctgttttgcgctagtgta- aatttaaatgaaatggtcatgtgt ttagaagatttaattaattactttgcccagcctgaggaagatatggaacatgaggaaaaacaaaaccggttaag- agctttgagaaacagaca agatttgttccaagaagaaggaattttaaatcttatcttagaagccattgataaaattaatgttataacatccc- aaggtttcttagtcagtttagctg gagatgagtctggacagagctgggatataatctcaggatatttgtatcaactgctagctgccatcataaaagga- aatcatactaattgtgctca gtttgctaacacaaatagattaaactggttatttagcagactaggttctcaagcttcaagtgagggcacaggta- tgttggatgtacttcattgcgt cttaattgattctccagaagctttgaatatgatgagagatgaacatataaaagtaatcatttcactgctagaaa- aacatgggcgagatccaaga gttttagatgtactttgttcactttgtgttggtaatggtgtagcagtccgtagctcacaaaacaacatctgtga- tttccttctgccaggaaaaaactt gcttctacaaacgcaacttgtggatcatgttgccagtgtcaggccaaatatttttgtgggtcgagtcgaaggtt- ctgctgtttatcaaaaatggta ttttgaagtgactttagatcatatggagcaaaccacccatatgacaccgcatctaagaattggctgggctaaca- cttctggttatgttccctttcc tggcggtggtgaaaaatggggcggtaatggagttggtgatgatctctactcttttggttttgatggagctgcat- tatggacaggtggaagaaa aactgtagtccttcctcatgctatggaaccttacataagaaagggagatgttattggttgtgctttcgatctga- ctgttccaattattacatttacttt taatggaacattaatccgaggatcatttagggattttaatcttcaaggaatgttctttccagttataagctgtt- cctcaaaacttagttgtcgtttttta ctgggaggtgatcatggaagattaaaatatgcacctcctgaagaattttctcctctcgttgaaagtttgcttcc- tcaacaagtgctttctattgatc catgtttttattttggcaacctgaataaatgtgtattggctggtccttatcctgttgaagatgattgtgctttt- gttccagttccagttgacacatctat ggtaaatttacccgttcatgttgatacaatacgcgatcgtttagctgaaaacatccatgaaatgtgggctatga- ataaaattgaagcaggatgg atttatggagatgtaagagatgatataagaagaatacatccatgtcttgtgcaatttgaaaaactacctcctgc- agaaaagcgatatgacactc aacttgctgtacaaactttaaaaaccatcattgcactgggctaccatataacaatggaaaaaccaccatctaga- ataaagaacattcgtttgcc gaatgaaccatttttacaatctaatggttacaagccagctcctcttgatctcagtgccataacactaataccta- aaatggaggaacttgttgacc aactcgctgaaaatactcacaacttgtgggcaaaagaaagaatccaacaaggctggacctatggtcttaatgag- gatcctgatttgtcccga agtcctcacctcgtcccttacagtaaagttgatgatttaattaaaaaagccaacagggataccgcaagtgaaac- tgtcaggactcttcttgttta tggttataatttagaccctcctacaggtgaacaaactgaagctctcttagcagaagcaagccgtttgaagcaga- tgcagtttagaacctatcg ggctgaaaagacatatgcagtaaccagtggcaaatggtattttgaatttgaaattcttactgctgggccaatga- gagtaggttgggccattgct gattataatccaggttcccagatcggaagtgatgaagcatcctgggcatatgatggttataatgaggaaaaggt- ttattctggggttgctgaaa cgtttggaagacaatggcaagttggagacgttgtaggagtttttcttgatctattggatcatactattagtttc- tctctaaatggtgaactgcttatg gatgcacttgggggagaaacatcttttgcagatgttcagggagaaggatttgttccagcatttacacttggagt- aggacaaaaagcaaaatta gtgtttgggcaagatgttaactcacttaagttctttactacctgtggtttgcaagaaggttatgaacctttctg- tgtaaacatgaacagggcagtt accttttggtacaccaaagatcatcctatatttgaaaatactgatgattatattgatactaaaattgatgcaac- gcgtattcctgctggttctgaca caccaccatgtcttaaaattagtcataatacttttgagacaatggagaaagccaattgggaatttcttagactt- tctttacctgttcaatgtttacca tcattcataaatgaacaagaaaaagtacgtaggtggcaagaaataaggataagacaacacagacttcttgtgga- agctgaccaaaccactc ctgctcacattgaacagattatgaagtctggttttagtatgagtgatattaagggtcttcaaagaagttataca- gaagatggaatggaaggaga agaaggattggcaccaagctcatcaccacttacaaggactaagtcaaaagtgactccagctcgtccacctagga- aaggctccttaccacga aatggagatgttattaatatgaacgggacattagaaccaggtggaggaaaaatgaaccgttctaatagtgagct- tgatttccaacgtttcaatg gtgaaatgcccgatggcgataacaagaaaaagcgtgggagatctccatttaggttcttttcaagaaaaaagggg- gagcgtgatactagtgg agaaaatgcaaaaaatgtacatatgtctgagcctatgggtaatttccttgagcctccaaggactccaatgcagc- aaagaggtggaagtgctc tgcgttcttctcctcaacctaaagtacaggagttaactaagccaccatccccattagttgaaagaagtggaccc- aaagcaatgtctgtgcctgt tggaactggcatcgaaactattggaaatgaaatatttgatgtagagtgtttgaaattgattaatgaatacttct- acggtgtcaggatatttccagg tcaagacccaactcatgtatatgtcggttgggttacaactcaattccatctacgtagtaaagactttaatcaga- atcgagtgctaaagagcact gtagtagtatgtgatgaattcaatcgtgtaatagacagtattcagcggcagagttgttttatggtaagagctga- tgaattatacaatcaagtaact caggatgcctctggtaaaggtgcttcacaaggaatgtttattggatgtttcctggatactgctactggttatgt- gacgttcacatgtgaaggaaa agaaactaaccacaagtataagatggaacctgatacaaaattatttccagctatatttgttgaagctacaagca- aagaaattctacaaattgag cttggtcgtacatcaactacactgcctttatcagcagctgttctccaaaattcagaaagacatgtcattcctca- gtttccaccaagacttaaagtt cagtgtctaaaaccacatcagtgggcacgtgttcctaatatttcattgcatgtccacgctctgaaattatcaga- tataagaggttggagtatgctt tgtgaagatccagtttcaatgttagcattacatatacctgaagaagatagatgtattgatattttagaacttat- tgaaatggacaaactactttcatt ccatgctcatacattgacactttatgcagcactatgttaccaatccaattatcgtgcaggacatgttctctgca- aacatgtagaccaaaagcaac ttcagtatgctattaggtctgaattcatatctggatctttacgcttgggattttatgacctcttgattgcttta- cacattgaatcacatgcaacaacaa tggaagtttgtaaaaatgaattcataataccccttggtctagacttgaaagatttatatgaagatccagatatg- aagcacagcttacgatctttaa aaactgtctctattttacctcaaatgagtatgacagacattacggaaaatattgaaagcatcaatacattatat- agtccttattttcctcttgatgca gttaaggattatggaatgactgcattagaagaggctgtaagcatgaatcaacttcacaatagagaccctgtagg- tggttcaaatgaaaacttg tttctacccttgttgaaactggtagatagattattgcttgttgggatactacgagatgaagatgttacaaagct- actaattatgtttgatcctgaaac ttgggattcaaattttgaaaaggatggcaaagatgaacatcgtaagggtttacttcaaatgaaaatggcagagg- gggcaaaactacagatgt gctatctcttacagcatttatgcgatatacaattgcggcatcgggttgaagccattattaattttagttatgac- tatattgctgatcttcagcaggat cagttgagaagatatgttgatattaagcagtctgatcttccatcatcagttgctgcaagaaaaacaagagagtt- tcgttgccctccaagagaac agatgaatgctatcataaattttaaaaatttagaagaagatgacaaagaaaactgtccatgtggtgaagaactg- agggagagattaaacacat ttcatgaagaaactatgagtaaagtttcacttgttgctctccaagagccacaagaagatgagaacggtgaaaca- ccagaaaagccgggtgtt ttcaaaaaattatacaattttattaatgctgttaaagaattggaagaacctcctaaaatagaagaagaacctgt- taagaaaactcctgaagaaat atttagaaaagtattaattagtacaattgttagatgggctgaagaatcccagattgaaacaccaaaattagtca- gagaaatgttcagtctattgg taaggcagtacgacactgtaggtgaattaatcagatctcttggaaacacttatgtgataaatgacaaaacgaaa- gaagatgtagctcagatgt gggtagggttgagccagatcagagctctcctacctgttcaaatgtctcaagatgaagaaggtcttatgcgaatg- aggctatggaaattagtta acaatcacacattctttcaacatcctgatttgattagagttcttcgtgttcatgaaaatgttatggctgttatg- atcaataccttgggtagaagatca caagcacaatctgatgcttctcaagctggtcaagaaggtgaacctgcagctaaggagaaagatacgtcccatga- aatggtggtagcatgtt gtcgtttcctgtgttatttttgcagaacttcacgtcaaaatcagaaagcaatgtttgaccatttaacattttta- ttagaaaacagtaatattttactttc aagaccttcacttagaggaagtacccctcttgatgttgcctattcctctctcatggaaaataccgaactggcat- tagctcttagagaacattattt agagaagatagctgtttacttgtctcgctgtggattacaatctaattcagaattggtagaaaagggttaccctg- atttgggttgggatccagttg agggagaaagatatttagactttttacgcttctgtgtttgggttaacggtgaaagtgttgaagaaaatgcaaat- ctggttatacggctccttatac gtcgaccagaatgtttgggtcctgcacttcgtggagaaggtgaaggattactgagagcaattatagatgctaat- aagatgtctgaaagaattt cagatcgcagaaaaatgatggaggaacctgaaaattctgcccatcatcagtttgaacatccacttcctgagtct- gatgaagatgaggactata ttgatacaggagcagcaatactggcattctattgtactctggtcgatcttttaggtcgctgtgctccagatgct- agtgtgattgctcagggaaag aatgagtctcttagagctagagctattttgagatctttagtacctcttgaagatttatttggtgtcttgagttt- aaagtttacacttaccaatccagct attggagaagaaaggccaaaaagtgatataccatctggtctaataccatctcataagcaaagtattgttttatt- tttagagagagtatatggtatt gaacagcaagatctcttcttcagattactcgaggaagcatttttacctgatttaagagcagcaactatgctaga- tagaactgatggttctgaatc agaaatggcattagctatgaatcgctatattggaaattctattctccctttgttgataaagcattaccagtttt- atagtggtgcagataactatgca agtcttttagatgctacacttcatacagtgtatcgcctatcaaaaaatcgaatgctaactaaaggtcagcgaga- ggcagtatcagattttttggtt gctctcacaagtcaattacagccaagcatgttactcaaacttcttcgaaagttaaccgttgatgtatcaaagct- ttctgagtataccacagttgct ttaaggttgcttactttacactatgagcgttgtgcaaaatattatggaactactggtggacaagctggtggatc- tagtgatgaagaaaaaaggc tcactatgttactcttcagtaatatttttgattctttatcaaaaatggattatgatcctgaattatttggaaaa- gcgcttccctgcttgagtgctatagg atgtgcacttccacccgattattcactgtccaagaattatgatgaagaatggtatagttcaaagggttcagaac- cgactgatgggccttataatc cactgcccatcaatacttctatggtttctctaaataatgatttaaacacaattgttcaaaaattttctgaacat- tatcatgatgcatgggctagtcga aaaatggaaaatggttgggtatatggtgagcagtggtctgacagctctaaaactcatcctcgtttaaaacctta- tacattgcttaatgattatgaa aaagagagatacaaagaaccggttagagagtcattgaaagctctgttagctataggatggaatgtagagcatac- tgaagttgatattccttct aataacagaggatcatcagtcagaagatcttctaaagcaaatacatctgatggttcaacaccatttaattatca- tcccaacccaattgatatgac taatttaacattgagtagagaaatgcaaaatatggcagagaggttagctgaaaactcacatgatatttgggcaa- aaaagaagaaagaagaa cttgtttcatgtggtggtggtatacacccacagcttgttccatatgatcttttaacagacaaagagaagaggaa- agatagagaaagatctcaag aatttttgaaatatttacaatatcaaggatacaaactccacaggcctactcgaggaagtgctgatgagcaacag- gccgctgcagctgctgcc acaggagagtccagatttgcttacagtctactcgagaaacttatacaatatactgataaagcttctattaatat- gaaactactaaagccttctggt acattcagtagacgctccagttttaaaacttgttcaagagacataaaattcttttccaaagtggtattgctatt- ggttgagaagtatttcagcactc acagaaattacttcattgctgttgccactgcttctaataatgtaggagcagcctctttaaaagaaaaagaaatg- gttgccagtttgttctgtaagc tggcaaatttaattcgaacaaagctggctgcttttggtgcagatgttcgaattactgtccgttgtctacaagtg- ctagtgaaagctatagatgcc aagtcattggtaaagaattgtcctgaatttataaggacttcaatgctgacatttttcaataatacagctgatga- cttaggccaaactattcagtgttt gcaagagggtcgttacagtcaccttagaggcactcatcttaaaacatctacttctttattttatataaatgatg- ttgtactacctgttctcacttctat
gtttgatcatttggctgtgtgtgattatggtagcgacttgttacttgatgaaattcaagtggcctcatatagaa- tgttgggtagtttatataatttagg aattgatccaactttaactcatgacagaaaatatttaaaaacagaaattgaaaggcataggcctgccattggtg- cttgtcttggtgcattttcatc aacatttccagtcgcttatcttgaaccccatttaaataaacataatcagttttcattagttaatagaattgctg- aacattctcttgaagcacaggata ttctagctagaatggaaaacaccatgcctacattggatgcgatcctttctgaagttgatcagttcattgaatcc- gaaaagagtcatacttcagca ccacatgttattgatgtgattttgcctctgctttgtgcttatttgccaagttggtggagtcaaggtcctgataa- tgtcagtctcacagcagggaatt atgtaacaatggttactagtgatcatatgaatcaactcctaaaaaatgtactaaaattaatcaaaaataatatt- ggaaatgaaaatgctccctgg atgacgagaatagcagcttacacccagcagatcatcataaactcttctgaagaactgttgaaagatccattcct- tccattaacacaagttgttaa gaagaggatagacaatatgtttcaccgtgaagaatctcttcgaggatttctaaaatcttcaactgaagatacct- ctcaagttgaagcagaaatt caggagggctggcatcttattgttagagatatatattctttttatccactactaattaaatatgttgatttaca- aagaaatcactggttacgtaataat attccggaagctgaatacttgtatactcatgttgctgatatatttaatatttggtctaaatcacagtactttct- aaaagaagaacagaatttcatatct gccaacgaaatagacaatatggctctaattatgcccactgcaactaggagatctgcagttgttttggatggaac- agctcctgctggaggtgg aaagaagaaaaagaagcatcgtgataagaaaagagataagaataaagaaatccaagcaagcttaatggtagctt- gcttaaaacgtttattac cagttggtcttaacctattcgctggaagagaacaagagttagttcagcattgtaaagacagatatttgaagaaa- atgccagaatatgaaatagt ggattttgccaaaatccaattaactcttcctgacaagatagatcctggagatgagatgtcttggcagcattatt- tgtactcaaaactgggaaata aaaaagatatcagctctgaaaaaccacagcaaatcgatgaggtagttgataggattgtggctatggcaaaagtt- ctttttgggcttcatatgatt gatcatccacaactacagagcaagacacaatacagatctgttgtatccacacagagaaagcgtgctgtcatagc- ttgtttccggcaactatca ctacatgccttaccaaggcatcgagtaattaacatatttgctcgcgcttattgtgagctgtggctgcaagaaga- gaatgttggtcaagaaatca tgattgaagatcttacacaaacttttgaagatgctgaattgaaaaaaagagattctgaagaagatgaaagcaaa- cctgatccacttacccaatt agttacaacattttgtcggggtgcaatgactgaaaggagtggagctttgcaagaagacccactttatatgtcct- atgcagaaattactgcaaaa tcatgtggagaagaagaagaagaaggtggagatgaggaagaaggtggagacgaagaaggaggggcatctatcca- taagacaatggca aaattagtggaacaagaaatggaaaaacagaaactcttattccatcaagctcggctagccaacagaggtgttgc- agaaatggtattgttacat atttcagcttgtaaaggtgttcccagtgaaatggttatgaaaactctccagctgggtatttctgttttacgtgg- tggtaatcttgatattcaaatgg gtatgctaaatcatttgaaagaaaaaaaggatgttggattttttacttctatagctggcttgatgaactcctgc- agtgtgttggatttagatgcattt gaaagaaacacaaaagctgaaggcttaggagttggttcagaaggtgctgctggtgaaaagaacatgcatgatgc- tgaattcacctgtactct tttcagatttattcaacttacctgtgaagggcataacttagaatggcagaattatcttagaacccaagctggaa- atacaacaacagttaatgttgt tatttgtactgttgattaccttttgagattacaggaatcaattatggacttctattggcactattcgagtaaag- aattaattgatcctgctggaaaag ccaactttttcaaagcaattggtgtggctagtcaagtatttaatacactctctgaagtaattcaagggccttgc- ccacaaaatcaacaagctctg gctcattcaagattgtgggatgctgttggaggatttttgtttcttttctctcatatgcaagataagctatcaaa- acattctagtcaagtagacttact gaaagaacttttgaatttacagaaagatatgataacaatgatgctatcaatgttggaaggtaatgttgtgaatg- gtactattggaaaacagatgg tagacacattagttgaatctgcctcaaatgtggaattgattttgaagtacttcgacatgtttttgaaattgaaa- gatttgacatcctctgctagcttct tggaacttgatccaaaccatgaaggctgggtaacacctaaagattttaaagaaaaaatggaacagcagaaaagt- tatactccagaagaaat agacttcatgttacagtgctgtgaaaccaatcatgacggtaaaattgactatgttggcttcacggatagattcc- atgagccggccaaggaaatt ggttttaacctagctgttcttctcacaaatttatctgagcatatgccaaatgaaccgagacttgctcgcttttt- agaaacagctggtagtgttctta actactttgaacctttcctgggacgaattgaaatattaggtagtagtaaacgaatcgagcgtgtatatttcgag- attaaagaatcaaatattgaa cagtgggaaaaacctcaaatcaaggaatctaaacgagcatttttctattcaattgtcactgaaggaggtgacaa- agaaaaattggaagctttt gttaatttttgtgaagatgccatatttgagatgacacatgccagtgggcttatggcaactgatgatggtacagg- ctctggaggaggaaaacaa agagcatcctcttattcttatatggaagatgaagatgaagaaaggaatccaatcagacgtggttggcaagcaac- taaagatggaatttacttta tgttctcaatgttatctcctagcaatattaaacataaaattattgaaatgcaacaaatgtcaattattgaacta- atgattggttttataaaactatttttc tacatgttttattactcaggatattctgtatcagttgtactgaagtatattggtggtattatattttcattgat- gaggggaccacaaattgaagagcc agttgtagaagttaaagaggaagaaaaatctggacctctgaggataatgcctgctttgccaccacctgaagata- gctctctgcttccatctgat gggtcaagagacatgaaaaaagaagacagtcagcctccatcaaaagtcatagaaggggctattcccatagaaga- aggaggtgagagga gctcagaggaacatgcgggagaccatgtaaaaccagaaaatgaagagcaacctccaacaccaacacttgctgat- atattgggtggagaa gcagcaagaaaagaagcagcacaaagagcagaagtcgctgctgaacaagaagcagttatggctgcttttgaggc- agaatctaaaataga aaaagtttcagagccttctgctgtctctcaaattgattttaacaagtatactcaccgggctgtcagtttccttg- ctcgtaatttctataatcttaaata tgtagcattggttttggctttctgcattaactttattttattgttctacaaggtaacaacattgggtgaagatg- atgatgctgctagcggagaaggg agtgttgaacaactaatggaagaattaacaggcgaaggtgatgatgtgagtggcggaggaagtagtggtggaga- aagtggtgaagagga tccaattgaaatggttcatgtggatgaggatttcttttatatggcacatgttatgcgattggctgcaatcctac- attctcttgtttctttagctatgttg attgcatattatcatttgaaggtccctctagctatattcaagagagaaaaagaaatagctcgtcgacttgagtt- tgatggtttgtacattgctgag caaccagaagatgatgatattaaatcacattgggataaactggttatctgtgcaaaatcatttcctgttaatta- ctgggataaatttgtgaagaaa aaggttcgacagaaatacagtgaaacttatgactttgattcaataagtaatcttttgggaatggaaaaaacatc- tttcagtgcccaagatactga agaaggatcgggacttattcattacattttgaactttgactggaggtatcagctttggaaagcaggagtcacaa- tcacagataatgcatttttgt acagtttattatacttcatcttttcaattttgggaaacttcaataactttttctttgctgcccatttacttgat- gttgcagttggttttaaaacattgagga ctattttgcaatcagtcacacacaatggaaaacagcttgtattgactgtaatgctgctaaccatcatagtatac- atctatactgtcattgctttcaa cttcttccgaaaattttatgtccaagaagaggatgaggaagtggataaaaaatgccacgatatgttaacttgtt- ttgtattccacctttacaaagg agttagagctggtggtggtattggtgatgagattgaacctcctgatggtgatgattatgaagtttacaggataa- tgtttgatattacgtttttcttttt tgttattgtcatcttgctagccatcattcaaggtttgatcattgatgcatttggtgaattgagagatcagttag- aaagtgtaaaagaagacatgga atctaactgcttcatttgtgggataggaaaagattattttgataaagttccccatggttttgacactcatgttc- aacaagaacataacttggctaatt acatgttctttcttatgcatctgattaacaagccagatactgaatacacaggtcaagaaacctatgtctggaac- atgtatcagcaacgttgttgg gatttcttcccagttggtgactgttttcgtaaacagtatgaagatgaactgggaggtggtggtggttaattcat- ttgggtgggtggtggctaaatt tatattattaaaacaaaattaatgctgggaactatcaaacatccttcaattttattaaaatttcagctaaattc- aacaatatatcttatgatattgtattt gtctaatgaaggaatagaactatcgtgttatgaatcagtgaagttttcacttgtttagcataatttatgctaag- tttactattgcaaaatactttcttta tatccgaaaatgttgtaaaataaatgtaaatggtgtggccttaaatataatg SEQ ID 20 nezvi_3755.WL.1 cagtgatctacttctgggtcaacttatgttttgtttatggttttcattaaatttacgagacattaaaaactaag- aatattgattgcttatgaagttatca atgataactaatattgttatttcgatgctgttatgttggatacattgttggtgactggcattagcttatgcgtg- aaaccttcttcgtaaatattcaaatt tagaatcaaatattattgatactatttctttttcatactttacattaatattcttcaaaattaaaaatgccagg- agtagagcatgttactaacaaagtc gttgttcatcctttagttctattaagtgttgttgatcatttcaatagaatgggtaaaattgggaatcagaagag- agtagttggcgtattattaggatg ctggaaggcaaaaggtgttttagacgtatctaatagttttgcagtgccatttgatgaagatgataaagacaaat- cagtttggtttttagaccatga ttatttagaaaatatgtatggcatgtttaagaaagttaatgcaagagaaaaagttgttggctggtatcatacag- gcccaaagttacatcaaaatg atgttgcaattaatgaacttatacgccgttactgccctaactcagttatgttattatcgatgcaaaaccaaagg- atcttggtttacctacagaagc atatagagcagttgaagaagtacatgatgatggttctcctacgacaaaaacatttgagcatgttcccagtgaaa- taggggctgaagaagcag aggaagtgggtgttgaacatctgctgagagatataaaagatacaactgtcggctcactttcgcaaagggttact- aatcaatttcttggtctcaa aggccttaatcaacaaattcaagacatcagggattaccttatgcaggttgttgaaggaaaattgcccatcaacc- atcaaataatatatcagctt caagacatatttaatctccttcctgacatgaaccatgggaactttgttgattcattatacataaaaacaaatga- tcagatgcttgtcgtttatctcgc tgccctcgttagagctattgttgccttgcataatctgatcaataataaactcagtaatcgtgatgccgaaaaaa- aagaaagcaccaaaaaaga agaaaaacctaaagaagaagaaagtgtaaaaaaagaattgaaggctaagtaaatgatgccagttcattctcagg- attgaacagatgttattta ttgtaagatttaattataatcttttatacatatgtgtacattaatagtatatacatcgttttcaacaaatcaga- tttataatttgtaaaaaaaaaagaaa agggaacaaaatgatatttaaatatttaactatttatacattttttttgtgagtacaattaaaccatttagttg- aacttgtgaactacaaaaattaattt gtaataaaaccagtctaatttcttaattttaaaaaaa SEQ ID 21 >ta01222.002 Fragment 1 gatgttggcttctttatacaatggaaacgttcatatttggaatcatgagacccagcagctagtaaagtcttttg- aagtatgcgaccaaccagttc gtgctgcagtatttgttcctcgcaagaactggattgtaacagggtcagatgatatgcagatcagagtttttaat- tacaatactcttgaaagagta aatgcatttgaagctcattcagactatgtcagatgtatagcagttcacccagcccatccttatattctgacatc- atcagatgatatgttaatcaaat tgtggaattggtctaaggcttgggtctgccaacaaatatttgaaggacatacccattatgtaatgcaagttgtt- a SEQ ID 22 >ta01222.002 Fragment 2 aacgtcttgctttagctccaaaagaaatgggaccatgtgaaatatatcctcaaagtatttcacataatccaaat- ggaagatttgtcgttgtttgtg gagatggtgaatacataatttatactgctatggctttaagaaacaaaagttttggatcagcccaagaatttgta- tgggcacaagatagttctgac tatgctataagagaaggaacatctactgtaaaactatttagacagttcaaggagcgcaagacacttaagccaga- gtttggtgctgaaggtata tttggtggacaattgcttggtgtcagatcagtctcaggattatgtttatatgattgggaaactctggaattaat- cagaagaatag SEQ ID 23 >ta01222.002 Fragment 3 tgaattagcaacgagacaagccaggcttgatgttgcgcaagcagctcttcacagagcccaacattatggtggac- ttctgcttctctccacatc agcaggaaatcgggaaatgatggaaaaacaggaaagagttcaggagaaaatggaaaaaataatgttagcttcct- tgcatatttcctgcttgg agaccttgccaaatgtatcaaattatattgacactgatcgcattccagaagctgccttttttgccaggacatat- ttgccgagtgaggttcctcg agttgttgggttatggcgaggtttagcaaaggcaggacagagccttgcagatccttcgcagtatctaatctctt- tccaggttatgcagatgct SEQ ID 24 >ta02948.001 Fragment 1 gatggctgtggtggaacaaccttgttatactctgatcaattttccatctgatttagagcctcctaatgaaatgc- agctaaaatctgatttagaaaat ggagacactaaagcgaaaattgaagctttgaaaaatattattcatttaattgcaaatggagagcgtctacctgg- tttacttatgcatatcatacgt tttgttttgccatcacaagaccataccataaaaaaattactgatatattttgggaaatcgttcctaaaactact- ccagatggcaaacttctccagg aaatgattttggtttgtgatgcctatcggaaggacttacaacatcctaatgaatttgtcagggga SEQ ID 25 >ta02948.001 Fragment 2 gtgataataatgttaaattaattgtattagatcgtcttatatctttaaaagaaattcctactcatgaacgggtt- cttcaagatttagttatggatatatt acgtgtgctagccagtcctgacatggaagtaaagaaaaaagccttaagcctagcactggatctcactacttcac- ggtgtgttgaagaaatgg ttttaatgttaaaaaaagaagttgctaagacacataacttgacagaacatgaagatgctggaaaatatcgtcaa- cttcttgttagaactcttcatt cctgttgcatgaagtttccagatgttgctgcttcagttataccagtattaatggaatttctctcagatacaagt- gaactagcttcgtatgatgt SEQ ID 26 >ta02948.001 Fragment 3 tcaacactgattgtcgcaatgctcttgctaatatgttagttgctcaacagaatgaggagtactcacttattaag- gccaaagaaaaatccgtccat accatccaagttgatgatcctgtatcatttttacaattatcaacgatacgatcatctgattttggttcagaaaa- tgtttttgagcttagtttaaatcaa gctgtcggggggccaaatacagctacaaacacagctgaacttccattttcagccagtaaattgaataaagtaac- tcagctgacagggttttca gatccagtttatgcagaagcatatgttcatgtcaaccagtatgacattgtacttgatgttttgatcgttaatca- aacaggtgatacact SEQ ID 27 >ta00781.001 Fragment 1 attcgaattgcatgtaaattattggaagaagaaagctctggagaatatgcagactctccactttttgattttat- tgaagcatgtttacgccacaaa agtgaaacagttgtttatgaagcagctgctgctcttgtaaacttacgccacactactaccagacaaatcacgcc- tgcagtaagtgttcttcaatt attttgttcttctccaaaaccagcgcttcgttttgctgctgtgagaactcttaataaggtagcaatgacacatc- ccactgctgtaacgtcatgcaa tattgacttagagaaccttataacggattcaaatcggtccatagctaccttgg SEQ ID 28 >ta00781.001 Fragment 2 gttccgatcctatcagtatacggaatctgagacagaatatcaagttagagtcacgaagcatgttttcaaaaatc- atattgttcttcagtttgactg tacaaataccatgagtgaccagctactggagaaagttcgagtgcagttagaagtgagcgaaggttaccagatcg- tagctgaggtcccctgc caaagattagcctgttcggaaacatcacctacttatattgccctgcaatttccagatgcccctaatcttactgt- cacaaactttgctgctactctga ggtttgttgtaaaggattgcgacccaatgaccggtatccctaactcagatgatggttatgaagaagattatatg- cttgaagatgtcgaagtgat gctt SEQ ID 29 >ta00781.001 Fragment 3 gacttacgaagagcaacttcggtgctgcatgggaggaaggcgaatcgtatagtgagctagaggacacttataac- ttgtcaggaataaacag cctcgaagaggcagtgaggagtgttgtcagtttcatggggatgcagcctgctgacaggagcgacagggtacagc- ctgataaatcttcaca
cactgtctacctcggaggcatgttccgtggtggagttgaagtgttagctagagctaaactggccatgggtaatt- ccccaggcgttgccatgc aacttacagtccgctctccaaatccagatatttgtgaactgattatttctgtagtcgggtaaaaaaaatatata- aatatatttgagaagtacacagt ttcctctcagatgttgta SEQ ID 30 >nezvi_22408.WL.1 Fragment 3 acttgtggatcatgttgccagtgtcaggccaaatatttttgtgggtcgagtcgaaggttctgctgtttatcaaa- aatggtattttgaagtgacttta gatcatatggagcaaaccacccatatgacaccgcatctaagaattggctgggctaacacttctggttatgttcc- ctttcctggcggtggtgaaa aatggggcggtaatggagttggtgatgatctctactcttttggttttgatggagctgcattatggacaggtgga- agaaaaactgtagtccttcct catgctatggaaccttacataagaaagggagatgttattggttgtgctttcgatctgactgttccaattattac- atttacttttaatggaacattaatc cgaggatcatttagggattttaatctt SEQ ID 31 >nezvi_22408.WL.1 Fragment 6 atgtgaaggaaaagaaactaaccacaagtataagatggaacctgatacaaaattatttccagctatatttgttg- aagctacaagcaaagaaatt ctacaaattgagcttggtcgtacatcaactacactgcctttatcagcagctgttctccaaaattcagaaagaca- tgtcattcctcagtttccacca agacttaaagttcagtgtctaaaaccacatcagtgggcacgtgttcctaatatttcattgcatgtccacgctct- gaaattatcagatataagaggt tggagtatgctttgtgaagatccagtttcaatgttagcattacatatacctgaaga SEQ ID 32 >nezvi_22408.WL.1 Fragment 7 aggatggcaaagatgaacatcgtaagggtttacttcaaatgaaaatggcagagggggcaaaactacagatgtgc- tatctcttacagcatttat gcgatatacaattgcggcatcgggttgaagccattattaattttagttatgactatattgctgatcttcagcag- gatcagttgagaagatatgttg atattaagcagtctgatcttccatcatcagttgctgcaagaaaaacaagagagtttcgttgccctccaagagaa- cagatgaatgctatcataaa ttttaaaaatttagaagaagatgacaaagaaaactgtccatgtggtgaagaactgagggagagattaaacacat- ttcatgaagaaactatgag taaagtttcacttgttgctctccaagagccacaagaagatgagaacggtgaaacac SEQ ID 33 >nezvi_22408.WL.1 Fragment 9 tctccctttgttgataaagcattaccagttttatagtggtgcagataactatgcaagtcttttagatgctacac- ttcatacagtgtatcgcctatcaa aaaatcgaatgctaactaaaggtcagcgagaggcagtatcagattttttggttgctctcacaagtcaattacag- ccaagcatgttactcaaact tatcgaaagttaaccgttgatgtatcaaagattctgagtataccacagttgattaaggttgatactttacacta- tgagcgttgtgcaaaatatt atggaactactggtggacaagctggtggatctagtgatgaagaaaaaaggctcactatgttactcttcagtaat- att SEQ ID_34 >nezvi 22408.WL.1 Fragment 14 gacttgctcgctttttagaaacagctggtagtgttcttaactactttgaacctttcctgggacgaattgaaata- ttaggtagtagtaaacgaatcg agcgtgtatatttcgagattaaagaatcaaatattgaacagtgggaaaaacctcaaatcaaggaatctaaacga- gcatttttctattcaattgtc actgaaggaggtgacaaagaaaaattggaagatttgttaatttttgtgaagatgccatatttgagatgacacat- gccagtgggcttatggcaa ctgatgatggtacaggctctggaggaggaaaacaaagagcatcctcttattcttatatggaagatgaagatgaa- gaaaggaatccaatcag acgtggttggcaagcaacta SEQ ID_35 >inv2c.pk011.b22.f Fragment 1 aatttagaatcaaatattattgatactatttctttttcatactttacattaatattcttcaaaattaaaaatgc- caggagtagagcatgttactaacaaa gtcgttgttcatcctttagttctattaagtgttgttgatcatttcaatagaatgggtaaaattgggaatcagaa- gagagtagttggcgtattattagg atgctggaaggcaaaaggtgttttagacgtatctaatagttttgcagtgccatttgatgaagatgataaagaca- aatcagtttggtttttagacca tgattatttagaaaatatgtatggcatgtttaagaaagttaatgcaagagaaaaagttgttggctggtatcata- caggcccaaagttacatcaaa atgatgttgcaattaatgaacttatacgccgttactgccctaactcagttcttgttattatcgatgcaaaacca- aaggatcttggtttacctacaga agcatatagagcagttgaagaagtacatgatgatggttctcctacgacaaaaacatttgagcatgttccca SEQ ID 36 >inv2c.pk011.b22f Fragment 2 atgaacttatacgccgttactgccctaactcagttcttgttattatcgatgcaaaaccaaaggatcttggttta- cctacagaagcatatagagca gttgaagaagtacatgatgatggttctcctacgacaaaaacatttgagcatgttcccagtgaaataggggctga- agaagcagaggaagtgg gtgttgaacatctgctgagagatataaaagatacaactgtcggctcactttcgcaaagggttactaatcaattt- cttggtctcaaaggccttaat caacaaattcaagacatcagggattaccttatgcaggttgttgaaggaaaattgcccatcaaccatcaaataat- atatcagcttcaagacatatt taatctccttcctgacatgaaccatgggaactttgttgattcattatacataaaaacaaatgatcagatgcttg- tcgtttatctcgctgccctcgtta gagctattgttgccttgcataatctgatcaataataaactcagtaatcgtgatgcc SEQ ID 37 >inv2c.pk020.119.f Fragment 1 tacttcattgtcataaaggggtaacattgctgaatccagcgtaaaggttacagtgactctcacctggttataac- agttttgctttgtaatcatgggt tctgagagatatagcttttctttgactactttcagtccatctggaaaattagttcaaattgagtatgcacttgc- cgcagtcgcagctggagctcca tcaatcggtatcagagcatccaatggagttgtattggctactgaaaacaaatacaaatcaattttatatgaaga- acatactattcaaaaagtaga aatgataactaaacacattggaatggtctacagtggaatgggacctgattataggctactagtgaagagagcta- gaaaaatggctcaacaat aacagttagtttacggtgagcctattcctactgcacagcttgttcaacgagttgccatggttatgcaggagtac- actcaatctggaggtgttag accttttggagtttctttactcattgccgggtgggatggggataaaccatctctgtttcaatgtgatcca SEQ ID 38 >inv2c.pk020.119.f Fragment 2 acacattggaatggtctacagtggaatgggacctgattataggctactagtgaagagagctagaaaaatggctc- aacaataacagttagttta cggtgagcctattcctactgcacagcttgttcaacgagttgccatggttatgcaggagtacactcaatctggag- gtgttagaccttttggagttt ctttactcattgccgggtgggatggggataaaccatctctgtttcaatgtgatccatctggagcatactttgcc- tggaaagctactgcaatggg aaaaaattttgtcactggcaaaacatttctagaaaagaggtacagtgaaactttagagctggatgatgcagtac- atactgcaattctcactctta aagaaaactttgaaggccaaatgacttcggacaatatcgaggtcggagtttgtgatgatcaagggttcagagtt- ttagatcctacaacagtga aggattatctggctaatattccataaatttattattaaaatttgattttataattaataaaaaggtgattgctt- atggatatgtgtgatgcctaaataaa atattattttttattgg SEQ ID 39 >inv3c.pk002.18.f Fragment 1 atcattgatgatggttgagaaagttccagactctacatatgaaatggttggaggtcttgataagcaaattaagg- aaatcaaagaagtaattgaa cctcctgtaaaacatccagaactgtttgatgcactaggaatagctcagcccaaaggagttttattatatggacc- acctggaacaggtaaaaca cttttggcaagagcagttgcccatcacactgagtgcacgttcattcgtgtgtcaggatctgagttggttcagaa- attcattggggaaggatcca gaatggttagagaattgttcgtcatggcaagggaacatgctccatctatcatatttatggatgaaatcgattca- ataggttcatcacgtatcgaat ctgggagtggtggtgattctgaagtccagagaacaatgttagagttattgaaccaattggatggcttcgaagcc- acaaaaaatattaaggtca taatggccactaataggattgatattttggaccctgctcttctgcgtcctggaaggatagatcgtaagattgag- ttcccc SEQ ID 40 >inv3c.pk002. 18.f Fragment 2 Tccatctatcatatttatggatgaaatcgattcaataggttcatcacgtatcgaatctgggagtggtggtgatt- ctgaagtccagagaacaatgt tagagttattgaaccaattggatggcttcgaagccacaaaaaatattaaggtcataatggccactaataggatt- gatattttggaccctgctctt ctgcgtcctggaaggatagatcgtaagattgagttccccccaccaaatgaggaagctcgtttagatatccttag- aattcattcacgtaaaatga atcttacccggggtatcaacttgcgtaaaattgccgagctcatgcctggagcttcaggtgcagaagtaaagggt- gtctgtactgaagcaggg atgtatgccctgagggagaggagaatccatgtcacccaagaagatttcgaaatggctgtggccaaggttatgca- aaaggactccgagaag aatatgtcaatcaagaaattatggaaataaacgactcacttatttttttttttttttactctgtttaaaaagct- ttaaatatatagatgtttgtgaggtttt gttaaaaataaa
REFERENCES
[0162] Marcel H. Schulz, Daniel R. Zerbino, Martin Vingron, and Ewan Birney. Oases: robust de novo RNA-seq assembly across the dynamic range of expression levels Bioinformatics (2012) 28(8): 1086-1092 first published online Feb. 24, 2012 doi:10.1093/bioinformatics/bts094.
[0163] The article "a" and "an" are used herein to refer to one or more than one (i.e., to at least one) of the grammatical object of the article. By way of example, "an element" means one or more element.
[0164] All publications and patent applications mentioned in the specification are indicative of the level of those skilled in the art to which this invention pertains. All publications and patent applications are herein incorporated by reference to the same extent as if each individual publication or patent application was specifically and individually indicated to be incorporated by reference.
[0165] Although the foregoing invention has been described in some detail by way of illustration and example for purposes of clarity of understanding, certain changes and modifications may be practiced within the scope of the appended claims.
Sequence CWU
1
1
40120DNAArtificial SequenceDescription of artificial sequence note =
synthetic construct 1taatacgact cactataggg
20244DNAArtificial SequenceDescription of artificial
sequence note = synthetic construct 2taatacgact cactataggg
atgcccggga attcggccat tacg 44346DNAArtificial
SequenceDescription of artificial sequence note = synthetic
construct 3taatacgact cactataggg cgcgccaaac gaatggtcta gaaagc
46444DNAArtificial SequenceDescription of artificial sequence note
= synthetic construct 4taatacgact cactataggg ctagtaacgg ccgccagtgt
gctg 44544DNAArtificial SequenceDescription of
artificial sequence note = synthetic construct 5taatacgact
cactataggg ggccgccagt gtgatggata tctg
4462969DNAArtificial SequenceDescription of artificial sequence note =
synthetic construct 6cagagatggc agagatggga agttgatagc cagatctgac
agagtcaagt gtgttgactt 60acatccatca gaaccatgga tgttggcttc tttatacaat
ggaaacgttc atatttggaa 120tcatgagacc cagcagctag taaagtcttt tgaagtatgc
gaccaaccag ttcgtgctgc 180agtatttgtt cctcgcaaga actggattgt aacagggtca
gatgatatgc agatcagagt 240ttttaattac aatactcttg aaagagtaaa tgcatttgaa
gctcattcag actatgtcag 300atgtatagca gttcacccag cccatcctta tattctgaca
tcatcagatg atatgttaat 360caaattgtgg aattggtcta aggcttgggt ctgccaacaa
atatttgaag gacataccca 420ttatgtaatg caagttgtta taaatccaaa agataataat
acatttgcat ctgcttcatt 480agatcggact gttaaagttt ggcagttagg ctctgctgct
ccaaatttta ctttagaagg 540tcatgaaaaa ggagttaatt ctgtcgatta ttatcatggt
ggtgacaaac cttatctcat 600atctggcgcc gacgatcatc ttgtcaaaat atgggattat
caaaataaga cttgtgttca 660aaccttggag ggccatgccc agaatattac tgcagtttgt
tttcacactg aactacctat 720tataataact gggtcagaag atggaactgt tcgattatgg
cactcagcaa cttacagatt 780ggaatcatct cttaactatg gcctagaacg tgtttggact
attgcgaggc tgaaaggatc 840aaacaatata gctcttggat atgatgaagg gagtatcatg
gtgaagatag gacgtgaaga 900accagcaatt tcaatggatg tgaatggtga aaaaatagtt
tgggccagac attctgaaat 960tgaacaggta aacttgaagc aagtttcagg agaagtaaga
gatggcgaac gtcttgcttt 1020agctccaaaa gaaatgggac catgtgaaat atatcctcaa
agtatttcac ataatccaaa 1080tggaagattt gtcgttgttt gtggagatgg tgaatacata
atttatactg ctatggcttt 1140aagaaacaaa agttttggat cagcccaaga atttgtatgg
gcacaagata gttctgacta 1200tgctataaga gaaggaacat ctactgtaaa actatttaga
cagttcaagg agcgcaagac 1260acttaagcca gagtttggtg ctgaaggtat atttggtgga
caattgcttg gtgtcagatc 1320agtctcagga ttatgtttat atgattggga aactctggaa
ttaatcagaa gaatagaaat 1380tcaggcaaaa tctctccatt ggtctgattc tggacatctt
cttgctattg taacggatga 1440ttcctattat atattgaagt atgattcatc cgcaatcgcc
agtgctcaag agagaactcc 1500tgatggtgtt gaagctgcat tttctcttgt cggagaagta
aatgacacag taaagacagg 1560tttgtgggtt ggcgattgtt ttatttacac caatgctgtt
gggcgaataa attattacgt 1620tggaggagaa atagtgactg ttgctcactt ggattgcact
atgtacctgt tgggatatgt 1680ggctaggcaa aatcttttat acctttctga taaacatcac
aatattgttt gttatacatt 1740attactttct gttcttgaat atcaaacagc tgttatgaga
ggagattttg aaacagctga 1800ccgtgtgttg ccaacaattc cagttcagca tcgttcccgg
tagcccactt cttggaaaaa 1860cagggcttta aaaaagaagc tctggctgta tctactgatc
cagaacataa atttgaatta 1920gctcttggac taaaagagct cgatacggtt gttcagttag
ctgaggaaat aggtagcaca 1980gccaagtggg gtcaagccgc tgaattagca acgagacaag
ccaggcttga tgttgcgcaa 2040gcagctcttc acagagccca acattatggt ggacttctgc
ttctctccac atcagcagga 2100aatcgggaaa tgatggaaaa acaggaaaga gttcaggaga
aaatggaaaa aataatgtta 2160gcttccttgc atatttcctg cttggagacc ttgccaaatg
tcttcaaatt cttattgaca 2220ctgatcgcat tccagaagct gccttttttg ccaggacata
tttgccgagt gaggttcctc 2280gagttgttgg gttatggcga ggtttagcaa aggcaggaca
gagccttgca gatccttcgc 2340agtatctaat ctctttccag gttatgcaga tgctttaaaa
actgaacagt atttagcaaa 2400gaatcctgtg tgactaaacc agcttcttat cataaaaata
ttaaggttat atttttatta 2460gtattattca tatatattat gtatattata tagcatgtaa
ttgggtactt gagcagaaaa 2520aaatacatgt caatttgaga catagtagaa tataagtgac
aaagagcata tataacattg 2580agatagcatt ttttaaatta caaaaaaaag agctcatatt
tgactaaaaa cttgaaataa 2640cagtgtgcct gggggctacc aaagtggggc tggggtgtcc
tgagaacaca cctgaaaaat 2700attttagtat gaatgaaatt tctaggtaat gaaaaaatat
atcaatatac attttatttg 2760taaaaaaaaa gtggaaaaac ataaaatgta attttcatct
aaaagaattc ttagtgtaca 2820tttataaaaa tgggccatta ttaaattatt tcataaagcc
atgaaaattc tatgtagaga 2880ttttttttta aactttcgag atacagaggt ttgactattc
ttcaggtcta accaatattt 2940ttgtttgcta gaaacaacct aatcgtaat
296974838DNAArtificial SequenceDescription of
artificial sequence note = synthetic construct 7aatattaaat
tatgagagtt ttttttgttg atatgaaata acaagtgctt ggctgtttta 60tttcccaaag
aagtattgag tgaataaata tcaagatatt gaattataat ttcctattta 120aggatggctg
tggtggaaca accttgttat actctgatca attttccatc tgatttagag 180cctcctaatg
aaatgcagct aaaatctgat ttagaaaatg gagacactaa agcgaaaatt 240gaagctttga
aaaatattat tcatttaatt gcaaatggag agcgtctacc tggtttactt 300atgcatatca
tacgttttgt tttgccatca caagaccata ccataaaaaa attactgctt 360atattttggg
aaatcgttcc taaaactact ccagatggca aacttctcca ggaaatgatt 420ttggtttgtg
atgcctatcg gaaggactta caacatccta atgaatttgt caggggatct 480acattacgtt
ttctatgtaa acttaaagaa cctgaattgc ttgagccttt aatgcctgct 540ataagagctt
gtttagagca tcgggtttca tatgtacgaa gaaatgcagt acttgcaata 600tttaccattt
ataggaattt tgaattctta gctcctgatg caccagaact tattgctaat 660ttcttagatg
gggagcaaga catgtcatgt aaaagaaatg ctttcttaat gctcctacat 720gctgaccaag
aacgtgcctt atcctactta gcttcatgtc ttgatcaagt gacttccttt 780ggcgatatac
ttcaattagt tattgttgaa ttaatttata aggtttgcca tgctaaccct 840tctgaacgtt
ctcgatttat acgttgcatt tataatttac tcaattctaa cagtcctgct 900gtgcgatatg
aagctgctgg aactttaatc acactttcga atgctcctac tgcaataaaa 960gctgctgctt
cttgttacat tgatttgata ataaaggaaa gtgataataa tgttaaatta 1020attgtattag
atcgtcttat atctttaaaa gaaattccta ctcatgaacg ggttcttcaa 1080gatttagtta
tggatatatt acgtgtgcta gccagtcctg acatggaagt aaagaaaaaa 1140gccttaagcc
tagcactgga tctcactact tcacggtgtg ttgaagaaat ggttttaatg 1200ttaaaaaaag
aagttgctaa gacacataac ttgacagaac atgaagatgc tggaaaatat 1260cgtcaacttc
ttgttagaac tcttcattcc tgttgcatga agtttccaga tgttgctgct 1320tcagttatac
cagtattaat ggaatttctc tcagatacaa gtgaactagc ttcgtatgat 1380gttcttatat
ttgtccgaga agcaattcat aagtttgatt ctttaagggt tttgatcata 1440gagaaattat
tagaagcgtt tccaaccata aaatctatga aagttcatcg agctgctctt 1500tggatattgg
gtgaatatac tacttcagtt acagatatta aagaagtcat gaaacaaata 1560aaacatgccc
ttggagagat accacttgtc gatgatgaaa taaaaagagc ttctggagag 1620aaagttgagg
aagttgatca tcgagatcaa gtaaaactgg ttacatctga tggaacatat 1680gctacacaat
caatatttaa caccattctg gcaattaaaa aagaggatcg acctcctctc 1740agacaatact
tgattgatgg agactttttt attggtgtat ctgtggcttc tacgcttgtg 1800aaattagcat
tacgttataa agagcttgtt cagcaggaaa atatgtacca taaatttttt 1860gctgagtgta
tgctaatcat ttcatctata gttcgtctgg gtaaatctgg atatccttcg 1920aaacagctga
gctatgatga ttatgaacga atgttacttt gtctaaaggt tctctctgaa 1980aataatgcac
ctattgtaaa aattttcaac actgattgtc gcaatgctct tgctaatatg 2040ttagttgctc
aacagaatga ggagtactca cttattaagg ccaaagaaaa atccgtccat 2100accatccaag
ttgatgatcc tgtatcattt ttacaattat caacgatacg atcatctgat 2160tttggttcag
aaaatgtttt tgagcttagt ttaaatcaag ctgtcggggg gccaaataca 2220gctacaaaca
cagctgaact tccattttca gccagtaaat tgaataaagt aactcagctg 2280acagggtttt
cagatccagt ttatgcagaa gcatatgttc atgtcaacca gtatgacatt 2340gtacttgatg
ttttgatcgt taatcaaaca ggtgatacac ttcaaaactg tacgcttgaa 2400ctagcaactc
ttggtgatct gaagcttgtc gaaaaacctc aaccttgtgt tctagctcct 2460tatgatttct
gcaacattaa agcaaatgtg aaggttgcat ctactgaaaa tggaatcata 2520tttggcaata
ttgtttatga tattagtgga gctgcttctg acagaaacgt tgttgttctt 2580aatgatatcc
atatagatat catggattat attgttcctg ctatctgttc agacacagaa 2640tttcgtcaga
tgtgggctga atttgagtgg gaaaacaagg tgtcagttaa cacttatttg 2700gtcgatcttc
atgaatatct tggccattta ttgaagagca ctaatatgaa atgtttaaca 2760ccagaaaaag
ctctatgtgg gcaatgtgga tttatggctg ctaatatgta tgcccgttca 2820atttttggtg
aagatgcact tgctaactta agcatcgaaa aaccgtttaa taagcctaat 2880gcacctgtta
ctgggcatat tagaatcaga gctaaaagcc agggtatggc attaagctta 2940ggagataaaa
ttaatatgac ccagaagaaa cctactatca tggctcagtg aaacataata 3000gtttcattgt
taaaatgcat ttcaagattg tttagacttt ttatatctat tggtttataa 3060gtatttggaa
ttatgggatt cacaactctg aatttgttaa agtattttaa atcaagttat 3120caaaaattat
ttttacttca atctaatagt tgtacattat tattagatgt gagtacctac 3180aaatatatag
attttttgta cctttctatg acttattaaa atatttcatt tgatgtacat 3240tatatattgt
tgacctaaat taaaaaagct tatgtatctt attcttaaat ttgtttttat 3300tattcttaga
ataatgctat tatattttgt gatatctcaa tttgaaaata gtatatatgt 3360gtgtgtgtta
atatacatgt gtatattatt aatattctaa taataaatat ttatatttaa 3420aagtcagtaa
aattatatgt atgtttgtat aactatactg ggtgtcctgt aaatagagag 3480acatttttgt
aggagttata gcagatctca agaactacaa aaaaattcat ataaacaatg 3540ggccttaatt
ttcagctgaa aagtaggaaa ttcaatttct ttttttttag caacctacac 3600aaaatttacc
tcaaattaaa atacagtttg tcatcgttaa aaccatttaa agaggacttt 3660aaaatggcta
aattaagctt aaaaaaatca taaataacta gtgatttttt tttgcaatta 3720ttcatcatct
aaagtggtac tttgttttta ataagatcta tattattgga attaattcat 3780acttttaagt
gatgcaataa ctgtttctag tcgtaacatt tacatttaaa ataaagaaac 3840ctgctgattg
cttcaattat tttcatgaaa acgtaaatat taggatcagg aacatttatt 3900ttatcactta
taaacacgaa tctattccta taatataata ttcctataat atagatctta 3960gtaaaaaacg
aatttttagt accactttag atgatgaacc atcgcaaaaa cattatttta 4020aacttttttt
taaaattttt atttatgcat ttatatttat tataatacat cccagatcat 4080ggaaaaacat
gaaattaaag ttttgcaaca gtttattgta gtctttttgc tgaaaatacg 4140ggtatttttt
ggatgactgg ttaaaaaaac acaaattgtt tttacataac ttgatgtgga 4200tctcaaatgt
ttgcctaata aacctcaact taattaatat tgacttaaag gttattaaac 4260ctcggaagtt
tttctaaaag tgaggtatct gtagtacagt tagaggatga taaaaacaaa 4320tcgtgaataa
cttatgatta tttttaaaaa acagaactta ggttagttta gccatttaaa 4380gatcctcttt
aaatggtttt agcaatgaca aattgtattt tgatttgagg taaattttgt 4440gtagtttgct
gaggagcaat tgttaattat ataaataaaa tattttttta gttgtcgctt 4500tattttacta
aaaaagaggc ttcattaaaa aatatataac ttggatttcc tacttttctg 4560ctgaaaatta
gcccattgtt tgtataacat ttttgtagct cttgaggtct gctatctctc 4620ctataaaaat
ttccctctac ttacaggaca tcctgtatat gtgttgaagt ttgcatgaat 4680gtttactttg
ttttttgttt ttttaatttt tcaggtcaag ttaatacata tattattaat 4740tttatataaa
tatatatatg aattattttt ggaccattat taaaaatatg ttgtaactaa 4800aataaataaa
aataatttat taaaagtcca aaaaaaaa
483883171DNAArtificial SequenceDescription of artificial sequence note =
synthetic construct 8ctgttgacgt tgacgtggga tgtgtagtta atgtttaata
attatttgtg taatttttaa 60tttgtaatat attaataaca tatttataac caataaaaat
ggcaataaaa cgagataaga 120aagaagaaga agatggtgga aacccctttc agagtcttga
taagaccagt gttcttcagg 180atgccagaac ttttaatgaa acaccagttg aacctcgcaa
atgcacccca atattgacca 240aaattctgta tcttttaaac caaggagaac agcttggtcc
tgctgaagca acagaaacat 300tttttgctgt tacgaagctt tttcaatcaa ataatacttt
gcttcgacga atggtatatc 360ttggcataaa agagttatct ctaattgctc aagatgttat
catcgttact tctagcctta 420caaaagacat gactgggaaa gaagatttat atcgagcagc
tgcaattcga gcattatgca 480gtataacaga tgctactatg ctgcagacga ttgaaagata
tatgaaacaa gcaatcgttg 540atagaaaccc agctgttgct agtgctgctc ttgttagttc
actgcatatg agtaggatcg 600ctagcgatgt cgtcaagaga tgggttaatg aagcacaaga
agctgttaat tctgacagta 660taatggtcca atatcatgct ctgggcctcc ttttccatat
taggaaaaat gacagattgg 720ctgtaacaaa attagttgct aaattaacta gaatgtcgtt
gaaatctcca tttcgcagtt 780tgtatgttga ttcgaattgc atgtaaatta ttggaagaag
aaagctctgg agaatatgca 840gactctccac tttttgattt tattgaagca tgtttacgcc
acaaaagtga aacagttgtt 900tatgaagcag ctgctgctct tgtaaactta cgccacacta
ctaccagaca aatcacgcct 960gcagtaagtg ttcttcaatt attttgttct tctccaaaac
cagcgcttcg ttttgctgct 1020gtgagaactc ttaataaggt agcaatgaca catcccactg
ctgtaacgtc atgcaatatt 1080gacttagaga accttataac ggattcaaat cggtccatag
ctaccttggc cataactact 1140cttctaaaaa ctggagctga atcagctgtg gacagactta
tgaagcagat agcatctttc 1200gtttcagaaa taagtgatga attcaagatt gttgtagtgc
aggccattag agcactatgc 1260ttgaaattcc ctcgaaaaca tggaacactc atgacgtttt
tgtctgcgat gctgagggat 1320gagggaggat tggagtataa ggcttcaatc gccgatacac
ttatatctct tatcgaaggg 1380aaccctgaag cgaaagagtc tggcctcgct catttgtgtg
aattcatcga ggattgtgag 1440catacttccc tggctgtcag gatactgcat ctgcttggta
aagaaggacc aaaaacaaaa 1500cagccttcta ggtacataag atttatctat aatagagtca
ttctggaaaa tgcagtagta 1560cgagcagctg ctgtttctgc attgtctcaa tttggagctc
agtgccctga tcttcttgag 1620aacatactag tcctcctcgc ccggtgccaa atggatacag
acgatgaagt tagggacagg 1680gccacatatt atttcagtat tttacaaaat caagatcgac
atttgattaa taattacata 1740gttgaaccac ctcaggtgtg tgtttccagt ttagaaaaag
ccttaatgct gcatttgatg 1800gaaactccag aagaagtatt tgacttgagt tctgttccgt
tggcaccccc tcctctatcc 1860gacgaagttc aggctgctcc aactgttgta caggaaccat
tagcggcttt gggacgtcct 1920gcggtctcca aagaagagag tgcttctgat agacttcgag
ctattccaga actttcttgg 1980attcagggtc cactcttcaa aagttccgat cctatcagtc
ttacggaatc tgagacagaa 2040tatcaagtta gagtcacgaa gcatgttttc aaaaatcata
ttgttcttca gtttgactgt 2100acaaatacca tgagtgacca gctactggag aaagttcgag
tgcagttaga agtgagcgaa 2160ggttaccaga tcgtagctga ggtcccctgc caaagattag
cctgttcgga aacatcacct 2220acttatattg ccctgcaatt tccagatgcc cctaatctta
ctgtcacaaa ctttgctgct 2280actctgaggt ttgttgtaaa ggattgcgac ccaatgaccg
gtatccctaa ctcagatgat 2340ggttatgaag aagattatat gcttgaagat gtcgaagtga
tgcttgctga ccaaatgcag 2400cgacttacga agagcaactt cggtgctgca tgggaggaag
gcgaatcgta tagtgagcta 2460gaggacactt ataacttgtc aggaataaac agcctcgaag
aggcagtgag gagtgttgtc 2520agtttcatgg ggatgcagcc tgctgacagg agcgacaggg
tacagcctga taaatcttca 2580cacactgtct acctcggagg catgttccgt ggtggagttg
aagtgttagc tagagctaaa 2640ctggccatgg gtaattcccc aggcgttgcc atgcaactta
cagtccgctc tccaaatcca 2700gatatttgtg aactgattat ttctgtagtc gggtaaaaaa
aatatataaa tatatttgag 2760aagtacacag tttcctctca gatgttgtac agaatcaaac
attgaacata aagtatatat 2820catatgaact gtattagttg actagctgct tgggaaaatt
ttggttacgc aataatcaat 2880cttttatatg tatcagattt taattaaagt atttaaaata
caagtgttgc tgtataaaat 2940gatgttttga aacattttta aagtatttaa gttatatgtt
ttaatttaag caacccagtt 3000attttttatg ttatgcttat gggaatttta ttttatataa
aatacatttt ttttctttcg 3060agataggtgt aaatttaaac ttgaattttt tccaaaggca
tttgtctaat ttattaaata 3120atatatgatt tattatatat attttttatt aatccaataa
atacttataa g 3171915780DNAArtificial SequenceDescription of
artificial sequence note = synthetic construct 9caacttccta
acgacgaggt agttcttgga tatgtaatac gggagagcca tccacttctt 60tcactggtct
gctaagtaga gaggatggcc gacagcgaag gaggatccga gcaggacgat 120gtttcgttcc
tgaggacgga ggatatggtg tgcctatcat gcacagcaac tggagagaga 180gtttgcttag
cagctgaggg ctttggtaac cgtcactgtt ttctagaaaa tattgctgat 240aagaatatac
caccagatct ttcaacatgt gtatttgtta ttgaacaagc tctatcagta 300agagcacttc
aggagttagt tacagcagct ggatctgaag agggaaaggg aactggatct 360ggtcacagga
ctcttcttta tggaaatgct atactactcc ggcaccaaaa cagtgacatg 420tatctggctt
gtttatctac cagttcatca aatgacaagc tctcatttga tgttggttta 480caagaacatt
cccaagggga agcttgttgg tggaccgtac accctgcttc taaacagaga 540tcagaaggtg
aaaaagtgag agttggtgat gatttaattc ttgtgtctgt agccactgaa 600agatatttgc
atactgctaa agaaaacgat caatctattg taaatgcatc tttccatgta 660actcattggt
ctgttcagcc ttatggaact ggtatcagca aaatgaagta tgttggttat 720gtgttcggag
gagatgtgtt aagatttttc catggtgggg atgaatgcct taccattcca 780tcaacttgga
gtgaaacccc tggacaaaat gtggtagttt atgaaggagg gagtgttttg 840agtcaagctc
gttcactttg gagattggaa ctggctagga caaaatggtc tggtggtttc 900attaattggt
atcatccaat gaggatacga catctcacca ctggtagata cttaggagtt 960aatgaaaata
atgaattaca cctcgttgtt agggaggaag ccacaacagc attatctaca 1020ttcattttaa
gacaagaaaa agatgaccaa aaagtagtaa tggaagataa ggatttagaa 1080gtaataggag
ctccaataat aaaatatggt gacagtactg ttttagtcca acattcagaa 1140agtggtttat
ggttaactta taagtcattc gaaactaaga aaaaaggtgt gggtaaagta 1200gaagaaaaac
aagctgtact tcatgaggag ggaaaaatgg atgatggatt agactttagt 1260agaagtcaag
aagaagaatc aaggactgct agagtaataa ggaaatgttc gtcacttttc 1320actcaattta
ttaggggtct agaaactctg caaatgaatc gaagacattc tctgttttgc 1380gctagtgtaa
atttaaatga aatggtcatg tgtttagaag atttaattaa ttactttgcc 1440cagcctgagg
aagatatgga acatgaggaa aaacaaaacc ggttaagagc tttgagaaac 1500agacaagatt
tgttccaaga agaaggaatt ttaaatctta tcttagaagc cattgataaa 1560attaatgtta
taacatccca aggtttctta gtcagtttag ctggagatga gtctggacag 1620agctgggata
taatctcagg atatttgtat caactgctag ctgccatcat aaaaggaaat 1680catactaatt
gtgctcagtt tgctaacaca aatagattaa actggttatt tagcagacta 1740ggttctcaag
cttcaagtga gggcacaggt atgttggatg tacttcattg cgtcttaatt 1800gattctccag
aagctttgaa tatgatgaga gatgaacata taaaagtaat catttcactg 1860ctagaaaaac
atgggcgaga tccaagagtt ttagatgtac tttgttcact ttgtgttggt 1920aatggtgtag
cagtccgtag ctcacaaaac aacatctgtg atttccttct gccaggaaaa 1980aacttgcttc
tacaaacgca acttgtggat catgttgcca gtgtcaggcc aaatattttt 2040gtgggtcgag
tcgaaggttc tgctgtttat caaaaatggt attttgaagt gactttagat 2100catatggagc
aaaccaccca tatgacaccg catctaagaa ttggctgggc taacacttct 2160ggttatgttc
cctttcctgg cggtggtgaa aaatggggcg gtaatggagt tggtgatgat 2220ctctactctt
ttggttttga tggagctgca ttatggacag gtggaagaaa aactgtagtc 2280cttcctcatg
ctatggaacc ttacataaga aagggagatg ttattggttg tgctttcgat 2340ctgactgttc
caattattac atttactttt aatggaacat taatccgagg atcatttagg 2400gattttaatc
ttcaaggaat gttctttcca gttataagct gttcctcaaa acttagttgt 2460cgttttttac
tgggaggtga tcatggaaga ttaaaatatg cacctcctga agaattttct 2520cctctcgttg
aaagtttgct tcctcaacaa gtgctttcta ttgatccatg tttttatttt 2580ggcaacctga
ataaatgtgt attggctggt ccttatcctg ttgaagatga ttgtgctttt 2640gttccagttc
cagttgacac atctatggta aatttacccg ttcatgttga tacaatacgc 2700gatcgtttag
ctgaaaacat ccatgaaatg tgggctatga ataaaattga agcaggatgg 2760atttatggag
atgtaagaga tgatataaga agaatacatc catgtcttgt gcaatttgaa 2820aaactacctc
ctgcagaaaa gcgatatgac actcaacttg ctgtacaaac tttaaaaacc 2880atcattgcac
tgggctacca tataacaatg gaaaaaccac catctagaat aaagaacatt 2940cgtttgccga
atgaaccatt tttacaatct aatggttaca agccagctcc tcttgatctc 3000agtgccataa
cactaatacc taaaatggag gaacttgttg accaactcgc tgaaaatact 3060cacaacttgt
gggcaaaaga aagaatccaa caaggctgga cctatggtct taatgaggat 3120cctgatttgt
cccgaagtcc tcacctcgtc ccttacagta aagttgatga tttaattaaa 3180aaagccaaca
gggataccgc aagtgaaact gtcaggactc ttcttgttta tggttataat 3240ttagaccctc
ctacaggtga acaaactgaa gctctcttag cagaagcaag ccgtttgaag 3300cagatgcagt
ttagaaccta tcgggctgaa aagacatatg cagtaaccag tggcaaatgg 3360tattttgaat
ttgaaattct tactgctggg ccaatgagag taggttgggc cattgctgat 3420tataatccag
gttcccagat cggaagtgat gaagcatcct gggcatatga tggttataat 3480gaggaaaagg
tttattctgg ggttgctgaa acgtttggaa gacaatggca agttggagac 3540gttgtaggag
tttttcttga tctattggat catactatta gtttctctct aaatggtgaa 3600ctgcttatgg
atgcacttgg gggagaaaca tcttttgcag atgttcaggg agaaggattt 3660gttccagcat
ttacacttgg agtaggacaa aaagcaaaat tagtgtttgg gcaagatgtt 3720aactcactta
agttctttac tacctgtggt ttgcaagaag gttatgaacc tttctgtgta 3780aacatgaaca
gggcagttac cttttggtac accaaagatc atcctatatt tgaaaatact 3840gatgattata
ttgatactaa aattgatgca acgcgtattc ctgctggttc tgacacacca 3900ccatgtctta
aaattagtca taatactttt gagacaatgg agaaagccaa ttgggaattt 3960cttagacttt
ctttacctgt tcaatgttta ccatcattca taaatgaaca agaaaaagta 4020cgtaggtggc
aagaaataag gataagacaa cacagacttc ttgtggaagc tgaccaaacc 4080actcctgctc
acattgaaca gattatgaag tctggtttta gtatgagtga tattaagggt 4140cttcaaagaa
gttatacaga agatggaatg gaaggagaag aaggattggc accaagctca 4200tcaccactta
caaggactaa gtcaaaagtg actccagctc gtccacctag gaaaggctcc 4260ttaccacgaa
atggagatgt tattaatatg aacgggacat tagaaccagg tggaggaaaa 4320atgaaccgtt
ctaatagtga gcttgatttc caacgtttca atggtgaaat gcccgatggc 4380gataacaaga
aaaagcgtgg gagatctcca tttaggttct tttcaagaaa aaagggggag 4440cgtgatacta
gtggagaaaa tgcaaaaaat gtacatatgt ctgagcctat gggtaatttc 4500cttgagcctc
caaggactcc aatgcagcaa agaggtggaa gtgctctgcg ttcttctcct 4560caacctaaag
tacaggagtt aactaagcca ccatccccat tagttgaaag aagtggaccc 4620aaagcaatgt
ctgtgcctgt tggaactggc atcgaaacta ttggaaatga aatatttgat 4680gtagagtgtt
tgaaattgat taatgaatac ttctacggtg tcaggatatt tccaggtcaa 4740gacccaactc
atgtatatgt cggttgggtt acaactcaat tccatctacg tagtaaagac 4800tttaatcaga
atcgagtgct aaagagcact gtagtagtat gtgatgaatt caatcgtgta 4860atagacagta
ttcagcggca gagttgtttt atggtaagag ctgatgaatt atacaatcaa 4920gtaactcagg
atgcctctgg taaaggtgct tcacaaggaa tgtttattgg atgtttcctg 4980gatactgcta
ctggttatgt gacgttcaca tgtgaaggaa aagaaactaa ccacaagtat 5040aagatggaac
ctgatacaaa attatttcca gctatatttg ttgaagctac aagcaaagaa 5100attctacaaa
ttgagcttgg tcgtacatca actacactgc ctttatcagc agctgttctc 5160caaaattcag
aaagacatgt cattcctcag tttccaccaa gacttaaagt tcagtgtcta 5220aaaccacatc
agtgggcacg tgttcctaat atttcattgc atgtccacgc tctgaaatta 5280tcagatataa
gaggttggag tatgctttgt gaagatccag tttcaatgtt agcattacat 5340atacctgaag
aagatagatg tattgatatt ttagaactta ttgaaatgga caaactactt 5400tcattccatg
ctcatacatt gacactttat gcagcactat gttaccaatc caattatcgt 5460gcaggacatg
ttctctgcaa acatgtagac caaaagcaac ttcagtatgc tattaggtct 5520gaattcatat
ctggatcttt acgcttggga ttttatgacc tcttgattgc tttacacatt 5580gaatcacatg
caacaacaat ggaagtttgt aaaaatgaat tcataatacc ccttggtcta 5640gacttgaaag
atttatatga agatccagat atgaagcaca gcttacgatc tttaaaaact 5700gtctctattt
tacctcaaat gagtatgaca gacattacgg aaaatattga aagcatcaat 5760acattatata
gtccttattt tcctcttgat gcagttaagg attatggaat gactgcatta 5820gaagaggctg
taagcatgaa tcaacttcac aatagagacc ctgtaggtgg ttcaaatgaa 5880aacttgtttc
tacccttgtt gaaactggta gatagattat tgcttgttgg gatactacga 5940gatgaagatg
ttacaaagct actaattatg tttgatcctg aaacttggga ttcaaatttt 6000gaaaaggatg
gcaaagatga acatcgtaag ggtttacttc aaatgaaaat ggcagagggg 6060gcaaaactac
agatgtgcta tctcttacag catttatgcg atatacaatt gcggcatcgg 6120gttgaagcca
ttattaattt tagttatgac tatattgctg atcttcagca ggatcagttg 6180agaagatatg
ttgatattaa gcagtctgat cttccatcat cagttgctgc aagaaaaaca 6240agagagtttc
gttgccctcc aagagaacag atgaatgcta tcataaattt taaaaattta 6300gaagaagatg
acaaagaaaa ctgtccatgt ggtgaagaac tgagggagag attaaacaca 6360tttcatgaag
aaactatgag taaagtttca cttgttgctc tccaagagcc acaagaagat 6420gagaacggtg
aaacaccaga aaagccgggt gttttcaaaa aattatacaa ttttattaat 6480gctgttaaag
aattggaaga acctcctaaa atagaagaag aacctgttaa gaaaactcct 6540gaagaaatat
ttagaaaagt attaattagt acaattgtta gatgggctga agaatcccag 6600attgaaacac
caaaattagt cagagaaatg ttcagtctat tggtaaggca gtacgacact 6660gtaggtgaat
taatcagatc tcttggaaac acttatgtga taaatgacaa aacgaaagaa 6720gatgtagctc
agatgtgggt agggttgagc cagatcagag ctctcctacc tgttcaaatg 6780tctcaagatg
aagaaggtct tatgcgaatg aggctatgga aattagttaa caatcacaca 6840ttctttcaac
atcctgattt gattagagtt cttcgtgttc atgaaaatgt tatggctgtt 6900atgatcaata
ccttgggtag aagatcacaa gcacaatctg atgcttctca agctggtcaa 6960gaaggtgaac
ctgcagctaa ggagaaagat acgtcccatg aaatggtggt agcatgttgt 7020cgtttcctgt
gttatttttg cagaacttca cgtcaaaatc agaaagcaat gtttgaccat 7080ttaacatttt
tattagaaaa cagtaatatt ttactttcaa gaccttcact tagaggaagt 7140acccctcttg
atgttgccta ttcctctctc atggaaaata ccgaactggc attagctctt 7200agagaacatt
atttagagaa gatagctgtt tacttgtctc gctgtggatt acaatctaat 7260tcagaattgg
tagaaaaggg ttaccctgat ttgggttggg atccagttga gggagaaaga 7320tatttagact
ttttacgctt ctgtgtttgg gttaacggtg aaagtgttga agaaaatgca 7380aatctggtta
tacggctcct tatacgtcga ccagaatgtt tgggtcctgc acttcgtgga 7440gaaggtgaag
gattactgag agcaattata gatgctaata agatgtctga aagaatttca 7500gatcgcagaa
aaatgatgga ggaacctgaa aattctgccc atcatcagtt tgaacatcca 7560cttcctgagt
ctgatgaaga tgaggactat attgatacag gagcagcaat actggcattc 7620tattgtactc
tggtcgatct tttaggtcgc tgtgctccag atgctagtgt gattgctcag 7680ggaaagaatg
agtctcttag agctagagct attttgagat ctttagtacc tcttgaagat 7740ttatttggtg
tcttgagttt aaagtttaca cttaccaatc cagctattgg agaagaaagg 7800ccaaaaagtg
atataccatc tggtctaata ccatctcata agcaaagtat tgttttattt 7860ttagagagag
tatatggtat tgaacagcaa gatctcttct tcagattact cgaggaagca 7920tttttacctg
atttaagagc agcaactatg ctagatagaa ctgatggttc tgaatcagaa 7980atggcattag
ctatgaatcg ctatattgga aattctattc tccctttgtt gataaagcat 8040taccagtttt
atagtggtgc agataactat gcaagtcttt tagatgctac acttcataca 8100gtgtatcgcc
tatcaaaaaa tcgaatgcta actaaaggtc agcgagaggc agtatcagat 8160tttttggttg
ctctcacaag tcaattacag ccaagcatgt tactcaaact tcttcgaaag 8220ttaaccgttg
atgtatcaaa gctttctgag tataccacag ttgctttaag gttgcttact 8280ttacactatg
agcgttgtgc aaaatattat ggaactactg gtggacaagc tggtggatct 8340agtgatgaag
aaaaaaggct cactatgtta ctcttcagta atatttttga ttctttatca 8400aaaatggatt
atgatcctga attatttgga aaagcgcttc cctgcttgag tgctatagga 8460tgtgcacttc
cacccgatta ttcactgtcc aagaattatg atgaagaatg gtatagttca 8520aagggttcag
aaccgactga tgggccttat aatccactgc ccatcaatac ttctatggtt 8580tctctaaata
atgatttaaa cacaattgtt caaaaatttt ctgaacatta tcatgatgca 8640tgggctagtc
gaaaaatgga aaatggttgg gtatatggtg agcagtggtc tgacagctct 8700aaaactcatc
ctcgtttaaa accttataca ttgcttaatg attatgaaaa agagagatac 8760aaagaaccgg
ttagagagtc attgaaagct ctgttagcta taggatggaa tgtagagcat 8820actgaagttg
atattccttc taataacaga ggatcatcag tcagaagatc ttctaaagca 8880aatacatctg
atggttcaac accatttaat tatcatccca acccaattga tatgactaat 8940ttaacattga
gtagagaaat gcaaaatatg gcagagaggt tagctgaaaa ctcacatgat 9000atttgggcaa
aaaagaagaa agaagaactt gtttcatgtg gtggtggtat acacccacag 9060cttgttccat
atgatctttt aacagacaaa gagaagagga aagatagaga aagatctcaa 9120gaatttttga
aatatttaca atatcaagga tacaaactcc acaggcctac tcgaggaagt 9180gctgatgagc
aacaggccgc tgcagctgct gccacaggag agtccagatt tgcttacagt 9240ctactcgaga
aacttataca atatactgat aaagcttcta ttaatatgaa actactaaag 9300ccttctggta
cattcagtag acgctccagt tttaaaactt gttcaagaga cataaaattc 9360ttttccaaag
tggtattgct attggttgag aagtatttca gcactcacag aaattacttc 9420attgctgttg
ccactgcttc taataatgta ggagcagcct ctttaaaaga aaaagaaatg 9480gttgccagtt
tgttctgtaa gctggcaaat ttaattcgaa caaagctggc tgcttttggt 9540gcagatgttc
gaattactgt ccgttgtcta caagtgctag tgaaagctat agatgccaag 9600tcattggtaa
agaattgtcc tgaatttata aggacttcaa tgctgacatt tttcaataat 9660acagctgatg
acttaggcca aactattcag tgtttgcaag agggtcgtta cagtcacctt 9720agaggcactc
atcttaaaac atctacttct ttattttata taaatgatgt tgtactacct 9780gttctcactt
ctatgtttga tcatttggct gtgtgtgatt atggtagcga cttgttactt 9840gatgaaattc
aagtggcctc atatagaatg ttgggtagtt tatataattt aggaattgat 9900ccaactttaa
ctcatgacag aaaatattta aaaacagaaa ttgaaaggca taggcctgcc 9960attggtgctt
gtcttggtgc attttcatca acatttccag tcgcttatct tgaaccccat 10020ttaaataaac
ataatcagtt ttcattagtt aatagaattg ctgaacattc tcttgaagca 10080caggatattc
tagctagaat ggaaaacacc atgcctacat tggatgcgat cctttctgaa 10140gttgatcagt
tcattgaatc cgaaaagagt catacttcag caccacatgt tattgatgtg 10200attttgcctc
tgctttgtgc ttatttgcca agttggtgga gtcaaggtcc tgataatgtc 10260agtctcacag
cagggaatta tgtaacaatg gttactagtg atcatatgaa tcaactccta 10320aaaaatgtac
taaaattaat caaaaataat attggaaatg aaaatgctcc ctggatgacg 10380agaatagcag
cttacaccca gcagatcatc ataaactctt ctgaagaact gttgaaagat 10440ccattccttc
cattaacaca agttgttaag aagaggatag acaatatgtt tcaccgtgaa 10500gaatctcttc
gaggatttct aaaatcttca actgaagata cctctcaagt tgaagcagaa 10560attcaggagg
gctggcatct tattgttaga gatatatatt ctttttatcc actactaatt 10620aaatatgttg
atttacaaag aaatcactgg ttacgtaata atattccgga agctgaatac 10680ttgtatactc
atgttgctga tatatttaat atttggtcta aatcacagta ctttctaaaa 10740gaagaacaga
atttcatatc tgccaacgaa atagacaata tggctctaat tatgcccact 10800gcaactagga
gatctgcagt tgttttggat ggaacagctc ctgctggagg tggaaagaag 10860aaaaagaagc
atcgtgataa gaaaagagat aagaataaag aaatccaagc aagcttaatg 10920gtagcttgct
taaaacgttt attaccagtt ggtcttaacc tattcgctgg aagagaacaa 10980gagttagttc
agcattgtaa agacagatat ttgaagaaaa tgccagaata tgaaatagtg 11040gattttgcca
aaatccaatt aactcttcct gacaagatag atcctggaga tgagatgtct 11100tggcagcatt
atttgtactc aaaactggga aataaaaaag atatcagctc tgaaaaacca 11160cagcaaatcg
atgaggtagt tgataggatt gtggctatgg caaaagttct ttttgggctt 11220catatgattg
atcatccaca actacagagc aagacacaat acagatctgt tgtatccaca 11280cagagaaagc
gtgctgtcat agcttgtttc cggcaactat cactacatgc cttaccaagc 11340atgcaaataa
acctccacct caccaatctg gatggaaaag agttctttca gcagcgagaa 11400aacgggctgc
tattgcttgt cttagaactc aacctttgta tacccttcca aggcatcgag 11460taattaacat
atttgctcgc gcttattgtg agctgtggct gcaagaagag aatgttggtc 11520aagaaatcat
gattgaagat cttacacaaa cttttgaaga tgctgaattg aaaaaaagag 11580attctgaaga
agatgaaagc aaacctgatc cacttaccca attagttaca acattttgtc 11640ggggtgcaat
gactgaaagg agtggagctt tgcaagaaga cccactttat atgtcctatg 11700cagaaattac
tgcaaaatca tgtggagaag aagaagaaga aggtggagat gaggaagaag 11760gtggagacga
agaaggaggg gcatctatcc ataagacaat ggcaaaatta gtggaacaag 11820aaatggaaaa
acagaaactc ttattccatc aagctcggct agccaacaga ggtgttgcag 11880aaatggtatt
gttacatatt tcagcttgta aaggtgttcc cagtgaaatg gttatgaaaa 11940ctctccagct
gggtatttct gttttacgtg gtggtaatct tgatattcaa atgggtatgc 12000taaatcattt
gaaagaaaaa aaggatgttg gattttttac ttctatagct ggcttgatga 12060actcctgcag
tgtgttggat ttagatgcat ttgaaagaaa cacaaaagct gaaggcttag 12120gagttggttc
agaaggtgct gctggtgaaa agaacatgca tgatgctgaa ttcacctgta 12180ctcttttcag
atttattcaa cttacctgtg aagggcataa cttagaatgg cagaattatc 12240ttagaaccca
agctggaaat acaacaacag ttaatgttgt tatttgtact gttgattacc 12300ttttgagatt
acaggaatca attatggact tctattggca ctattcgagt aaagaattaa 12360ttgatcctgc
tggaaaagcc aactttttca aagcaattgg tgtggctagt caagtattta 12420atacactctc
tgaagtaatt caagggcctt gcccacaaaa tcaacaagct ctggctcatt 12480caagattgtg
ggatgctgtt ggaggatttt tgtttctttt ctctcatatg caagataagc 12540tatcaaaaca
ttctagtcaa gtagacttac tgaaagaact tttgaattta cagaaagata 12600tgataacaat
gatgctatca atgttggaag gtaatgttgt gaatggtact attggaaaac 12660agatggtaga
cacattagtt gaatctgcct caaatgtgga attgattttg aagtacttcg 12720acatgttttt
gaaattgaaa gatttgacat cctctgctag cttcttggaa cttgatccaa 12780accatgaagg
ctgggtaaca cctaaagatt ttaaagaaaa aatggaacag cagaaaagtt 12840atactccaga
agaaatagac ttcatgttac agtgctgtga aaccaatcat gacggtaaaa 12900ttgactatgt
tggcttcacg gatagattcc atgagccggc caaggaaatt ggttttaacc 12960tagctgttct
tctcacaaat ttatctgagc atatgccaaa tgaaccgaga cttgctcgct 13020ttttagaaac
agctggtagt gttcttaact actttgaacc tttcctggga cgaattgaaa 13080tattaggtag
tagtaaacga atcgagcgtg tatatttcga gattaaagaa tcaaatattg 13140aacagtggga
aaaacctcaa atcaaggaat ctaaacgagc atttttctat tcaattgtca 13200ctgaaggagg
tgacaaagaa aaattggaag cttttgttaa tttttgtgaa gatgccatat 13260ttgagatgac
acatgccagt gggcttatgg caactgatga tggtacaggc tctggaggag 13320gaaaacaaag
agcatcctct tattcttata tggaagatga agatgaagaa aggaatccaa 13380tcagacgtgg
ttggcaagca actaaagatg gaatttactt tatgttctca atgttatctc 13440ctagcaatat
taaacataaa attattgaaa tgcaacaaat gtcaattatt gaactaatga 13500ttggttttat
aaaactattt ttctacatgt tttattactc aggatattct gtatcagttg 13560tactgaagta
tattggtggt attatatttt cattgatgag gggaccacaa attgaagagc 13620cagttgtaga
agttaaagag gaagaaaaat ctggacctct gaggataatg cctgctttgc 13680caccacctga
agatagctct ctgcttccat ctgatgggtc aagagacatg aaaaaagaag 13740acagtcagcc
tccatcaaaa gtcatagaag gggctattcc catagaagaa ggaggtgaga 13800ggagctcaga
ggaacatgcg ggagaccatg taaaaccaga aaatgaagag caacctccaa 13860caccaacact
tgctgatata ttgggtggag aagcagcaag aaaagaagca gcacaaagag 13920cagaagtcgc
tgctgaacaa gaagcagtta tggctgcttt tgaggcagaa tctaaaatag 13980aaaaagtttc
agagccttct gctgtctctc aaattgattt taacaagtat actcaccggg 14040ctgtcagttt
ccttgctcgt aatttctata atcttaaata tgtagcattg gttttggctt 14100tctgcattaa
ctttatttta ttgttctaca aggtaacaac attgggtgaa gatgatgatg 14160ctgctagcgg
agaagggagt gttgaacaac taatggaaga attaacaggc gaaggtgatg 14220atgtgagtgg
cggaggaagt agtggtggag aaagtggtga agaggatcca attgaaatgg 14280ttcatgtgga
tgaggatttc ttttatatgg cacatgttat gcgattggct gcaatcctac 14340attctcttgt
ttctttagct atgttgattg catattatca tttgaaggtc cctctagcta 14400tattcaagag
agaaaaagaa atagctcgtc gacttgagtt tgatggtttg tacattgctg 14460agcaaccaga
agatgatgat attaaatcac attgggataa actggttatc tgtgcaaaat 14520catttcctgt
taattactgg gataaatttg tgaagaaaaa ggttcgacag aaatacagtg 14580aaacttatga
ctttgattca ataagtaatc ttttgggaat ggaaaaaaca tctttcagtg 14640cccaagatac
tgaagaagga tcgggactta ttcattacat tttgaacttt gactggaggt 14700atcagctttg
gaaagcagga gtcacaatca cagataatgc atttttgtac agtttattat 14760acttcatctt
ttcaattttg ggaaacttca ataacttttt ctttgctgcc catttacttg 14820atgttgcagt
tggttttaaa acattgagga ctattttgca atcagtcaca cacaatggaa 14880aacagcttgt
attgactgta atgctgctaa ccatcatagt atacatctat actgtcattg 14940ctttcaactt
cttccgaaaa ttttatgtcc aagaagagga tgaggaagtg gataaaaaat 15000gccacgatat
gttaacttgt tttgtattcc acctttacaa aggagttaga gctggtggtg 15060gtattggtga
tgagattgaa cctcctgatg gtgatgatta tgaagtttac aggataatgt 15120ttgatattac
gtttttcttt tttgttattg tcatcttgct agccatcatt caaggtttga 15180tcattgatgc
atttggtgaa ttgagagatc agttagaaag tgtaaaagaa gacatggaat 15240ctaactgctt
catttgtggg ataggaaaag attattttga taaagttccc catggttttg 15300acactcatgt
tcaacaagaa cataacttgg ctaattacat gttctttctt atgcatctga 15360ttaacaagcc
agatactgaa tacacaggtc aagaaaccta tgtctggaac atgtatcagc 15420aacgttgttg
ggatttcttc ccagttggtg actgttttcg taaacagtat gaagatgaac 15480tgggaggtgg
tggtggttaa ttcatttggg tgggtggtgg ctaaatttat attattaaaa 15540caaaattaat
gctgggaact atcaaacatc cttcaatttt attaaaattt cagctaaatt 15600caacaatata
tcttatgata ttgtatttgt ctaatgaagg aatagaacta tcgtgttatg 15660aatcagtgaa
gttttcactt gtttagcata atttatgcta agtttactat tgcaaaatac 15720tttctttata
tccgaaaatg ttgtaaaata aatgtaaatg gtgtggcctt aaatataatg
1578010965DNAArtificial SequenceDescription of artificial sequence note =
synthetic construct 10aatttagaat caaatattat tgatactatt tctttttcat
actttacatt aatattcttc 60aaaattaaaa atgccaggag tagagcatgt tactaacaaa
gtcgttgttc atcctttagt 120tctattaagt gttgttgatc atttcaatag aatgggtaaa
attgggaatc agaagagagt 180agttggcgta ttattaggat gctggaaggc aaaaggtgtt
ttagacgtat ctaatagttt 240tgcagtgcca tttgatgaag atgataaaga caaatcagtt
tggtttttag accatgatta 300tttagaaaat atgtatggca tgtttaagaa agttaatgca
agagaaaaag ttgttggctg 360gtatcataca ggcccaaagt tacatcaaaa tgatgttgca
attaatgaac ttatacgccg 420ttactgccct aactcagttc ttgttattat cgatgcaaaa
ccaaaggatc ttggtttacc 480tacagaagca tatagagcag ttgaagaagt acatgatgat
ggttctccta cgacaaaaac 540atttgagcat gttcccagtg aaataggggc tgaagaagca
gaggaagtgg gtgttgaaca 600tctgctgaga gatataaaag atacaactgt cggctcactt
tcgcaaaggg ttactaatca 660atttcttggt ctcaaaggcc ttaatcaaca aattcaagac
atcagggatt accttatgca 720ggttgttgaa ggaaaattgc ccatcaacca tcaaataata
tatcagcttc aagacatatt 780taatctcctt cctgacatga accatgggaa ctttgttgat
tcattataca taaaaacaaa 840tgatcagatg cttgtcgttt atctcgctgc cctcgttaga
gctattgttg ccttgcataa 900tctgatcaat aataaactca gtaatcgtga tgccgaaaaa
aaaaaaaaaa aaaaaaaaaa 960aaaaa
96511924DNAArtificial SequenceDescription of
artificial sequence note = synthetic construct 11tacttcattg
tcataaaggg gtaacattgc tgaatccagc gtaaaggtta cagtgactct 60cacctggtta
taacagtttt gctttgtaat catgggttct gagagatata gcttttcttt 120gactactttc
agtccatctg gaaaattagt tcaaattgag tatgcacttg ccgcagtcgc 180agctggagct
ccatcaatcg gtatcagagc atccaatgga gttgtattgg ctactgaaaa 240caaatacaaa
tcaattttat atgaagaaca tactattcaa aaagtagaaa tgataactaa 300acacattgga
atggtctaca gtggaatggg acctgattat aggctactag tgaagagagc 360tagaaaaatg
gctcaacaat aacagttagt ttacggtgag cctattccta ctgcacagct 420tgttcaacga
gttgccatgg ttatgcagga gtacactcaa tctggaggtg ttagaccttt 480tggagtttct
ttactcattg ccgggtggga tggggataaa ccatctctgt ttcaatgtga 540tccatctgga
gcatactttg cctggaaagc tactgcaatg ggaaaaaatt ttgtcactgg 600caaaacattt
ctagaaaaga ggtacagtga aactttagag ctggatgatg cagtacatac 660tgcaattctc
actcttaaag aaaactttga aggccaaatg acttcggaca atatcgaggt 720cggagtttgt
gatgatcaag ggttcagagt tttagatcct acaacagtga aggattatct 780ggctaatatt
ccataaattt attattaaaa tttgatttta taattaataa aaaggtgatt 840gcttatggat
atgtgtgatg cctaaataaa atattatttt ttattggttt aatgctaaaa 900aaaaaaaaaa
aaaaaaaaaa aaaa
92412946DNAArtificial SequenceDescription of artificial sequence note =
synthetic construct 12atcattgatg atggttgaga aagttccaga ctctacatat
gaaatggttg gaggtcttga 60taagcaaatt aaggaaatca aagaagtaat tgaacctcct
gtaaaacatc cagaactgtt 120tgatgcacta ggaatagctc agcccaaagg agttttatta
tatggaccac ctggaacagg 180taaaacactt ttggcaagag cagttgccca tcacactgag
tgcacgttca ttcgtgtgtc 240aggatctgag ttggttcaga aattcattgg ggaaggatcc
agaatggtta gagaattgtt 300cgtcatggca agggaacatg ctccatctat catatttatg
gatgaaatcg attcaatagg 360ttcatcacgt atcgaatctg ggagtggtgg tgattctgaa
gtccagagaa caatgttaga 420gttattgaac caattggatg gcttcgaagc cacaaaaaat
attaaggtca taatggccac 480taataggatt gatattttgg accctgctct tctgcgtcct
ggaaggatag atcgtaagat 540tgagttcccc ccaccaaatg aggaagctcg tttagatatc
cttagaattc attcacgtaa 600aatgaatctt acccggggta tcaacttgcg taaaattgcc
gagctcatgc ctggagcttc 660aggtgcagaa gtaaagggtg tctgtactga agcagggatg
tatgccctga gggagaggag 720aatccatgtc acccaagaag atttcgaaat ggctgtggcc
aaggttatgc aaaaggactc 780cgagaagaat atgtcaatca agaaattatg gaaataaacg
actcacttat tttttttttt 840ttttactctg tttaaaaagc tttaaatata tagatgtttg
tgaggttttg ttaaaaataa 900atatatacta taatcataaa aaaaaaaaaa aaaaaaaaaa
aaaaaa 9461321DNAArtificial SequenceDescription of
artificial sequence note = synthetic construct 13caactttgta
tagaaaagtt g
211421DNAArtificial SequenceDescription of artificial sequence note =
synthetic construct 14caactttgta taataaagtt g
21154855DNAArtificial SequenceDescription of
artificial sequence note = synthetic construct 15agattatcaa
aaaggatctt cacctagatc cttttaaatt aaaaatgaag ttttaaatca 60atctaaagta
tatatgagta aacttggtct gacagttacc aatgcttaat cagtgaggca 120cctatctcag
cgatctgtct atttcgttca tccatagttg cctgactccc cgtcgtgtag 180ataactacga
tacgggaggg cttaccatct ggccccagtg ctgcaatgat accgcgagac 240ccacgctcac
cggctccaga tttatcagca ataaaccagc cagccggaag ggccgagcgc 300agaagtggtc
ctgcaacttt atccgcctcc atccagtcta ttaattgttg ccgggaagct 360agagtaagta
gttcgccagt taatagtttg cgcaacgttg ttgccattgc tacaggcatc 420gtggtgtcac
gctcgtcgtt tggtatggct tcattcagct ccggttccca acgatcaagg 480cgagttacat
gatcccccat gttgtgcaaa aaagcggtta gctccttcgg tcctccgatc 540gttgtcagaa
gtaagttggc cgcagtgtta tcactcatgg ttatggcagc acttacggat 600ggcatgacag
taagagaatt atgcagatgc ttttctgtga ctggtgagta ctcaaccaag 660tcattctgag
aatagtgtat gcggcgaccg agttgctctt gcccggcgtc aatacgggat 720aataccgcgc
cacatagcag aactttaaaa gtgctcatca ttggaaaacg ttcttcgggg 780cgaaaactct
caaggatctt accgctgttg agatccagtt cgatgtaacc cactcgtgca 840cccaactgat
cttcagcatc ttttactttc accagcgttt ctgggtgagc aaaaacagga 900aggcaaaatg
ccgcaaaaaa gggaataagg gcgacacgga aatgttgaat actcatactc 960ttcctttttc
aatattattg aagcatttat cagggttatt gtctcatgat gatatatttt 1020tatcttgtgc
aatgtaacat cagagatttt gagacacggg ccagagctgc caggaaacag 1080ctatgaccat
gtaatacgac tcactatagg ggatatcgcg gccgccctgc agctggatgg 1140caaataatga
ttttattttg actgatagtg acctgttcgt tgcaacaaat tgataagcaa 1200tgctttctta
taatgccaac tttgtataga aaagttgaac gagaaacgta aaatgatata 1260aatatcaata
tattaaatta gattttgcat aaaaaacaga ctacataata ctgtaaaaca 1320caacatatcc
agtcactatg aatcaactac ttagatggta ttagtgacct gtagtcgact 1380aagttggcag
catcacccga cgcactttgc gccgaataaa tacctgtgac ggaagatcac 1440ttcgcagaat
aaataaatcc tggtgtccct gttgataccg ggaagccctg ggccaacttt 1500tggcgaaaat
gagacgttga tcggcacgta agaggttcca actttcacca taatgaaata 1560agatcactac
cgggcgtatt ttttgagtta tcgagatttt caggagctaa ggaagctaaa 1620atggagaaaa
aaatcactgg atataccacc gttgatatat cccaatggca tcgtaaagaa 1680cattttgagg
catttcagtc agttgctcaa tgtacctata accagaccgt tcagctggat 1740attacggcct
ttttaaagac cgtaaagaaa aataagcaca agttttatcc ggcctttatt 1800cacattcttg
cccgcctgat gaatgctcat ccggaattcc gtatggcaat gaaagacggt 1860gagctggtga
tatgggatag tgttcaccct tgttacaccg ttttccatga gcaaactgaa 1920acgttttcat
cgctctggag tgaataccac gacgatttcc ggcagtttct acacatatat 1980tcgcaagatg
tggcgtgtta cggtgaaaac ctggcctatt tccctaaagg gtttattgag 2040aatatgtttt
tcgtctcagc caatccctgg gtgagtttca ccagttttga tttaaacgtg 2100gccaatatgg
acaacttctt cgcccccgtt ttcaccatgg gcaaatatta tacgcaaggc 2160gacaaggtgc
tgatgccgct ggcgattcag gttcatcatg ccgtttgtga tggcttccat 2220gtcggcagaa
tgcttaatga attacaacag tactgcgatg agtggcaggg ggggcgtaaa 2280cgccgcgtgg
atccggctta ctaaaagcca gataacagta tgcgtatttg cgcgctgatt 2340tttgcggtat
aagaatatat actgatatgt atacccgaag tatgtcaaaa agaggtatgc 2400tatgaagcag
cgtattacag tgacagttga cagcgacagc tatcagttgc tcaaggcata 2460tatgatgtca
atatctccgg tctggtaagc acaaccatgc agaatgaagc ccgtcgtctg 2520cgtgccgaac
gctggaaagc ggaaaatcag gaagggatgg ctgaggtcgc ccggtttatt 2580gaaatgaacg
gctcttttgc tgacgagaac aggggctggt gaaatgcagt ttaaggttta 2640cacctataaa
agagagagcc gttatcgtct gtttgtggat gtacagagtg atattattga 2700cacgcccggg
cgacggatgg tgatccccct ggccagtgca cgtctgctgt cagataaagt 2760ctcccgtgaa
ctttacccgg tggtgcatat cggggatgaa agctggcgca tgatgaccac 2820cgatatggcc
agtgtgccgg tctccgttat cggggaagaa gtggctgatc tcagccaccg 2880cgaaaatgac
atcaaaaacg ccattaacct gatgttctgg ggaatataaa tgtcaggctc 2940ccttatacac
agccagtctg caggtcgata cagtagaaat tacagaaact ttatcacgtt 3000tagtaagtat
agaggctgaa aatccagatg aagccgaacg acttgtaaga gaaaagtata 3060agagttgtga
aattgttctt gatgcagatg attttcagga ctatgacact agcgtatatg 3120aataggtaga
tgtttttatt ttgtcacaca aaaaagaggc tcgcacctct ttttcttatt 3180tctttttatg
atttaatacg gcattgagga caatagcgag taggctggat acgacgattc 3240cgtttgagaa
gaacatttgg aaggctgtcg gtcgactaag ttggcagcat cacccgaaga 3300acatttggaa
ggctgtcggt cgactacagg tcactaatac catctaagta gttgattcat 3360agtgactgga
tatgttgtgt tttacagtat tatgtagtct gttttttatg caaaatctaa 3420tttaatatat
tgatatttat atcattttac gtttctcgtt caactttatt atacaaagtt 3480ggcattataa
aaaagcattg ctcatcaatt tgttgcaacg aacaggtcac tatcagtcaa 3540aataaaatca
ttatttgggg cccgagctta agactggccg tcgttttaca acgtcgtgac 3600tgggaaaaca
tccatgctag cgttaacgcg agagtaggga actgccaggc atcaaataaa 3660acgaaaggct
cagtcggaag actgggcctt tcgttttatc tgttgtttgt cggtgaacgc 3720tctcctgagt
aggacaaatc cgccgggagc ggatttgaac gttgtgaagc aacggcccgg 3780agggtggcgg
gcaggacgcc cgccataaac tgccaggcat caaactaagc agaaggccat 3840cctgacggat
ggcctttttg cgtttctaca aactcttcct ggctagcggt acgcgtatta 3900attgcgttgc
gctcactgcc cgctttccag tcgggaaacc tgtcgtgcca gctgcattaa 3960tgaatcggcc
aacgcgcggg gagaggcggt ttgcgtattg ggcgctcttc cgcttcctcg 4020ctcactgact
cgctgcgctc ggtcgttcgg ctgcggcgag cggtatcagc tcactcaaag 4080gcggtaatac
ggttatccac agaatcaggg gataacgcag gaaagaacat gtgagcaaaa 4140ggccagcaaa
aggccaggaa ccgtaaaaag gccgcgttgc tggcgttttt ccataggctc 4200cgcccccctg
acgagcatca caaaaatcga cgctcaagtc agaggtggcg aaacccgaca 4260ggactataaa
gataccaggc gtttccccct ggaagctccc tcgtgcgctc tcctgttccg 4320accctgccgc
ttaccggata cctgtccgcc tttctccctt cgggaagcgt ggcgctttct 4380catagctcac
gctgtaggta tctcagttcg gtgtaggtcg ttcgctccaa gctgggctgt 4440gtgcacgaac
cccccgttca gcccgaccgc tgcgccttat ccggtaacta tcgtcttgag 4500tccaacccgg
taagacacga cttatcgcca ctggcagcag ccactggtaa caggattagc 4560agagcgaggt
atgtaggcgg tgctacagag ttcttgaagt ggtggcctaa ctacggctac 4620actagaagaa
cagtatttgg tatctgcgct ctgctgaagc cagttacctt cggaaaaaga 4680gttggtagct
cttgatccgg caaacaaacc accgctggta gcggtggttt ttttgtttgc 4740aagcagcaga
ttacgcgcag aaaaaaagga tctcaagaag atcctttgat cttttctacg 4800gggtctgacg
ctcagtggaa cgacgcgtaa ctcacgttaa gggattttgg tcatg
48551611321DNAArtificial SequenceDescription of artificial sequence note
= synthetic constructmisc_feature(6511)..(6511)n is a, c, g, or t
16cgtaccggcc ggcctctgcc tgcgttctgc tgtggaagtt cctattccga agttcctatt
60ctccagaaag tataggaact tcacatgctg cctcgtgcaa gtcacgatct cgagttctat
120agtgtcacct aaatcgtatg tgtatgatac ataaggttat gtattaattg tagccgcgtt
180ctaacgacaa tatgtccata tggtgcactc tcagtacaat ctgctctgat gccgcatagt
240taagccagcc ccgacacccg ccaacacccg ctgacgcgcc ctgacgggct tgtctgctcc
300cggcatccgc ttacagacaa gctgtgaccg tctccgggag ctgcatgtgt cagaggtttt
360caccgtcatc accgaaacgc gcgagacgaa agggcctcgt gatacgccta tttttatagg
420ttaatgtcat gaccaaaatc ccttaacgtg agttttcgtt ccactgagcg tcagaccccg
480tagaaaagat caaaggatct tcttgagatc ctttttttct gcgcgtaatc tgctgcttgc
540aaacaaaaaa accaccgcta ccagcggtgg tttgtttgcc ggatcaagag ctaccaactc
600tttttccgaa ggtaactggc ttcagcagag cgcagatacc aaatactgtc cttctagtgt
660agccgtagtt aggccaccac ttcaagaact ctgtagcacc gcctacatac ctcgctctgc
720taatcctgtt accagtggct gctgccagtg gcgataagtc gtgtcttacc gggttggact
780caagacgata gttaccggat aaggcgcagc ggtcgggctg aacggggggt tcgtgcacac
840agcccagctt ggagcgaacg acctacaccg aactgagata cctacagcgt gagcattgag
900aaagcgccac gcttcccgaa gggagaaagg cggacaggta tccggtaagc ggcagggtcg
960gaacaggaga gcgcacgagg gagcttccag ggggaaacgc ctggtatctt tatagtcctg
1020tcgggtttcg ccacctctga cttgagcgtc gatttttgtg atgctcgtca ggggggcgga
1080gcctatggaa aaacgccagc aacgcggcct ttttacggtt cctggccttt tgctggcctt
1140ttgctcacat gttctttcct gcgttatccc ctgattctgt ggataaccgt attaccgcct
1200ttgagtgagc tgataccgct cgccgcagcc gaacgaccga gcgcagcgag tcagtgagcg
1260aggaagcgga agagcgccca atacgcaaac cgcctctccc cgcgcgttgg ccgattcatt
1320aatgcaggtt gatcagatct cgatcccgcg aaattaatac gactcactat agggagacca
1380caacggtttc cctctagaaa taattttgtt taactttaag aaggagatat acccatggaa
1440aagcctgaac tcaccgcgac gtctgtcgag aagtttctga tcgaaaagtt cgacagcgtc
1500tccgacctga tgcagctctc ggagggcgaa gaatctcgtg ctttcagctt cgatgtagga
1560gggcgtggat atgtcctgcg ggtaaatagc tgcgccgatg gtttctacaa agatcgttat
1620gtttatcggc actttgcatc ggccgcgctc ccgattccgg aagtgcttga cattggggaa
1680ttcagcgaga gcctgaccta ttgcatctcc cgccgtgcac agggtgtcac gttgcaagac
1740ctgcctgaaa ccgaactgcc cgctgttctg cagccggtcg cggaggctat ggatgcgatc
1800gctgcggccg atcttagcca gacgagcggg ttcggcccat tcggaccgca aggaatcggt
1860caatacacta catggcgtga tttcatatgc gcgattgctg atccccatgt gtatcactgg
1920caaactgtga tggacgacac cgtcagtgcg tccgtcgcgc aggctctcga tgagctgatg
1980ctttgggccg aggactgccc cgaagtccgg cacctcgtgc acgcggattt cggctccaac
2040aatgtcctga cggacaatgg ccgcataaca gcggtcattg actggagcga ggcgatgttc
2100ggggattccc aatacgaggt cgccaacatc ttcttctgga ggccgtggtt ggcttgtatg
2160gagcagcaga cgcgctactt cgagcggagg catccggagc ttgcaggatc gccgcggctc
2220cgggcgtata tgctccgcat tggtcttgac caactctatc agagcttggt tgacggcaat
2280ttcgatgatg cagcttgggc gcagggtcga tgcgacgcaa tcgtccgatc cggagccggg
2340actgtcgggc gtacacaaat cgcccgcaga agcgcggccg tctggaccga tggctgtgta
2400gaagtactcg ccgatagtgg aaaccgacgc cccagcactc gtccgagggc aaaggaatag
2460tgaggtacag cttggatcga tccggctgct aacaaagccc gaaaggaagc tgagttggct
2520gctgccaccg ctgagcaata actagcataa ccccttgggg cctctaaacg ggtcttgagg
2580ggttttttgc tgaaaggagg aactatatcc ggatgatcgt cgaggcctca cgtgttaaca
2640gaagttccta ttccgaagtt cctattctct agaaagtata ggaacttcca ccacacaaca
2700caatggcggc caccgcttcc agaaccaccc gattctcttc ttcctcttca caccccacct
2760tccccaaacg cattactaga tccaccctcc ctctctctca tcaaaccctc accaaaccca
2820accacgctct caaaatcaaa tgttccatct ccaaaccccc cacggcggcg cccttcacca
2880aggaagcgcc gaccacggag cccttcgtgt cacggttcgc ctccggcgaa cctcgcaagg
2940gcgcggacat ccttgtggag gcgctggaga ggcagggcgt gacgacggtg ttcgcgtacc
3000ccggcggtgc gtcgatggag atccaccagg cgctcacgcg ctccgccgcc atccgcaacg
3060tgctcccgcg ccacgagcag ggcggcgtct tcgccgccga aggctacgcg cgttcctccg
3120gcctccccgg cgtctgcatt gccacctccg gccccggcgc caccaacctc gtgagcggcc
3180tcgccgacgc tttaatggac agcgtcccag tcgtcgccat caccggccag gtcagccgcc
3240ggatgatcgg caccgacgcc ttccaagaaa ccccgatcgt ggaggtgagc agatccatca
3300cgaagcacaa ctacctcatc ctcgacgtcg acgacatccc ccgcgtcgtc gccgaggctt
3360tcttcgtcgc cacctccggc cgccccggtc cggtcctcat cgacattccc aaagacgttc
3420agcagcaact cgccgtgcct aattgggacg agcccgttaa cctccccggt tacctcgcca
3480ggctgcccag gccccccgcc gaggcccaat tggaacacat tgtcagactc atcatggagg
3540cccaaaagcc cgttctctac gtcggcggtg gcagtttgaa ttccagtgct gaattgaggc
3600gctttgttga actcactggt attcccgttg ctagcacttt aatgggtctt ggaacttttc
3660ctattggtga tgaatattcc cttcagatgc tgggtatgca tggtactgtt tatgctaact
3720atgctgttga caatagtgat ttgttgcttg cctttggggt aaggtttgat gaccgtgtta
3780ctgggaagct tgaggctttt gctagtaggg ctaagattgt tcacattgat attgattctg
3840ccgagattgg gaagaacaag caggcgcacg tgtcggtttg cgcggatttg aagttggcct
3900tgaagggaat taatatgatt ttggaggaga aaggagtgga gggtaagttt gatcttggag
3960gttggagaga agagattaat gtgcagaaac acaagtttcc attgggttac aagacattcc
4020aggacgcgat ttctccgcag catgctatcg aggttcttga tgagttgact aatggagatg
4080ctattgttag tactggggtt gggcagcatc aaatgtgggc tgcgcagttt tacaagtaca
4140agagaccgag gcagtggttg acctcagggg gtcttggagc catgggtttt ggattgcctg
4200cggctattgg tgctgctgtt gctaaccctg gggctgttgt ggttgacatt gatggggatg
4260gtagtttcat catgaatgtt caggagttgg ccactataag agtggagaat ctcccagtta
4320agatattgtt gttgaacaat cagcatttgg gtatggtggt tcagtgggag gataggttct
4380acaagtccaa tagagctcac acctatcttg gagatccgtc tagcgagagc gagatattcc
4440caaacatgct caagtttgct gatgcttgtg ggataccggc agcgcgagtg acgaagaagg
4500aagagcttag agcggcaatt cagagaatgt tggacacccc tggcccctac cttcttgatg
4560tcattgtgcc ccatcaggag catgtgttgc cgatgattcc cagtaatgga tccttcaagg
4620atgtgataac tgagggtgat ggtagaacga ggtactgatt gcctagacca aatgttcctt
4680gatgcttgtt ttgtacaata tatataagat aatgctgtcc tagttgcagg atttggcctg
4740tggtgagcat catagtctgt agtagttttg gtagcaagac attttatttt ccttttattt
4800aacttactac atgcagtagc atctatctat ctctgtagtc tgatatctcc tgttgtctgt
4860attgtgccgt tggatttttt gctgtagtga gactgaaaat gatgtgctag taataatatt
4920tctgttagaa atctaagtag agaatctgtt gaagaagtca aaagctaatg gaatcaggtt
4980acatattcaa tgtttttctt tttttagcgg ttggtagacg tgtagattca acttctcttg
5040gagctcacct aggcaatcag taaaatgcat attccttttt taacttgcca tttatttact
5100tttagtggaa attgtgacca atttgttcat gtagaacgga tttggaccat tgcgtccaca
5160aaacgtctct tttgctcgat cttcacaaag cgataccgaa atccagagat agttttcaaa
5220agtcagaaat ggcaaagtta taaatagtaa aacagaatag atgctgtaat cgacttcaat
5280aacaagtggc atcacgtttc tagttctaga cccatcagct gaggtacacc ggtgatcctc
5340gaagagaagg gttaataaca cactttttta acatttttaa cacaaatttt agttatttaa
5400aaatttatta aaaaatttaa aataagaaga ggaactcttt aaataaatct aacttacaaa
5460atttatgatt tttaataagt tttcaccaat aaaaaatgtc ataaaaatat gttaaaaagt
5520atattatcaa tattctcttt atgataaata aaaagaaaaa aaaaataaaa gttaagtgaa
5580aatgagattg aagtgacttt aggtgtgtat aaatatatca accccgccaa caatttattt
5640aatccaaata tattgaagta tattattcca tagcctttat ttatttatat atttattata
5700taaaagcttt atttgttcta ggttgttcat gaaatatttt tttggtttta tctccgttgt
5760aagaaaatca tgtgctttgt gtcgccactc actattgcag ctttttcatg cattggtcag
5820attgacggtt gattgtattt ttgtttttta tggttttgtg ttatgactta agtcttcatc
5880tctttatctc ttcatcaggt ttgatggtta cctaatatgg tccatgggta catgcatggt
5940taaattaggt ggccaacttt gttgtgaacg atagaatttt tttttatatt aagtaaacta
6000tttttatatt atgaaataat aataaaaaaa atattttatc attattaaca aaatcatatt
6060agttaatttg ttaactctat aataaaagaa atactgtaac attcacatta catggtaaca
6120tctttccacc ctttcatttg ttttttgttt gatgactttt tttcttgttt aaatttattt
6180cccttctttt aaatttggaa tacattatca tcatatataa actaaaatac taaaaacagg
6240attacacaaa tgataaataa taacacaaat atttataaat ctagctgcaa tatatttaaa
6300ctagctatat cgatattgta aaataaaact agctgcattg atactgataa aaaaatatca
6360tgtgctttct ggactgatga tgcagtatac ttttgacatt gcctttattt tatttttcag
6420aaaagctttc ttagttctgg gttcttcatt atttgtttcc catctccatt gtgaattgaa
6480tcatttgctt cgtgtcacaa atacatttag ntaggtacat gcattggtca gattcacggt
6540ttattatgtc atgacttaag ttcatggtag tacattacct gccacgcatg cattatattg
6600gttagatttg ataggcaaat ttggttgtca acaatataaa tataaataat gtttttatat
6660tacgaaataa cagtgatcaa aacaaacagt tttatcttta ttaacaagat tttgtttttg
6720tttgatgacg ttttttaatg tttacgcttt cccccttctt ttgaatttag aacactttat
6780catcataaaa tcaaatacta aaaaaattac atatttcata aataataaca caaatatttt
6840taaaaaatct gaaataataa tgaacaatat tacatattat cacgaaaatt cattaataaa
6900aatattatat aaataaaatg taatagtagt tatatgtagg aaaaaagtac tgcacgcata
6960atatatacaa aaagattaaa atgaactatt ataaataata acactaaatt aatggtgaat
7020catatcaaaa taatgaaaaa gtaaataaaa tttgtaatta acttctatat gtattacaca
7080cacaaataat aaataatagt aaaaaaaatt atgataaata tttaccatct cataaagata
7140tttaaaataa tgataaaaat atagattatt ttttatgcaa ctagctagcc aaaaagagaa
7200cacgggtata tataaaaaga gtacctttaa attctactgt acttccttta ttcctgacgt
7260ttttatatca agtggacata cgtgaagatt ttaattatca gtctaaatat ttcattagca
7320cttaatactt ttctgtttta ttcctatcct ataagtagtc ccgattctcc caacattgct
7380tattcacaca actaactaag aaagtcttcc atagcccccc aagccctagg cgctatcaac
7440tttgtataga aaagttgaac gagaaacgta aaatgatata aatatcaata tattaaatta
7500gattttgcat aaaaaacaga ctacataata ctgtaaaaca caacatatcc agtcactatg
7560gtcgacattt tcaggagcta aggaagctaa aatggagaaa aaaatcactg gatataccac
7620cgttgatata tcccaatggc atcgtaaaga acattttgag gcatttcagt cagttgctca
7680atgtacctat aaccagaccg ttcagctgga tattacggcc tttttaaaga ccgtaaagaa
7740aaataagcac aagttttatc cggcctttat tcacattctt gcccgcctga tgaatgctca
7800tccggaattc cgtatggcaa tgaaagacgg tgagctggtg atatgggata gtgttcaccc
7860ttgttacacc gttttccatg agcaaactga aacgttttca tcgctctgga gtgaatacca
7920cgacgatttc cggcagtttc tacacatata ttcgcaagat gtggcgtgtt acggtgaaaa
7980cctggcctat ttccctaaag ggtttattga gaatatgttt ttcgtctcag ccaatccctg
8040ggtgagtttc accagttttg atttaaacgt ggccaatatg gacaacttct tcgcccccgt
8100tttcaccatg ggcaaatatt atacgcaagg cgacaaggtg ctgatgccgc tggcgattca
8160ggttcatcat gccgtctgtg atggcttcca tgtcggcaga atgcttaatg aattacaaca
8220gtactgcgat gagtggcagg gcggggcgta aacgcgtgga tccggcttac taaaagccag
8280ataacagtat gcgtatttgc gcgctgattt ttgcggtata agaatatata ctgatatgta
8340tacccgaagt atgtcaaaaa gaggtgtgct atgaagcagc gtattacagt gacagttgac
8400agcgacagct atcagttgct caaggcatat atgatgtcaa tatctccggt ctggtaagca
8460caaccatgca gaatgaagcc cgtcgtctgc gtgccgaacg ctggaaagcg gaaaatcagg
8520aagggatggc tgaggtcgcc cggtttattg aaatgaacgg ctcttttgct gacgagaaca
8580gggactggtg aaatgcagtt taaggtttac acctataaaa gagagagccg ttatcgtctg
8640tttgtggatg tacagagtga tattattgac acgccagggc gacggatggt gatccccctg
8700gccagtgcac gtctgctgtc agataaagtc tcccgtgaac tttacccggt ggtgcatatc
8760ggggatgaaa gctggcgcat gatgaccacc gatatggcca gtgtgccggt ctccgttatc
8820ggggaagaag tggctgatct cagccaccgc gaaaatgaca tcaaaaacgc cattaacctg
8880atgttctggg gaatataaat gtcaggctcc cttatacaca ggcggccgcc atagtgactg
8940gatatgttgt gttttacagt attatgtagt ctgtttttta tgcaaaatct aatttaatat
9000attgatattt atatcatttt acgtttctcg ttcaacttta ttatacaaag ttgatagata
9060tcggtccgag atccatcagg taagtttctg cttctacctt tgatatatat ataataatta
9120tcattaatta gtagtaatat aatatttcaa atattttttt caaaataaaa gaatgtagta
9180tatagcaatt gcttttctgt agtttataag tgtgtatatt ttaatttata acttttctaa
9240tatatgacca aaacatggtg atgtgcaggt ccatggtgga gctcgaccga tatctatcaa
9300ctttgtataa taaagttgaa cgagaaacgt aaaatgatat aaatatcaat atattaaatt
9360agattttgca taaaaaacag actacataat actgtaaaac acaacatatc cagtcactat
9420ggcggccgca ttagggcacc ccaggcttta cactttatgc ttccggctcg tataatgtgt
9480ggattttgag ttaggatccg tcgagatttt caggagctaa ggaagctaaa atggagaaaa
9540aaatcactgg atataccacc gttgatatat cccaatggca tcgtaaagaa cattttgagg
9600catttcagtc agttgctcaa tgtacctata accagaccgt tcagctggat attacggcct
9660ttttaaagac cgtaaagaaa aataagcaca agttttatcc ggcctttatt cacattcttg
9720cccgcctgat gaatgctcat ccggaattcc gtatggcaat gaaagacggt gagctggtga
9780tatgggatag tgttcaccct tgttacaccg ttttccatga gcaaactgaa acgttttcat
9840cgctctggag tgaataccac gacgatttcc ggcagtttct acacatatat tcgcaagatg
9900tggcgtgtta cggtgaaaac ctggcctatt tccctaaagg gtttattgag aatatgtttt
9960tcgtctcagc caatccctgg gtgagtttca ccagttttga tttaaacgtg gccaatatgg
10020acaacttctt cgcccccgtt ttcaccatgg gcaaatatta tacgcaaggc gacaaggtgc
10080tgatgccgct ggcgattcag gttcatcatg ccgtttgtga tggcttccat gtcggcagaa
10140tgcttaatga attacaacag tactgcgatg agtggcaggc ggggcgtaat ctagaggatc
10200cggcttacta aaagccagat aacagtatgc gtatttgcgc gctgattttt gcggtataag
10260aatatatact gatatgtata cccgaagtat gtcaaaaaga ggtatgctat gaagcagcgt
10320attacagtga cagttgacag cgacagctat cagttgctca aggcatatat gatgtcaata
10380tctccggttc ggtaagcaca accatgcaga atgaagcccg tcgtctgcgt gccgaacgct
10440ggaaagcgga aaatcaggaa gggatggctg aggtcgcccg gtttattgaa atgaacggct
10500cttttgccga cgagaacagg ggctggtgaa atgcagttta aggtttacac ctataaaaga
10560gagagccgtt atcgtctgtt tgtggatgta cagagtgata ttattgacac gccagggcga
10620cggatggtga tccccctggc cagtgcacgt ctgctgtcag ataaagtccc ccgtgaactt
10680tacccggtgg tgcatatcgg ggatgaaagc tggcgcatga tgaccaccga tatggccagt
10740gtgccggtct ccgttatcgg ggaagaagtg gctgatctca gccaccgcga aaatgacatc
10800aaaaacgcca ttaacctgat gttctgggga atataaatgt caggctccct tatacacagc
10860cagtctgcag gtcgaccata gtgactggat atgttgtgtt ttacagtatt atgtagtctg
10920ttttttatgc aaaatctaat ttaatatatt gatatttata tcattttacg tttctcgttc
10980aacttttcta tacaaagttg atagcgttaa cccgggtaac tgtacctaaa gaaggagtgc
11040gtcgaagcag atcgttcaaa catttggcaa taaagtttct taagattgaa tcctgttgcc
11100ggtcttgcga tgattatcat ataatttctg ttgaattacg ttaagcatgt aataattaac
11160atgtaatgca tgacgttatt tatgagatgg gtttttatga ttagagtccc gcaattatac
11220atttaatacg cgatagaaaa caaaatatag cgcgcaaact aggataaatt atcgcgcgcg
11280gtgtcatcta tgttactaga tcgatgtcga atcgatgggc c
11321178851DNAArtificial SequenceDescription of artificial sequence note
= synthetic constructmisc_feature(6841)..(6841)n is a, c, g, or t
17acaaagttga tagcgttaac ccgggtaact gtacctaaag aaggagtgcg tcgaagcaga
60tcgttcaaac atttggcaat aaagtttctt aagattgaat cctgttgccg gtcttgcgat
120gattatcata taatttctgt tgaattacgt taagcatgta ataattaaca tgtaatgcat
180gacgttattt atgagatggg tttttatgat tagagtcccg caattataca tttaatacgc
240gatagaaaac aaaatatagc gcgcaaacta ggataaatta tcgcgcgcgg tgtcatctat
300gttactagat cgatgtcgaa tcgatgggcc cgtaccggcc ggcctctgcc tgcgttctgc
360tgtggaagtt cctattccga agttcctatt ctccagaaag tataggaact tcacatgctg
420cctcgtgcaa gtcacgatct cgagttctat agtgtcacct aaatcgtatg tgtatgatac
480ataaggttat gtattaattg tagccgcgtt ctaacgacaa tatgtccata tggtgcactc
540tcagtacaat ctgctctgat gccgcatagt taagccagcc ccgacacccg ccaacacccg
600ctgacgcgcc ctgacgggct tgtctgctcc cggcatccgc ttacagacaa gctgtgaccg
660tctccgggag ctgcatgtgt cagaggtttt caccgtcatc accgaaacgc gcgagacgaa
720agggcctcgt gatacgccta tttttatagg ttaatgtcat gaccaaaatc ccttaacgtg
780agttttcgtt ccactgagcg tcagaccccg tagaaaagat caaaggatct tcttgagatc
840ctttttttct gcgcgtaatc tgctgcttgc aaacaaaaaa accaccgcta ccagcggtgg
900tttgtttgcc ggatcaagag ctaccaactc tttttccgaa ggtaactggc ttcagcagag
960cgcagatacc aaatactgtc cttctagtgt agccgtagtt aggccaccac ttcaagaact
1020ctgtagcacc gcctacatac ctcgctctgc taatcctgtt accagtggct gctgccagtg
1080gcgataagtc gtgtcttacc gggttggact caagacgata gttaccggat aaggcgcagc
1140ggtcgggctg aacggggggt tcgtgcacac agcccagctt ggagcgaacg acctacaccg
1200aactgagata cctacagcgt gagcattgag aaagcgccac gcttcccgaa gggagaaagg
1260cggacaggta tccggtaagc ggcagggtcg gaacaggaga gcgcacgagg gagcttccag
1320ggggaaacgc ctggtatctt tatagtcctg tcgggtttcg ccacctctga cttgagcgtc
1380gatttttgtg atgctcgtca ggggggcgga gcctatggaa aaacgccagc aacgcggcct
1440ttttacggtt cctggccttt tgctggcctt ttgctcacat gttctttcct gcgttatccc
1500ctgattctgt ggataaccgt attaccgcct ttgagtgagc tgataccgct cgccgcagcc
1560gaacgaccga gcgcagcgag tcagtgagcg aggaagcgga agagcgccca atacgcaaac
1620cgcctctccc cgcgcgttgg ccgattcatt aatgcaggtt gatcagatct cgatcccgcg
1680aaattaatac gactcactat agggagacca caacggtttc cctctagaaa taattttgtt
1740taactttaag aaggagatat acccatggaa aagcctgaac tcaccgcgac gtctgtcgag
1800aagtttctga tcgaaaagtt cgacagcgtc tccgacctga tgcagctctc ggagggcgaa
1860gaatctcgtg ctttcagctt cgatgtagga gggcgtggat atgtcctgcg ggtaaatagc
1920tgcgccgatg gtttctacaa agatcgttat gtttatcggc actttgcatc ggccgcgctc
1980ccgattccgg aagtgcttga cattggggaa ttcagcgaga gcctgaccta ttgcatctcc
2040cgccgtgcac agggtgtcac gttgcaagac ctgcctgaaa ccgaactgcc cgctgttctg
2100cagccggtcg cggaggctat ggatgcgatc gctgcggccg atcttagcca gacgagcggg
2160ttcggcccat tcggaccgca aggaatcggt caatacacta catggcgtga tttcatatgc
2220gcgattgctg atccccatgt gtatcactgg caaactgtga tggacgacac cgtcagtgcg
2280tccgtcgcgc aggctctcga tgagctgatg ctttgggccg aggactgccc cgaagtccgg
2340cacctcgtgc acgcggattt cggctccaac aatgtcctga cggacaatgg ccgcataaca
2400gcggtcattg actggagcga ggcgatgttc ggggattccc aatacgaggt cgccaacatc
2460ttcttctgga ggccgtggtt ggcttgtatg gagcagcaga cgcgctactt cgagcggagg
2520catccggagc ttgcaggatc gccgcggctc cgggcgtata tgctccgcat tggtcttgac
2580caactctatc agagcttggt tgacggcaat ttcgatgatg cagcttgggc gcagggtcga
2640tgcgacgcaa tcgtccgatc cggagccggg actgtcgggc gtacacaaat cgcccgcaga
2700agcgcggccg tctggaccga tggctgtgta gaagtactcg ccgatagtgg aaaccgacgc
2760cccagcactc gtccgagggc aaaggaatag tgaggtacag cttggatcga tccggctgct
2820aacaaagccc gaaaggaagc tgagttggct gctgccaccg ctgagcaata actagcataa
2880ccccttgggg cctctaaacg ggtcttgagg ggttttttgc tgaaaggagg aactatatcc
2940ggatgatcgt cgaggcctca cgtgttaaca gaagttccta ttccgaagtt cctattctct
3000agaaagtata ggaacttcca ccacacaaca caatggcggc caccgcttcc agaaccaccc
3060gattctcttc ttcctcttca caccccacct tccccaaacg cattactaga tccaccctcc
3120ctctctctca tcaaaccctc accaaaccca accacgctct caaaatcaaa tgttccatct
3180ccaaaccccc cacggcggcg cccttcacca aggaagcgcc gaccacggag cccttcgtgt
3240cacggttcgc ctccggcgaa cctcgcaagg gcgcggacat ccttgtggag gcgctggaga
3300ggcagggcgt gacgacggtg ttcgcgtacc ccggcggtgc gtcgatggag atccaccagg
3360cgctcacgcg ctccgccgcc atccgcaacg tgctcccgcg ccacgagcag ggcggcgtct
3420tcgccgccga aggctacgcg cgttcctccg gcctccccgg cgtctgcatt gccacctccg
3480gccccggcgc caccaacctc gtgagcggcc tcgccgacgc tttaatggac agcgtcccag
3540tcgtcgccat caccggccag gtcagccgcc ggatgatcgg caccgacgcc ttccaagaaa
3600ccccgatcgt ggaggtgagc agatccatca cgaagcacaa ctacctcatc ctcgacgtcg
3660acgacatccc ccgcgtcgtc gccgaggctt tcttcgtcgc cacctccggc cgccccggtc
3720cggtcctcat cgacattccc aaagacgttc agcagcaact cgccgtgcct aattgggacg
3780agcccgttaa cctccccggt tacctcgcca ggctgcccag gccccccgcc gaggcccaat
3840tggaacacat tgtcagactc atcatggagg cccaaaagcc cgttctctac gtcggcggtg
3900gcagtttgaa ttccagtgct gaattgaggc gctttgttga actcactggt attcccgttg
3960ctagcacttt aatgggtctt ggaacttttc ctattggtga tgaatattcc cttcagatgc
4020tgggtatgca tggtactgtt tatgctaact atgctgttga caatagtgat ttgttgcttg
4080cctttggggt aaggtttgat gaccgtgtta ctgggaagct tgaggctttt gctagtaggg
4140ctaagattgt tcacattgat attgattctg ccgagattgg gaagaacaag caggcgcacg
4200tgtcggtttg cgcggatttg aagttggcct tgaagggaat taatatgatt ttggaggaga
4260aaggagtgga gggtaagttt gatcttggag gttggagaga agagattaat gtgcagaaac
4320acaagtttcc attgggttac aagacattcc aggacgcgat ttctccgcag catgctatcg
4380aggttcttga tgagttgact aatggagatg ctattgttag tactggggtt gggcagcatc
4440aaatgtgggc tgcgcagttt tacaagtaca agagaccgag gcagtggttg acctcagggg
4500gtcttggagc catgggtttt ggattgcctg cggctattgg tgctgctgtt gctaaccctg
4560gggctgttgt ggttgacatt gatggggatg gtagtttcat catgaatgtt caggagttgg
4620ccactataag agtggagaat ctcccagtta agatattgtt gttgaacaat cagcatttgg
4680gtatggtggt tcagtgggag gataggttct acaagtccaa tagagctcac acctatcttg
4740gagatccgtc tagcgagagc gagatattcc caaacatgct caagtttgct gatgcttgtg
4800ggataccggc agcgcgagtg acgaagaagg aagagcttag agcggcaatt cagagaatgt
4860tggacacccc tggcccctac cttcttgatg tcattgtgcc ccatcaggag catgtgttgc
4920cgatgattcc cagtaatgga tccttcaagg atgtgataac tgagggtgat ggtagaacga
4980ggtactgatt gcctagacca aatgttcctt gatgcttgtt ttgtacaata tatataagat
5040aatgctgtcc tagttgcagg atttggcctg tggtgagcat catagtctgt agtagttttg
5100gtagcaagac attttatttt ccttttattt aacttactac atgcagtagc atctatctat
5160ctctgtagtc tgatatctcc tgttgtctgt attgtgccgt tggatttttt gctgtagtga
5220gactgaaaat gatgtgctag taataatatt tctgttagaa atctaagtag agaatctgtt
5280gaagaagtca aaagctaatg gaatcaggtt acatattcaa tgtttttctt tttttagcgg
5340ttggtagacg tgtagattca acttctcttg gagctcacct aggcaatcag taaaatgcat
5400attccttttt taacttgcca tttatttact tttagtggaa attgtgacca atttgttcat
5460gtagaacgga tttggaccat tgcgtccaca aaacgtctct tttgctcgat cttcacaaag
5520cgataccgaa atccagagat agttttcaaa agtcagaaat ggcaaagtta taaatagtaa
5580aacagaatag atgctgtaat cgacttcaat aacaagtggc atcacgtttc tagttctaga
5640cccatcagct gaggtacacc ggtgatcctc gaagagaagg gttaataaca cactttttta
5700acatttttaa cacaaatttt agttatttaa aaatttatta aaaaatttaa aataagaaga
5760ggaactcttt aaataaatct aacttacaaa atttatgatt tttaataagt tttcaccaat
5820aaaaaatgtc ataaaaatat gttaaaaagt atattatcaa tattctcttt atgataaata
5880aaaagaaaaa aaaaataaaa gttaagtgaa aatgagattg aagtgacttt aggtgtgtat
5940aaatatatca accccgccaa caatttattt aatccaaata tattgaagta tattattcca
6000tagcctttat ttatttatat atttattata taaaagcttt atttgttcta ggttgttcat
6060gaaatatttt tttggtttta tctccgttgt aagaaaatca tgtgctttgt gtcgccactc
6120actattgcag ctttttcatg cattggtcag attgacggtt gattgtattt ttgtttttta
6180tggttttgtg ttatgactta agtcttcatc tctttatctc ttcatcaggt ttgatggtta
6240cctaatatgg tccatgggta catgcatggt taaattaggt ggccaacttt gttgtgaacg
6300atagaatttt tttttatatt aagtaaacta tttttatatt atgaaataat aataaaaaaa
6360atattttatc attattaaca aaatcatatt agttaatttg ttaactctat aataaaagaa
6420atactgtaac attcacatta catggtaaca tctttccacc ctttcatttg ttttttgttt
6480gatgactttt tttcttgttt aaatttattt cccttctttt aaatttggaa tacattatca
6540tcatatataa actaaaatac taaaaacagg attacacaaa tgataaataa taacacaaat
6600atttataaat ctagctgcaa tatatttaaa ctagctatat cgatattgta aaataaaact
6660agctgcattg atactgataa aaaaatatca tgtgctttct ggactgatga tgcagtatac
6720ttttgacatt gcctttattt tatttttcag aaaagctttc ttagttctgg gttcttcatt
6780atttgtttcc catctccatt gtgaattgaa tcatttgctt cgtgtcacaa atacatttag
6840ntaggtacat gcattggtca gattcacggt ttattatgtc atgacttaag ttcatggtag
6900tacattacct gccacgcatg cattatattg gttagatttg ataggcaaat ttggttgtca
6960acaatataaa tataaataat gtttttatat tacgaaataa cagtgatcaa aacaaacagt
7020tttatcttta ttaacaagat tttgtttttg tttgatgacg ttttttaatg tttacgcttt
7080cccccttctt ttgaatttag aacactttat catcataaaa tcaaatacta aaaaaattac
7140atatttcata aataataaca caaatatttt taaaaaatct gaaataataa tgaacaatat
7200tacatattat cacgaaaatt cattaataaa aatattatat aaataaaatg taatagtagt
7260tatatgtagg aaaaaagtac tgcacgcata atatatacaa aaagattaaa atgaactatt
7320ataaataata acactaaatt aatggtgaat catatcaaaa taatgaaaaa gtaaataaaa
7380tttgtaatta acttctatat gtattacaca cacaaataat aaataatagt aaaaaaaatt
7440atgataaata tttaccatct cataaagata tttaaaataa tgataaaaat atagattatt
7500ttttatgcaa ctagctagcc aaaaagagaa cacgggtata tataaaaaga gtacctttaa
7560attctactgt acttccttta ttcctgacgt ttttatatca agtggacata cgtgaagatt
7620ttaattatca gtctaaatat ttcattagca cttaatactt ttctgtttta ttcctatcct
7680ataagtagtc ccgattctcc caacattgct tattcacaca actaactaag aaagtcttcc
7740atagcccccc aagccctagg cgctatcaac tttgtataga aaagttgaag catcacttcg
7800acatcttcaa gcatataatc ttcttcataa ccatcatctg agttagggat accggtcatt
7860gggtcgcaat cctttacaac aaacctcaga gtagcagcaa agtttgtgac agtaagatta
7920ggggcatctg gaaattgcag ggcaatataa gtaggtgatg tttccgaaca ggctaatctt
7980tggcagggga cctcagctac gatctggtaa ccttcgctca cttctaactg cactcgaact
8040ttctccagta gctggtcact catggtattt gtacagtcaa actgaagaac aatatgattt
8100ttgaaaacat gcttcgtgac tctaacttga tattctgtct cagattccgt aagactgata
8160ggatcggaac caactttatt atacaaagtt gatagatatc ggtccgagat ccatcaggta
8220agtttctgct tctacctttg atatatatat aataattatc attaattagt agtaatataa
8280tatttcaaat atttttttca aaataaaaga atgtagtata tagcaattgc ttttctgtag
8340tttataagtg tgtatatttt aatttataac ttttctaata tatgaccaaa acatggtgat
8400gtgcaggtcc atggtggagc tcgaccgata tctatcaact ttgtataata aagttggttc
8460cgatcctatc agtcttacgg aatctgagac agaatatcaa gttagagtca cgaagcatgt
8520tttcaaaaat catattgttc ttcagtttga ctgtacaaat accatgagtg accagctact
8580ggagaaagtt cgagtgcagt tagaagtgag cgaaggttac cagatcgtag ctgaggtccc
8640ctgccaaaga ttagcctgtt cggaaacatc acctacttat attgccctgc aatttccaga
8700tgcccctaat cttactgtca caaactttgc tgctactctg aggtttgttg taaaggattg
8760cgacccaatg accggtatcc ctaactcaga tgatggttat gaagaagatt atatgcttga
8820agatgtcgaa gtgatgcttc aacttttcta t
88511815759DNAArtificial SequenceDescription of artificial sequence note
= synthetic construct 18caacttccta acgacgaggt agttcttgga tatgtaatac
gggagagcca tccacttctt 60tcactggtct gctaagtaga gaggatggcc gacagcgaag
gaggatccga gcaggacgat 120gtttcgttcc tgaggacgga ggatatggtg tgcctatcat
gcacagcaac tggagagaga 180gtttgcttag cagctgaggg ctttggtaac cgtcactgtt
ttctagaaaa tattgctgat 240aagaatatac caccagatct ttcaacatgt gtatttgtta
ttgaacaagc tctatcagta 300agagcacttc aggagttagt tacagcagct ggatctgaag
agggaaaggg aactggatct 360ggtcacagga ctcttcttta tggaaatgct atactactcc
ggcaccaaaa cagtgacatg 420tatctggctt gtttatctac cagttcatca aatgacaagc
tctcatttga tgttggttta 480caagaacatt cccaagggga agcttgttgg tggaccgtac
accctgcttc taaacagaga 540tcagaaggtg aaaaagtgag agttggtgat gatttaattc
ttgtgtctgt agccactgaa 600agatatttgc atactgctaa agaaaacgat caatctattg
taaatgcatc tttccatgta 660actcattggt ctgttcagcc ttatggaact ggtatcagca
aaatgaagta tgttggttat 720gtgttcggag gagatgtgtt aagatttttc catggtgggg
atgaatgcct taccattcca 780tcaacttgga gtgaaacccc tggacaaaat gtggtagttt
atgaaggagg gagtgttttg 840agtcaagctc gttcactttg gagattggaa ctggctagga
caaaatggtc tggtggtttc 900attaattggt atcatccaat gaggatacga catctcacca
ctggtagata cttaggagtt 960aatgaaaata atgaattaca cctcgttgtt agggaggaag
ccacaacagc attatctaca 1020ttcattttaa gacaagaaaa agatgaccaa aaagtagtaa
tggaagataa ggatttagaa 1080gtaataggag ctccaataat aaaatatggt gacagtactg
ttttagtcca acattcagaa 1140agtggtttat ggttaactta taagtcattc gaaactaaga
aaaaaggtgt gggtaaagta 1200gaagaaaaac aagctgtact tcatgaggag ggaaaaatgg
atgatggatt agactttagt 1260agaagtcaag aagaagaatc aaggactgct agagtaataa
ggaaatgttc gtcacttttc 1320actcaattta ttaggggtct agaaactctg caaatgaatc
gaagacattc tctgttttgc 1380gctagtgtaa atttaaatga aatggtcatg tgtttagaag
atttaattaa ttactttgcc 1440cagcctgagg aagatatgga acatgaggaa aaacaaaacc
ggttaagagc tttgagaaac 1500agacaagatt tgttccaaga agaaggaatt ttaaatctta
tcttagaagc cattgataaa 1560attaatgtta taacatccca aggtttctta gtcagtttag
ctggagatga gtctggacag 1620agctgggata taatctcagg atatttgtat caactgctag
ctgccatcat aaaaggaaat 1680catactaatt gtgctcagtt tgctaacaca aatagattaa
actggttatt tagcagacta 1740ggttctcaag cttcaagtga gggcacaggt atgttggatg
tacttcattg cgtcttaatt 1800gattctccag aagctttgaa tatgatgaga gatgaacata
taaaagtaat catttcactg 1860ctagaaaaac atgggcgaga tccaagagtt ttagatgtac
tttgttcact ttgtgttggt 1920aatggtgtag cagtccgtag ctcacaaaac aacatctgtg
atttccttct gccaggaaaa 1980aacttgcttc tacaaacgca acttgtggat catgttgcca
gtgtcaggcc aaatattttt 2040gtgggtcgag tcgaaggttc tgctgtttat caaaaatggt
attttgaagt gactttagat 2100catatggagc aaaccaccca tatgacaccg catctaagaa
ttggctgggc taacacttct 2160ggttatgttc cctttcctgg cggtggtgaa aaatggggcg
gtaatggagt tggtgatgat 2220ctctactctt ttggttttga tggagctgca ttatggacag
gtggaagaaa aactgtagtc 2280cttcctcatg ctatggaacc ttacataaga aagggagatg
ttattggttg tgctttcgat 2340ctgactgttc caattattac atttactttt aatggaacat
taatccgagg atcatttagg 2400gattttaatc ttcaaggaat gttctttcca gttataagct
gttcctcaaa acttagttgt 2460cgttttttac tgggaggtga tcatggaaga ttaaaatatg
cacctcctga agaattttct 2520cctctcgttg aaagtttgct tcctcaacaa gtgctttcta
ttgatccatg tttttatttt 2580ggcaacctga ataaatgtgt attggctggt ccttatcctg
ttgaagatga ttgtgctttt 2640gttccagttc cagttgacac atctatggta aatttacccg
ttcatgttga tacaatacgc 2700gatcgtttag ctgaaaacat ccatgaaatg tgggctatga
ataaaattga agcaggatgg 2760atttatggag atgtaagaga tgatataaga agaatacatc
catgtcttgt gcaatttgaa 2820aaactacctc ctgcagaaaa gcgatatgac actcaacttg
ctgtacaaac tttaaaaacc 2880atcattgcac tgggctacca tataacaatg gaaaaaccac
catctagaat aaagaacatt 2940cgtttgccga atgaaccatt tttacaatct aatggttaca
agccagctcc tcttgatctc 3000agtgccataa cactaatacc taaaatggag gaacttgttg
accaactcgc tgaaaatact 3060cacaacttgt gggcaaaaga aagaatccaa caaggctgga
cctatggtct taatgaggat 3120cctgatttgt cccgaagtcc tcacctcgtc ccttacagta
aagttgatga tttaattaaa 3180aaagccaaca gggataccgc aagtgaaact gtcaggactc
ttcttgttta tggttataat 3240ttagaccctc ctacaggtga acaaactgaa gctctcttag
cagaagcaag ccgtttgaag 3300cagatgcagt ttagaaccta tcgggctgaa aagacatatg
cagtaaccag tggcaaatgg 3360tattttgaat ttgaaattct tactgctggg ccaatgagag
taggttgggc cattgctgat 3420tataatccag gttcccagat cggaagtgat gaagcatcct
gggcatatga tggttataat 3480gaggaaaagg tttattctgg ggttgctgaa acgtttggaa
gacaatggca agttggagac 3540gttgtaggag tttttcttga tctattggat catactatta
gtttctctct aaatggtgaa 3600ctgcttatgg atgcacttgg gggagaaaca tcttttgcag
atgttcaggg agaaggattt 3660gttccagcat ttacacttgg agtaggacaa aaagcaaaat
tagtgtttgg gcaagatgtt 3720aactcactta agttctttac tacctgtggt ttgcaagaag
gttatgaacc tttctgtgta 3780aacatgaaca gggcagttac cttttggtac accaaagatc
atcctatatt tgaaaatact 3840gatgattata ttgatactaa aattgatgca acgcgtattc
ctgctggttc tgacacacca 3900ccatgtctta aaattagtca taatactttt gagacaatgg
agaaagccaa ttgggaattt 3960cttagacttt ctttacctgt tcaatgttta ccatcattca
taaatgaaca agaaaaagta 4020cgtaggtggc aagaaataag gataagacaa cacagacttc
ttgtggaagc tgaccaaacc 4080actcctgctc acattgaaca gattatgaag tctggtttta
gtatgagtga tattaagggt 4140cttcaaagaa gttatacaga agatggaatg gaaggagaag
aaggattggc accaagctca 4200tcaccactta caaggactaa gtcaaaagtg actccagctc
gtccacctag gaaaggctcc 4260ttaccacgaa atggagatgt tattaatatg aacgggacat
tagaaccagg tggaggaaaa 4320atgaaccgtt ctaatagtga gcttgatttc caacgtttca
atggtgaaat gcccgatggc 4380gataacaaga aaaagcgtgg gagatctcca tttaggttct
tttcaagaaa aaagggggag 4440cgtgatacta gtggagaaaa tgcaaaaaat gtacatatgt
ctgagcctat gggtaatttc 4500cttgagcctc caaggactcc aatgcagcaa agaggtggaa
gtgctctgcg ttcttctcct 4560caacctaaag tacaggagtt aactaagcca ccatccccat
tagttgaaag aagtggaccc 4620aaagcaatgt ctgtgcctgt tggaactggc atcgaaacta
ttggaaatga aatatttgat 4680gtagagtgtt tgaaattgat taatgaatac ttctacggtg
tcaggatatt tccaggtcaa 4740gacccaactc atgtatatgt cggttgggtt acaactcaat
tccatctacg tagtaaagac 4800tttaatcaga atcgagtgct aaagagcact gtagtagtat
gtgatgaatt caatcgtgta 4860atagacagta ttcagcggca gagttgtttt atggtaagag
ctgatgaatt atacaatcaa 4920gtaactcagg atgcctctgg taaaggtgct tcacaaggaa
tgtttattgg atgtttcctg 4980gatactgcta ctggttatgt gacgttcaca tgtgaaggaa
aagaaactaa ccacaagtat 5040aagatggaac ctgatacaaa attatttcca gctatatttg
ttgaagctac aagcaaagaa 5100attctacaaa ttgagcttgg tcgtacatca actacactgc
ctttatcagc agctgttctc 5160caaaattcag aaagacatgt cattcctcag tttccaccaa
gacttaaagt tcagtgtcta 5220aaaccacatc agtgggcacg tgttcctaat atttcattgc
atgtccacgc tctgaaatta 5280tcagatataa gaggttggag tatgctttgt gaagatccag
tttcaatgtt agcattacat 5340atacctgaag aagatagatg tattgatatt ttagaactta
ttgaaatgga caaactactt 5400tcattccatg ctcatacatt gacactttat gcagcactat
gttaccaatc caattatcgt 5460gcaggacatg ttctctgcaa acatgtagac caaaagcaac
ttcagtatgc tattaggtct 5520gaattcatat ctggatcttt acgcttggga ttttatgacc
tcttgattgc tttacacatt 5580gaatcacatg caacaacaat ggaagtttgt aaaaatgaat
tcataatacc ccttggtcta 5640gacttgaaag atttatatga agatccagat atgaagcaca
gcttacgatc tttaaaaact 5700gtctctattt tacctcaaat gagtatgaca gacattacgg
aaaatattga aagcatcaat 5760acattatata gtccttattt tcctcttgat gcagttaagg
attatggaat gactgcatta 5820gaagaggctg taagcatgaa tcaacttcac aatagagacc
ctgtaggtgg ttcaaatgaa 5880aacttgtttc tacccttgtt gaaactggta gatagattat
tgcttgttgg gatactacga 5940gatgaagatg ttacaaagct actaattatg tttgatcctg
aaacttggga ttcaaatttt 6000gaaaaggatg gcaaagatga acatcgtaag ggtttacttc
aaatgaaaat ggcagagggg 6060gcaaaactac agatgtgcta tctcttacag catttatgcg
atatacaatt gcggcatcgg 6120gttgaagcca ttattaattt tagttatgac tatattgctg
atcttcagca ggatcagttg 6180agaagatatg ttgatattaa gcagtctgat cttccatcat
cagttgctgc aagaaaaaca 6240agagagtttc gttgccctcc aagagaacag atgaatgcta
tcataaattt taaaaattta 6300gaagaagatg acaaagaaaa ctgtccatgt ggtgaagaac
tgagggagag attaaacaca 6360tttcatgaag aaactatgag taaagtttca cttgttgctc
tccaagagcc acaagaagat 6420gagaacggtg aaacaccaga aaagccgggt gttttcaaaa
aattatacaa ttttattaat 6480gctgttaaag aattggaaga acctcctaaa atagaagaag
aacctgttaa gaaaactcct 6540gaagaaatat ttagaaaagt attaattagt acaattgtta
gatgggctga agaatcccag 6600attgaaacac caaaattagt cagagaaatg ttcagtctat
tggtaaggca gtacgacact 6660gtaggtgaat taatcagatc tcttggaaac acttatgtga
taaatgacaa aacgaaagaa 6720gatgtagctc agatgtgggt agggttgagc cagatcagag
ctctcctacc tgttcaaatg 6780tctcaagatg aagaaggtct tatgcgaatg aggctatgga
aattagttaa caatcacaca 6840ttctttcaac atcctgattt gattagagtt cttcgtgttc
atgaaaatgt tatggctgtt 6900atgatcaata ccttgggtag aagatcacaa gcacaatctg
atgcttctca agctggtcaa 6960gaaggtgaac ctgcagctaa ggagaaagat acgtcccatg
aaatggtggt agcatgttgt 7020cgtttcctgt gttatttttg cagaacttca cgtcaaaatc
agaaagcaat gtttgaccat 7080ttaacatttt tattagaaaa cagtaatatt ttactttcaa
gaccttcact tagaggaagt 7140acccctcttg atgttgccta ttcctctctc atggaaaata
ccgaactggc attagctctt 7200agagaacatt atttagagaa gatagctgtt tacttgtctc
gctgtggatt acaatctaat 7260tcagaattgg tagaaaaggg ttaccctgat ttgggttggg
atccagttga gggagaaaga 7320tatttagact ttttacgctt ctgtgtttgg gttaacggtg
aaagtgttga agaaaatgca 7380aatctggtta tacggctcct tatacgtcga ccagaatgtt
tgggtcctgc acttcgtgga 7440gaaggtgaag gattactgag agcaattata gatgctaata
agatgtctga aagaatttca 7500gatcgcagaa aaatgatgga ggaacctgaa aattctgccc
atcatcagtt tgaacatcca 7560cttcctgagt ctgatgaaga tgaggactat attgatacag
gagcagcaat actggcattc 7620tattgtactc tggtcgatct tttaggtcgc tgtgctccag
atgctagtgt gattgctcag 7680ggaaagaatg agtctcttag agctagagct attttgagat
ctttagtacc tcttgaagat 7740ttatttggtg tcttgagttt aaagtttaca cttaccaatc
cagctattgg agaagaaagg 7800ccaaaaagtg atataccatc tggtctaata ccatctcata
agcaaagtat tgttttattt 7860ttagagagag tatatggtat tgaacagcaa gatctcttct
tcagattact cgaggaagca 7920tttttacctg atttaagagc agcaactatg ctagatagaa
ctgatggttc tgaatcagaa 7980atggcattag ctatgaatcg ctatattgga aattctattc
tccctttgtt gataaagcat 8040taccagtttt atagtggtgc agataactat gcaagtcttt
tagatgctac acttcataca 8100gtgtatcgcc tatcaaaaaa tcgaatgcta actaaaggtc
agcgagaggc agtatcagat 8160tttttggttg ctctcacaag tcaattacag ccaagcatgt
tactcaaact tcttcgaaag 8220ttaacagttg atgtatcaaa gctttctgag tataccacag
tcgctttaag gttgcttact 8280ttacactatg agcgttgtgc aaaatattat ggaactactg
gaggacaagc tggtggatct 8340agtgatgaag aaaaaaggct cactatgtta ctcttcagta
atatttttga ttctttatca 8400aaaatggatt atgatcctga attatttgga aaagcgcttc
cctgcttgag tgctatagga 8460tgtgcacttc cacccgatta ttcactgtcc aagaattatg
atgaagaatg gtatagctca 8520aagggttcag aaccgactga tgggccttat aatccactgc
ccatcaatac ttctatggtt 8580tctctaaata atgatttaaa cacaattgtt caaaaatttt
ctgaacatta tcatgatgca 8640tgggctagtc gaaaaatgga aaatggttgg gtatatggcg
aacagtggtc tgacagctct 8700aaaactcatc ctcgtttaaa accttataca ttgcttaatg
attatgaaaa agagagatac 8760aaagaaccgg ttagagagtc attgaaagct ctgttagcta
taggatggaa tgtagagcat 8820actgaagttg atattccttc taataacaga ggatcatcag
tcagaagatc ttctaaagca 8880aatacatctg atggttcaac accatttaat tatcatccca
acccaattga tatgactaat 8940ttaacattga gtagagaaat gcaaaatatg gcagagaggt
tagctgaaaa ctcacatgat 9000atttgggcaa aaaagaagaa agaagaactt gtttcatgtg
gtggtggtat acacccacag 9060cttgttccat atgatctttt aacagacaaa gagaagagga
aagatagaga aagatctcaa 9120gaatttttga aatatttaca atatcaagga tacaaactcc
acaggcctac tcgaggaagt 9180gctgatgagc aacaggccgc tgcagctgct gccacaggag
agtccagatt tgcttacagt 9240ctactcgaga aacttataca atatactgat aaagcttcta
ttaatatgaa actactaaag 9300ccttctggta cattcagtag acgctccagt tttaaaactt
gttcaagaga cataaaattc 9360ttttccaaag tggtattgct attggttgag aagtatttca
gcactcacag aaattacttc 9420attgctgttg ccactgcttc taataatgta ggagcagcct
ctttaaaaga aaaagaaatg 9480gttgccagtt tgttctgtaa gctggcaaat ttaattcgaa
caaagctggc tgcttttggt 9540gcagatgttc gaattactgt ccgttgtcta caagtgctag
tgaaagctat agatgccaag 9600tcattggtaa agaattgtcc tgaatttata aggacttcaa
tgctgacatt tttcaataat 9660acagctgatg acttaggcca aactattcag tgtttgcaag
agggtcgtta cagtcacctt 9720agaggcactc atcttaaaac atctacttct ttattttata
taaatgatgt tgtactacct 9780gttctcactt ctatgtttga tcatttggct gtgtgtgatt
atggtagcga cttgttactt 9840gatgaaattc aagtggcctc atatagaatg ttgggtagtt
tatataattt aggaattgat 9900ccaactttaa ctcatgacag aaaatattta aaaacagaaa
ttgaaaggca taggcctgcc 9960attggtgctt gtcttggtgc attttcatca acatttccag
tcgcttatct tgaaccccat 10020ttaaataaac ataatcagtt ttcattagtt aatagaattg
ctgaacattc tcttgaagca 10080caggatattc tagctagaat ggaaaacacc atgcctacat
tggatgcgat cctttctgaa 10140gttgatcagt tcattgaatc cgaaaagagt catacttcag
caccacatgt tattgatgtg 10200attttgcctc tgctttgtgc ttatttgcca agttggtgga
gtcaaggtcc tgataatgtc 10260agtctcacag cagggaatta tgtaacaatg gttactagtg
atcatatgaa tcaactccta 10320aaaaatgtac taaaattaat caaaaataat attggaaatg
aaaatgctcc ctggatgacg 10380agaatagcag cttacaccca gcagatcatc ataaactctt
ctgaagaact gttgaaagat 10440ccattccttc cattaacaca agttgttaag aagaggatag
acaatatgtt tcaccgtgaa 10500gaatctcttc gaggatttct aaaatcttca actgaagata
cctctcaagt tgaagcagaa 10560attcaggagg gctggcatct tattgttaga gatatatatt
ctttttatcc actactaatt 10620aaatatgttg atttacaaag aaatcactgg ttacgtaata
atattccgga agctgaatac 10680ttgtatactc atgttgctga tatatttaat atttggtcta
aatcacagta ctttctaaaa 10740gaagaacaga atttcatatc tgccaacgaa atagacaata
tggctctaat tatgcccact 10800gcaactagga gatctgcagt tgttttggat ggaacagctc
ctgctggagg tggaaagaag 10860aaaaagaagc atcgtgataa gaaaagagat aagaataaag
aaatccaagc aagcttaatg 10920gtagcttgct taaaacgttt attaccagtt ggtcttaacc
tattcgctgg aagagaacaa 10980gagttagttc agcattgtaa agacagatat ttgaagaaaa
tgccagaata tgaaatagtg 11040gattttgcca aaatccaatt aactcttcct gacaagatag
atcctggaga tgagatgtct 11100tggcagcatt atttgtactc aaaactggga aataaaaaag
atatcagctc tgaaaaacca 11160cagcaaatcg atgaggtagt tgataggatt gtggctatgg
caaaagttct ttttgggctt 11220catatgattg atcatccaca actacagagc aagacacaat
acagatctgt tgtatccaca 11280cagagaaagc gtgctgtcat agcttgtttc cggcaactat
cactacatgc cttaccaagc 11340atgcaaataa acctccacct caccaatctg gatggaaaag
agttctttca gcagcgagaa 11400aacgggctgc tattgcttgt cttagaactc aacctttgta
tacccttcca aggcatcgag 11460taattaacat atttgctcgc gcttattgtg agctgtggct
gcaagaagag aatgttggtc 11520aagaaatcat gattgaagat cttacacaaa cttttgaaga
tgctgaattg aaaaaaagag 11580attctgaaga agatgaaagc aaacctgatc cacttaccca
attagttaca acattttgtc 11640ggggtgcaat gactgaaagg agtggagctt tgcaagaaga
cccactttat atgtcctatg 11700cagaaattac tgcaaaatca tgtggagaag aagaagaaga
aggtggagat gaggaagaag 11760gtggagacga agaaggaggg gcatctatcc atgaacaaga
aatggaaaaa cagaaactct 11820tattccatca agctcggcta gccaacagag gtgttgcaga
aatggtattg ttacatattt 11880cagcttgtaa aggtgttccc agtgaaatgg ttatgaaaac
tctccagctg ggtatttctg 11940ttttacgtgg tggtaatctt gatattcaaa tgggtatgct
aaatcatttg aaagaaaaaa 12000aggatgttgg attttttact tctatagctg gcttgatgaa
ctcctgcagt gtgttggatt 12060tagatgcatt tgaaagaaac acaaaagctg aaggcttagg
agttggttca gaaggtgctg 12120ctggtgaaaa gaacatgcat gatgctgaat tcacctgtac
tcttttcaga tttattcaac 12180ttacctgtga agggcataac ttagaatggc agaattatct
tagaacccaa gctggaaata 12240caacaacagt taatgttgtt atttgtactg ttgattacct
tttgagatta caggaatcaa 12300ttatggactt ctattggcac tattcgagta aagaattaat
tgatcctgct ggaaaagcca 12360actttttcaa agcaattggt gtggctagtc aagtatttaa
tacactctct gaagtaattc 12420aagggccttg cccacaaaat caacaagctc tggctcattc
aagattgtgg gatgctgttg 12480gaggattttt gtttcttttc tctcatatgc aagataagct
atcaaaacat tctagtcaag 12540tagacttact gaaagaactt ttgaatttac agaaagatat
gataacaatg atgctatcaa 12600tgttggaagg taatgttgtg aatggtacta ttggaaaaca
gatggtagac acattagttg 12660aatctgcctc aaatgtggaa ttgattttga agtacttcga
catgtttttg aaattgaaag 12720atttgacatc ctctgctagc ttcttggaac ttgatccaaa
ccatgaaggc tgggtaacac 12780ctaaagattt taaagaaaaa atggaacagc agaaaagtta
tactccagaa gaaatagact 12840tcatgttaca gtgctgtgaa accaatcatg acggtaaaat
tgactatgtt ggcttcacgg 12900atagattcca tgagccggcc aaggaaattg gttttaacct
agctgttctt ctcacaaatt 12960tatctgagca tatgccaaat gaaccgagac ttgctcgctt
tttagaaaca gctggtagtg 13020ttcttaacta ctttgaacct ttcctgggac gaattgaaat
attaggtagt agtaaacgaa 13080tcgagcgtgt atatttcgag attaaagaat caaatattga
acagtgggaa aaacctcaaa 13140tcaaggaatc taaacgagca tttttctatt caattgtcac
tgaaggaggt gacaaagaaa 13200aattggaagc ttttgttaat ttttgtgaag atgccatatt
tgagatgaca catgccagtg 13260ggcttatggc aactgatgat ggtacaggct ctggaggagg
aaaacaaaga gcatcctctt 13320attcttatat ggaagatgaa gatgaagaaa ggaatccaat
cagacgtggt tggcaagcaa 13380ctaaagatgg aatttacttt atgttctcaa tgttatctcc
tagcaatatt aaacataaaa 13440ttattgaaat gcaacaaatg tcaattattg aactaatgat
tggttttata aaactatttt 13500tctacatgtt ttattactca ggatattctg tatcagttgt
actgaagtat attggtggta 13560ttatattttc attgatgagg ggaccacaaa ttgaagagcc
agttgtagaa gttaaagagg 13620aagaaaaatc tggacctctg aggataatgc ctgctttgcc
accacctgaa gatagctctc 13680tgcttccatc tgatgggtca agagacatga aaaaagaaga
cagtcagcct ccatcaaaag 13740tcatagaagg ggctattccc atagaagaag gaggtgagag
gagctcagag gaacatgcgg 13800gagaccatgt aaaaccagaa aatgaagagc aacctccaac
accaacactt gctgatatat 13860tgggtggaga agcagcaaga aaagaagcag cacaaagagc
agaagtcgct gctgaacaag 13920aagcagttat ggctgctttt gaggcagaat ctaaaataga
aaaagtttca gagccttctg 13980ctgtctctca aattgatttt aacaagtata ctcaccgggc
tgtcagtttc cttgctcgta 14040atttctataa tcttaaatat gtagcattgg ttttggcttt
ctgcattaac tttattttat 14100tgttctacaa ggtaacaaca ttgggtgaag atgatgatgc
tgctagcgga gaagggagtg 14160ttgaacaact aatggaagaa ttaacaggcg aaggtgatga
tgtgagtggc ggaggaagta 14220gtggtggaga aagtggtgaa gaggatccaa ttgaaatggt
tcatgtggat gaggatttct 14280tttatatggc acatgttatg cgattggctg caatcctaca
ttctcttgtt tctttagcta 14340tgttgattgc atattatcat ttgaaggtcc ctctagctat
attcaagaga gaaaaagaaa 14400tagctcgtcg acttgagttt gatggtttgt acattgctga
gcaaccagaa gatgatgata 14460ttaaatcaca ttgggataaa ctggttatct gtgcaaaatc
atttcctgtt aattactggg 14520ataaatttgt gaagaaaaag gttcgacaga aatacagtga
aacttatgac tttgattcaa 14580taagtaatct tttgggaatg gaaaaaacat ctttcagtgc
ccaagatact gaagaaggat 14640cgggacttat tcattacatt ttgaactttg actggaggta
tcagctttgg aaagcaggag 14700tcacaatcac agataatgca tttttgtaca gtttattata
cttcatcttt tcaattttgg 14760gaaacttcaa taactttttc tttgctgccc atttacttga
tgttgcagtt ggttttaaaa 14820cattgaggac tattttgcaa tcagtcacac acaatggaaa
acagcttgta ttgactgtaa 14880tgctgctaac catcatagta tacatctata ctgtcattgc
tttcaacttc ttccgaaaat 14940tttatgtcca agaagaggat gaggaagtgg ataaaaaatg
ccacgatatg ttaacttgtt 15000ttgtattcca cctttacaaa ggagttagag ctggtggtgg
tattggtgat gagattgaac 15060ctcctgatgg tgatgattat gaagtttaca ggataatgtt
tgatattacg tttttctttt 15120ttgttattgt catcttgcta gccatcattc aaggtttgat
cattgatgca tttggtgaat 15180tgagagatca gttagaaagt gtaaaagaag acatggaatc
taactgcttc atttgtggga 15240taggaaaaga ttattttgat aaagttcccc atggttttga
cactcatgtt caacaagaac 15300ataacttggc taattacatg ttctttctta tgcatctgat
taacaagcca gatactgaat 15360acacaggtca agaaacctat gtctggaaca tgtatcagca
acgttgttgg gatttcttcc 15420cagttggtga ctgttttcgt aaacagtatg aagatgaact
gggaggtggt ggtggttaat 15480tcatttgggt gggtggtggc taaatttata ttattaaaac
aaaattaatg ctgggaacta 15540tcaaacatcc ttcaatttta ttaaaatttc agctaaattc
aacaatatat cttatgatat 15600tgtatttgtc taatgaagga atagaactat cgtgttatga
atcagtgaag ttttcacttg 15660tttagcataa tttatgctaa gtttactatt gcaaaatact
ttctttatat ccgaaaatgt 15720tgtaaaataa atgtaaatgg tgtggcctta aatataatg
157591915667DNAArtificial SequenceDescription of
artificial sequence note = synthetic construct 19caacttccta
acgacgaggt agttcttgga tatgtaatac gggagagcca tccacttctt 60tcactggtct
gctaagtaga gaggatggcc gacagcgaag gaggatccga gcaggacgat 120gtttcgttcc
tgaggacgga ggatatggtg tgcctatcat gcacagcaac tggagagaga 180gtttgcttag
cagctgaggg ctttggtaac cgtcactgtt ttctagaaaa tattgctgat 240aagaatatac
caccagatct ttcaacatgt gtatttgtta ttgaacaagc tctatcagta 300agagcacttc
aggagttagt tacagcagct ggatctgaag agggaaaggg aactggatct 360ggtcacagga
ctcttcttta tggaaatgct atactactcc ggcaccaaaa cagtgacatg 420tatctggctt
gtttatctac cagttcatca aatgacaagc tctcatttga tgttggttta 480caagaacatt
cccaagggga agcttgttgg tggaccgtac accctgcttc taaacagaga 540tcagaaggtg
aaaaagtgag agttggtgat gatttaattc ttgtgtctgt agccactgaa 600agatatttgc
atactgctaa agaaaacgat caatctattg taaatgcatc tttccatgta 660actcattggt
ctgttcagcc ttatggaact ggtatcagca aaatgaagta tgttggttat 720gtgttcggag
gagatgtgtt aagatttttc catggtgggg atgaatgcct taccattcca 780tcaacttgga
gtgaaacccc tggacaaaat gtggtagttt atgaaggagg gagtgttttg 840agtcaagctc
gttcactttg gagattggaa ctggctagga caaaatggtc tggtggtttc 900attaattggt
atcatccaat gaggatacga catctcacca ctggtagata cttaggagtt 960aatgaaaata
atgaattaca cctcgttgtt agggaggaag ccacaacagc attatctaca 1020ttcattttaa
gacaagaaaa agatgaccaa aaagtagtaa tggaagataa ggatttagaa 1080gtaataggag
ctccaataat aaaatatggt gacagtactg ttttagtcca acattcagaa 1140agtggtttat
ggttaactta taagtcattc gaaactaaga aaaaaggtgt gggtaaagta 1200gaagaaaaac
aagctgtact tcatgaggag ggaaaaatgg atgatggatt agactttagt 1260agaagtcaag
aagaagaatc aaggactgct agagtaataa ggaaatgttc gtcacttttc 1320actcaattta
ttaggggtct agaaactctg caaatgaatc gaagacattc tctgttttgc 1380gctagtgtaa
atttaaatga aatggtcatg tgtttagaag atttaattaa ttactttgcc 1440cagcctgagg
aagatatgga acatgaggaa aaacaaaacc ggttaagagc tttgagaaac 1500agacaagatt
tgttccaaga agaaggaatt ttaaatctta tcttagaagc cattgataaa 1560attaatgtta
taacatccca aggtttctta gtcagtttag ctggagatga gtctggacag 1620agctgggata
taatctcagg atatttgtat caactgctag ctgccatcat aaaaggaaat 1680catactaatt
gtgctcagtt tgctaacaca aatagattaa actggttatt tagcagacta 1740ggttctcaag
cttcaagtga gggcacaggt atgttggatg tacttcattg cgtcttaatt 1800gattctccag
aagctttgaa tatgatgaga gatgaacata taaaagtaat catttcactg 1860ctagaaaaac
atgggcgaga tccaagagtt ttagatgtac tttgttcact ttgtgttggt 1920aatggtgtag
cagtccgtag ctcacaaaac aacatctgtg atttccttct gccaggaaaa 1980aacttgcttc
tacaaacgca acttgtggat catgttgcca gtgtcaggcc aaatattttt 2040gtgggtcgag
tcgaaggttc tgctgtttat caaaaatggt attttgaagt gactttagat 2100catatggagc
aaaccaccca tatgacaccg catctaagaa ttggctgggc taacacttct 2160ggttatgttc
cctttcctgg cggtggtgaa aaatggggcg gtaatggagt tggtgatgat 2220ctctactctt
ttggttttga tggagctgca ttatggacag gtggaagaaa aactgtagtc 2280cttcctcatg
ctatggaacc ttacataaga aagggagatg ttattggttg tgctttcgat 2340ctgactgttc
caattattac atttactttt aatggaacat taatccgagg atcatttagg 2400gattttaatc
ttcaaggaat gttctttcca gttataagct gttcctcaaa acttagttgt 2460cgttttttac
tgggaggtga tcatggaaga ttaaaatatg cacctcctga agaattttct 2520cctctcgttg
aaagtttgct tcctcaacaa gtgctttcta ttgatccatg tttttatttt 2580ggcaacctga
ataaatgtgt attggctggt ccttatcctg ttgaagatga ttgtgctttt 2640gttccagttc
cagttgacac atctatggta aatttacccg ttcatgttga tacaatacgc 2700gatcgtttag
ctgaaaacat ccatgaaatg tgggctatga ataaaattga agcaggatgg 2760atttatggag
atgtaagaga tgatataaga agaatacatc catgtcttgt gcaatttgaa 2820aaactacctc
ctgcagaaaa gcgatatgac actcaacttg ctgtacaaac tttaaaaacc 2880atcattgcac
tgggctacca tataacaatg gaaaaaccac catctagaat aaagaacatt 2940cgtttgccga
atgaaccatt tttacaatct aatggttaca agccagctcc tcttgatctc 3000agtgccataa
cactaatacc taaaatggag gaacttgttg accaactcgc tgaaaatact 3060cacaacttgt
gggcaaaaga aagaatccaa caaggctgga cctatggtct taatgaggat 3120cctgatttgt
cccgaagtcc tcacctcgtc ccttacagta aagttgatga tttaattaaa 3180aaagccaaca
gggataccgc aagtgaaact gtcaggactc ttcttgttta tggttataat 3240ttagaccctc
ctacaggtga acaaactgaa gctctcttag cagaagcaag ccgtttgaag 3300cagatgcagt
ttagaaccta tcgggctgaa aagacatatg cagtaaccag tggcaaatgg 3360tattttgaat
ttgaaattct tactgctggg ccaatgagag taggttgggc cattgctgat 3420tataatccag
gttcccagat cggaagtgat gaagcatcct gggcatatga tggttataat 3480gaggaaaagg
tttattctgg ggttgctgaa acgtttggaa gacaatggca agttggagac 3540gttgtaggag
tttttcttga tctattggat catactatta gtttctctct aaatggtgaa 3600ctgcttatgg
atgcacttgg gggagaaaca tcttttgcag atgttcaggg agaaggattt 3660gttccagcat
ttacacttgg agtaggacaa aaagcaaaat tagtgtttgg gcaagatgtt 3720aactcactta
agttctttac tacctgtggt ttgcaagaag gttatgaacc tttctgtgta 3780aacatgaaca
gggcagttac cttttggtac accaaagatc atcctatatt tgaaaatact 3840gatgattata
ttgatactaa aattgatgca acgcgtattc ctgctggttc tgacacacca 3900ccatgtctta
aaattagtca taatactttt gagacaatgg agaaagccaa ttgggaattt 3960cttagacttt
ctttacctgt tcaatgttta ccatcattca taaatgaaca agaaaaagta 4020cgtaggtggc
aagaaataag gataagacaa cacagacttc ttgtggaagc tgaccaaacc 4080actcctgctc
acattgaaca gattatgaag tctggtttta gtatgagtga tattaagggt 4140cttcaaagaa
gttatacaga agatggaatg gaaggagaag aaggattggc accaagctca 4200tcaccactta
caaggactaa gtcaaaagtg actccagctc gtccacctag gaaaggctcc 4260ttaccacgaa
atggagatgt tattaatatg aacgggacat tagaaccagg tggaggaaaa 4320atgaaccgtt
ctaatagtga gcttgatttc caacgtttca atggtgaaat gcccgatggc 4380gataacaaga
aaaagcgtgg gagatctcca tttaggttct tttcaagaaa aaagggggag 4440cgtgatacta
gtggagaaaa tgcaaaaaat gtacatatgt ctgagcctat gggtaatttc 4500cttgagcctc
caaggactcc aatgcagcaa agaggtggaa gtgctctgcg ttcttctcct 4560caacctaaag
tacaggagtt aactaagcca ccatccccat tagttgaaag aagtggaccc 4620aaagcaatgt
ctgtgcctgt tggaactggc atcgaaacta ttggaaatga aatatttgat 4680gtagagtgtt
tgaaattgat taatgaatac ttctacggtg tcaggatatt tccaggtcaa 4740gacccaactc
atgtatatgt cggttgggtt acaactcaat tccatctacg tagtaaagac 4800tttaatcaga
atcgagtgct aaagagcact gtagtagtat gtgatgaatt caatcgtgta 4860atagacagta
ttcagcggca gagttgtttt atggtaagag ctgatgaatt atacaatcaa 4920gtaactcagg
atgcctctgg taaaggtgct tcacaaggaa tgtttattgg atgtttcctg 4980gatactgcta
ctggttatgt gacgttcaca tgtgaaggaa aagaaactaa ccacaagtat 5040aagatggaac
ctgatacaaa attatttcca gctatatttg ttgaagctac aagcaaagaa 5100attctacaaa
ttgagcttgg tcgtacatca actacactgc ctttatcagc agctgttctc 5160caaaattcag
aaagacatgt cattcctcag tttccaccaa gacttaaagt tcagtgtcta 5220aaaccacatc
agtgggcacg tgttcctaat atttcattgc atgtccacgc tctgaaatta 5280tcagatataa
gaggttggag tatgctttgt gaagatccag tttcaatgtt agcattacat 5340atacctgaag
aagatagatg tattgatatt ttagaactta ttgaaatgga caaactactt 5400tcattccatg
ctcatacatt gacactttat gcagcactat gttaccaatc caattatcgt 5460gcaggacatg
ttctctgcaa acatgtagac caaaagcaac ttcagtatgc tattaggtct 5520gaattcatat
ctggatcttt acgcttggga ttttatgacc tcttgattgc tttacacatt 5580gaatcacatg
caacaacaat ggaagtttgt aaaaatgaat tcataatacc ccttggtcta 5640gacttgaaag
atttatatga agatccagat atgaagcaca gcttacgatc tttaaaaact 5700gtctctattt
tacctcaaat gagtatgaca gacattacgg aaaatattga aagcatcaat 5760acattatata
gtccttattt tcctcttgat gcagttaagg attatggaat gactgcatta 5820gaagaggctg
taagcatgaa tcaacttcac aatagagacc ctgtaggtgg ttcaaatgaa 5880aacttgtttc
tacccttgtt gaaactggta gatagattat tgcttgttgg gatactacga 5940gatgaagatg
ttacaaagct actaattatg tttgatcctg aaacttggga ttcaaatttt 6000gaaaaggatg
gcaaagatga acatcgtaag ggtttacttc aaatgaaaat ggcagagggg 6060gcaaaactac
agatgtgcta tctcttacag catttatgcg atatacaatt gcggcatcgg 6120gttgaagcca
ttattaattt tagttatgac tatattgctg atcttcagca ggatcagttg 6180agaagatatg
ttgatattaa gcagtctgat cttccatcat cagttgctgc aagaaaaaca 6240agagagtttc
gttgccctcc aagagaacag atgaatgcta tcataaattt taaaaattta 6300gaagaagatg
acaaagaaaa ctgtccatgt ggtgaagaac tgagggagag attaaacaca 6360tttcatgaag
aaactatgag taaagtttca cttgttgctc tccaagagcc acaagaagat 6420gagaacggtg
aaacaccaga aaagccgggt gttttcaaaa aattatacaa ttttattaat 6480gctgttaaag
aattggaaga acctcctaaa atagaagaag aacctgttaa gaaaactcct 6540gaagaaatat
ttagaaaagt attaattagt acaattgtta gatgggctga agaatcccag 6600attgaaacac
caaaattagt cagagaaatg ttcagtctat tggtaaggca gtacgacact 6660gtaggtgaat
taatcagatc tcttggaaac acttatgtga taaatgacaa aacgaaagaa 6720gatgtagctc
agatgtgggt agggttgagc cagatcagag ctctcctacc tgttcaaatg 6780tctcaagatg
aagaaggtct tatgcgaatg aggctatgga aattagttaa caatcacaca 6840ttctttcaac
atcctgattt gattagagtt cttcgtgttc atgaaaatgt tatggctgtt 6900atgatcaata
ccttgggtag aagatcacaa gcacaatctg atgcttctca agctggtcaa 6960gaaggtgaac
ctgcagctaa ggagaaagat acgtcccatg aaatggtggt agcatgttgt 7020cgtttcctgt
gttatttttg cagaacttca cgtcaaaatc agaaagcaat gtttgaccat 7080ttaacatttt
tattagaaaa cagtaatatt ttactttcaa gaccttcact tagaggaagt 7140acccctcttg
atgttgccta ttcctctctc atggaaaata ccgaactggc attagctctt 7200agagaacatt
atttagagaa gatagctgtt tacttgtctc gctgtggatt acaatctaat 7260tcagaattgg
tagaaaaggg ttaccctgat ttgggttggg atccagttga gggagaaaga 7320tatttagact
ttttacgctt ctgtgtttgg gttaacggtg aaagtgttga agaaaatgca 7380aatctggtta
tacggctcct tatacgtcga ccagaatgtt tgggtcctgc acttcgtgga 7440gaaggtgaag
gattactgag agcaattata gatgctaata agatgtctga aagaatttca 7500gatcgcagaa
aaatgatgga ggaacctgaa aattctgccc atcatcagtt tgaacatcca 7560cttcctgagt
ctgatgaaga tgaggactat attgatacag gagcagcaat actggcattc 7620tattgtactc
tggtcgatct tttaggtcgc tgtgctccag atgctagtgt gattgctcag 7680ggaaagaatg
agtctcttag agctagagct attttgagat ctttagtacc tcttgaagat 7740ttatttggtg
tcttgagttt aaagtttaca cttaccaatc cagctattgg agaagaaagg 7800ccaaaaagtg
atataccatc tggtctaata ccatctcata agcaaagtat tgttttattt 7860ttagagagag
tatatggtat tgaacagcaa gatctcttct tcagattact cgaggaagca 7920tttttacctg
atttaagagc agcaactatg ctagatagaa ctgatggttc tgaatcagaa 7980atggcattag
ctatgaatcg ctatattgga aattctattc tccctttgtt gataaagcat 8040taccagtttt
atagtggtgc agataactat gcaagtcttt tagatgctac acttcataca 8100gtgtatcgcc
tatcaaaaaa tcgaatgcta actaaaggtc agcgagaggc agtatcagat 8160tttttggttg
ctctcacaag tcaattacag ccaagcatgt tactcaaact tcttcgaaag 8220ttaaccgttg
atgtatcaaa gctttctgag tataccacag ttgctttaag gttgcttact 8280ttacactatg
agcgttgtgc aaaatattat ggaactactg gtggacaagc tggtggatct 8340agtgatgaag
aaaaaaggct cactatgtta ctcttcagta atatttttga ttctttatca 8400aaaatggatt
atgatcctga attatttgga aaagcgcttc cctgcttgag tgctatagga 8460tgtgcacttc
cacccgatta ttcactgtcc aagaattatg atgaagaatg gtatagttca 8520aagggttcag
aaccgactga tgggccttat aatccactgc ccatcaatac ttctatggtt 8580tctctaaata
atgatttaaa cacaattgtt caaaaatttt ctgaacatta tcatgatgca 8640tgggctagtc
gaaaaatgga aaatggttgg gtatatggtg agcagtggtc tgacagctct 8700aaaactcatc
ctcgtttaaa accttataca ttgcttaatg attatgaaaa agagagatac 8760aaagaaccgg
ttagagagtc attgaaagct ctgttagcta taggatggaa tgtagagcat 8820actgaagttg
atattccttc taataacaga ggatcatcag tcagaagatc ttctaaagca 8880aatacatctg
atggttcaac accatttaat tatcatccca acccaattga tatgactaat 8940ttaacattga
gtagagaaat gcaaaatatg gcagagaggt tagctgaaaa ctcacatgat 9000atttgggcaa
aaaagaagaa agaagaactt gtttcatgtg gtggtggtat acacccacag 9060cttgttccat
atgatctttt aacagacaaa gagaagagga aagatagaga aagatctcaa 9120gaatttttga
aatatttaca atatcaagga tacaaactcc acaggcctac tcgaggaagt 9180gctgatgagc
aacaggccgc tgcagctgct gccacaggag agtccagatt tgcttacagt 9240ctactcgaga
aacttataca atatactgat aaagcttcta ttaatatgaa actactaaag 9300ccttctggta
cattcagtag acgctccagt tttaaaactt gttcaagaga cataaaattc 9360ttttccaaag
tggtattgct attggttgag aagtatttca gcactcacag aaattacttc 9420attgctgttg
ccactgcttc taataatgta ggagcagcct ctttaaaaga aaaagaaatg 9480gttgccagtt
tgttctgtaa gctggcaaat ttaattcgaa caaagctggc tgcttttggt 9540gcagatgttc
gaattactgt ccgttgtcta caagtgctag tgaaagctat agatgccaag 9600tcattggtaa
agaattgtcc tgaatttata aggacttcaa tgctgacatt tttcaataat 9660acagctgatg
acttaggcca aactattcag tgtttgcaag agggtcgtta cagtcacctt 9720agaggcactc
atcttaaaac atctacttct ttattttata taaatgatgt tgtactacct 9780gttctcactt
ctatgtttga tcatttggct gtgtgtgatt atggtagcga cttgttactt 9840gatgaaattc
aagtggcctc atatagaatg ttgggtagtt tatataattt aggaattgat 9900ccaactttaa
ctcatgacag aaaatattta aaaacagaaa ttgaaaggca taggcctgcc 9960attggtgctt
gtcttggtgc attttcatca acatttccag tcgcttatct tgaaccccat 10020ttaaataaac
ataatcagtt ttcattagtt aatagaattg ctgaacattc tcttgaagca 10080caggatattc
tagctagaat ggaaaacacc atgcctacat tggatgcgat cctttctgaa 10140gttgatcagt
tcattgaatc cgaaaagagt catacttcag caccacatgt tattgatgtg 10200attttgcctc
tgctttgtgc ttatttgcca agttggtgga gtcaaggtcc tgataatgtc 10260agtctcacag
cagggaatta tgtaacaatg gttactagtg atcatatgaa tcaactccta 10320aaaaatgtac
taaaattaat caaaaataat attggaaatg aaaatgctcc ctggatgacg 10380agaatagcag
cttacaccca gcagatcatc ataaactctt ctgaagaact gttgaaagat 10440ccattccttc
cattaacaca agttgttaag aagaggatag acaatatgtt tcaccgtgaa 10500gaatctcttc
gaggatttct aaaatcttca actgaagata cctctcaagt tgaagcagaa 10560attcaggagg
gctggcatct tattgttaga gatatatatt ctttttatcc actactaatt 10620aaatatgttg
atttacaaag aaatcactgg ttacgtaata atattccgga agctgaatac 10680ttgtatactc
atgttgctga tatatttaat atttggtcta aatcacagta ctttctaaaa 10740gaagaacaga
atttcatatc tgccaacgaa atagacaata tggctctaat tatgcccact 10800gcaactagga
gatctgcagt tgttttggat ggaacagctc ctgctggagg tggaaagaag 10860aaaaagaagc
atcgtgataa gaaaagagat aagaataaag aaatccaagc aagcttaatg 10920gtagcttgct
taaaacgttt attaccagtt ggtcttaacc tattcgctgg aagagaacaa 10980gagttagttc
agcattgtaa agacagatat ttgaagaaaa tgccagaata tgaaatagtg 11040gattttgcca
aaatccaatt aactcttcct gacaagatag atcctggaga tgagatgtct 11100tggcagcatt
atttgtactc aaaactggga aataaaaaag atatcagctc tgaaaaacca 11160cagcaaatcg
atgaggtagt tgataggatt gtggctatgg caaaagttct ttttgggctt 11220catatgattg
atcatccaca actacagagc aagacacaat acagatctgt tgtatccaca 11280cagagaaagc
gtgctgtcat agcttgtttc cggcaactat cactacatgc cttaccaagg 11340catcgagtaa
ttaacatatt tgctcgcgct tattgtgagc tgtggctgca agaagagaat 11400gttggtcaag
aaatcatgat tgaagatctt acacaaactt ttgaagatgc tgaattgaaa 11460aaaagagatt
ctgaagaaga tgaaagcaaa cctgatccac ttacccaatt agttacaaca 11520ttttgtcggg
gtgcaatgac tgaaaggagt ggagctttgc aagaagaccc actttatatg 11580tcctatgcag
aaattactgc aaaatcatgt ggagaagaag aagaagaagg tggagatgag 11640gaagaaggtg
gagacgaaga aggaggggca tctatccata agacaatggc aaaattagtg 11700gaacaagaaa
tggaaaaaca gaaactctta ttccatcaag ctcggctagc caacagaggt 11760gttgcagaaa
tggtattgtt acatatttca gcttgtaaag gtgttcccag tgaaatggtt 11820atgaaaactc
tccagctggg tatttctgtt ttacgtggtg gtaatcttga tattcaaatg 11880ggtatgctaa
atcatttgaa agaaaaaaag gatgttggat tttttacttc tatagctggc 11940ttgatgaact
cctgcagtgt gttggattta gatgcatttg aaagaaacac aaaagctgaa 12000ggcttaggag
ttggttcaga aggtgctgct ggtgaaaaga acatgcatga tgctgaattc 12060acctgtactc
ttttcagatt tattcaactt acctgtgaag ggcataactt agaatggcag 12120aattatctta
gaacccaagc tggaaataca acaacagtta atgttgttat ttgtactgtt 12180gattaccttt
tgagattaca ggaatcaatt atggacttct attggcacta ttcgagtaaa 12240gaattaattg
atcctgctgg aaaagccaac tttttcaaag caattggtgt ggctagtcaa 12300gtatttaata
cactctctga agtaattcaa gggccttgcc cacaaaatca acaagctctg 12360gctcattcaa
gattgtggga tgctgttgga ggatttttgt ttcttttctc tcatatgcaa 12420gataagctat
caaaacattc tagtcaagta gacttactga aagaactttt gaatttacag 12480aaagatatga
taacaatgat gctatcaatg ttggaaggta atgttgtgaa tggtactatt 12540ggaaaacaga
tggtagacac attagttgaa tctgcctcaa atgtggaatt gattttgaag 12600tacttcgaca
tgtttttgaa attgaaagat ttgacatcct ctgctagctt cttggaactt 12660gatccaaacc
atgaaggctg ggtaacacct aaagatttta aagaaaaaat ggaacagcag 12720aaaagttata
ctccagaaga aatagacttc atgttacagt gctgtgaaac caatcatgac 12780ggtaaaattg
actatgttgg cttcacggat agattccatg agccggccaa ggaaattggt 12840tttaacctag
ctgttcttct cacaaattta tctgagcata tgccaaatga accgagactt 12900gctcgctttt
tagaaacagc tggtagtgtt cttaactact ttgaaccttt cctgggacga 12960attgaaatat
taggtagtag taaacgaatc gagcgtgtat atttcgagat taaagaatca 13020aatattgaac
agtgggaaaa acctcaaatc aaggaatcta aacgagcatt tttctattca 13080attgtcactg
aaggaggtga caaagaaaaa ttggaagctt ttgttaattt ttgtgaagat 13140gccatatttg
agatgacaca tgccagtggg cttatggcaa ctgatgatgg tacaggctct 13200ggaggaggaa
aacaaagagc atcctcttat tcttatatgg aagatgaaga tgaagaaagg 13260aatccaatca
gacgtggttg gcaagcaact aaagatggaa tttactttat gttctcaatg 13320ttatctccta
gcaatattaa acataaaatt attgaaatgc aacaaatgtc aattattgaa 13380ctaatgattg
gttttataaa actatttttc tacatgtttt attactcagg atattctgta 13440tcagttgtac
tgaagtatat tggtggtatt atattttcat tgatgagggg accacaaatt 13500gaagagccag
ttgtagaagt taaagaggaa gaaaaatctg gacctctgag gataatgcct 13560gctttgccac
cacctgaaga tagctctctg cttccatctg atgggtcaag agacatgaaa 13620aaagaagaca
gtcagcctcc atcaaaagtc atagaagggg ctattcccat agaagaagga 13680ggtgagagga
gctcagagga acatgcggga gaccatgtaa aaccagaaaa tgaagagcaa 13740cctccaacac
caacacttgc tgatatattg ggtggagaag cagcaagaaa agaagcagca 13800caaagagcag
aagtcgctgc tgaacaagaa gcagttatgg ctgcttttga ggcagaatct 13860aaaatagaaa
aagtttcaga gccttctgct gtctctcaaa ttgattttaa caagtatact 13920caccgggctg
tcagtttcct tgctcgtaat ttctataatc ttaaatatgt agcattggtt 13980ttggctttct
gcattaactt tattttattg ttctacaagg taacaacatt gggtgaagat 14040gatgatgctg
ctagcggaga agggagtgtt gaacaactaa tggaagaatt aacaggcgaa 14100ggtgatgatg
tgagtggcgg aggaagtagt ggtggagaaa gtggtgaaga ggatccaatt 14160gaaatggttc
atgtggatga ggatttcttt tatatggcac atgttatgcg attggctgca 14220atcctacatt
ctcttgtttc tttagctatg ttgattgcat attatcattt gaaggtccct 14280ctagctatat
tcaagagaga aaaagaaata gctcgtcgac ttgagtttga tggtttgtac 14340attgctgagc
aaccagaaga tgatgatatt aaatcacatt gggataaact ggttatctgt 14400gcaaaatcat
ttcctgttaa ttactgggat aaatttgtga agaaaaaggt tcgacagaaa 14460tacagtgaaa
cttatgactt tgattcaata agtaatcttt tgggaatgga aaaaacatct 14520ttcagtgccc
aagatactga agaaggatcg ggacttattc attacatttt gaactttgac 14580tggaggtatc
agctttggaa agcaggagtc acaatcacag ataatgcatt tttgtacagt 14640ttattatact
tcatcttttc aattttggga aacttcaata actttttctt tgctgcccat 14700ttacttgatg
ttgcagttgg ttttaaaaca ttgaggacta ttttgcaatc agtcacacac 14760aatggaaaac
agcttgtatt gactgtaatg ctgctaacca tcatagtata catctatact 14820gtcattgctt
tcaacttctt ccgaaaattt tatgtccaag aagaggatga ggaagtggat 14880aaaaaatgcc
acgatatgtt aacttgtttt gtattccacc tttacaaagg agttagagct 14940ggtggtggta
ttggtgatga gattgaacct cctgatggtg atgattatga agtttacagg 15000ataatgtttg
atattacgtt tttctttttt gttattgtca tcttgctagc catcattcaa 15060ggtttgatca
ttgatgcatt tggtgaattg agagatcagt tagaaagtgt aaaagaagac 15120atggaatcta
actgcttcat ttgtgggata ggaaaagatt attttgataa agttccccat 15180ggttttgaca
ctcatgttca acaagaacat aacttggcta attacatgtt ctttcttatg 15240catctgatta
acaagccaga tactgaatac acaggtcaag aaacctatgt ctggaacatg 15300tatcagcaac
gttgttggga tttcttccca gttggtgact gttttcgtaa acagtatgaa 15360gatgaactgg
gaggtggtgg tggttaattc atttgggtgg gtggtggcta aatttatatt 15420attaaaacaa
aattaatgct gggaactatc aaacatcctt caattttatt aaaatttcag 15480ctaaattcaa
caatatatct tatgatattg tatttgtcta atgaaggaat agaactatcg 15540tgttatgaat
cagtgaagtt ttcacttgtt tagcataatt tatgctaagt ttactattgc 15600aaaatacttt
ctttatatcc gaaaatgttg taaaataaat gtaaatggtg tggccttaaa 15660tataatg
15667201486DNAArtificial SequenceDescription of artificial sequence note
= synthetic construct 20cagtgatcta cttctgggtc aacttatgtt ttgtttatgg
ttttcattaa atttacgaga 60cattaaaaac taagaatatt gattgcttat gaagttatca
atgataacta atattgttat 120ttcgatgctg ttatgttgga tacattgttg gtgactggca
ttagcttatg cgtgaaacct 180tcttcgtaaa tattcaaatt tagaatcaaa tattattgat
actatttctt tttcatactt 240tacattaata ttcttcaaaa ttaaaaatgc caggagtaga
gcatgttact aacaaagtcg 300ttgttcatcc tttagttcta ttaagtgttg ttgatcattt
caatagaatg ggtaaaattg 360ggaatcagaa gagagtagtt ggcgtattat taggatgctg
gaaggcaaaa ggtgttttag 420acgtatctaa tagttttgca gtgccatttg atgaagatga
taaagacaaa tcagtttggt 480ttttagacca tgattattta gaaaatatgt atggcatgtt
taagaaagtt aatgcaagag 540aaaaagttgt tggctggtat catacaggcc caaagttaca
tcaaaatgat gttgcaatta 600atgaacttat acgccgttac tgccctaact cagttcttgt
tattatcgat gcaaaaccaa 660aggatcttgg tttacctaca gaagcatata gagcagttga
agaagtacat gatgatggtt 720ctcctacgac aaaaacattt gagcatgttc ccagtgaaat
aggggctgaa gaagcagagg 780aagtgggtgt tgaacatctg ctgagagata taaaagatac
aactgtcggc tcactttcgc 840aaagggttac taatcaattt cttggtctca aaggccttaa
tcaacaaatt caagacatca 900gggattacct tatgcaggtt gttgaaggaa aattgcccat
caaccatcaa ataatatatc 960agcttcaaga catatttaat ctccttcctg acatgaacca
tgggaacttt gttgattcat 1020tatacataaa aacaaatgat cagatgcttg tcgtttatct
cgctgccctc gttagagcta 1080ttgttgcctt gcataatctg atcaataata aactcagtaa
tcgtgatgcc gaaaaaaaag 1140aaagcaccaa aaaagaagaa aaacctaaag aagaagaaag
tgtaaaaaaa gaattgaagg 1200ctaagtaaat gatgccagtt cattctcagg attgaacaga
tgttatttat tgtaagattt 1260aattataatc ttttatacat atgtgtacat taatagtata
tacatcgttt tcaacaaatc 1320agatttataa tttgtaaaaa aaaaagaaaa gggaacaaaa
tgatatttaa atatttaact 1380atttatacat tttttttgtg agtacaatta aaccatttag
ttgaacttgt gaactacaaa 1440aattaatttg taataaaacc agtctaattt cttaatttta
aaaaaa 148621362DNAArtificial SequenceDescription of
artificial sequence note = synthetic construct 21gatgttggct
tctttataca atggaaacgt tcatatttgg aatcatgaga cccagcagct 60agtaaagtct
tttgaagtat gcgaccaacc agttcgtgct gcagtatttg ttcctcgcaa 120gaactggatt
gtaacagggt cagatgatat gcagatcaga gtttttaatt acaatactct 180tgaaagagta
aatgcatttg aagctcattc agactatgtc agatgtatag cagttcaccc 240agcccatcct
tatattctga catcatcaga tgatatgtta atcaaattgt ggaattggtc 300taaggcttgg
gtctgccaac aaatatttga aggacatacc cattatgtaa tgcaagttgt 360ta
36222369DNAArtificial SequenceDescription of artificial sequence note =
synthetic construct 22aacgtcttgc tttagctcca aaagaaatgg gaccatgtga
aatatatcct caaagtattt 60cacataatcc aaatggaaga tttgtcgttg tttgtggaga
tggtgaatac ataatttata 120ctgctatggc tttaagaaac aaaagttttg gatcagccca
agaatttgta tgggcacaag 180atagttctga ctatgctata agagaaggaa catctactgt
aaaactattt agacagttca 240aggagcgcaa gacacttaag ccagagtttg gtgctgaagg
tatatttggt ggacaattgc 300ttggtgtcag atcagtctca ggattatgtt tatatgattg
ggaaactctg gaattaatca 360gaagaatag
36923374DNAArtificial SequenceDescription of
artificial sequence note = synthetic construct 23tgaattagca
acgagacaag ccaggcttga tgttgcgcaa gcagctcttc acagagccca 60acattatggt
ggacttctgc ttctctccac atcagcagga aatcgggaaa tgatggaaaa 120acaggaaaga
gttcaggaga aaatggaaaa aataatgtta gcttccttgc atatttcctg 180cttggagacc
ttgccaaatg tcttcaaatt cttattgaca ctgatcgcat tccagaagct 240gccttttttg
ccaggacata tttgccgagt gaggttcctc gagttgttgg gttatggcga 300ggtttagcaa
aggcaggaca gagccttgca gatccttcgc agtatctaat ctctttccag 360gttatgcaga
tgct
37424355DNAArtificial SequenceDescription of artificial sequence note =
synthetic construct 24gatggctgtg gtggaacaac cttgttatac tctgatcaat
tttccatctg atttagagcc 60tcctaatgaa atgcagctaa aatctgattt agaaaatgga
gacactaaag cgaaaattga 120agctttgaaa aatattattc atttaattgc aaatggagag
cgtctacctg gtttacttat 180gcatatcata cgttttgttt tgccatcaca agaccatacc
ataaaaaaat tactgcttat 240attttgggaa atcgttccta aaactactcc agatggcaaa
cttctccagg aaatgatttt 300ggtttgtgat gcctatcgga aggacttaca acatcctaat
gaatttgtca gggga 35525382DNAArtificial SequenceDescription of
artificial sequence note = synthetic construct 25gtgataataa
tgttaaatta attgtattag atcgtcttat atctttaaaa gaaattccta 60ctcatgaacg
ggttcttcaa gatttagtta tggatatatt acgtgtgcta gccagtcctg 120acatggaagt
aaagaaaaaa gccttaagcc tagcactgga tctcactact tcacggtgtg 180ttgaagaaat
ggttttaatg ttaaaaaaag aagttgctaa gacacataac ttgacagaac 240atgaagatgc
tggaaaatat cgtcaacttc ttgttagaac tcttcattcc tgttgcatga 300agtttccaga
tgttgctgct tcagttatac cagtattaat ggaatttctc tcagatacaa 360gtgaactagc
ttcgtatgat gt
38226376DNAArtificial SequenceDescription of artificial sequence note =
synthetic construct 26tcaacactga ttgtcgcaat gctcttgcta atatgttagt
tgctcaacag aatgaggagt 60actcacttat taaggccaaa gaaaaatccg tccataccat
ccaagttgat gatcctgtat 120catttttaca attatcaacg atacgatcat ctgattttgg
ttcagaaaat gtttttgagc 180ttagtttaaa tcaagctgtc ggggggccaa atacagctac
aaacacagct gaacttccat 240tttcagccag taaattgaat aaagtaactc agctgacagg
gttttcagat ccagtttatg 300cagaagcata tgttcatgtc aaccagtatg acattgtact
tgatgttttg atcgttaatc 360aaacaggtga tacact
37627340DNAArtificial SequenceDescription of
artificial sequence note = synthetic construct 27attcgaattg
catgtaaatt attggaagaa gaaagctctg gagaatatgc agactctcca 60ctttttgatt
ttattgaagc atgtttacgc cacaaaagtg aaacagttgt ttatgaagca 120gctgctgctc
ttgtaaactt acgccacact actaccagac aaatcacgcc tgcagtaagt 180gttcttcaat
tattttgttc ttctccaaaa ccagcgcttc gttttgctgc tgtgagaact 240cttaataagg
tagcaatgac acatcccact gctgtaacgt catgcaatat tgacttagag 300aaccttataa
cggattcaaa tcggtccata gctaccttgg
34028383DNAArtificial SequenceDescription of artificial sequence note =
synthetic construct 28gttccgatcc tatcagtctt acggaatctg agacagaata
tcaagttaga gtcacgaagc 60atgttttcaa aaatcatatt gttcttcagt ttgactgtac
aaataccatg agtgaccagc 120tactggagaa agttcgagtg cagttagaag tgagcgaagg
ttaccagatc gtagctgagg 180tcccctgcca aagattagcc tgttcggaaa catcacctac
ttatattgcc ctgcaatttc 240cagatgcccc taatcttact gtcacaaact ttgctgctac
tctgaggttt gttgtaaagg 300attgcgaccc aatgaccggt atccctaact cagatgatgg
ttatgaagaa gattatatgc 360ttgaagatgt cgaagtgatg ctt
38329388DNAArtificial SequenceDescription of
artificial sequence note = synthetic construct 29gacttacgaa
gagcaacttc ggtgctgcat gggaggaagg cgaatcgtat agtgagctag 60aggacactta
taacttgtca ggaataaaca gcctcgaaga ggcagtgagg agtgttgtca 120gtttcatggg
gatgcagcct gctgacagga gcgacagggt acagcctgat aaatcttcac 180acactgtcta
cctcggaggc atgttccgtg gtggagttga agtgttagct agagctaaac 240tggccatggg
taattcccca ggcgttgcca tgcaacttac agtccgctct ccaaatccag 300atatttgtga
actgattatt tctgtagtcg ggtaaaaaaa atatataaat atatttgaga 360agtacacagt
ttcctctcag atgttgta
38830412DNAArtificial SequenceDescription of artificial sequence note =
synthetic construct 30acttgtggat catgttgcca gtgtcaggcc aaatattttt
gtgggtcgag tcgaaggttc 60tgctgtttat caaaaatggt attttgaagt gactttagat
catatggagc aaaccaccca 120tatgacaccg catctaagaa ttggctgggc taacacttct
ggttatgttc cctttcctgg 180cggtggtgaa aaatggggcg gtaatggagt tggtgatgat
ctctactctt ttggttttga 240tggagctgca ttatggacag gtggaagaaa aactgtagtc
cttcctcatg ctatggaacc 300ttacataaga aagggagatg ttattggttg tgctttcgat
ctgactgttc caattattac 360atttactttt aatggaacat taatccgagg atcatttagg
gattttaatc tt 41231342DNAArtificial SequenceDescription of
artificial sequence note = synthetic construct 31atgtgaagga
aaagaaacta accacaagta taagatggaa cctgatacaa aattatttcc 60agctatattt
gttgaagcta caagcaaaga aattctacaa attgagcttg gtcgtacatc 120aactacactg
cctttatcag cagctgttct ccaaaattca gaaagacatg tcattcctca 180gtttccacca
agacttaaag ttcagtgtct aaaaccacat cagtgggcac gtgttcctaa 240tatttcattg
catgtccacg ctctgaaatt atcagatata agaggttgga gtatgctttg 300tgaagatcca
gtttcaatgt tagcattaca tatacctgaa ga
34232432DNAArtificial SequenceDescription of artificial sequence note =
synthetic construct 32aggatggcaa agatgaacat cgtaagggtt tacttcaaat
gaaaatggca gagggggcaa 60aactacagat gtgctatctc ttacagcatt tatgcgatat
acaattgcgg catcgggttg 120aagccattat taattttagt tatgactata ttgctgatct
tcagcaggat cagttgagaa 180gatatgttga tattaagcag tctgatcttc catcatcagt
tgctgcaaga aaaacaagag 240agtttcgttg ccctccaaga gaacagatga atgctatcat
aaattttaaa aatttagaag 300aagatgacaa agaaaactgt ccatgtggtg aagaactgag
ggagagatta aacacatttc 360atgaagaaac tatgagtaaa gtttcacttg ttgctctcca
agagccacaa gaagatgaga 420acggtgaaac ac
43233367DNAArtificial SequenceDescription of
artificial sequence note = synthetic construct 33tctccctttg
ttgataaagc attaccagtt ttatagtggt gcagataact atgcaagtct 60tttagatgct
acacttcata cagtgtatcg cctatcaaaa aatcgaatgc taactaaagg 120tcagcgagag
gcagtatcag attttttggt tgctctcaca agtcaattac agccaagcat 180gttactcaaa
cttcttcgaa agttaaccgt tgatgtatca aagctttctg agtataccac 240agttgcttta
aggttgctta ctttacacta tgagcgttgt gcaaaatatt atggaactac 300tggtggacaa
gctggtggat ctagtgatga agaaaaaagg ctcactatgt tactcttcag 360taatatt
36734396DNAArtificial SequenceDescription of artificial sequence note =
synthetic construct 34gacttgctcg ctttttagaa acagctggta gtgttcttaa
ctactttgaa cctttcctgg 60gacgaattga aatattaggt agtagtaaac gaatcgagcg
tgtatatttc gagattaaag 120aatcaaatat tgaacagtgg gaaaaacctc aaatcaagga
atctaaacga gcatttttct 180attcaattgt cactgaagga ggtgacaaag aaaaattgga
agcttttgtt aatttttgtg 240aagatgccat atttgagatg acacatgcca gtgggcttat
ggcaactgat gatggtacag 300gctctggagg aggaaaacaa agagcatcct cttattctta
tatggaagat gaagatgaag 360aaaggaatcc aatcagacgt ggttggcaag caacta
39635557DNAArtificial SequenceDescription of
artificial sequence note = synthetic construct 35aatttagaat
caaatattat tgatactatt tctttttcat actttacatt aatattcttc 60aaaattaaaa
atgccaggag tagagcatgt tactaacaaa gtcgttgttc atcctttagt 120tctattaagt
gttgttgatc atttcaatag aatgggtaaa attgggaatc agaagagagt 180agttggcgta
ttattaggat gctggaaggc aaaaggtgtt ttagacgtat ctaatagttt 240tgcagtgcca
tttgatgaag atgataaaga caaatcagtt tggtttttag accatgatta 300tttagaaaat
atgtatggca tgtttaagaa agttaatgca agagaaaaag ttgttggctg 360gtatcataca
ggcccaaagt tacatcaaaa tgatgttgca attaatgaac ttatacgccg 420ttactgccct
aactcagttc ttgttattat cgatgcaaaa ccaaaggatc ttggtttacc 480tacagaagca
tatagagcag ttgaagaagt acatgatgat ggttctccta cgacaaaaac 540atttgagcat
gttccca
55736530DNAArtificial SequenceDescription of artificial sequence note =
synthetic construct 36atgaacttat acgccgttac tgccctaact cagttcttgt
tattatcgat gcaaaaccaa 60aggatcttgg tttacctaca gaagcatata gagcagttga
agaagtacat gatgatggtt 120ctcctacgac aaaaacattt gagcatgttc ccagtgaaat
aggggctgaa gaagcagagg 180aagtgggtgt tgaacatctg ctgagagata taaaagatac
aactgtcggc tcactttcgc 240aaagggttac taatcaattt cttggtctca aaggccttaa
tcaacaaatt caagacatca 300gggattacct tatgcaggtt gttgaaggaa aattgcccat
caaccatcaa ataatatatc 360agcttcaaga catatttaat ctccttcctg acatgaacca
tgggaacttt gttgattcat 420tatacataaa aacaaatgat cagatgcttg tcgtttatct
cgctgccctc gttagagcta 480ttgttgcctt gcataatctg atcaataata aactcagtaa
tcgtgatgcc 53037544DNAArtificial SequenceDescription of
artificial sequence note = synthetic construct 37tacttcattg
tcataaaggg gtaacattgc tgaatccagc gtaaaggtta cagtgactct 60cacctggtta
taacagtttt gctttgtaat catgggttct gagagatata gcttttcttt 120gactactttc
agtccatctg gaaaattagt tcaaattgag tatgcacttg ccgcagtcgc 180agctggagct
ccatcaatcg gtatcagagc atccaatgga gttgtattgg ctactgaaaa 240caaatacaaa
tcaattttat atgaagaaca tactattcaa aaagtagaaa tgataactaa 300acacattgga
atggtctaca gtggaatggg acctgattat aggctactag tgaagagagc 360tagaaaaatg
gctcaacaat aacagttagt ttacggtgag cctattccta ctgcacagct 420tgttcaacga
gttgccatgg ttatgcagga gtacactcaa tctggaggtg ttagaccttt 480tggagtttct
ttactcattg ccgggtggga tggggataaa ccatctctgt ttcaatgtga 540tcca
54438587DNAArtificial SequenceDescription of artificial sequence note =
synthetic construct 38acacattgga atggtctaca gtggaatggg acctgattat
aggctactag tgaagagagc 60tagaaaaatg gctcaacaat aacagttagt ttacggtgag
cctattccta ctgcacagct 120tgttcaacga gttgccatgg ttatgcagga gtacactcaa
tctggaggtg ttagaccttt 180tggagtttct ttactcattg ccgggtggga tggggataaa
ccatctctgt ttcaatgtga 240tccatctgga gcatactttg cctggaaagc tactgcaatg
ggaaaaaatt ttgtcactgg 300caaaacattt ctagaaaaga ggtacagtga aactttagag
ctggatgatg cagtacatac 360tgcaattctc actcttaaag aaaactttga aggccaaatg
acttcggaca atatcgaggt 420cggagtttgt gatgatcaag ggttcagagt tttagatcct
acaacagtga aggattatct 480ggctaatatt ccataaattt attattaaaa tttgatttta
taattaataa aaaggtgatt 540gcttatggat atgtgtgatg cctaaataaa atattatttt
ttattgg 58739550DNAArtificial SequenceDescription of
artificial sequence note = synthetic construct 39atcattgatg
atggttgaga aagttccaga ctctacatat gaaatggttg gaggtcttga 60taagcaaatt
aaggaaatca aagaagtaat tgaacctcct gtaaaacatc cagaactgtt 120tgatgcacta
ggaatagctc agcccaaagg agttttatta tatggaccac ctggaacagg 180taaaacactt
ttggcaagag cagttgccca tcacactgag tgcacgttca ttcgtgtgtc 240aggatctgag
ttggttcaga aattcattgg ggaaggatcc agaatggtta gagaattgtt 300cgtcatggca
agggaacatg ctccatctat catatttatg gatgaaatcg attcaatagg 360ttcatcacgt
atcgaatctg ggagtggtgg tgattctgaa gtccagagaa caatgttaga 420gttattgaac
caattggatg gcttcgaagc cacaaaaaat attaaggtca taatggccac 480taataggatt
gatattttgg accctgctct tctgcgtcct ggaaggatag atcgtaagat 540tgagttcccc
55040580DNAArtificial SequenceDescription of artificial sequence note =
synthetic construct 40tccatctatc atatttatgg atgaaatcga ttcaataggt
tcatcacgta tcgaatctgg 60gagtggtggt gattctgaag tccagagaac aatgttagag
ttattgaacc aattggatgg 120cttcgaagcc acaaaaaata ttaaggtcat aatggccact
aataggattg atattttgga 180ccctgctctt ctgcgtcctg gaaggataga tcgtaagatt
gagttccccc caccaaatga 240ggaagctcgt ttagatatcc ttagaattca ttcacgtaaa
atgaatctta cccggggtat 300caacttgcgt aaaattgccg agctcatgcc tggagcttca
ggtgcagaag taaagggtgt 360ctgtactgaa gcagggatgt atgccctgag ggagaggaga
atccatgtca cccaagaaga 420tttcgaaatg gctgtggcca aggttatgca aaaggactcc
gagaagaata tgtcaatcaa 480gaaattatgg aaataaacga ctcacttatt tttttttttt
tttactctgt ttaaaaagct 540ttaaatatat agatgtttgt gaggttttgt taaaaataaa
580
User Contributions:
Comment about this patent or add new information about this topic: